Query psy1911
Match_columns 342
No_of_seqs 430 out of 2698
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 19:27:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1911.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1911hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd02620 Peptidase_C1A_Cathepsi 100.0 5.1E-29 1.1E-33 230.6 11.1 139 188-340 1-140 (236)
2 KOG1542|consensus 100.0 9.5E-29 2.1E-33 232.9 7.3 95 182-289 152-247 (372)
3 PTZ00049 cathepsin C-like prot 99.9 2.6E-27 5.5E-32 243.1 12.6 103 184-293 378-490 (693)
4 cd02621 Peptidase_C1A_Cathepsi 99.9 3.5E-27 7.5E-32 219.1 9.3 96 187-289 1-100 (243)
5 KOG1543|consensus 99.9 4E-27 8.8E-32 227.2 9.6 92 183-278 105-198 (325)
6 cd02698 Peptidase_C1A_Cathepsi 99.9 3.4E-26 7.4E-31 212.1 10.7 105 63-167 127-238 (239)
7 cd02620 Peptidase_C1A_Cathepsi 99.9 7.2E-26 1.6E-30 209.6 10.8 102 62-163 131-234 (236)
8 KOG1542|consensus 99.9 1.1E-25 2.3E-30 212.3 8.6 112 52-165 252-369 (372)
9 PTZ00203 cathepsin L protease; 99.9 2.1E-25 4.5E-30 217.0 9.3 92 185-289 124-218 (348)
10 cd02621 Peptidase_C1A_Cathepsi 99.9 3.7E-25 8E-30 205.5 10.1 98 70-167 129-242 (243)
11 KOG1543|consensus 99.9 6.2E-25 1.3E-29 212.0 10.9 112 52-164 206-320 (325)
12 PTZ00364 dipeptidyl-peptidase 99.9 4.4E-25 9.6E-30 223.9 9.9 100 184-289 202-308 (548)
13 PTZ00203 cathepsin L protease; 99.9 1.3E-24 2.8E-29 211.4 11.3 100 62-163 235-336 (348)
14 cd02698 Peptidase_C1A_Cathepsi 99.9 9.7E-25 2.1E-29 202.4 9.7 94 187-289 1-98 (239)
15 PTZ00200 cysteine proteinase; 99.9 6.2E-25 1.3E-29 219.4 8.5 91 186-289 233-324 (448)
16 cd02248 Peptidase_C1A Peptidas 99.9 1.9E-24 4E-29 195.3 10.8 103 60-162 101-207 (210)
17 PTZ00021 falcipain-2; Provisio 99.9 1.4E-24 3E-29 218.0 8.0 90 187-289 266-356 (489)
18 smart00645 Pept_C1 Papain fami 99.9 3.2E-24 6.9E-29 189.8 8.1 86 187-278 1-88 (174)
19 PTZ00021 falcipain-2; Provisio 99.9 1E-23 2.2E-28 211.7 10.6 104 62-166 370-487 (489)
20 PTZ00200 cysteine proteinase; 99.9 1.5E-23 3.3E-28 209.4 10.9 104 62-166 336-444 (448)
21 PF00112 Peptidase_C1: Papain 99.9 2.9E-23 6.3E-28 187.5 8.8 104 62-165 109-218 (219)
22 cd02248 Peptidase_C1A Peptidas 99.9 6.4E-23 1.4E-27 185.3 8.6 89 188-289 1-90 (210)
23 PTZ00049 cathepsin C-like prot 99.9 1E-22 2.2E-27 209.3 10.8 91 77-167 561-676 (693)
24 PF00112 Peptidase_C1: Papain 99.9 2E-22 4.4E-27 182.0 8.1 92 187-289 1-94 (219)
25 PTZ00364 dipeptidyl-peptidase 99.9 8.6E-22 1.9E-26 200.0 9.8 101 65-165 333-457 (548)
26 KOG1544|consensus 99.8 9.4E-21 2E-25 177.1 6.9 108 63-170 341-463 (470)
27 KOG1544|consensus 99.8 2.8E-21 6.1E-26 180.6 0.4 100 181-289 203-303 (470)
28 cd02619 Peptidase_C1 C1 Peptid 99.8 1.4E-19 3E-24 163.7 9.7 93 61-153 110-213 (223)
29 PTZ00462 Serine-repeat antigen 99.8 1E-19 2.3E-24 192.2 10.1 85 77-162 684-776 (1004)
30 cd02619 Peptidase_C1 C1 Peptid 99.8 3.6E-19 7.7E-24 161.0 8.8 89 190-290 1-95 (223)
31 smart00645 Pept_C1 Papain fami 99.8 1.3E-18 2.9E-23 153.9 7.1 75 87-162 93-170 (174)
32 PTZ00462 Serine-repeat antigen 99.7 3.3E-17 7.2E-22 173.4 8.8 84 197-289 538-625 (1004)
33 cd00585 Peptidase_C1B Peptidas 99.5 8.4E-15 1.8E-19 146.0 8.4 85 67-152 288-399 (437)
34 COG4870 Cysteine protease [Pos 99.4 8E-13 1.7E-17 126.6 8.2 99 53-153 197-314 (372)
35 PF03051 Peptidase_C1_2: Pepti 98.8 1.5E-08 3.3E-13 101.4 8.9 84 67-151 289-399 (438)
36 COG4870 Cysteine protease [Pos 98.5 2.1E-08 4.6E-13 96.5 -0.3 86 185-278 97-190 (372)
37 cd00585 Peptidase_C1B Peptidas 97.1 0.0011 2.3E-08 67.0 6.5 75 204-279 55-157 (437)
38 PF03051 Peptidase_C1_2: Pepti 96.9 0.00086 1.9E-08 67.6 4.0 74 204-278 56-157 (438)
39 COG3579 PepC Aminopeptidase C 96.8 0.00093 2E-08 64.2 2.9 70 81-151 308-401 (444)
40 PF13529 Peptidase_C39_2: Pept 84.0 1.5 3.2E-05 35.8 4.1 52 77-137 92-144 (144)
41 COG3579 PepC Aminopeptidase C 82.5 1.9 4.2E-05 42.0 4.6 74 204-278 58-159 (444)
42 PF14399 Transpep_BrtH: NlpC/p 80.8 3.5 7.5E-05 39.2 5.8 52 77-135 81-133 (317)
43 PF10500 SR-25: Nuclear RNA-sp 79.8 2.3 5.1E-05 38.9 3.9 6 82-87 141-146 (225)
44 PF10500 SR-25: Nuclear RNA-sp 66.8 7.5 0.00016 35.7 3.9 11 77-87 151-161 (225)
45 PF05543 Peptidase_C47: Stapho 65.8 24 0.00052 31.3 6.8 37 111-152 117-154 (175)
46 KOG4364|consensus 65.8 11 0.00023 39.9 5.2 14 78-91 427-440 (811)
47 KOG4128|consensus 62.3 1.2 2.5E-05 43.3 -2.2 39 112-150 370-412 (457)
48 COG4990 Uncharacterized protei 52.9 24 0.00052 31.6 4.5 50 69-138 116-168 (195)
49 KOG4364|consensus 50.1 28 0.00061 37.0 5.1 10 114-123 463-472 (811)
50 cd02549 Peptidase_C39A A sub-f 48.7 49 0.0011 26.9 5.7 22 113-137 93-114 (141)
51 PF01754 zf-A20: A20-like zinc 47.2 14 0.0003 22.1 1.5 19 198-216 6-24 (25)
52 KOG3925|consensus 42.5 40 0.00087 33.2 4.7 14 254-267 306-319 (371)
53 PF09778 Guanylate_cyc_2: Guan 34.8 1.3E+02 0.0028 27.7 6.4 14 113-126 160-173 (212)
54 smart00259 ZnF_A20 A20-like zi 32.9 24 0.00051 21.3 0.9 19 198-216 7-25 (26)
55 PF15246 NCKAP5: Nck-associate 31.2 50 0.0011 31.9 3.3 36 48-83 125-160 (318)
56 KOG4128|consensus 31.1 21 0.00045 35.0 0.7 76 203-279 62-167 (457)
57 cd00044 CysPc Calpains, domain 25.0 75 0.0016 30.5 3.4 27 113-139 235-263 (315)
58 PF12385 Peptidase_C70: Papain 24.6 1.6E+02 0.0034 25.9 4.9 34 77-125 102-135 (166)
59 KOG1144|consensus 23.4 1.4E+02 0.0031 32.7 5.2 10 77-86 298-307 (1064)
60 PF12569 NARP1: NMDA receptor- 22.4 78 0.0017 32.9 3.2 9 77-85 480-488 (517)
61 KOG3925|consensus 21.4 1.4E+02 0.0031 29.5 4.5 8 85-92 97-104 (371)
No 1
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.96 E-value=5.1e-29 Score=230.58 Aligned_cols=139 Identities=53% Similarity=0.998 Sum_probs=107.6
Q ss_pred CcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC-CCCCCCCchhhHHHH
Q psy1911 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CWGCNGGWPQLAWRF 266 (342)
Q Consensus 188 P~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~-~~GC~GG~~~~a~~y 266 (342)
|++||||++|++|..|+||+||+.||||||||++++||++++|++++...+.||+|+||||+.. ..||+||++..||+|
T Consensus 1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~ 80 (236)
T cd02620 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY 80 (236)
T ss_pred CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence 8999999999988888999999999999999999999999999988555789999999999876 559999999999999
Q ss_pred hhhcccccCCCCCCCCCcccCCCCCCccCCCCCCccCCCCCccccccccccccccccccccccccccccceeEe
Q psy1911 267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV 340 (342)
Q Consensus 267 ~~~~Gi~~e~~Y~~~~~C~PY~~~~c~~~~~~~~~~C~~~~~~~~p~C~~~C~~~~~~~~y~~d~~~~~~~y~~ 340 (342)
|+++||++|.+| ||....|..+.... .+|.. ++.|+..|. ......|..+.++....|.+
T Consensus 81 i~~~G~~~e~~y-------PY~~~~~~~~~~~~-~~~~~-----~~~~~~~C~-~~~~~~~~~~~~~~~~~~~~ 140 (236)
T cd02620 81 LTTTGVVTGGCQ-------PYTIPPCGHHPEGP-PPCCG-----TPYCTPKCQ-DGCEKTYEEDKHKGKSAYSV 140 (236)
T ss_pred HHhcCCCcCCEe-------cCcCCCCccCCCCC-CCCCC-----CCCCCCCCC-cCCccccceeeeeecceeee
Confidence 999999996655 99876665433222 34542 335555665 33222355555555544543
No 2
>KOG1542|consensus
Probab=99.95 E-value=9.5e-29 Score=232.90 Aligned_cols=95 Identities=35% Similarity=0.653 Sum_probs=88.0
Q ss_pred cCCCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchh
Q psy1911 182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQ 261 (342)
Q Consensus 182 ~~~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~ 261 (342)
.....||++||||++ +.||||+|||.||||||||+++++|....|+++ ..+.||+|+||||+....||+||.+.
T Consensus 152 ~~~~~lP~~fDWR~k----gaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g--~LvsLSEQeLvDCD~~d~gC~GGl~~ 225 (372)
T KOG1542|consen 152 EPGESLPESFDWRDK----GAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATG--KLVSLSEQELVDCDSCDNGCNGGLMD 225 (372)
T ss_pred CCCCCCCcccchhcc----CCccccccCCcCcchhhhhhhhhhhhHHHhhcC--cccccchhhhhcccCcCCcCCCCChh
Confidence 356789999999999 999999999999999999999999999999999 68999999999998777799999999
Q ss_pred hHHHHh-hhcccccCCCCCCCCCcccCCC
Q psy1911 262 LAWRFW-GHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 262 ~a~~y~-~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
.||+|+ ...||.+|.+| ||++
T Consensus 226 nA~~~~~~~gGL~~E~dY-------PY~g 247 (372)
T KOG1542|consen 226 NAFKYIKKAGGLEKEKDY-------PYTG 247 (372)
T ss_pred HHHHHHHHhCCccccccC-------Cccc
Confidence 999995 55699999999 8876
No 3
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.94 E-value=2.6e-27 Score=243.06 Aligned_cols=103 Identities=29% Similarity=0.536 Sum_probs=91.2
Q ss_pred CCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCc--------cccccCHHHHHhhCCCCCCC
Q psy1911 184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGY--------FTGQISAQHIVACTPNCWGC 255 (342)
Q Consensus 184 ~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~--------~~~~LS~Q~lldC~~~~~GC 255 (342)
..+||++||||+.|++|+.|+||+|||.||||||||++++||+|++|++++. +...||+|+||||+..++||
T Consensus 378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC 457 (693)
T PTZ00049 378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGC 457 (693)
T ss_pred cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCc
Confidence 5689999999999999999999999999999999999999999999997531 23479999999999877899
Q ss_pred CCCchhhHHHHhhhcccccCCCCCCCCCcccCCC--CCCc
Q psy1911 256 NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL--APCE 293 (342)
Q Consensus 256 ~GG~~~~a~~y~~~~Gi~~e~~Y~~~~~C~PY~~--~~c~ 293 (342)
+||++..|++|+.++||+++..| ||++ +.|.
T Consensus 458 ~GG~~~~A~kya~~~GI~tEscY-------PY~a~~g~C~ 490 (693)
T PTZ00049 458 NGGFPYLVSKMAKLQGIPLDKVF-------PYTATEQTCP 490 (693)
T ss_pred CCCcHHHHHHHHHHCCCCcCCcc-------CCcCCCCCCC
Confidence 99999999999999999996555 8875 4564
No 4
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.94 E-value=3.5e-27 Score=219.10 Aligned_cols=96 Identities=34% Similarity=0.628 Sum_probs=87.8
Q ss_pred CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCC----ccccccCHHHHHhhCCCCCCCCCCchhh
Q psy1911 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG----YFTGQISAQHIVACTPNCWGCNGGWPQL 262 (342)
Q Consensus 187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~----~~~~~LS~Q~lldC~~~~~GC~GG~~~~ 262 (342)
||++||||+.|++|..|+||+||+.||||||||++++||++++|+++. ...+.||+|+||||+....||+||++..
T Consensus 1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~ 80 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL 80 (243)
T ss_pred CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence 799999999998889999999999999999999999999999999875 2368899999999987667999999999
Q ss_pred HHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911 263 AWRFWGHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 263 a~~y~~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
|++|+.++||+++.+| ||..
T Consensus 81 a~~~~~~~Gi~~e~~y-------PY~~ 100 (243)
T cd02621 81 VGKFAEDFGIVTEDYF-------PYTA 100 (243)
T ss_pred HHHHHHhcCcCCCcee-------CCCC
Confidence 9999999999996665 8875
No 5
>KOG1543|consensus
Probab=99.94 E-value=4e-27 Score=227.19 Aligned_cols=92 Identities=39% Similarity=0.743 Sum_probs=84.3
Q ss_pred CCCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCC-CCCCCchh
Q psy1911 183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQ 261 (342)
Q Consensus 183 ~~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~-GC~GG~~~ 261 (342)
...+||++||||++| +.+.|||||+.||||||||++++||+|++|++++ .++.||+||||||+..++ ||+||++.
T Consensus 105 ~~~~~p~s~DwR~~~---~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~-~l~sLSeq~lvdC~~~~~~GC~GG~~~ 180 (325)
T KOG1543|consen 105 DGDDLPDSFDWRDKG---AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGG-KLLSLSEQDLVDCCGECGDGCNGGEPK 180 (325)
T ss_pred chhhCCCCccccccC---CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCC-ccCccChhhhhhccCCCCCCcCCCCHH
Confidence 356899999999997 5677899999999999999999999999999997 788999999999998744 99999999
Q ss_pred hHHHHhhhccccc-CCCC
Q psy1911 262 LAWRFWGHNGVVT-GGDY 278 (342)
Q Consensus 262 ~a~~y~~~~Gi~~-e~~Y 278 (342)
.||+|+.++|+++ +.+|
T Consensus 181 ~A~~yi~~~G~~t~~~~Y 198 (325)
T KOG1543|consen 181 NAFKYIKKNGGVTECENY 198 (325)
T ss_pred HHHHHHHHhCCCCCCcCC
Confidence 9999999999998 6666
No 6
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.93 E-value=3.4e-26 Score=212.09 Aligned_cols=105 Identities=29% Similarity=0.602 Sum_probs=92.9
Q ss_pred eeeeeeEEcCchHH-HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccC-CccEEEEecCCCCC
Q psy1911 63 HYFKKAHMVPRCNA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWNDH 140 (342)
Q Consensus 63 ~~i~~~~~v~~~~~-~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~-g~~ywiikNSWG~~ 140 (342)
+++.+|..++..+. +++|+++|||+++|.++++|+.|++|||...++...++|+|+|||||++. +.+|||||||||+.
T Consensus 127 ~~i~~~~~~~~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~ 206 (239)
T cd02698 127 YFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDENGVEYWIVRNSWGEP 206 (239)
T ss_pred EEeeeceecCCHHHHHHHHHHcCCEEEEEEecccccccCCeEEccCCCCCcCCeEEEEEEEEecCCCCEEEEEEcCCCcc
Confidence 56778888876555 77899999999999999899999999998776666679999999999876 89999999999999
Q ss_pred CCCCCceeeecCC-----cchhhccccccccc
Q psy1911 141 WGDHGTFKILRGE-----NEADIEMGFNNRVE 167 (342)
Q Consensus 141 WG~~Gy~~i~~~~-----~~~gi~~~~~~~v~ 167 (342)
||++|||||.|+. |++||++.+.++.+
T Consensus 207 WGe~Gy~~i~rg~~~~~~~~~~i~~~~~~~~~ 238 (239)
T cd02698 207 WGERGWFRIVTSSYKGARYNLAIEEDCAWADP 238 (239)
T ss_pred cCcCceEEEEccCCcccccccccccceEEEee
Confidence 9999999999999 99999988877653
No 7
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.93 E-value=7.2e-26 Score=209.58 Aligned_cols=102 Identities=54% Similarity=1.018 Sum_probs=89.1
Q ss_pred ceeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCCCC
Q psy1911 62 SHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139 (342)
Q Consensus 62 ~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSWG~ 139 (342)
.+++..++.+..++. +.+|+++|||+++|.++++|+.|.+|||...++...++|||+|||||++++.+|||||||||+
T Consensus 131 ~~~~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~ 210 (236)
T cd02620 131 KHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVENGVPYWLAANSWGT 210 (236)
T ss_pred eeeecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEeccCCeeEEEEEeCCCC
Confidence 455667777765433 678999999999999988999999999987665556799999999999888999999999999
Q ss_pred CCCCCCceeeecCCcchhhccccc
Q psy1911 140 HWGDHGTFKILRGENEADIEMGFN 163 (342)
Q Consensus 140 ~WG~~Gy~~i~~~~~~~gi~~~~~ 163 (342)
.||++|||||.|+.|.|||++.+.
T Consensus 211 ~WGe~Gy~ri~~~~~~cgi~~~~~ 234 (236)
T cd02620 211 DWGENGYFRILRGSNECGIESEVV 234 (236)
T ss_pred CCCCCcEEEEEccCccccccccee
Confidence 999999999999999999987764
No 8
>KOG1542|consensus
Probab=99.92 E-value=1.1e-25 Score=212.31 Aligned_cols=112 Identities=25% Similarity=0.471 Sum_probs=98.3
Q ss_pred ccCCCCCcccceeeeeeEEcCchHH--HHHHHHcCCeEEEEeecccccccccccccc---CCCCCccceEEEEeeecccC
Q psy1911 52 RLYLPTSIPLSHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVEN 126 (342)
Q Consensus 52 ~~~~p~s~~~~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~---~~~~~~~~Hav~ivGyg~~~ 126 (342)
.|.... .....+|++|+.++.+++ ++.|+++|||+|+|++ ..+|+|++||+.+ .|....++|+|+|||||..+
T Consensus 252 ~C~~~~-~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g 329 (372)
T KOG1542|consen 252 QCHFDK-SKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSG 329 (372)
T ss_pred ccccch-hhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHHhcccccCCCcccCCccccCceEEEEeecCCC
Confidence 666665 446799999999999887 5679999999999996 4699999999987 46666689999999999988
Q ss_pred -CccEEEEecCCCCCCCCCCceeeecCCcchhhccccccc
Q psy1911 127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNR 165 (342)
Q Consensus 127 -g~~ywiikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~ 165 (342)
..+|||||||||+.||++||+++.||.|.|||...+..+
T Consensus 330 ~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mvss~ 369 (372)
T KOG1542|consen 330 YEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMVSSA 369 (372)
T ss_pred CCCceEEEECCccccccccceEEEeccccccccccchhhh
Confidence 899999999999999999999999999999998776544
No 9
>PTZ00203 cathepsin L protease; Provisional
Probab=99.92 E-value=2.1e-25 Score=216.97 Aligned_cols=92 Identities=33% Similarity=0.589 Sum_probs=83.3
Q ss_pred CCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchhhHH
Q psy1911 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAW 264 (342)
Q Consensus 185 ~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~~a~ 264 (342)
.+||++||||++ +.|+||+|||.||||||||++++||++++|+++ ..+.||+|+||||+..+.||+||++..||
T Consensus 124 ~~lP~~~DWR~~----g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~--~~~~LSeQqLvdC~~~~~GC~GG~~~~a~ 197 (348)
T PTZ00203 124 SAVPDAVDWREK----GAVTPVKNQGACGSCWAFSAVGNIESQWAVAGH--KLVRLSEQQLVSCDHVDNGCGGGLMLQAF 197 (348)
T ss_pred ccCCCCCcCCcC----CCCCCccccCCCccHHHHhhHHHHHHHHHHhcC--CCccCCHHHHHhccCCCCCCCCCCHHHHH
Confidence 368999999998 889999999999999999999999999999988 46789999999998776799999999999
Q ss_pred HHhhhc---ccccCCCCCCCCCcccCCC
Q psy1911 265 RFWGHN---GVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 265 ~y~~~~---Gi~~e~~Y~~~~~C~PY~~ 289 (342)
+|+.++ ||++|.+| ||..
T Consensus 198 ~yi~~~~~ggi~~e~~Y-------PY~~ 218 (348)
T PTZ00203 198 EWVLRNMNGTVFTEKSY-------PYVS 218 (348)
T ss_pred HHHHHhcCCCCCccccC-------CCcc
Confidence 999764 57888888 8864
No 10
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.92 E-value=3.7e-25 Score=205.52 Aligned_cols=98 Identities=41% Similarity=0.778 Sum_probs=82.7
Q ss_pred EcCchHH-HHHHHHcCCeEEEEeeccccccccccccccC-----CCC--------CccceEEEEeeecccC--CccEEEE
Q psy1911 70 MVPRCNA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-----FGD--------SIGLHAVRVLGWGVEN--DIPYWLV 133 (342)
Q Consensus 70 ~v~~~~~-~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~-----~~~--------~~~~Hav~ivGyg~~~--g~~ywii 133 (342)
.+...+. +++|+++|||+++|++.++|++|++|||... |.. ..++|+|+|||||++. +.+||||
T Consensus 129 ~~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWii 208 (243)
T cd02621 129 GCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIV 208 (243)
T ss_pred ccCCHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEE
Confidence 3334444 6789999999999999989999999999765 321 2468999999999876 8899999
Q ss_pred ecCCCCCCCCCCceeeecCCcchhhccccccccc
Q psy1911 134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVE 167 (342)
Q Consensus 134 kNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~v~ 167 (342)
|||||+.||++|||||.|+.|.|||+..+..+++
T Consensus 209 rNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~~ 242 (243)
T cd02621 209 KNSWGSSWGEKGYFKIRRGTNECGIESQAVFAYP 242 (243)
T ss_pred EcCCCCCCCcCCeEEEecCCcccCcccceEeecc
Confidence 9999999999999999999999999888766543
No 11
>KOG1543|consensus
Probab=99.92 E-value=6.2e-25 Score=212.00 Aligned_cols=112 Identities=32% Similarity=0.617 Sum_probs=98.5
Q ss_pred ccCCCCCcccceeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCCCC-ccceEEEEeeecccCCc
Q psy1911 52 RLYLPTSIPLSHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDI 128 (342)
Q Consensus 52 ~~~~p~s~~~~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~-~~~Hav~ivGyg~~~g~ 128 (342)
.|-.... ...+.+.+++.++.++. +.+|+.+|||+|+|++.++|+.|++|||.++++.. .++|+|+|||||..++.
T Consensus 206 ~C~~~~~-~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~~~~~ 284 (325)
T KOG1543|consen 206 TCKSNKK-DKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGTGDGV 284 (325)
T ss_pred CccCCCc-cceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcCCCCc
Confidence 4444443 56788999999999976 78899999999999999999999999999988776 48999999999996568
Q ss_pred cEEEEecCCCCCCCCCCceeeecCCcchhhcccccc
Q psy1911 129 PYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNN 164 (342)
Q Consensus 129 ~ywiikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~ 164 (342)
+||||+||||+.||++|||||.|+.+.|+|++...+
T Consensus 285 ~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~~ 320 (325)
T KOG1543|consen 285 DYWIVKNSWGTDWGEKGYFRIARGVNKCGIASEASY 320 (325)
T ss_pred eeEEEEcCCCCCcccCceEEEecCCCchhhhccccc
Confidence 999999999999999999999999999998776655
No 12
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.92 E-value=4.4e-25 Score=223.93 Aligned_cols=100 Identities=25% Similarity=0.424 Sum_probs=88.7
Q ss_pred CCCCCcccccCccCCCCCCCcccccccc---cchHHHHHhhHHHHHHHHHHhCCc----cccccCHHHHHhhCCCCCCCC
Q psy1911 184 AKGLPRNFDAREKWPECPSLRHIADQSN---CGSCWAVSVANAISDRLCIASNGY----FTGQISAQHIVACTPNCWGCN 256 (342)
Q Consensus 184 ~~~lP~sfD~R~~~~~c~~i~~v~dQg~---CGsCwAfa~~~~le~r~~i~~~~~----~~~~LS~Q~lldC~~~~~GC~ 256 (342)
..+||++||||++ .++++|+|||||+. ||||||||++++||+|++|++++. ..+.||+|+||||+..+.||+
T Consensus 202 ~~~LP~sfDWR~~-gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCd 280 (548)
T PTZ00364 202 GDPPPAAWSWGDV-GGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCA 280 (548)
T ss_pred ccCCCCccccCcC-CCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCC
Confidence 4689999999998 34457999999999 999999999999999999998643 357899999999987767999
Q ss_pred CCchhhHHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911 257 GGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 257 GG~~~~a~~y~~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
||++..|++|++++||++|.+| |.||..
T Consensus 281 GG~p~~A~~yi~~~GI~tE~dY-----~~PY~~ 308 (548)
T PTZ00364 281 GGFPEEVGKFAETFGILTTDSY-----YIPYDS 308 (548)
T ss_pred CCcHHHHHHHHHhCCccccccc-----CCCCCC
Confidence 9999999999999999998887 668864
No 13
>PTZ00203 cathepsin L protease; Provisional
Probab=99.91 E-value=1.3e-24 Score=211.42 Aligned_cols=100 Identities=21% Similarity=0.456 Sum_probs=85.4
Q ss_pred ceeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCCCC
Q psy1911 62 SHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND 139 (342)
Q Consensus 62 ~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSWG~ 139 (342)
...+.+|..|+.++. +.+|+++|||+|+|++. +|++|++|||.. |....++|+|+|||||.++|.+|||||||||+
T Consensus 235 ~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~ 312 (348)
T PTZ00203 235 GARIDGYVSMESSERVMAAWLAKNGPISIAVDAS-SFMSYHSGVLTS-CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGE 312 (348)
T ss_pred ceEecceeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcCccCceeec-cCCCCCCeEEEEEEEecCCCceEEEEEcCCCC
Confidence 456778888876544 56788899999999985 899999999974 54445699999999999888999999999999
Q ss_pred CCCCCCceeeecCCcchhhccccc
Q psy1911 140 HWGDHGTFKILRGENEADIEMGFN 163 (342)
Q Consensus 140 ~WG~~Gy~~i~~~~~~~gi~~~~~ 163 (342)
.||++|||||.++.|.|||...+.
T Consensus 313 ~WGe~GY~ri~rg~n~Cgi~~~~~ 336 (348)
T PTZ00203 313 DWGEKGYVRVTMGVNACLLTGYPV 336 (348)
T ss_pred CcCcCceEEEEcCCCcccccceEE
Confidence 999999999999999999975443
No 14
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.91 E-value=9.7e-25 Score=202.38 Aligned_cols=94 Identities=29% Similarity=0.545 Sum_probs=83.1
Q ss_pred CCcccccCccCCCCCCCccccccc---ccchHHHHHhhHHHHHHHHHHhCCcc-ccccCHHHHHhhCCCCCCCCCCchhh
Q psy1911 187 LPRNFDAREKWPECPSLRHIADQS---NCGSCWAVSVANAISDRLCIASNGYF-TGQISAQHIVACTPNCWGCNGGWPQL 262 (342)
Q Consensus 187 lP~sfD~R~~~~~c~~i~~v~dQg---~CGsCwAfa~~~~le~r~~i~~~~~~-~~~LS~Q~lldC~~~~~GC~GG~~~~ 262 (342)
||++||||+.+.. ..|+||+||+ .||||||||++++||+|++|++++.. .+.||+|+||||+. ..||+||++..
T Consensus 1 lP~~~Dwr~~~~~-~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~-~~gC~GG~~~~ 78 (239)
T cd02698 1 LPKSWDWRNVNGV-NYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG-GGSCHGGDPGG 78 (239)
T ss_pred CCCCcccccCCCC-cccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC-CCCccCcCHHH
Confidence 7999999998432 4899999998 89999999999999999999987543 57899999999986 56999999999
Q ss_pred HHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911 263 AWRFWGHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 263 a~~y~~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
||+|++++||++|.+| ||..
T Consensus 79 a~~~~~~~Gl~~e~~y-------PY~~ 98 (239)
T cd02698 79 VYEYAHKHGIPDETCN-------PYQA 98 (239)
T ss_pred HHHHHHHcCcCCCCee-------CCcC
Confidence 9999999999996666 8875
No 15
>PTZ00200 cysteine proteinase; Provisional
Probab=99.91 E-value=6.2e-25 Score=219.36 Aligned_cols=91 Identities=30% Similarity=0.611 Sum_probs=83.8
Q ss_pred CCCcccccCccCCCCCCCccccccc-ccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchhhHH
Q psy1911 186 GLPRNFDAREKWPECPSLRHIADQS-NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAW 264 (342)
Q Consensus 186 ~lP~sfD~R~~~~~c~~i~~v~dQg-~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~~a~ 264 (342)
.+|++||||+. +.|+||+||| .||||||||++++||++++|.++ ..+.||+|+||||+..+.||+||++..||
T Consensus 233 ~~P~~~DWR~~----g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~--~~~~LSeQqLvDC~~~~~GC~GG~~~~A~ 306 (448)
T PTZ00200 233 ITGEGLDWRRA----DAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRD--KSVDLSEQELVNCDTKSQGCSGGYPDTAL 306 (448)
T ss_pred cCCCCccCCCC----CCCCCcccCCCccchHHHHhHHHHHHHHHHHhcC--CCeecCHHHHhhccCccCCCCCCcHHHHH
Confidence 46999999998 8899999999 99999999999999999999876 46889999999998777799999999999
Q ss_pred HHhhhcccccCCCCCCCCCcccCCC
Q psy1911 265 RFWGHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 265 ~y~~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
+|++++||+++.+| ||++
T Consensus 307 ~yi~~~Gi~~e~~Y-------PY~~ 324 (448)
T PTZ00200 307 EYVKNKGLSSSSDV-------PYLA 324 (448)
T ss_pred HHHhhcCccccccC-------CCCC
Confidence 99999999997777 8875
No 16
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.91 E-value=1.9e-24 Score=195.29 Aligned_cols=103 Identities=30% Similarity=0.557 Sum_probs=91.1
Q ss_pred ccceeeeeeEEcCch--HH-HHHHHHcCCeEEEEeeccccccccccccccCCC-CCccceEEEEeeecccCCccEEEEec
Q psy1911 60 PLSHYFKKAHMVPRC--NA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG-DSIGLHAVRVLGWGVENDIPYWLVAN 135 (342)
Q Consensus 60 ~~~~~i~~~~~v~~~--~~-~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~-~~~~~Hav~ivGyg~~~g~~ywiikN 135 (342)
....++.+|+.|+.. +. +++|+++|||+++|.+.++|+.|.+|||..+.+ ...++|+|+|||||++.+.+||||+|
T Consensus 101 ~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~~~~~ywiv~N 180 (210)
T cd02248 101 KVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYGTENGVDYWIVKN 180 (210)
T ss_pred cccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCCCcCCEEEEEEEEeecCCceEEEEEc
Confidence 456888999999874 33 788999999999999999999999999987665 45679999999999988899999999
Q ss_pred CCCCCCCCCCceeeecCCcchhhcccc
Q psy1911 136 SWNDHWGDHGTFKILRGENEADIEMGF 162 (342)
Q Consensus 136 SWG~~WG~~Gy~~i~~~~~~~gi~~~~ 162 (342)
|||+.||++|||||+++.+.|||....
T Consensus 181 SWG~~WG~~Gy~~i~~~~~~cgi~~~~ 207 (210)
T cd02248 181 SWGTSWGEKGYIRIARGSNLCGIASYA 207 (210)
T ss_pred CCCCccccCcEEEEEcCCCccCceeee
Confidence 999999999999999999999997554
No 17
>PTZ00021 falcipain-2; Provisional
Probab=99.90 E-value=1.4e-24 Score=218.00 Aligned_cols=90 Identities=31% Similarity=0.537 Sum_probs=82.8
Q ss_pred CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchhhHHHH
Q psy1911 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF 266 (342)
Q Consensus 187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~~a~~y 266 (342)
+|++||||+. +.|+||+|||.||||||||++++||++++|+++ ..+.||+|+||||+..+.||+||++..||+|
T Consensus 266 ~P~s~DWR~~----g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g--~~v~LSeQqLVDCs~~n~GC~GG~~~~Af~y 339 (489)
T PTZ00021 266 DHAKYDWRLH----NGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKN--ELVSLSEQELVDCSFKNNGCYGGLIPNAFED 339 (489)
T ss_pred CccccccccC----CCCCCcccccccccHHHHHHHHHHHHHHHHHcC--CCcccCHHHHhhhccCCCCCCCcchHhhhhh
Confidence 5999999998 889999999999999999999999999999988 4688999999999877779999999999999
Q ss_pred hhhc-ccccCCCCCCCCCcccCCC
Q psy1911 267 WGHN-GVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 267 ~~~~-Gi~~e~~Y~~~~~C~PY~~ 289 (342)
+.++ ||++|.+| ||..
T Consensus 340 i~~~gGl~tE~~Y-------PY~~ 356 (489)
T PTZ00021 340 MIELGGLCSEDDY-------PYVS 356 (489)
T ss_pred hhhccccCccccc-------CccC
Confidence 9776 89998888 8864
No 18
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.90 E-value=3.2e-24 Score=189.81 Aligned_cols=86 Identities=40% Similarity=0.790 Sum_probs=78.2
Q ss_pred CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC-CCCCCCCchhhHHH
Q psy1911 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CWGCNGGWPQLAWR 265 (342)
Q Consensus 187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~-~~GC~GG~~~~a~~ 265 (342)
||++||||+. +.++||+||+.||+|||||++++||++++|+++. .+.||+|+|++|... ..||+||++..|++
T Consensus 1 lP~~~D~R~~----~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~--~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~ 74 (174)
T smart00645 1 LPESFDWRKK----GAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGK--LVSLSEQQLVDCSTGGNNGCNGGLPDNAFE 74 (174)
T ss_pred CCCcCccccc----CCCCccccCcccchHHHHHHHHHHHHHHHHhcCC--ccccCHHHHhhhcCCCCCCCCCcCHHHHHH
Confidence 7999999998 3689999999999999999999999999999884 678999999999875 44999999999999
Q ss_pred Hhhhc-ccccCCCC
Q psy1911 266 FWGHN-GVVTGGDY 278 (342)
Q Consensus 266 y~~~~-Gi~~e~~Y 278 (342)
|+.++ ||++|.+|
T Consensus 75 ~~~~~~Gi~~e~~~ 88 (174)
T smart00645 75 YIKKNGGLETESCY 88 (174)
T ss_pred HHHHcCCccccccc
Confidence 99998 99996555
No 19
>PTZ00021 falcipain-2; Provisional
Probab=99.90 E-value=1e-23 Score=211.70 Aligned_cols=104 Identities=26% Similarity=0.499 Sum_probs=88.2
Q ss_pred ceeeeeeEEcCchHHHHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccC----------CccEE
Q psy1911 62 SHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN----------DIPYW 131 (342)
Q Consensus 62 ~~~i~~~~~v~~~~~~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~----------g~~yw 131 (342)
.+.+.+|..|+..+.+++|+.+|||+|+|+++.+|++|.+|||...|.. .++|||+|||||.++ +.+||
T Consensus 370 ~~~i~~y~~i~~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YW 448 (489)
T PTZ00021 370 KYKIKSYVSIPEDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYY 448 (489)
T ss_pred cceeeeEEEecHHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCCCCC-ccceEEEEEEecCcCCcccccccCCCCCEE
Confidence 4678889999865557788889999999999889999999999876654 469999999999753 24799
Q ss_pred EEecCCCCCCCCCCceeeecCC----cchhhcccccccc
Q psy1911 132 LVANSWNDHWGDHGTFKILRGE----NEADIEMGFNNRV 166 (342)
Q Consensus 132 iikNSWG~~WG~~Gy~~i~~~~----~~~gi~~~~~~~v 166 (342)
|||||||+.||++|||||.++. |.|||...+.+++
T Consensus 449 IVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~yP~ 487 (489)
T PTZ00021 449 IIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEAYVPL 487 (489)
T ss_pred EEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccceeEe
Confidence 9999999999999999999986 5899987766553
No 20
>PTZ00200 cysteine proteinase; Provisional
Probab=99.89 E-value=1.5e-23 Score=209.40 Aligned_cols=104 Identities=28% Similarity=0.510 Sum_probs=87.8
Q ss_pred ceeeeeeEEcCchHHHHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecc--cCCccEEEEecCCCC
Q psy1911 62 SHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANSWND 139 (342)
Q Consensus 62 ~~~i~~~~~v~~~~~~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~--~~g~~ywiikNSWG~ 139 (342)
..+|.+|..++..+.+++++.+|||+|+|+++.+|++|++|||..+|... ++|||+|||||. ++|.+||||+||||+
T Consensus 336 ~~~i~~y~~~~~~~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~~-~nHaV~lVGyG~d~~~g~~YWIIkNSWG~ 414 (448)
T PTZ00200 336 KVYIDSYLVAKGKDVLNKSLVISPTVVYIAVSRELLKYKSGVYNGECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGT 414 (448)
T ss_pred eeEecceEecCHHHHHHHHHhcCCEEEEeecccccccCCCCccccccCCC-CcEEEEEEEecccCCCCCceEEEEcCCCC
Confidence 35677787776555577777899999999998899999999998776554 799999999984 467899999999999
Q ss_pred CCCCCCceeeecC---Ccchhhcccccccc
Q psy1911 140 HWGDHGTFKILRG---ENEADIEMGFNNRV 166 (342)
Q Consensus 140 ~WG~~Gy~~i~~~---~~~~gi~~~~~~~v 166 (342)
.||++|||||.|+ .|.|||...+.+++
T Consensus 415 ~WGe~GY~ri~r~~~g~n~CGI~~~~~~P~ 444 (448)
T PTZ00200 415 DWGENGYMRLERTNEGTDKCGILTVGLTPV 444 (448)
T ss_pred CcccCeeEEEEeCCCCCCcCCccccceeeE
Confidence 9999999999985 48999987766554
No 21
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.89 E-value=2.9e-23 Score=187.53 Aligned_cols=104 Identities=36% Similarity=0.575 Sum_probs=90.6
Q ss_pred ceeeeeeEEcCch--HH-HHHHHHcCCeEEEEeecc-ccccccccccccCC-CCCccceEEEEeeecccCCccEEEEecC
Q psy1911 62 SHYFKKAHMVPRC--NA-MRQIYEHGPLVAIFSVYA-DFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLVANS 136 (342)
Q Consensus 62 ~~~i~~~~~v~~~--~~-~~~i~~~GPV~v~i~v~~-~f~~y~~Gi~~~~~-~~~~~~Hav~ivGyg~~~g~~ywiikNS 136 (342)
...+..+..+... +. +++|+++|||+++|.+++ +|+.|.+|||..+. ....++|+|+|||||++.+.+||||+||
T Consensus 109 ~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~~~~~~wiv~NS 188 (219)
T PF00112_consen 109 YVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDENGKGYWIVKNS 188 (219)
T ss_dssp EBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEETTEEEEEEE-S
T ss_pred cccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeeccccccccccccccccccccccceeeEeeehh
Confidence 3677888888875 23 788999999999999998 69999999998863 3457899999999999999999999999
Q ss_pred CCCCCCCCCceeeecCCc-chhhccccccc
Q psy1911 137 WNDHWGDHGTFKILRGEN-EADIEMGFNNR 165 (342)
Q Consensus 137 WG~~WG~~Gy~~i~~~~~-~~gi~~~~~~~ 165 (342)
||+.||++||+||+++.+ .|||+..+.++
T Consensus 189 WG~~WG~~Gy~~i~~~~~~~c~i~~~~~~~ 218 (219)
T PF00112_consen 189 WGTDWGDNGYFRISYDYNNECGIESQAVYP 218 (219)
T ss_dssp BTTTSTBTTEEEEESSSSSGGGTTSSEEEE
T ss_pred hCCccCCCeEEEEeeCCCCcCccCceeeec
Confidence 999999999999999987 99999888765
No 22
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.88 E-value=6.4e-23 Score=185.25 Aligned_cols=89 Identities=37% Similarity=0.722 Sum_probs=81.1
Q ss_pred CcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC-CCCCCCCchhhHHHH
Q psy1911 188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CWGCNGGWPQLAWRF 266 (342)
Q Consensus 188 P~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~-~~GC~GG~~~~a~~y 266 (342)
|++||||+. +.++||+||+.||||||||++++||++++|+++ ..+.||+|+|++|... ..||.||++..|++|
T Consensus 1 P~~~d~r~~----~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~ 74 (210)
T cd02248 1 PESVDWREK----GAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTG--KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEY 74 (210)
T ss_pred CCcccCCcC----CCCCCCccCCCCcchHHhHHHHHHHHHHHHHcC--CCcccCHHHHhccCCCCCCCCCCCCHHHhHHH
Confidence 889999998 558999999999999999999999999999987 5678999999999876 569999999999999
Q ss_pred hhhcccccCCCCCCCCCcccCCC
Q psy1911 267 WGHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 267 ~~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
+.++||+++++| ||..
T Consensus 75 ~~~~Gi~~e~~y-------PY~~ 90 (210)
T cd02248 75 VKNGGLASESDY-------PYTG 90 (210)
T ss_pred HHHCCcCccccC-------CccC
Confidence 999999997776 8864
No 23
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.88 E-value=1e-22 Score=209.26 Aligned_cols=91 Identities=41% Similarity=0.747 Sum_probs=78.6
Q ss_pred HHHHHHcCCeEEEEeeccccccccccccccC-------CCC--------------CccceEEEEeeeccc--CCc--cEE
Q psy1911 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-------FGD--------------SIGLHAVRVLGWGVE--NDI--PYW 131 (342)
Q Consensus 77 ~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~-------~~~--------------~~~~Hav~ivGyg~~--~g~--~yw 131 (342)
|++|+.+|||+|+|+++++|++|++|||... |.. ...+|||+|||||.+ +|. +||
T Consensus 561 m~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~~~~YW 640 (693)
T PTZ00049 561 MNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLYKYW 640 (693)
T ss_pred HHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccCCCcccCEE
Confidence 7899999999999999889999999999742 321 136899999999975 353 799
Q ss_pred EEecCCCCCCCCCCceeeecCCcchhhccccccccc
Q psy1911 132 LVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVE 167 (342)
Q Consensus 132 iikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~v~ 167 (342)
|||||||+.||++|||||.|+.|.|||+..+.++.+
T Consensus 641 IVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~p 676 (693)
T PTZ00049 641 IGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEP 676 (693)
T ss_pred EEECCCCCCcccCceEEEEcCCCccCCccceeEEee
Confidence 999999999999999999999999999988877653
No 24
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.87 E-value=2e-22 Score=182.04 Aligned_cols=92 Identities=40% Similarity=0.740 Sum_probs=77.0
Q ss_pred CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCC-CCCCCCCCchhhHHH
Q psy1911 187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGGWPQLAWR 265 (342)
Q Consensus 187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~-~~~GC~GG~~~~a~~ 265 (342)
||++||||+.+ +.++||+||+.||+|||||++++||++++++.+ ...+.||+|+|++|+. ...+|+||++..|+.
T Consensus 1 lP~~~D~r~~~---~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~-~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~ 76 (219)
T PF00112_consen 1 LPKSFDWRDKG---GRITPVRDQGSCGSCWAFAAAAALESRLAIQNN-GKNVDLSEQYLIDCSNKYNKGCDGGSPFDALK 76 (219)
T ss_dssp STSSEEGGGTT---TCSG---BTTSSBTHHHHHHHHHHHHHHHHHHT-SSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHH
T ss_pred CCCCEecccCC---CCcCccccCCcccccccchhccceecccccccc-ccccccccccccccccccccccccCcccccce
Confidence 89999999962 359999999999999999999999999999985 3678999999999987 345999999999999
Q ss_pred Hhhh-cccccCCCCCCCCCcccCCC
Q psy1911 266 FWGH-NGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 266 y~~~-~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
|+.+ +||+++.+| ||..
T Consensus 77 ~~~~~~Gi~~e~~~-------pY~~ 94 (219)
T PF00112_consen 77 YIKNNNGIVTEEDY-------PYNG 94 (219)
T ss_dssp HHHHHTSBEBTTTS---------SS
T ss_pred eecccCcccccccc-------cccc
Confidence 9999 999997777 8873
No 25
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.86 E-value=8.6e-22 Score=200.04 Aligned_cols=101 Identities=31% Similarity=0.483 Sum_probs=82.7
Q ss_pred eeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccC---------CC----------CCccceEEEEeeec
Q psy1911 65 FKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---------FG----------DSIGLHAVRVLGWG 123 (342)
Q Consensus 65 i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~---------~~----------~~~~~Hav~ivGyg 123 (342)
+.+|+.+..++. +.+|+.+|||+|+|+++.+|++|.+|||... ++ ....+|+|+|||||
T Consensus 333 I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG 412 (548)
T PTZ00364 333 LGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWG 412 (548)
T ss_pred ecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEec
Confidence 444555554333 7789999999999999989999999998521 11 13469999999999
Q ss_pred c-cCCccEEEEecCCCC--CCCCCCceeeecCCcchhhccccccc
Q psy1911 124 V-ENDIPYWLVANSWND--HWGDHGTFKILRGENEADIEMGFNNR 165 (342)
Q Consensus 124 ~-~~g~~ywiikNSWG~--~WG~~Gy~~i~~~~~~~gi~~~~~~~ 165 (342)
. ++|.+|||||||||+ .||++|||||.|+.|.|||+..+...
T Consensus 413 ~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~~v~~ 457 (548)
T PTZ00364 413 TDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVVVM 457 (548)
T ss_pred ccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcccccceeeee
Confidence 6 467899999999999 99999999999999999998887654
No 26
>KOG1544|consensus
Probab=99.82 E-value=9.4e-21 Score=177.06 Aligned_cols=108 Identities=37% Similarity=0.686 Sum_probs=93.5
Q ss_pred eeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCC--------CCccceEEEEeeecccCC-----
Q psy1911 63 HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG--------DSIGLHAVRVLGWGVEND----- 127 (342)
Q Consensus 63 ~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~--------~~~~~Hav~ivGyg~~~g----- 127 (342)
+....-|+|+.+++ |++||++|||.+.|.|.++|+.|.+|||.+... ...+.|+|.|.|||.+.+
T Consensus 341 yq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~ 420 (470)
T KOG1544|consen 341 YQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRT 420 (470)
T ss_pred eeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCe
Confidence 55667889999887 899999999999999999999999999976432 235789999999997632
Q ss_pred ccEEEEecCCCCCCCCCCceeeecCCcchhhccccccccccCC
Q psy1911 128 IPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS 170 (342)
Q Consensus 128 ~~ywiikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~v~~~~ 170 (342)
..|||..||||+.||++|||+|.||.|+|.|+..+..++....
T Consensus 421 ~KyW~aANSWG~~WGE~GYFriLRGvNecdIEsfvIgAWGr~~ 463 (470)
T KOG1544|consen 421 LKYWTAANSWGPAWGERGYFRILRGVNECDIESFVIGAWGRVG 463 (470)
T ss_pred eEEEEeecccccccccCceEEEeccccchhhhHhhhhhhhccc
Confidence 5899999999999999999999999999999998887765433
No 27
>KOG1544|consensus
Probab=99.81 E-value=2.8e-21 Score=180.57 Aligned_cols=100 Identities=37% Similarity=0.722 Sum_probs=92.8
Q ss_pred ccCCCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCC-CCCCCc
Q psy1911 181 CQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGW 259 (342)
Q Consensus 181 ~~~~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~-GC~GG~ 259 (342)
.+...+||+.||+|++|| +.|.++.||++|++.|||+++++.+||++|++.+..+..||+|+||+|+...+ ||+||.
T Consensus 203 l~p~~~LPE~F~As~KWp--~liH~plDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~ 280 (470)
T KOG1544|consen 203 LNPGEVLPEAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGR 280 (470)
T ss_pred cCcccccchhhhhhhcCC--ccccCccccCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCc
Confidence 355689999999999999 57999999999999999999999999999999999999999999999987666 999999
Q ss_pred hhhHHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911 260 PQLAWRFWGHNGVVTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 260 ~~~a~~y~~~~Gi~~e~~Y~~~~~C~PY~~ 289 (342)
...||=||++.|||+ +.|.||..
T Consensus 281 lDRAWWYlRKrGvVs-------dhCYP~~~ 303 (470)
T KOG1544|consen 281 LDRAWWYLRKRGVVS-------DHCYPFSG 303 (470)
T ss_pred ccchheeeecccccc-------cccccccC
Confidence 999999999999998 77779964
No 28
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.80 E-value=1.4e-19 Score=163.70 Aligned_cols=93 Identities=29% Similarity=0.467 Sum_probs=79.0
Q ss_pred cceeeeeeEEcCchHH---HHHHHHcCCeEEEEeeccccccccccccc------cCCCCCccceEEEEeeecccC--Ccc
Q psy1911 61 LSHYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQ------HNFGDSIGLHAVRVLGWGVEN--DIP 129 (342)
Q Consensus 61 ~~~~i~~~~~v~~~~~---~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~------~~~~~~~~~Hav~ivGyg~~~--g~~ 129 (342)
...++..|..+...+. +++|++.|||+++|.++++|+.|.+|++. ..++...++|||+|||||++. +.+
T Consensus 110 ~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~ 189 (223)
T cd02619 110 AKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKG 189 (223)
T ss_pred cceeecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCCCCCC
Confidence 3477888888887532 77899999999999999999999999862 223445679999999999886 789
Q ss_pred EEEEecCCCCCCCCCCceeeecCC
Q psy1911 130 YWLVANSWNDHWGDHGTFKILRGE 153 (342)
Q Consensus 130 ywiikNSWG~~WG~~Gy~~i~~~~ 153 (342)
||||+||||+.||++||+||.++.
T Consensus 190 ~~i~~NSwG~~wg~~Gy~~i~~~~ 213 (223)
T cd02619 190 AFIVKNSWGTDWGDNGYGRISYED 213 (223)
T ss_pred EEEEEeCCCCccccCCEEEEehhh
Confidence 999999999999999999999864
No 29
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.80 E-value=1e-19 Score=192.21 Aligned_cols=85 Identities=31% Similarity=0.618 Sum_probs=71.1
Q ss_pred HHHHHHcCCeEEEEeecccccccc-cccccc-CCCCCccceEEEEeeeccc-----CCccEEEEecCCCCCCCCCCceee
Q psy1911 77 MRQIYEHGPLVAIFSVYADFLQYK-SGVYQH-NFGDSIGLHAVRVLGWGVE-----NDIPYWLVANSWNDHWGDHGTFKI 149 (342)
Q Consensus 77 ~~~i~~~GPV~v~i~v~~~f~~y~-~Gi~~~-~~~~~~~~Hav~ivGyg~~-----~g~~ywiikNSWG~~WG~~Gy~~i 149 (342)
+++|+.+|||+|+|++. +|+.|. +|||.. .|+...++|||+|||||.. ++.+|||||||||+.||++|||||
T Consensus 684 K~eI~~kGPVaV~IdAs-df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI 762 (1004)
T PTZ00462 684 KDEIMNKGSVIAYIKAE-NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKV 762 (1004)
T ss_pred HHHHHhcCCEEEEEEee-hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEE
Confidence 77899999999999985 688885 898654 4655567999999999964 256999999999999999999999
Q ss_pred ec-CCcchhhcccc
Q psy1911 150 LR-GENEADIEMGF 162 (342)
Q Consensus 150 ~~-~~~~~gi~~~~ 162 (342)
.| +.+.||+....
T Consensus 763 ~r~g~n~CGin~i~ 776 (1004)
T PTZ00462 763 DMYGPSHCEDNFIH 776 (1004)
T ss_pred EeCCCCCCccchhe
Confidence 98 67999985543
No 30
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.78 E-value=3.6e-19 Score=161.03 Aligned_cols=89 Identities=28% Similarity=0.449 Sum_probs=78.3
Q ss_pred ccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCC-----CCCCCCchhhHH
Q psy1911 190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-----WGCNGGWPQLAW 264 (342)
Q Consensus 190 sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~-----~GC~GG~~~~a~ 264 (342)
.||||+. + ++||+||+.||+|||||++++||++++++.+....+.||+|+|++|.... .||.||.+..++
T Consensus 1 ~~d~r~~----~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~ 75 (223)
T cd02619 1 SVDLRPL----R-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSAL 75 (223)
T ss_pred CCcchhc----C-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHH
Confidence 4899998 4 89999999999999999999999999998764456899999999998653 489999999999
Q ss_pred H-HhhhcccccCCCCCCCCCcccCCCC
Q psy1911 265 R-FWGHNGVVTGGDYNSQEGCQPYTLA 290 (342)
Q Consensus 265 ~-y~~~~Gi~~e~~Y~~~~~C~PY~~~ 290 (342)
. ++..+||+++.+| ||...
T Consensus 76 ~~~~~~~Gi~~e~~~-------Py~~~ 95 (223)
T cd02619 76 LKLVALKGIPPEEDY-------PYGAE 95 (223)
T ss_pred HHHHHHcCCCccccC-------CCCCC
Confidence 8 8899999997777 88753
No 31
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.75 E-value=1.3e-18 Score=153.87 Aligned_cols=75 Identities=44% Similarity=0.866 Sum_probs=62.6
Q ss_pred EEEEeeccccccccccccccC-CCCCccceEEEEeeeccc-CCccEEEEecCCCCCCCCCCceeeecCC-cchhhcccc
Q psy1911 87 VAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVE-NDIPYWLVANSWNDHWGDHGTFKILRGE-NEADIEMGF 162 (342)
Q Consensus 87 ~v~i~v~~~f~~y~~Gi~~~~-~~~~~~~Hav~ivGyg~~-~g~~ywiikNSWG~~WG~~Gy~~i~~~~-~~~gi~~~~ 162 (342)
++.+.+. +|++|.+|||..+ |.....+|+|+|||||.+ ++.+||||+||||+.||++|||||.++. +.|+++...
T Consensus 93 ~~~~~~~-~f~~Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~~~ 170 (174)
T smart00645 93 SVAIDAS-DFQFYKSGIYDHPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASV 170 (174)
T ss_pred EEEEEcc-cccCCcCeEECCCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCceeee
Confidence 4444444 6999999999874 544447999999999986 7889999999999999999999999987 899985443
No 32
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.69 E-value=3.3e-17 Score=173.41 Aligned_cols=84 Identities=25% Similarity=0.409 Sum_probs=71.8
Q ss_pred CCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC--CCCCCCCc-hhhHHHHhhhcc-c
Q psy1911 197 WPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN--CWGCNGGW-PQLAWRFWGHNG-V 272 (342)
Q Consensus 197 ~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~--~~GC~GG~-~~~a~~y~~~~G-i 272 (342)
+++|..+.||+|||.||||||||++++||++++|+++ ..+.||+|+||||+.. ..||.||+ +..++.|++++| |
T Consensus 538 ~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg--~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgL 615 (1004)
T PTZ00462 538 ENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGY--EPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFL 615 (1004)
T ss_pred CCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcC--CCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCC
Confidence 5899888999999999999999999999999999977 4678999999999864 35999997 555669998885 7
Q ss_pred ccCCCCCCCCCcccCCC
Q psy1911 273 VTGGDYNSQEGCQPYTL 289 (342)
Q Consensus 273 ~~e~~Y~~~~~C~PY~~ 289 (342)
+++.+| ||..
T Consensus 616 ptESdY-------PYt~ 625 (1004)
T PTZ00462 616 PADSNY-------LYNY 625 (1004)
T ss_pred cccccC-------CCcc
Confidence 776666 8864
No 33
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.55 E-value=8.4e-15 Score=146.00 Aligned_cols=85 Identities=26% Similarity=0.390 Sum_probs=70.3
Q ss_pred eeEEcCchHHH----HHHHHcCCeEEEEeeccccccccccccccC---------------------CCCCccceEEEEee
Q psy1911 67 KAHMVPRCNAM----RQIYEHGPLVAIFSVYADFLQYKSGVYQHN---------------------FGDSIGLHAVRVLG 121 (342)
Q Consensus 67 ~~~~v~~~~~~----~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~---------------------~~~~~~~Hav~ivG 121 (342)
.|++|+.++.+ ++|...+||.++++|. .|+.|.+||+... ++....+|||+|||
T Consensus 288 ~y~Nvp~d~l~~~~~~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivG 366 (437)
T cd00585 288 LYLNVPMDVLKKAAIAQLKDGEPVWFGCDVG-KFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTG 366 (437)
T ss_pred eEEecCHHHHHHHHHHHHhcCCCEEEEEEcC-hhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEE
Confidence 68999888773 4667777999999996 4789999999543 23445689999999
Q ss_pred ecccC-Cc-cEEEEecCCCCCCCCCCceeeecC
Q psy1911 122 WGVEN-DI-PYWLVANSWNDHWGDHGTFKILRG 152 (342)
Q Consensus 122 yg~~~-g~-~ywiikNSWG~~WG~~Gy~~i~~~ 152 (342)
||.+. |. .||+|+||||+.||++||++|+.+
T Consensus 367 v~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 367 VDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred EEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence 99754 65 699999999999999999999974
No 34
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.39 E-value=8e-13 Score=126.63 Aligned_cols=99 Identities=19% Similarity=0.297 Sum_probs=70.1
Q ss_pred cCCCCCcccceeeeeeEEcCchHH-------HHHHHHcCCeEEE--EeeccccccccccccccCCCCCccceEEEEeeec
Q psy1911 53 LYLPTSIPLSHYFKKAHMVPRCNA-------MRQIYEHGPLVAI--FSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG 123 (342)
Q Consensus 53 ~~~p~s~~~~~~i~~~~~v~~~~~-------~~~i~~~GPV~v~--i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg 123 (342)
...|+..++..+.+....++.+.. .+++...|-++.. |+++..+. ..-+.|..... ...+|||+||||+
T Consensus 197 ~~~~~~~p~~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~-~~~~~~~~~s~-~~~gHAv~iVGyD 274 (372)
T COG4870 197 YFSPTNLPVTKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLG-ICIPYPYVDSG-ENWGHAVLIVGYD 274 (372)
T ss_pred ccCCcCCchhhccccceecccchhhhcccchHHHHhhhccccceeEEecccccc-cccCCCCCCcc-ccccceEEEEecc
Confidence 345566677777777777776654 4667778866543 44444333 33344433333 4569999999999
Q ss_pred cc----------CCccEEEEecCCCCCCCCCCceeeecCC
Q psy1911 124 VE----------NDIPYWLVANSWNDHWGDHGTFKILRGE 153 (342)
Q Consensus 124 ~~----------~g~~ywiikNSWG~~WG~~Gy~~i~~~~ 153 (342)
+. .|.+.||||||||+.||++|||||++..
T Consensus 275 Ds~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~y 314 (372)
T COG4870 275 DSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYY 314 (372)
T ss_pred ccccccccccCCCCCceEEEECccccccccCceEEEEeee
Confidence 75 3567999999999999999999999864
No 35
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.80 E-value=1.5e-08 Score=101.45 Aligned_cols=84 Identities=27% Similarity=0.425 Sum_probs=59.6
Q ss_pred eeEEcCchHH---HHHHHHcC-CeEEEEeeccccccccccccccCC---------------------CCCccceEEEEee
Q psy1911 67 KAHMVPRCNA---MRQIYEHG-PLVAIFSVYADFLQYKSGVYQHNF---------------------GDSIGLHAVRVLG 121 (342)
Q Consensus 67 ~~~~v~~~~~---~~~i~~~G-PV~v~i~v~~~f~~y~~Gi~~~~~---------------------~~~~~~Hav~ivG 121 (342)
.|++||.++. +.+.+..| ||..+-+|.. +.....||..... .....+|||+|+|
T Consensus 289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k-~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itG 367 (438)
T PF03051_consen 289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGK-FFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITG 367 (438)
T ss_dssp EEEE--HHHHHHHHHHHHHTT--EEEEEETTT-TEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEE
T ss_pred eEeccCHHHHHHHHHHHHHcCCcEEEeccCCc-cccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEE
Confidence 5899999887 34455566 9999999976 4455677763321 1244689999999
Q ss_pred ecc-cCCc-cEEEEecCCCCCCCCCCceeeec
Q psy1911 122 WGV-ENDI-PYWLVANSWNDHWGDHGTFKILR 151 (342)
Q Consensus 122 yg~-~~g~-~ywiikNSWG~~WG~~Gy~~i~~ 151 (342)
.+. ++|. .+|.|+||||+..|.+||+.++.
T Consensus 368 v~~D~~g~p~~wkVeNSWG~~~g~kGy~~msd 399 (438)
T PF03051_consen 368 VDLDEDGKPVRWKVENSWGTDNGDKGYFYMSD 399 (438)
T ss_dssp EEE-TTSSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred EEeccCCCeeEEEEEcCCCCCCCCCcEEEECH
Confidence 996 5565 59999999999999999999985
No 36
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.49 E-value=2.1e-08 Score=96.52 Aligned_cols=86 Identities=24% Similarity=0.387 Sum_probs=62.1
Q ss_pred CCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCC-CC-CCC-----CC
Q psy1911 185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGC-----NG 257 (342)
Q Consensus 185 ~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~-~~-~GC-----~G 257 (342)
..||.-||||+. +.|+||+||+.||+||||+++++||+.+..... ..+|+-.++.=.. .. .+| +|
T Consensus 97 ~s~~~~fd~r~~----g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~~----w~~s~~nm~~ll~~~ye~~fd~~~~d~ 168 (372)
T COG4870 97 ASLPSYFDRRDE----GKVSPVKDQGSGGSCWAFATTRSLESYLNPESA----WDFSENNMKNLLGVPYEKGFDYTSNDG 168 (372)
T ss_pred ccchhheeeecc----CCcccccccCcccceEeeeehhhhhheeccccc----ccccccchhhhcCCCccccCCCccccC
Confidence 458999999999 889999999999999999999999988876542 3445444333111 11 144 37
Q ss_pred CchhhHHHHhhhc-ccccCCCC
Q psy1911 258 GWPQLAWRFWGHN-GVVTGGDY 278 (342)
Q Consensus 258 G~~~~a~~y~~~~-Gi~~e~~Y 278 (342)
|....+..|+.+. |.+.+.+-
T Consensus 169 g~~~m~~a~l~e~sgpv~et~d 190 (372)
T COG4870 169 GNADMSAAYLTEWSGPVYETDD 190 (372)
T ss_pred CccccccccccccCCcchhhcC
Confidence 7777777787665 77765554
No 37
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=97.08 E-value=0.0011 Score=66.95 Aligned_cols=75 Identities=15% Similarity=0.189 Sum_probs=59.8
Q ss_pred cccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHh----------------hCC------------CCCCC
Q psy1911 204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA----------------CTP------------NCWGC 255 (342)
Q Consensus 204 ~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lld----------------C~~------------~~~GC 255 (342)
.+|.||+.-|-||.||++.+|+..+...-+. ..+.||+.+|+- ... ...-.
T Consensus 55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~-~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~ 133 (437)
T cd00585 55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNL-KEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQN 133 (437)
T ss_pred CCcccCCCCchhHHHHCHHHHHHHHHHHcCC-CCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcC
Confidence 5899999999999999999999988875554 478899855544 320 12257
Q ss_pred CCCchhhHHHHhhhcccccCCCCC
Q psy1911 256 NGGWPQLAWRFWGHNGVVTGGDYN 279 (342)
Q Consensus 256 ~GG~~~~a~~y~~~~Gi~~e~~Y~ 279 (342)
+||.-..+...+.+.|||+...||
T Consensus 134 DGGqw~m~~~li~KYGvVPk~~~p 157 (437)
T cd00585 134 DGGQWDMLVNLIEKYGLVPKSVMP 157 (437)
T ss_pred CCCchHHHHHHHHHcCCCcccccC
Confidence 899999999999999999977663
No 38
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=96.89 E-value=0.00086 Score=67.60 Aligned_cols=74 Identities=16% Similarity=0.223 Sum_probs=46.3
Q ss_pred cccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHH----------------hhCCC------------CCCC
Q psy1911 204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV----------------ACTPN------------CWGC 255 (342)
Q Consensus 204 ~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~ll----------------dC~~~------------~~GC 255 (342)
.+|.||..-|-||.||++.+|+..+....+. ....||+-+|. +.... ..-.
T Consensus 56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l-~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~ 134 (438)
T PF03051_consen 56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNL-KDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVS 134 (438)
T ss_dssp -S--B--BSSTHHHHHHHHHHHHHHHHHCT--SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-
T ss_pred CCCCCCCCCCCcchhhchHHHHHHHHHHcCC-CceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCC
Confidence 5899999999999999999999988877653 46889986654 33210 1246
Q ss_pred CCCchhhHHHHhhhcccccCCCC
Q psy1911 256 NGGWPQLAWRFWGHNGVVTGGDY 278 (342)
Q Consensus 256 ~GG~~~~a~~y~~~~Gi~~e~~Y 278 (342)
+||.-..+..-+.++|||+...|
T Consensus 135 DGGqw~~~~nli~KYGvVPk~~m 157 (438)
T PF03051_consen 135 DGGQWDMVVNLIKKYGVVPKSVM 157 (438)
T ss_dssp S-B-HHHHHHHHHHH---BGGGS
T ss_pred CCCchHHHHHHHHHcCcCcHhhC
Confidence 89999999999999999997777
No 39
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=96.76 E-value=0.00093 Score=64.21 Aligned_cols=70 Identities=24% Similarity=0.425 Sum_probs=49.0
Q ss_pred HHcC-CeEEEEeeccccccccccccccC---------------------CCCCccceEEEEeeeccc-CCc-cEEEEecC
Q psy1911 81 YEHG-PLVAIFSVYADFLQYKSGVYQHN---------------------FGDSIGLHAVRVLGWGVE-NDI-PYWLVANS 136 (342)
Q Consensus 81 ~~~G-PV~v~i~v~~~f~~y~~Gi~~~~---------------------~~~~~~~Hav~ivGyg~~-~g~-~ywiikNS 136 (342)
++.| ||-.+-+|.. +..-+.||+... .+.....|||+|.|.+.+ +|. --|-|.||
T Consensus 308 ~qagetVwFG~dvgq-~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g~p~rwkVENS 386 (444)
T COG3579 308 MQAGETVWFGCDVGQ-LSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETGNPLRWKVENS 386 (444)
T ss_pred HhcCCcEEeecCchh-hcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCCCceeeEeecc
Confidence 4455 7877777643 555566655211 012346899999999954 333 56999999
Q ss_pred CCCCCCCCCceeeec
Q psy1911 137 WNDHWGDHGTFKILR 151 (342)
Q Consensus 137 WG~~WG~~Gy~~i~~ 151 (342)
||..-|.+|||-++-
T Consensus 387 WG~d~G~~GyfvaSd 401 (444)
T COG3579 387 WGKDVGKKGYFVASD 401 (444)
T ss_pred cccccCCCceEeehH
Confidence 999999999998773
No 40
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=84.03 E-value=1.5 Score=35.76 Aligned_cols=52 Identities=21% Similarity=0.431 Sum_probs=26.5
Q ss_pred HHHHHHcC-CeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCC
Q psy1911 77 MRQIYEHG-PLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW 137 (342)
Q Consensus 77 ~~~i~~~G-PV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSW 137 (342)
+...++.| ||.+.+...-.- .....+. .....|.|+|+||+.+. +++|..+|
T Consensus 92 i~~~i~~G~Pvi~~~~~~~~~--~~~~~~~----~~~~~H~vvi~Gy~~~~---~~~v~DP~ 144 (144)
T PF13529_consen 92 IKQEIDAGRPVIVSVNSGWRP--PNGDGYD----GTYGGHYVVIIGYDEDG---YVYVNDPW 144 (144)
T ss_dssp HHHHHHTT--EEEEEETTSS----TTEEEE----E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred HHHHHHCCCcEEEEEEccccc--CCCCCcC----CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence 44444445 999988632100 0111111 12358999999998753 67777766
No 41
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=82.53 E-value=1.9 Score=41.99 Aligned_cols=74 Identities=19% Similarity=0.331 Sum_probs=50.2
Q ss_pred cccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHH----------------HHHhhCC------------CCCCC
Q psy1911 204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ----------------HIVACTP------------NCWGC 255 (342)
Q Consensus 204 ~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q----------------~lldC~~------------~~~GC 255 (342)
.+|.||..-|-||.|||+..+-..+.-.-+ ...+.||.. |||.-.. ...--
T Consensus 58 d~vtNQk~SGRCWmFAAlNtfRhk~~~el~-le~fElSQaytfFwDKlEKaN~FleqIi~tadq~ldsRlv~~LL~~Pqq 136 (444)
T COG3579 58 DKVTNQKQSGRCWMFAALNTFRHKLISELK-LEDFELSQAYTFFWDKLEKANWFLEQIIETADQELDSRLVSFLLATPQQ 136 (444)
T ss_pred CccccccccceehHHHHHHHHHHHHHHhcC-cceeehhhHHHHHHHHHHHhhHHHHHHHhhcccchHHHHHHHHHcCccc
Confidence 478999999999999999988554443322 245667752 3333111 01135
Q ss_pred CCCchhhHHHHhhhcccccCCCC
Q psy1911 256 NGGWPQLAWRFWGHNGVVTGGDY 278 (342)
Q Consensus 256 ~GG~~~~a~~y~~~~Gi~~e~~Y 278 (342)
+||--.....-+.+.|||+-+.|
T Consensus 137 DGGQwdM~v~l~eKYGvVpK~~y 159 (444)
T COG3579 137 DGGQWDMFVSLFEKYGVVPKSVY 159 (444)
T ss_pred cCchHHHHHHHHHHhCCCchhhc
Confidence 78888888888899999986666
No 42
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=80.78 E-value=3.5 Score=39.25 Aligned_cols=52 Identities=19% Similarity=0.520 Sum_probs=34.0
Q ss_pred HHHHHHcC-CeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEec
Q psy1911 77 MRQIYEHG-PLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN 135 (342)
Q Consensus 77 ~~~i~~~G-PV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikN 135 (342)
+++.+..| ||.+.++++ ++.|...-| ......|.|+|+||++++ ..|.++.+
T Consensus 81 l~~~l~~g~pv~~~~D~~--~lpy~~~~~----~~~~~~H~i~v~G~d~~~-~~~~v~D~ 133 (317)
T PF14399_consen 81 LKEALDAGRPVIVWVDMY--YLPYRPNYY----KKHHADHYIVVYGYDEEE-DVFYVSDP 133 (317)
T ss_pred HHHHHhCCCceEEEeccc--cCCCCcccc----ccccCCcEEEEEEEeCCC-CEEEEEcC
Confidence 66677777 999998764 444443322 122358999999999653 34666644
No 43
>PF10500 SR-25: Nuclear RNA-splicing-associated protein; InterPro: IPR019532 SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissue types. At the N terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing []. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1; also see IPR019093 from INTERPRO) signalling pathway [].
Probab=79.83 E-value=2.3 Score=38.91 Aligned_cols=6 Identities=50% Similarity=1.210 Sum_probs=3.3
Q ss_pred HcCCeE
Q psy1911 82 EHGPLV 87 (342)
Q Consensus 82 ~~GPV~ 87 (342)
+.|||.
T Consensus 141 ~~~Pv~ 146 (225)
T PF10500_consen 141 EDGPVL 146 (225)
T ss_pred ccCCcc
Confidence 456754
No 44
>PF10500 SR-25: Nuclear RNA-splicing-associated protein; InterPro: IPR019532 SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissue types. At the N terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing []. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1; also see IPR019093 from INTERPRO) signalling pathway [].
Probab=66.82 E-value=7.5 Score=35.69 Aligned_cols=11 Identities=18% Similarity=0.359 Sum_probs=6.2
Q ss_pred HHHHHHcCCeE
Q psy1911 77 MRQIYEHGPLV 87 (342)
Q Consensus 77 ~~~i~~~GPV~ 87 (342)
+..|-..+|++
T Consensus 151 ksr~~am~PmT 161 (225)
T PF10500_consen 151 KSRIQAMAPMT 161 (225)
T ss_pred hhhhhhcCCCC
Confidence 44455667763
No 45
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=65.85 E-value=24 Score=31.30 Aligned_cols=37 Identities=19% Similarity=0.267 Sum_probs=23.0
Q ss_pred CccceEEEEeeecc-cCCccEEEEecCCCCCCCCCCceeeecC
Q psy1911 111 SIGLHAVRVLGWGV-ENDIPYWLVANSWNDHWGDHGTFKILRG 152 (342)
Q Consensus 111 ~~~~Hav~ivGyg~-~~g~~ywiikNSWG~~WG~~Gy~~i~~~ 152 (342)
...+||++||||-. .+|.++.+|=|-|- ++++-++..
T Consensus 117 ~~~gHAlavvGya~~~~g~~~y~~WNPW~-----~~~~~~sa~ 154 (175)
T PF05543_consen 117 PHAGHALAVVGYAKPNNGQKTYYFWNPWW-----NDVMIQSAK 154 (175)
T ss_dssp B--EEEEEEEEEEEETTSEEEEEEE-TT------SS-EEEETT
T ss_pred CccceeEEEEeeeecCCCCeEEEEeCCcc-----CCcEEEecC
Confidence 34689999999986 45678888867665 345555543
No 46
>KOG4364|consensus
Probab=65.84 E-value=11 Score=39.94 Aligned_cols=14 Identities=14% Similarity=0.266 Sum_probs=6.8
Q ss_pred HHHHHcCCeEEEEe
Q psy1911 78 RQIYEHGPLVAIFS 91 (342)
Q Consensus 78 ~~i~~~GPV~v~i~ 91 (342)
..|...+|+...-+
T Consensus 427 d~Lf~r~pls~~~d 440 (811)
T KOG4364|consen 427 DELFPRLPLSTNSD 440 (811)
T ss_pred hhhcccCCCccccc
Confidence 34555556554433
No 47
>KOG4128|consensus
Probab=62.27 E-value=1.2 Score=43.35 Aligned_cols=39 Identities=26% Similarity=0.419 Sum_probs=30.8
Q ss_pred ccceEEEEeeecc----cCCccEEEEecCCCCCCCCCCceeee
Q psy1911 112 IGLHAVRVLGWGV----ENDIPYWLVANSWNDHWGDHGTFKIL 150 (342)
Q Consensus 112 ~~~Hav~ivGyg~----~~g~~ywiikNSWG~~WG~~Gy~~i~ 150 (342)
...||++|.|-|. +++-.-|-|.||||.+-|..|+..+.
T Consensus 370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt 412 (457)
T KOG4128|consen 370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMT 412 (457)
T ss_pred HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhh
Confidence 3589999999882 23445699999999999999995544
No 48
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.94 E-value=24 Score=31.56 Aligned_cols=50 Identities=14% Similarity=0.209 Sum_probs=30.6
Q ss_pred EEcCchHH---HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCCC
Q psy1911 69 HMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN 138 (342)
Q Consensus 69 ~~v~~~~~---~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSWG 138 (342)
.+++..+. +..|.+..||.+-.-. |-. -.-|+|+|+||++. ++..-+.||
T Consensus 116 ~d~tGksl~~ik~ql~kg~PV~iw~T~---~~~-------------~s~H~v~itgyDk~----n~yynDpyG 168 (195)
T COG4990 116 VDLTGKSLSDIKGQLLKGRPVVIWVTN---FHS-------------YSIHSVLITGYDKY----NIYYNDPYG 168 (195)
T ss_pred ccCcCCcHHHHHHHHhcCCcEEEEEec---ccc-------------cceeeeEeeccccc----ceEeccccc
Confidence 34444443 4455555598776533 222 12599999999976 566667775
No 49
>KOG4364|consensus
Probab=50.09 E-value=28 Score=36.98 Aligned_cols=10 Identities=10% Similarity=-0.144 Sum_probs=5.4
Q ss_pred ceEEEEeeec
Q psy1911 114 LHAVRVLGWG 123 (342)
Q Consensus 114 ~Hav~ivGyg 123 (342)
+-++++|--+
T Consensus 463 ~sa~kpve~e 472 (811)
T KOG4364|consen 463 GSACKPVESE 472 (811)
T ss_pred cccceeeccC
Confidence 4555565554
No 50
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=48.65 E-value=49 Score=26.94 Aligned_cols=22 Identities=23% Similarity=0.249 Sum_probs=14.8
Q ss_pred cceEEEEeeecccCCccEEEEecCC
Q psy1911 113 GLHAVRVLGWGVENDIPYWLVANSW 137 (342)
Q Consensus 113 ~~Hav~ivGyg~~~g~~ywiikNSW 137 (342)
.+|.|+|+||+.+ ...+|.+.|
T Consensus 93 ~gH~vVv~g~~~~---~~~~i~DP~ 114 (141)
T cd02549 93 SGHAMVVIGYDRK---GNVYVNDPG 114 (141)
T ss_pred CCeEEEEEEEcCC---CCEEEECCC
Confidence 4799999999811 234555655
No 51
>PF01754 zf-A20: A20-like zinc finger; InterPro: IPR002653 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the zinc finger domain found in A20. A20 is an inhibitor of cell death that inhibits NF-kappaB activation via the tumour necrosis factor receptor associated factor pathway []. The zinc finger domains appear to mediate self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding, 0008270 zinc ion binding; PDB: 2FIF_F 2FID_B 2C7N_C 2C7M_A 2L00_A 2KZY_A 2EQG_A 2EQE_A 3OJ3_J 3OJ4_C ....
Probab=47.16 E-value=14 Score=22.14 Aligned_cols=19 Identities=21% Similarity=0.522 Sum_probs=15.5
Q ss_pred CCCCCCcccccccccchHH
Q psy1911 198 PECPSLRHIADQSNCGSCW 216 (342)
Q Consensus 198 ~~c~~i~~v~dQg~CGsCw 216 (342)
++|++.+.+..++.|.-||
T Consensus 6 ~gCgf~Gs~~~~~~Cs~C~ 24 (25)
T PF01754_consen 6 NGCGFYGSPATNGLCSKCY 24 (25)
T ss_dssp TTSSSB-BGGGTTS-HHHH
T ss_pred CCCCCcccccccCcchhhc
Confidence 6899999999999999997
No 52
>KOG3925|consensus
Probab=42.51 E-value=40 Score=33.20 Aligned_cols=14 Identities=29% Similarity=0.214 Sum_probs=9.8
Q ss_pred CCCCCchhhHHHHh
Q psy1911 254 GCNGGWPQLAWRFW 267 (342)
Q Consensus 254 GC~GG~~~~a~~y~ 267 (342)
=|.||+...++.+-
T Consensus 306 lvfg~l~~l~~a~e 319 (371)
T KOG3925|consen 306 LVFGGLAGLVRAFE 319 (371)
T ss_pred EEeechhhHHHHHH
Confidence 47788877766663
No 53
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=34.85 E-value=1.3e+02 Score=27.67 Aligned_cols=14 Identities=21% Similarity=0.275 Sum_probs=11.9
Q ss_pred cceEEEEeeecccC
Q psy1911 113 GLHAVRVLGWGVEN 126 (342)
Q Consensus 113 ~~Hav~ivGyg~~~ 126 (342)
.+|=|+|+||+...
T Consensus 160 ~GHYVVlcGyd~~~ 173 (212)
T PF09778_consen 160 QGHYVVLCGYDAAT 173 (212)
T ss_pred cEEEEEEEeecCCC
Confidence 48999999998754
No 54
>smart00259 ZnF_A20 A20-like zinc fingers. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappaB activation.
Probab=32.85 E-value=24 Score=21.32 Aligned_cols=19 Identities=26% Similarity=0.739 Sum_probs=17.0
Q ss_pred CCCCCCcccccccccchHH
Q psy1911 198 PECPSLRHIADQSNCGSCW 216 (342)
Q Consensus 198 ~~c~~i~~v~dQg~CGsCw 216 (342)
++|++.+.+..|+.|.-||
T Consensus 7 ~~CgF~G~~~t~~~CskCy 25 (26)
T smart00259 7 PGCGFFGNPATEGLCSKCF 25 (26)
T ss_pred CCCCCcCChhhcccCHhhc
Confidence 5799999999999999887
No 55
>PF15246 NCKAP5: Nck-associated protein 5, Peripheral clock protein
Probab=31.16 E-value=50 Score=31.95 Aligned_cols=36 Identities=8% Similarity=0.061 Sum_probs=18.1
Q ss_pred hcccccCCCCCcccceeeeeeEEcCchHHHHHHHHc
Q psy1911 48 KKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEH 83 (342)
Q Consensus 48 ~~kk~~~~p~s~~~~~~i~~~~~v~~~~~~~~i~~~ 83 (342)
++-..|+..........-..|..+..+..|++|+++
T Consensus 125 ~~~~~~~~qm~~~~~q~~~~~~~~~kD~FM~eLLnR 160 (318)
T PF15246_consen 125 KRPQDCYNQMGQEQRQLPSMYQCSGKDTFMQELLNR 160 (318)
T ss_pred cCCCccccccccccccCcCcccccchhHHHHHHHHh
Confidence 344566666533232222223334444448888875
No 56
>KOG4128|consensus
Probab=31.11 E-value=21 Score=35.02 Aligned_cols=76 Identities=20% Similarity=0.215 Sum_probs=50.1
Q ss_pred CcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHh--------------------hCCCCC---------
Q psy1911 203 LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA--------------------CTPNCW--------- 253 (342)
Q Consensus 203 i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lld--------------------C~~~~~--------- 253 (342)
-.||.+|..-|-||.|+++..|---+...-+- ....||..+|+- |-...+
T Consensus 62 ~~pvtnqkssGrcWift~ln~lrl~~~~kLnl-~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~n 140 (457)
T KOG4128|consen 62 RQPVTNQKSSGRCWIFTGLNLLRLEMDRKLNL-PEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKN 140 (457)
T ss_pred CcccccCcCCCceEEEechhHHHHHHHhcCCc-chhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhC
Confidence 37899999999999999998874333322221 245677655432 111111
Q ss_pred -CCCCCchhhHHHHhhhcccccCCCCC
Q psy1911 254 -GCNGGWPQLAWRFWGHNGVVTGGDYN 279 (342)
Q Consensus 254 -GC~GG~~~~a~~y~~~~Gi~~e~~Y~ 279 (342)
-=+||.-..-..-+++.|+++-.-|+
T Consensus 141 P~~DGGqw~MfvNlVkKYGviPKkcy~ 167 (457)
T KOG4128|consen 141 PVPDGGQWQMFVNLVKKYGVIPKKCYL 167 (457)
T ss_pred CCCCCchHHHHHHHHHHhCCCcHHhcc
Confidence 24688877777888888998866663
No 57
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=24.97 E-value=75 Score=30.48 Aligned_cols=27 Identities=22% Similarity=0.231 Sum_probs=23.2
Q ss_pred cceEEEEeeecccC--CccEEEEecCCCC
Q psy1911 113 GLHAVRVLGWGVEN--DIPYWLVANSWND 139 (342)
Q Consensus 113 ~~Hav~ivGyg~~~--g~~ywiikNSWG~ 139 (342)
.+|+-.|++...-+ +.....++|-||.
T Consensus 235 ~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~ 263 (315)
T cd00044 235 KGHAYSVLDVREVQEEGLRLLRLRNPWGV 263 (315)
T ss_pred cCcceEEeEEEEEccCceEEEEecCCccC
Confidence 58999999998655 7788999999995
No 58
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=24.61 E-value=1.6e+02 Score=25.89 Aligned_cols=34 Identities=29% Similarity=0.364 Sum_probs=23.8
Q ss_pred HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeeccc
Q psy1911 77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE 125 (342)
Q Consensus 77 ~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~ 125 (342)
...|.++|||-+++.. + .+....|+++|+|...+
T Consensus 102 ~~LL~~yGPLwv~~~~--------------P-~~~~~~H~~ViTGI~~d 135 (166)
T PF12385_consen 102 ANLLREYGPLWVAWEA--------------P-GDSWVAHASVITGIDGD 135 (166)
T ss_pred HHHHHHcCCeEEEecC--------------C-CCcceeeEEEEEeecCC
Confidence 4568889999998543 2 22223699999998755
No 59
>KOG1144|consensus
Probab=23.41 E-value=1.4e+02 Score=32.71 Aligned_cols=10 Identities=20% Similarity=0.381 Sum_probs=6.3
Q ss_pred HHHHHHcCCe
Q psy1911 77 MRQIYEHGPL 86 (342)
Q Consensus 77 ~~~i~~~GPV 86 (342)
+++|+..||+
T Consensus 298 l~~ll~sg~~ 307 (1064)
T KOG1144|consen 298 LKQLLASGGG 307 (1064)
T ss_pred HHHHHhcCCC
Confidence 5667777743
No 60
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=22.42 E-value=78 Score=32.94 Aligned_cols=9 Identities=22% Similarity=0.663 Sum_probs=4.8
Q ss_pred HHHHHHcCC
Q psy1911 77 MRQIYEHGP 85 (342)
Q Consensus 77 ~~~i~~~GP 85 (342)
++-|.+.+|
T Consensus 480 l~pL~~~a~ 488 (517)
T PF12569_consen 480 LKPLLELAP 488 (517)
T ss_pred HHHHHHhCc
Confidence 344555555
No 61
>KOG3925|consensus
Probab=21.44 E-value=1.4e+02 Score=29.46 Aligned_cols=8 Identities=25% Similarity=0.505 Sum_probs=3.5
Q ss_pred CeEEEEee
Q psy1911 85 PLVAIFSV 92 (342)
Q Consensus 85 PV~v~i~v 92 (342)
+++..|.|
T Consensus 97 R~aslf~V 104 (371)
T KOG3925|consen 97 RAASLFRV 104 (371)
T ss_pred hhheeecc
Confidence 34444444
Done!