Query psy15346
Match_columns 280
No_of_seqs 200 out of 1326
Neff 6.1
Searched_HMMs 46136
Date Fri Aug 16 18:34:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15346.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15346hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd02620 Peptidase_C1A_Cathepsi 100.0 3E-38 6.6E-43 284.9 17.4 168 1-240 68-235 (236)
2 cd02698 Peptidase_C1A_Cathepsi 100.0 1.2E-37 2.6E-42 281.4 17.6 162 1-244 70-239 (239)
3 cd02621 Peptidase_C1A_Cathepsi 100.0 2.3E-37 4.9E-42 279.8 15.8 165 1-243 72-242 (243)
4 PTZ00049 cathepsin C-like prot 100.0 1.5E-36 3.2E-41 305.9 18.0 189 1-249 456-682 (693)
5 KOG1543|consensus 100.0 1.3E-35 2.7E-40 280.1 15.3 151 1-244 173-324 (325)
6 PTZ00364 dipeptidyl-peptidase 100.0 2.9E-35 6.2E-40 292.6 17.3 189 1-258 278-474 (548)
7 cd02248 Peptidase_C1A Peptidas 100.0 8E-35 1.7E-39 254.9 16.1 147 1-240 62-209 (210)
8 KOG1542|consensus 100.0 5.4E-35 1.2E-39 272.4 12.7 148 1-242 218-370 (372)
9 PTZ00203 cathepsin L protease; 100.0 2.8E-34 6E-39 273.2 17.5 151 1-243 187-340 (348)
10 PF00112 Peptidase_C1: Papain 100.0 4E-33 8.8E-38 243.6 13.6 150 1-242 65-219 (219)
11 PTZ00200 cysteine proteinase; 100.0 5.1E-33 1.1E-37 271.9 15.5 145 1-243 296-445 (448)
12 PTZ00021 falcipain-2; Provisio 100.0 1.9E-32 4.1E-37 269.5 15.3 147 1-243 327-488 (489)
13 KOG1544|consensus 100.0 4.5E-33 9.8E-38 257.7 8.6 179 1-245 275-462 (470)
14 PTZ00462 Serine-repeat antigen 99.9 6.3E-27 1.4E-31 242.5 16.8 103 91-251 680-789 (1004)
15 cd02619 Peptidase_C1 C1 Peptid 99.9 9.2E-27 2E-31 203.6 14.9 141 1-229 65-213 (223)
16 smart00645 Pept_C1 Papain fami 99.9 1.5E-26 3.2E-31 200.0 11.2 52 188-239 118-171 (174)
17 cd00585 Peptidase_C1B Peptidas 99.5 9.2E-14 2E-18 136.1 8.8 41 188-228 357-399 (437)
18 COG4870 Cysteine protease [Pos 99.4 6.9E-13 1.5E-17 125.7 7.1 42 188-229 263-314 (372)
19 PF03051 Peptidase_C1_2: Pepti 98.3 3.4E-06 7.4E-11 83.2 9.9 41 188-228 358-400 (438)
20 PTZ00203 cathepsin L protease; 95.7 0.0051 1.1E-07 59.2 2.0 29 157-185 288-316 (348)
21 cd02698 Peptidase_C1A_Cathepsi 95.7 0.005 1.1E-07 55.7 1.7 28 157-184 180-208 (239)
22 cd02621 Peptidase_C1A_Cathepsi 95.5 0.0081 1.8E-07 54.3 2.1 30 156-185 187-218 (243)
23 KOG1543|consensus 95.4 0.0083 1.8E-07 57.2 2.2 29 157-185 271-299 (325)
24 smart00645 Pept_C1 Papain fami 95.3 0.0085 1.9E-07 51.7 1.5 45 140-184 104-149 (174)
25 cd02620 Peptidase_C1A_Cathepsi 94.9 0.015 3.2E-07 52.6 2.0 28 157-184 186-213 (236)
26 COG3579 PepC Aminopeptidase C 94.8 0.0084 1.8E-07 57.4 0.2 40 188-227 360-401 (444)
27 KOG1542|consensus 94.5 0.023 5E-07 54.5 2.4 29 157-185 318-347 (372)
28 cd02248 Peptidase_C1A Peptidas 94.4 0.018 4E-07 50.0 1.5 28 157-184 160-187 (210)
29 PTZ00200 cysteine proteinase; 93.6 0.041 8.9E-07 54.8 2.3 28 157-184 388-417 (448)
30 PTZ00364 dipeptidyl-peptidase 93.4 0.043 9.2E-07 55.9 2.0 31 156-186 403-436 (548)
31 KOG1544|consensus 92.9 0.068 1.5E-06 51.1 2.3 84 78-186 346-437 (470)
32 PTZ00049 cathepsin C-like prot 92.5 0.073 1.6E-06 55.4 2.2 30 156-185 619-652 (693)
33 PF13529 Peptidase_C39_2: Pept 92.3 0.49 1.1E-05 37.5 6.4 24 87-110 85-108 (144)
34 PTZ00021 falcipain-2; Provisio 92.2 0.084 1.8E-06 53.1 2.2 28 157-184 422-459 (489)
35 PF00112 Peptidase_C1: Papain 92.0 0.056 1.2E-06 46.8 0.7 29 157-185 167-195 (219)
36 cd02619 Peptidase_C1 C1 Peptid 91.7 0.13 2.7E-06 44.6 2.5 29 157-185 173-203 (223)
37 PTZ00462 Serine-repeat antigen 90.5 0.15 3.3E-06 55.0 2.2 31 157-187 723-758 (1004)
38 PF05543 Peptidase_C47: Stapho 66.4 12 0.00026 32.9 4.9 26 188-213 118-144 (175)
39 KOG4128|consensus 64.4 0.83 1.8E-05 44.0 -2.8 54 188-245 370-427 (457)
40 PF14399 Transpep_BrtH: NlpC/p 64.1 16 0.00035 33.6 5.7 23 89-111 76-98 (317)
41 cd00585 Peptidase_C1B Peptidas 57.0 7.1 0.00015 39.0 2.1 73 84-182 289-387 (437)
42 COG4990 Uncharacterized protei 52.6 32 0.0007 30.5 5.2 21 190-214 148-168 (195)
43 PF09778 Guanylate_cyc_2: Guan 44.0 87 0.0019 28.4 6.8 21 90-110 112-132 (212)
44 cd00044 CysPc Calpains, domain 37.8 43 0.00093 31.4 4.0 28 188-215 234-263 (315)
45 PF12385 Peptidase_C70: Papain 34.7 71 0.0015 27.8 4.5 23 90-112 97-119 (166)
46 cd02549 Peptidase_C39A A sub-f 33.1 1.4E+02 0.0029 23.6 5.8 22 189-213 93-114 (141)
47 PF01357 Pollen_allerg_1: Poll 31.0 78 0.0017 23.9 3.8 43 170-218 10-52 (82)
No 1
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=3e-38 Score=284.89 Aligned_cols=168 Identities=39% Similarity=0.788 Sum_probs=130.8
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~ 80 (280)
||+||++..||+||+++|+++|.+| ||+...|..+... ...|.. ++.|...|.. .....+....++
T Consensus 68 gC~GG~~~~a~~~i~~~G~~~e~~y-------PY~~~~~~~~~~~--~~~~~~----~~~~~~~C~~-~~~~~~~~~~~~ 133 (236)
T cd02620 68 GCNGGYPDAAWKYLTTTGVVTGGCQ-------PYTIPPCGHHPEG--PPPCCG----TPYCTPKCQD-GCEKTYEEDKHK 133 (236)
T ss_pred CCCCCCHHHHHHHHHhcCCCcCCEe-------cCcCCCCccCCCC--CCCCCC----CCCCCCCCCc-CCccccceeeee
Confidence 7999999999999999999997666 9996544331111 122322 1233344541 111123445566
Q ss_pred eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346 81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK 160 (280)
Q Consensus 81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~ 160 (280)
+..++.+..++++||++|+++|||+++|.++++|+. |++|||+.....
T Consensus 134 ~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~-----------------------Y~~Giy~~~~~~--------- 181 (236)
T cd02620 134 GKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-----------------------YKSGVYQHTSGK--------- 181 (236)
T ss_pred ecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhh-----------------------cCCcEEeecCCC---------
Confidence 777787766789999999999999999999888999 999999865322
Q ss_pred eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeee
Q psy15346 161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240 (280)
Q Consensus 161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~ 240 (280)
. .++|||+|||||++++++|||||||||++||++|||||+||.|.|||+++++.
T Consensus 182 ~--------------------------~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~ 235 (236)
T cd02620 182 Q--------------------------LGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA 235 (236)
T ss_pred C--------------------------cCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence 1 56899999999999999999999999999999999999999999999998875
No 2
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=1.2e-37 Score=281.44 Aligned_cols=162 Identities=22% Similarity=0.457 Sum_probs=129.2
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCccc--ccccCCCCCcccccce
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCH--TRCTNDNYGRGFFQDK 78 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~--~~c~~~~~~~~~~~~~ 78 (280)
||+||++..||+|++++|++++.+| ||.. ....|+... ...+|. ..|.... ....
T Consensus 70 gC~GG~~~~a~~~~~~~Gl~~e~~y-------PY~~----------~~~~C~~~~-~~~~c~~~~~c~~~~-----~~~~ 126 (239)
T cd02698 70 SCHGGDPGGVYEYAHKHGIPDETCN-------PYQA----------KDGECNPFN-RCGTCNPFGECFAIK-----NYTL 126 (239)
T ss_pred CccCcCHHHHHHHHHHcCcCCCCee-------CCcC----------CCCCCcCCC-CCCCcccCccccccc-----ccce
Confidence 7999999999999999999996666 9983 344564321 111221 1222100 1234
Q ss_pred eeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhe
Q psy15346 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT 158 (280)
Q Consensus 79 ~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~ 158 (280)
++++++..+. ++++||++|+++|||+++|.++.+|+. |++|||++..+.
T Consensus 127 ~~i~~~~~~~-~~~~i~~~l~~~GPV~v~i~~~~~f~~-----------------------Y~~GIy~~~~~~------- 175 (239)
T cd02698 127 YFVSDYGSVS-GRDKMMAEIYARGPISCGIMATEALEN-----------------------YTGGVYKEYVQD------- 175 (239)
T ss_pred EEeeeceecC-CHHHHHHHHHHcCCEEEEEEecccccc-----------------------cCCeEEccCCCC-------
Confidence 5677776674 578999999999999999999988999 999999876544
Q ss_pred eeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC-CccEEEEEcCCCCCCCCCceEEEEccC-----Ccc
Q psy15346 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR-----NEA 232 (280)
Q Consensus 159 ~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~-g~~YWiirNSWG~~WG~~Gy~kI~rg~-----n~c 232 (280)
. .++|||+|||||+++ +++|||||||||++||++|||||+||. |+|
T Consensus 176 --~--------------------------~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~ 227 (239)
T cd02698 176 --P--------------------------LINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNL 227 (239)
T ss_pred --C--------------------------cCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccCCccccccc
Confidence 1 569999999999886 999999999999999999999999999 999
Q ss_pred cccceeeeEeec
Q psy15346 233 IIESLVNGALPK 244 (280)
Q Consensus 233 gIe~~~~~~~p~ 244 (280)
|||+.+++++|.
T Consensus 228 ~i~~~~~~~~~~ 239 (239)
T cd02698 228 AIEEDCAWADPI 239 (239)
T ss_pred ccccceEEEeeC
Confidence 999999999983
No 3
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=2.3e-37 Score=279.78 Aligned_cols=165 Identities=32% Similarity=0.544 Sum_probs=124.7
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~ 80 (280)
||+||++.+|++|++++||+++..| ||+. .....|.... ..|. +.+..+..+
T Consensus 72 GC~GG~~~~a~~~~~~~Gi~~e~~y-------PY~~---------~~~~~C~~~~-------~~~~-----~~~~~~~~~ 123 (243)
T cd02621 72 GCDGGFPFLVGKFAEDFGIVTEDYF-------PYTA---------DDDRPCKASP-------SECR-----RYYFSDYNY 123 (243)
T ss_pred CCCCCCHHHHHHHHHhcCcCCCcee-------CCCC---------CCCCCCCCCc-------cccc-----cccccceeE
Confidence 7999999999999999999996666 9983 1345565421 0011 111112223
Q ss_pred eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCC----chhhhhh
Q psy15346 81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA----SAEIVAY 156 (280)
Q Consensus 81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~----~~~~~~~ 156 (280)
+.+++.. .++++||++|+++|||+++|.++++|+. |++|||+... |..
T Consensus 124 i~~~~~~-~~~~~ik~~i~~~GPv~v~~~~~~~F~~-----------------------Y~~GIy~~~~~~~~C~~---- 175 (243)
T cd02621 124 VGGCYGC-TNEDEMKWEIYRNGPIVVAFEVYSDFDF-----------------------YKEGVYHHTDNDEVSDG---- 175 (243)
T ss_pred ccccccc-CCHHHHHHHHHHcCCEEEEEEecccccc-----------------------cCCeEECcCCccccccc----
Confidence 3333333 4789999999999999999999988999 9999998763 320
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC--CccEEEEEcCCCCCCCCCceEEEEccCCcccc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN--GRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~--g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgI 234 (280)
..+. ......++|||+|||||+++ +++|||||||||++||++|||||+||.|.|||
T Consensus 176 -~~~~---------------------~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi 233 (243)
T cd02621 176 -DNDN---------------------FNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGI 233 (243)
T ss_pred -cccc---------------------ccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCcccCc
Confidence 0000 00011569999999999986 89999999999999999999999999999999
Q ss_pred cceeeeEee
Q psy15346 235 ESLVNGALP 243 (280)
Q Consensus 235 e~~~~~~~p 243 (280)
++++++++|
T Consensus 234 ~~~~~~~~~ 242 (243)
T cd02621 234 ESQAVFAYP 242 (243)
T ss_pred ccceEeecc
Confidence 999999988
No 4
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=1.5e-36 Score=305.89 Aligned_cols=189 Identities=25% Similarity=0.421 Sum_probs=134.0
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCC-------------------Ccc
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-------------------PKC 61 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~-------------------p~c 61 (280)
||+||++..|++|++++||+++..| ||+. ..+.|+...... +.|
T Consensus 456 GC~GG~~~~A~kya~~~GI~tEscY-------PY~a----------~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 518 (693)
T PTZ00049 456 GCNGGFPYLVSKMAKLQGIPLDKVF-------PYTA----------TEQTCPYQVDQSANSMNGSANLRQINAVFFSSET 518 (693)
T ss_pred CcCCCcHHHHHHHHHHCCCCcCCcc-------CCcC----------CCCCCCCCCCCccccccccccccccccccccccc
Confidence 7999999999999999999996665 9983 234454321100 112
Q ss_pred cccccC-------CCCCcccccceeeeEEEEEcC--chHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCccccccc
Q psy15346 62 HTRCTN-------DNYGRGFFQDKYRFKRYYWVN--DEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132 (280)
Q Consensus 62 ~~~c~~-------~~~~~~~~~~~~~i~~~y~~~--~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~ 132 (280)
...|.. ..+.++|..+..++.++|.+. .++++||++|+++|||+++|.++.+|+.
T Consensus 519 ~~~~~~~~~~~~~~~~~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~---------------- 582 (693)
T PTZ00049 519 QSDMHADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYD---------------- 582 (693)
T ss_pred cccccccccccccccccceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhc----------------
Confidence 222211 122334444444455555542 3689999999999999999999888998
Q ss_pred ccccccccccceeecCC------chhhhhhheeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeecc--CCc
Q psy15346 133 LYSDIFSYKSGVYAVSA------SAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE--NGR 204 (280)
Q Consensus 133 ~~~~~~~Y~~GVy~~~~------~~~~~~~~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e--~g~ 204 (280)
|++|||+... |. ... | .+.+.....++..++|||+|||||++ ++.
T Consensus 583 -------YksGVY~~~~~~h~~~C~-------~d~---------~----~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~ 635 (693)
T PTZ00049 583 -------YADGVYYVEDFPHARRCT-------VDL---------P----KHNGVYNITGWEKVNHAIVLVGWGEEEINGK 635 (693)
T ss_pred -------CCCccccCcccccccccC-------Ccc---------c----cccccccccccccCceEEEEEEeccccCCCc
Confidence 9999998632 32 000 0 00000000112257999999999985 463
Q ss_pred --cEEEEEcCCCCCCCCCceEEEEccCCcccccceeeeEeeccCCCC
Q psy15346 205 --PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGV 249 (280)
Q Consensus 205 --~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~~~p~~~~~~ 249 (280)
+|||||||||++||++|||||+||.|.||||+++++++|+++||.
T Consensus 636 ~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~pd~~rg~ 682 (693)
T PTZ00049 636 LYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEPDFSRGA 682 (693)
T ss_pred ccCEEEEECCCCCCcccCceEEEEcCCCccCCccceeEEeeeccccH
Confidence 899999999999999999999999999999999999999999985
No 5
>KOG1543|consensus
Probab=100.00 E-value=1.3e-35 Score=280.14 Aligned_cols=151 Identities=28% Similarity=0.501 Sum_probs=130.6
Q ss_pred CCCCCchHHHHHHHHhcCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCccccccee
Q psy15346 1 VCSSGISSSTWVWVHKRGLVT-GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKY 79 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~t-e~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~ 79 (280)
||+||++..||+|++++|+++ +.+| ||. +.+..|..+.. .+.+
T Consensus 173 GC~GG~~~~A~~yi~~~G~~t~~~~Y-------py~----------~~~~~C~~~~~-------------------~~~~ 216 (325)
T KOG1543|consen 173 GCNGGEPKNAFKYIKKNGGVTECENY-------PYI----------GKDGTCKSNKK-------------------DKTV 216 (325)
T ss_pred CcCCCCHHHHHHHHHHhCCCCCCcCC-------CCc----------CCCCCccCCCc-------------------ccee
Confidence 799999999999999999998 8888 888 34446665421 2355
Q ss_pred eeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhee
Q psy15346 80 RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159 (280)
Q Consensus 80 ~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~ 159 (280)
.+.+++.++.++.+||.+|+.+|||.++|.++.+|+. |++|||.+..+.
T Consensus 217 ~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~-----------------------Y~~GVy~~~~~~-------- 265 (325)
T KOG1543|consen 217 TIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSL-----------------------YKGGVYAEEKGD-------- 265 (325)
T ss_pred EeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhh-----------------------ccCceEeCCCCC--------
Confidence 6778888887899999999999999999999999999 999999998776
Q ss_pred eeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceee
Q psy15346 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239 (280)
Q Consensus 160 ~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~ 239 (280)
. . .++|||+|||||+.++.+|||||||||++||++|||||+|++|.|+|++.+.
T Consensus 266 -~------------------------~-~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~ 319 (325)
T KOG1543|consen 266 -D------------------------K-EGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIASEAS 319 (325)
T ss_pred -C------------------------C-CCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhhcccc
Confidence 1 0 3699999999999667899999999999999999999999999999999988
Q ss_pred eEeec
Q psy15346 240 GALPK 244 (280)
Q Consensus 240 ~~~p~ 244 (280)
++.|+
T Consensus 320 ~~p~~ 324 (325)
T KOG1543|consen 320 YGPIK 324 (325)
T ss_pred cCCCC
Confidence 86554
No 6
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=2.9e-35 Score=292.57 Aligned_cols=189 Identities=29% Similarity=0.345 Sum_probs=137.1
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~ 80 (280)
||+||++..|++|++++||++|++| |.||+.. ++..+.|+.. + ...+.+..+..+
T Consensus 278 GCdGG~p~~A~~yi~~~GI~tE~dY-----~~PY~~~-------dg~~~~Ck~~----------~---~~~~y~~~~~~~ 332 (548)
T PTZ00364 278 GCAGGFPEEVGKFAETFGILTTDSY-----YIPYDSG-------DGVERACKTR----------R---PSRRYYFTNYGP 332 (548)
T ss_pred CCCCCcHHHHHHHHHhCCccccccc-----CCCCCCC-------CCCCCCCCCC----------c---ccceeeeeeeEE
Confidence 7999999999999999999997766 6699831 1222235432 1 112223334456
Q ss_pred eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecC-----Cchhhhh
Q psy15346 81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVS-----ASAEIVA 155 (280)
Q Consensus 81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~-----~~~~~~~ 155 (280)
+.++|.+..++++||++|+++|||+++|+++.+|+. |++|||.+. .... ..
T Consensus 333 I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~-----------------------YksGiy~gi~~~~~~~~~-~~ 388 (548)
T PTZ00364 333 LGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYN-----------------------CDENSTEDVRYVSLDDYS-TA 388 (548)
T ss_pred ecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHh-----------------------cCCCCccCeecccccccc-cc
Confidence 667777666788999999999999999999989999 999888632 1000 00
Q ss_pred hheeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeec-cCCccEEEEEcCCCC--CCCCCceEEEEccCCcc
Q psy15346 156 YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGE--QFGDKGTIKILRGRNEA 232 (280)
Q Consensus 156 ~~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~-e~g~~YWiirNSWG~--~WG~~Gy~kI~rg~n~c 232 (280)
.++. ....+ ....++|||+|||||+ +++++|||||||||+ +|||+|||||+||.|+|
T Consensus 389 ~~~~--------~~~~~------------~~~~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~C 448 (548)
T PTZ00364 389 SADR--------PLRHY------------FASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAY 448 (548)
T ss_pred ccCC--------ccccc------------ccccCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcc
Confidence 0000 00000 0014699999999997 478999999999999 99999999999999999
Q ss_pred cccceeeeEeeccCCCCccCcccccc
Q psy15346 233 IIESLVNGALPKDNYGVEFGEESGER 258 (280)
Q Consensus 233 gIe~~~~~~~p~~~~~~~~~~~~~~~ 258 (280)
|||+.++.+.|.....+...++.-.+
T Consensus 449 GIes~~v~~~~~~~~~~~~~~~~~~~ 474 (548)
T PTZ00364 449 NIESEVVVMYWAPYPDVLHPEEYFLV 474 (548)
T ss_pred cccceeeeeeeecCCCccCCCceEEE
Confidence 99999999999776666666655444
No 7
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=8e-35 Score=254.94 Aligned_cols=147 Identities=26% Similarity=0.483 Sum_probs=123.7
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~ 80 (280)
+|+||.+..||++++++|++++++| ||. .....|+... . ...++
T Consensus 62 gC~GG~~~~a~~~~~~~Gi~~e~~y-------PY~----------~~~~~C~~~~----------~---------~~~~~ 105 (210)
T cd02248 62 GCNGGNPDNAFEYVKNGGLASESDY-------PYT----------GKDGTCKYNS----------S---------KVGAK 105 (210)
T ss_pred CCCCCCHHHhHHHHHHCCcCccccC-------Ccc----------CCCCCccCCC----------C---------cccEE
Confidence 6999999999999999999997777 998 2334554421 0 23566
Q ss_pred eEEEEEcCc-hHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhee
Q psy15346 81 FKRYYWVND-EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159 (280)
Q Consensus 81 i~~~y~~~~-~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~ 159 (280)
+.+++.+.. +.++||++|+++|||+++|.++++|.. |++|||.+..+..
T Consensus 106 i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~-----------------------y~~Giy~~~~~~~------- 155 (210)
T cd02248 106 ITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQF-----------------------YKGGIYSGPCCSN------- 155 (210)
T ss_pred EeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCccccc-----------------------CCCCceeCCCCCC-------
Confidence 778777753 488999999999999999999989999 9999999876520
Q ss_pred eeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceee
Q psy15346 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239 (280)
Q Consensus 160 ~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~ 239 (280)
. .++|||+|||||++.+.+|||||||||++||++|||||.|+.|.|||++.+.
T Consensus 156 -~--------------------------~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~~~~~cgi~~~~~ 208 (210)
T cd02248 156 -T--------------------------NLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYAS 208 (210)
T ss_pred -C--------------------------cCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEcCCCccCceeeee
Confidence 1 6799999999999989999999999999999999999999999999998765
Q ss_pred e
Q psy15346 240 G 240 (280)
Q Consensus 240 ~ 240 (280)
+
T Consensus 209 ~ 209 (210)
T cd02248 209 Y 209 (210)
T ss_pred c
Confidence 3
No 8
>KOG1542|consensus
Probab=100.00 E-value=5.4e-35 Score=272.44 Aligned_cols=148 Identities=22% Similarity=0.454 Sum_probs=126.9
Q ss_pred CCCCCchHHHHHHHH-hcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CccCCCCCCCcccccccCCCCCcccccce
Q psy15346 1 VCSSGISSSTWVWVH-KRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP-ECKTLATPQPKCHTRCTNDNYGRGFFQDK 78 (280)
Q Consensus 1 gC~GG~~~~A~~yi~-~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~-~C~~~~~~~p~c~~~c~~~~~~~~~~~~~ 78 (280)
||+||.+..||+|++ ..||..|.+| ||+ ++.. .|..+.. ...
T Consensus 218 gC~GGl~~nA~~~~~~~gGL~~E~dY-------PY~----------g~~~~~C~~~~~-------------------~~~ 261 (372)
T KOG1542|consen 218 GCNGGLMDNAFKYIKKAGGLEKEKDY-------PYT----------GKKGNQCHFDKS-------------------KIV 261 (372)
T ss_pred cCCCCChhHHHHHHHHhCCccccccC-------Ccc----------ccCCCccccchh-------------------hce
Confidence 799999999999954 5689999999 999 4444 7766421 235
Q ss_pred eeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecC--Cchhhhhh
Q psy15346 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVS--ASAEIVAY 156 (280)
Q Consensus 79 ~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~--~~~~~~~~ 156 (280)
.+|.+++-++.++++|.+.|.++|||+|+|.+ ..++. |++||+.+. .|+
T Consensus 262 v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~-----------------------YrgGV~~P~~~~Cs----- 312 (372)
T KOG1542|consen 262 VSIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQF-----------------------YRGGVSCPSKYICS----- 312 (372)
T ss_pred EEEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHH-----------------------hcccccCCCcccCC-----
Confidence 67889999989999999999999999999996 44666 999999983 354
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC-CccEEEEEcCCCCCCCCCceEEEEccCCccccc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~-g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe 235 (280)
. . .++|+|+|||||.+. ..||||||||||++||++||+|+.||.|.|||+
T Consensus 313 --~-~--------------------------~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~ 363 (372)
T KOG1542|consen 313 --P-K--------------------------LLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIA 363 (372)
T ss_pred --c-c--------------------------ccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccc
Confidence 1 1 479999999999998 899999999999999999999999999999999
Q ss_pred ceeeeEe
Q psy15346 236 SLVNGAL 242 (280)
Q Consensus 236 ~~~~~~~ 242 (280)
+.+.+++
T Consensus 364 ~mvss~~ 370 (372)
T KOG1542|consen 364 DMVSSAA 370 (372)
T ss_pred cchhhhh
Confidence 9988765
No 9
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=2.8e-34 Score=273.23 Aligned_cols=151 Identities=23% Similarity=0.421 Sum_probs=119.2
Q ss_pred CCCCCchHHHHHHHHhc---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccc
Q psy15346 1 VCSSGISSSTWVWVHKR---GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~---Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~ 77 (280)
||+||++..||+|++++ ||++|.+| ||+.. ++..+.|... +. . ..
T Consensus 187 GC~GG~~~~a~~yi~~~~~ggi~~e~~Y-------PY~~~-------~~~~~~C~~~----------~~---~-----~~ 234 (348)
T PTZ00203 187 GCGGGLMLQAFEWVLRNMNGTVFTEKSY-------PYVSG-------NGDVPECSNS----------SE---L-----AP 234 (348)
T ss_pred CCCCCCHHHHHHHHHHhcCCCCCccccC-------CCccC-------CCCCCcCCCC----------cc---c-----cc
Confidence 79999999999999864 58898888 99831 1112234321 10 0 01
Q ss_pred eeeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhh
Q psy15346 78 KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYA 157 (280)
Q Consensus 78 ~~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~ 157 (280)
.+++.++..+..++++||++|+++|||+++|.+. +|+. |++|||++ |..
T Consensus 235 ~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~-----------------------Y~~GIy~~--c~~----- 283 (348)
T PTZ00203 235 GARIDGYVSMESSERVMAAWLAKNGPISIAVDAS-SFMS-----------------------YHSGVLTS--CIG----- 283 (348)
T ss_pred ceEecceeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcC-----------------------ccCceeec--cCC-----
Confidence 2345666666667888999999999999999984 7988 99999974 330
Q ss_pred eeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccce
Q psy15346 158 TVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237 (280)
Q Consensus 158 ~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~ 237 (280)
. ..+|||+|||||++++++|||||||||++||++|||||+||.|.|||+++
T Consensus 284 ---~--------------------------~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cgi~~~ 334 (348)
T PTZ00203 284 ---E--------------------------QLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGY 334 (348)
T ss_pred ---C--------------------------CCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCcccccce
Confidence 1 35999999999999999999999999999999999999999999999998
Q ss_pred eeeEee
Q psy15346 238 VNGALP 243 (280)
Q Consensus 238 ~~~~~p 243 (280)
++.+..
T Consensus 335 ~~~~~~ 340 (348)
T PTZ00203 335 PVSVHV 340 (348)
T ss_pred EEEEec
Confidence 887744
No 10
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=4e-33 Score=243.56 Aligned_cols=150 Identities=31% Similarity=0.559 Sum_probs=121.3
Q ss_pred CCCCCchHHHHHHHHh-cCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCccCCCCCCCcccccccCCCCCcccccce
Q psy15346 1 VCSSGISSSTWVWVHK-RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDK 78 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~-~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~-~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~ 78 (280)
+|+||++..||+++++ +||+++..| ||.. .. +.|.... +. ...
T Consensus 65 ~c~gg~~~~a~~~~~~~~Gi~~e~~~-------pY~~----------~~~~~c~~~~---------~~---------~~~ 109 (219)
T PF00112_consen 65 GCDGGSPFDALKYIKNNNGIVTEEDY-------PYNG----------NENPTCKSKK---------SN---------SYY 109 (219)
T ss_dssp TTBBBEHHHHHHHHHHHTSBEBTTTS---------SS----------SSSCSSCHSG---------GG---------EEE
T ss_pred ccccCcccccceeecccCcccccccc-------cccc----------cccccccccc---------cc---------ccc
Confidence 6999999999999999 999997777 9992 22 4554421 00 012
Q ss_pred eeeEEEEEcCc-hHHHHHHHHHhCCcEEEEEEeCc-cccccccCccCCCcccccccccccccccccceeecCCchhhhhh
Q psy15346 79 YRFKRYYWVND-EVADIQQEIMKNGPVVANMYLYS-DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156 (280)
Q Consensus 79 ~~i~~~y~~~~-~~~~Ik~~I~~~GPV~v~~~v~~-~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~ 156 (280)
+++..+..+.. ++++||++|+++|||+++|.+.. +|.. |++|||....+..
T Consensus 110 ~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~-----------------------~~~gi~~~~~~~~---- 162 (219)
T PF00112_consen 110 VKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQN-----------------------YKSGIYDPPDCSN---- 162 (219)
T ss_dssp BEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHT-----------------------EESSEECSTSSSS----
T ss_pred ccccccccccccchhHHHHHHhhCceeeeeeecccccccc-----------------------ccceeeecccccc----
Confidence 45556666543 58999999999999999999988 5988 9999999875440
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCC-ccccc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-EAIIE 235 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n-~cgIe 235 (280)
..++|||+|||||++.+.+|||||||||++||++|||||.|+.| +||||
T Consensus 163 ------------------------------~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~~~c~i~ 212 (219)
T PF00112_consen 163 ------------------------------ESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYFRISYDYNNECGIE 212 (219)
T ss_dssp ------------------------------SSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEEEEESSSSSGGGTT
T ss_pred ------------------------------ccccccccccccccccceeeEeeehhhCCccCCCeEEEEeeCCCCcCccC
Confidence 16799999999999999999999999999999999999999997 99999
Q ss_pred ceeeeEe
Q psy15346 236 SLVNGAL 242 (280)
Q Consensus 236 ~~~~~~~ 242 (280)
+++++++
T Consensus 213 ~~~~~~~ 219 (219)
T PF00112_consen 213 SQAVYPI 219 (219)
T ss_dssp SSEEEEE
T ss_pred ceeeecC
Confidence 9999875
No 11
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=5.1e-33 Score=271.86 Aligned_cols=145 Identities=21% Similarity=0.423 Sum_probs=114.6
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~ 80 (280)
||+||++..||+|++++||+++++| ||+ +..+.|.... ...++
T Consensus 296 GC~GG~~~~A~~yi~~~Gi~~e~~Y-------PY~----------~~~~~C~~~~--------------------~~~~~ 338 (448)
T PTZ00200 296 GCSGGYPDTALEYVKNKGLSSSSDV-------PYL----------AKDGKCVVSS--------------------TKKVY 338 (448)
T ss_pred CCCCCcHHHHHHHHhhcCccccccC-------CCC----------CCCCCCcCCC--------------------CCeeE
Confidence 7999999999999999999997777 998 4455675421 01223
Q ss_pred eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346 81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK 160 (280)
Q Consensus 81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~ 160 (280)
+.++..+. . .+++++++.+|||+++|.++.+|+. |++|||++. |.
T Consensus 339 i~~y~~~~-~-~~~l~~~l~~GPV~v~i~~~~~f~~-----------------------Yk~GIy~~~-C~--------- 383 (448)
T PTZ00200 339 IDSYLVAK-G-KDVLNKSLVISPTVVYIAVSRELLK-----------------------YKSGVYNGE-CG--------- 383 (448)
T ss_pred ecceEecC-H-HHHHHHHHhcCCEEEEeeccccccc-----------------------CCCCccccc-cC---------
Confidence 44444343 3 3455566678999999999888999 999999864 33
Q ss_pred eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeecc--CCccEEEEEcCCCCCCCCCceEEEEcc---CCccccc
Q psy15346 161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRG---RNEAIIE 235 (280)
Q Consensus 161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e--~g~~YWiirNSWG~~WG~~Gy~kI~rg---~n~cgIe 235 (280)
. .++|||+|||||.+ ++.+|||||||||++||++|||||+|+ .|.|||+
T Consensus 384 ~--------------------------~~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~ 437 (448)
T PTZ00200 384 K--------------------------SLNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGIL 437 (448)
T ss_pred C--------------------------CCcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCcc
Confidence 1 35999999999953 688999999999999999999999995 5899999
Q ss_pred ceeeeEee
Q psy15346 236 SLVNGALP 243 (280)
Q Consensus 236 ~~~~~~~p 243 (280)
+.+.+++.
T Consensus 438 ~~~~~P~~ 445 (448)
T PTZ00200 438 TVGLTPVF 445 (448)
T ss_pred ccceeeEE
Confidence 98887653
No 12
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=1.9e-32 Score=269.55 Aligned_cols=147 Identities=23% Similarity=0.419 Sum_probs=118.2
Q ss_pred CCCCCchHHHHHHHHhc-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCccccccee
Q psy15346 1 VCSSGISSSTWVWVHKR-GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKY 79 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~-Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~ 79 (280)
||+||++..||+|++++ ||++|++| ||+. ...+.|... . |. ..+
T Consensus 327 GC~GG~~~~Af~yi~~~gGl~tE~~Y-------PY~~---------~~~~~C~~~-----~----~~----------~~~ 371 (489)
T PTZ00021 327 GCYGGLIPNAFEDMIELGGLCSEDDY-------PYVS---------DTPELCNID-----R----CK----------EKY 371 (489)
T ss_pred CCCCcchHhhhhhhhhccccCccccc-------CccC---------CCCCccccc-----c----cc----------ccc
Confidence 79999999999999766 89998888 9983 112456432 1 21 134
Q ss_pred eeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhee
Q psy15346 80 RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159 (280)
Q Consensus 80 ~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~ 159 (280)
.+.++..++ ..+|+++|+.+|||+|+|.+..+|+. |++|||++. |.
T Consensus 372 ~i~~y~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~-----------------------YkgGIy~~~-C~-------- 417 (489)
T PTZ00021 372 KIKSYVSIP--EDKFKEAIRFLGPISVSIAVSDDFAF-----------------------YKGGIFDGE-CG-------- 417 (489)
T ss_pred eeeeEEEec--HHHHHHHHHhcCCeEEEEEeeccccc-----------------------CCCCcCCCC-CC--------
Confidence 566776664 57899999999999999999888999 999999864 43
Q ss_pred eeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCC----------ccEEEEEcCCCCCCCCCceEEEEccC
Q psy15346 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENG----------RPYWTIVSTFGEQFGDKGTIKILRGR 229 (280)
Q Consensus 160 ~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g----------~~YWiirNSWG~~WG~~Gy~kI~rg~ 229 (280)
. .++|||+|||||++++ .+|||||||||++||++|||||+|+.
T Consensus 418 -~--------------------------~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~ 470 (489)
T PTZ00021 418 -E--------------------------EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDE 470 (489)
T ss_pred -C--------------------------ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCC
Confidence 1 4599999999997642 57999999999999999999999996
Q ss_pred ----CcccccceeeeEee
Q psy15346 230 ----NEAIIESLVNGALP 243 (280)
Q Consensus 230 ----n~cgIe~~~~~~~p 243 (280)
|.|||..++.+++.
T Consensus 471 ~g~~n~CGI~t~a~yP~~ 488 (489)
T PTZ00021 471 NGLMKTCSLGTEAYVPLI 488 (489)
T ss_pred CCCCCCCCCcccceeEec
Confidence 58999998887653
No 13
>KOG1544|consensus
Probab=99.98 E-value=4.5e-33 Score=257.69 Aligned_cols=179 Identities=34% Similarity=0.625 Sum_probs=142.2
Q ss_pred CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCC----CCCcccccccCCCCCccccc
Q psy15346 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT----PQPKCHTRCTNDNYGRGFFQ 76 (280)
Q Consensus 1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~----~~p~c~~~c~~~~~~~~~~~ 76 (280)
||+||+.+.||-||.+.|+|. +.|+||.. ...+..+.|...+. ......+.|++. +. -..
T Consensus 275 GC~gG~lDRAWWYlRKrGvVs-------dhCYP~~~------dQ~~~~~~C~m~sR~~grgkRqat~~CPn~-~~--~Sn 338 (470)
T KOG1544|consen 275 GCRGGRLDRAWWYLRKRGVVS-------DHCYPFSG------DQAGPAPPCMMHSRAMGRGKRQATAHCPNS-YV--NSN 338 (470)
T ss_pred cCccCcccchheeeecccccc-------cccccccC------CCCCCCCCceeeccccCcccccccCcCCCc-cc--ccC
Confidence 799999999999999999999 78889985 22345667766443 112223446532 11 124
Q ss_pred ceeeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhh
Q psy15346 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156 (280)
Q Consensus 77 ~~~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~ 156 (280)
+.|+.+..|.+++++++||++||++|||.+.|.|.+||+. |++|||.|....
T Consensus 339 ~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~-----------------------YkgGiY~H~~~~----- 390 (470)
T KOG1544|consen 339 DIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFL-----------------------YKGGIYSHTPVS----- 390 (470)
T ss_pred ceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhh-----------------------hccceeeccccc-----
Confidence 6788899999999999999999999999999999999999 999999997643
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC-----CccEEEEEcCCCCCCCCCceEEEEccCCc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN-----GRPYWTIVSTFGEQFGDKGTIKILRGRNE 231 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~-----g~~YWiirNSWG~~WG~~Gy~kI~rg~n~ 231 (280)
. + .+......+.|+|.|.|||++. ..+|||..||||+.||++|||||.||+|+
T Consensus 391 ----~-------~-----------~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNe 448 (470)
T KOG1544|consen 391 ----L-------G-----------RPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNE 448 (470)
T ss_pred ----c-------C-----------CchhhhhcccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccc
Confidence 0 0 0111123678999999999973 36799999999999999999999999999
Q ss_pred ccccceeeeEeecc
Q psy15346 232 AIIESLVNGALPKD 245 (280)
Q Consensus 232 cgIe~~~~~~~p~~ 245 (280)
|.||+.+++|+-.+
T Consensus 449 cdIEsfvIgAWGr~ 462 (470)
T KOG1544|consen 449 CDIESFVIGAWGRV 462 (470)
T ss_pred hhhhHhhhhhhhcc
Confidence 99999999887644
No 14
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.95 E-value=6.3e-27 Score=242.50 Aligned_cols=103 Identities=20% Similarity=0.385 Sum_probs=86.8
Q ss_pred HHHHHHHHHhCCcEEEEEEeCccccccccCccCCCccccccccccccccc-ccceeecCCchhhhhhheeeeeccCcCCC
Q psy15346 91 VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEENG 169 (280)
Q Consensus 91 ~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y-~~GVy~~~~~~~~~~~~~~~~~gwg~~~~ 169 (280)
++.||++|+.+|||+|+|.+. +|+. | .+|||....|.. .
T Consensus 680 i~~IK~eI~~kGPVaV~IdAs-df~~-----------------------Y~~sGIyv~~~Cgs--------~-------- 719 (1004)
T PTZ00462 680 IKIIKDEIMNKGSVIAYIKAE-NVLG-----------------------YEFNGKKVQNLCGD--------D-------- 719 (1004)
T ss_pred HHHHHHHHHhcCCEEEEEEee-hHHh-----------------------hhcCCccccCCCCC--------C--------
Confidence 468999999999999999985 6888 7 489876654540 1
Q ss_pred CCceeeeeeeecccCccccCCeEEEEEEeecc-----CCccEEEEEcCCCCCCCCCceEEEEc-cCCcccccceeeeEee
Q psy15346 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEE-----NGRPYWTIVSTFGEQFGDKGTIKILR-GRNEAIIESLVNGALP 243 (280)
Q Consensus 170 ~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e-----~g~~YWiirNSWG~~WG~~Gy~kI~r-g~n~cgIe~~~~~~~p 243 (280)
..+|||+|||||.+ .+++|||||||||+.||++|||||+| |.|.|||.....++++
T Consensus 720 ------------------~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin~i~t~~~f 781 (1004)
T PTZ00462 720 ------------------TADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIHSVVIF 781 (1004)
T ss_pred ------------------cCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCCCCCccchheeeeeE
Confidence 45899999999974 25799999999999999999999998 7899999999999999
Q ss_pred ccCCCCcc
Q psy15346 244 KDNYGVEF 251 (280)
Q Consensus 244 ~~~~~~~~ 251 (280)
++.-++.-
T Consensus 782 n~d~~~~~ 789 (1004)
T PTZ00462 782 NIDLPKNK 789 (1004)
T ss_pred eecccccc
Confidence 88766653
No 15
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.94 E-value=9.2e-27 Score=203.61 Aligned_cols=141 Identities=21% Similarity=0.335 Sum_probs=106.1
Q ss_pred CCCCCchHHHHH-HHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCccccccee
Q psy15346 1 VCSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKY 79 (280)
Q Consensus 1 gC~GG~~~~A~~-yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~ 79 (280)
+|.||.+..|+. +++++||+++..| ||.. ....|... |.... ....+
T Consensus 65 ~c~gG~~~~~~~~~~~~~Gi~~e~~~-------Py~~----------~~~~~~~~----------~~~~~-----~~~~~ 112 (223)
T cd02619 65 SCDGGGPLSALLKLVALKGIPPEEDY-------PYGA----------ESDGEEPK----------SEAAL-----NAAKV 112 (223)
T ss_pred CCCCCcHHHHHHHHHHHcCCCccccC-------CCCC----------CCCCCCCC----------Cccch-----hhcce
Confidence 699999999998 9999999997777 9983 22223221 00000 11234
Q ss_pred eeEEEEEcC-chHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeec----CCchhhh
Q psy15346 80 RFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAV----SASAEIV 154 (280)
Q Consensus 80 ~i~~~y~~~-~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~----~~~~~~~ 154 (280)
++..+..+. .++++||++|+++|||+++|.+..+|+. |++|+|.. ....
T Consensus 113 ~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~--- 166 (223)
T cd02619 113 KLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDR-----------------------LKEGIIYEEIVYLLYE--- 166 (223)
T ss_pred eecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhc-----------------------ccCccccccccccccC---
Confidence 455665554 3578999999999999999999988988 88888631 1111
Q ss_pred hhheeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC--CccEEEEEcCCCCCCCCCceEEEEccC
Q psy15346 155 AYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN--GRPYWTIVSTFGEQFGDKGTIKILRGR 229 (280)
Q Consensus 155 ~~~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~--g~~YWiirNSWG~~WG~~Gy~kI~rg~ 229 (280)
....++|||+|||||++. +++|||||||||+.||++||+||.++.
T Consensus 167 ------------------------------~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~ 213 (223)
T cd02619 167 ------------------------------DGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYED 213 (223)
T ss_pred ------------------------------CCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence 011679999999999987 889999999999999999999999974
No 16
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.94 E-value=1.5e-26 Score=199.99 Aligned_cols=52 Identities=37% Similarity=0.797 Sum_probs=48.0
Q ss_pred cCCeEEEEEEeecc-CCccEEEEEcCCCCCCCCCceEEEEccC-Ccccccceee
Q psy15346 188 VAYATVKLIGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGR-NEAIIESLVN 239 (280)
Q Consensus 188 ~~~HaV~IVGwG~e-~g~~YWiirNSWG~~WG~~Gy~kI~rg~-n~cgIe~~~~ 239 (280)
.++|+|+|||||++ ++++|||||||||+.||++|||||+|+. |+||||....
T Consensus 118 ~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~~~~ 171 (174)
T smart00645 118 TLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA 171 (174)
T ss_pred cccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCceeeee
Confidence 35999999999987 8999999999999999999999999998 9999987654
No 17
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.48 E-value=9.2e-14 Score=136.07 Aligned_cols=41 Identities=22% Similarity=0.492 Sum_probs=37.0
Q ss_pred cCCeEEEEEEeeccC-Cc-cEEEEEcCCCCCCCCCceEEEEcc
Q psy15346 188 VAYATVKLIGWGEEN-GR-PYWTIVSTFGEQFGDKGTIKILRG 228 (280)
Q Consensus 188 ~~~HaV~IVGwG~e~-g~-~YWiirNSWG~~WG~~Gy~kI~rg 228 (280)
..+|||+||||+.+. |. .||+|+||||+.||++||++|.+.
T Consensus 357 ~~tHAM~ivGv~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 357 LMTHAMVLTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred cCCeEEEEEEEEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence 468999999999864 76 599999999999999999999986
No 18
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.38 E-value=6.9e-13 Score=125.66 Aligned_cols=42 Identities=19% Similarity=0.311 Sum_probs=36.9
Q ss_pred cCCeEEEEEEeeccC----------CccEEEEEcCCCCCCCCCceEEEEccC
Q psy15346 188 VAYATVKLIGWGEEN----------GRPYWTIVSTFGEQFGDKGTIKILRGR 229 (280)
Q Consensus 188 ~~~HaV~IVGwG~e~----------g~~YWiirNSWG~~WG~~Gy~kI~rg~ 229 (280)
..+|||+||||++.. +...||||||||++||++|||||....
T Consensus 263 ~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~y 314 (372)
T COG4870 263 NWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYY 314 (372)
T ss_pred cccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEeee
Confidence 469999999999851 345999999999999999999999875
No 19
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.30 E-value=3.4e-06 Score=83.17 Aligned_cols=41 Identities=24% Similarity=0.534 Sum_probs=33.9
Q ss_pred cCCeEEEEEEeec-cCCcc-EEEEEcCCCCCCCCCceEEEEcc
Q psy15346 188 VAYATVKLIGWGE-ENGRP-YWTIVSTFGEQFGDKGTIKILRG 228 (280)
Q Consensus 188 ~~~HaV~IVGwG~-e~g~~-YWiirNSWG~~WG~~Gy~kI~rg 228 (280)
..+|||+|+|... ++|.+ +|+|+||||++.|.+|||.|...
T Consensus 358 ~~tHAM~itGv~~D~~g~p~~wkVeNSWG~~~g~kGy~~msd~ 400 (438)
T PF03051_consen 358 TMTHAMVITGVDLDEDGKPVRWKVENSWGTDNGDKGYFYMSDD 400 (438)
T ss_dssp --EEEEEEEEEEE-TTSSEEEEEEE-SBTTTSTBTTEEEEEHH
T ss_pred CCceeEEEEEEEeccCCCeeEEEEEcCCCCCCCCCcEEEECHH
Confidence 5689999999997 46664 99999999999999999999853
No 20
>PTZ00203 cathepsin L protease; Provisional
Probab=95.75 E-value=0.0051 Score=59.20 Aligned_cols=29 Identities=24% Similarity=0.500 Sum_probs=25.0
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASA 185 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~ 185 (280)
|+|.++|||++++.+||++.|||...|..
T Consensus 288 HaVliVGYG~~~g~~YWiikNSWG~~WGe 316 (348)
T PTZ00203 288 HGVLLVGYNMTGEVPYWVIKNSWGEDWGE 316 (348)
T ss_pred eEEEEEEEecCCCceEEEEEcCCCCCcCc
Confidence 37889999999999999999999775553
No 21
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=95.70 E-value=0.005 Score=55.71 Aligned_cols=28 Identities=21% Similarity=0.713 Sum_probs=24.3
Q ss_pred heeeeeccCcCC-CCCceeeeeeeecccC
Q psy15346 157 ATVKIVGWGEEN-GRPYWTIVRVYAVSAS 184 (280)
Q Consensus 157 ~~~~~~gwg~~~-~~~~w~~~~~~~~~~~ 184 (280)
|+|.++|||+++ +++||++.|||...|.
T Consensus 180 HaV~IVGyG~~~~g~~YWiikNSWG~~WG 208 (239)
T cd02698 180 HIISVAGWGVDENGVEYWIVRNSWGEPWG 208 (239)
T ss_pred eEEEEEEEEecCCCCEEEEEEcCCCcccC
Confidence 388899999886 8999999999977655
No 22
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=95.46 E-value=0.0081 Score=54.32 Aligned_cols=30 Identities=33% Similarity=0.796 Sum_probs=25.5
Q ss_pred hheeeeeccCcCC--CCCceeeeeeeecccCc
Q psy15346 156 YATVKIVGWGEEN--GRPYWTIVRVYAVSASA 185 (280)
Q Consensus 156 ~~~~~~~gwg~~~--~~~~w~~~~~~~~~~~~ 185 (280)
.|+|.++|||.++ +.+||+++|||...|..
T Consensus 187 ~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe 218 (243)
T cd02621 187 NHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGE 218 (243)
T ss_pred CeEEEEEEeeccCCCCCcEEEEEcCCCCCCCc
Confidence 4689999999886 88999999999776653
No 23
>KOG1543|consensus
Probab=95.44 E-value=0.0083 Score=57.20 Aligned_cols=29 Identities=28% Similarity=0.633 Sum_probs=24.5
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASA 185 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~ 185 (280)
|+|+++|||+.++.+||+++|||...|..
T Consensus 271 Hav~iVGyG~~~~~~YWivkNSWG~~WGe 299 (325)
T KOG1543|consen 271 HAVLIVGYGTGDGVDYWIVKNSWGTDWGE 299 (325)
T ss_pred ceEEEEEEcCCCCceeEEEEcCCCCCccc
Confidence 48889999996668999999999776653
No 24
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=95.26 E-value=0.0085 Score=51.68 Aligned_cols=45 Identities=33% Similarity=0.647 Sum_probs=36.6
Q ss_pred cccceeecCCchhhhhhheeeeeccCcC-CCCCceeeeeeeecccC
Q psy15346 140 YKSGVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVRVYAVSAS 184 (280)
Q Consensus 140 Y~~GVy~~~~~~~~~~~~~~~~~gwg~~-~~~~~w~~~~~~~~~~~ 184 (280)
|++|||++..+......|.|.++|||++ ++++||+++|||...|.
T Consensus 104 Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG 149 (174)
T smart00645 104 YKSGIYDHPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWG 149 (174)
T ss_pred CcCeEECCCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcc
Confidence 9999998864433335799999999987 88999999999966444
No 25
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=94.88 E-value=0.015 Score=52.57 Aligned_cols=28 Identities=43% Similarity=0.949 Sum_probs=24.5
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccC
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSAS 184 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~ 184 (280)
|+|.++|||.+++.+||++.|||...|.
T Consensus 186 HaV~iVGyg~~~g~~YWivrNSWG~~WG 213 (236)
T cd02620 186 HAVKIIGWGVENGVPYWLAANSWGTDWG 213 (236)
T ss_pred eEEEEEEEeccCCeeEEEEEeCCCCCCC
Confidence 3788999999999999999999977554
No 26
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=94.80 E-value=0.0084 Score=57.42 Aligned_cols=40 Identities=20% Similarity=0.416 Sum_probs=33.8
Q ss_pred cCCeEEEEEEeecc-CC-ccEEEEEcCCCCCCCCCceEEEEc
Q psy15346 188 VAYATVKLIGWGEE-NG-RPYWTIVSTFGEQFGDKGTIKILR 227 (280)
Q Consensus 188 ~~~HaV~IVGwG~e-~g-~~YWiirNSWG~~WG~~Gy~kI~r 227 (280)
...|||+|.|...+ +| .-=|.|.||||.+=|.+|||-+.-
T Consensus 360 LmTHAMvlTGvd~d~~g~p~rwkVENSWG~d~G~~GyfvaSd 401 (444)
T COG3579 360 LMTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYFVASD 401 (444)
T ss_pred HHHHHHHhhccccccCCCceeeEeecccccccCCCceEeehH
Confidence 56899999999865 43 347999999999999999998753
No 27
>KOG1542|consensus
Probab=94.52 E-value=0.023 Score=54.50 Aligned_cols=29 Identities=28% Similarity=0.697 Sum_probs=25.6
Q ss_pred heeeeeccCcCC-CCCceeeeeeeecccCc
Q psy15346 157 ATVKIVGWGEEN-GRPYWTIVRVYAVSASA 185 (280)
Q Consensus 157 ~~~~~~gwg~~~-~~~~w~~~~~~~~~~~~ 185 (280)
|+|.++|+|..+ +.|||+++|||...|..
T Consensus 318 HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE 347 (372)
T KOG1542|consen 318 HAVLLVGYGSSGYEKPYWIVKNSWGTSWGE 347 (372)
T ss_pred ceEEEEeecCCCCCCceEEEECCccccccc
Confidence 488899999998 99999999999887764
No 28
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=94.44 E-value=0.018 Score=50.05 Aligned_cols=28 Identities=36% Similarity=0.732 Sum_probs=24.7
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccC
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSAS 184 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~ 184 (280)
|+|.++|||.+.+++||++.|||...|.
T Consensus 160 Hav~iVGy~~~~~~~ywiv~NSWG~~WG 187 (210)
T cd02248 160 HAVLLVGYGTENGVDYWIVKNSWGTSWG 187 (210)
T ss_pred EEEEEEEEeecCCceEEEEEcCCCCccc
Confidence 3888999999989999999999977655
No 29
>PTZ00200 cysteine proteinase; Provisional
Probab=93.61 E-value=0.041 Score=54.77 Aligned_cols=28 Identities=25% Similarity=0.487 Sum_probs=23.1
Q ss_pred heeeeeccCc--CCCCCceeeeeeeecccC
Q psy15346 157 ATVKIVGWGE--ENGRPYWTIVRVYAVSAS 184 (280)
Q Consensus 157 ~~~~~~gwg~--~~~~~~w~~~~~~~~~~~ 184 (280)
|+|.++|||. +++.+||+++|||...|.
T Consensus 388 HaV~lVGyG~d~~~g~~YWIIkNSWG~~WG 417 (448)
T PTZ00200 388 HAVLLVGEGYDEKTKKRYWIIKNSWGTDWG 417 (448)
T ss_pred EEEEEEEecccCCCCCceEEEEcCCCCCcc
Confidence 3888999995 367899999999977554
No 30
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=93.40 E-value=0.043 Score=55.92 Aligned_cols=31 Identities=35% Similarity=0.775 Sum_probs=26.5
Q ss_pred hheeeeeccCcC-CCCCceeeeeeeec--ccCcc
Q psy15346 156 YATVKIVGWGEE-NGRPYWTIVRVYAV--SASAE 186 (280)
Q Consensus 156 ~~~~~~~gwg~~-~~~~~w~~~~~~~~--~~~~~ 186 (280)
.|+|.++|||++ ++.+||+++|||.. .|...
T Consensus 403 nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~ 436 (548)
T PTZ00364 403 NHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDG 436 (548)
T ss_pred CeEEEEEEecccCCCceEEEEECCCCCCCCcccC
Confidence 469999999974 78899999999988 77653
No 31
>KOG1544|consensus
Probab=92.86 E-value=0.068 Score=51.13 Aligned_cols=84 Identities=26% Similarity=0.503 Sum_probs=48.7
Q ss_pred eeeeEEEEEcCchHHHHHHH--HHhCCcEEEEEEeCcc-ccccccCccCCCcccccccccccccccccceeecCCchhhh
Q psy15346 78 KYRFKRYYWVNDEVADIQQE--IMKNGPVVANMYLYSD-IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIV 154 (280)
Q Consensus 78 ~~~i~~~y~~~~~~~~Ik~~--I~~~GPV~v~~~v~~~-f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~ 154 (280)
-|++++ +-...+.+||++ +..---|-..|..|.. .+.|. |+-+ =++.-|....
T Consensus 346 PYrVSS--nE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~-------~~~~----------~~~e~yr~~g----- 401 (470)
T KOG1544|consen 346 PYRVSS--NEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHT-------PVSL----------GRPERYRRHG----- 401 (470)
T ss_pred CeeccC--CHHHHHHHHHhCCChhhhhhhhhhhhhhccceeecc-------cccc----------CCchhhhhcc-----
Confidence 455554 223457788776 3333335566766654 55531 1100 1122233333
Q ss_pred hhheeeeeccCcC---CC--CCceeeeeeeecccCcc
Q psy15346 155 AYATVKIVGWGEE---NG--RPYWTIVRVYAVSASAE 186 (280)
Q Consensus 155 ~~~~~~~~gwg~~---~~--~~~w~~~~~~~~~~~~~ 186 (280)
.|+||+.|||++ .+ .+||+.+|||...|...
T Consensus 402 -tHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~ 437 (470)
T KOG1544|consen 402 -THSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 437 (470)
T ss_pred -cceEEEeecccccCCCCCeeEEEEeecccccccccC
Confidence 359999999987 22 48999999999888754
No 32
>PTZ00049 cathepsin C-like protein; Provisional
Probab=92.52 E-value=0.073 Score=55.45 Aligned_cols=30 Identities=33% Similarity=0.750 Sum_probs=24.1
Q ss_pred hheeeeeccCcC--CCC--CceeeeeeeecccCc
Q psy15346 156 YATVKIVGWGEE--NGR--PYWTIVRVYAVSASA 185 (280)
Q Consensus 156 ~~~~~~~gwg~~--~~~--~~w~~~~~~~~~~~~ 185 (280)
.|+|.++|||++ ++. +||++.|||...|..
T Consensus 619 NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGe 652 (693)
T PTZ00049 619 NHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGK 652 (693)
T ss_pred ceEEEEEEeccccCCCcccCEEEEECCCCCCccc
Confidence 469999999985 453 899999999776653
No 33
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=92.26 E-value=0.49 Score=37.51 Aligned_cols=24 Identities=29% Similarity=0.426 Sum_probs=17.7
Q ss_pred cCchHHHHHHHHHhCCcEEEEEEe
Q psy15346 87 VNDEVADIQQEIMKNGPVVANMYL 110 (280)
Q Consensus 87 ~~~~~~~Ik~~I~~~GPV~v~~~v 110 (280)
...+..+|+++|.+..||++.+..
T Consensus 85 ~~~~~~~i~~~i~~G~Pvi~~~~~ 108 (144)
T PF13529_consen 85 SDASFDDIKQEIDAGRPVIVSVNS 108 (144)
T ss_dssp TTS-HHHHHHHHHTT--EEEEEET
T ss_pred cCCcHHHHHHHHHCCCcEEEEEEc
Confidence 346789999999999999999874
No 34
>PTZ00021 falcipain-2; Provisional
Probab=92.23 E-value=0.084 Score=53.11 Aligned_cols=28 Identities=32% Similarity=0.503 Sum_probs=22.3
Q ss_pred heeeeeccCcCC----------CCCceeeeeeeecccC
Q psy15346 157 ATVKIVGWGEEN----------GRPYWTIVRVYAVSAS 184 (280)
Q Consensus 157 ~~~~~~gwg~~~----------~~~~w~~~~~~~~~~~ 184 (280)
|+|.+||||.++ +.+||++.|||...|.
T Consensus 422 HAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WG 459 (489)
T PTZ00021 422 HAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWG 459 (489)
T ss_pred eEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcc
Confidence 388899999763 2589999999977554
No 35
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=92.03 E-value=0.056 Score=46.78 Aligned_cols=29 Identities=31% Similarity=0.699 Sum_probs=24.6
Q ss_pred heeeeeccCcCCCCCceeeeeeeecccCc
Q psy15346 157 ATVKIVGWGEENGRPYWTIVRVYAVSASA 185 (280)
Q Consensus 157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~ 185 (280)
|++.++||+.+.++.||+++|||...|..
T Consensus 167 Hav~iVGy~~~~~~~~wiv~NSWG~~WG~ 195 (219)
T PF00112_consen 167 HAVLIVGYDDENGKGYWIVKNSWGTDWGD 195 (219)
T ss_dssp EEEEEEEEEEETTEEEEEEE-SBTTTSTB
T ss_pred ccccccccccccceeeEeeehhhCCccCC
Confidence 37889999999999999999999887664
No 36
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=91.65 E-value=0.13 Score=44.61 Aligned_cols=29 Identities=17% Similarity=0.377 Sum_probs=25.2
Q ss_pred heeeeeccCcCC--CCCceeeeeeeecccCc
Q psy15346 157 ATVKIVGWGEEN--GRPYWTIVRVYAVSASA 185 (280)
Q Consensus 157 ~~~~~~gwg~~~--~~~~w~~~~~~~~~~~~ 185 (280)
|+|.++||+.+. +.+||++.|||...|..
T Consensus 173 Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~ 203 (223)
T cd02619 173 HAVVIVGYDDNYVEGKGAFIVKNSWGTDWGD 203 (223)
T ss_pred eEEEEEeecCCCCCCCCEEEEEeCCCCcccc
Confidence 588899999887 89999999999876654
No 37
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=90.52 E-value=0.15 Score=54.95 Aligned_cols=31 Identities=26% Similarity=0.519 Sum_probs=25.2
Q ss_pred heeeeeccCcC-----CCCCceeeeeeeecccCccc
Q psy15346 157 ATVKIVGWGEE-----NGRPYWTIVRVYAVSASAEI 187 (280)
Q Consensus 157 ~~~~~~gwg~~-----~~~~~w~~~~~~~~~~~~~~ 187 (280)
|+|.++|||.+ .+.+||++.|||...|..++
T Consensus 723 HAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnG 758 (1004)
T PTZ00462 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEG 758 (1004)
T ss_pred ceEEEEEecccccccCCCCceEEEEcCCCCCcCCCe
Confidence 48889999974 25799999999999886544
No 38
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=66.43 E-value=12 Score=32.93 Aligned_cols=26 Identities=15% Similarity=0.391 Sum_probs=20.2
Q ss_pred cCCeEEEEEEeec-cCCccEEEEEcCC
Q psy15346 188 VAYATVKLIGWGE-ENGRPYWTIVSTF 213 (280)
Q Consensus 188 ~~~HaV~IVGwG~-e~g~~YWiirNSW 213 (280)
..+|||+||||-. .+|.++.++=|-|
T Consensus 118 ~~gHAlavvGya~~~~g~~~y~~WNPW 144 (175)
T PF05543_consen 118 HAGHALAVVGYAKPNNGQKTYYFWNPW 144 (175)
T ss_dssp --EEEEEEEEEEEETTSEEEEEEE-TT
T ss_pred ccceeEEEEeeeecCCCCeEEEEeCCc
Confidence 5789999999976 4578899999999
No 39
>KOG4128|consensus
Probab=64.43 E-value=0.83 Score=44.05 Aligned_cols=54 Identities=15% Similarity=0.247 Sum_probs=39.4
Q ss_pred cCCeEEEEEEeec-c---CCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeeeEeecc
Q psy15346 188 VAYATVKLIGWGE-E---NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 245 (280)
Q Consensus 188 ~~~HaV~IVGwG~-e---~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~~~p~~ 245 (280)
.-.|||++.+-|. + .+..=|.|.||||.+-|.+|+.+|..- -.+.++.-.+.+.
T Consensus 370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt~e----wf~EY~feiVVd~ 427 (457)
T KOG4128|consen 370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMTAE----WFREYAFEIVVDE 427 (457)
T ss_pred HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhhHH----HHHhhheeEEeec
Confidence 4579999999983 2 455679999999999999999766532 2455555555544
No 40
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=64.06 E-value=16 Score=33.63 Aligned_cols=23 Identities=13% Similarity=0.401 Sum_probs=17.6
Q ss_pred chHHHHHHHHHhCCcEEEEEEeC
Q psy15346 89 DEVADIQQEIMKNGPVVANMYLY 111 (280)
Q Consensus 89 ~~~~~Ik~~I~~~GPV~v~~~v~ 111 (280)
...+.|++.|.++.||++.++.+
T Consensus 76 ~~~~~l~~~l~~g~pv~~~~D~~ 98 (317)
T PF14399_consen 76 EAWEELKEALDAGRPVIVWVDMY 98 (317)
T ss_pred HHHHHHHHHHhCCCceEEEeccc
Confidence 34557888888888999998764
No 41
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=56.98 E-value=7.1 Score=38.96 Aligned_cols=73 Identities=15% Similarity=0.258 Sum_probs=51.2
Q ss_pred EEEcCchHHHHH----HHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCC----------
Q psy15346 84 YYWVNDEVADIQ----QEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA---------- 149 (280)
Q Consensus 84 ~y~~~~~~~~Ik----~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~---------- 149 (280)
++++ .+++|+ +.|..++||.++.++. .|+. |++||+....
T Consensus 289 y~Nv--p~d~l~~~~~~~L~~g~pV~~g~Dv~-~~~~-----------------------~k~GI~d~~~~~~~~~f~~~ 342 (437)
T cd00585 289 YLNV--PMDVLKKAAIAQLKDGEPVWFGCDVG-KFSD-----------------------RKSGILDTDLFDYELLFGID 342 (437)
T ss_pred EEec--CHHHHHHHHHHHHhcCCCEEEEEEcC-hhhc-----------------------cCCccccCcccchhhhcCcc
Confidence 4455 344555 5678899999999996 4666 7888875421
Q ss_pred ----------chhhhhhheeeeeccCcC-CCC-Cceeeeeeeecc
Q psy15346 150 ----------SAEIVAYATVKIVGWGEE-NGR-PYWTIVRVYAVS 182 (280)
Q Consensus 150 ----------~~~~~~~~~~~~~gwg~~-~~~-~~w~~~~~~~~~ 182 (280)
+.+....|++.++|++.+ +++ .||++.|||...
T Consensus 343 ~~~~KaeRl~~~es~~tHAM~ivGv~~D~~g~p~yw~VkNSWG~~ 387 (437)
T cd00585 343 FGLNKAERLDYGESLMTHAMVLTGVDLDEDGKPVKWKVENSWGEK 387 (437)
T ss_pred ccCCHHHHHhhcCCcCCeEEEEEEEEecCCCCcceEEEEcccCCC
Confidence 112234578999999986 476 599999999653
No 42
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.59 E-value=32 Score=30.54 Aligned_cols=21 Identities=14% Similarity=0.273 Sum_probs=15.3
Q ss_pred CeEEEEEEeeccCCccEEEEEcCCC
Q psy15346 190 YATVKLIGWGEENGRPYWTIVSTFG 214 (280)
Q Consensus 190 ~HaV~IVGwG~e~g~~YWiirNSWG 214 (280)
-|+|+|+||++. |...-++||
T Consensus 148 ~H~v~itgyDk~----n~yynDpyG 168 (195)
T COG4990 148 IHSVLITGYDKY----NIYYNDPYG 168 (195)
T ss_pred eeeeEeeccccc----ceEeccccc
Confidence 599999999764 555566663
No 43
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=44.02 E-value=87 Score=28.37 Aligned_cols=21 Identities=14% Similarity=0.382 Sum_probs=17.6
Q ss_pred hHHHHHHHHHhCCcEEEEEEe
Q psy15346 90 EVADIQQEIMKNGPVVANMYL 110 (280)
Q Consensus 90 ~~~~Ik~~I~~~GPV~v~~~v 110 (280)
++++|..+|..+||+++-++.
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~ 132 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDA 132 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEcc
Confidence 688999999999988777765
No 44
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=37.78 E-value=43 Score=31.40 Aligned_cols=28 Identities=14% Similarity=0.169 Sum_probs=24.0
Q ss_pred cCCeEEEEEEeeccC--CccEEEEEcCCCC
Q psy15346 188 VAYATVKLIGWGEEN--GRPYWTIVSTFGE 215 (280)
Q Consensus 188 ~~~HaV~IVGwG~e~--g~~YWiirNSWG~ 215 (280)
..+||=.|++.-+-+ +...-.+||.||.
T Consensus 234 ~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~ 263 (315)
T cd00044 234 VKGHAYSVLDVREVQEEGLRLLRLRNPWGV 263 (315)
T ss_pred ccCcceEEeEEEEEccCceEEEEecCCccC
Confidence 568999999998765 7889999999993
No 45
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=34.65 E-value=71 Score=27.82 Aligned_cols=23 Identities=9% Similarity=0.105 Sum_probs=16.5
Q ss_pred hHHHHHHHHHhCCcEEEEEEeCc
Q psy15346 90 EVADIQQEIMKNGPVVANMYLYS 112 (280)
Q Consensus 90 ~~~~Ik~~I~~~GPV~v~~~v~~ 112 (280)
+.+.+.+.|.++||+-++.....
T Consensus 97 t~e~~~~LL~~yGPLwv~~~~P~ 119 (166)
T PF12385_consen 97 TAEGLANLLREYGPLWVAWEAPG 119 (166)
T ss_pred CHHHHHHHHHHcCCeEEEecCCC
Confidence 45677777888899888865543
No 46
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=33.08 E-value=1.4e+02 Score=23.61 Aligned_cols=22 Identities=9% Similarity=0.083 Sum_probs=15.5
Q ss_pred CCeEEEEEEeeccCCccEEEEEcCC
Q psy15346 189 AYATVKLIGWGEENGRPYWTIVSTF 213 (280)
Q Consensus 189 ~~HaV~IVGwG~e~g~~YWiirNSW 213 (280)
.+|.|+|+||.. ....+|.+.|
T Consensus 93 ~gH~vVv~g~~~---~~~~~i~DP~ 114 (141)
T cd02549 93 SGHAMVVIGYDR---KGNVYVNDPG 114 (141)
T ss_pred CCeEEEEEEEcC---CCCEEEECCC
Confidence 489999999971 1235667765
No 47
>PF01357 Pollen_allerg_1: Pollen allergen; InterPro: IPR007117 Expansins are unusual proteins that mediate cell wall extension in plants []. They are believed to act as a sort of chemical grease, allowing polymers to slide past one another by disrupting non-covalent hydrogen bonds that hold many wall polymers to one another. This process is not degradative and hence does not weaken the wall, which could otherwise rupture under internal pressure during growth. Sequence comparisons indicate at least four distinct expansin cDNAs in rice and at least six in Arabidopsis. The proteins are highly conserved in size and sequence (75-95% amino acid sequence similarity between any pairwise comparison), and phylogenetic trees indicate that this multigene family formed before the evolutionary divergence of monocotyledons and dicotyledons []. Sequence and motif analyses show no similarities to known functional domains that might account for expansin action on wall extension. It is thought that several highly-conserved tryptophans may function in expansin binding to cellulose, or other glycans. The high conservation of the family indicates that the mechanism by which expansins promote wall extensin tolerates little variation in protein structure. Grass pollens, such as pollen from timothy grass, represent a major cause of type I allergy []. Interestingly, expansins share a high degree of sequence similarity with the Lol p I family of allergens. This entry represents the C-terminal domain.; PDB: 2VXQ_A 1WHP_A 1BMW_A 1WHO_A 2HCZ_X 2JNZ_A 3FT9_A 3FT1_C 1N10_B.
Probab=31.02 E-value=78 Score=23.91 Aligned_cols=43 Identities=16% Similarity=0.300 Sum_probs=23.1
Q ss_pred CCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCC
Q psy15346 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFG 218 (280)
Q Consensus 170 ~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG 218 (280)
+|||+.+-...+ .+.+.|.-|=--..+....--++++||..|=
T Consensus 10 ~~~~l~v~v~n~------gG~gdi~~Vevk~~~s~~W~~m~r~wGa~W~ 52 (82)
T PF01357_consen 10 NPYYLAVLVKNV------GGDGDIKAVEVKQSGSGNWIPMKRSWGAVWQ 52 (82)
T ss_dssp BTTEEEEEEEEC------CTTS-EEEEEEEETTSSS-EE-EEECTTEEE
T ss_pred CCcEEEEEEEEc------CCCccEEEEEEEeCCCCCceEeecCcCceEE
Confidence 588888877766 3344444332221222234467889998884
Done!