Query psy1664
Match_columns 524
No_of_seqs 409 out of 2951
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 18:13:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1664.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1664hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 5.7E-70 1.2E-74 527.3 21.5 294 4-350 66-366 (372)
2 PTZ00203 cathepsin L protease; 100.0 4.6E-66 1E-70 525.8 27.0 298 1-347 30-332 (348)
3 PTZ00021 falcipain-2; Provisio 100.0 6.9E-64 1.5E-68 523.5 25.1 307 4-353 164-486 (489)
4 PTZ00200 cysteine proteinase; 100.0 7.3E-63 1.6E-67 514.7 25.0 298 4-353 121-443 (448)
5 KOG1543|consensus 100.0 4.3E-61 9.2E-66 486.1 24.5 285 13-348 30-316 (325)
6 cd02620 Peptidase_C1A_Cathepsi 100.0 1.8E-54 4E-59 422.4 23.8 235 98-492 1-235 (236)
7 PTZ00049 cathepsin C-like prot 100.0 2.9E-51 6.3E-56 436.7 24.9 271 94-500 378-681 (693)
8 cd02621 Peptidase_C1A_Cathepsi 100.0 2.4E-51 5.2E-56 402.6 21.9 218 97-349 1-236 (243)
9 cd02698 Peptidase_C1A_Cathepsi 100.0 3.4E-51 7.4E-56 400.1 21.6 214 97-340 1-220 (239)
10 PTZ00364 dipeptidyl-peptidase 100.0 3.3E-49 7.2E-54 417.6 22.8 231 94-496 202-460 (548)
11 cd02248 Peptidase_C1A Peptidas 100.0 1.1E-48 2.3E-53 375.8 21.1 204 98-349 1-206 (210)
12 KOG1544|consensus 100.0 1.8E-48 3.9E-53 371.6 7.9 302 39-498 152-463 (470)
13 PF00112 Peptidase_C1: Papain 100.0 2.8E-46 6E-51 360.7 16.7 210 97-350 1-215 (219)
14 smart00645 Pept_C1 Papain fami 100.0 1.9E-43 4.1E-48 328.7 16.7 164 97-346 1-166 (174)
15 cd02619 Peptidase_C1 C1 Peptid 100.0 4.3E-41 9.4E-46 325.2 18.9 202 100-340 1-214 (223)
16 PTZ00462 Serine-repeat antigen 100.0 4.8E-40 1E-44 359.8 21.4 223 107-348 538-774 (1004)
17 cd02698 Peptidase_C1A_Cathepsi 99.9 3E-27 6.5E-32 231.2 14.1 99 392-496 136-239 (239)
18 KOG1543|consensus 99.9 2.1E-27 4.6E-32 240.6 13.1 111 379-496 213-324 (325)
19 cd02621 Peptidase_C1A_Cathepsi 99.9 9.3E-27 2E-31 228.5 13.2 101 391-496 130-243 (243)
20 KOG1542|consensus 99.9 1E-26 2.2E-31 226.4 9.7 105 382-493 262-369 (372)
21 PTZ00203 cathepsin L protease; 99.9 5.7E-26 1.2E-30 231.8 12.0 99 386-493 240-338 (348)
22 COG4870 Cysteine protease [Pos 99.9 5.7E-26 1.2E-30 223.8 5.9 205 95-340 97-315 (372)
23 cd02248 Peptidase_C1A Peptidas 99.9 1E-24 2.2E-29 209.3 14.2 101 384-491 106-208 (210)
24 cd02620 Peptidase_C1A_Cathepsi 99.9 3.7E-25 8.1E-30 216.0 11.2 94 247-349 139-232 (236)
25 PTZ00021 falcipain-2; Provisio 99.9 7.1E-24 1.5E-28 222.7 12.0 107 386-497 375-488 (489)
26 PTZ00200 cysteine proteinase; 99.9 1.6E-23 3.5E-28 219.3 12.9 95 395-497 348-445 (448)
27 PF00112 Peptidase_C1: Papain 99.9 2.8E-23 6E-28 200.2 11.2 94 393-493 122-218 (219)
28 PTZ00462 Serine-repeat antigen 99.9 3.5E-23 7.6E-28 227.8 11.2 125 394-523 680-812 (1004)
29 PTZ00049 cathepsin C-like prot 99.9 7.8E-22 1.7E-26 211.6 9.3 94 250-350 555-671 (693)
30 PTZ00364 dipeptidyl-peptidase 99.8 3.1E-21 6.7E-26 205.1 9.5 95 248-350 339-454 (548)
31 cd00585 Peptidase_C1B Peptidas 99.8 3.4E-20 7.5E-25 193.0 16.3 213 114-337 55-398 (437)
32 KOG1544|consensus 99.8 3.6E-20 7.7E-25 178.0 6.8 98 246-345 347-452 (470)
33 smart00645 Pept_C1 Papain fami 99.8 1.3E-19 2.9E-24 168.5 8.7 75 408-490 93-170 (174)
34 cd02619 Peptidase_C1 C1 Peptid 99.8 4E-18 8.6E-23 164.6 12.7 84 393-481 124-213 (223)
35 cd00585 Peptidase_C1B Peptidas 99.5 9.6E-14 2.1E-18 145.0 8.5 86 386-480 288-399 (437)
36 PF03051 Peptidase_C1_2: Pepti 99.5 8.5E-13 1.8E-17 137.9 14.4 76 114-190 56-158 (438)
37 COG4870 Cysteine protease [Pos 99.3 1.8E-12 3.9E-17 128.8 8.2 124 394-524 224-351 (372)
38 PF08246 Inhibitor_I29: Cathep 99.3 1.5E-12 3.2E-17 98.3 2.4 58 9-67 1-58 (58)
39 smart00848 Inhibitor_I29 Cathe 99.0 1E-10 2.3E-15 87.8 0.7 57 9-66 1-57 (57)
40 PF03051 Peptidase_C1_2: Pepti 98.4 9.3E-07 2E-11 92.9 9.1 87 386-479 289-399 (438)
41 COG3579 PepC Aminopeptidase C 98.3 3.9E-06 8.4E-11 82.6 10.3 80 251-336 296-400 (444)
42 PF08127 Propeptide_C1: Peptid 97.2 0.00019 4.1E-09 49.5 1.7 36 38-76 4-39 (41)
43 PF05543 Peptidase_C47: Stapho 96.2 0.041 8.8E-07 50.2 9.8 120 118-324 18-145 (175)
44 KOG4128|consensus 95.4 0.023 5E-07 56.4 5.0 75 114-189 63-166 (457)
45 COG3579 PepC Aminopeptidase C 95.3 0.014 3E-07 58.2 3.2 72 401-478 308-400 (444)
46 PF13529 Peptidase_C39_2: Pept 94.6 0.44 9.5E-06 41.6 10.9 57 250-323 87-144 (144)
47 PF13529 Peptidase_C39_2: Pept 82.5 5.7 0.00012 34.3 7.5 60 390-465 85-144 (144)
48 PF14399 Transpep_BrtH: NlpC/p 76.2 6.1 0.00013 40.0 6.3 47 252-304 78-124 (317)
49 PF05543 Peptidase_C47: Stapho 75.5 7 0.00015 35.9 5.7 56 392-465 89-144 (175)
50 PF09778 Guanylate_cyc_2: Guan 74.7 8.8 0.00019 36.6 6.4 54 251-304 112-172 (212)
51 COG4990 Uncharacterized protei 70.1 8.9 0.00019 35.4 5.0 39 250-304 121-159 (195)
52 PF12385 Peptidase_C70: Papain 66.4 57 0.0012 29.6 9.2 38 252-304 98-135 (166)
53 PF14399 Transpep_BrtH: NlpC/p 65.9 13 0.00029 37.4 6.2 47 395-447 79-125 (317)
54 KOG4128|consensus 55.8 1.8 3.9E-05 43.5 -2.2 42 434-478 371-412 (457)
55 PF09778 Guanylate_cyc_2: Guan 50.4 56 0.0012 31.2 6.9 53 393-447 112-173 (212)
56 cd02549 Peptidase_C39A A sub-f 43.6 43 0.00094 28.9 4.9 34 255-302 70-103 (141)
57 COG4990 Uncharacterized protei 33.7 98 0.0021 28.7 5.4 40 392-447 121-160 (195)
58 PF12385 Peptidase_C70: Papain 31.5 1.1E+02 0.0023 27.9 5.2 39 394-447 98-136 (166)
59 cd02549 Peptidase_C39A A sub-f 29.9 1.9E+02 0.0042 24.7 6.8 34 397-444 70-103 (141)
60 PF01640 Peptidase_C10: Peptid 29.5 1.5E+02 0.0032 27.8 6.2 52 253-334 141-192 (192)
61 cd00044 CysPc Calpains, domain 29.4 68 0.0015 32.6 4.2 29 292-325 235-263 (315)
No 1
>KOG1542|consensus
Probab=100.00 E-value=5.7e-70 Score=527.35 Aligned_cols=294 Identities=26% Similarity=0.413 Sum_probs=244.5
Q ss_pred cHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCC--CCCCC
Q psy1664 4 STADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPD--SKLPQ 81 (524)
Q Consensus 4 st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~--~~~~~ 81 (524)
...+.|..|+.+|.|+|.+.+|...|+.+|++|+..++++++... .+-..|+|+|||||.|||+++++..+. .+.+.
T Consensus 66 ~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~-gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~~ 144 (372)
T KOG1542|consen 66 GLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDP-GSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLPG 144 (372)
T ss_pred chHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCc-cccccCccchhhcCHHHHHHHhhccccccccCcc
Confidence 347899999999999999999999999999999999999987652 477889999999999999997654332 21111
Q ss_pred CCCCcccccCCCCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCC
Q psy1664 82 NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD 161 (524)
Q Consensus 82 ~~~~~~~~~~~~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~ 161 (524)
. .......+...||++||||++ |+||||||||+||||||||+++++|.+++|+++ ++++||||+||||+ .
T Consensus 145 ~---~~~~~~~~~~~lP~~fDWR~k----gaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g--~LvsLSEQeLvDCD-~ 214 (372)
T KOG1542|consen 145 D---AAEAPIEPGESLPESFDWRDK----GAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATG--KLVSLSEQELVDCD-S 214 (372)
T ss_pred c---cccCcCCCCCCCCcccchhcc----CCccccccCCcCcchhhhhhhhhhhhHHHhhcC--cccccchhhhhccc-C
Confidence 1 111112445689999999999 999999999999999999999999999999986 68999999999995 5
Q ss_pred CCCCCCCCChHHHHHHHHH-hCCccCCccCCCCCccccccCcccccCCCCC-CCCCCCCCCccccccccCCCcccccccc
Q psy1664 162 CGNGCQGGFHGKAWKYWVT-TGIVSGGTYASKQGCRPYEIPCERYMNGSHS-SCQDNEPNTPECIRKCQPGYDVSYEDDL 239 (524)
Q Consensus 162 ~~~gC~GG~~~~a~~~~~~-~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~-~C~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (524)
.++||+||.+..||+|+++ .|+..|.+| ||++. .+ .|...+.....
T Consensus 215 ~d~gC~GGl~~nA~~~~~~~gGL~~E~dY-------PY~g~--------~~~~C~~~~~~~~v----------------- 262 (372)
T KOG1542|consen 215 CDNGCNGGLMDNAFKYIKKAGGLEKEKDY-------PYTGK--------KGNQCHFDKSKIVV----------------- 262 (372)
T ss_pred cCCcCCCCChhHHHHHHHHhCCccccccC-------Ccccc--------CCCccccchhhceE-----------------
Confidence 6889999999999999655 489999999 99987 44 78765533221
Q ss_pred ccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEc--CCCCCC-CCcEEEEEEeccCCCCCCCccceeE
Q psy1664 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--VAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKY 316 (524)
Q Consensus 240 ~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~--~~~~~~-~~HaV~iVGyg~~~~~~g~~~g~~Y 316 (524)
+......++.|+++|.+.|+++|||+|+|++ ..+|+|++||..+ ..|++. ++|+|+|||||... -.++|
T Consensus 263 -~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g------~~~PY 334 (372)
T KOG1542|consen 263 -SIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSG------YEKPY 334 (372)
T ss_pred -EEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHHhcccccCCCcccCCccccCceEEEEeecCCC------CCCce
Confidence 1111245677999999999999999999995 6799999999987 457664 99999999999862 25899
Q ss_pred EEEeCCCCCcccccCccccccccCccCCcCcCCc
Q psy1664 317 WLVANSFNTNWGENGLFRIGCRPYEIPCERYMNG 350 (524)
Q Consensus 317 WivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~~ 350 (524)
||||||||++|||+||+||.|+.+ .||+....
T Consensus 335 WIVKNSWG~~WGE~GY~~l~RG~N--~CGi~~mv 366 (372)
T KOG1542|consen 335 WIVKNSWGTSWGEKGYYKLCRGSN--ACGIADMV 366 (372)
T ss_pred EEEECCccccccccceEEEecccc--ccccccch
Confidence 999999999999999999999977 68886544
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=4.6e-66 Score=525.81 Aligned_cols=298 Identities=21% Similarity=0.424 Sum_probs=231.8
Q ss_pred CCCcHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHh-CCCCCCCC
Q psy1664 1 MGKSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRM-GVHPDSKL 79 (524)
Q Consensus 1 ~~~st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~-~~~~~~~~ 79 (524)
|+++..+.|++|+++|+|+|.+.+|...|+.+|.+|+++|++||++. .+|++++|+|+|||.|||++++ +.......
T Consensus 30 ~~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~--~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~ 107 (348)
T PTZ00203 30 VGTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARN--PHARFGITKFFDLSEAEFAARYLNGAAYFAA 107 (348)
T ss_pred cccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccC--CCeEEeccccccCCHHHHHHHhcCCCccccc
Confidence 46678899999999999999998888899999999999999999875 6899999999999999999754 22111100
Q ss_pred CCCCCCc-ccccCCCCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhh
Q psy1664 80 PQNRLPL-LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSC 158 (524)
Q Consensus 80 ~~~~~~~-~~~~~~~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC 158 (524)
+...... .........+||++||||++ |+|+||||||.||||||||++++||++++|+++ ..+.||+|||+||
T Consensus 108 ~~~~~~~~~~~~~~~~~~lP~~~DWR~~----g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~--~~~~LSeQqLvdC 181 (348)
T PTZ00203 108 AKQHAGQHYRKARADLSAVPDAVDWREK----GAVTPVKNQGACGSCWAFSAVGNIESQWAVAGH--KLVRLSEQQLVSC 181 (348)
T ss_pred ccccccccccccccccccCCCCCcCCcC----CCCCCccccCCCccHHHHhhHHHHHHHHHHhcC--CCccCCHHHHHhc
Confidence 0000000 00011122369999999998 899999999999999999999999999999975 4689999999999
Q ss_pred cCCCCCCCCCCChHHHHHHHHHh---CCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCcccc
Q psy1664 159 CKDCGNGCQGGFHGKAWKYWVTT---GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235 (524)
Q Consensus 159 ~~~~~~gC~GG~~~~a~~~~~~~---Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~ 235 (524)
+. .+.||+||++..||+|++++ |+++|++| ||... .+....|....... ...
T Consensus 182 ~~-~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~Y-------PY~~~-----~~~~~~C~~~~~~~--------~~~---- 236 (348)
T PTZ00203 182 DH-VDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY-------PYVSG-----NGDVPECSNSSELA--------PGA---- 236 (348)
T ss_pred cC-CCCCCCCCCHHHHHHHHHHhcCCCCCccccC-------CCccC-----CCCCCcCCCCcccc--------cce----
Confidence 75 36799999999999999865 47888888 99865 11112454211000 000
Q ss_pred ccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCcccee
Q psy1664 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK 315 (524)
Q Consensus 236 ~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~ 315 (524)
.... ...++.++++|+.+|+++|||+|+|++. +|++|++|||+. +....++|||+|||||.+ ++++
T Consensus 237 -~i~~----~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~-------~g~~ 302 (348)
T PTZ00203 237 -RIDG----YVSMESSERVMAAWLAKNGPISIAVDAS-SFMSYHSGVLTS-CIGEQLNHGVLLVGYNMT-------GEVP 302 (348)
T ss_pred -Eecc----eeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcCccCceeec-cCCCCCCeEEEEEEEecC-------CCce
Confidence 1111 1334557888999999999999999985 899999999985 333457999999999976 4689
Q ss_pred EEEEeCCCCCcccccCccccccccCccCCcCc
Q psy1664 316 YWLVANSFNTNWGENGLFRIGCRPYEIPCERY 347 (524)
Q Consensus 316 YWivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~ 347 (524)
|||||||||++|||+|||||+++.. .|++.
T Consensus 303 YWiikNSWG~~WGe~GY~ri~rg~n--~Cgi~ 332 (348)
T PTZ00203 303 YWVIKNSWGEDWGEKGYVRVTMGVN--ACLLT 332 (348)
T ss_pred EEEEEcCCCCCcCcCceEEEEcCCC--ccccc
Confidence 9999999999999999999998754 57764
No 3
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=6.9e-64 Score=523.54 Aligned_cols=307 Identities=21% Similarity=0.423 Sum_probs=235.9
Q ss_pred cHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCCCCCCC--
Q psy1664 4 STADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQ-- 81 (524)
Q Consensus 4 st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~~~~~~-- 81 (524)
-++.+|++|+++|+|+|.+.+|...|+.+|.+|+++|++||++.+ .+|++++|+|+|||.|||++++...+......
T Consensus 164 e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~-~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~~ 242 (489)
T PTZ00021 164 ENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKEN-VLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSNG 242 (489)
T ss_pred HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCC-CCEEEeccccccCCHHHHHHHhcccccccccccc
Confidence 457889999999999999999999999999999999999998754 79999999999999999998543221100000
Q ss_pred ---CCCCcc----ccc-CCCCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHH
Q psy1664 82 ---NRLPLL----VQL-SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD 153 (524)
Q Consensus 82 ---~~~~~~----~~~-~~~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q 153 (524)
...... ... +.....+|++||||+. |.|+||||||.||||||||++++||++++|+++ ..+.||+|
T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~----g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g--~~v~LSeQ 316 (489)
T PTZ00021 243 KKSPRVINYDDVIKKYKPKDATFDHAKYDWRLH----NGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKN--ELVSLSEQ 316 (489)
T ss_pred ccccccccccccccccccccccCCccccccccC----CCCCCcccccccccHHHHHHHHHHHHHHHHHcC--CCcccCHH
Confidence 000000 000 0011125999999998 899999999999999999999999999999976 46899999
Q ss_pred HHHhhcCCCCCCCCCCChHHHHHHHHHh-CCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCc
Q psy1664 154 DLVSCCKDCGNGCQGGFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232 (524)
Q Consensus 154 ~lvdC~~~~~~gC~GG~~~~a~~~~~~~-Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~ 232 (524)
||+||+. .+.||+||++..||+|+.++ |+++|++| ||.+. ..+.|.... |...+.
T Consensus 317 qLVDCs~-~n~GC~GG~~~~Af~yi~~~gGl~tE~~Y-------PY~~~-------~~~~C~~~~---------~~~~~~ 372 (489)
T PTZ00021 317 ELVDCSF-KNNGCYGGLIPNAFEDMIELGGLCSEDDY-------PYVSD-------TPELCNIDR---------CKEKYK 372 (489)
T ss_pred HHhhhcc-CCCCCCCcchHhhhhhhhhccccCccccc-------CccCC-------CCCcccccc---------ccccce
Confidence 9999975 36799999999999999876 89998888 99864 125565321 111111
Q ss_pred cccccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCC---C
Q psy1664 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE---G 309 (524)
Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~---g 309 (524)
+ ..| ..++ +++|+++|+.+|||+|+|+++.+|++|++|||+.. |...++|||+|||||.+...+ +
T Consensus 373 i-----~~y----~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~-C~~~~nHAVlIVGYG~e~~~~~~~~ 440 (489)
T PTZ00021 373 I-----KSY----VSIP--EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGE-CGEEPNHAVILVGYGMEEIYNSDTK 440 (489)
T ss_pred e-----eeE----EEec--HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCC-CCCccceEEEEEEecCcCCcccccc
Confidence 1 111 2333 57899999999999999999889999999999874 655689999999999763100 0
Q ss_pred CccceeEEEEeCCCCCcccccCccccccccC--ccCCcCcCCcCCC
Q psy1664 310 TSSVVKYWLVANSFNTNWGENGLFRIGCRPY--EIPCERYMNGSRS 353 (524)
Q Consensus 310 ~~~g~~YWivkNSWG~~WGe~Gy~ri~~~~~--~~~c~~~~~~~~~ 353 (524)
...+.+|||||||||++|||+|||||+++.. ...||+.+.+.+|
T Consensus 441 ~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~yP 486 (489)
T PTZ00021 441 KMEKRYYYIIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEAYVP 486 (489)
T ss_pred cCCCCCEEEEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccceeE
Confidence 0123579999999999999999999998753 2479998766554
No 4
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=7.3e-63 Score=514.71 Aligned_cols=298 Identities=24% Similarity=0.418 Sum_probs=230.3
Q ss_pred cHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhC-CCCCCCC---
Q psy1664 4 STADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG-VHPDSKL--- 79 (524)
Q Consensus 4 st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~-~~~~~~~--- 79 (524)
.+...|++|+++|+|+|.+.+|...|+.+|.+|++.|++||.. .+|++|+|+|+|||.|||.+++. ...+...
T Consensus 121 e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~---~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~ 197 (448)
T PTZ00200 121 EVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD---EPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNST 197 (448)
T ss_pred HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc---CCeEEeccccccCCHHHHHHHhccCCCccccccc
Confidence 3567899999999999999999999999999999999999963 58999999999999999998653 2211100
Q ss_pred -CCCC-------CCcccc---c-----C---CCCCCCCCccccccCCCCCCCCccCCCCC-CCccHHHHHHHHHHHHHHH
Q psy1664 80 -PQNR-------LPLLVQ---L-----S---DPLEELPEGFDARINWPYCPTIQEIRDQG-SCGSGWALGAVEAMSDRVC 139 (524)
Q Consensus 80 -~~~~-------~~~~~~---~-----~---~~~~~lP~s~DwR~~~~~~g~vtpvkdQg-~CGsCwAfA~~~~le~~~~ 139 (524)
+... .+.... . . .....+|++||||+. +.|+|||||| .||||||||++++||++++
T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~----g~vtpVkdQG~~CGSCWAFat~~aiEs~~~ 273 (448)
T PTZ00200 198 SHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRA----DAVTKVKDQGLNCGSCWAFSSVGSVESLYK 273 (448)
T ss_pred ccccccccccccccccccccccccccccccccccccCCCCccCCCC----CCCCCcccCCCccchHHHHhHHHHHHHHHH
Confidence 0000 000000 0 0 001236999999998 8899999999 9999999999999999999
Q ss_pred HHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCC
Q psy1664 140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPN 219 (524)
Q Consensus 140 i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~ 219 (524)
|+++ ..+.||+|||+||+. .++||+||++..||+|++++|+++|++| ||.+. .+.|......
T Consensus 274 i~~~--~~~~LSeQqLvDC~~-~~~GC~GG~~~~A~~yi~~~Gi~~e~~Y-------PY~~~--------~~~C~~~~~~ 335 (448)
T PTZ00200 274 IYRD--KSVDLSEQELVNCDT-KSQGCSGGYPDTALEYVKNKGLSSSSDV-------PYLAK--------DGKCVVSSTK 335 (448)
T ss_pred HhcC--CCeecCHHHHhhccC-ccCCCCCCcHHHHHHHHhhcCccccccC-------CCCCC--------CCCCcCCCCC
Confidence 9865 468999999999975 3679999999999999999999998888 99876 6677643211
Q ss_pred CccccccccCCCccccccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEE
Q psy1664 220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299 (524)
Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iV 299 (524)
. . .... +.+..+. +++++++.+|||+|+|.++.+|+.|++|||+++ |...++|||+||
T Consensus 336 ~----------~-----~i~~-----y~~~~~~-~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~-C~~~~nHaV~lV 393 (448)
T PTZ00200 336 K----------V-----YIDS-----YLVAKGK-DVLNKSLVISPTVVYIAVSRELLKYKSGVYNGE-CGKSLNHAVLLV 393 (448)
T ss_pred e----------e-----Eecc-----eEecCHH-HHHHHHHhcCCEEEEeecccccccCCCCccccc-cCCCCcEEEEEE
Confidence 0 0 0111 1222333 455556678999999999989999999999875 555589999999
Q ss_pred EeccCCCCCCCccceeEEEEeCCCCCcccccCccccccccC-ccCCcCcCCcCCC
Q psy1664 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPY-EIPCERYMNGSRS 353 (524)
Q Consensus 300 Gyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~~~~~-~~~c~~~~~~~~~ 353 (524)
|||.+. ++|.+|||||||||++|||+|||||++... ...||+.+.+.+|
T Consensus 394 GyG~d~-----~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~~~~~~P 443 (448)
T PTZ00200 394 GEGYDE-----KTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGILTVGLTP 443 (448)
T ss_pred EecccC-----CCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCccccceee
Confidence 999752 246899999999999999999999998642 2368887665444
No 5
>KOG1543|consensus
Probab=100.00 E-value=4.3e-61 Score=486.13 Aligned_cols=285 Identities=31% Similarity=0.550 Sum_probs=235.2
Q ss_pred HHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCCCCCCCCCCCcccccCC
Q psy1664 13 LKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSD 92 (524)
Q Consensus 13 ~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~~~~~~~~~~~~~~~~~ 92 (524)
+..|.+.|.+..+...|+.+|.+|++.|+.||.... .+|++++|+|+|++.+|+++.......... .........
T Consensus 30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~-~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~----~~~~~~~~~ 104 (325)
T KOG1543|consen 30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYV-LSFLMGVNQFADLTTEEFKRKKTGKKPPEI----KRDKFTEKL 104 (325)
T ss_pred hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhc-eeeeeccccccccchHHHHHhhccccCccc----ccccccccc
Confidence 445667777777788889999999999999999853 899999999999999999986544322210 111111223
Q ss_pred CCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChH
Q psy1664 93 PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172 (524)
Q Consensus 93 ~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~ 172 (524)
...+||++||||++| .+++||||||.||||||||++++||++++|++++ .++.||+|+|+||+..+++||+||++.
T Consensus 105 ~~~~~p~s~DwR~~~---~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~-~l~sLSeq~lvdC~~~~~~GC~GG~~~ 180 (325)
T KOG1543|consen 105 DGDDLPDSFDWRDKG---AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGG-KLLSLSEQDLVDCCGECGDGCNGGEPK 180 (325)
T ss_pred chhhCCCCccccccC---CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCC-ccCccChhhhhhccCCCCCCcCCCCHH
Confidence 345899999999996 4567799999999999999999999999999976 689999999999987767899999999
Q ss_pred HHHHHHHHhCCcc-CCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCC
Q psy1664 173 KAWKYWVTTGIVS-GGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN 251 (524)
Q Consensus 173 ~a~~~~~~~Gi~~-e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (524)
.|++|++++|+++ +.+| ||... .+.|...... ...+....+.++.+
T Consensus 181 ~A~~yi~~~G~~t~~~~Y-------py~~~--------~~~C~~~~~~------------------~~~~~~~~~~~~~~ 227 (325)
T KOG1543|consen 181 NAFKYIKKNGGVTECENY-------PYIGK--------DGTCKSNKKD------------------KTVTIKGFYNVPAN 227 (325)
T ss_pred HHHHHHHHhCCCCCCcCC-------CCcCC--------CCCccCCCcc------------------ceeEeeeeeecCcC
Confidence 9999999999888 8888 99877 5577654420 11122224567888
Q ss_pred HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCC-CCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCccccc
Q psy1664 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330 (524)
Q Consensus 252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~-~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~ 330 (524)
+++|+.+|+.+|||+|+|+++.+|+.|++|||.+++|.. .++|||+|||||+ . ++.+|||||||||++|||+
T Consensus 228 e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~-~------~~~~YWivkNSWG~~WGe~ 300 (325)
T KOG1543|consen 228 EEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGT-G------DGVDYWIVKNSWGTDWGEK 300 (325)
T ss_pred HHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcC-C------CCceeEEEEcCCCCCcccC
Confidence 999999999999999999999999999999999999887 4999999999998 3 5689999999999999999
Q ss_pred CccccccccCccCCcCcC
Q psy1664 331 GLFRIGCRPYEIPCERYM 348 (524)
Q Consensus 331 Gy~ri~~~~~~~~c~~~~ 348 (524)
|||||.++.. .|++..
T Consensus 301 Gy~ri~r~~~--~~~I~~ 316 (325)
T KOG1543|consen 301 GYFRIARGVN--KCGIAS 316 (325)
T ss_pred ceEEEecCCC--chhhhc
Confidence 9999999877 355443
No 6
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=1.8e-54 Score=422.38 Aligned_cols=235 Identities=55% Similarity=1.119 Sum_probs=186.0
Q ss_pred CCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHH
Q psy1664 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177 (524)
Q Consensus 98 P~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~ 177 (524)
|++||||++|+++..|+||+|||.||||||||++++||++++|++++...+.||+|+|+||+...+.||+||++..||+|
T Consensus 1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~ 80 (236)
T cd02620 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY 80 (236)
T ss_pred CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence 89999999988776678999999999999999999999999998764457899999999997654679999999999999
Q ss_pred HHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHHHH
Q psy1664 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMR 257 (524)
Q Consensus 178 ~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~ 257 (524)
++++|+++|++| ||... ...|...
T Consensus 81 i~~~G~~~e~~y-------PY~~~--------~~~~~~~----------------------------------------- 104 (236)
T cd02620 81 LTTTGVVTGGCQ-------PYTIP--------PCGHHPE----------------------------------------- 104 (236)
T ss_pred HHhcCCCcCCEe-------cCcCC--------CCccCCC-----------------------------------------
Confidence 999999998877 99754 1111000
Q ss_pred HHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCcccccc
Q psy1664 258 EIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGC 337 (524)
Q Consensus 258 ~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~~ 337 (524)
.|
T Consensus 105 -----------------------------------~~------------------------------------------- 106 (236)
T cd02620 105 -----------------------------------GP------------------------------------------- 106 (236)
T ss_pred -----------------------------------CC-------------------------------------------
Confidence 00
Q ss_pred ccCccCCcCcCCcCCCCCCCCCCCCcccccccccCccccccCcceeeeEEEEcCCCHHHHHHHHHhCCCEEEEEeccccc
Q psy1664 338 RPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM 417 (524)
Q Consensus 338 ~~~~~~c~~~~~~~~~~C~~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f 417 (524)
..|.. ...|+..|.......|..+.+.....+.+..++++||.+|+++|||+++|.++++|
T Consensus 107 ---------------~~~~~----~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f 167 (236)
T cd02620 107 ---------------PPCCG----TPYCTPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDF 167 (236)
T ss_pred ---------------CCCCC----CCCCCCCCCcCCccccceeeeeecceeeeCCHHHHHHHHHHHCCCeEEEEEechhh
Confidence 00100 11111223222111133334444556666667899999999999999999998899
Q ss_pred ccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcEEEEEeCCCccCcccccee
Q psy1664 418 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA 492 (524)
Q Consensus 418 ~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~ 492 (524)
+.|++|||...+....++|||+|||||+++ +++|||||||||++||++|||||+||.|.|||+++++.
T Consensus 168 ~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~~-------g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~ 235 (236)
T cd02620 168 LYYKSGVYQHTSGKQLGGHAVKIIGWGVEN-------GVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA 235 (236)
T ss_pred hhcCCcEEeecCCCCcCCeEEEEEEEeccC-------CeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence 999999998765555679999999999886 88999999999999999999999999999999998764
No 7
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=2.9e-51 Score=436.65 Aligned_cols=271 Identities=26% Similarity=0.435 Sum_probs=191.4
Q ss_pred CCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCC--------ccccCCHHHHHhhcCCCCCC
Q psy1664 94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK--------RHVRLSSDDLVSCCKDCGNG 165 (524)
Q Consensus 94 ~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~--------~~~~LS~q~lvdC~~~~~~g 165 (524)
..+||++||||+.|+.++.++||+|||.||||||||++++||++++|+++.. ....||+|+||||+. .++|
T Consensus 378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~-~nqG 456 (693)
T PTZ00049 378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSF-YDQG 456 (693)
T ss_pred cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCC-CCCC
Confidence 4589999999999999999999999999999999999999999999986421 123799999999975 4679
Q ss_pred CCCCChHHHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceee
Q psy1664 166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA 245 (524)
Q Consensus 166 C~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (524)
|+||++..|++|++++||++|..| ||++. .+.|+.........
T Consensus 457 C~GG~~~~A~kya~~~GI~tEscY-------PY~a~--------~g~C~~~~~~~~~~---------------------- 499 (693)
T PTZ00049 457 CNGGFPYLVSKMAKLQGIPLDKVF-------PYTAT--------EQTCPYQVDQSANS---------------------- 499 (693)
T ss_pred cCCCcHHHHHHHHHHCCCCcCCcc-------CCcCC--------CCCCCCCCCCcccc----------------------
Confidence 999999999999999999997777 99865 55675321100000
Q ss_pred eecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCC
Q psy1664 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325 (524)
Q Consensus 246 ~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~ 325 (524)
+.|+..
T Consensus 500 ----------------------------------------------------~~g~~~---------------------- 505 (693)
T PTZ00049 500 ----------------------------------------------------MNGSAN---------------------- 505 (693)
T ss_pred ----------------------------------------------------cccccc----------------------
Confidence 000000
Q ss_pred cccccCccccccccCccCCcCcCCcCCCCCCCCCCCCcccccccccCccccccCcceeeeEEEEc--CCCHHHHHHHHHh
Q psy1664 326 NWGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL--PANEETIMREIFR 403 (524)
Q Consensus 326 ~WGe~Gy~ri~~~~~~~~c~~~~~~~~~~C~~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~--~~~~~~~~~~~~~ 403 (524)
+. +..+.+......+.|. ...|...|... ...|.++..+...+|.+ ..++++||.+|++
T Consensus 506 ---------~~----~~~~~~~~~~~~~~~~-----~~~~~~~~~~~-~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~ 566 (693)
T PTZ00049 506 ---------LR----QINAVFFSSETQSDMH-----ADFEAPISSEP-ARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYR 566 (693)
T ss_pred ---------cc----cccccccccccccccc-----ccccccccccc-cceeeeeeEEecccccccCCCCHHHHHHHHHh
Confidence 00 0000000000000010 00011111111 11122223333333443 2478999999999
Q ss_pred CCCEEEEEecccccccccccEEeCC-------CCC--------------CccCeeEEEeeecCCCCCCCccCC--ccEEE
Q psy1664 404 HGPVEGSMTIYADMILYKTGIYKHV-------AGG--------------PLGEHAIRIIGWGQEPLGEGTSSV--VKYWL 460 (524)
Q Consensus 404 ~gPv~~~~~~~~~f~~y~~gi~~~~-------~~~--------------~~~~H~v~ivG~g~~~~~~~~~~~--~~ywi 460 (524)
+|||+|+|+++++|++|++|||+.+ |.. ..++|||+|||||.+.. .| ++|||
T Consensus 567 ~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~e-----nG~~~~YWI 641 (693)
T PTZ00049 567 NGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEI-----NGKLYKYWI 641 (693)
T ss_pred cCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccC-----CCcccCEEE
Confidence 9999999999889999999999853 211 13699999999998631 14 48999
Q ss_pred EEcCCCCCCCCCcEEEEEeCCCccCccccceeccceeccc
Q psy1664 461 VANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE 500 (524)
Q Consensus 461 v~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~~~~~ 500 (524)
||||||++||++|||||+||.|.||||++++++.|++..-
T Consensus 642 VRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~pd~~rg 681 (693)
T PTZ00049 642 GRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEPDFSRG 681 (693)
T ss_pred EECCCCCCcccCceEEEEcCCCccCCccceeEEeeecccc
Confidence 9999999999999999999999999999999999998754
No 8
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=2.4e-51 Score=402.63 Aligned_cols=218 Identities=31% Similarity=0.629 Sum_probs=173.9
Q ss_pred CCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCC----CccccCCHHHHHhhcCCCCCCCCCCChH
Q psy1664 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG----KRHVRLSSDDLVSCCKDCGNGCQGGFHG 172 (524)
Q Consensus 97 lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~----~~~~~LS~q~lvdC~~~~~~gC~GG~~~ 172 (524)
||++||||+.++++.+|+|||||+.||||||||++++||++++|+++. ...+.||+|||+||+. .++||+||++.
T Consensus 1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~-~~~GC~GG~~~ 79 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQ-YSQGCDGGFPF 79 (243)
T ss_pred CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcC-CCCCCCCCCHH
Confidence 799999999977777999999999999999999999999999998764 2368999999999974 46799999999
Q ss_pred HHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCcccccccccccee-eeecCCC
Q psy1664 173 KAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI-AYSLPAN 251 (524)
Q Consensus 173 ~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 251 (524)
.|++|++++|+++|++| ||... ..+.|...... +.... ...|... .+....+
T Consensus 80 ~a~~~~~~~Gi~~e~~y-------PY~~~-------~~~~C~~~~~~-------~~~~~------~~~~~~i~~~~~~~~ 132 (243)
T cd02621 80 LVGKFAEDFGIVTEDYF-------PYTAD-------DDRPCKASPSE-------CRRYY------FSDYNYVGGCYGCTN 132 (243)
T ss_pred HHHHHHHhcCcCCCcee-------CCCCC-------CCCCCCCCccc-------ccccc------ccceeEcccccccCC
Confidence 99999999999998777 99861 15667643200 00000 0011111 1112357
Q ss_pred HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCC----CCC---------CCCcEEEEEEeccCCCCCCCccceeEEE
Q psy1664 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----GGP---------LGEHAIRIIGWGQEPLGEGTSSVVKYWL 318 (524)
Q Consensus 252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~----~~~---------~~~HaV~iVGyg~~~~~~g~~~g~~YWi 318 (524)
+++||++|+++|||+|+|.++++|++|++|||+... |.. .++|||+|||||++.. ++++|||
T Consensus 133 ~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~-----~g~~YWi 207 (243)
T cd02621 133 EDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI-----KGEKYWI 207 (243)
T ss_pred HHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC-----CCCcEEE
Confidence 889999999999999999999999999999998763 421 4799999999998631 3689999
Q ss_pred EeCCCCCcccccCccccccccCccCCcCcCC
Q psy1664 319 VANSFNTNWGENGLFRIGCRPYEIPCERYMN 349 (524)
Q Consensus 319 vkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~ 349 (524)
||||||++|||+|||||+|+.. .|++...
T Consensus 208 irNSWG~~WGe~Gy~~i~~~~~--~cgi~~~ 236 (243)
T cd02621 208 VKNSWGSSWGEKGYFKIRRGTN--ECGIESQ 236 (243)
T ss_pred EEcCCCCCCCcCCeEEEecCCc--ccCcccc
Confidence 9999999999999999999764 6887654
No 9
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=3.4e-51 Score=400.11 Aligned_cols=214 Identities=31% Similarity=0.600 Sum_probs=168.7
Q ss_pred CCCccccccCCCCCCCCccCCCCC---CCccHHHHHHHHHHHHHHHHHcCCC-ccccCCHHHHHhhcCCCCCCCCCCChH
Q psy1664 97 LPEGFDARINWPYCPTIQEIRDQG---SCGSGWALGAVEAMSDRVCIASRGK-RHVRLSSDDLVSCCKDCGNGCQGGFHG 172 (524)
Q Consensus 97 lP~s~DwR~~~~~~g~vtpvkdQg---~CGsCwAfA~~~~le~~~~i~~~~~-~~~~LS~q~lvdC~~~~~~gC~GG~~~ 172 (524)
||++||||+++.. .+|+|||||| .||||||||++++||++++|++++. ..+.||+|||+||+. +.||+||++.
T Consensus 1 lP~~~Dwr~~~~~-~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~--~~gC~GG~~~ 77 (239)
T cd02698 1 LPKSWDWRNVNGV-NYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG--GGSCHGGDPG 77 (239)
T ss_pred CCCCcccccCCCC-cccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC--CCCccCcCHH
Confidence 7999999998322 2799999998 8999999999999999999987653 357899999999975 6799999999
Q ss_pred HHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCcccc--ccccCCCccccccccccceeeeecCC
Q psy1664 173 KAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI--RKCQPGYDVSYEDDLNFGRIAYSLPA 250 (524)
Q Consensus 173 ~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (524)
.|++|++++|+++|++| ||... ...|...... ..|. ..|.............| ..+ .
T Consensus 78 ~a~~~~~~~Gl~~e~~y-------PY~~~--------~~~C~~~~~~-~~c~~~~~c~~~~~~~~~~i~~~----~~~-~ 136 (239)
T cd02698 78 GVYEYAHKHGIPDETCN-------PYQAK--------DGECNPFNRC-GTCNPFGECFAIKNYTLYFVSDY----GSV-S 136 (239)
T ss_pred HHHHHHHHcCcCCCCee-------CCcCC--------CCCCcCCCCC-CCcccCcccccccccceEEeeec----eec-C
Confidence 99999999999998877 99865 4556432110 1111 11211100000011111 122 3
Q ss_pred CHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCccccc
Q psy1664 251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN 330 (524)
Q Consensus 251 ~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~ 330 (524)
++++||++|+++|||+|+|.++++|+.|++|||+..+|...++|||+|||||++. ++++|||||||||++|||+
T Consensus 137 ~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~------~g~~YWiikNSWG~~WGe~ 210 (239)
T cd02698 137 GRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDE------NGVEYWIVRNSWGEPWGER 210 (239)
T ss_pred CHHHHHHHHHHcCCEEEEEEecccccccCCeEEccCCCCCcCCeEEEEEEEEecC------CCCEEEEEEcCCCcccCcC
Confidence 5789999999999999999999999999999999888877889999999999863 2689999999999999999
Q ss_pred CccccccccC
Q psy1664 331 GLFRIGCRPY 340 (524)
Q Consensus 331 Gy~ri~~~~~ 340 (524)
|||||+++.+
T Consensus 211 Gy~~i~rg~~ 220 (239)
T cd02698 211 GWFRIVTSSY 220 (239)
T ss_pred ceEEEEccCC
Confidence 9999999873
No 10
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=3.3e-49 Score=417.57 Aligned_cols=231 Identities=26% Similarity=0.462 Sum_probs=178.5
Q ss_pred CCCCCCccccccCCCCCCCCccCCCCCC---CccHHHHHHHHHHHHHHHHHcCCC----ccccCCHHHHHhhcCCCCCCC
Q psy1664 94 LEELPEGFDARINWPYCPTIQEIRDQGS---CGSGWALGAVEAMSDRVCIASRGK----RHVRLSSDDLVSCCKDCGNGC 166 (524)
Q Consensus 94 ~~~lP~s~DwR~~~~~~g~vtpvkdQg~---CGsCwAfA~~~~le~~~~i~~~~~----~~~~LS~q~lvdC~~~~~~gC 166 (524)
..+||++||||++. .+.+|+||||||. ||||||||++++||++++|++++. ..+.||+|+|+||+. .++||
T Consensus 202 ~~~LP~sfDWR~~g-g~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~-~n~GC 279 (548)
T PTZ00364 202 GDPPPAAWSWGDVG-GASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQ-YGQGC 279 (548)
T ss_pred ccCCCCccccCcCC-CCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccC-CCCCC
Confidence 35799999999982 2247899999999 999999999999999999998542 358899999999974 47899
Q ss_pred CCCChHHHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeee
Q psy1664 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY 246 (524)
Q Consensus 167 ~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (524)
+||++..|++|++++|+++|++| |.||... .+....|+.....
T Consensus 280 dGG~p~~A~~yi~~~GI~tE~dY-----~~PY~~~-----dg~~~~Ck~~~~~--------------------------- 322 (548)
T PTZ00364 280 AGGFPEEVGKFAETFGILTTDSY-----YIPYDSG-----DGVERACKTRRPS--------------------------- 322 (548)
T ss_pred CCCcHHHHHHHHHhCCccccccc-----CCCCCCC-----CCCCCCCCCCccc---------------------------
Confidence 99999999999999999997766 4588653 0001112210000
Q ss_pred ecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCc
Q psy1664 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326 (524)
Q Consensus 247 ~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~ 326 (524)
T Consensus 323 -------------------------------------------------------------------------------- 322 (548)
T PTZ00364 323 -------------------------------------------------------------------------------- 322 (548)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ccccCccccccccCccCCcCcCCcCCCCCCCCCCCCcccccccccCccccccCcceeeeEEEEcCCCHHHHHHHHHhCCC
Q psy1664 327 WGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP 406 (524)
Q Consensus 327 WGe~Gy~ri~~~~~~~~c~~~~~~~~~~C~~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~gP 406 (524)
...+..+..+..+++.+..++++|+.+|+++||
T Consensus 323 -----------------------------------------------~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GP 355 (548)
T PTZ00364 323 -----------------------------------------------RRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGP 355 (548)
T ss_pred -----------------------------------------------ceeeeeeeEEecceeecCCcHHHHHHHHHHcCC
Confidence 000000111112233344578899999999999
Q ss_pred EEEEEecccccccccccEEeCC---------CC----------CCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCC
Q psy1664 407 VEGSMTIYADMILYKTGIYKHV---------AG----------GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 467 (524)
Q Consensus 407 v~~~~~~~~~f~~y~~gi~~~~---------~~----------~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~ 467 (524)
|+|+|+++.+|+.|++|||... ++ ...++|+|+|||||++.+ |++|||||||||+
T Consensus 356 VsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~------G~~YWIVKNSWGt 429 (548)
T PTZ00364 356 VPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDEN------GGDYWLVLDPWGS 429 (548)
T ss_pred eEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCC------CceEEEEECCCCC
Confidence 9999999889999999998521 11 134799999999997542 7899999999999
Q ss_pred --CCCCCcEEEEEeCCCccCccccceeccce
Q psy1664 468 --NWGENGLFRIVRGQNECGIEADITAGLPK 496 (524)
Q Consensus 468 --~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~ 496 (524)
+||++|||||+||.|+||||+.++++.|.
T Consensus 430 ~~~WGE~GYfRI~RG~N~CGIes~~v~~~~~ 460 (548)
T PTZ00364 430 RRSWCDGGTRKIARGVNAYNIESEVVVMYWA 460 (548)
T ss_pred CCCcccCCeEEEEcCCCcccccceeeeeeee
Confidence 99999999999999999999999988884
No 11
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=1.1e-48 Score=375.76 Aligned_cols=204 Identities=33% Similarity=0.616 Sum_probs=170.4
Q ss_pred CCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHH
Q psy1664 98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY 177 (524)
Q Consensus 98 P~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~ 177 (524)
|++||||+. +.++||+|||.||+|||||++++||++++++++ ..+.||+|+|++|....+.+|.||++..|+++
T Consensus 1 P~~~d~r~~----~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~ 74 (210)
T cd02248 1 PESVDWREK----GAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTG--KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEY 74 (210)
T ss_pred CCcccCCcC----CCCCCCccCCCCcchHHhHHHHHHHHHHHHHcC--CCcccCHHHHhccCCCCCCCCCCCCHHHhHHH
Confidence 889999998 669999999999999999999999999999876 46889999999997644579999999999999
Q ss_pred HHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCC-CHHHHH
Q psy1664 178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA-NEETIM 256 (524)
Q Consensus 178 ~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ik 256 (524)
+++.|+++|++| ||... ...|...... ..+ +... ...+.. +.++||
T Consensus 75 ~~~~Gi~~e~~y-------PY~~~--------~~~C~~~~~~---------~~~-----~i~~----~~~i~~~~~~~ik 121 (210)
T cd02248 75 VKNGGLASESDY-------PYTGK--------DGTCKYNSSK---------VGA-----KITG----YSNVPPGDEEALK 121 (210)
T ss_pred HHHCCcCccccC-------CccCC--------CCCccCCCCc---------ccE-----EEee----EEEcCCCcHHHHH
Confidence 999999998888 99864 4556543210 000 1111 123333 478999
Q ss_pred HHHHHcCCeEEEEEecccccccCCceEEcCCC-CCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCcccc
Q psy1664 257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAG-GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335 (524)
Q Consensus 257 ~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~-~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri 335 (524)
++|+++|||+++|.++++|+.|++|||..+.+ ...++|||+|||||++ .+.+|||||||||++||++|||||
T Consensus 122 ~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~-------~~~~ywiv~NSWG~~WG~~Gy~~i 194 (210)
T cd02248 122 AALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYGTE-------NGVDYWIVKNSWGTSWGEKGYIRI 194 (210)
T ss_pred HHHhhcCCEEEEEecCcccccCCCCceeCCCCCCCcCCEEEEEEEEeec-------CCceEEEEEcCCCCccccCcEEEE
Confidence 99999999999999999999999999998877 4568999999999987 368999999999999999999999
Q ss_pred ccccCccCCcCcCC
Q psy1664 336 GCRPYEIPCERYMN 349 (524)
Q Consensus 336 ~~~~~~~~c~~~~~ 349 (524)
+++.. .|++...
T Consensus 195 ~~~~~--~cgi~~~ 206 (210)
T cd02248 195 ARGSN--LCGIASY 206 (210)
T ss_pred EcCCC--ccCceee
Confidence 98773 6887643
No 12
>KOG1544|consensus
Probab=100.00 E-value=1.8e-48 Score=371.58 Aligned_cols=302 Identities=35% Similarity=0.649 Sum_probs=232.1
Q ss_pred hhhhhhcCCCCcccccccc-ccccchHHH-HHHHhCCCCCCCCCCCCCCcccccCCCCCCCCCccccccCCCCCCCCccC
Q psy1664 39 RVDHSILLPKLPFYGAEKN-ALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116 (524)
Q Consensus 39 ~I~~~N~~~~~~~~~~g~N-~fsd~t~eE-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~s~DwR~~~~~~g~vtpv 116 (524)
+|+++|+.. .+|+++.. +|=.||.++ |+-+||..+++....+.. +....-.+...||+.||.|.+|| +++.++
T Consensus 152 ~iE~in~G~--YgW~A~NYSaFWGmtL~DGiKyRLGTL~Ps~sv~nMN-Ei~~~l~p~~~LPE~F~As~KWp--~liH~p 226 (470)
T KOG1544|consen 152 MIEAINQGN--YGWQAGNYSAFWGMTLDDGIKYRLGTLRPSSSVMNMN-EIYTVLNPGEVLPEAFEASEKWP--NLIHEP 226 (470)
T ss_pred HHHHHhcCC--ccccccchhhhhcccccccceeeecccCchhhhhhHH-hHhhccCcccccchhhhhhhcCC--ccccCc
Confidence 499999877 79999844 688888876 666788766543211111 11112233468999999999999 679999
Q ss_pred CCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHHHHHhCCccCCccCCCCCcc
Q psy1664 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR 196 (524)
Q Consensus 117 kdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~ 196 (524)
.|||.|++.|||+++++..++++|.+.|+....||+|+|++|.....+||+||....|+-||++.|++. ..|+
T Consensus 227 lDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKrGvVs-------dhCY 299 (470)
T KOG1544|consen 227 LDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKRGVVS-------DHCY 299 (470)
T ss_pred cccCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecccccc-------cccc
Confidence 999999999999999999999999999988899999999999876567999999999999999999997 5566
Q ss_pred ccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEeccccc
Q psy1664 197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI 276 (524)
Q Consensus 197 PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~ 276 (524)
||.... +...+.|...+
T Consensus 300 P~~~dQ----~~~~~~C~m~s----------------------------------------------------------- 316 (470)
T KOG1544|consen 300 PFSGDQ----AGPAPPCMMHS----------------------------------------------------------- 316 (470)
T ss_pred cccCCC----CCCCCCceeec-----------------------------------------------------------
Confidence 997540 00111111000
Q ss_pred ccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCccccccccCccCCcCcCCcCCCCCC
Q psy1664 277 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQ 356 (524)
Q Consensus 277 ~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~~~~~~C~ 356 (524)
...|+|
T Consensus 317 --------------------R~~grg------------------------------------------------------ 322 (470)
T KOG1544|consen 317 --------------------RAMGRG------------------------------------------------------ 322 (470)
T ss_pred --------------------cccCcc------------------------------------------------------
Confidence 001222
Q ss_pred CCCCCCcccccccccCccccccCcceeeeEEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCC------
Q psy1664 357 ANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG------ 430 (524)
Q Consensus 357 ~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~------ 430 (524)
+.+.++-|++++.. .++.+.....|++.++|++|+++||++|||.+.|.|.++|+.|++|||.+.+.
T Consensus 323 -----kRqat~~CPn~~~~--Sn~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e 395 (470)
T KOG1544|consen 323 -----KRQATAHCPNSYVN--SNDIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPE 395 (470)
T ss_pred -----cccccCcCCCcccc--cCceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCch
Confidence 12222334333321 12455666778999999999999999999999999999999999999987642
Q ss_pred --CCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcEEEEEeCCCccCccccceeccceec
Q psy1664 431 --GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG 498 (524)
Q Consensus 431 --~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~~~ 498 (524)
...+.|+|.|.|||++....|. ..+|||..||||+.||++|||||.||.|+|.||+.++++.-.++
T Consensus 396 ~yr~~gtHsVk~tGWG~~~~~~G~--~~KyW~aANSWG~~WGE~GYFriLRGvNecdIEsfvIgAWGr~~ 463 (470)
T KOG1544|consen 396 RYRRHGTHSVKITGWGEETLPDGR--TLKYWTAANSWGPAWGERGYFRILRGVNECDIESFVIGAWGRVG 463 (470)
T ss_pred hhhhcccceEEEeecccccCCCCC--eeEEEEeecccccccccCceEEEeccccchhhhHhhhhhhhccc
Confidence 1347999999999998755555 78999999999999999999999999999999999988876554
No 13
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=2.8e-46 Score=360.66 Aligned_cols=210 Identities=35% Similarity=0.649 Sum_probs=165.2
Q ss_pred CCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHH
Q psy1664 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176 (524)
Q Consensus 97 lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~ 176 (524)
||++||||+.+ +.++||+||+.||+|||||++++||++++++.+ ...+.||+|+|++|....+.+|+||++..|++
T Consensus 1 lP~~~D~r~~~---~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~-~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~ 76 (219)
T PF00112_consen 1 LPKSFDWRDKG---GRITPVRDQGSCGSCWAFAAAAALESRLAIQNN-GKNVDLSEQYLIDCSNKYNKGCDGGSPFDALK 76 (219)
T ss_dssp STSSEEGGGTT---TCSG---BTTSSBTHHHHHHHHHHHHHHHHHHT-SSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHH
T ss_pred CCCCEecccCC---CCcCccccCCcccccccchhccceecccccccc-ccccccccccccccccccccccccCcccccce
Confidence 89999999962 368999999999999999999999999999985 46799999999999864457999999999999
Q ss_pred HHHH-hCCccCCccCCCCCccccccCcccccCCCC-CCCCCCCCCCccccccccCCCccccccccccceeeeecC-CCHH
Q psy1664 177 YWVT-TGIVSGGTYASKQGCRPYEIPCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP-ANEE 253 (524)
Q Consensus 177 ~~~~-~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~-~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 253 (524)
++++ +|+++|+.| ||... . ..|....... ...+...| ..+. .+.+
T Consensus 77 ~~~~~~Gi~~e~~~-------pY~~~--------~~~~c~~~~~~~-------------~~~~i~~~----~~~~~~~~~ 124 (219)
T PF00112_consen 77 YIKNNNGIVTEEDY-------PYNGN--------ENPTCKSKKSNS-------------YYVKIKGY----GKVKDNDIE 124 (219)
T ss_dssp HHHHHTSBEBTTTS---------SSS--------SSCSSCHSGGGE-------------EEBEESEE----EEEESTCHH
T ss_pred eecccCcccccccc-------ccccc--------cccccccccccc-------------cccccccc----ccccccchh
Confidence 9999 899998888 99865 2 3454321100 00111111 1222 2589
Q ss_pred HHHHHHHHcCCeEEEEEecc-cccccCCceEEcCCCC-CCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccC
Q psy1664 254 TIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331 (524)
Q Consensus 254 ~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~~~~~-~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~G 331 (524)
+||++|+++|||+++|.+.+ +|+.|++|||..+.+. ..++|||+|||||++ .+++|||||||||++||++|
T Consensus 125 ~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~-------~~~~~wiv~NSWG~~WG~~G 197 (219)
T PF00112_consen 125 DIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDE-------NGKGYWIVKNSWGTDWGDNG 197 (219)
T ss_dssp HHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEE-------TTEEEEEEE-SBTTTSTBTT
T ss_pred HHHHHHhhCceeeeeeeccccccccccceeeeccccccccccccccccccccc-------cceeeEeeehhhCCccCCCe
Confidence 99999999999999999998 6999999999998665 478999999999987 47899999999999999999
Q ss_pred ccccccccCccCCcCcCCc
Q psy1664 332 LFRIGCRPYEIPCERYMNG 350 (524)
Q Consensus 332 y~ri~~~~~~~~c~~~~~~ 350 (524)
||||+++.. ..|++....
T Consensus 198 y~~i~~~~~-~~c~i~~~~ 215 (219)
T PF00112_consen 198 YFRISYDYN-NECGIESQA 215 (219)
T ss_dssp EEEEESSSS-SGGGTTSSE
T ss_pred EEEEeeCCC-CcCccCcee
Confidence 999999765 257776544
No 14
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00 E-value=1.9e-43 Score=328.69 Aligned_cols=164 Identities=40% Similarity=0.780 Sum_probs=140.0
Q ss_pred CCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHH
Q psy1664 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK 176 (524)
Q Consensus 97 lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~ 176 (524)
||++||||++ ++++||+||+.||+|||||++++||++++++++. .+.||+|+|++|....++||.||++..|++
T Consensus 1 lP~~~D~R~~----~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~--~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~ 74 (174)
T smart00645 1 LPESFDWRKK----GAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGK--LVSLSEQQLVDCSTGGNNGCNGGLPDNAFE 74 (174)
T ss_pred CCCcCccccc----CCCCccccCcccchHHHHHHHHHHHHHHHHhcCC--ccccCHHHHhhhcCCCCCCCCCcCHHHHHH
Confidence 7999999998 5789999999999999999999999999999863 689999999999765345999999999999
Q ss_pred HHHHh-CCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHH
Q psy1664 177 YWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI 255 (524)
Q Consensus 177 ~~~~~-Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 255 (524)
|++++ |+++|+.| ||+.
T Consensus 75 ~~~~~~Gi~~e~~~-------PY~~------------------------------------------------------- 92 (174)
T smart00645 75 YIKKNGGLETESCY-------PYTG------------------------------------------------------- 92 (174)
T ss_pred HHHHcCCccccccc-------Cccc-------------------------------------------------------
Confidence 99998 99997777 9841
Q ss_pred HHHHHHcCCeEEEEEecccccccCCceEEcCCCCC-CCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCccc
Q psy1664 256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334 (524)
Q Consensus 256 k~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~-~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~r 334 (524)
++.+.+. +|+.|++|||+.+.|.. .++|+|+|||||.+. ++++|||||||||+.|||+||||
T Consensus 93 ----------~~~~~~~-~f~~Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~------~g~~yWii~NSwG~~WG~~G~~~ 155 (174)
T smart00645 93 ----------SVAIDAS-DFQFYKSGIYDHPGCGSGTLDHAVLIVGYGTEE------NGKDYWIVKNSWGTDWGENGYFR 155 (174)
T ss_pred ----------EEEEEcc-cccCCcCeEECCCCCCCCcccEEEEEEEEeecC------CCeeEEEEECCCCCCcccCeEEE
Confidence 3444443 69999999998865543 479999999999752 46899999999999999999999
Q ss_pred cccccCccCCcC
Q psy1664 335 IGCRPYEIPCER 346 (524)
Q Consensus 335 i~~~~~~~~c~~ 346 (524)
|+++.+ ..|++
T Consensus 156 i~~~~~-~~c~i 166 (174)
T smart00645 156 IARGKN-NECGI 166 (174)
T ss_pred EEcCCC-CccCc
Confidence 998752 25666
No 15
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00 E-value=4.3e-41 Score=325.17 Aligned_cols=202 Identities=26% Similarity=0.396 Sum_probs=157.2
Q ss_pred ccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCC----CCCCCCChHHHH
Q psy1664 100 GFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG----NGCQGGFHGKAW 175 (524)
Q Consensus 100 s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~----~gC~GG~~~~a~ 175 (524)
.||||+. + ++||+|||.||+|||||++++||++++++......+.||+|+|++|..... .+|.||.+..++
T Consensus 1 ~~d~r~~----~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~ 75 (223)
T cd02619 1 SVDLRPL----R-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSAL 75 (223)
T ss_pred CCcchhc----C-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHH
Confidence 4899998 6 899999999999999999999999999987533468999999999976532 699999999999
Q ss_pred H-HHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecC-CCHH
Q psy1664 176 K-YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP-ANEE 253 (524)
Q Consensus 176 ~-~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 253 (524)
. +++++|+++|++| ||... ...|........ ... ......| ..+. .+++
T Consensus 76 ~~~~~~~Gi~~e~~~-------Py~~~--------~~~~~~~~~~~~-------~~~---~~~~~~y----~~~~~~~~~ 126 (223)
T cd02619 76 LKLVALKGIPPEEDY-------PYGAE--------SDGEEPKSEAAL-------NAA---KVKLKDY----RRVLKNNIE 126 (223)
T ss_pred HHHHHHcCCCccccC-------CCCCC--------CCCCCCCCccch-------hhc---ceeecce----eEeCchhHH
Confidence 8 8889999998888 99865 333322110000 000 0011111 1222 3478
Q ss_pred HHHHHHHHcCCeEEEEEecccccccCCceEE-----c-CCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcc
Q psy1664 254 TIMREIFRHGPVEGSMTIYADMILYKTGIYK-----H-VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW 327 (524)
Q Consensus 254 ~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~-----~-~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~W 327 (524)
+||++|+++|||+++|.++.+|+.|++|+|. . .++...++|||+|||||++.. .+++|||||||||+.|
T Consensus 127 ~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~-----~~~~~~i~~NSwG~~w 201 (223)
T cd02619 127 DIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYV-----EGKGAFIVKNSWGTDW 201 (223)
T ss_pred HHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCC-----CCCCEEEEEeCCCCcc
Confidence 9999999999999999999999999999873 2 223446899999999998731 2689999999999999
Q ss_pred cccCccccccccC
Q psy1664 328 GENGLFRIGCRPY 340 (524)
Q Consensus 328 Ge~Gy~ri~~~~~ 340 (524)
|++||+||++...
T Consensus 202 g~~Gy~~i~~~~~ 214 (223)
T cd02619 202 GDNGYGRISYEDV 214 (223)
T ss_pred ccCCEEEEehhhh
Confidence 9999999998765
No 16
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=4.8e-40 Score=359.82 Aligned_cols=223 Identities=21% Similarity=0.379 Sum_probs=155.6
Q ss_pred CCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCC-CCCCCCCC-hHHHHHHHHHhC-C
Q psy1664 107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDC-GNGCQGGF-HGKAWKYWVTTG-I 183 (524)
Q Consensus 107 ~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~-~~gC~GG~-~~~a~~~~~~~G-i 183 (524)
++.|....||||||.||+|||||++++||++++|+++ ..+.||+|+|+||+... ..||.||+ +..++.|++++| +
T Consensus 538 ~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg--~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgL 615 (1004)
T PTZ00462 538 ENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGY--EPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFL 615 (1004)
T ss_pred CCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcC--CCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCC
Confidence 5777777899999999999999999999999999875 46899999999998643 46999997 556679998885 7
Q ss_pred ccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccC---C-Cccccccccccceeeee-----cCCCHHH
Q psy1664 184 VSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP---G-YDVSYEDDLNFGRIAYS-----LPANEET 254 (524)
Q Consensus 184 ~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~-----~~~~~~~ 254 (524)
++|++| ||... ...+.|+........+-..... . ..........|....-. +..-++.
T Consensus 616 ptESdY-------PYt~k------~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~ 682 (1004)
T PTZ00462 616 PADSNY-------LYNYT------KVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKI 682 (1004)
T ss_pred cccccC-------CCccC------CCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHH
Confidence 777777 99753 1245676432211111000000 0 00000011112111000 0011468
Q ss_pred HHHHHHHcCCeEEEEEecccccccC-CceEEcCCCC-CCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCc
Q psy1664 255 IMREIFRHGPVEGSMTIYADMILYK-TGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332 (524)
Q Consensus 255 ik~~l~~~GPV~v~i~v~~~f~~Y~-sGIy~~~~~~-~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy 332 (524)
|+++|+.+|||+|+|++. +|+.|. +|||....|. ..++|||+|||||.+.+.++ .+++|||||||||+.|||+||
T Consensus 683 IK~eI~~kGPVaV~IdAs-df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg--~gk~YWIVRNSWGt~WGEnGY 759 (1004)
T PTZ00462 683 IKDEIMNKGSVIAYIKAE-NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDED--EKKSYWIVRNSWGKYWGDEGY 759 (1004)
T ss_pred HHHHHHhcCCEEEEEEee-hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccC--CCCceEEEEcCCCCCcCCCeE
Confidence 999999999999999985 788885 8987766565 45799999999997532111 357999999999999999999
Q ss_pred cccccccCccCCcCcC
Q psy1664 333 FRIGCRPYEIPCERYM 348 (524)
Q Consensus 333 ~ri~~~~~~~~c~~~~ 348 (524)
|||+|+.. ..|++..
T Consensus 760 FKI~r~g~-n~CGin~ 774 (1004)
T PTZ00462 760 FKVDMYGP-SHCEDNF 774 (1004)
T ss_pred EEEEeCCC-CCCccch
Confidence 99999532 2688754
No 17
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.95 E-value=3e-27 Score=231.18 Aligned_cols=99 Identities=34% Similarity=0.682 Sum_probs=90.7
Q ss_pred CCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCC
Q psy1664 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 471 (524)
Q Consensus 392 ~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~ 471 (524)
.++++||.+|+++|||+++|.++.+|+.|++|||+..++...++|+|+|||||++.. +++|||||||||++||+
T Consensus 136 ~~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~~------g~~YWiikNSWG~~WGe 209 (239)
T cd02698 136 SGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDEN------GVEYWIVRNSWGEPWGE 209 (239)
T ss_pred CCHHHHHHHHHHcCCEEEEEEecccccccCCeEEccCCCCCcCCeEEEEEEEEecCC------CCEEEEEEcCCCcccCc
Confidence 468899999999999999999988999999999988776667899999999998652 78999999999999999
Q ss_pred CcEEEEEeCC-----CccCccccceeccce
Q psy1664 472 NGLFRIVRGQ-----NECGIEADITAGLPK 496 (524)
Q Consensus 472 ~Gy~~i~~g~-----~~cgi~~~~~~~~p~ 496 (524)
+|||||+||. |+||||++++++.|.
T Consensus 210 ~Gy~~i~rg~~~~~~~~~~i~~~~~~~~~~ 239 (239)
T cd02698 210 RGWFRIVTSSYKGARYNLAIEEDCAWADPI 239 (239)
T ss_pred CceEEEEccCCcccccccccccceEEEeeC
Confidence 9999999999 999999999988873
No 18
>KOG1543|consensus
Probab=99.95 E-value=2.1e-27 Score=240.58 Aligned_cols=111 Identities=42% Similarity=0.809 Sum_probs=100.4
Q ss_pred CcceeeeEEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCC-ccCeeEEEeeecCCCCCCCccCCcc
Q psy1664 379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVK 457 (524)
Q Consensus 379 ~~~~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~-~~~H~v~ivG~g~~~~~~~~~~~~~ 457 (524)
.+.++..+.+.++.+|++|+++|+.+|||+|+|++..+|+.|++|||.++++.. .++|+|+|||||+.+ +.+
T Consensus 213 ~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~~~-------~~~ 285 (325)
T KOG1543|consen 213 DKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGTGD-------GVD 285 (325)
T ss_pred cceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcCCC-------Cce
Confidence 456777888889999999999999999999999998899999999999988776 499999999999933 789
Q ss_pred EEEEEcCCCCCCCCCcEEEEEeCCCccCccccceeccce
Q psy1664 458 YWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496 (524)
Q Consensus 458 ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~ 496 (524)
|||||||||+.||++|||||.|+.+.|+|++.+.++.|.
T Consensus 286 YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~~~p~~ 324 (325)
T KOG1543|consen 286 YWIVKNSWGTDWGEKGYFRIARGVNKCGIASEASYGPIK 324 (325)
T ss_pred eEEEEcCCCCCcccCceEEEecCCCchhhhcccccCCCC
Confidence 999999999999999999999999999999987765543
No 19
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.94 E-value=9.3e-27 Score=228.50 Aligned_cols=101 Identities=42% Similarity=0.931 Sum_probs=89.4
Q ss_pred CCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCC----C-C--------CccCeeEEEeeecCCCCCCCccCCcc
Q psy1664 391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----G-G--------PLGEHAIRIIGWGQEPLGEGTSSVVK 457 (524)
Q Consensus 391 ~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~----~-~--------~~~~H~v~ivG~g~~~~~~~~~~~~~ 457 (524)
..++++||.+|+++|||+++|++.++|++|++|||+... | . ..++|+|+|||||++.. ++++
T Consensus 130 ~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~-----~g~~ 204 (243)
T cd02621 130 CTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI-----KGEK 204 (243)
T ss_pred cCCHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC-----CCCc
Confidence 357899999999999999999998899999999998752 1 1 24799999999998751 2779
Q ss_pred EEEEEcCCCCCCCCCcEEEEEeCCCccCccccceeccce
Q psy1664 458 YWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK 496 (524)
Q Consensus 458 ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~ 496 (524)
|||||||||++||++|||||+||.|.|||++.+++++|.
T Consensus 205 YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~~~ 243 (243)
T cd02621 205 YWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVFAYPI 243 (243)
T ss_pred EEEEEcCCCCCCCcCCeEEEecCCcccCcccceEeeccC
Confidence 999999999999999999999999999999999998884
No 20
>KOG1542|consensus
Probab=99.94 E-value=1e-26 Score=226.44 Aligned_cols=105 Identities=29% Similarity=0.581 Sum_probs=91.9
Q ss_pred eeeeEEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeC---CCCCCccCeeEEEeeecCCCCCCCccCCccE
Q psy1664 382 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY 458 (524)
Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~---~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~y 458 (524)
.+....+.++.||++|.+.|.++|||+|+|++. .+++|.+||+.+ .|....++|+|||||||.+.. ..+|
T Consensus 262 v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~-~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~------~~PY 334 (372)
T KOG1542|consen 262 VSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAK-PMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGY------EKPY 334 (372)
T ss_pred EEEeccEecCCCHHHHHHHHHhcCCeEEEEchH-HHHHhcccccCCCcccCCccccCceEEEEeecCCCC------CCce
Confidence 344556677889999999999999999999974 599999999987 344455999999999998752 6899
Q ss_pred EEEEcCCCCCCCCCcEEEEEeCCCccCccccceec
Q psy1664 459 WLVANSFNTNWGENGLFRIVRGQNECGIEADITAG 493 (524)
Q Consensus 459 wiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~ 493 (524)
||||||||++||++||+||.||.|.|||++.++++
T Consensus 335 WIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mvss~ 369 (372)
T KOG1542|consen 335 WIVKNSWGTSWGEKGYYKLCRGSNACGIADMVSSA 369 (372)
T ss_pred EEEECCccccccccceEEEeccccccccccchhhh
Confidence 99999999999999999999999999999987654
No 21
>PTZ00203 cathepsin L protease; Provisional
Probab=99.93 E-value=5.7e-26 Score=231.81 Aligned_cols=99 Identities=22% Similarity=0.495 Sum_probs=86.8
Q ss_pred EEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCC
Q psy1664 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 465 (524)
Q Consensus 386 ~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSW 465 (524)
.+..++.++++|+.+|+++|||+++|++. +|++|++|||.. |....++|+|+|||||+++ +++||||||||
T Consensus 240 ~~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~~-------g~~YWiikNSW 310 (348)
T PTZ00203 240 GYVSMESSERVMAAWLAKNGPISIAVDAS-SFMSYHSGVLTS-CIGEQLNHGVLLVGYNMTG-------EVPYWVIKNSW 310 (348)
T ss_pred ceeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcCccCceeec-cCCCCCCeEEEEEEEecCC-------CceEEEEEcCC
Confidence 44556668899999999999999999995 899999999975 4444579999999999876 88999999999
Q ss_pred CCCCCCCcEEEEEeCCCccCccccceec
Q psy1664 466 NTNWGENGLFRIVRGQNECGIEADITAG 493 (524)
Q Consensus 466 G~~WG~~Gy~~i~~g~~~cgi~~~~~~~ 493 (524)
|++||++|||||+||.|.|||++.++.+
T Consensus 311 G~~WGe~GY~ri~rg~n~Cgi~~~~~~~ 338 (348)
T PTZ00203 311 GEDWGEKGYVRVTMGVNACLLTGYPVSV 338 (348)
T ss_pred CCCcCcCceEEEEcCCCcccccceEEEE
Confidence 9999999999999999999999776543
No 22
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.92 E-value=5.7e-26 Score=223.77 Aligned_cols=205 Identities=24% Similarity=0.251 Sum_probs=128.1
Q ss_pred CCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhc-CCCCCCC-----CC
Q psy1664 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGC-----QG 168 (524)
Q Consensus 95 ~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~-~~~~~gC-----~G 168 (524)
..||+.||||.. |.|+||||||.||+||||+++++||+.+.-.. ...+|+-.+..-. ..+..+| +|
T Consensus 97 ~s~~~~fd~r~~----g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~----~w~~s~~nm~~ll~~~ye~~fd~~~~d~ 168 (372)
T COG4870 97 ASLPSYFDRRDE----GKVSPVKDQGSGGSCWAFATTRSLESYLNPES----AWDFSENNMKNLLGVPYEKGFDYTSNDG 168 (372)
T ss_pred ccchhheeeecc----CCcccccccCcccceEeeeehhhhhheecccc----cccccccchhhhcCCCccccCCCccccC
Confidence 358999999999 88999999999999999999999999885532 3455554443221 1111222 36
Q ss_pred CChHHHHHHHHHh-CCccCCccCCCCCccccccCcccccCCCCCCCCCCCC---CCccccccccCCCcccccccccccee
Q psy1664 169 GFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEP---NTPECIRKCQPGYDVSYEDDLNFGRI 244 (524)
Q Consensus 169 G~~~~a~~~~~~~-Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (524)
|....+.-|+.+. |-+.+.+- ||... ...|....+ ....|...+..
T Consensus 169 g~~~m~~a~l~e~sgpv~et~d-------~y~~~--------s~~~~~~~p~~k~~~~~~~i~~~--------------- 218 (372)
T COG4870 169 GNADMSAAYLTEWSGPVYETDD-------PYSEN--------SYFSPTNLPVTKHVQEAQIIPSR--------------- 218 (372)
T ss_pred CccccccccccccCCcchhhcC-------ccccc--------cccCCcCCchhhccccceecccc---------------
Confidence 7766666666554 76666555 66543 222222111 11111100000
Q ss_pred eeecCCCHHHHHHHHHHcCCeEEEEEecc-cccccCCceEEcCCCCCCCCcEEEEEEeccCCCC---CCCccceeEEEEe
Q psy1664 245 AYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG---EGTSSVVKYWLVA 320 (524)
Q Consensus 245 ~~~~~~~~~~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~---~g~~~g~~YWivk 320 (524)
.-..+...|++++...|-++.+|.+.. .+.....+.|..... ...+|||+||||++.... +....|.+.||||
T Consensus 219 --~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~~~~s~-~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiik 295 (372)
T COG4870 219 --KKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSG-ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIK 295 (372)
T ss_pred --hhhhcccchHHHHhhhccccceeEEecccccccccCCCCCCcc-ccccceEEEEeccccccccccccCCCCCceEEEE
Confidence 001123347888888888876666542 222233344433322 567999999999986431 1234567799999
Q ss_pred CCCCCcccccCccccccccC
Q psy1664 321 NSFNTNWGENGLFRIGCRPY 340 (524)
Q Consensus 321 NSWG~~WGe~Gy~ri~~~~~ 340 (524)
||||++||++|||||+.+..
T Consensus 296 NSWGt~wG~~GYfwisY~ya 315 (372)
T COG4870 296 NSWGTNWGENGYFWISYYYA 315 (372)
T ss_pred CccccccccCceEEEEeeec
Confidence 99999999999999997643
No 23
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.92 E-value=1e-24 Score=209.25 Aligned_cols=101 Identities=35% Similarity=0.645 Sum_probs=88.6
Q ss_pred eeEEEEcCC-CHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCC-CCccCeeEEEeeecCCCCCCCccCCccEEEE
Q psy1664 384 GRIAYSLPA-NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG-GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV 461 (524)
Q Consensus 384 ~~~~~~~~~-~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~-~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv 461 (524)
...+..+.. ++++||++|+++|||+++|.+.++|+.|++|||..+++ ...++|+|+|||||++. +.+||||
T Consensus 106 i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~~-------~~~ywiv 178 (210)
T cd02248 106 ITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYGTEN-------GVDYWIV 178 (210)
T ss_pred EeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCCCcCCEEEEEEEEeecC-------CceEEEE
Confidence 334445543 48899999999999999999988999999999988766 45689999999999987 7899999
Q ss_pred EcCCCCCCCCCcEEEEEeCCCccCccccce
Q psy1664 462 ANSFNTNWGENGLFRIVRGQNECGIEADIT 491 (524)
Q Consensus 462 ~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~ 491 (524)
|||||+.||++|||||.++.|.|||++.+.
T Consensus 179 ~NSWG~~WG~~Gy~~i~~~~~~cgi~~~~~ 208 (210)
T cd02248 179 KNSWGTSWGEKGYIRIARGSNLCGIASYAS 208 (210)
T ss_pred EcCCCCccccCcEEEEEcCCCccCceeeee
Confidence 999999999999999999999999997743
No 24
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.92 E-value=3.7e-25 Score=215.98 Aligned_cols=94 Identities=49% Similarity=0.992 Sum_probs=82.5
Q ss_pred ecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCc
Q psy1664 247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN 326 (524)
Q Consensus 247 ~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~ 326 (524)
.+..++++||++|+++|||+|+|.++++|+.|++|||+..++...++|||+|||||++ ++++|||||||||++
T Consensus 139 ~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~-------~g~~YWivrNSWG~~ 211 (236)
T cd02620 139 SVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVE-------NGVPYWLAANSWGTD 211 (236)
T ss_pred eeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEecc-------CCeeEEEEEeCCCCC
Confidence 4455789999999999999999999999999999999876555567999999999986 468999999999999
Q ss_pred ccccCccccccccCccCCcCcCC
Q psy1664 327 WGENGLFRIGCRPYEIPCERYMN 349 (524)
Q Consensus 327 WGe~Gy~ri~~~~~~~~c~~~~~ 349 (524)
|||+|||||+++.. .|++.+.
T Consensus 212 WGe~Gy~ri~~~~~--~cgi~~~ 232 (236)
T cd02620 212 WGENGYFRILRGSN--ECGIESE 232 (236)
T ss_pred CCCCcEEEEEccCc--ccccccc
Confidence 99999999999764 6887653
No 25
>PTZ00021 falcipain-2; Provisional
Probab=99.90 E-value=7.1e-24 Score=222.68 Aligned_cols=107 Identities=28% Similarity=0.583 Sum_probs=86.2
Q ss_pred EEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCC---CccCCccEEEEE
Q psy1664 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE---GTSSVVKYWLVA 462 (524)
Q Consensus 386 ~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~---~~~~~~~ywiv~ 462 (524)
++..++ +.+|+++|+.+|||+|+|++..+|++|++|||..+|.. .++|||+|||||++...+ +...+.+|||||
T Consensus 375 ~y~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVK 451 (489)
T PTZ00021 375 SYVSIP--EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIK 451 (489)
T ss_pred eEEEec--HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCCCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEE
Confidence 334444 57899999999999999999889999999999876543 479999999999764111 111246899999
Q ss_pred cCCCCCCCCCcEEEEEeCC----CccCccccceecccee
Q psy1664 463 NSFNTNWGENGLFRIVRGQ----NECGIEADITAGLPKI 497 (524)
Q Consensus 463 NSWG~~WG~~Gy~~i~~g~----~~cgi~~~~~~~~p~~ 497 (524)
||||++||++|||||+|+. |.|||.+.+. +|.+
T Consensus 452 NSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~--yP~~ 488 (489)
T PTZ00021 452 NSWGESWGEKGFIRIETDENGLMKTCSLGTEAY--VPLI 488 (489)
T ss_pred CCCCCCcccCeEEEEEcCCCCCCCCCCCcccce--eEec
Confidence 9999999999999999986 5899999854 5654
No 26
>PTZ00200 cysteine proteinase; Provisional
Probab=99.90 E-value=1.6e-23 Score=219.32 Aligned_cols=95 Identities=27% Similarity=0.610 Sum_probs=79.6
Q ss_pred HHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcE
Q psy1664 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 474 (524)
Q Consensus 395 ~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy 474 (524)
.+++.+++.+|||+|+|.++.+|+.|++|||.++|.. .++|||+|||||.+. +.|++|||||||||++||++||
T Consensus 348 ~~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~-~~nHaV~lVGyG~d~-----~~g~~YWIIkNSWG~~WGe~GY 421 (448)
T PTZ00200 348 KDVLNKSLVISPTVVYIAVSRELLKYKSGVYNGECGK-SLNHAVLLVGEGYDE-----KTKKRYWIIKNSWGTDWGENGY 421 (448)
T ss_pred HHHHHHHHhcCCEEEEeecccccccCCCCccccccCC-CCcEEEEEEEecccC-----CCCCceEEEEcCCCCCcccCee
Confidence 3455566678999999999889999999999876544 489999999999642 1278999999999999999999
Q ss_pred EEEEeC---CCccCccccceecccee
Q psy1664 475 FRIVRG---QNECGIEADITAGLPKI 497 (524)
Q Consensus 475 ~~i~~g---~~~cgi~~~~~~~~p~~ 497 (524)
|||+|+ .|.|||++.+ .+|.+
T Consensus 422 ~ri~r~~~g~n~CGI~~~~--~~P~~ 445 (448)
T PTZ00200 422 MRLERTNEGTDKCGILTVG--LTPVF 445 (448)
T ss_pred EEEEeCCCCCCcCCccccc--eeeEE
Confidence 999995 5899999984 46765
No 27
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.89 E-value=2.8e-23 Score=200.16 Aligned_cols=94 Identities=39% Similarity=0.747 Sum_probs=85.6
Q ss_pred CHHHHHHHHHhCCCEEEEEeccc-ccccccccEEeCCCC-CCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCC
Q psy1664 393 NEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAG-GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 470 (524)
Q Consensus 393 ~~~~~~~~~~~~gPv~~~~~~~~-~f~~y~~gi~~~~~~-~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG 470 (524)
+.++||++|+++|||+++|.+.+ +|..|++|||..+.+ ...++|+|+|||||++. +..|||||||||+.||
T Consensus 122 ~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~~-------~~~~wiv~NSWG~~WG 194 (219)
T PF00112_consen 122 DIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDEN-------GKGYWIVKNSWGTDWG 194 (219)
T ss_dssp CHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEET-------TEEEEEEE-SBTTTST
T ss_pred chhHHHHHHhhCceeeeeeeccccccccccceeeecccccccccccccccccccccc-------ceeeEeeehhhCCccC
Confidence 58999999999999999999988 699999999998744 45789999999999987 8899999999999999
Q ss_pred CCcEEEEEeCCC-ccCccccceec
Q psy1664 471 ENGLFRIVRGQN-ECGIEADITAG 493 (524)
Q Consensus 471 ~~Gy~~i~~g~~-~cgi~~~~~~~ 493 (524)
++|||||.++.+ +|||+++++++
T Consensus 195 ~~Gy~~i~~~~~~~c~i~~~~~~~ 218 (219)
T PF00112_consen 195 DNGYFRISYDYNNECGIESQAVYP 218 (219)
T ss_dssp BTTEEEEESSSSSGGGTTSSEEEE
T ss_pred CCeEEEEeeCCCCcCccCceeeec
Confidence 999999999987 99999998754
No 28
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.89 E-value=3.5e-23 Score=227.76 Aligned_cols=125 Identities=23% Similarity=0.425 Sum_probs=99.5
Q ss_pred HHHHHHHHHhCCCEEEEEeccccccccc-ccEEeCC-CCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCC
Q psy1664 394 EETIMREIFRHGPVEGSMTIYADMILYK-TGIYKHV-AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 471 (524)
Q Consensus 394 ~~~~~~~~~~~gPv~~~~~~~~~f~~y~-~gi~~~~-~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~ 471 (524)
++.|+.+|+++|||+|+|++. +|+.|. +|||... |+...++|||+|||||.+.+..++ +++|||||||||+.||+
T Consensus 680 i~~IK~eI~~kGPVaV~IdAs-df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~--gk~YWIVRNSWGt~WGE 756 (1004)
T PTZ00462 680 IKIIKDEIMNKGSVIAYIKAE-NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDE--KKSYWIVRNSWGKYWGD 756 (1004)
T ss_pred HHHHHHHHHhcCCEEEEEEee-hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccCC--CCceEEEEcCCCCCcCC
Confidence 468999999999999999985 688885 8986554 444568999999999986421122 67899999999999999
Q ss_pred CcEEEEEe-CCCccCccccceeccceeccccCCcccccCcc-----cCCCCCCCCCCC
Q psy1664 472 NGLFRIVR-GQNECGIEADITAGLPKIGLEIDSNEINLGKM-----MTLPLTNRDTYT 523 (524)
Q Consensus 472 ~Gy~~i~~-g~~~cgi~~~~~~~~p~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 523 (524)
+|||||.| |.|.|||... ..+|.+..+++-...+..+. .++++.++|+|.
T Consensus 757 nGYFKI~r~g~n~CGin~i--~t~~~fn~d~~~~~~~~~~~~~~~~~y~~k~spdf~~ 812 (1004)
T PTZ00462 757 EGYFKVDMYGPSHCEDNFI--HSVVIFNIDLPKNKKSPKKESFKIYDYYLKASPDFYH 812 (1004)
T ss_pred CeEEEEEeCCCCCCccchh--eeeeeEeeccccccCCccccccchheeeeccChhHhh
Confidence 99999998 7899999775 44677777776666655443 588999999884
No 29
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.85 E-value=7.8e-22 Score=211.63 Aligned_cols=94 Identities=32% Similarity=0.577 Sum_probs=78.6
Q ss_pred CCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcC------CCCC---------------CCCcEEEEEEeccCCCCC
Q psy1664 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV------AGGP---------------LGEHAIRIIGWGQEPLGE 308 (524)
Q Consensus 250 ~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~------~~~~---------------~~~HaV~iVGyg~~~~~~ 308 (524)
.++++||++|+.+|||+|+|+++++|++|++|||+.+ .|.. .++|||+|||||.+.
T Consensus 555 ~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~--- 631 (693)
T PTZ00049 555 NGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEE--- 631 (693)
T ss_pred CCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEecccc---
Confidence 3688999999999999999999989999999999864 2421 369999999999762
Q ss_pred CCccc--eeEEEEeCCCCCcccccCccccccccCccCCcCcCCc
Q psy1664 309 GTSSV--VKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNG 350 (524)
Q Consensus 309 g~~~g--~~YWivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~~ 350 (524)
.+| .+|||||||||++|||+|||||+|+.. .|++...+
T Consensus 632 --enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N--~CGIEs~a 671 (693)
T PTZ00049 632 --INGKLYKYWIGRNSWGKNWGKEGYFKIIRGKN--FSGIESQS 671 (693)
T ss_pred --CCCcccCEEEEECCCCCCcccCceEEEEcCCC--ccCCccce
Confidence 123 479999999999999999999999865 68876543
No 30
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.84 E-value=3.1e-21 Score=205.06 Aligned_cols=95 Identities=25% Similarity=0.471 Sum_probs=79.5
Q ss_pred cCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcC---------CC----------CCCCCcEEEEEEeccCCCCC
Q psy1664 248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---------AG----------GPLGEHAIRIIGWGQEPLGE 308 (524)
Q Consensus 248 ~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~---------~~----------~~~~~HaV~iVGyg~~~~~~ 308 (524)
+..++++||++|+++|||+|+|+++.+|+.|++|||... ++ ...++|||+|||||.++
T Consensus 339 ~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de--- 415 (548)
T PTZ00364 339 AVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE--- 415 (548)
T ss_pred cCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccC---
Confidence 345688999999999999999999999999999998631 11 13579999999999853
Q ss_pred CCccceeEEEEeCCCCC--cccccCccccccccCccCCcCcCCc
Q psy1664 309 GTSSVVKYWLVANSFNT--NWGENGLFRIGCRPYEIPCERYMNG 350 (524)
Q Consensus 309 g~~~g~~YWivkNSWG~--~WGe~Gy~ri~~~~~~~~c~~~~~~ 350 (524)
++.+|||||||||+ +|||+|||||+|+.+ .|++.+.+
T Consensus 416 ---~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N--~CGIes~~ 454 (548)
T PTZ00364 416 ---NGGDYWLVLDPWGSRRSWCDGGTRKIARGVN--AYNIESEV 454 (548)
T ss_pred ---CCceEEEEECCCCCCCCcccCCeEEEEcCCC--ccccccee
Confidence 46899999999999 999999999999865 68876543
No 31
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.84 E-value=3.4e-20 Score=192.99 Aligned_cols=213 Identities=17% Similarity=0.196 Sum_probs=132.4
Q ss_pred ccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHh----------------hcCC-----------CCCCC
Q psy1664 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS----------------CCKD-----------CGNGC 166 (524)
Q Consensus 114 tpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvd----------------C~~~-----------~~~gC 166 (524)
.||+||++-|.||.||+...|+..+....+. ..+.||+.+|.. +... ...-.
T Consensus 55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~-~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~ 133 (437)
T cd00585 55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNL-KEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQN 133 (437)
T ss_pred CCcccCCCCchhHHHHCHHHHHHHHHHHcCC-CCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcC
Confidence 4899999999999999999999998876554 369999988765 2110 12356
Q ss_pred CCCChHHHHHHHHHhCCccCCccCCCCCccccccC-c------cc-c--------cCC----------------------
Q psy1664 167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C------ER-Y--------MNG---------------------- 208 (524)
Q Consensus 167 ~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~-~------~~-~--------~~~---------------------- 208 (524)
+||.-..+...+++.|+++.+.|+..... ..+.. . .+ + ..+
T Consensus 134 DGGqw~m~~~li~KYGvVPk~~~pet~~s-~~t~~~n~~L~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~ 212 (437)
T cd00585 134 DGGQWDMLVNLIEKYGLVPKSVMPESFNS-ENSRRLNYLLNRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILA 212 (437)
T ss_pred CCCchHHHHHHHHHcCCCcccccCCCcCc-cchHHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHH
Confidence 89999999999999999998887421100 00000 0 00 0 000
Q ss_pred -CCCCCC---------------CCCCCCccc-ccc---ccCCCcccccccc----ccce----------------eeeec
Q psy1664 209 -SHSSCQ---------------DNEPNTPEC-IRK---CQPGYDVSYEDDL----NFGR----------------IAYSL 248 (524)
Q Consensus 209 -~~~~C~---------------~~~~~~~~~-~~~---~~~~~~~~~~~~~----~~~~----------------~~~~~ 248 (524)
.-|..+ ....-+|.- ... +...-.+.+.... .|.+ ..+++
T Consensus 213 ~~lG~pP~~F~~~y~dkd~~~~~~~~~TP~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~y~Nv 292 (437)
T cd00585 213 IALGEPPEKFDWEYRDKDKKYHEIKELTPLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPILYLNV 292 (437)
T ss_pred HHcCCCCceEEEEEEeCCCCeeeCCCcCHHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccceEEec
Confidence 000000 000011100 000 1110001110000 0110 11222
Q ss_pred CCCHHHHH----HHHHHcCCeEEEEEecccccccCCceEEcC---------------------CCCCCCCcEEEEEEecc
Q psy1664 249 PANEETIM----REIFRHGPVEGSMTIYADMILYKTGIYKHV---------------------AGGPLGEHAIRIIGWGQ 303 (524)
Q Consensus 249 ~~~~~~ik----~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~---------------------~~~~~~~HaV~iVGyg~ 303 (524)
..++++ ++|.+++||.++++|. .|+.|++||++.. ++.+..+|||+|||||.
T Consensus 293 --p~d~l~~~~~~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~ 369 (437)
T cd00585 293 --PMDVLKKAAIAQLKDGEPVWFGCDVG-KFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDL 369 (437)
T ss_pred --CHHHHHHHHHHHHhcCCCEEEEEEcC-hhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEe
Confidence 234444 5677889999999996 5789999999653 34456799999999998
Q ss_pred CCCCCCCccce-eEEEEeCCCCCcccccCcccccc
Q psy1664 304 EPLGEGTSSVV-KYWLVANSFNTNWGENGLFRIGC 337 (524)
Q Consensus 304 ~~~~~g~~~g~-~YWivkNSWG~~WGe~Gy~ri~~ 337 (524)
+. +|+ .||+||||||+.||++|||+|+.
T Consensus 370 D~------~g~p~yw~VkNSWG~~~G~~Gy~~ms~ 398 (437)
T cd00585 370 DE------DGKPVKWKVENSWGEKVGKKGYFVMSD 398 (437)
T ss_pred cC------CCCcceEEEEcccCCCCCCCcceehhH
Confidence 74 344 69999999999999999999984
No 32
>KOG1544|consensus
Probab=99.80 E-value=3.6e-20 Score=178.03 Aligned_cols=98 Identities=43% Similarity=0.846 Sum_probs=84.0
Q ss_pred eecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCC--------CCCCcEEEEEEeccCCCCCCCccceeEE
Q psy1664 246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYW 317 (524)
Q Consensus 246 ~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~--------~~~~HaV~iVGyg~~~~~~g~~~g~~YW 317 (524)
|++.+++++||++||++|||.+.|.|.++|+.|++|||.+..-. ..+.|+|.|.|||++....| ...+||
T Consensus 347 YrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G--~~~KyW 424 (470)
T KOG1544|consen 347 YRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG--RTLKYW 424 (470)
T ss_pred eeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCC--CeeEEE
Confidence 56778999999999999999999999999999999999886532 24799999999998753223 557899
Q ss_pred EEeCCCCCcccccCccccccccCccCCc
Q psy1664 318 LVANSFNTNWGENGLFRIGCRPYEIPCE 345 (524)
Q Consensus 318 ivkNSWG~~WGe~Gy~ri~~~~~~~~c~ 345 (524)
|..||||+.|||+|||||.++.++.+.+
T Consensus 425 ~aANSWG~~WGE~GYFriLRGvNecdIE 452 (470)
T KOG1544|consen 425 TAANSWGPAWGERGYFRILRGVNECDIE 452 (470)
T ss_pred EeecccccccccCceEEEeccccchhhh
Confidence 9999999999999999999998854333
No 33
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.79 E-value=1.3e-19 Score=168.49 Aligned_cols=75 Identities=49% Similarity=0.975 Sum_probs=63.9
Q ss_pred EEEEecccccccccccEEeCC-CCCCccCeeEEEeeecCC-CCCCCccCCccEEEEEcCCCCCCCCCcEEEEEeCC-Ccc
Q psy1664 408 EGSMTIYADMILYKTGIYKHV-AGGPLGEHAIRIIGWGQE-PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NEC 484 (524)
Q Consensus 408 ~~~~~~~~~f~~y~~gi~~~~-~~~~~~~H~v~ivG~g~~-~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~g~-~~c 484 (524)
++.+.+. +|+.|++|||+.+ +....++|+|+|||||.+ + +++|||||||||+.||++|||||.|+. |.|
T Consensus 93 ~~~~~~~-~f~~Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~~-------g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c 164 (174)
T smart00645 93 SVAIDAS-DFQFYKSGIYDHPGCGSGTLDHAVLIVGYGTEEN-------GKDYWIVKNSWGTDWGENGYFRIARGKNNEC 164 (174)
T ss_pred EEEEEcc-cccCCcCeEECCCCCCCCcccEEEEEEEEeecCC-------CeeEEEEECCCCCCcccCeEEEEEcCCCCcc
Confidence 4555554 6999999999875 433447999999999987 4 789999999999999999999999998 999
Q ss_pred Cccccc
Q psy1664 485 GIEADI 490 (524)
Q Consensus 485 gi~~~~ 490 (524)
||+...
T Consensus 165 ~i~~~~ 170 (174)
T smart00645 165 GIEASV 170 (174)
T ss_pred Cceeee
Confidence 997764
No 34
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.76 E-value=4e-18 Score=164.56 Aligned_cols=84 Identities=32% Similarity=0.547 Sum_probs=72.9
Q ss_pred CHHHHHHHHHhCCCEEEEEecccccccccccEEe------CCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCC
Q psy1664 393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYK------HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 466 (524)
Q Consensus 393 ~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~------~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG 466 (524)
++++||++|+++|||+++|.+..+|..|++|++. ..+....++|||+|||||++.. .+++|||||||||
T Consensus 124 ~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~-----~~~~~~i~~NSwG 198 (223)
T cd02619 124 NIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYV-----EGKGAFIVKNSWG 198 (223)
T ss_pred hHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCC-----CCCCEEEEEeCCC
Confidence 4789999999999999999999999999999863 2234456899999999998752 2678999999999
Q ss_pred CCCCCCcEEEEEeCC
Q psy1664 467 TNWGENGLFRIVRGQ 481 (524)
Q Consensus 467 ~~WG~~Gy~~i~~g~ 481 (524)
+.||++||+||.++.
T Consensus 199 ~~wg~~Gy~~i~~~~ 213 (223)
T cd02619 199 TDWGDNGYGRISYED 213 (223)
T ss_pred CccccCCEEEEehhh
Confidence 999999999999984
No 35
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.46 E-value=9.6e-14 Score=144.95 Aligned_cols=86 Identities=21% Similarity=0.279 Sum_probs=68.0
Q ss_pred EEEEcCCCHHHHH----HHHHhCCCEEEEEecccccccccccEEeCC---------------------CCCCccCeeEEE
Q psy1664 386 IAYSLPANEETIM----REIFRHGPVEGSMTIYADMILYKTGIYKHV---------------------AGGPLGEHAIRI 440 (524)
Q Consensus 386 ~~~~~~~~~~~~~----~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~---------------------~~~~~~~H~v~i 440 (524)
.+++++.+ +++ ++|...+||.++++|. .|+.|++||+... ++....+|||+|
T Consensus 288 ~y~Nvp~d--~l~~~~~~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~i 364 (437)
T cd00585 288 LYLNVPMD--VLKKAAIAQLKDGEPVWFGCDVG-KFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVL 364 (437)
T ss_pred eEEecCHH--HHHHHHHHHHhcCCCEEEEEEcC-hhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEE
Confidence 44566543 343 5777888999999996 5789999999653 233457899999
Q ss_pred eeecCCCCCCCccCCc-cEEEEEcCCCCCCCCCcEEEEEeC
Q psy1664 441 IGWGQEPLGEGTSSVV-KYWLVANSFNTNWGENGLFRIVRG 480 (524)
Q Consensus 441 vG~g~~~~~~~~~~~~-~ywiv~NSWG~~WG~~Gy~~i~~g 480 (524)
||||.+.+ |. .||+|+||||+.||++||++|+++
T Consensus 365 vGv~~D~~------g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 365 TGVDLDED------GKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred EEEEecCC------CCcceEEEEcccCCCCCCCcceehhHH
Confidence 99998652 54 699999999999999999999875
No 36
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.45 E-value=8.5e-13 Score=137.95 Aligned_cols=76 Identities=16% Similarity=0.245 Sum_probs=48.7
Q ss_pred ccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHH----------------hhcCC-----------CCCCC
Q psy1664 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV----------------SCCKD-----------CGNGC 166 (524)
Q Consensus 114 tpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lv----------------dC~~~-----------~~~gC 166 (524)
.||.||.+-|-||.||+...|+..+..+.+. ..+.||+.+|. ++... ...-.
T Consensus 56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l-~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~ 134 (438)
T PF03051_consen 56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNL-KDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVS 134 (438)
T ss_dssp -S--B--BSSTHHHHHHHHHHHHHHHHHCT--SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-
T ss_pred CCCCCCCCCCCcchhhchHHHHHHHHHHcCC-CceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCC
Confidence 4899999999999999999999999887654 47999998875 22211 01246
Q ss_pred CCCChHHHHHHHHHhCCccCCccC
Q psy1664 167 QGGFHGKAWKYWVTTGIVSGGTYA 190 (524)
Q Consensus 167 ~GG~~~~a~~~~~~~Gi~~e~~y~ 190 (524)
+||.-..+...++.+|+++.+.|+
T Consensus 135 DGGqw~~~~nli~KYGvVPk~~mp 158 (438)
T PF03051_consen 135 DGGQWDMVVNLIKKYGVVPKSVMP 158 (438)
T ss_dssp S-B-HHHHHHHHHHH---BGGGST
T ss_pred CCCchHHHHHHHHHcCcCcHhhCC
Confidence 799999999999999999999884
No 37
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.35 E-value=1.8e-12 Score=128.76 Aligned_cols=124 Identities=23% Similarity=0.134 Sum_probs=79.9
Q ss_pred HHHHHHHHHhCCCEEEEEeccc-ccccccccEEeCCCCCCccCeeEEEeeecCCCCC---CCccCCccEEEEEcCCCCCC
Q psy1664 394 EETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG---EGTSSVVKYWLVANSFNTNW 469 (524)
Q Consensus 394 ~~~~~~~~~~~gPv~~~~~~~~-~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~---~~~~~~~~ywiv~NSWG~~W 469 (524)
...|++++..+|-++.+|.+.. .+..-.-+.|..... ...+|||+||||++.-.. .....|...||||||||++|
T Consensus 224 nG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~~~~s~-~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~w 302 (372)
T COG4870 224 NGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSG-ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNW 302 (372)
T ss_pred ccchHHHHhhhccccceeEEecccccccccCCCCCCcc-ccccceEEEEeccccccccccccCCCCCceEEEECcccccc
Confidence 4457888888888776665421 122212233333222 457999999999986321 11222345999999999999
Q ss_pred CCCcEEEEEeCCCccCccccceeccceeccccCCcccccCcccCCCCCCCCCCCC
Q psy1664 470 GENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLGKMMTLPLTNRDTYTM 524 (524)
Q Consensus 470 G~~Gy~~i~~g~~~cgi~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (524)
|++|||||....-.-| + +.-++ .-++...+.|+++.....|+..++|.|
T Consensus 303 G~~GYfwisY~ya~~g-~----a~~~D-~y~~i~qydpl~wv~~~~y~~~~~w~~ 351 (372)
T COG4870 303 GENGYFWISYYYALNG-D----AEALD-FYVYIYQYDPLGWVITSGYGLNTAWMA 351 (372)
T ss_pred ccCceEEEEeeecccc-c----ccccC-cceEEeeccCcceEeecCcCcchhhhh
Confidence 9999999999743233 1 11111 234456778899998888888777653
No 38
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.28 E-value=1.5e-12 Score=98.31 Aligned_cols=58 Identities=19% Similarity=0.203 Sum_probs=50.5
Q ss_pred HHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHH
Q psy1664 9 VATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSEL 67 (524)
Q Consensus 9 f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~ 67 (524)
|+.|+++|+|.|.+.++...|+.+|.+|++.|++||+... .+|++++|+|+|||.+||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~-~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGN-NTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTS-SSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCC-CCeEEeCccccCcChhhC
Confidence 6899999999999999999999999999999999995543 899999999999999997
No 39
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=98.98 E-value=1e-10 Score=87.76 Aligned_cols=57 Identities=18% Similarity=0.110 Sum_probs=52.6
Q ss_pred HHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHH
Q psy1664 9 VATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSE 66 (524)
Q Consensus 9 f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE 66 (524)
|..|+.+|+|.|.+.++...|+.+|.+|++.|+.||+... .+|++++|+|+|||.+|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~-~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKND-HSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCC-CCeEecCcccccCCCCC
Confidence 5689999999999999999999999999999999998654 79999999999999876
No 40
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.39 E-value=9.3e-07 Score=92.93 Aligned_cols=87 Identities=23% Similarity=0.286 Sum_probs=57.2
Q ss_pred EEEEcCCCH--HHHHHHHHhCCCEEEEEecccccccccccEEeCCC---------------------CCCccCeeEEEee
Q psy1664 386 IAYSLPANE--ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---------------------GGPLGEHAIRIIG 442 (524)
Q Consensus 386 ~~~~~~~~~--~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~---------------------~~~~~~H~v~ivG 442 (524)
.+++++.++ ..++++|...-||..+-+|.. +..-+.||.+... .....+|||+|||
T Consensus 289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k-~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itG 367 (438)
T PF03051_consen 289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGK-FFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITG 367 (438)
T ss_dssp EEEE--HHHHHHHHHHHHHTT--EEEEEETTT-TEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEE
T ss_pred eEeccCHHHHHHHHHHHHHcCCcEEEeccCCc-cccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEE
Confidence 346666543 334444544559999999975 5566788875431 1223689999999
Q ss_pred ecCCCCCCCccCCc-cEEEEEcCCCCCCCCCcEEEEEe
Q psy1664 443 WGQEPLGEGTSSVV-KYWLVANSFNTNWGENGLFRIVR 479 (524)
Q Consensus 443 ~g~~~~~~~~~~~~-~ywiv~NSWG~~WG~~Gy~~i~~ 479 (524)
...+.. |. .+|+|+||||+..|.+||+.++.
T Consensus 368 v~~D~~------g~p~~wkVeNSWG~~~g~kGy~~msd 399 (438)
T PF03051_consen 368 VDLDED------GKPVRWKVENSWGTDNGDKGYFYMSD 399 (438)
T ss_dssp EEE-TT------SSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred EEeccC------CCeeEEEEEcCCCCCCCCCcEEEECH
Confidence 998652 55 59999999999999999999874
No 41
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=98.30 E-value=3.9e-06 Score=82.61 Aligned_cols=80 Identities=24% Similarity=0.287 Sum_probs=55.0
Q ss_pred CHHHHHHHHH----HcCCeEEEEEecccccccCCceEEcCC---------------------CCCCCCcEEEEEEeccCC
Q psy1664 251 NEETIMREIF----RHGPVEGSMTIYADMILYKTGIYKHVA---------------------GGPLGEHAIRIIGWGQEP 305 (524)
Q Consensus 251 ~~~~ik~~l~----~~GPV~v~i~v~~~f~~Y~sGIy~~~~---------------------~~~~~~HaV~iVGyg~~~ 305 (524)
..+.++++.+ .+-||=.+-+|. .+..-+.||.+..- ..+...|||+|.|.+.++
T Consensus 296 ~me~lkkl~~~q~qagetVwFG~dvg-q~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~ 374 (444)
T COG3579 296 DMERLKKLAIKQMQAGETVWFGCDVG-QLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDE 374 (444)
T ss_pred cHHHHHHHHHHHHhcCCcEEeecCch-hhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhcccccc
Confidence 3455555433 345787777763 45566667653210 112358999999999886
Q ss_pred CCCCCccceeEEEEeCCCCCcccccCccccc
Q psy1664 306 LGEGTSSVVKYWLVANSFNTNWGENGLFRIG 336 (524)
Q Consensus 306 ~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~ 336 (524)
.| ..--|.|.||||.+=|.+|||-++
T Consensus 375 ~g-----~p~rwkVENSWG~d~G~~GyfvaS 400 (444)
T COG3579 375 TG-----NPLRWKVENSWGKDVGKKGYFVAS 400 (444)
T ss_pred CC-----CceeeEeecccccccCCCceEeeh
Confidence 43 245799999999999999999887
No 42
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=97.19 E-value=0.00019 Score=49.52 Aligned_cols=36 Identities=19% Similarity=0.290 Sum_probs=24.6
Q ss_pred hhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCC
Q psy1664 38 DRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPD 76 (524)
Q Consensus 38 ~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~ 76 (524)
++|+.+|+.+ .+|+||.| |.+++.+.++.++|..+.
T Consensus 4 e~I~~IN~~~--~tWkAG~N-F~~~~~~~ik~LlGv~~~ 39 (41)
T PF08127_consen 4 EFIDYINSKN--TTWKAGRN-FENTSIEYIKRLLGVLPD 39 (41)
T ss_dssp HHHHHHHHCT---SEEE-----SSB-HHHHHHCS-B-TT
T ss_pred HHHHHHHcCC--CcccCCCC-CCCCCHHHHHHHcCCCCC
Confidence 3499999986 89999999 799999999999998653
No 43
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=96.22 E-value=0.041 Score=50.23 Aligned_cols=120 Identities=14% Similarity=0.105 Sum_probs=72.8
Q ss_pred CCCCCccHHHHHHHHHHHHHHH--------HHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHHHHHhCCccCCcc
Q psy1664 118 DQGSCGSGWALGAVEAMSDRVC--------IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY 189 (524)
Q Consensus 118 dQg~CGsCwAfA~~~~le~~~~--------i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y 189 (524)
.||.=+=|-+||.++.|-.... |... ..+.+|+++|.+++. .+...++|.+..|...
T Consensus 18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~--~yPn~s~~~l~~~~~---------~~~~~i~y~ks~g~~~---- 82 (175)
T PF05543_consen 18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRY--LYPNVSEEQLKFTSL---------TPNQMIKYAKSQGRNP---- 82 (175)
T ss_dssp --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHH--HSTTS-CCCHHH--B----------HHHHHHHHHHTTEEE----
T ss_pred ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHH--HCCCCCHHHHhhcCC---------CHHHHHHHHHHcCcch----
Confidence 3788888999999988765421 1110 246788888877742 4678999988888653
Q ss_pred CCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHHHHHHHHcCCeEEEE
Q psy1664 190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM 269 (524)
Q Consensus 190 ~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i 269 (524)
-|.. -..+-+++++.+-++.|+.+..
T Consensus 83 -------~~~n-----------------------------------------------~~~s~~eV~~~~~~nk~i~i~~ 108 (175)
T PF05543_consen 83 -------QYNN-----------------------------------------------RMPSFDEVKKLIDNNKGIAILA 108 (175)
T ss_dssp -------EEEC-----------------------------------------------S---HHHHHHHHHTT-EEEEEE
T ss_pred -------hHhc-----------------------------------------------CCCCHHHHHHHHHcCCCeEEEe
Confidence 1110 0124678999999999998776
Q ss_pred EecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCC
Q psy1664 270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN 324 (524)
Q Consensus 270 ~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG 324 (524)
...+ .......+||++||||-.-. +|.++.++=|=|-
T Consensus 109 ~~v~------------~~~~~~~gHAlavvGya~~~------~g~~~y~~WNPW~ 145 (175)
T PF05543_consen 109 DRVE------------QTNGPHAGHALAVVGYAKPN------NGQKTYYFWNPWW 145 (175)
T ss_dssp EETT------------SCTTB--EEEEEEEEEEEET------TSEEEEEEE-TT-
T ss_pred cccc------------cCCCCccceeEEEEeeeecC------CCCeEEEEeCCcc
Confidence 6421 11234568999999998642 4688999988774
No 44
>KOG4128|consensus
Probab=95.35 E-value=0.023 Score=56.42 Aligned_cols=75 Identities=17% Similarity=0.160 Sum_probs=52.2
Q ss_pred ccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHh--------------------hcCC---------CCC
Q psy1664 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS--------------------CCKD---------CGN 164 (524)
Q Consensus 114 tpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvd--------------------C~~~---------~~~ 164 (524)
.||-+|.+-|-||.|+....|--.+..+-+- ....||..+|+- |-.. .+.
T Consensus 63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl-~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP 141 (457)
T KOG4128|consen 63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNL-PEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNP 141 (457)
T ss_pred cccccCcCCCceEEEechhHHHHHHHhcCCc-chhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCC
Confidence 6999999999999999998776555544432 247888877752 1110 012
Q ss_pred CCCCCChHHHHHHHHHhCCccCCcc
Q psy1664 165 GCQGGFHGKAWKYWVTTGIVSGGTY 189 (524)
Q Consensus 165 gC~GG~~~~a~~~~~~~Gi~~e~~y 189 (524)
.-+||.-..-.+.++..|+.+..-|
T Consensus 142 ~~DGGqw~MfvNlVkKYGviPKkcy 166 (457)
T KOG4128|consen 142 VPDGGQWQMFVNLVKKYGVIPKKCY 166 (457)
T ss_pred CCCCchHHHHHHHHHHhCCCcHHhc
Confidence 3358888888888888998875555
No 45
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=95.28 E-value=0.014 Score=58.16 Aligned_cols=72 Identities=25% Similarity=0.283 Sum_probs=50.6
Q ss_pred HHhCCCEEEEEecccccccccccEEeCCC---------------------CCCccCeeEEEeeecCCCCCCCccCCccEE
Q psy1664 401 IFRHGPVEGSMTIYADMILYKTGIYKHVA---------------------GGPLGEHAIRIIGWGQEPLGEGTSSVVKYW 459 (524)
Q Consensus 401 ~~~~gPv~~~~~~~~~f~~y~~gi~~~~~---------------------~~~~~~H~v~ivG~g~~~~~~~~~~~~~yw 459 (524)
+...-||-.+-+|.. +..-+.||.+-.. +.....|||+|.|.+.+..+ ..-=|
T Consensus 308 ~qagetVwFG~dvgq-~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g-----~p~rw 381 (444)
T COG3579 308 MQAGETVWFGCDVGQ-LSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETG-----NPLRW 381 (444)
T ss_pred HhcCCcEEeecCchh-hcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCC-----Cceee
Confidence 333448888877743 6667777753210 11236799999999877631 23379
Q ss_pred EEEcCCCCCCCCCcEEEEE
Q psy1664 460 LVANSFNTNWGENGLFRIV 478 (524)
Q Consensus 460 iv~NSWG~~WG~~Gy~~i~ 478 (524)
.|.||||..=|.+|||-.+
T Consensus 382 kVENSWG~d~G~~GyfvaS 400 (444)
T COG3579 382 KVENSWGKDVGKKGYFVAS 400 (444)
T ss_pred EeecccccccCCCceEeeh
Confidence 9999999999999999765
No 46
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=94.62 E-value=0.44 Score=41.59 Aligned_cols=57 Identities=28% Similarity=0.343 Sum_probs=33.5
Q ss_pred CCHHHHHHHHHHcCCeEEEEEecc-cccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCC
Q psy1664 250 ANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 323 (524)
Q Consensus 250 ~~~~~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSW 323 (524)
.+.+.|+++|.++.||.+.+...- .. ....+. .....|.|+|+||..+ . +++|-.+|
T Consensus 87 ~~~~~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~----~~~~~H~vvi~Gy~~~---------~-~~~v~DP~ 144 (144)
T PF13529_consen 87 ASFDDIKQEIDAGRPVIVSVNSGWRPP---NGDGYD----GTYGGHYVVIIGYDED---------G-YVYVNDPW 144 (144)
T ss_dssp S-HHHHHHHHHTT--EEEEEETTSS-----TTEEEE----E-TTEEEEEEEEE-SS---------E--EEEE-TT
T ss_pred CcHHHHHHHHHCCCcEEEEEEcccccC---CCCCcC----CCcCCEEEEEEEEeCC---------C-EEEEeCCC
Confidence 356889999999999999997421 01 111111 1246899999999974 2 77777766
No 47
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=82.49 E-value=5.7 Score=34.34 Aligned_cols=60 Identities=27% Similarity=0.285 Sum_probs=34.1
Q ss_pred cCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCC
Q psy1664 390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 465 (524)
Q Consensus 390 ~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSW 465 (524)
...+..+|+++|.+..||.+.+.... .......+. .....|.|+|+||..+ + +++|..+|
T Consensus 85 ~~~~~~~i~~~i~~G~Pvi~~~~~~~--~~~~~~~~~----~~~~~H~vvi~Gy~~~--------~--~~~v~DP~ 144 (144)
T PF13529_consen 85 SDASFDDIKQEIDAGRPVIVSVNSGW--RPPNGDGYD----GTYGGHYVVIIGYDED--------G--YVYVNDPW 144 (144)
T ss_dssp TTS-HHHHHHHHHTT--EEEEEETTS--S--TTEEEE----E-TTEEEEEEEEE-SS--------E---EEEE-TT
T ss_pred cCCcHHHHHHHHHCCCcEEEEEEccc--ccCCCCCcC----CCcCCEEEEEEEEeCC--------C--EEEEeCCC
Confidence 34567899999998889999987421 000111111 1236899999999763 2 67776666
No 48
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=76.23 E-value=6.1 Score=39.99 Aligned_cols=47 Identities=21% Similarity=0.360 Sum_probs=31.9
Q ss_pred HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccC
Q psy1664 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304 (524)
Q Consensus 252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~ 304 (524)
.+.|+++|.++.||.+.++++ +..|...-| ......|.|+|+||+.+
T Consensus 78 ~~~l~~~l~~g~pv~~~~D~~--~lpy~~~~~----~~~~~~H~i~v~G~d~~ 124 (317)
T PF14399_consen 78 WEELKEALDAGRPVIVWVDMY--YLPYRPNYY----KKHHADHYIVVYGYDEE 124 (317)
T ss_pred HHHHHHHHhCCCceEEEeccc--cCCCCcccc----ccccCCcEEEEEEEeCC
Confidence 457777888888999998874 222332211 12346899999999975
No 49
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=75.50 E-value=7 Score=35.93 Aligned_cols=56 Identities=14% Similarity=0.202 Sum_probs=36.8
Q ss_pred CCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCC
Q psy1664 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF 465 (524)
Q Consensus 392 ~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSW 465 (524)
.+.+++++++-++-||.+..+.-+ ........||++||||-.-.+ |.++.++=|=|
T Consensus 89 ~s~~eV~~~~~~nk~i~i~~~~v~------------~~~~~~~gHAlavvGya~~~~------g~~~y~~WNPW 144 (175)
T PF05543_consen 89 PSFDEVKKLIDNNKGIAILADRVE------------QTNGPHAGHALAVVGYAKPNN------GQKTYYFWNPW 144 (175)
T ss_dssp --HHHHHHHHHTT-EEEEEEEETT------------SCTTB--EEEEEEEEEEEETT------SEEEEEEE-TT
T ss_pred CCHHHHHHHHHcCCCeEEEecccc------------cCCCCccceeEEEEeeeecCC------CCeEEEEeCCc
Confidence 357889999999889887665321 112234789999999976542 68899997766
No 50
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=74.70 E-value=8.8 Score=36.61 Aligned_cols=54 Identities=15% Similarity=0.222 Sum_probs=34.2
Q ss_pred CHHHHHHHHHHcCCeEEEEEecc-cccccCCceEEc---CC-C--CCCCCcEEEEEEeccC
Q psy1664 251 NEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKH---VA-G--GPLGEHAIRIIGWGQE 304 (524)
Q Consensus 251 ~~~~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~---~~-~--~~~~~HaV~iVGyg~~ 304 (524)
..++|...|.++||++|-++..- .-..-++-.... .+ + ....+|-|+|+||+.+
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~ 172 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA 172 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC
Confidence 57899999999999998888641 000002222111 11 1 2346899999999976
No 51
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=70.14 E-value=8.9 Score=35.37 Aligned_cols=39 Identities=18% Similarity=0.244 Sum_probs=30.7
Q ss_pred CCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccC
Q psy1664 250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304 (524)
Q Consensus 250 ~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~ 304 (524)
.+.++|+..|.+..||.+-.... -. ..-|+|+|+||++.
T Consensus 121 ksl~~ik~ql~kg~PV~iw~T~~---~~-------------~s~H~v~itgyDk~ 159 (195)
T COG4990 121 KSLSDIKGQLLKGRPVVIWVTNF---HS-------------YSIHSVLITGYDKY 159 (195)
T ss_pred CcHHHHHHHHhcCCcEEEEEecc---cc-------------cceeeeEeeccccc
Confidence 46899999999999998766652 11 23699999999964
No 52
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=66.43 E-value=57 Score=29.61 Aligned_cols=38 Identities=21% Similarity=0.196 Sum_probs=29.0
Q ss_pred HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccC
Q psy1664 252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 304 (524)
Q Consensus 252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~ 304 (524)
.+.+...|.++||+-++.... ......|+++|.|-..+
T Consensus 98 ~e~~~~LL~~yGPLwv~~~~P---------------~~~~~~H~~ViTGI~~d 135 (166)
T PF12385_consen 98 AEGLANLLREYGPLWVAWEAP---------------GDSWVAHASVITGIDGD 135 (166)
T ss_pred HHHHHHHHHHcCCeEEEecCC---------------CCcceeeEEEEEeecCC
Confidence 578889999999999886542 12234699999998865
No 53
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=65.95 E-value=13 Score=37.44 Aligned_cols=47 Identities=21% Similarity=0.387 Sum_probs=30.6
Q ss_pred HHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCC
Q psy1664 395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447 (524)
Q Consensus 395 ~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~ 447 (524)
+.|+++|.+.-||.+.++++ +..|...-| ......|.|+|+||+++.
T Consensus 79 ~~l~~~l~~g~pv~~~~D~~--~lpy~~~~~----~~~~~~H~i~v~G~d~~~ 125 (317)
T PF14399_consen 79 EELKEALDAGRPVIVWVDMY--YLPYRPNYY----KKHHADHYIVVYGYDEEE 125 (317)
T ss_pred HHHHHHHhCCCceEEEeccc--cCCCCcccc----ccccCCcEEEEEEEeCCC
Confidence 46666666666999998763 333433322 223468999999998754
No 54
>KOG4128|consensus
Probab=55.75 E-value=1.8 Score=43.46 Aligned_cols=42 Identities=19% Similarity=0.307 Sum_probs=31.6
Q ss_pred cCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcEEEEE
Q psy1664 434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIV 478 (524)
Q Consensus 434 ~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~ 478 (524)
..||++|.|-|.-.. .+.+-.=|-|.||||.+-|.+|+..+.
T Consensus 371 mthAml~T~v~~kd~---~~g~~~~~rVenswgkd~gkkg~~~mt 412 (457)
T KOG4128|consen 371 MTHAMLLTSVGLKDP---ATGGLNEHRVENSWGKDLGKKGVNKMT 412 (457)
T ss_pred HHHHHHhhhccccCc---ccCCchhhhhhchhhhhccccchhhhh
Confidence 579999999984221 122445799999999999999996553
No 55
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=50.38 E-value=56 Score=31.24 Aligned_cols=53 Identities=11% Similarity=0.220 Sum_probs=33.7
Q ss_pred CHHHHHHHHHhCCCEEEEEeccccccc---ccccEEeC---CC---CCCccCeeEEEeeecCCC
Q psy1664 393 NEETIMREIFRHGPVEGSMTIYADMIL---YKTGIYKH---VA---GGPLGEHAIRIIGWGQEP 447 (524)
Q Consensus 393 ~~~~~~~~~~~~gPv~~~~~~~~~f~~---y~~gi~~~---~~---~~~~~~H~v~ivG~g~~~ 447 (524)
..++|...|...||+.+.++.. +.. -+.-.... .+ .....+|-|+|+||+...
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~--~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~ 173 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDAS--LLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAAT 173 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccc--cccChhhcccccccccccccCCCCCccEEEEEEEeecCCC
Confidence 4789999999999888877653 222 02222111 11 123468999999998754
No 56
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=43.63 E-value=43 Score=28.94 Aligned_cols=34 Identities=24% Similarity=0.335 Sum_probs=25.2
Q ss_pred HHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEec
Q psy1664 255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 302 (524)
Q Consensus 255 ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg 302 (524)
+++.|....||.+.+... . .....+|.|+|+||.
T Consensus 70 ~~~~l~~~~Pvi~~~~~~--------~------~~~~~gH~vVv~g~~ 103 (141)
T cd02549 70 LLRQLAAGHPVIVSVNLG--------V------SITPSGHAMVVIGYD 103 (141)
T ss_pred HHHHHHCCCeEEEEEecC--------c------ccCCCCeEEEEEEEc
Confidence 677788888999887751 0 112358999999998
No 57
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.67 E-value=98 Score=28.74 Aligned_cols=40 Identities=18% Similarity=0.202 Sum_probs=29.7
Q ss_pred CCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCC
Q psy1664 392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447 (524)
Q Consensus 392 ~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~ 447 (524)
.+..+|+..|.+..||.+-... |.. ..-|+|+|+||++..
T Consensus 121 ksl~~ik~ql~kg~PV~iw~T~---~~~-------------~s~H~v~itgyDk~n 160 (195)
T COG4990 121 KSLSDIKGQLLKGRPVVIWVTN---FHS-------------YSIHSVLITGYDKYN 160 (195)
T ss_pred CcHHHHHHHHhcCCcEEEEEec---ccc-------------cceeeeEeecccccc
Confidence 3678999999999999776543 322 235999999997653
No 58
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=31.50 E-value=1.1e+02 Score=27.89 Aligned_cols=39 Identities=21% Similarity=0.177 Sum_probs=27.8
Q ss_pred HHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCC
Q psy1664 394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP 447 (524)
Q Consensus 394 ~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~ 447 (524)
.+.+...|.++|||-++.... ......|+++|.|-..+.
T Consensus 98 ~e~~~~LL~~yGPLwv~~~~P---------------~~~~~~H~~ViTGI~~dg 136 (166)
T PF12385_consen 98 AEGLANLLREYGPLWVAWEAP---------------GDSWVAHASVITGIDGDG 136 (166)
T ss_pred HHHHHHHHHHcCCeEEEecCC---------------CCcceeeEEEEEeecCCC
Confidence 578899999999999885442 112235888888886543
No 59
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=29.89 E-value=1.9e+02 Score=24.71 Aligned_cols=34 Identities=24% Similarity=0.335 Sum_probs=24.2
Q ss_pred HHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeec
Q psy1664 397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG 444 (524)
Q Consensus 397 ~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g 444 (524)
++..+...-||.+.+... . .....+|.|+|+||.
T Consensus 70 ~~~~l~~~~Pvi~~~~~~--------~------~~~~~gH~vVv~g~~ 103 (141)
T cd02549 70 LLRQLAAGHPVIVSVNLG--------V------SITPSGHAMVVIGYD 103 (141)
T ss_pred HHHHHHCCCeEEEEEecC--------c------ccCCCCeEEEEEEEc
Confidence 667777777998877640 0 112368999999997
No 60
>PF01640 Peptidase_C10: Peptidase C10 family classification.; InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=29.47 E-value=1.5e+02 Score=27.80 Aligned_cols=52 Identities=27% Similarity=0.356 Sum_probs=31.5
Q ss_pred HHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCc
Q psy1664 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332 (524)
Q Consensus 253 ~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy 332 (524)
+.|+.+|.+..||...-.-. ...||-+|=||..+ .|+-+==.||-. .+||
T Consensus 141 ~~i~~el~~~rPV~~~g~~~------------------~~GHawViDGy~~~----------~~~H~NwGW~G~--~nGy 190 (192)
T PF01640_consen 141 DMIRNELDNGRPVLYSGNSK------------------SGGHAWVIDGYDSD----------GYFHCNWGWGGS--SNGY 190 (192)
T ss_dssp HHHHHHHHTT--EEEEEEET------------------TEEEEEEEEEEESS----------SEEEEE-SSTTT--T-EE
T ss_pred HHHHHHHHcCCCEEEEEecC------------------CCCeEEEEcCccCC----------CeEEEeeCccCC--CCCc
Confidence 56888899999997554421 12899999999643 466554344422 4677
Q ss_pred cc
Q psy1664 333 FR 334 (524)
Q Consensus 333 ~r 334 (524)
|+
T Consensus 191 y~ 192 (192)
T PF01640_consen 191 YR 192 (192)
T ss_dssp EE
T ss_pred cC
Confidence 64
No 61
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=29.38 E-value=68 Score=32.57 Aligned_cols=29 Identities=10% Similarity=0.182 Sum_probs=22.8
Q ss_pred CCcEEEEEEeccCCCCCCCccceeEEEEeCCCCC
Q psy1664 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT 325 (524)
Q Consensus 292 ~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~ 325 (524)
.+||=.|++.-.-. ..+.+...+||-||.
T Consensus 235 ~~HaY~Vl~~~~~~-----~~~~~lv~lrNPWg~ 263 (315)
T cd00044 235 KGHAYSVLDVREVQ-----EEGLRLLRLRNPWGV 263 (315)
T ss_pred cCcceEEeEEEEEc-----cCceEEEEecCCccC
Confidence 58999999998641 026789999999994
Done!