Query psy11694
Match_columns 655
No_of_seqs 552 out of 3057
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 16:29:44 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11694.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11694hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 1.2E-69 2.7E-74 539.7 23.3 272 191-484 69-359 (372)
2 PTZ00203 cathepsin L protease; 100.0 5.9E-63 1.3E-67 519.4 28.0 283 189-489 34-332 (348)
3 PTZ00021 falcipain-2; Provisio 100.0 7.3E-62 1.6E-66 524.4 27.0 292 191-495 167-486 (489)
4 PTZ00200 cysteine proteinase; 100.0 2.6E-59 5.6E-64 503.8 27.6 296 190-495 123-443 (448)
5 KOG1543|consensus 100.0 1.6E-55 3.5E-60 460.1 26.2 264 197-485 30-312 (325)
6 cd02621 Peptidase_C1A_Cathepsi 100.0 2.3E-46 4.9E-51 380.0 20.3 192 296-491 1-236 (243)
7 cd02620 Peptidase_C1A_Cathepsi 100.0 7.5E-46 1.6E-50 373.9 19.5 190 297-489 1-230 (236)
8 cd02698 Peptidase_C1A_Cathepsi 100.0 1.3E-45 2.8E-50 372.9 20.1 181 296-482 1-219 (239)
9 cd02248 Peptidase_C1A Peptidas 100.0 7.8E-45 1.7E-49 361.2 20.9 189 297-490 1-205 (210)
10 PTZ00364 dipeptidyl-peptidase 100.0 2.4E-44 5.3E-49 392.6 20.5 206 293-502 202-466 (548)
11 PTZ00049 cathepsin C-like prot 100.0 1.2E-43 2.5E-48 390.1 20.5 197 293-493 378-672 (693)
12 PF00112 Peptidase_C1: Papain 100.0 1.1E-42 2.3E-47 347.7 14.9 199 296-495 1-218 (219)
13 smart00645 Pept_C1 Papain fami 100.0 1.4E-39 3E-44 312.8 16.0 150 296-488 1-166 (174)
14 cd02619 Peptidase_C1 C1 Peptid 100.0 1E-37 2.2E-42 312.6 18.8 180 299-482 1-213 (223)
15 PTZ00462 Serine-repeat antigen 100.0 1E-36 2.2E-41 343.9 19.6 182 308-489 544-773 (1004)
16 KOG1542|consensus 100.0 3.3E-35 7.1E-40 293.6 13.4 154 495-655 198-351 (372)
17 KOG1544|consensus 100.0 1.1E-35 2.5E-40 291.4 7.6 246 222-488 151-451 (470)
18 PTZ00203 cathepsin L protease; 100.0 2.2E-32 4.8E-37 287.9 13.8 148 38-201 31-179 (348)
19 cd02620 Peptidase_C1A_Cathepsi 100.0 5.7E-31 1.2E-35 265.7 13.1 149 497-655 49-218 (236)
20 cd02698 Peptidase_C1A_Cathepsi 100.0 1.2E-30 2.5E-35 263.9 14.1 145 498-655 54-213 (239)
21 cd02621 Peptidase_C1A_Cathepsi 100.0 1.4E-30 3E-35 264.3 13.8 152 498-655 55-222 (243)
22 cd02248 Peptidase_C1A Peptidas 100.0 2.2E-30 4.8E-35 257.4 14.8 148 497-655 43-192 (210)
23 PTZ00021 falcipain-2; Provisio 100.0 1.4E-30 3.1E-35 281.6 13.1 150 497-655 309-464 (489)
24 KOG1543|consensus 100.0 8.3E-30 1.8E-34 266.6 14.4 146 497-655 154-303 (325)
25 PTZ00200 cysteine proteinase; 100.0 2.6E-29 5.7E-34 271.3 10.4 185 9-200 95-287 (448)
26 PTZ00049 cathepsin C-like prot 100.0 3.7E-28 8E-33 268.0 14.7 152 500-655 441-656 (693)
27 PF00112 Peptidase_C1: Papain 99.9 1.6E-28 3.4E-33 245.3 8.5 152 496-655 45-199 (219)
28 PTZ00364 dipeptidyl-peptidase 99.9 1.5E-27 3.2E-32 261.1 13.8 153 497-655 260-439 (548)
29 cd02619 Peptidase_C1 C1 Peptid 99.9 1.5E-26 3.2E-31 231.7 13.1 153 497-655 42-207 (223)
30 PTZ00462 Serine-repeat antigen 99.9 1.2E-26 2.5E-31 262.8 13.4 157 497-655 575-760 (1004)
31 KOG1544|consensus 99.9 5.3E-26 1.2E-30 223.6 8.8 155 499-655 258-440 (470)
32 COG4870 Cysteine protease [Pos 99.9 4.8E-25 1E-29 223.6 7.7 205 294-655 97-308 (372)
33 smart00645 Pept_C1 Papain fami 99.9 2.2E-24 4.7E-29 207.3 8.8 108 498-655 45-154 (174)
34 cd00585 Peptidase_C1B Peptidas 99.9 2.6E-23 5.6E-28 223.2 11.8 280 309-655 55-394 (437)
35 PF03051 Peptidase_C1_2: Pepti 99.7 3.1E-19 6.7E-24 191.9 2.8 281 309-655 56-395 (438)
36 COG4870 Cysteine protease [Pos 99.7 4.7E-18 1E-22 173.0 7.7 39 147-185 98-136 (372)
37 PF08246 Inhibitor_I29: Cathep 99.7 1E-17 2.2E-22 130.4 7.6 58 45-102 1-58 (58)
38 smart00848 Inhibitor_I29 Cathe 99.6 2.9E-15 6.3E-20 116.3 5.0 57 45-101 1-57 (57)
39 COG3579 PepC Aminopeptidase C 99.4 1.1E-13 2.3E-18 138.3 3.8 90 563-655 289-397 (444)
40 PF08246 Inhibitor_I29: Cathep 99.4 5.9E-13 1.3E-17 103.6 6.7 58 193-250 1-58 (58)
41 cd00585 Peptidase_C1B Peptidas 99.3 2.3E-12 4.9E-17 139.1 8.8 75 404-481 285-399 (437)
42 smart00848 Inhibitor_I29 Cathe 99.2 4.7E-11 1E-15 92.5 5.3 57 193-249 1-57 (57)
43 KOG4128|consensus 98.1 6.3E-07 1.4E-11 90.2 0.0 76 309-385 63-169 (457)
44 PF03051 Peptidase_C1_2: Pepti 97.9 2.1E-05 4.6E-10 85.5 7.2 64 417-480 300-399 (438)
45 PF08127 Propeptide_C1: Peptid 96.3 0.0037 8E-08 44.4 2.8 36 74-111 4-39 (41)
46 PF08127 Propeptide_C1: Peptid 95.0 0.023 5E-07 40.4 2.8 35 222-258 4-38 (41)
47 PF13529 Peptidase_C39_2: Pept 92.0 0.58 1.2E-05 42.4 7.4 58 572-645 87-144 (144)
48 COG3579 PepC Aminopeptidase C 90.5 0.31 6.7E-06 50.3 4.2 57 423-479 308-400 (444)
49 PF14399 Transpep_BrtH: NlpC/p 78.6 5 0.00011 42.1 6.8 49 573-629 77-125 (317)
50 PF09778 Guanylate_cyc_2: Guan 76.7 9.2 0.0002 37.7 7.3 57 573-629 112-173 (212)
51 PF05543 Peptidase_C47: Stapho 75.9 8.1 0.00018 36.6 6.4 55 573-645 90-144 (175)
52 PF13529 Peptidase_C39_2: Pept 66.8 65 0.0014 28.5 10.4 50 415-466 88-144 (144)
53 COG4990 Uncharacterized protei 64.2 17 0.00036 34.7 5.7 51 569-646 118-168 (195)
54 cd02549 Peptidase_C39A A sub-f 58.5 29 0.00062 31.3 6.4 34 577-626 70-103 (141)
55 PF01640 Peptidase_C10: Peptid 51.5 28 0.00061 33.8 5.3 50 574-655 140-191 (192)
56 PF12385 Peptidase_C70: Papain 47.5 42 0.0009 31.4 5.3 40 573-629 97-136 (166)
57 KOG4128|consensus 42.6 9.3 0.0002 39.7 0.4 64 415-478 305-411 (457)
58 PF05543 Peptidase_C47: Stapho 41.8 96 0.0021 29.6 6.9 113 313-467 18-145 (175)
59 PF11567 PfUIS3: Plasmodium fa 39.1 27 0.00059 28.9 2.4 32 62-106 20-51 (101)
60 PF14663 RasGEF_N_2: Rapamycin 30.6 30 0.00064 30.7 1.6 58 3-65 49-106 (115)
61 PF12385 Peptidase_C70: Papain 30.3 3.4E+02 0.0074 25.6 8.3 23 415-437 97-119 (166)
62 KOG4621|consensus 29.7 1.5E+02 0.0033 26.6 5.7 57 573-629 58-124 (167)
63 PF07172 GRP: Glycine rich pro 28.4 36 0.00077 29.1 1.6 20 11-30 3-22 (95)
64 KOG4702|consensus 26.1 3.2E+02 0.0069 21.8 6.3 36 40-76 26-61 (77)
No 1
>KOG1542|consensus
Probab=100.00 E-value=1.2e-69 Score=539.67 Aligned_cols=272 Identities=40% Similarity=0.777 Sum_probs=243.6
Q ss_pred chhhhhhhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHHHHh-hcCCCCC-CcccccCCC
Q psy11694 191 NLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ-LTGLNLD-STLEDIQPS 268 (655)
Q Consensus 191 ~l~~~~~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eEf~~-~~~~~~~-~~~~~~~~~ 268 (655)
+.|.+|..+|+|+|.+.+|...|+.||+.|+..++.++....+++.+|+|+|||||+|||++ +++.+.. ..+.
T Consensus 69 ~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~----- 143 (372)
T KOG1542|consen 69 DSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLP----- 143 (372)
T ss_pred HHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccccccCc-----
Confidence 46889999999999999999999999999999999999888889999999999999999999 7765442 1110
Q ss_pred CCCCCCCCCcchhhhhhhcccCCCCCCCCCceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHHH
Q psy11694 269 LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQL 348 (655)
Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l 348 (655)
... ... .+.....||++||||++|.||||||||+||||||||+++++|+.++|+++++++||||+|
T Consensus 144 ----~~~---------~~~-~~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeL 209 (372)
T KOG1542|consen 144 ----GDA---------AEA-PIEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQEL 209 (372)
T ss_pred ----ccc---------ccC-cCCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhh
Confidence 000 000 012246899999999999999999999999999999999999999999999999999999
Q ss_pred HHccCCCCCCCCCcHHHHHHHHHHcCCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCC
Q psy11694 349 VDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGP 428 (655)
Q Consensus 349 ~dC~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gP 428 (655)
|||+..++||+||.+..||+|+.+.+|+..|.+|||++..+. .|..++.. ..+.|++|..++. ||++|.+.|.++||
T Consensus 210 vDCD~~d~gC~GGl~~nA~~~~~~~gGL~~E~dYPY~g~~~~-~C~~~~~~-~~v~I~~f~~l~~-nE~~ia~wLv~~GP 286 (372)
T KOG1542|consen 210 VDCDSCDNGCNGGLMDNAFKYIKKAGGLEKEKDYPYTGKKGN-QCHFDKSK-IVVSIKDFSMLSN-NEDQIAAWLVTFGP 286 (372)
T ss_pred hcccCcCCcCCCCChhHHHHHHHHhCCccccccCCccccCCC-ccccchhh-ceEEEeccEecCC-CHHHHHHHHHhcCC
Confidence 999999999999999999999888889999999999998875 89988866 8899999999998 99999999999999
Q ss_pred eEEEEEcCCcccccccEEe----------CCceE----EcC---CccEEEEecCCCCccCCCceEEecccCCc
Q psy11694 429 LSVGMNANGLFYYSGGVID----------LNQRL----YGT---SIPYWIVKNSWGSDWGEKVEDKVGSSGNR 484 (655)
Q Consensus 429 V~v~i~~~~~~~Y~~Gi~~----------~~Hav----yg~---g~~yWivkNSWG~~WGe~Gy~~i~~~~~~ 484 (655)
|+|+|++..+++|.+||.. ++|+| ||. ..+|||||||||++|||+||+|+.|+.|.
T Consensus 287 i~vgiNa~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~ 359 (372)
T KOG1542|consen 287 LSVGINAKPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNA 359 (372)
T ss_pred eEEEEchHHHHHhcccccCCCcccCCccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccc
Confidence 9999998889999999986 79999 776 58999999999999999999999998664
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=5.9e-63 Score=519.37 Aligned_cols=283 Identities=34% Similarity=0.661 Sum_probs=231.7
Q ss_pred ccchhhhhhhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHHHHh-hcCCCCCCcccccCC
Q psy11694 189 GNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ-LTGLNLDSTLEDIQP 267 (655)
Q Consensus 189 ~~~l~~~~~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eEf~~-~~~~~~~~~~~~~~~ 267 (655)
...+|+.|+++|+|.|.+.+|+.+|+.||++|+++|++||++. .+|++|+|+|+|||+|||.+ +++.... .. .+
T Consensus 34 ~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~-~~~~lg~N~FaDlT~eEf~~~~l~~~~~--~~--~~ 108 (348)
T PTZ00203 34 AAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARN-PHARFGITKFFDLSEAEFAARYLNGAAY--FA--AA 108 (348)
T ss_pred HHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccC-CCeEEeccccccCCHHHHHHHhcCCCcc--cc--cc
Confidence 4457999999999999998888999999999999999999864 48999999999999999998 6542111 00 00
Q ss_pred CCCCCCCCCCcchhhhhhhcccCCCCCCCCCceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHH
Q psy11694 268 SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQ 347 (655)
Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~ 347 (655)
...... .... ......+||++||||++|+|+||||||.||||||||++++||++++|++++.+.||+|+
T Consensus 109 --~~~~~~-----~~~~----~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQq 177 (348)
T PTZ00203 109 --KQHAGQ-----HYRK----ARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQ 177 (348)
T ss_pred --cccccc-----cccc----cccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHH
Confidence 000000 0000 00012368999999999999999999999999999999999999999999999999999
Q ss_pred HHHccCCCCCCCCCcHHHHHHHHHHc--CCCCCCCCCCCCCCCCC-CccccCCCCCceeeeeceEEcCCCCHHHHHHHHh
Q psy11694 348 LVDCDMSNGGCNGGRMDDALQYIIDN--GGVVSDQAYPYKASESE-RGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA 424 (655)
Q Consensus 348 l~dC~~~~~gC~GG~~~~a~~~~~~~--~Gi~~e~~yPY~~~~~~-~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~ 424 (655)
|+||+..+.||+||++..||+|+.++ +|+++|++|||.+..+. ..|.........+++.+|..++. +++.|+.+|+
T Consensus 178 LvdC~~~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~~~~i~~-~e~~~~~~l~ 256 (348)
T PTZ00203 178 LVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMES-SERVMAAWLA 256 (348)
T ss_pred HHhccCCCCCCCCCCHHHHHHHHHHhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecceeecCc-CHHHHHHHHH
Confidence 99999878899999999999999864 67999999999987652 25764332213567889988876 7899999999
Q ss_pred hCCCeEEEEEcCCcccccccEEe------CCceE----EcC--CccEEEEecCCCCccCCCceEEecccCCcccccc
Q psy11694 425 TRGPLSVGMNANGLFYYSGGVID------LNQRL----YGT--SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489 (655)
Q Consensus 425 ~~gPV~v~i~~~~~~~Y~~Gi~~------~~Hav----yg~--g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~ 489 (655)
++|||+|+|++..|++|++|||. ++||| ||. +.+|||||||||++|||+|||||+|+.|. |++.
T Consensus 257 ~~GPv~v~i~a~~f~~Y~~GIy~~c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~-Cgi~ 332 (348)
T PTZ00203 257 KNGPISIAVDASSFMSYHSGVLTSCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNA-CLLT 332 (348)
T ss_pred hCCCEEEEEEhhhhcCccCceeeccCCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCc-cccc
Confidence 99999999999779999999997 68999 776 67999999999999999999999997553 4443
No 3
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=7.3e-62 Score=524.38 Aligned_cols=292 Identities=33% Similarity=0.599 Sum_probs=234.7
Q ss_pred chhhhhhhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHHHHh-hcCCCCCCcccccCCCC
Q psy11694 191 NLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ-LTGLNLDSTLEDIQPSL 269 (655)
Q Consensus 191 ~l~~~~~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eEf~~-~~~~~~~~~~~~~~~~~ 269 (655)
..|+.|+++|+|+|.+.+|+..|+.||++|+++|++||++++.+|++|+|+|+|||.|||++ +++++... ... ...
T Consensus 167 ~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~-~~~--~~~ 243 (489)
T PTZ00021 167 NSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFD-FKS--NGK 243 (489)
T ss_pred HHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccc-ccc--ccc
Confidence 46999999999999999999999999999999999999876679999999999999999998 76643210 000 000
Q ss_pred CCCCCCCCcchhhhhhhcccCCCCCCCCCceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHHHH
Q psy11694 270 QAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLV 349 (655)
Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~ 349 (655)
......... ........ .....+|++||||++|.|+||||||.||||||||++++||++++|+++..+.||+|+|+
T Consensus 244 ~~~~~~~~~-~~~~~~~~---~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLV 319 (489)
T PTZ00021 244 KSPRVINYD-DVIKKYKP---KDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELV 319 (489)
T ss_pred ccccccccc-cccccccc---ccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHh
Confidence 000000000 00000000 00113599999999999999999999999999999999999999999999999999999
Q ss_pred HccCCCCCCCCCcHHHHHHHHHHcCCCCCCCCCCCCCCC-CCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCC
Q psy11694 350 DCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE-SERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGP 428 (655)
Q Consensus 350 dC~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~-~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gP 428 (655)
||+..+.||+||++..||+|+.+++||++|++|||.+.. + .|....+. ..++|.+|..++ +++|+++|+..||
T Consensus 320 DCs~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~~~~~--~C~~~~~~-~~~~i~~y~~i~---~~~lk~al~~~GP 393 (489)
T PTZ00021 320 DCSFKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVSDTPE--LCNIDRCK-EKYKIKSYVSIP---EDKFKEAIRFLGP 393 (489)
T ss_pred hhccCCCCCCCcchHhhhhhhhhccccCcccccCccCCCCC--cccccccc-ccceeeeEEEec---HHHHHHHHHhcCC
Confidence 999888999999999999999888899999999999874 4 78765443 567889998886 5789999999999
Q ss_pred eEEEEEcCC-cccccccEEe------CCceE----EcCC------------ccEEEEecCCCCccCCCceEEecccCC--
Q psy11694 429 LSVGMNANG-LFYYSGGVID------LNQRL----YGTS------------IPYWIVKNSWGSDWGEKVEDKVGSSGN-- 483 (655)
Q Consensus 429 V~v~i~~~~-~~~Y~~Gi~~------~~Hav----yg~g------------~~yWivkNSWG~~WGe~Gy~~i~~~~~-- 483 (655)
|+|+|++.. |++|++|||+ ++||| ||.. .+|||||||||++|||+|||||+|+.+
T Consensus 394 Vsv~i~a~~~f~~YkgGIy~~~C~~~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~ 473 (489)
T PTZ00021 394 ISVSIAVSDDFAFYKGGIFDGECGEEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDENGL 473 (489)
T ss_pred eEEEEEeecccccCCCCcCCCCCCCccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCCCCC
Confidence 999999975 9999999997 69999 6641 369999999999999999999999754
Q ss_pred -ccccccccCCcc
Q psy11694 484 -RTRDLELTGVLP 495 (655)
Q Consensus 484 -~~~~~~~~gv~p 495 (655)
..|++......|
T Consensus 474 ~n~CGI~t~a~yP 486 (489)
T PTZ00021 474 MKTCSLGTEAYVP 486 (489)
T ss_pred CCCCCCcccceeE
Confidence 356665544443
No 4
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=2.6e-59 Score=503.78 Aligned_cols=296 Identities=29% Similarity=0.513 Sum_probs=230.4
Q ss_pred cchhhhhhhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHHHHh-hcCCCCCCcccccCCC
Q psy11694 190 NNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ-LTGLNLDSTLEDIQPS 268 (655)
Q Consensus 190 ~~l~~~~~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eEf~~-~~~~~~~~~~~~~~~~ 268 (655)
...|+.|+++|+|.|.+.+|+..|+.+|++|++.|++||.. .+|++|+|+|+|||+|||.+ +++++.+.......+.
T Consensus 123 ~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~--~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~~~~ 200 (448)
T PTZ00200 123 YLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD--EPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHN 200 (448)
T ss_pred HHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc--CCeEEeccccccCCHHHHHHHhccCCCcccccccccc
Confidence 35699999999999999989999999999999999999963 58999999999999999998 6654322100000000
Q ss_pred CC--CCCCCCCc-chhhh-hhhc-ccC-CCCCCCCCceecCCCCCCCcCCCCC-CCccHHHHHHHHHHHHHHHHhcCCCc
Q psy11694 269 LQ--APFSSNQT-DTEMR-AFQF-NSL-RHGDDLPEAFDWRAEGVISKVKEQG-KCACCWAFSAVGVVEAMHAIQGNSLT 341 (655)
Q Consensus 269 ~~--~~~~~~~~-~~~~~-~~~~-~~~-~~~~~lP~~~Dwr~~g~vtpVkdQg-~CGsCwAfa~~~~le~~~~i~~~~~~ 341 (655)
.. ........ ..... ..+. ..+ .....+|++||||+.|.|+|||||| .||||||||+++++|++++|+++..+
T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~~~~ 280 (448)
T PTZ00200 201 NDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSV 280 (448)
T ss_pred cccccccccccccccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHHHhHHHHHHHHHHHhcCCCe
Confidence 00 00000000 00000 0000 000 0012369999999999999999999 99999999999999999999999999
Q ss_pred CCCHHHHHHccCCCCCCCCCcHHHHHHHHHHcCCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHH
Q psy11694 342 ELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKK 421 (655)
Q Consensus 342 ~lS~q~l~dC~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~ 421 (655)
.||+|+|+||+..+.||+||++..||+|+.++ ||++|++|||.+..+ .|..... ..+.|.+|..++ +.+.|++
T Consensus 281 ~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~-Gi~~e~~YPY~~~~~--~C~~~~~--~~~~i~~y~~~~--~~~~l~~ 353 (448)
T PTZ00200 281 DLSEQELVNCDTKSQGCSGGYPDTALEYVKNK-GLSSSSDVPYLAKDG--KCVVSST--KKVYIDSYLVAK--GKDVLNK 353 (448)
T ss_pred ecCHHHHhhccCccCCCCCCcHHHHHHHHhhc-CccccccCCCCCCCC--CCcCCCC--CeeEecceEecC--HHHHHHH
Confidence 99999999999888999999999999999877 999999999999887 8976543 356688887664 3556666
Q ss_pred HHhhCCCeEEEEEcCC-cccccccEEe------CCceE----EcC----CccEEEEecCCCCccCCCceEEeccc--CCc
Q psy11694 422 WVATRGPLSVGMNANG-LFYYSGGVID------LNQRL----YGT----SIPYWIVKNSWGSDWGEKVEDKVGSS--GNR 484 (655)
Q Consensus 422 ~l~~~gPV~v~i~~~~-~~~Y~~Gi~~------~~Hav----yg~----g~~yWivkNSWG~~WGe~Gy~~i~~~--~~~ 484 (655)
++. .|||+|+|++.. |+.|++|||. ++||| ||. |.+|||||||||++|||+|||||+|+ +..
T Consensus 354 ~l~-~GPV~v~i~~~~~f~~Yk~GIy~~~C~~~~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n 432 (448)
T PTZ00200 354 SLV-ISPTVVYIAVSRELLKYKSGVYNGECGKSLNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTD 432 (448)
T ss_pred HHh-cCCEEEEeecccccccCCCCccccccCCCCcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCCCCCC
Confidence 665 899999999975 9999999997 69999 663 67899999999999999999999996 345
Q ss_pred cccccccCCcc
Q psy11694 485 TRDLELTGVLP 495 (655)
Q Consensus 485 ~~~~~~~gv~p 495 (655)
.|++...+..|
T Consensus 433 ~CGI~~~~~~P 443 (448)
T PTZ00200 433 KCGILTVGLTP 443 (448)
T ss_pred cCCccccceee
Confidence 67777766655
No 5
>KOG1543|consensus
Probab=100.00 E-value=1.6e-55 Score=460.06 Aligned_cols=264 Identities=37% Similarity=0.687 Sum_probs=228.4
Q ss_pred hhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHHHHh-hcCCCCCCcccccCCCCCCCCCC
Q psy11694 197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ-LTGLNLDSTLEDIQPSLQAPFSS 275 (655)
Q Consensus 197 ~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eEf~~-~~~~~~~~~~~~~~~~~~~~~~~ 275 (655)
+.+|.+.|.+..|...|+.+|.+|+..|..||.....+|.+|+|+|+|+|.+|+.+ +.+.+.+.. ...
T Consensus 30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~--------~~~--- 98 (325)
T KOG1543|consen 30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEI--------KRD--- 98 (325)
T ss_pred hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccc--------ccc---
Confidence 56778888877788899999999999999999976779999999999999999999 666544311 000
Q ss_pred CCcchhhhhhhcccCCCCCCCCCceecCCCC-CCCcCCCCCCCccHHHHHHHHHHHHHHHHhcC-CCcCCCHHHHHHccC
Q psy11694 276 NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGN-SLTELSVQQLVDCDM 353 (655)
Q Consensus 276 ~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g-~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~dC~~ 353 (655)
.........++|++||||+++ .++||||||.||||||||++++||++++|+++ .++.||+|+|+||+.
T Consensus 99 ----------~~~~~~~~~~~p~s~DwR~~~~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~ 168 (325)
T KOG1543|consen 99 ----------KFTEKLDGDDLPDSFDWRDKGAVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCG 168 (325)
T ss_pred ----------ccccccchhhCCCCccccccCCcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccC
Confidence 000111246899999999996 55559999999999999999999999999999 899999999999998
Q ss_pred C-CCCCCCCcHHHHHHHHHHcCCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEE
Q psy11694 354 S-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVG 432 (655)
Q Consensus 354 ~-~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~ 432 (655)
. +.||+||++..|++|+.+++++.++.+|||.+..+ .|..+... ..+.+.++..++. ++++|+.+|+.+|||+|+
T Consensus 169 ~~~~GC~GG~~~~A~~yi~~~G~~t~~~~Ypy~~~~~--~C~~~~~~-~~~~~~~~~~~~~-~e~~i~~~v~~~GPv~v~ 244 (325)
T KOG1543|consen 169 ECGDGCNGGEPKNAFKYIKKNGGVTECENYPYIGKDG--TCKSNKKD-KTVTIKGFYNVPA-NEEAIAEAVAKNGPVSVA 244 (325)
T ss_pred CCCCCcCCCCHHHHHHHHHHhCCCCCCcCCCCcCCCC--CccCCCcc-ceeEeeeeeecCc-CHHHHHHHHHhcCCeEEE
Confidence 4 88999999999999999995555599999999998 99988764 7788899998888 499999999999999999
Q ss_pred EEcCC-cccccccEEe--------CCceE----EcC--CccEEEEecCCCCccCCCceEEecccCCcc
Q psy11694 433 MNANG-LFYYSGGVID--------LNQRL----YGT--SIPYWIVKNSWGSDWGEKVEDKVGSSGNRT 485 (655)
Q Consensus 433 i~~~~-~~~Y~~Gi~~--------~~Hav----yg~--g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~ 485 (655)
|++.. |+.|++|||. ++||| ||+ +.+|||||||||++|||+|||||.|+.+..
T Consensus 245 ~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~ 312 (325)
T KOG1543|consen 245 IDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKC 312 (325)
T ss_pred EeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCch
Confidence 99998 9999999997 68999 886 579999999999999999999999987764
No 6
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=2.3e-46 Score=379.96 Aligned_cols=192 Identities=33% Similarity=0.617 Sum_probs=163.4
Q ss_pred CCCceecCCCC----CCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCC------CcCCCHHHHHHccCCCCCCCCCcHHH
Q psy11694 296 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNS------LTELSVQQLVDCDMSNGGCNGGRMDD 365 (655)
Q Consensus 296 lP~~~Dwr~~g----~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~------~~~lS~q~l~dC~~~~~gC~GG~~~~ 365 (655)
||++||||+.+ +|+||||||.||||||||++++||++++|+++. .+.||+|+|+||+..+.||+||++..
T Consensus 1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~ 80 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL 80 (243)
T ss_pred CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence 79999999988 999999999999999999999999999998776 68999999999998788999999999
Q ss_pred HHHHHHHcCCCCCCCCCCCCC-CCCCCccccCCCCCceeeeeceEEcC----CCCHHHHHHHHhhCCCeEEEEEcCC-cc
Q psy11694 366 ALQYIIDNGGVVSDQAYPYKA-SESERGCLVGEEEGFKVKVKEYSRIP----YGEEEEMKKWVATRGPLSVGMNANG-LF 439 (655)
Q Consensus 366 a~~~~~~~~Gi~~e~~yPY~~-~~~~~~C~~~~~~~~~~~i~~~~~v~----~~~~~~ik~~l~~~gPV~v~i~~~~-~~ 439 (655)
|++|+.++ |+++|++|||.. ... .|.........+++..|..+. ..++++||++|+++|||+++|++.. |+
T Consensus 81 a~~~~~~~-Gi~~e~~yPY~~~~~~--~C~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F~ 157 (243)
T cd02621 81 VGKFAEDF-GIVTEDYFPYTADDDR--PCKASPSECRRYYFSDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFD 157 (243)
T ss_pred HHHHHHhc-CcCCCceeCCCCCCCC--CCCCCccccccccccceeEcccccccCCHHHHHHHHHHcCCEEEEEEeccccc
Confidence 99999887 999999999998 555 787554121344455555442 3478999999999999999999976 99
Q ss_pred cccccEEe--------------------CCceE----EcC----CccEEEEecCCCCccCCCceEEecccCCcccccccc
Q psy11694 440 YYSGGVID--------------------LNQRL----YGT----SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELT 491 (655)
Q Consensus 440 ~Y~~Gi~~--------------------~~Hav----yg~----g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~~ 491 (655)
+|++|||+ ++|+| ||+ +.+|||||||||++|||+|||||+|+.| .|++...
T Consensus 158 ~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~-~cgi~~~ 236 (243)
T cd02621 158 FYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTN-ECGIESQ 236 (243)
T ss_pred ccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCc-ccCcccc
Confidence 99999997 26998 775 5699999999999999999999999755 3555433
No 7
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=7.5e-46 Score=373.93 Aligned_cols=190 Identities=36% Similarity=0.685 Sum_probs=159.4
Q ss_pred CCceecCCC--CCC--CcCCCCCCCccHHHHHHHHHHHHHHHHhcC--CCcCCCHHHHHHccCC-CCCCCCCcHHHHHHH
Q psy11694 297 PEAFDWRAE--GVI--SKVKEQGKCACCWAFSAVGVVEAMHAIQGN--SLTELSVQQLVDCDMS-NGGCNGGRMDDALQY 369 (655)
Q Consensus 297 P~~~Dwr~~--g~v--tpVkdQg~CGsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~dC~~~-~~gC~GG~~~~a~~~ 369 (655)
|++||||++ +++ +||+|||.||||||||++++||++++|+++ +.+.||+|+|+||+.. +.||+||++..||+|
T Consensus 1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~ 80 (236)
T cd02620 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY 80 (236)
T ss_pred CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence 889999996 554 599999999999999999999999999887 7789999999999976 789999999999999
Q ss_pred HHHcCCCCCCCCCCCCCCCCC----------------CccccCCC---CCceeeeeceEEcCCCCHHHHHHHHhhCCCeE
Q psy11694 370 IIDNGGVVSDQAYPYKASESE----------------RGCLVGEE---EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLS 430 (655)
Q Consensus 370 ~~~~~Gi~~e~~yPY~~~~~~----------------~~C~~~~~---~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~ 430 (655)
+.++ |+++|++|||.+.... ..|..... ....+++..+..+.. ++++||.+|+++|||+
T Consensus 81 i~~~-G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~-~~~~ik~~l~~~GPv~ 158 (236)
T cd02620 81 LTTT-GVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVPS-DETDIMKEIMTNGPVQ 158 (236)
T ss_pred HHhc-CCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecceeeeCC-HHHHHHHHHHHCCCeE
Confidence 9987 9999999999876542 12432221 112345556666654 7899999999999999
Q ss_pred EEEEcCC-cccccccEEe-------CCceE----EcC--CccEEEEecCCCCccCCCceEEecccCCcccccc
Q psy11694 431 VGMNANG-LFYYSGGVID-------LNQRL----YGT--SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489 (655)
Q Consensus 431 v~i~~~~-~~~Y~~Gi~~-------~~Hav----yg~--g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~ 489 (655)
++|.+.. |+.|++|||. ++||| ||+ +.+|||||||||++|||+|||||+|+.| .|++.
T Consensus 159 v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~-~cgi~ 230 (236)
T cd02620 159 AAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSN-ECGIE 230 (236)
T ss_pred EEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCc-ccccc
Confidence 9999965 9999999997 58999 775 7799999999999999999999999754 34443
No 8
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=1.3e-45 Score=372.88 Aligned_cols=181 Identities=31% Similarity=0.652 Sum_probs=157.9
Q ss_pred CCCceecCCCC---CCCcCCCCC---CCccHHHHHHHHHHHHHHHHhcC---CCcCCCHHHHHHccCCCCCCCCCcHHHH
Q psy11694 296 LPEAFDWRAEG---VISKVKEQG---KCACCWAFSAVGVVEAMHAIQGN---SLTELSVQQLVDCDMSNGGCNGGRMDDA 366 (655)
Q Consensus 296 lP~~~Dwr~~g---~vtpVkdQg---~CGsCwAfa~~~~le~~~~i~~~---~~~~lS~q~l~dC~~~~~gC~GG~~~~a 366 (655)
||++||||+++ +|+|||||| .||||||||++++||++++|+++ ..+.||+|+|+||+. +.||+||++..|
T Consensus 1 lP~~~Dwr~~~~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~-~~gC~GG~~~~a 79 (239)
T cd02698 1 LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG-GGSCHGGDPGGV 79 (239)
T ss_pred CCCCcccccCCCCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC-CCCccCcCHHHH
Confidence 69999999987 999999998 89999999999999999999865 357899999999987 789999999999
Q ss_pred HHHHHHcCCCCCCCCCCCCCCCCCCccccC----------CC----CCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEE
Q psy11694 367 LQYIIDNGGVVSDQAYPYKASESERGCLVG----------EE----EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVG 432 (655)
Q Consensus 367 ~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~----------~~----~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~ 432 (655)
++|+.++ |+++|.+|||.+... .|... .+ ....+++++|..++ ++++||++|.++|||+++
T Consensus 80 ~~~~~~~-Gl~~e~~yPY~~~~~--~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~~--~~~~i~~~l~~~GPV~v~ 154 (239)
T cd02698 80 YEYAHKH-GIPDETCNPYQAKDG--ECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVS--GRDKMMAEIYARGPISCG 154 (239)
T ss_pred HHHHHHc-CcCCCCeeCCcCCCC--CCcCCCCCCCcccCcccccccccceEEeeeceecC--CHHHHHHHHHHcCCEEEE
Confidence 9999887 999999999988665 44321 00 11346677777775 578999999999999999
Q ss_pred EEcCC-cccccccEEe-------CCceE----EcC---CccEEEEecCCCCccCCCceEEecccC
Q psy11694 433 MNANG-LFYYSGGVID-------LNQRL----YGT---SIPYWIVKNSWGSDWGEKVEDKVGSSG 482 (655)
Q Consensus 433 i~~~~-~~~Y~~Gi~~-------~~Hav----yg~---g~~yWivkNSWG~~WGe~Gy~~i~~~~ 482 (655)
|.+.. |+.|++|||+ ++||| ||+ +.+|||||||||++|||+|||||+|+.
T Consensus 155 i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~ 219 (239)
T cd02698 155 IMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSS 219 (239)
T ss_pred EEecccccccCCeEEccCCCCCcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccC
Confidence 99986 9999999997 68999 765 679999999999999999999999976
No 9
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=7.8e-45 Score=361.22 Aligned_cols=189 Identities=43% Similarity=0.895 Sum_probs=170.8
Q ss_pred CCceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHHHHHccCC-CCCCCCCcHHHHHHHHHHcCC
Q psy11694 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGG 375 (655)
Q Consensus 297 P~~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~-~~gC~GG~~~~a~~~~~~~~G 375 (655)
|++||||+.+.++||+|||.||||||||++++||++++++++....||+|+|++|... +.||+||++..|++++.+. |
T Consensus 1 P~~~d~r~~~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~-G 79 (210)
T cd02248 1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNG-G 79 (210)
T ss_pred CCcccCCcCCCCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHC-C
Confidence 7899999999999999999999999999999999999999998899999999999976 7899999999999988877 9
Q ss_pred CCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCC-cccccccEEe-------
Q psy11694 376 VVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG-LFYYSGGVID------- 447 (655)
Q Consensus 376 i~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~-~~~Y~~Gi~~------- 447 (655)
+++|++|||..... .|...... ..+++.+|..++..++++||++|+++|||+++|.+.. |+.|++|||.
T Consensus 80 i~~e~~yPY~~~~~--~C~~~~~~-~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~ 156 (210)
T cd02248 80 LASESDYPYTGKDG--TCKYNSSK-VGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNT 156 (210)
T ss_pred cCccccCCccCCCC--CccCCCCc-ccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCCC
Confidence 99999999998766 78766543 6789999999988779999999999999999999986 9999999997
Q ss_pred -CCceE----EcC--CccEEEEecCCCCccCCCceEEecccCCccccccc
Q psy11694 448 -LNQRL----YGT--SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLEL 490 (655)
Q Consensus 448 -~~Hav----yg~--g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~ 490 (655)
.+||| ||+ +.+|||||||||++||++|||||+++.+ .|++..
T Consensus 157 ~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~~~~-~cgi~~ 205 (210)
T cd02248 157 NLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSN-LCGIAS 205 (210)
T ss_pred cCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEcCCC-ccCcee
Confidence 57999 666 5799999999999999999999998764 455543
No 10
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=2.4e-44 Score=392.62 Aligned_cols=206 Identities=22% Similarity=0.462 Sum_probs=168.8
Q ss_pred CCCCCCceecCCCC---CCCcCCCCCC---CccHHHHHHHHHHHHHHHHhcC------CCcCCCHHHHHHccCCCCCCCC
Q psy11694 293 GDDLPEAFDWRAEG---VISKVKEQGK---CACCWAFSAVGVVEAMHAIQGN------SLTELSVQQLVDCDMSNGGCNG 360 (655)
Q Consensus 293 ~~~lP~~~Dwr~~g---~vtpVkdQg~---CGsCwAfa~~~~le~~~~i~~~------~~~~lS~q~l~dC~~~~~gC~G 360 (655)
..+||++||||++| +|+||||||. ||||||||++++||++++|+++ ..+.||+|+|+||+..+.||+|
T Consensus 202 ~~~LP~sfDWR~~gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCdG 281 (548)
T PTZ00364 202 GDPPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCAG 281 (548)
T ss_pred ccCCCCccccCcCCCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCCC
Confidence 35799999999987 7999999999 9999999999999999999873 4678999999999987899999
Q ss_pred CcHHHHHHHHHHcCCCCCCCCC--CCCCCCCC-CccccCCCCCceeeeec------eEEcCCCCHHHHHHHHhhCCCeEE
Q psy11694 361 GRMDDALQYIIDNGGVVSDQAY--PYKASESE-RGCLVGEEEGFKVKVKE------YSRIPYGEEEEMKKWVATRGPLSV 431 (655)
Q Consensus 361 G~~~~a~~~~~~~~Gi~~e~~y--PY~~~~~~-~~C~~~~~~~~~~~i~~------~~~v~~~~~~~ik~~l~~~gPV~v 431 (655)
|++..|++|+.++ ||++|++| ||.+.++. ..|...... ..+.+++ |..+. .++++|+.+|+++|||+|
T Consensus 282 G~p~~A~~yi~~~-GI~tE~dY~~PY~~~dg~~~~Ck~~~~~-~~y~~~~~~~I~gyy~~~-~~e~~I~~eI~~~GPVsV 358 (548)
T PTZ00364 282 GFPEEVGKFAETF-GILTTDSYYIPYDSGDGVERACKTRRPS-RRYYFTNYGPLGGYYGAV-TDPDEIIWEIYRHGPVPA 358 (548)
T ss_pred CcHHHHHHHHHhC-CcccccccCCCCCCCCCCCCCCCCCccc-ceeeeeeeEEecceeecC-CcHHHHHHHHHHcCCeEE
Confidence 9999999999887 99999999 99876551 247654333 3333433 43333 378899999999999999
Q ss_pred EEEcCC-cccccccEEe--------------------------CCceE----EcC---CccEEEEecCCCC--ccCCCce
Q psy11694 432 GMNANG-LFYYSGGVID--------------------------LNQRL----YGT---SIPYWIVKNSWGS--DWGEKVE 475 (655)
Q Consensus 432 ~i~~~~-~~~Y~~Gi~~--------------------------~~Hav----yg~---g~~yWivkNSWG~--~WGe~Gy 475 (655)
+|++.. |+.|++|||. .+||| ||. |.+|||||||||+ +|||+||
T Consensus 359 aIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GY 438 (548)
T PTZ00364 359 SVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGT 438 (548)
T ss_pred EEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCe
Confidence 999986 8899998863 47999 774 6799999999999 9999999
Q ss_pred EEecccCCccccccccCC--ccccccccc
Q psy11694 476 DKVGSSGNRTRDLELTGV--LPSKLSRLA 502 (655)
Q Consensus 476 ~~i~~~~~~~~~~~~~gv--~p~~~~~ls 502 (655)
|||+|+.| .|+++...+ .|.++.+++
T Consensus 439 fRI~RG~N-~CGIes~~v~~~~~~~~~~~ 466 (548)
T PTZ00364 439 RKIARGVN-AYNIESEVVVMYWAPYPDVL 466 (548)
T ss_pred EEEEcCCC-cccccceeeeeeeecCCCcc
Confidence 99999865 466665544 444444444
No 11
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=1.2e-43 Score=390.13 Aligned_cols=197 Identities=26% Similarity=0.486 Sum_probs=162.3
Q ss_pred CCCCCCceecCCC----CCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCC-----C-----cCCCHHHHHHccCCCCCC
Q psy11694 293 GDDLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNS-----L-----TELSVQQLVDCDMSNGGC 358 (655)
Q Consensus 293 ~~~lP~~~Dwr~~----g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~-----~-----~~lS~q~l~dC~~~~~gC 358 (655)
..+||++||||+. +.++||+|||.||||||||++++||++++|+++. . ..||+|+||||+..+.||
T Consensus 378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC 457 (693)
T PTZ00049 378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGC 457 (693)
T ss_pred cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCc
Confidence 3579999999984 6899999999999999999999999999998642 1 279999999999888999
Q ss_pred CCCcHHHHHHHHHHcCCCCCCCCCCCCCCCCCCccccCCCC--------------------------------------C
Q psy11694 359 NGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEE--------------------------------------G 400 (655)
Q Consensus 359 ~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~~~~--------------------------------------~ 400 (655)
+||++..|++|+.+. ||++|++|||.+..+ .|...... .
T Consensus 458 ~GG~~~~A~kya~~~-GI~tEscYPY~a~~g--~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (693)
T PTZ00049 458 NGGFPYLVSKMAKLQ-GIPLDKVFPYTATEQ--TCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEP 534 (693)
T ss_pred CCCcHHHHHHHHHHC-CCCcCCccCCcCCCC--CCCCCCCCccccccccccccccccccccccccccccccccccccccc
Confidence 999999999999887 999999999998766 66432110 0
Q ss_pred ceeeeeceEEcC-------CCCHHHHHHHHhhCCCeEEEEEcCC-cccccccEEe-------------------------
Q psy11694 401 FKVKVKEYSRIP-------YGEEEEMKKWVATRGPLSVGMNANG-LFYYSGGVID------------------------- 447 (655)
Q Consensus 401 ~~~~i~~~~~v~-------~~~~~~ik~~l~~~gPV~v~i~~~~-~~~Y~~Gi~~------------------------- 447 (655)
.++.+++|..+. ..++++|+.+|+++|||+|+|++.. |++|++|||.
T Consensus 535 ~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G 614 (693)
T PTZ00049 535 ARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITG 614 (693)
T ss_pred cceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccc
Confidence 123455666653 2368999999999999999999975 9999999996
Q ss_pred ---CCceE----EcC----C--ccEEEEecCCCCccCCCceEEecccCCccccccccCC
Q psy11694 448 ---LNQRL----YGT----S--IPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGV 493 (655)
Q Consensus 448 ---~~Hav----yg~----g--~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~~gv 493 (655)
++||| ||. | .+|||||||||++||++|||||+|+.| .|+++...+
T Consensus 615 ~e~~NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N-~CGIEs~a~ 672 (693)
T PTZ00049 615 WEKVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKN-FSGIESQSL 672 (693)
T ss_pred cccCceEEEEEEeccccCCCcccCEEEEECCCCCCcccCceEEEEcCCC-ccCCcccee
Confidence 37998 665 3 379999999999999999999999865 455554433
No 12
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=1.1e-42 Score=347.75 Aligned_cols=199 Identities=37% Similarity=0.764 Sum_probs=169.0
Q ss_pred CCCceecCCC-CCCCcCCCCCCCccHHHHHHHHHHHHHHHHhc-CCCcCCCHHHHHHccC-CCCCCCCCcHHHHHHHHHH
Q psy11694 296 LPEAFDWRAE-GVISKVKEQGKCACCWAFSAVGVVEAMHAIQG-NSLTELSVQQLVDCDM-SNGGCNGGRMDDALQYIID 372 (655)
Q Consensus 296 lP~~~Dwr~~-g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~-~~~~~lS~q~l~dC~~-~~~gC~GG~~~~a~~~~~~ 372 (655)
||++||||+. +.++||+|||.||+|||||+++++|++++++. ...+.||+|+|++|.. .+.+|+||++..|++++.+
T Consensus 1 lP~~~D~r~~~~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~~ 80 (219)
T PF00112_consen 1 LPKSFDWRDKGGRITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIKN 80 (219)
T ss_dssp STSSEEGGGTTTCSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHHH
T ss_pred CCCCEecccCCCCcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeecc
Confidence 7999999997 48999999999999999999999999999998 7889999999999997 5689999999999999999
Q ss_pred cCCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCC--cccccccEEe---
Q psy11694 373 NGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVID--- 447 (655)
Q Consensus 373 ~~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~--~~~Y~~Gi~~--- 447 (655)
..|+++|.+|||..... ..|..........++..|..+...++++||++|+++|||+++|.+.. |+.|++|||.
T Consensus 81 ~~Gi~~e~~~pY~~~~~-~~c~~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~ 159 (219)
T PF00112_consen 81 NNGIVTEEDYPYNGNEN-PTCKSKKSNSYYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPD 159 (219)
T ss_dssp HTSBEBTTTS--SSSSS-CSSCHSGGGEEEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTS
T ss_pred cCccccccccccccccc-ccccccccccccccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeeccc
Confidence 44999999999998762 27876654422478899999988789999999999999999999987 8999999998
Q ss_pred -----CCceE----EcC--CccEEEEecCCCCccCCCceEEecccCCccccccccCCcc
Q psy11694 448 -----LNQRL----YGT--SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLP 495 (655)
Q Consensus 448 -----~~Hav----yg~--g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~~gv~p 495 (655)
.+|+| ||+ +.+|||||||||++||++||+||+++.+..|+++...+.|
T Consensus 160 ~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~~~c~i~~~~~~~ 218 (219)
T PF00112_consen 160 CSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYFRISYDYNNECGIESQAVYP 218 (219)
T ss_dssp SSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEEEEESSSSSGGGTTSSEEEE
T ss_pred cccccccccccccccccccceeeEeeehhhCCccCCCeEEEEeeCCCCcCccCceeeec
Confidence 58998 665 5889999999999999999999999888777777665443
No 13
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00 E-value=1.4e-39 Score=312.79 Aligned_cols=150 Identities=47% Similarity=0.977 Sum_probs=133.6
Q ss_pred CCCceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHHHHHccCC-CCCCCCCcHHHHHHHHHHcC
Q psy11694 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNG 374 (655)
Q Consensus 296 lP~~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~-~~gC~GG~~~~a~~~~~~~~ 374 (655)
||++||||++++++||+|||.||+|||||++++||++++++++..+.||+|+|++|... +.||+||++..|++|+.+++
T Consensus 1 lP~~~D~R~~~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~~ 80 (174)
T smart00645 1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKNG 80 (174)
T ss_pred CCCcCcccccCCCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHcC
Confidence 69999999999999999999999999999999999999999998999999999999875 67999999999999998876
Q ss_pred CCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCCcccccccEEe-------
Q psy11694 375 GVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVID------- 447 (655)
Q Consensus 375 Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~~~~Y~~Gi~~------- 447 (655)
|+++|++|||.. ++.+.+..|++|++|||+
T Consensus 81 Gi~~e~~~PY~~-------------------------------------------~~~~~~~~f~~Y~~Gi~~~~~~~~~ 117 (174)
T smart00645 81 GLETESCYPYTG-------------------------------------------SVAIDASDFQFYKSGIYDHPGCGSG 117 (174)
T ss_pred CcccccccCccc-------------------------------------------EEEEEcccccCCcCeEECCCCCCCC
Confidence 999999999964 455566669999999997
Q ss_pred -CCceE----EcC---CccEEEEecCCCCccCCCceEEecccCCccccc
Q psy11694 448 -LNQRL----YGT---SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDL 488 (655)
Q Consensus 448 -~~Hav----yg~---g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~ 488 (655)
.+|+| ||. +.+|||||||||+.||++|||||.++....|++
T Consensus 118 ~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i 166 (174)
T smart00645 118 TLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGI 166 (174)
T ss_pred cccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCc
Confidence 48998 663 578999999999999999999999876233444
No 14
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00 E-value=1e-37 Score=312.57 Aligned_cols=180 Identities=31% Similarity=0.550 Sum_probs=157.4
Q ss_pred ceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcC--CCcCCCHHHHHHccCCC-----CCCCCCcHHHHHH-HH
Q psy11694 299 AFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN--SLTELSVQQLVDCDMSN-----GGCNGGRMDDALQ-YI 370 (655)
Q Consensus 299 ~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~dC~~~~-----~gC~GG~~~~a~~-~~ 370 (655)
.||||+.+ ++||+|||.||+|||||+++++|+++++++. ..+.||+|+|++|.... .||.||.+..++. ++
T Consensus 1 ~~d~r~~~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~ 79 (223)
T cd02619 1 SVDLRPLR-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLV 79 (223)
T ss_pred CCcchhcC-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHHHHHH
Confidence 48999988 9999999999999999999999999999987 88999999999998653 6999999999998 77
Q ss_pred HHcCCCCCCCCCCCCCCCCCCccccC---CCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCC-cccccccEE
Q psy11694 371 IDNGGVVSDQAYPYKASESERGCLVG---EEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG-LFYYSGGVI 446 (655)
Q Consensus 371 ~~~~Gi~~e~~yPY~~~~~~~~C~~~---~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~-~~~Y~~Gi~ 446 (655)
..+ |+++|.+|||..... .|... ......+++..|..+...++++||++|+++|||+++|.+.. |+.|++|++
T Consensus 80 ~~~-Gi~~e~~~Py~~~~~--~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~ 156 (223)
T cd02619 80 ALK-GIPPEEDYPYGAESD--GEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGII 156 (223)
T ss_pred HHc-CCCccccCCCCCCCC--CCCCCCccchhhcceeecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCccc
Confidence 666 999999999998876 44322 11225678899999887778999999999999999999987 889999986
Q ss_pred -----e--------CCceE----EcC----CccEEEEecCCCCccCCCceEEecccC
Q psy11694 447 -----D--------LNQRL----YGT----SIPYWIVKNSWGSDWGEKVEDKVGSSG 482 (655)
Q Consensus 447 -----~--------~~Hav----yg~----g~~yWivkNSWG~~WGe~Gy~~i~~~~ 482 (655)
. .+||| ||+ +.+|||||||||+.||++||+||.++.
T Consensus 157 ~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~ 213 (223)
T cd02619 157 YEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYED 213 (223)
T ss_pred cccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence 2 78999 776 469999999999999999999998864
No 15
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=1e-36 Score=343.89 Aligned_cols=182 Identities=25% Similarity=0.462 Sum_probs=145.5
Q ss_pred CCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHHHHHccCC--CCCCCCCc-HHHHHHHHHHcCCCCCCCCCCC
Q psy11694 308 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGR-MDDALQYIIDNGGVVSDQAYPY 384 (655)
Q Consensus 308 vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~--~~gC~GG~-~~~a~~~~~~~~Gi~~e~~yPY 384 (655)
..||||||.||||||||++++||++++|+++..+.||+|+|+||+.. +.||.||. +..++.|+.+++|+++|++|||
T Consensus 544 ~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPY 623 (1004)
T PTZ00462 544 KIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLY 623 (1004)
T ss_pred CCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCC
Confidence 57999999999999999999999999999999999999999999853 58999997 5556699988877999999999
Q ss_pred CCCCCCCccccCCCC-----------------CceeeeeceEEcCCC----C----HHHHHHHHhhCCCeEEEEEcCCcc
Q psy11694 385 KASESERGCLVGEEE-----------------GFKVKVKEYSRIPYG----E----EEEMKKWVATRGPLSVGMNANGLF 439 (655)
Q Consensus 385 ~~~~~~~~C~~~~~~-----------------~~~~~i~~~~~v~~~----~----~~~ik~~l~~~gPV~v~i~~~~~~ 439 (655)
.+......|...... ...+.+.+|..+... + +++|+++|+..|||+|+|++..|+
T Consensus 624 t~k~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsdf~ 703 (1004)
T PTZ00462 624 NYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENVL 703 (1004)
T ss_pred ccCCCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeehHH
Confidence 863222267643210 012344566665432 1 468999999999999999997777
Q ss_pred cc-cccEEe--------CCceE----EcC-------CccEEEEecCCCCccCCCceEEecccCCcccccc
Q psy11694 440 YY-SGGVID--------LNQRL----YGT-------SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489 (655)
Q Consensus 440 ~Y-~~Gi~~--------~~Hav----yg~-------g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~ 489 (655)
.| .+|||. ++||| ||. +.+|||||||||+.|||+|||||.|.+...|++.
T Consensus 704 ~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin 773 (1004)
T PTZ00462 704 GYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDN 773 (1004)
T ss_pred hhhcCCccccCCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCCCCCccc
Confidence 77 589864 68999 774 3589999999999999999999999665566653
No 16
>KOG1542|consensus
Probab=100.00 E-value=3.3e-35 Score=293.56 Aligned_cols=154 Identities=43% Similarity=0.906 Sum_probs=141.4
Q ss_pred cccccccccccccCCCCCCCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCH
Q psy11694 495 PSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 574 (655)
Q Consensus 495 p~~~~~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~ 574 (655)
..+|++||||+|+||+.++.||+||.+..|++|+++.+|+..|.+|||++.... .|.... ......|.+|..++. ++
T Consensus 198 ~g~LvsLSEQeLvDCD~~d~gC~GGl~~nA~~~~~~~gGL~~E~dYPY~g~~~~-~C~~~~-~~~~v~I~~f~~l~~-nE 274 (372)
T KOG1542|consen 198 TGKLVSLSEQELVDCDSCDNGCNGGLMDNAFKYIKKAGGLEKEKDYPYTGKKGN-QCHFDK-SKIVVSIKDFSMLSN-NE 274 (372)
T ss_pred cCcccccchhhhhcccCcCCcCCCCChhHHHHHHHHhCCccccccCCccccCCC-ccccch-hhceEEEeccEecCC-CH
Confidence 358999999999999999999999999999999888889999999999998774 788776 667789999999986 99
Q ss_pred HHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCCCCCcCCCCC
Q psy11694 575 EEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEKVM 654 (655)
Q Consensus 575 ~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG~~WGe~Gy 654 (655)
++|.+.|.++|||+|+|++...+.|.+||..|....|.+..++|||+|||||...- ..+|||||||||+.|||+||
T Consensus 275 ~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~----~~PYWIVKNSWG~~WGE~GY 350 (372)
T KOG1542|consen 275 DQIAAWLVTFGPLSVGINAKPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGY----EKPYWIVKNSWGTSWGEKGY 350 (372)
T ss_pred HHHHHHHHhcCCeEEEEchHHHHHhcccccCCCcccCCccccCceEEEEeecCCCC----CCceEEEECCccccccccce
Confidence 99999999999999999998889999999997778899888999999999998752 56899999999999999999
Q ss_pred C
Q psy11694 655 L 655 (655)
Q Consensus 655 ~ 655 (655)
|
T Consensus 351 ~ 351 (372)
T KOG1542|consen 351 Y 351 (372)
T ss_pred E
Confidence 7
No 17
>KOG1544|consensus
Probab=100.00 E-value=1.1e-35 Score=291.45 Aligned_cols=246 Identities=29% Similarity=0.514 Sum_probs=186.0
Q ss_pred HHHHHHhcCCCCceeee-eCcCCCCChHHHHh-hcCCCCCCcccccCCCCCCCCCCCCcchhhhhhhcccCCCCCCCCCc
Q psy11694 222 EKAEDYQSEDSGTAVFG-VNKFFDLSESDLQQ-LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEA 299 (655)
Q Consensus 222 ~~I~~~N~~~~~~~~~g-~N~FsDlt~eEf~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~ 299 (655)
+.|++.|.. +-+|+++ ..+|-.||.++-.+ .||.-.+ +.... ++... ...+....+||+.
T Consensus 151 d~iE~in~G-~YgW~A~NYSaFWGmtL~DGiKyRLGTL~P--------s~sv~--------nMNEi-~~~l~p~~~LPE~ 212 (470)
T KOG1544|consen 151 DMIEAINQG-NYGWQAGNYSAFWGMTLDDGIKYRLGTLRP--------SSSVM--------NMNEI-YTVLNPGEVLPEA 212 (470)
T ss_pred HHHHHHhcC-CccccccchhhhhcccccccceeeecccCc--------hhhhh--------hHHhH-hhccCcccccchh
Confidence 467777763 3466654 45899999998777 5553222 11000 01000 0123335789999
Q ss_pred eecCCC--CCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcC-C-CcCCCHHHHHHccCC-CCCCCCCcHHHHHHHHHHcC
Q psy11694 300 FDWRAE--GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN-S-LTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNG 374 (655)
Q Consensus 300 ~Dwr~~--g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~-~-~~~lS~q~l~dC~~~-~~gC~GG~~~~a~~~~~~~~ 374 (655)
||-|++ +.+.|+.|||+|++.|||+++++...+++|+.. + ...||+|+|++|... ..||+||+++.|+=|+.+.
T Consensus 213 F~As~KWp~liH~plDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKr- 291 (470)
T KOG1544|consen 213 FEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKR- 291 (470)
T ss_pred hhhhhcCCccccCccccCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecc-
Confidence 999986 899999999999999999999999999999864 3 467999999999854 5899999999999999887
Q ss_pred CCCCCCCCCCCCCCCC--Ccc------------------ccCCCCC-ceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEE
Q psy11694 375 GVVSDQAYPYKASESE--RGC------------------LVGEEEG-FKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433 (655)
Q Consensus 375 Gi~~e~~yPY~~~~~~--~~C------------------~~~~~~~-~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i 433 (655)
|++...||||...+.. ..| .....+. ..++++--..|+. +|++|+++|+++|||.+.|
T Consensus 292 GvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~iyq~tPPYrVSS-nE~eImkElM~NGPVQA~m 370 (470)
T KOG1544|consen 292 GVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNSNDIYQVTPPYRVSS-NEKEIMKELMENGPVQALM 370 (470)
T ss_pred cccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccccCceeeecCCeeccC-CHHHHHHHHHhCCChhhhh
Confidence 9999999999765431 233 3222221 2344444445554 7999999999999999999
Q ss_pred EcCC-cccccccEEe---------------CCceE----EcC-------CccEEEEecCCCCccCCCceEEecccCCccc
Q psy11694 434 NANG-LFYYSGGVID---------------LNQRL----YGT-------SIPYWIVKNSWGSDWGEKVEDKVGSSGNRTR 486 (655)
Q Consensus 434 ~~~~-~~~Y~~Gi~~---------------~~Hav----yg~-------g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~ 486 (655)
.|-+ |+.|++|||. ..|+| ||+ ..+|||..||||+.|||+|||||.|+.|. |
T Consensus 371 ~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNe-c 449 (470)
T KOG1544|consen 371 EVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNE-C 449 (470)
T ss_pred hhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccc-h
Confidence 9987 9999999997 67999 665 35799999999999999999999998775 4
Q ss_pred cc
Q psy11694 487 DL 488 (655)
Q Consensus 487 ~~ 488 (655)
++
T Consensus 450 dI 451 (470)
T KOG1544|consen 450 DI 451 (470)
T ss_pred hh
Confidence 44
No 18
>PTZ00203 cathepsin L protease; Provisional
Probab=99.98 E-value=2.2e-32 Score=287.89 Aligned_cols=148 Identities=30% Similarity=0.517 Sum_probs=115.9
Q ss_pred cCchHHHHHHHHHHcCcccCChHHHHHHHHHHHHHHHHHHHHhccCCCCeeeeccCCCCCCHHHHH-HHhCCCCCCCccc
Q psy11694 38 LNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ-QLTGLNLDSTLED 116 (655)
Q Consensus 38 ~~~~~~~f~~~~~~~~k~Y~~~~e~~~r~~~f~~n~~~i~~~n~~~~~~~~~g~N~fsDlt~~E~~-~~~g~~~~~~~~~ 116 (655)
...+..+|++||++|+|+|.+.+|+.+|+.||++|+++|++||++ +.+|+||+|+|||||+|||+ .+++...... ..
T Consensus 31 ~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~-~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~-~~ 108 (348)
T PTZ00203 31 GTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQAR-NPHARFGITKFFDLSEAEFAARYLNGAAYFA-AA 108 (348)
T ss_pred ccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhcc-CCCeEEeccccccCCHHHHHHHhcCCCcccc-cc
Confidence 345667899999999999999889999999999999999999976 45999999999999999994 4553211000 00
Q ss_pred cCCCcCCCCCCCccchhhhhhhcccCCCCCCCCccccccccCccccccCCCcceeeecccchhhHHhhhhhcccchhhhh
Q psy11694 117 IQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELS 196 (655)
Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWr~~g~Vt~VkdQg~c~~~~a~~~~~~~e~~~~i~~~~l~~~~ 196 (655)
........... .....++|++||||++|+||||||||.|||||||++++++|++..+..+.++.+|
T Consensus 109 -~~~~~~~~~~~-------------~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LS 174 (348)
T PTZ00203 109 -KQHAGQHYRKA-------------RADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLS 174 (348)
T ss_pred -ccccccccccc-------------ccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCC
Confidence 00000000000 0011268999999999999999999999999999999999999999999999999
Q ss_pred hhccc
Q psy11694 197 VQHHD 201 (655)
Q Consensus 197 ~q~~~ 201 (655)
+|+.-
T Consensus 175 eQqLv 179 (348)
T PTZ00203 175 EQQLV 179 (348)
T ss_pred HHHHH
Confidence 88763
No 19
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.97 E-value=5.7e-31 Score=265.66 Aligned_cols=149 Identities=34% Similarity=0.683 Sum_probs=120.5
Q ss_pred cccccccccccCCCCC-CCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCC----------------CCCCCCCc---
Q psy11694 497 KLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE----------------RGCLVGEE--- 556 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~~-~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~----------------~~c~~~~~--- 556 (655)
+.+.||+|+|++|+.. +.||.||++..|++|++++ |+++|++|||...... ..|.....
T Consensus 49 ~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~i~~~-G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~ 127 (236)
T cd02620 49 ENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKYLTTT-GVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTY 127 (236)
T ss_pred CccccCHHHHHhhcCCCCCCCCCCCHHHHHHHHHhc-CCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCcccc
Confidence 4578999999999876 7899999999999999986 9999999999876531 12432211
Q ss_pred CCceEEEeeeEEcCCCCHHHHHHHHHhCCCEEEEEEc-cccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCC
Q psy11694 557 EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTS 635 (655)
Q Consensus 557 ~~~~~~~~~~~~~~~~~~~~ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~ 635 (655)
....+++..+..+. .++++||.+|+++|||+++|++ ++|+.|++|||. ..++...++|||+|||||+++ +
T Consensus 128 ~~~~~~~~~~~~~~-~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~---~~~~~~~~~HaV~iVGyg~~~-----g 198 (236)
T cd02620 128 EEDKHKGKSAYSVP-SDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQ---HTSGKQLGGHAVKIIGWGVEN-----G 198 (236)
T ss_pred ceeeeeecceeeeC-CHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEe---ecCCCCcCCeEEEEEEEeccC-----C
Confidence 11234455555554 3789999999999999999999 578899999997 335556689999999999875 6
Q ss_pred CCEEEEEcCCCCCcCCCCCC
Q psy11694 636 IPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 636 ~~ywivrNSWG~~WGe~Gy~ 655 (655)
.+|||||||||++|||+|||
T Consensus 199 ~~YWivrNSWG~~WGe~Gy~ 218 (236)
T cd02620 199 VPYWLAANSWGTDWGENGYF 218 (236)
T ss_pred eeEEEEEeCCCCCCCCCcEE
Confidence 78999999999999999997
No 20
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.97 E-value=1.2e-30 Score=263.91 Aligned_cols=145 Identities=30% Similarity=0.624 Sum_probs=118.6
Q ss_pred ccccccccccCCCCCCCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCC-----------C---cCCceEEE
Q psy11694 498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVG-----------E---EEGFKVKV 563 (655)
Q Consensus 498 ~~~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~-----------~---~~~~~~~~ 563 (655)
..+||+|+|++|+. +.||.||++..|++|++++ |+++|++|||..... .|... + .....+++
T Consensus 54 ~~~lS~Q~lldC~~-~~gC~GG~~~~a~~~~~~~-Gl~~e~~yPY~~~~~--~C~~~~~~~~c~~~~~c~~~~~~~~~~i 129 (239)
T cd02698 54 SVYLSVQVVIDCAG-GGSCHGGDPGGVYEYAHKH-GIPDETCNPYQAKDG--ECNPFNRCGTCNPFGECFAIKNYTLYFV 129 (239)
T ss_pred CcccCHHHHHhCCC-CCCccCcCHHHHHHHHHHc-CcCCCCeeCCcCCCC--CCcCCCCCCCcccCcccccccccceEEe
Confidence 57899999999987 7899999999999999985 999999999987644 33210 0 01224567
Q ss_pred eeeEEcCCCCHHHHHHHHHhCCCEEEEEEcc-ccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEE
Q psy11694 564 KEYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVK 642 (655)
Q Consensus 564 ~~~~~~~~~~~~~ik~~i~~~GPV~~~~~~~-~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivr 642 (655)
..|..++ +.++||++|+++|||+++|++. +|+.|++|||+. .+....++|||+|||||++.+ +++|||||
T Consensus 130 ~~~~~~~--~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~---~~~~~~~~HaV~IVGyG~~~~----g~~YWiik 200 (239)
T cd02698 130 SDYGSVS--GRDKMMAEIYARGPISCGIMATEALENYTGGVYKE---YVQDPLINHIISVAGWGVDEN----GVEYWIVR 200 (239)
T ss_pred eeceecC--CHHHHHHHHHHcCCEEEEEEecccccccCCeEEcc---CCCCCcCCeEEEEEEEEecCC----CCEEEEEE
Confidence 7777774 5789999999999999999995 688999999973 233456799999999998642 67899999
Q ss_pred cCCCCCcCCCCCC
Q psy11694 643 NSWGSDWGEKVML 655 (655)
Q Consensus 643 NSWG~~WGe~Gy~ 655 (655)
||||++|||+|||
T Consensus 201 NSWG~~WGe~Gy~ 213 (239)
T cd02698 201 NSWGEPWGERGWF 213 (239)
T ss_pred cCCCcccCcCceE
Confidence 9999999999997
No 21
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.97 E-value=1.4e-30 Score=264.32 Aligned_cols=152 Identities=31% Similarity=0.625 Sum_probs=120.9
Q ss_pred ccccccccccCCCCCCCCCCCCCccchHHHHHHcCCCCCCCCcccCC-CCCCCCCCCCCcCCceEEEeeeEEcC----CC
Q psy11694 498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKA-SESERGCLVGEEEGFKVKVKEYSRIP----YG 572 (655)
Q Consensus 498 ~~~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~-~~~~~~c~~~~~~~~~~~~~~~~~~~----~~ 572 (655)
...||+|+|++|...+.||+||++..|++|+.+. |+++|++|||.. ... .|.........+++..|..+. ..
T Consensus 55 ~~~lS~q~l~dC~~~~~GC~GG~~~~a~~~~~~~-Gi~~e~~yPY~~~~~~--~C~~~~~~~~~~~~~~~~~i~~~~~~~ 131 (243)
T cd02621 55 QPILSPQHVLSCSQYSQGCDGGFPFLVGKFAEDF-GIVTEDYFPYTADDDR--PCKASPSECRRYYFSDYNYVGGCYGCT 131 (243)
T ss_pred CcccCHHHhhhhcCCCCCCCCCCHHHHHHHHHhc-CcCCCceeCCCCCCCC--CCCCCccccccccccceeEcccccccC
Confidence 5789999999998777899999999999999886 999999999987 433 676543122334555555542 35
Q ss_pred CHHHHHHHHHhCCCEEEEEEc-cccccccCCeeecCC--CCCCC--------cccCeEEEEEEEecccCCCCCCCCEEEE
Q psy11694 573 EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQ--RLCNP--------KAQNHALIIVGYGEEEKKDGTSIPYWIV 641 (655)
Q Consensus 573 ~~~~ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~~~~--~~~~~--------~~~~HaV~iVGyg~~~~~~~~~~~ywiv 641 (655)
++++||++|+++|||+++|++ ++|..|++|||..+. ..|+. ..++|||+|||||++.. .+.+||||
T Consensus 132 ~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~---~g~~YWii 208 (243)
T cd02621 132 NEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI---KGEKYWIV 208 (243)
T ss_pred CHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC---CCCcEEEE
Confidence 789999999999999999999 478899999997421 01322 25799999999998641 25689999
Q ss_pred EcCCCCCcCCCCCC
Q psy11694 642 KNSWGSDWGEKVML 655 (655)
Q Consensus 642 rNSWG~~WGe~Gy~ 655 (655)
|||||++|||+|||
T Consensus 209 rNSWG~~WGe~Gy~ 222 (243)
T cd02621 209 KNSWGSSWGEKGYF 222 (243)
T ss_pred EcCCCCCCCcCCeE
Confidence 99999999999997
No 22
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.97 E-value=2.2e-30 Score=257.41 Aligned_cols=148 Identities=42% Similarity=0.887 Sum_probs=128.1
Q ss_pred cccccccccccCCCCC-CCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCHH
Q psy11694 497 KLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEE 575 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~~-~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~ 575 (655)
...+||+|+|++|... +.+|.||++..|++++.+. |+++|++|||..... .|.... ....+++.+|..+...+++
T Consensus 43 ~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~-Gi~~e~~yPY~~~~~--~C~~~~-~~~~~~i~~~~~i~~~~~~ 118 (210)
T cd02248 43 KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNG-GLASESDYPYTGKDG--TCKYNS-SKVGAKITGYSNVPPGDEE 118 (210)
T ss_pred CCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHC-CcCccccCCccCCCC--CccCCC-CcccEEEeeEEEcCCCcHH
Confidence 3567999999999875 7899999999999998875 999999999998654 676654 3567899999999877799
Q ss_pred HHHHHHHhCCCEEEEEEcc-ccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCCCCCcCCCCC
Q psy11694 576 EMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEKVM 654 (655)
Q Consensus 576 ~ik~~i~~~GPV~~~~~~~-~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG~~WGe~Gy 654 (655)
+||++|+++|||+++|.+. +|..|++|||.. ..+....++|||+|||||++. +.+|||||||||++||++||
T Consensus 119 ~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~--~~~~~~~~~Hav~iVGy~~~~-----~~~ywiv~NSWG~~WG~~Gy 191 (210)
T cd02248 119 ALKAALANYGPVSVAIDASSSFQFYKGGIYSG--PCCSNTNLNHAVLLVGYGTEN-----GVDYWIVKNSWGTSWGEKGY 191 (210)
T ss_pred HHHHHHhhcCCEEEEEecCcccccCCCCceeC--CCCCCCcCCEEEEEEEEeecC-----CceEEEEEcCCCCccccCcE
Confidence 9999999999999999995 788999999973 223356689999999999986 66899999999999999999
Q ss_pred C
Q psy11694 655 L 655 (655)
Q Consensus 655 ~ 655 (655)
|
T Consensus 192 ~ 192 (210)
T cd02248 192 I 192 (210)
T ss_pred E
Confidence 6
No 23
>PTZ00021 falcipain-2; Provisional
Probab=99.97 E-value=1.4e-30 Score=281.65 Aligned_cols=150 Identities=39% Similarity=0.780 Sum_probs=124.6
Q ss_pred cccccccccccCCCCCCCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCHHH
Q psy11694 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEE 576 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~ 576 (655)
.+++||+|+|+||+..+.||.||++..|++|+.++.|+++|++|||.+... ..|.... ....+++.+|..++ .+.
T Consensus 309 ~~v~LSeQqLVDCs~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~~~~-~~C~~~~-~~~~~~i~~y~~i~---~~~ 383 (489)
T PTZ00021 309 ELVSLSEQELVDCSFKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVSDTP-ELCNIDR-CKEKYKIKSYVSIP---EDK 383 (489)
T ss_pred CCcccCHHHHhhhccCCCCCCCcchHhhhhhhhhccccCcccccCccCCCC-Ccccccc-ccccceeeeEEEec---HHH
Confidence 467899999999987788999999999999998877999999999998632 2686533 23457888998886 478
Q ss_pred HHHHHHhCCCEEEEEEc-cccccccCCeeecCCCCCCCcccCeEEEEEEEecccCC-----CCCCCCEEEEEcCCCCCcC
Q psy11694 577 MKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKK-----DGTSIPYWIVKNSWGSDWG 650 (655)
Q Consensus 577 ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~-----~~~~~~ywivrNSWG~~WG 650 (655)
|+++|+..|||++++++ .+|+.|++|||. ..|+. ..+|||+|||||+++.. .+++.+|||||||||++||
T Consensus 384 lk~al~~~GPVsv~i~a~~~f~~YkgGIy~---~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WG 459 (489)
T PTZ00021 384 FKEAIRFLGPISVSIAVSDDFAFYKGGIFD---GECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWG 459 (489)
T ss_pred HHHHHHhcCCeEEEEEeecccccCCCCcCC---CCCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcc
Confidence 99999999999999999 478899999996 45764 37999999999976421 1234689999999999999
Q ss_pred CCCCC
Q psy11694 651 EKVML 655 (655)
Q Consensus 651 e~Gy~ 655 (655)
|+|||
T Consensus 460 E~GY~ 464 (489)
T PTZ00021 460 EKGFI 464 (489)
T ss_pred cCeEE
Confidence 99997
No 24
>KOG1543|consensus
Probab=99.96 E-value=8.3e-30 Score=266.61 Aligned_cols=146 Identities=40% Similarity=0.781 Sum_probs=125.9
Q ss_pred cccccccccccCCCC-CCCCCCCCCccchHHHHHHcCCCCC-CCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCH
Q psy11694 497 KLSRLATEKLVDCDM-SNGGCNGGRMDDALQYIIDNGGVVS-DQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 574 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~-~~~gC~GG~~~~a~~~~~~~~Gv~~-e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~ 574 (655)
.+++||+|+|+||.. ++.||.||.+..|++|++++ |+++ +.+|||..... .|..... .....+.++..++.. +
T Consensus 154 ~l~sLSeq~lvdC~~~~~~GC~GG~~~~A~~yi~~~-G~~t~~~~Ypy~~~~~--~C~~~~~-~~~~~~~~~~~~~~~-e 228 (325)
T KOG1543|consen 154 KLLSLSEQDLVDCCGECGDGCNGGEPKNAFKYIKKN-GGVTECENYPYIGKDG--TCKSNKK-DKTVTIKGFYNVPAN-E 228 (325)
T ss_pred ccCccChhhhhhccCCCCCCcCCCCHHHHHHHHHHh-CCCCCCcCCCCcCCCC--CccCCCc-cceeEeeeeeecCcC-H
Confidence 578999999999987 48899999999999999998 6666 99999999887 7887663 556677778877765 9
Q ss_pred HHHHHHHHhCCCEEEEEEc-cccccccCCeeecCCCCCCCc-ccCeEEEEEEEecccCCCCCCCCEEEEEcCCCCCcCCC
Q psy11694 575 EEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPK-AQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652 (655)
Q Consensus 575 ~~ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~~~~~~~~~~-~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG~~WGe~ 652 (655)
++|+.+|+.+|||+++|++ .+|+.|++|||. ..++.. .++|||+|||||. . .+.+|||||||||+.|||+
T Consensus 229 ~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~---~~~~~~~~~~Hav~iVGyG~-~----~~~~YWivkNSWG~~WGe~ 300 (325)
T KOG1543|consen 229 EAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYA---EEKGDDKEGDHAVLIVGYGT-G----DGVDYWIVKNSWGTDWGEK 300 (325)
T ss_pred HHHHHHHHhcCCeEEEEeehhhhhhccCceEe---CCCCCCCCCCceEEEEEEcC-C----CCceeEEEEcCCCCCcccC
Confidence 9999999999999999999 568899999998 333333 5899999999999 3 1678999999999999999
Q ss_pred CCC
Q psy11694 653 VML 655 (655)
Q Consensus 653 Gy~ 655 (655)
|||
T Consensus 301 Gy~ 303 (325)
T KOG1543|consen 301 GYF 303 (325)
T ss_pred ceE
Confidence 997
No 25
>PTZ00200 cysteine proteinase; Provisional
Probab=99.96 E-value=2.6e-29 Score=271.30 Aligned_cols=185 Identities=21% Similarity=0.309 Sum_probs=126.2
Q ss_pred eccchhHHHHHHHHHHHhhhcccccccCccCchHHHHHHHHHHcCcccCChHHHHHHHHHHHHHHHHHHHHhccCCCCee
Q psy11694 9 LGEKGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAV 88 (655)
Q Consensus 9 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~k~Y~~~~e~~~r~~~f~~n~~~i~~~n~~~~~~~~ 88 (655)
+.++++.-|..|+-.-.++... ....+...+|++||++|+|+|.+.+|+.+|+.||++|+++|++||. ..+|+
T Consensus 95 ~~~~~~~~~~~~~~~~~i~~~~-----~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~--~~~y~ 167 (448)
T PTZ00200 95 KRDSYVDELTRLFKDGYISDDP-----KLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKG--DEPYS 167 (448)
T ss_pred HHHHHHHHHHHHhhCCCcCCCc-----cchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcC--cCCeE
Confidence 3455666666555444444433 3333556789999999999999999999999999999999999995 45899
Q ss_pred eeccCCCCCCHHHHH-HHhCCCCCCCccccCCCc--CC-CCCCCccchhhhh-h-hcccC-CCCCCCCccccccccCccc
Q psy11694 89 FEVNKFFDLSDSDLQ-QLTGLNLDSTLEDIQPSL--QA-PFSSNQTDTEMRA-F-QFNSL-RHGDDLPEAFDWRAEGVIS 161 (655)
Q Consensus 89 ~g~N~fsDlt~~E~~-~~~g~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~-~-~~~~~-~~~~~~P~~~DWr~~g~Vt 161 (655)
+|+|+|||||+|||+ .+++.+.+.......... .. ............. . ....+ +....+|++||||++|+||
T Consensus 168 lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vt 247 (448)
T PTZ00200 168 KEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVT 247 (448)
T ss_pred EeccccccCCHHHHHHHhccCCCcccccccccccccccccccccccccccccccccccccccccccCCCCccCCCCCCCC
Confidence 999999999999995 455543211100000000 00 0000000000000 0 00000 1112369999999999999
Q ss_pred cccCCC-cceeeecccchhhHHhhhhhcccchhhhhhhcc
Q psy11694 162 KVKEQG-KCACCWAFSAVGVVEAMHAIQGNNLTELSVQHH 200 (655)
Q Consensus 162 ~VkdQg-~c~~~~a~~~~~~~e~~~~i~~~~l~~~~~q~~ 200 (655)
|||||| .|||||||++++++|++..+..+.++.+|+|+.
T Consensus 248 pVkdQG~~CGSCWAFat~~aiEs~~~i~~~~~~~LSeQqL 287 (448)
T PTZ00200 248 KVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSVDLSEQEL 287 (448)
T ss_pred CcccCCCccchHHHHhHHHHHHHHHHHhcCCCeecCHHHH
Confidence 999999 999999999999999999999888888888875
No 26
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.95 E-value=3.7e-28 Score=267.98 Aligned_cols=152 Identities=30% Similarity=0.624 Sum_probs=117.8
Q ss_pred ccccccccCCCCCCCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCc-----------------------
Q psy11694 500 RLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEE----------------------- 556 (655)
Q Consensus 500 ~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~----------------------- 556 (655)
.||+|+|++|+..+.||.||++..|++|+.+. ||++|++|||.+... .|+....
T Consensus 441 ~LS~QqLLDCs~~nqGC~GG~~~~A~kya~~~-GI~tEscYPY~a~~g--~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 517 (693)
T PTZ00049 441 LLSIQTVLSCSFYDQGCNGGFPYLVSKMAKLQ-GIPLDKVFPYTATEQ--TCPYQVDQSANSMNGSANLRQINAVFFSSE 517 (693)
T ss_pred CcCHHHhcccCCCCCCcCCCcHHHHHHHHHHC-CCCcCCccCCcCCCC--CCCCCCCCcccccccccccccccccccccc
Confidence 79999999998778899999999999999885 999999999987644 4532110
Q ss_pred ---------------CCceEEEeeeEEcC-------CCCHHHHHHHHHhCCCEEEEEEcc-ccccccCCeeecCC--C--
Q psy11694 557 ---------------EGFKVKVKEYSRIP-------YGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQ--R-- 609 (655)
Q Consensus 557 ---------------~~~~~~~~~~~~~~-------~~~~~~ik~~i~~~GPV~~~~~~~-~~~~y~~Gi~~~~~--~-- 609 (655)
....+.++.|..+. ..++++|+++|+.+|||+++|++. +|+.|++|||..+. +
T Consensus 518 ~~~~~~~~~~~~~~~~~~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~ 597 (693)
T PTZ00049 518 TQSDMHADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHAR 597 (693)
T ss_pred ccccccccccccccccccceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCccccccc
Confidence 01123456666653 247899999999999999999994 78899999997321 1
Q ss_pred CCCC--------------cccCeEEEEEEEecccCCCCCCCCEEEEEcCCCCCcCCCCCC
Q psy11694 610 LCNP--------------KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 610 ~~~~--------------~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG~~WGe~Gy~ 655 (655)
.|.. ...+|||+|||||.+. .+|.+.+|||||||||+.||++|||
T Consensus 598 ~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~-enG~~~~YWIVRNSWGt~WGenGYf 656 (693)
T PTZ00049 598 RCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEE-INGKLYKYWIGRNSWGKNWGKEGYF 656 (693)
T ss_pred ccCCccccccccccccccccCceEEEEEEecccc-CCCcccCEEEEECCCCCCcccCceE
Confidence 2422 1369999999999753 1232357999999999999999997
No 27
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.95 E-value=1.6e-28 Score=245.34 Aligned_cols=152 Identities=36% Similarity=0.767 Sum_probs=126.0
Q ss_pred ccccccccccccCCCC-CCCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCH
Q psy11694 496 SKLSRLATEKLVDCDM-SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 574 (655)
Q Consensus 496 ~~~~~ls~q~l~~C~~-~~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~ 574 (655)
....+||+|+|++|.. .+.+|.||++..|+++++++.|+++|++|||..... ..|..........++..|..+...++
T Consensus 45 ~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~~~~Gi~~e~~~pY~~~~~-~~c~~~~~~~~~~~i~~~~~~~~~~~ 123 (219)
T PF00112_consen 45 GKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIKNNNGIVTEEDYPYNGNEN-PTCKSKKSNSYYVKIKGYGKVKDNDI 123 (219)
T ss_dssp SSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHHHHTSBEBTTTS--SSSSS-CSSCHSGGGEEEBEESEEEEEESTCH
T ss_pred ccccccccccccccccccccccccCcccccceeecccCccccccccccccccc-ccccccccccccccccccccccccch
Confidence 3568999999999987 567999999999999999955999999999997762 26776542222478999999988789
Q ss_pred HHHHHHHHhCCCEEEEEEccc--cccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCCCCCcCCC
Q psy11694 575 EEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652 (655)
Q Consensus 575 ~~ik~~i~~~GPV~~~~~~~~--~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG~~WGe~ 652 (655)
++||++|+++|||++++.+.+ |..|++|||..+ .+....++|||+|||||++. +.+|||||||||++||++
T Consensus 124 ~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~--~~~~~~~~Hav~iVGy~~~~-----~~~~wiv~NSWG~~WG~~ 196 (219)
T PF00112_consen 124 EDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPP--DCSNESGGHAVLIVGYDDEN-----GKGYWIVKNSWGTDWGDN 196 (219)
T ss_dssp HHHHHHHHHHSSEEEEEEEESHHHHTEESSEECST--SSSSSSEEEEEEEEEEEEET-----TEEEEEEE-SBTTTSTBT
T ss_pred hHHHHHHhhCceeeeeeeccccccccccceeeecc--cccccccccccccccccccc-----ceeeEeeehhhCCccCCC
Confidence 999999999999999999943 778999999843 45567899999999999987 678999999999999999
Q ss_pred CCC
Q psy11694 653 VML 655 (655)
Q Consensus 653 Gy~ 655 (655)
|||
T Consensus 197 Gy~ 199 (219)
T PF00112_consen 197 GYF 199 (219)
T ss_dssp TEE
T ss_pred eEE
Confidence 996
No 28
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.95 E-value=1.5e-27 Score=261.12 Aligned_cols=153 Identities=20% Similarity=0.483 Sum_probs=118.0
Q ss_pred cccccccccccCCCCCCCCCCCCCccchHHHHHHcCCCCCCCCc--ccCCCCCC-CCCCCCCcCCceEEEeeeEEcC---
Q psy11694 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAY--PYKASESE-RGCLVGEEEGFKVKVKEYSRIP--- 570 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~Gv~~e~~y--PY~~~~~~-~~c~~~~~~~~~~~~~~~~~~~--- 570 (655)
+...||+|+|+||+..+.||+||++..|++|+.++ |+++|++| ||...+.. ..|.... ....++++.|..+.
T Consensus 260 ~~~~LS~QqLVDCs~~n~GCdGG~p~~A~~yi~~~-GI~tE~dY~~PY~~~dg~~~~Ck~~~-~~~~y~~~~~~~I~gyy 337 (548)
T PTZ00364 260 QQTFLSARHVLDCSQYGQGCAGGFPEEVGKFAETF-GILTTDSYYIPYDSGDGVERACKTRR-PSRRYYFTNYGPLGGYY 337 (548)
T ss_pred cccCcCHHHHhcccCCCCCCCCCcHHHHHHHHHhC-CcccccccCCCCCCCCCCCCCCCCCc-ccceeeeeeeEEeccee
Confidence 36789999999998778899999999999999885 99999999 99876441 2476433 23334444443331
Q ss_pred --CCCHHHHHHHHHhCCCEEEEEEc-cccccccCCeeecC-----C-CCC----------CCcccCeEEEEEEEecccCC
Q psy11694 571 --YGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLN-----Q-RLC----------NPKAQNHALIIVGYGEEEKK 631 (655)
Q Consensus 571 --~~~~~~ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~~~-----~-~~~----------~~~~~~HaV~iVGyg~~~~~ 631 (655)
..++++|+.+|+.+|||+++|++ .+|+.|++|||... . ..| .....+|||+|||||.+.+
T Consensus 338 ~~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~- 416 (548)
T PTZ00364 338 GAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDEN- 416 (548)
T ss_pred ecCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCC-
Confidence 14788999999999999999999 57889999987511 1 011 1235799999999997531
Q ss_pred CCCCCCEEEEEcCCCC--CcCCCCCC
Q psy11694 632 DGTSIPYWIVKNSWGS--DWGEKVML 655 (655)
Q Consensus 632 ~~~~~~ywivrNSWG~--~WGe~Gy~ 655 (655)
+.+|||||||||+ +|||+|||
T Consensus 417 ---G~~YWIVKNSWGt~~~WGE~GYf 439 (548)
T PTZ00364 417 ---GGDYWLVLDPWGSRRSWCDGGTR 439 (548)
T ss_pred ---CceEEEEECCCCCCCCcccCCeE
Confidence 6789999999999 99999997
No 29
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.94 E-value=1.5e-26 Score=231.68 Aligned_cols=153 Identities=30% Similarity=0.530 Sum_probs=124.6
Q ss_pred cccccccccccCCCCCC-----CCCCCCCccchHH-HHHHcCCCCCCCCcccCCCCCCCCCCC---CCcCCceEEEeeeE
Q psy11694 497 KLSRLATEKLVDCDMSN-----GGCNGGRMDDALQ-YIIDNGGVVSDQAYPYKASESERGCLV---GEEEGFKVKVKEYS 567 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~~~-----~gC~GG~~~~a~~-~~~~~~Gv~~e~~yPY~~~~~~~~c~~---~~~~~~~~~~~~~~ 567 (655)
..++||+|+|++|.... .+|.||.+..++. ++.++ |+++|.+|||..... .|.. ........++..|.
T Consensus 42 ~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~~~~-Gi~~e~~~Py~~~~~--~~~~~~~~~~~~~~~~~~~y~ 118 (223)
T cd02619 42 EYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLVALK-GIPPEEDYPYGAESD--GEEPKSEAALNAAKVKLKDYR 118 (223)
T ss_pred ccccCCHHHHHHhccccccccCCCCCCCcHHHHHHHHHHHc-CCCccccCCCCCCCC--CCCCCCccchhhcceeeccee
Confidence 45789999999997542 6999999999998 77764 999999999998766 3332 12244567899999
Q ss_pred EcCCCCHHHHHHHHHhCCCEEEEEEc-cccccccCCeee---cCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEc
Q psy11694 568 RIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVID---LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKN 643 (655)
Q Consensus 568 ~~~~~~~~~ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~---~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrN 643 (655)
.+...++++||++|+++|||+++|.+ ..|+.|++|++. .....+....++|||+|||||++.. .+.+||||||
T Consensus 119 ~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~---~~~~~~i~~N 195 (223)
T cd02619 119 RVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYV---EGKGAFIVKN 195 (223)
T ss_pred EeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCC---CCCCEEEEEe
Confidence 99877899999999999999999999 467789999862 1233455667899999999998752 2568999999
Q ss_pred CCCCCcCCCCCC
Q psy11694 644 SWGSDWGEKVML 655 (655)
Q Consensus 644 SWG~~WGe~Gy~ 655 (655)
|||+.||++||+
T Consensus 196 SwG~~wg~~Gy~ 207 (223)
T cd02619 196 SWGTDWGDNGYG 207 (223)
T ss_pred CCCCccccCCEE
Confidence 999999999996
No 30
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.94 E-value=1.2e-26 Score=262.82 Aligned_cols=157 Identities=26% Similarity=0.490 Sum_probs=118.5
Q ss_pred cccccccccccCCCCC--CCCCCCCC-ccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCcC----------------
Q psy11694 497 KLSRLATEKLVDCDMS--NGGCNGGR-MDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEE---------------- 557 (655)
Q Consensus 497 ~~~~ls~q~l~~C~~~--~~gC~GG~-~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~~---------------- 557 (655)
.+..||+|+|++|+.. +.||.||. +..++.|+.+++|+++|++|||........|+.....
T Consensus 575 ~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPYt~k~~~g~Cp~~~~~w~n~~~~~kll~~~~~ 654 (1004)
T PTZ00462 575 EPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLYNYTKVGEDCPDEEDHWMNLLDHGKILNHNKK 654 (1004)
T ss_pred CCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCCccCCCCCCCCCCccccccccccccccccccc
Confidence 4678999999999753 57999997 4566699988767999999999864222257642210
Q ss_pred -CceEEEeeeEEcCCC----C----HHHHHHHHHhCCCEEEEEEccccccc-cCCeeecCCCCCCCcccCeEEEEEEEec
Q psy11694 558 -GFKVKVKEYSRIPYG----E----EEEMKKWVATRGPLSVGMNANGLFYY-SGGVIDLNQRLCNPKAQNHALIIVGYGE 627 (655)
Q Consensus 558 -~~~~~~~~~~~~~~~----~----~~~ik~~i~~~GPV~~~~~~~~~~~y-~~Gi~~~~~~~~~~~~~~HaV~iVGyg~ 627 (655)
...+.+.+|..+... + ++.||++|+..|||+++|++.+|+.| .+|||. ...|+....+|||+|||||.
T Consensus 655 ~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsdf~~Y~~sGIyv--~~~Cgs~~~nHAVlIVGYGt 732 (1004)
T PTZ00462 655 EPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENVLGYEFNGKKV--QNLCGDDTADHAVNIVGYGN 732 (1004)
T ss_pred ccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeehHHhhhcCCccc--cCCCCCCcCCceEEEEEecc
Confidence 112334566666432 1 47899999999999999999888888 589875 34577666799999999997
Q ss_pred ccCCCCCCCCEEEEEcCCCCCcCCCCCC
Q psy11694 628 EEKKDGTSIPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 628 ~~~~~~~~~~ywivrNSWG~~WGe~Gy~ 655 (655)
+.+.++.+.+|||||||||++|||+|||
T Consensus 733 ~in~eg~gk~YWIVRNSWGt~WGEnGYF 760 (1004)
T PTZ00462 733 YINDEDEKKSYWIVRNSWGKYWGDEGYF 760 (1004)
T ss_pred cccccCCCCceEEEEcCCCCCcCCCeEE
Confidence 6433344678999999999999999997
No 31
>KOG1544|consensus
Probab=99.93 E-value=5.3e-26 Score=223.62 Aligned_cols=155 Identities=32% Similarity=0.649 Sum_probs=122.6
Q ss_pred cccccccccCCCCC-CCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCC--CCC------------------CCCCCcC
Q psy11694 499 SRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES--ERG------------------CLVGEEE 557 (655)
Q Consensus 499 ~~ls~q~l~~C~~~-~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~--~~~------------------c~~~~~~ 557 (655)
..||.|+|++|..- ..||.||.+.+|+=||.+. |++...||||..... +.+ |+....+
T Consensus 258 ~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKr-GvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~ 336 (470)
T KOG1544|consen 258 PVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKR-GVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVN 336 (470)
T ss_pred cccChHHhcchhhhhhccCccCcccchheeeecc-cccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccc
Confidence 47899999999765 5699999999999999986 999999999986532 122 4432211
Q ss_pred C-ceEEEeeeEEcCCCCHHHHHHHHHhCCCEEEEEEc-cccccccCCeeecCCCCCC-----CcccCeEEEEEEEecccC
Q psy11694 558 G-FKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCN-----PKAQNHALIIVGYGEEEK 630 (655)
Q Consensus 558 ~-~~~~~~~~~~~~~~~~~~ik~~i~~~GPV~~~~~~-~~~~~y~~Gi~~~~~~~~~-----~~~~~HaV~iVGyg~~~~ 630 (655)
. ..++.+.-.++ ++++++|+++||++|||.+.|.| ++||.|++|||.+...... ...+.|+|.|.|||.+..
T Consensus 337 Sn~iyq~tPPYrV-SSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~ 415 (470)
T KOG1544|consen 337 SNDIYQVTPPYRV-SSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL 415 (470)
T ss_pred cCceeeecCCeec-cCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccC
Confidence 1 12222222223 25899999999999999999999 8899999999985432221 256899999999999988
Q ss_pred CCCCCCCEEEEEcCCCCCcCCCCCC
Q psy11694 631 KDGTSIPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 631 ~~~~~~~ywivrNSWG~~WGe~Gy~ 655 (655)
++|...+|||..||||+.|||+|||
T Consensus 416 ~~G~~~KyW~aANSWG~~WGE~GYF 440 (470)
T KOG1544|consen 416 PDGRTLKYWTAANSWGPAWGERGYF 440 (470)
T ss_pred CCCCeeEEEEeecccccccccCceE
Confidence 8899999999999999999999998
No 32
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.91 E-value=4.8e-25 Score=223.57 Aligned_cols=205 Identities=21% Similarity=0.334 Sum_probs=129.2
Q ss_pred CCCCCceecCCCCCCCcCCCCCCCccHHHHHHHHHHHHHHHHhcCCCcCCCHHHHHHccCCCCCCCCCcHHHHHHHHHHc
Q psy11694 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373 (655)
Q Consensus 294 ~~lP~~~Dwr~~g~vtpVkdQg~CGsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~dC~~~~~gC~GG~~~~a~~~~~~~ 373 (655)
..+|+.||||+.|.|+||||||.|||||||++++++|+.+.-.. ...+|+..+ +..
T Consensus 97 ~s~~~~fd~r~~g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~--~w~~s~~nm----------------------~~l 152 (372)
T COG4870 97 ASLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPES--AWDFSENNM----------------------KNL 152 (372)
T ss_pred ccchhheeeeccCCcccccccCcccceEeeeehhhhhheecccc--cccccccch----------------------hhh
Confidence 45899999999999999999999999999999999999754221 222332211 112
Q ss_pred CCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCCcccccccEEeCCceEE
Q psy11694 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLY 453 (655)
Q Consensus 374 ~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~~~~Y~~Gi~~~~Havy 453 (655)
.|++.|+.+||...++...|-..
T Consensus 153 l~~~ye~~fd~~~~d~g~~~m~~--------------------------------------------------------- 175 (372)
T COG4870 153 LGVPYEKGFDYTSNDGGNADMSA--------------------------------------------------------- 175 (372)
T ss_pred cCCCccccCCCccccCCcccccc---------------------------------------------------------
Confidence 27788888888754441111100
Q ss_pred cCCccEEEEecCCCCccCCCceEEecccCCccccccccCCccccccccccccccCCCCCCCCCCCCCccchHHHHHHcCC
Q psy11694 454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGG 533 (655)
Q Consensus 454 g~g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~~gv~p~~~~~ls~q~l~~C~~~~~gC~GG~~~~a~~~~~~~~G 533 (655)
...-|.+|.+..+.+.+..+ +
T Consensus 176 ----------------------------------------------------------a~l~e~sgpv~et~d~y~~~-s 196 (372)
T COG4870 176 ----------------------------------------------------------AYLTEWSGPVYETDDPYSEN-S 196 (372)
T ss_pred ----------------------------------------------------------ccccccCCcchhhcCccccc-c
Confidence 00013334444343444343 6
Q ss_pred CCCCCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCHHHHHHHHHhCCCEEEEEEcccccccc--CCeeecCCCCC
Q psy11694 534 VVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYS--GGVIDLNQRLC 611 (655)
Q Consensus 534 v~~e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~ik~~i~~~GPV~~~~~~~~~~~y~--~Gi~~~~~~~~ 611 (655)
...++++|+..... .|...+ .....+ +.-.||+++..+|.+..+|.+.....+. -+.+ .+.
T Consensus 197 ~~~~~~~p~~k~~~--~~~~i~--------~~~~~L---dnG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~----~~~ 259 (372)
T COG4870 197 YFSPTNLPVTKHVQ--EAQIIP--------SRKKYL---DNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYP----YVD 259 (372)
T ss_pred ccCCcCCchhhccc--cceecc--------cchhhh---cccchHHHHhhhccccceeEEecccccccccCCC----CCC
Confidence 66667777776654 333222 111222 3334788888889888777764322222 2222 222
Q ss_pred CCcccCeEEEEEEEecccCCC-----CCCCCEEEEEcCCCCCcCCCCCC
Q psy11694 612 NPKAQNHALIIVGYGEEEKKD-----GTSIPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 612 ~~~~~~HaV~iVGyg~~~~~~-----~~~~~ywivrNSWG~~WGe~Gy~ 655 (655)
....++|||+||||||....+ .++.++||||||||++||++|||
T Consensus 260 s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYf 308 (372)
T COG4870 260 SGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYF 308 (372)
T ss_pred ccccccceEEEEeccccccccccccCCCCCceEEEECccccccccCceE
Confidence 226789999999999976432 34678999999999999999997
No 33
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.90 E-value=2.2e-24 Score=207.34 Aligned_cols=108 Identities=46% Similarity=0.978 Sum_probs=91.9
Q ss_pred ccccccccccCCCCC-CCCCCCCCccchHHHHHHcCCCCCCCCcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCHHH
Q psy11694 498 LSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEE 576 (655)
Q Consensus 498 ~~~ls~q~l~~C~~~-~~gC~GG~~~~a~~~~~~~~Gv~~e~~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~ 576 (655)
..+||+|+|++|... +.||.||++..|++|+.++.|+++|++|||..
T Consensus 45 ~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~~PY~~-------------------------------- 92 (174)
T smart00645 45 LVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKNGGLETESCYPYTG-------------------------------- 92 (174)
T ss_pred ccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHcCCcccccccCccc--------------------------------
Confidence 578999999999864 66999999999999999855999999999964
Q ss_pred HHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEecc-cCCCCCCCCEEEEEcCCCCCcCCCCCC
Q psy11694 577 MKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEE-EKKDGTSIPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 577 ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~-~~~~~~~~~ywivrNSWG~~WGe~Gy~ 655 (655)
++.+.+.+|+.|++|||.. ..|+...++|||+|||||.+ + +++|||||||||+.|||+|||
T Consensus 93 -----------~~~~~~~~f~~Y~~Gi~~~--~~~~~~~~~Hav~ivGyg~~~~-----g~~yWii~NSwG~~WG~~G~~ 154 (174)
T smart00645 93 -----------SVAIDASDFQFYKSGIYDH--PGCGSGTLDHAVLIVGYGTEEN-----GKDYWIVKNSWGTDWGENGYF 154 (174)
T ss_pred -----------EEEEEcccccCCcCeEECC--CCCCCCcccEEEEEEEEeecCC-----CeeEEEEECCCCCCcccCeEE
Confidence 4556666788999999973 24554457999999999976 3 668999999999999999996
No 34
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.89 E-value=2.6e-23 Score=223.16 Aligned_cols=280 Identities=19% Similarity=0.212 Sum_probs=157.1
Q ss_pred CcCCCCCCCccHHHHHHHHHHHHHHHHh-cCCCcCCCHHHHHH----------------cc--C----------CCCCCC
Q psy11694 309 SKVKEQGKCACCWAFSAVGVVEAMHAIQ-GNSLTELSVQQLVD----------------CD--M----------SNGGCN 359 (655)
Q Consensus 309 tpVkdQg~CGsCwAfa~~~~le~~~~i~-~~~~~~lS~q~l~d----------------C~--~----------~~~gC~ 359 (655)
.||+||+.-|.||.||+...||+.+..+ +...+.||+.+|+- +. . .....+
T Consensus 55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~D 134 (437)
T cd00585 55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQND 134 (437)
T ss_pred CCcccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcCC
Confidence 4899999999999999999999988774 45678999976664 22 1 134578
Q ss_pred CCcHHHHHHHHHHcCCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCCcc
Q psy11694 360 GGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439 (655)
Q Consensus 360 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~~~ 439 (655)
||....+...|.++ |+++.+.||-+..... +-..... ...++..+ ...|..++.. +.-.-.+... -.
T Consensus 135 GGqw~m~~~li~KY-GvVPk~~~pet~~s~~-t~~~n~~--L~~kLr~~-------a~~lr~~~~~-~~~~~~l~~~-~~ 201 (437)
T cd00585 135 GGQWDMLVNLIEKY-GLVPKSVMPESFNSEN-SRRLNYL--LNRKLRED-------ALELRKLVAK-GASKEEIEAK-KE 201 (437)
T ss_pred CCchHHHHHHHHHc-CCCcccccCCCcCccc-hHHHHHH--HHHHHHHH-------HHHHHHHHhc-CCcHHHHHHH-HH
Confidence 99999999999888 9999999995433220 0000000 00000000 0011111110 0000000000 00
Q ss_pred cccccEEeCCceEEcCCccEEEEecCCCCccCCCceEEecccCCccccccccCCccccccccccccccCCCCCCCCCCCC
Q psy11694 440 YYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGG 519 (655)
Q Consensus 440 ~Y~~Gi~~~~Havyg~g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~~gv~p~~~~~ls~q~l~~C~~~~~gC~GG 519 (655)
-+...| .+|. ..-.|..|. ..++. ..|=+ ...-|.
T Consensus 202 ~~~~~i------------------------------y~il--------~~~lG~pP~-~F~~~---y~dkd-~~~~~~-- 236 (437)
T cd00585 202 EMLKEV------------------------------YRIL--------AIALGEPPE-KFDWE---YRDKD-KKYHEI-- 236 (437)
T ss_pred HHHHHH------------------------------HHHH--------HHHcCCCCc-eEEEE---EEeCC-CCeeeC--
Confidence 000000 0000 011222231 11110 00100 011122
Q ss_pred CccchHHHHHHcCCCCCCCCc-----------ccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCHHHHH-HHHHhCCCE
Q psy11694 520 RMDDALQYIIDNGGVVSDQAY-----------PYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMK-KWVATRGPL 587 (655)
Q Consensus 520 ~~~~a~~~~~~~~Gv~~e~~y-----------PY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~ik-~~i~~~GPV 587 (655)
-..++.+|.++..|+.. .+| ||..... .....+....+...|.+++.....++. ++|..++||
T Consensus 237 ~~~TP~~F~~~yv~~~~-~dyV~l~~~p~~~~p~~~~y~----ve~~~Nv~~g~~~~y~Nvp~d~l~~~~~~~L~~g~pV 311 (437)
T cd00585 237 KELTPLEFYKKYVKFDL-DDYVSLINDPRPDKPYNKLYT----VEYLGNVVGGRPILYLNVPMDVLKKAAIAQLKDGEPV 311 (437)
T ss_pred CCcCHHHHHHHhcCCCc-cceEEEEeCCCCCCCCCceEE----EecCCcccccccceEEecCHHHHHHHHHHHHhcCCCE
Confidence 23588888888767554 333 2321111 111112333355678888655544433 577788899
Q ss_pred EEEEEccccccccCCeeecCCC------------------CCCCcccCeEEEEEEEecccCCCCCCC-CEEEEEcCCCCC
Q psy11694 588 SVGMNANGLFYYSGGVIDLNQR------------------LCNPKAQNHALIIVGYGEEEKKDGTSI-PYWIVKNSWGSD 648 (655)
Q Consensus 588 ~~~~~~~~~~~y~~Gi~~~~~~------------------~~~~~~~~HaV~iVGyg~~~~~~~~~~-~ywivrNSWG~~ 648 (655)
.+++++..++.|++||+++... .+..+..+|||+|||||.+.. |. .||+|+||||+.
T Consensus 312 ~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D~~----g~p~yw~VkNSWG~~ 387 (437)
T cd00585 312 WFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLDED----GKPVKWKVENSWGEK 387 (437)
T ss_pred EEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEecCC----CCcceEEEEcccCCC
Confidence 9999998888999999964321 133456789999999997642 33 599999999999
Q ss_pred cCCCCCC
Q psy11694 649 WGEKVML 655 (655)
Q Consensus 649 WGe~Gy~ 655 (655)
||++|||
T Consensus 388 ~G~~Gy~ 394 (437)
T cd00585 388 VGKKGYF 394 (437)
T ss_pred CCCCcce
Confidence 9999997
No 35
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.75 E-value=3.1e-19 Score=191.88 Aligned_cols=281 Identities=23% Similarity=0.296 Sum_probs=132.0
Q ss_pred CcCCCCCCCccHHHHHHHHHHHHHHHHhcC-CCcCCCHHHHH----------------HccCC------------CCCCC
Q psy11694 309 SKVKEQGKCACCWAFSAVGVVEAMHAIQGN-SLTELSVQQLV----------------DCDMS------------NGGCN 359 (655)
Q Consensus 309 tpVkdQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~----------------dC~~~------------~~gC~ 359 (655)
.||.||..-|.||.||+...++..+..+.+ +...||+-+|. ++... ....+
T Consensus 56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~D 135 (438)
T PF03051_consen 56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSD 135 (438)
T ss_dssp -S--B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S
T ss_pred CCCCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCC
Confidence 499999999999999999999999988765 67899997765 33211 23478
Q ss_pred CCcHHHHHHHHHHcCCCCCCCCCCCCCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCCcc
Q psy11694 360 GGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439 (655)
Q Consensus 360 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~~~ 439 (655)
||....+...|.++ |||+.+.||-+....+ +-....- ...+++.+ ...|.+++.. |.-.-.+... ..
T Consensus 136 GGqw~~~~nli~KY-GvVPk~~mpet~~s~~-t~~~n~~--l~~~Lr~~-------a~~LR~~~~~-~~~~~~l~~~-k~ 202 (438)
T PF03051_consen 136 GGQWDMVVNLIKKY-GVVPKSVMPETFSSSN-TSEMNEM--LNTKLREY-------ALELRKLVKA-GKSEEELRKL-KE 202 (438)
T ss_dssp -B-HHHHHHHHHHH----BGGGSTTGCGCHB-HHHHHHH--HHHHHHHH-------HHHHHHHHHT-TTTCHHHHHH-HH
T ss_pred CCchHHHHHHHHHc-CcCcHhhCCCCCCCCC-hHHHHHH--HHHHHHHH-------HHHHHHHHHc-CCCHHHHHHH-HH
Confidence 99999999999888 9999999996543220 0000000 00000000 0011111110 0000000000 00
Q ss_pred cccccEEeCCceEEcCCccEEEEecCCCCccCCCceEEecccCCccccccccCCccccccccccccccCCCCCCCCCCCC
Q psy11694 440 YYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGG 519 (655)
Q Consensus 440 ~Y~~Gi~~~~Havyg~g~~yWivkNSWG~~WGe~Gy~~i~~~~~~~~~~~~~gv~p~~~~~ls~q~l~~C~~~~~gC~GG 519 (655)
.....| .+|. ..-.|..| +..++. ..+ .+.-=.-+
T Consensus 203 ~~l~~i------------------------------y~il--------~~~lG~PP-~~F~~e---y~d---kd~~~~~~ 237 (438)
T PF03051_consen 203 EMLAEI------------------------------YRIL--------AIYLGEPP-EKFTWE---YRD---KDKKYHRG 237 (438)
T ss_dssp HHHHHH------------------------------HHHH--------HHHH---S-SSEEEE---EE----TTS-EEEE
T ss_pred HHHHHH------------------------------HHHH--------HHHcCCCC-hheeEE---Eec---cccccccc
Confidence 000000 0000 00122222 111111 000 00000001
Q ss_pred CccchHHHHHHcCCCCCCC----------CcccCCCCCCCCCCCCCcCCceEEEeeeEEcCCCCHH-HHHHHHHhCCCEE
Q psy11694 520 RMDDALQYIIDNGGVVSDQ----------AYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEE-EMKKWVATRGPLS 588 (655)
Q Consensus 520 ~~~~a~~~~~~~~Gv~~e~----------~yPY~~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~-~ik~~i~~~GPV~ 588 (655)
-..++.+|+.+..|+..+. ..||..... -.. ..+....+...|.+++..... .+.+.|..+-||.
T Consensus 238 ~~~TP~eF~~kyv~~~~ddyVsLin~P~~~~py~~~y~---ve~-~~Nv~~g~~~~ylNvpid~lk~~~i~~Lk~G~~Vw 313 (438)
T PF03051_consen 238 KNYTPLEFYKKYVGFDLDDYVSLINDPRSHHPYNKLYT---VEY-LGNVVGGRPVRYLNVPIDELKDAAIKSLKAGYPVW 313 (438)
T ss_dssp EEE-HHHHHHHCTTS-GGGEEEEE--T-TTS-TTCEEE---ETT-TTSSTT-EEEEEEE--HHHHHHHHHHHHHTT--EE
T ss_pred cccCchhHHHHHhCCCCcceEEEeeCCCccCccceeEE---Ecc-CCCEECCcceeEeccCHHHHHHHHHHHHHcCCcEE
Confidence 2357899998877765543 223332211 001 112223344567888765433 3444555555999
Q ss_pred EEEEccccccccCCeeecCCCC------------------CCCcccCeEEEEEEEecccCCCCCCC-CEEEEEcCCCCCc
Q psy11694 589 VGMNANGLFYYSGGVIDLNQRL------------------CNPKAQNHALIIVGYGEEEKKDGTSI-PYWIVKNSWGSDW 649 (655)
Q Consensus 589 ~~~~~~~~~~y~~Gi~~~~~~~------------------~~~~~~~HaV~iVGyg~~~~~~~~~~-~ywivrNSWG~~W 649 (655)
.+.+|..+++.+.||.+..... ...+..+|||+|||++.+.+ |. .+|+|+||||+..
T Consensus 314 fgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D~~----g~p~~wkVeNSWG~~~ 389 (438)
T PF03051_consen 314 FGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLDED----GKPVRWKVENSWGTDN 389 (438)
T ss_dssp EEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-TT----SSEEEEEEE-SBTTTS
T ss_pred EeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEeccC----CCeeEEEEEcCCCCCC
Confidence 9999988888899998743211 11256789999999998652 43 4899999999999
Q ss_pred CCCCCC
Q psy11694 650 GEKVML 655 (655)
Q Consensus 650 Ge~Gy~ 655 (655)
|.+|||
T Consensus 390 g~kGy~ 395 (438)
T PF03051_consen 390 GDKGYF 395 (438)
T ss_dssp TBTTEE
T ss_pred CCCcEE
Confidence 999996
No 36
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.73 E-value=4.7e-18 Score=172.96 Aligned_cols=39 Identities=44% Similarity=0.806 Sum_probs=37.2
Q ss_pred CCCccccccccCccccccCCCcceeeecccchhhHHhhh
Q psy11694 147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMH 185 (655)
Q Consensus 147 ~~P~~~DWr~~g~Vt~VkdQg~c~~~~a~~~~~~~e~~~ 185 (655)
.+|.-||||+.|-||+|||||.||+||+|+++.++|+..
T Consensus 98 s~~~~fd~r~~g~vs~v~dQg~~Gscwaf~t~~sles~l 136 (372)
T COG4870 98 SLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYL 136 (372)
T ss_pred cchhheeeeccCCcccccccCcccceEeeeehhhhhhee
Confidence 589999999999999999999999999999999998875
No 37
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.72 E-value=1e-17 Score=130.42 Aligned_cols=58 Identities=26% Similarity=0.442 Sum_probs=52.3
Q ss_pred HHHHHHHcCcccCChHHHHHHHHHHHHHHHHHHHHhccCCCCeeeeccCCCCCCHHHH
Q psy11694 45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDL 102 (655)
Q Consensus 45 f~~~~~~~~k~Y~~~~e~~~r~~~f~~n~~~i~~~n~~~~~~~~~g~N~fsDlt~~E~ 102 (655)
|++||++|+|+|.+.+|+++|+.||++|++.|++||+....+|++|||+|||||++||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence 8999999999999999999999999999999999998788899999999999999998
No 38
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.56 E-value=2.9e-15 Score=116.29 Aligned_cols=57 Identities=30% Similarity=0.491 Sum_probs=54.3
Q ss_pred HHHHHHHcCcccCChHHHHHHHHHHHHHHHHHHHHhccCCCCeeeeccCCCCCCHHH
Q psy11694 45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD 101 (655)
Q Consensus 45 f~~~~~~~~k~Y~~~~e~~~r~~~f~~n~~~i~~~n~~~~~~~~~g~N~fsDlt~~E 101 (655)
|..||++|+|.|.+.+|+.+|+.+|++|++.|..||+....+|++|||+|||||.+|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence 689999999999999999999999999999999999877789999999999999886
No 39
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=99.41 E-value=1.1e-13 Score=138.28 Aligned_cols=90 Identities=22% Similarity=0.398 Sum_probs=61.5
Q ss_pred EeeeEEcCCCCHHHHHHHHHhCC-CEEEEEEccccccccCCeeecCCC------------------CCCCcccCeEEEEE
Q psy11694 563 VKEYSRIPYGEEEEMKKWVATRG-PLSVGMNANGLFYYSGGVIDLNQR------------------LCNPKAQNHALIIV 623 (655)
Q Consensus 563 ~~~~~~~~~~~~~~ik~~i~~~G-PV~~~~~~~~~~~y~~Gi~~~~~~------------------~~~~~~~~HaV~iV 623 (655)
...|.+++......+...-++.| ||.+|.++..+...+.||.+..-. +.+.+...|||+|.
T Consensus 289 ~v~ylNv~me~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlT 368 (444)
T COG3579 289 AVKYLNVDMERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLT 368 (444)
T ss_pred eeEEecCcHHHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhh
Confidence 34455554433222222223444 999999998888888888763211 12235568999999
Q ss_pred EEecccCCCCCCCCEEEEEcCCCCCcCCCCCC
Q psy11694 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEKVML 655 (655)
Q Consensus 624 Gyg~~~~~~~~~~~ywivrNSWG~~WGe~Gy~ 655 (655)
|.+.+. +|... -|.|+||||.+=|.+|||
T Consensus 369 Gvd~d~--~g~p~-rwkVENSWG~d~G~~Gyf 397 (444)
T COG3579 369 GVDLDE--TGNPL-RWKVENSWGKDVGKKGYF 397 (444)
T ss_pred cccccc--CCCce-eeEeecccccccCCCceE
Confidence 999775 33333 499999999999999997
No 40
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.40 E-value=5.9e-13 Score=103.57 Aligned_cols=58 Identities=26% Similarity=0.402 Sum_probs=51.1
Q ss_pred hhhhhhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHHH
Q psy11694 193 TELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDL 250 (655)
Q Consensus 193 ~~~~~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eEf 250 (655)
|+.|+++|+|.|.+.+|+..|+.+|++|++.|.+||+.+..+|++|+|+|+|||.+||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence 5789999999999999999999999999999999997777899999999999999997
No 41
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.33 E-value=2.3e-12 Score=139.06 Aligned_cols=75 Identities=20% Similarity=0.374 Sum_probs=59.9
Q ss_pred eeeceEEcCCCCHHHHH----HHHhhCCCeEEEEEcCCcccccccEEe----------------------------CCce
Q psy11694 404 KVKEYSRIPYGEEEEMK----KWVATRGPLSVGMNANGLFYYSGGVID----------------------------LNQR 451 (655)
Q Consensus 404 ~i~~~~~v~~~~~~~ik----~~l~~~gPV~v~i~~~~~~~Y~~Gi~~----------------------------~~Ha 451 (655)
+...|..++. +.|+ ++|..++||.++.++..++.|++||+. .+||
T Consensus 285 ~~~~y~Nvp~---d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHA 361 (437)
T cd00585 285 RPILYLNVPM---DVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHA 361 (437)
T ss_pred ccceEEecCH---HHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeE
Confidence 3344555543 4444 677889999999999888899999984 4799
Q ss_pred E----EcC---Cc-cEEEEecCCCCccCCCceEEeccc
Q psy11694 452 L----YGT---SI-PYWIVKNSWGSDWGEKVEDKVGSS 481 (655)
Q Consensus 452 v----yg~---g~-~yWivkNSWG~~WGe~Gy~~i~~~ 481 (655)
| ||. |. .||+||||||++||++||++|++.
T Consensus 362 M~ivGv~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 362 MVLTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred EEEEEEEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence 9 554 44 699999999999999999999864
No 42
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.15 E-value=4.7e-11 Score=92.55 Aligned_cols=57 Identities=32% Similarity=0.433 Sum_probs=53.0
Q ss_pred hhhhhhccccccCCHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeCcCCCCChHH
Q psy11694 193 TELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESD 249 (655)
Q Consensus 193 ~~~~~q~~~k~Y~~~~E~~~R~~iF~~n~~~I~~~N~~~~~~~~~g~N~FsDlt~eE 249 (655)
|..|+++|+|.|.+.+|...|+.+|..|++.|..||..+..+|++|+|+|+|||++|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence 457899999999999999999999999999999999877679999999999999986
No 43
>KOG4128|consensus
Probab=98.10 E-value=6.3e-07 Score=90.17 Aligned_cols=76 Identities=22% Similarity=0.345 Sum_probs=57.2
Q ss_pred CcCCCCCCCccHHHHHHHHHHHHHHHHhcC-CCcCCCHHHHHH--------------------ccCC----------CCC
Q psy11694 309 SKVKEQGKCACCWAFSAVGVVEAMHAIQGN-SLTELSVQQLVD--------------------CDMS----------NGG 357 (655)
Q Consensus 309 tpVkdQg~CGsCwAfa~~~~le~~~~i~~~-~~~~lS~q~l~d--------------------C~~~----------~~g 357 (655)
+||.||.+-|-||.|+.+..+.--+..+-+ .-..||..+|+- |... +.-
T Consensus 63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~ 142 (457)
T KOG4128|consen 63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPV 142 (457)
T ss_pred cccccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCC
Confidence 699999999999999999888666555433 346788876652 2211 233
Q ss_pred CCCCcHHHHHHHHHHcCCCCCCCCCCCC
Q psy11694 358 CNGGRMDDALQYIIDNGGVVSDQAYPYK 385 (655)
Q Consensus 358 C~GG~~~~a~~~~~~~~Gi~~e~~yPY~ 385 (655)
-+||.....+..++++ |+.+..|||-.
T Consensus 143 ~DGGqw~MfvNlVkKY-GviPKkcy~~s 169 (457)
T KOG4128|consen 143 PDGGQWQMFVNLVKKY-GVIPKKCYLHS 169 (457)
T ss_pred CCCchHHHHHHHHHHh-CCCcHHhcccc
Confidence 5699888888888887 99999999855
No 44
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=97.90 E-value=2.1e-05 Score=85.53 Aligned_cols=64 Identities=25% Similarity=0.453 Sum_probs=49.4
Q ss_pred HHHHHHHhhCCCeEEEEEcCCcccccccEEe----------------------------CCceE--EcC-----Cc-cEE
Q psy11694 417 EEMKKWVATRGPLSVGMNANGLFYYSGGVID----------------------------LNQRL--YGT-----SI-PYW 460 (655)
Q Consensus 417 ~~ik~~l~~~gPV~v~i~~~~~~~Y~~Gi~~----------------------------~~Hav--yg~-----g~-~yW 460 (655)
+.+..+|..+-||..+-++..+..-+.||.+ .+||| .|. |. .+|
T Consensus 300 ~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D~~g~p~~w 379 (438)
T PF03051_consen 300 DAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLDEDGKPVRW 379 (438)
T ss_dssp HHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-TTSSEEEE
T ss_pred HHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEeccCCCeeEE
Confidence 3455677778899999999887778889886 58999 554 54 599
Q ss_pred EEecCCCCccCCCceEEecc
Q psy11694 461 IVKNSWGSDWGEKVEDKVGS 480 (655)
Q Consensus 461 ivkNSWG~~WGe~Gy~~i~~ 480 (655)
+|+||||++.|.+||+.|+.
T Consensus 380 kVeNSWG~~~g~kGy~~msd 399 (438)
T PF03051_consen 380 KVENSWGTDNGDKGYFYMSD 399 (438)
T ss_dssp EEE-SBTTTSTBTTEEEEEH
T ss_pred EEEcCCCCCCCCCcEEEECH
Confidence 99999999999999999974
No 45
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=96.29 E-value=0.0037 Score=44.43 Aligned_cols=36 Identities=22% Similarity=0.115 Sum_probs=24.4
Q ss_pred HHHHHHhccCCCCeeeeccCCCCCCHHHHHHHhCCCCC
Q psy11694 74 EKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLD 111 (655)
Q Consensus 74 ~~i~~~n~~~~~~~~~g~N~fsDlt~~E~~~~~g~~~~ 111 (655)
++|+..|+. +.|++.|.| |.++|.+.+++++|..+.
T Consensus 4 e~I~~IN~~-~~tWkAG~N-F~~~~~~~ik~LlGv~~~ 39 (41)
T PF08127_consen 4 EFIDYINSK-NTTWKAGRN-FENTSIEYIKRLLGVLPD 39 (41)
T ss_dssp HHHHHHHHC-T-SEEE-----SSB-HHHHHHCS-B-TT
T ss_pred HHHHHHHcC-CCcccCCCC-CCCCCHHHHHHHcCCCCC
Confidence 578888887 779999999 999999999999997654
No 46
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=95.02 E-value=0.023 Score=40.38 Aligned_cols=35 Identities=26% Similarity=0.177 Sum_probs=23.3
Q ss_pred HHHHHHhcCCCCceeeeeCcCCCCChHHHHhhcCCCC
Q psy11694 222 EKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNL 258 (655)
Q Consensus 222 ~~I~~~N~~~~~~~~~g~N~FsDlt~eEf~~~~~~~~ 258 (655)
++|+.+|+. +.+|++|.| |.+.|.+.++.++|...
T Consensus 4 e~I~~IN~~-~~tWkAG~N-F~~~~~~~ik~LlGv~~ 38 (41)
T PF08127_consen 4 EFIDYINSK-NTTWKAGRN-FENTSIEYIKRLLGVLP 38 (41)
T ss_dssp HHHHHHHHC-T-SEEE-----SSB-HHHHHHCS-B-T
T ss_pred HHHHHHHcC-CCcccCCCC-CCCCCHHHHHHHcCCCC
Confidence 578888987 669999999 89999999999888643
No 47
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=91.97 E-value=0.58 Score=42.36 Aligned_cols=58 Identities=22% Similarity=0.429 Sum_probs=34.8
Q ss_pred CCHHHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCC
Q psy11694 572 GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSW 645 (655)
Q Consensus 572 ~~~~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSW 645 (655)
.+.+.|++.|.++.||.+.+...-... .+..+ ....++|.|+|+||+++. +++|..+|
T Consensus 87 ~~~~~i~~~i~~G~Pvi~~~~~~~~~~-~~~~~-------~~~~~~H~vvi~Gy~~~~--------~~~v~DP~ 144 (144)
T PF13529_consen 87 ASFDDIKQEIDAGRPVIVSVNSGWRPP-NGDGY-------DGTYGGHYVVIIGYDEDG--------YVYVNDPW 144 (144)
T ss_dssp S-HHHHHHHHHTT--EEEEEETTSS---TTEEE-------EE-TTEEEEEEEEE-SSE---------EEEE-TT
T ss_pred CcHHHHHHHHHCCCcEEEEEEcccccC-CCCCc-------CCCcCCEEEEEEEEeCCC--------EEEEeCCC
Confidence 478999999998889999997421000 11112 123478999999999842 68888777
No 48
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=90.47 E-value=0.31 Score=50.30 Aligned_cols=57 Identities=25% Similarity=0.404 Sum_probs=43.4
Q ss_pred HhhCCCeEEEEEcCCcccccccEEe----------------------------CCceE--EcC-----C-ccEEEEecCC
Q psy11694 423 VATRGPLSVGMNANGLFYYSGGVID----------------------------LNQRL--YGT-----S-IPYWIVKNSW 466 (655)
Q Consensus 423 l~~~gPV~v~i~~~~~~~Y~~Gi~~----------------------------~~Hav--yg~-----g-~~yWivkNSW 466 (655)
+..+-||-.+-++..+..-+.||.. ..||| -|. | .-=|.|.|||
T Consensus 308 ~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g~p~rwkVENSW 387 (444)
T COG3579 308 MQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETGNPLRWKVENSW 387 (444)
T ss_pred HhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCCCceeeEeeccc
Confidence 3446689888888878777888765 35777 332 2 3369999999
Q ss_pred CCccCCCceEEec
Q psy11694 467 GSDWGEKVEDKVG 479 (655)
Q Consensus 467 G~~WGe~Gy~~i~ 479 (655)
|.+=|.+|||-++
T Consensus 388 G~d~G~~GyfvaS 400 (444)
T COG3579 388 GKDVGKKGYFVAS 400 (444)
T ss_pred ccccCCCceEeeh
Confidence 9999999999875
No 49
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=78.62 E-value=5 Score=42.14 Aligned_cols=49 Identities=27% Similarity=0.381 Sum_probs=33.1
Q ss_pred CHHHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEeccc
Q psy11694 573 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEE 629 (655)
Q Consensus 573 ~~~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~ 629 (655)
-.+.|++.|.++.||.+.+++.-. .|...-+ ......|.|+|+||+++.
T Consensus 77 ~~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~-------~~~~~~H~i~v~G~d~~~ 125 (317)
T PF14399_consen 77 AWEELKEALDAGRPVIVWVDMYYL-PYRPNYY-------KKHHADHYIVVYGYDEEE 125 (317)
T ss_pred HHHHHHHHHhCCCceEEEeccccC-CCCcccc-------ccccCCcEEEEEEEeCCC
Confidence 366889988877799999876422 2222211 223468999999999764
No 50
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=76.69 E-value=9.2 Score=37.67 Aligned_cols=57 Identities=21% Similarity=0.332 Sum_probs=35.4
Q ss_pred CHHHHHHHHHhCCCEEEEEEcccccc--ccCCeeec--CCC-CCCCcccCeEEEEEEEeccc
Q psy11694 573 EEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDL--NQR-LCNPKAQNHALIIVGYGEEE 629 (655)
Q Consensus 573 ~~~~ik~~i~~~GPV~~~~~~~~~~~--y~~Gi~~~--~~~-~~~~~~~~HaV~iVGyg~~~ 629 (655)
..++|...|.++||+.+-++..-..- -+...... +.. .......+|-|+|+||+...
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~ 173 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAAT 173 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCC
Confidence 68899999999998877777542220 02222110 001 11245679999999999764
No 51
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=75.86 E-value=8.1 Score=36.65 Aligned_cols=55 Identities=22% Similarity=0.365 Sum_probs=36.2
Q ss_pred CHHHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCC
Q psy11694 573 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSW 645 (655)
Q Consensus 573 ~~~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSW 645 (655)
+.+++++.+.++-|+.+..+.-+ .......+|||+||||-.-.. +.++.+|=|=|
T Consensus 90 s~~eV~~~~~~nk~i~i~~~~v~--------------~~~~~~~gHAlavvGya~~~~----g~~~y~~WNPW 144 (175)
T PF05543_consen 90 SFDEVKKLIDNNKGIAILADRVE--------------QTNGPHAGHALAVVGYAKPNN----GQKTYYFWNPW 144 (175)
T ss_dssp -HHHHHHHHHTT-EEEEEEEETT--------------SCTTB--EEEEEEEEEEEETT----SEEEEEEE-TT
T ss_pred CHHHHHHHHHcCCCeEEEecccc--------------cCCCCccceeEEEEeeeecCC----CCeEEEEeCCc
Confidence 68899999998888887665310 012245689999999965331 57889998887
No 52
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=66.83 E-value=65 Score=28.53 Aligned_cols=50 Identities=20% Similarity=0.283 Sum_probs=28.3
Q ss_pred CHHHHHHHHhhCCCeEEEEEcCCcccccccEEe---CCceE--EcC--CccEEEEecCC
Q psy11694 415 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVID---LNQRL--YGT--SIPYWIVKNSW 466 (655)
Q Consensus 415 ~~~~ik~~l~~~gPV~v~i~~~~~~~Y~~Gi~~---~~Hav--yg~--g~~yWivkNSW 466 (655)
+.+.|++.|.++.||.+.+....-.. .++.+. ..|.| .|. .. +++|-.+|
T Consensus 88 ~~~~i~~~i~~G~Pvi~~~~~~~~~~-~~~~~~~~~~~H~vvi~Gy~~~~-~~~v~DP~ 144 (144)
T PF13529_consen 88 SFDDIKQEIDAGRPVIVSVNSGWRPP-NGDGYDGTYGGHYVVIIGYDEDG-YVYVNDPW 144 (144)
T ss_dssp -HHHHHHHHHTT--EEEEEETTSS---TTEEEEE-TTEEEEEEEEE-SSE--EEEE-TT
T ss_pred cHHHHHHHHHCCCcEEEEEEcccccC-CCCCcCCCcCCEEEEEEEEeCCC-EEEEeCCC
Confidence 67899999999899999997432100 111111 67888 443 33 57777666
No 53
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=64.18 E-value=17 Score=34.66 Aligned_cols=51 Identities=14% Similarity=0.285 Sum_probs=36.1
Q ss_pred cCCCCHHHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCCC
Q psy11694 569 IPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWG 646 (655)
Q Consensus 569 ~~~~~~~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG 646 (655)
+.-.++++||..|.++.||.+-... |-. ..-|+|+|+|||+.. +..-++||
T Consensus 118 ~tGksl~~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyDk~n---------~yynDpyG 168 (195)
T COG4990 118 LTGKSLSDIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYDKYN---------IYYNDPYG 168 (195)
T ss_pred CcCCcHHHHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeecccccc---------eEeccccc
Confidence 3445899999999998899954432 111 235999999999863 56666665
No 54
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=58.52 E-value=29 Score=31.25 Aligned_cols=34 Identities=26% Similarity=0.531 Sum_probs=25.2
Q ss_pred HHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEe
Q psy11694 577 MKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYG 626 (655)
Q Consensus 577 ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg 626 (655)
+++.|..+.||.+.+... .. ....+|.|+|+||+
T Consensus 70 ~~~~l~~~~Pvi~~~~~~--------~~--------~~~~gH~vVv~g~~ 103 (141)
T cd02549 70 LLRQLAAGHPVIVSVNLG--------VS--------ITPSGHAMVVIGYD 103 (141)
T ss_pred HHHHHHCCCeEEEEEecC--------cc--------cCCCCeEEEEEEEc
Confidence 778888888999887751 11 12368999999998
No 55
>PF01640 Peptidase_C10: Peptidase C10 family classification.; InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=51.53 E-value=28 Score=33.83 Aligned_cols=50 Identities=24% Similarity=0.571 Sum_probs=30.3
Q ss_pred HHHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEecccCCCCCCCCEEEEEcCCCCCcC--C
Q psy11694 574 EEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG--E 651 (655)
Q Consensus 574 ~~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~~~~~~~~~ywivrNSWG~~WG--e 651 (655)
.+.|+.+|.++.||...-... .++||-+|=||..+. ||- --|| || .
T Consensus 140 ~~~i~~el~~~rPV~~~g~~~--------------------~~GHawViDGy~~~~--------~~H--~NwG--W~G~~ 187 (192)
T PF01640_consen 140 MDMIRNELDNGRPVLYSGNSK--------------------SGGHAWVIDGYDSDG--------YFH--CNWG--WGGSS 187 (192)
T ss_dssp HHHHHHHHHTT--EEEEEEET--------------------TEEEEEEEEEEESSS--------EEE--EE-S--STTTT
T ss_pred HHHHHHHHHcCCCEEEEEecC--------------------CCCeEEEEcCccCCC--------eEE--EeeC--ccCCC
Confidence 467888998888998543211 118999999996543 554 4465 55 4
Q ss_pred CCCC
Q psy11694 652 KVML 655 (655)
Q Consensus 652 ~Gy~ 655 (655)
+|||
T Consensus 188 nGyy 191 (192)
T PF01640_consen 188 NGYY 191 (192)
T ss_dssp -EEE
T ss_pred CCcc
Confidence 6775
No 56
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=47.48 E-value=42 Score=31.41 Aligned_cols=40 Identities=25% Similarity=0.331 Sum_probs=29.6
Q ss_pred CHHHHHHHHHhCCCEEEEEEccccccccCCeeecCCCCCCCcccCeEEEEEEEeccc
Q psy11694 573 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEE 629 (655)
Q Consensus 573 ~~~~ik~~i~~~GPV~~~~~~~~~~~y~~Gi~~~~~~~~~~~~~~HaV~iVGyg~~~ 629 (655)
..+.+++.|.++||+.++.... +..-..|+++|+|-+.+.
T Consensus 97 t~e~~~~LL~~yGPLwv~~~~P-----------------~~~~~~H~~ViTGI~~dg 136 (166)
T PF12385_consen 97 TAEGLANLLREYGPLWVAWEAP-----------------GDSWVAHASVITGIDGDG 136 (166)
T ss_pred CHHHHHHHHHHcCCeEEEecCC-----------------CCcceeeEEEEEeecCCC
Confidence 5789999999999999884432 112235999999987653
No 57
>KOG4128|consensus
Probab=42.64 E-value=9.3 Score=39.68 Aligned_cols=64 Identities=25% Similarity=0.403 Sum_probs=44.6
Q ss_pred CHHHHHHHHhh----CCCeEEEEEcCCcccccccEEe-----------------------------CCceE----EcC--
Q psy11694 415 EEEEMKKWVAT----RGPLSVGMNANGLFYYSGGVID-----------------------------LNQRL----YGT-- 455 (655)
Q Consensus 415 ~~~~ik~~l~~----~gPV~v~i~~~~~~~Y~~Gi~~-----------------------------~~Hav----yg~-- 455 (655)
+.+.|++.+.+ .-||-.+-++..+..-++|+.+ ..||+ .|.
T Consensus 305 ~~d~l~k~vv~sl~~~kaVwfgcd~~k~~~~K~G~~dl~l~~~~l~fG~~l~~~~KAeRl~y~eSlmthAml~T~v~~kd 384 (457)
T KOG4128|consen 305 SMDILMKIVVTSLEGDKAVWFGCDIRKAISLKSGPLDLRLHQFDLLFGFKLGESTKAERLDYRESLMTHAMLLTSVGLKD 384 (457)
T ss_pred CHHHHHHHHHHHhcCCcceEEecccHhhhhcccCccchhhccCceeeeeeccccchhhhhhHHHHHHHHHHHhhhccccC
Confidence 45666665543 5577777777667777777654 34666 231
Q ss_pred ----CccEEEEecCCCCccCCCceEEe
Q psy11694 456 ----SIPYWIVKNSWGSDWGEKVEDKV 478 (655)
Q Consensus 456 ----g~~yWivkNSWG~~WGe~Gy~~i 478 (655)
+..-|-|.||||.+-|-+||..|
T Consensus 385 ~~~g~~~~~rVenswgkd~gkkg~~~m 411 (457)
T KOG4128|consen 385 PATGGLNEHRVENSWGKDLGKKGVNKM 411 (457)
T ss_pred cccCCchhhhhhchhhhhccccchhhh
Confidence 33459999999999999999766
No 58
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=41.81 E-value=96 Score=29.58 Aligned_cols=113 Identities=16% Similarity=0.208 Sum_probs=63.5
Q ss_pred CCCCCccHHHHHHHHHHHHHHHH--------hcCCCcCCCHHHHHHccCCCCCCCCCcHHHHHHHHHHcCCCCCCCCCCC
Q psy11694 313 EQGKCACCWAFSAVGVVEAMHAI--------QGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPY 384 (655)
Q Consensus 313 dQg~CGsCwAfa~~~~le~~~~i--------~~~~~~~lS~q~l~dC~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY 384 (655)
.||.=+=|-+|+.+++|-..... .+.....+|+++|.+++ -.+...++|.+.. |...
T Consensus 18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l~~~~--------~~~~~~i~y~ks~-g~~~------ 82 (175)
T PF05543_consen 18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQLKFTS--------LTPNQMIKYAKSQ-GRNP------ 82 (175)
T ss_dssp --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCHHH----------B-HHHHHHHHHHT-TEEE------
T ss_pred ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHHhhcC--------CCHHHHHHHHHHc-Ccch------
Confidence 58888999999999888764211 11223567888887764 2366888887665 3221
Q ss_pred CCCCCCCccccCCCCCceeeeeceEEcCCCCHHHHHHHHhhCCCeEEEEEcCCcccccccEEeCCceE--EcC-----Cc
Q psy11694 385 KASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRL--YGT-----SI 457 (655)
Q Consensus 385 ~~~~~~~~C~~~~~~~~~~~i~~~~~v~~~~~~~ik~~l~~~gPV~v~i~~~~~~~Y~~Gi~~~~Hav--yg~-----g~ 457 (655)
.... .. .+-+++++.+.++-|+.+..+.-.-. .|.. .+||+ .|. |.
T Consensus 83 -------------------~~~n--~~--~s~~eV~~~~~~nk~i~i~~~~v~~~---~~~~-~gHAlavvGya~~~~g~ 135 (175)
T PF05543_consen 83 -------------------QYNN--RM--PSFDEVKKLIDNNKGIAILADRVEQT---NGPH-AGHALAVVGYAKPNNGQ 135 (175)
T ss_dssp -------------------EEEC--S-----HHHHHHHHHTT-EEEEEEEETTSC---TTB---EEEEEEEEEEEETTSE
T ss_pred -------------------hHhc--CC--CCHHHHHHHHHcCCCeEEEecccccC---CCCc-cceeEEEEeeeecCCCC
Confidence 0000 01 14788999999988998877643322 2211 46887 443 68
Q ss_pred cEEEEecCCC
Q psy11694 458 PYWIVKNSWG 467 (655)
Q Consensus 458 ~yWivkNSWG 467 (655)
++.++=|=|-
T Consensus 136 ~~y~~WNPW~ 145 (175)
T PF05543_consen 136 KTYYFWNPWW 145 (175)
T ss_dssp EEEEEE-TT-
T ss_pred eEEEEeCCcc
Confidence 8999988775
No 59
>PF11567 PfUIS3: Plasmodium falciparum UIS3 membrane protein; InterPro: IPR021626 UIS3 is a membrane protein essential for sporozoite development in infected hepatocytes. This family is 130-229 of the Plasmodium falciparum UIS3 protein which is compact and has an all alpha-helical structure.PfUIS3(130-229) interacts with lipids, phospholipid lysosomes, the human liver fatty acid-binding protein and with the lipid phosphatidylethanolamine. The interaction with liver fatty acid-binding protein provides the parasite with a method to import essential fatty acids/lipids during rapid growth phases of sporozoites []. ; PDB: 2VWA_C.
Probab=39.08 E-value=27 Score=28.87 Aligned_cols=32 Identities=25% Similarity=0.381 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCCeeeeccCCCCCCHHHHHHHh
Q psy11694 62 LLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLT 106 (655)
Q Consensus 62 ~~~r~~~f~~n~~~i~~~n~~~~~~~~~g~N~fsDlt~~E~~~~~ 106 (655)
-.+||.+|.+|.+...+|. |+||+.+.-..+.
T Consensus 20 piKrfN~F~Dn~rla~qhH-------------F~~LSn~Qq~y~i 51 (101)
T PF11567_consen 20 PIKRFNIFMDNARLAAQHH-------------FSNLSNEQQKYLI 51 (101)
T ss_dssp -HHHHHHHHHHHHHHHHHH-------------HHHS-HHHHHHHT
T ss_pred cHHHHHHHHHHHHHHHHHH-------------HHhcCcHHHHHHh
Confidence 3589999999999988874 8889887764443
No 60
>PF14663 RasGEF_N_2: Rapamycin-insensitive companion of mTOR RasGEF_N domain
Probab=30.64 E-value=30 Score=30.67 Aligned_cols=58 Identities=16% Similarity=0.247 Sum_probs=35.5
Q ss_pred CCCceeeccchhHHHHHHHHHHHhhhcccccccCccCchHHHHHHHHHHcCcccCChHHHHHH
Q psy11694 3 QPPWVSLGEKGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRR 65 (655)
Q Consensus 3 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~k~Y~~~~e~~~r 65 (655)
+|++.-|++.|-.+|.-++....... -.+..+ -...+.+.|+..+|+.|-..-|..-+
T Consensus 49 ~p~l~~L~~~g~~Ll~~~lS~~~Gf~--~L~~~~---~v~~El~~W~~~~N~~YV~~vE~~l~ 106 (115)
T PF14663_consen 49 RPSLDHLGDIGSPLLLRFLSTPSGFR--YLNEIG---YVEKELDKWFESFNKEYVKLVEEFLS 106 (115)
T ss_pred CcHHHHHHHcCHHHHHHHHcchHHHH--Hhcchh---HHHHHHHHHHHcccHHHHHHHHHHHH
Confidence 47777788888766663332222111 111112 23578999999999999887665544
No 61
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=30.27 E-value=3.4e+02 Score=25.56 Aligned_cols=23 Identities=30% Similarity=0.455 Sum_probs=20.0
Q ss_pred CHHHHHHHHhhCCCeEEEEEcCC
Q psy11694 415 EEEEMKKWVATRGPLSVGMNANG 437 (655)
Q Consensus 415 ~~~~ik~~l~~~gPV~v~i~~~~ 437 (655)
..+.+...|.++||+-+++....
T Consensus 97 t~e~~~~LL~~yGPLwv~~~~P~ 119 (166)
T PF12385_consen 97 TAEGLANLLREYGPLWVAWEAPG 119 (166)
T ss_pred CHHHHHHHHHHcCCeEEEecCCC
Confidence 57899999999999999987654
No 62
>KOG4621|consensus
Probab=29.73 E-value=1.5e+02 Score=26.57 Aligned_cols=57 Identities=16% Similarity=0.333 Sum_probs=36.9
Q ss_pred CHHHHHHHHHhCCCEEEEEEccccc-------cccCCeeecCCCCC---CCcccCeEEEEEEEeccc
Q psy11694 573 EEEEMKKWVATRGPLSVGMNANGLF-------YYSGGVIDLNQRLC---NPKAQNHALIIVGYGEEE 629 (655)
Q Consensus 573 ~~~~ik~~i~~~GPV~~~~~~~~~~-------~y~~Gi~~~~~~~~---~~~~~~HaV~iVGyg~~~ 629 (655)
.+.+|+..|+++.-|++.+--++-. -.+++.+.++.+.| .+-..+|-|+|-||+-..
T Consensus 58 Si~dIqahLaqGnhiAIaLVdq~~Lhcdlceeplk~ccfspnghhcfcrtp~YqGHfiVi~GYd~a~ 124 (167)
T KOG4621|consen 58 SIHDIQAHLAQGNHIAIALVDQDKLHCDLCEEPLKSCCFSPNGHHCFCRTPCYQGHFIVICGYDAAR 124 (167)
T ss_pred eHHHHHHHHhcCCeEEEEEecCCceehHHHHhHHHHhccCCCCccccccCCcccccEEEEecccccc
Confidence 5788998888766777766543321 24556665433322 234679999999998643
No 63
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=28.43 E-value=36 Score=29.14 Aligned_cols=20 Identities=20% Similarity=0.286 Sum_probs=11.8
Q ss_pred cchhHHHHHHHHHHHhhhcc
Q psy11694 11 EKGLGYLHTFMIKVALLESN 30 (655)
Q Consensus 11 ~~~~~~l~~~~~~~~~~~~~ 30 (655)
-|.++||.++|+++.|+++.
T Consensus 3 SK~~llL~l~LA~lLlisSe 22 (95)
T PF07172_consen 3 SKAFLLLGLLLAALLLISSE 22 (95)
T ss_pred hhHHHHHHHHHHHHHHHHhh
Confidence 36677777666555555443
No 64
>KOG4702|consensus
Probab=26.07 E-value=3.2e+02 Score=21.84 Aligned_cols=36 Identities=17% Similarity=0.303 Sum_probs=27.1
Q ss_pred chHHHHHHHHHHcCcccCChHHHHHHHHHHHHHHHHH
Q psy11694 40 SPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76 (655)
Q Consensus 40 ~~~~~f~~~~~~~~k~Y~~~~e~~~r~~~f~~n~~~i 76 (655)
...+-|++|+..|.+.-.. .|...|.+-|++-++.-
T Consensus 26 NQpe~Fee~v~~~krel~p-pe~~~~~EE~~~~lRe~ 61 (77)
T KOG4702|consen 26 NQPEIFEEFVRGYKRELSP-PEATKRKEEYENFLREQ 61 (77)
T ss_pred cChHHHHHHHHhccccCCC-hHHHhhHHHHHHHHHHH
Confidence 4456799999999877665 57777888887766654
Done!