Query psy17191
Match_columns 377
No_of_seqs 221 out of 1599
Neff 5.9
Searched_HMMs 46136
Date Sat Aug 17 00:11:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17191.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17191hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03051 Peptidase_C1_2: Pepti 100.0 2E-103 5E-108 798.4 22.1 316 22-373 96-438 (438)
2 COG3579 PepC Aminopeptidase C 100.0 1E-102 2E-107 751.9 18.4 320 22-377 98-444 (444)
3 cd00585 Peptidase_C1B Peptidas 100.0 5E-101 1E-105 780.8 27.0 316 22-373 95-437 (437)
4 KOG4128|consensus 100.0 1.5E-93 3.2E-98 687.3 13.3 323 22-377 103-457 (457)
5 KOG1543|consensus 99.8 1.4E-20 3.1E-25 186.4 14.2 165 118-318 137-310 (325)
6 KOG1542|consensus 99.8 1.1E-20 2.3E-25 184.7 11.2 157 120-316 186-356 (372)
7 PTZ00203 cathepsin L protease; 99.8 4.5E-20 9.8E-25 184.3 14.5 160 119-316 154-325 (348)
8 cd02620 Peptidase_C1A_Cathepsi 99.8 1E-18 2.2E-23 165.4 14.3 159 122-316 35-223 (236)
9 cd02698 Peptidase_C1A_Cathepsi 99.8 1.3E-18 2.8E-23 165.0 14.6 149 136-322 55-224 (239)
10 PTZ00021 falcipain-2; Provisio 99.8 1E-18 2.2E-23 180.6 14.9 160 119-316 294-469 (489)
11 cd02248 Peptidase_C1A Peptidas 99.8 2.6E-18 5.6E-23 157.8 15.3 157 126-316 35-197 (210)
12 PTZ00200 cysteine proteinase; 99.8 2.2E-18 4.7E-23 177.1 14.1 157 119-316 263-427 (448)
13 cd02621 Peptidase_C1A_Cathepsi 99.8 5.6E-18 1.2E-22 160.7 14.0 156 134-316 54-227 (243)
14 cd02619 Peptidase_C1 C1 Peptid 99.7 3.3E-17 7.1E-22 150.2 15.1 179 120-326 26-222 (223)
15 PTZ00462 Serine-repeat antigen 99.7 2.1E-16 4.5E-21 172.3 14.7 165 118-315 559-764 (1004)
16 PF00112 Peptidase_C1: Papain 99.7 7.1E-17 1.5E-21 147.9 9.3 162 122-318 33-206 (219)
17 PTZ00364 dipeptidyl-peptidase 99.7 2.7E-16 5.9E-21 164.5 13.9 167 133-316 259-444 (548)
18 PTZ00049 cathepsin C-like prot 99.7 8.5E-16 1.9E-20 163.2 13.9 97 215-316 560-661 (693)
19 smart00645 Pept_C1 Papain fami 99.6 1.8E-15 3.9E-20 137.1 12.0 127 119-316 29-159 (174)
20 COG3579 PepC Aminopeptidase C 99.3 4.7E-13 1E-17 131.0 2.7 74 1-92 348-424 (444)
21 PF03051 Peptidase_C1_2: Pepti 99.1 1.2E-11 2.5E-16 127.1 1.1 78 1-96 346-426 (438)
22 cd00585 Peptidase_C1B Peptidas 99.0 2.1E-10 4.4E-15 118.0 2.5 77 1-95 345-424 (437)
23 KOG1544|consensus 98.8 7.1E-09 1.5E-13 101.5 6.5 89 212-316 354-445 (470)
24 COG4870 Cysteine protease [Pos 98.7 1E-08 2.2E-13 101.9 4.3 47 273-321 263-318 (372)
25 KOG4128|consensus 98.6 2.2E-08 4.7E-13 98.4 2.9 70 1-86 358-431 (457)
26 PF14399 Transpep_BrtH: NlpC/p 77.8 7.5 0.00016 37.8 7.2 61 207-299 73-133 (317)
27 PF13529 Peptidase_C39_2: Pept 77.2 7.3 0.00016 32.2 6.0 19 213-231 90-108 (144)
28 PF05543 Peptidase_C47: Stapho 70.8 21 0.00045 32.9 7.6 30 273-304 118-147 (175)
29 PF09778 Guanylate_cyc_2: Guan 65.6 11 0.00025 35.7 5.0 116 149-300 63-181 (212)
30 cd00044 CysPc Calpains, domain 60.5 52 0.0011 32.4 9.0 51 274-326 235-312 (315)
31 PF12385 Peptidase_C70: Papain 48.4 28 0.00061 31.7 4.3 19 213-231 99-117 (166)
32 COG4990 Uncharacterized protei 47.2 26 0.00057 32.6 4.0 24 275-305 148-171 (195)
33 cd02549 Peptidase_C39A A sub-f 45.2 1E+02 0.0022 25.7 7.2 59 215-320 70-128 (141)
34 COG5572 Predicted integral mem 42.8 8.3 0.00018 31.8 0.1 18 5-22 3-20 (104)
35 PF09713 A_thal_3526: Plant pr 35.8 25 0.00054 26.3 1.7 30 1-32 13-44 (54)
36 PF01640 Peptidase_C10: Peptid 35.5 1.5E+02 0.0033 27.1 7.2 59 206-312 134-192 (192)
37 PF09028 Mac-1: Mac 1; InterP 33.2 1.5E+02 0.0032 30.1 7.0 47 206-290 227-273 (333)
38 PF04046 PSP: PSP; InterPro: 27.0 38 0.00082 24.8 1.3 22 348-369 2-23 (48)
39 cd02620 Peptidase_C1A_Cathepsi 26.4 18 0.0004 34.1 -0.5 39 14-71 184-222 (236)
40 smart00581 PSP proline-rich do 25.9 42 0.00091 25.2 1.4 23 347-369 5-27 (54)
41 PF07910 Peptidase_C78: Peptid 25.5 72 0.0016 30.3 3.3 22 274-297 155-176 (218)
42 PTZ00200 cysteine proteinase; 24.3 16 0.00035 38.4 -1.4 57 13-86 385-443 (448)
43 PF00648 Peptidase_C2: Calpain 23.7 4.8E+02 0.01 25.2 8.8 30 274-303 213-243 (298)
44 smart00230 CysPc Calpain-like 23.2 4.8E+02 0.01 25.8 8.8 54 274-328 227-307 (318)
45 COG3423 Nlp Predicted transcri 23.2 71 0.0015 25.7 2.3 44 96-139 15-64 (82)
46 PTZ00203 cathepsin L protease; 20.9 27 0.00058 35.4 -0.6 40 13-71 285-324 (348)
No 1
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=100.00 E-value=2.1e-103 Score=798.44 Aligned_cols=316 Identities=38% Similarity=0.695 Sum_probs=272.7
Q ss_pred hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191 22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY 94 (377)
Q Consensus 22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L 94 (377)
++|||||||||+||++||+|++ +|+|+|+++++++.|++|||+|+|+.++|+||||||+++|| |+.||.+|
T Consensus 96 l~F~DKlEKaN~fLe~ii~~~~--~~~d~R~v~~ll~~~~~DGGqw~~~~nli~KYGvVPk~~mpet~~s~~t~~~n~~l 173 (438)
T PF03051_consen 96 LFFWDKLEKANYFLENIIDTAD--EPLDDRLVRFLLKNPVSDGGQWDMVVNLIKKYGVVPKSVMPETFSSSNTSEMNEML 173 (438)
T ss_dssp HHHHHHHHHHHHHHHHHHHCCT--S-TTSHHHHHHHHSTT-S-B-HHHHHHHHHHH---BGGGSTTGCGCHBHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHhc--CCcchHHHHHHHhcCCCCCCchHHHHHHHHHcCcCcHhhCCCCCCCCChHHHHHHH
Confidence 6999999999999999999986 99999999999999999999999999999999999999999 89999999
Q ss_pred hhhHHHHHHHHHH--hhccChhhh---hHHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191 95 NNQPVELLMKLAA--ESIKNNEAV---WFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ 169 (377)
Q Consensus 95 ~~~Lr~~a~~lr~--~~~~~~~~l---~~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T 169 (377)
+++||++|++||+ ++|++.++| .++||++|||||++|||+||++|+|+|+||+++||+ .++||
T Consensus 174 ~~~Lr~~a~~LR~~~~~~~~~~~l~~~k~~~l~~iy~il~~~lG~PP~~F~~ey~dkd~~~~~------------~~~~T 241 (438)
T PF03051_consen 174 NTKLREYALELRKLVKAGKSEEELRKLKEEMLAEIYRILAIYLGEPPEKFTWEYRDKDKKYHR------------GKNYT 241 (438)
T ss_dssp HHHHHHHHHHHHHHHHTTTTCHHHHHHHHHHHHHHHHHHHHHH---SSSEEEEEE-TTS-EEE------------EEEE-
T ss_pred HHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHcCCCChheeEEEecccccccc------------ccccC
Confidence 9999999999999 566654444 499999999999999999999999999999999999 99999
Q ss_pred cHHHHHHhhc-----------c---cCCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCC
Q psy17191 170 PVELLMKLAA-----------E---SIKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKR 234 (377)
Q Consensus 170 P~eF~~k~v~-----------D---~~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~ 234 (377)
|++||++||+ | +|||++.|++++++||+ |+++.|+|||+++|+++++++|++|+||||||||+++
T Consensus 242 P~eF~~kyv~~~~ddyVsLin~P~~~~py~~~y~ve~~~Nv~~g~~~~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k~ 321 (438)
T PF03051_consen 242 PLEFYKKYVGFDLDDYVSLINDPRSHHPYNKLYTVEYLGNVVGGRPVRYLNVPIDELKDAAIKSLKAGYPVWFGCDVGKF 321 (438)
T ss_dssp HHHHHHHCTTS-GGGEEEEE--T-TTS-TTCEEEETTTTSSTT-EEEEEEE--HHHHHHHHHHHHHTT--EEEEEETTTT
T ss_pred chhHHHHHhCCCCcceEEEeeCCCccCccceeEEEccCCCEECCcceeEeccCHHHHHHHHHHHHHcCCcEEEeccCCcc
Confidence 9999999997 2 79999999999999999 9999999999999999999999999999999999999
Q ss_pred CcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEe
Q psy17191 235 FANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMT 314 (377)
Q Consensus 235 f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms 314 (377)
++++.||+|.+++|++.+||+++. |+|++|+++++|.+||||+|||+++|++|+ |.+|+||||||++.|++|||+||
T Consensus 322 ~~~k~Gi~D~~~~d~~~~fg~~~~--~~K~~Rl~~~eS~~tHAM~itGv~~D~~g~-p~~wkVeNSWG~~~g~kGy~~ms 398 (438)
T PF03051_consen 322 FDRKNGIMDTDLYDYDSLFGVDFN--MSKAERLDYGESTMTHAMVITGVDLDEDGK-PVRWKVENSWGTDNGDKGYFYMS 398 (438)
T ss_dssp EETTTTEE-TTSB-HHHHHT--S---S-HHHHHHTTSS--EEEEEEEEEEE-TTSS-EEEEEEE-SBTTTSTBTTEEEEE
T ss_pred ccccchhhccchhhhhhhhccccc--cCHHHHHHhCCCCCceeEEEEEEEeccCCC-eeEEEEEcCCCCCCCCCcEEEEC
Confidence 999999999999999999999976 999999999999999999999999999999 89999999999999999999999
Q ss_pred hhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhcCCCeecCCCCccC
Q psy17191 315 SPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG 373 (377)
Q Consensus 315 ~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~wdpm~ 373 (377)
++||++|||+||||| +|||++++++|++|||+||||||||
T Consensus 399 d~wf~e~v~~vvV~K-------------------~~lp~~~~~~~~~~p~~l~~wdpmg 438 (438)
T PF03051_consen 399 DDWFDEYVYQVVVHK-------------------KYLPKELLKKLEKEPIVLPPWDPMG 438 (438)
T ss_dssp HHHHHHHEEEEEEEG-------------------GGS-HHHHHHHHS--EEE-TTSCCC
T ss_pred HHHHHhhhheEEech-------------------hhCCHHHHHHhcCCCeECCCCCCCC
Confidence 999999999999999 9999999999999999999999998
No 2
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=100.00 E-value=1.1e-102 Score=751.94 Aligned_cols=320 Identities=32% Similarity=0.578 Sum_probs=309.7
Q ss_pred hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191 22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY 94 (377)
Q Consensus 22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L 94 (377)
+||||||||||||||+||+||+ +++++|++.+++..|..|||||+|+.++|+|||+|||++|| |+++|.+|
T Consensus 98 tfFwDKlEKaN~FleqIi~tad--q~ldsRlv~~LL~~PqqDGGQwdM~v~l~eKYGvVpK~~ypes~sSS~Sr~ln~~L 175 (444)
T COG3579 98 TFFWDKLEKANWFLEQIIETAD--QELDSRLVSFLLATPQQDGGQWDMFVSLFEKYGVVPKSVYPESFSSSNSRELNALL 175 (444)
T ss_pred HHHHHHHHHhhHHHHHHHhhcc--cchHHHHHHHHHcCccccCchHHHHHHHHHHhCCCchhhcccccccccHHHHHHHH
Confidence 6999999999999999999997 99999999999999999999999999999999999999999 88999999
Q ss_pred hhhHHHHHHHHHH--hhccC---hhhhhHHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191 95 NNQPVELLMKLAA--ESIKN---NEAVWFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ 169 (377)
Q Consensus 95 ~~~Lr~~a~~lr~--~~~~~---~~~l~~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T 169 (377)
|++||+.|.-||. .+|++ .+++.++++.+||++|+..||.||+.|+|+||||+++||. +..+|
T Consensus 176 n~~LR~dAqiLR~a~~eg~~~~~v~~~kEe~l~eif~~l~~~lg~PP~~Fdf~YrdKd~~~h~------------~k~lT 243 (444)
T COG3579 176 NKLLRQDAQILRDALKEGADDDTVEALKEELLQEIFNFLAMTLGLPPEKFDFAYRDKDNKYHK------------EKGLT 243 (444)
T ss_pred HHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHcCCCchhcceEEeccccchhh------------hcCCC
Confidence 9999999999999 66665 3456699999999999999999999999999999999999 99999
Q ss_pred cHHHHHHhhc-c-------------cCCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCC
Q psy17191 170 PVELLMKLAA-E-------------SIKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKR 234 (377)
Q Consensus 170 P~eF~~k~v~-D-------------~~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~ 234 (377)
|++||++||+ | ++||++.|++++.+||+ |+.+.|+|||++.|+.++++++++|+||||||||++.
T Consensus 244 P~eFy~kyv~ldl~~yVslInaPtadkPygk~ytV~~LGnVvgg~~v~ylNv~me~lkkl~~~q~qagetVwFG~dvgq~ 323 (444)
T COG3579 244 PQEFYKKYVGLDLKDYVSLINAPTADKPYGKSYTVEFLGNVVGGRAVKYLNVDMERLKKLAIKQMQAGETVWFGCDVGQL 323 (444)
T ss_pred HHHHHHHhcCCCcccceeeccCCcCCCCCcceeehhhhccccCCceeEEecCcHHHHHHHHHHHHhcCCcEEeecCchhh
Confidence 9999999998 1 69999999999999999 9999999999999999999999999999999999999
Q ss_pred CcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEe
Q psy17191 235 FANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMT 314 (377)
Q Consensus 235 f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms 314 (377)
++++.||++.+.++|++.+|+.+. ++||+|++|+||.|+|||+|||++.|++|+ |.+|+||||||++.|.+|||.||
T Consensus 324 s~rk~Gimdtd~~~~~s~~g~~~~--q~KA~RldY~eSLmTHAMvlTGvd~d~~g~-p~rwkVENSWG~d~G~~GyfvaS 400 (444)
T COG3579 324 SDRKTGIMDTDIYDYESSLGINLT--QDKAGRLDYGESLMTHAMVLTGVDLDETGN-PLRWKVENSWGKDVGKKGYFVAS 400 (444)
T ss_pred cccccceeeehhccchhhhCCCcc--cchhhccccchHHHHHHHHhhccccccCCC-ceeeEeecccccccCCCceEeeh
Confidence 999999999999999999999976 999999999999999999999999999998 99999999999999999999999
Q ss_pred hhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhcCCCeecCCCCccCccCC
Q psy17191 315 SPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMGTLAQ 377 (377)
Q Consensus 315 ~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~wdpm~~l~~ 377 (377)
+.||+||+|||||+| +|||+|++++++.|||+|+||||||+||+
T Consensus 401 d~wmdEytyQIvV~k-------------------~~l~~e~~~a~~~epivL~pWDPMGALA~ 444 (444)
T COG3579 401 DAWMDEYTYQIVVDK-------------------KFLPKEELAAYEEEPIVLAPWDPMGALAK 444 (444)
T ss_pred HhHhhhheeEEEEeh-------------------hhCCHHHHHhhcCCCeecCCCCccccccC
Confidence 999999999999999 99999999999999999999999999985
No 3
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=100.00 E-value=5.1e-101 Score=780.85 Aligned_cols=316 Identities=40% Similarity=0.709 Sum_probs=303.7
Q ss_pred hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191 22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY 94 (377)
Q Consensus 22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L 94 (377)
++|||||||||+|||+||+|++ +|+++|++++++..|++|||+|+|++++|+||||||+++|| |++||.+|
T Consensus 95 l~f~dklEkaN~fle~ii~~~~--~~~~~R~v~~ll~~~~~DGGqw~m~~~li~KYGvVPk~~~pet~~s~~t~~~n~~L 172 (437)
T cd00585 95 LFFWDKLEKANYFLENIIETAD--EPLDDRLVQFLLANPQNDGGQWDMLVNLIEKYGLVPKSVMPESFNSENSRRLNYLL 172 (437)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc--CCCccHHHHHHHhCCcCCCCchHHHHHHHHHcCCCcccccCCCcCccchHHHHHHH
Confidence 6999999999999999999996 99999999999999999999999999999999999999999 88999999
Q ss_pred hhhHHHHHHHHHH--hhccChhhh---hHHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191 95 NNQPVELLMKLAA--ESIKNNEAV---WFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ 169 (377)
Q Consensus 95 ~~~Lr~~a~~lr~--~~~~~~~~l---~~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T 169 (377)
+++||++|++||+ +++++.++| .++|+++||+||++|||+||++|+|+|+||+++||+ .+++|
T Consensus 173 ~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~~~lG~pP~~F~~~y~dkd~~~~~------------~~~~T 240 (437)
T cd00585 173 NRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILAIALGEPPEKFDWEYRDKDKKYHE------------IKELT 240 (437)
T ss_pred HHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHcCCCCceEEEEEEeCCCCeee------------CCCcC
Confidence 9999999999999 555555556 499999999999999999999999999999999999 88999
Q ss_pred cHHHHHHhhc-----------c-c--CCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCC
Q psy17191 170 PVELLMKLAA-----------E-S--IKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKR 234 (377)
Q Consensus 170 P~eF~~k~v~-----------D-~--~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~ 234 (377)
|++||++||+ | + |||++.|++++++||+ |+++.|+|||+++|+++++++|++|+||||||||+++
T Consensus 241 P~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~y~Nvp~d~l~~~~~~~L~~g~pV~~g~Dv~~~ 320 (437)
T cd00585 241 PLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPILYLNVPMDVLKKAAIAQLKDGEPVWFGCDVGKF 320 (437)
T ss_pred HHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccceEEecCHHHHHHHHHHHHhcCCCEEEEEEcChh
Confidence 9999999985 2 5 9999999999999999 9999999999999999999999999999999999988
Q ss_pred CcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEe
Q psy17191 235 FANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMT 314 (377)
Q Consensus 235 f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms 314 (377)
+++++||++..+++++.+||+++. |||++|+++++|.+||||+||||+.|++|+ |.||+||||||++||++||++||
T Consensus 321 ~~~k~GI~d~~~~~~~~~f~~~~~--~~KaeRl~~~es~~tHAM~ivGv~~D~~g~-p~yw~VkNSWG~~~G~~Gy~~ms 397 (437)
T cd00585 321 SDRKSGILDTDLFDYELLFGIDFG--LNKAERLDYGESLMTHAMVLTGVDLDEDGK-PVKWKVENSWGEKVGKKGYFVMS 397 (437)
T ss_pred hccCCccccCcccchhhhcCcccc--CCHHHHHhhcCCcCCeEEEEEEEEecCCCC-cceEEEEcccCCCCCCCcceehh
Confidence 899999999999999999999976 999999999999999999999999998898 88999999999999999999999
Q ss_pred hhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhcCCCeecCCCCccC
Q psy17191 315 SPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG 373 (377)
Q Consensus 315 ~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~wdpm~ 373 (377)
++||++|||+||||| +|||+++++++++|||+||||||||
T Consensus 398 ~~wf~~y~~~ivV~k-------------------k~l~~~~~~~l~~~p~~L~~WDpmg 437 (437)
T cd00585 398 DDWFDEYVYQVVVDK-------------------KYLPEEVLDLLKQEPIVLPPWDPMG 437 (437)
T ss_pred HHHHHhhheEEEeec-------------------ccCCHHHHHHhcCCCEECCCCCCCC
Confidence 999999999999999 9999999999999999999999998
No 4
>KOG4128|consensus
Probab=100.00 E-value=1.5e-93 Score=687.25 Aligned_cols=323 Identities=38% Similarity=0.639 Sum_probs=306.8
Q ss_pred hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191 22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY 94 (377)
Q Consensus 22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L 94 (377)
+|||||||||||||.+|+++|+|.+|+|+|+|++++.+|..|||+|.|++|+++||||+||++|+ |+.||.+|
T Consensus 103 LFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~~DGGqw~MfvNlVkKYGviPKkcy~~sysT~atrkmN~iL 182 (457)
T KOG4128|consen 103 LFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPVPDGGQWQMFVNLVKKYGVIPKKCYLHSYSTQATRKMNLIL 182 (457)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCCCCCchHHHHHHHHHHhCCCcHHhccccchhhHHHHHHHHH
Confidence 79999999999999999999999999999999999999999999999999999999999999999 78999999
Q ss_pred hhhHHHHHHHHHH--hhccChhhhh---HHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191 95 NNQPVELLMKLAA--ESIKNNEAVW---FGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ 169 (377)
Q Consensus 95 ~~~Lr~~a~~lr~--~~~~~~~~l~---~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T 169 (377)
++|||++|..||+ ..++...+|. .+|+++|+|+++||||+||+.|||+|+||+|+|++ +...|
T Consensus 183 ~~KlREfa~~Lr~~~~~~~~~~~i~~t~~emm~eiFrviciclg~PPe~FTWey~DkdK~yq~------------i~~~T 250 (457)
T KOG4128|consen 183 KSKLREFASMLRAQFTFNGNGCRIPDTIQEMMPEIFRVICICLGEPPEVFTWEYYDKDKRYQC------------INDMT 250 (457)
T ss_pred HHHHHHHHHHHHHHHhcCCCCCcchHHHHHHHHHHHHHHhhhcCCCcceeeEEEeecccceee------------cCCCc
Confidence 9999999999999 4443333444 89999999999999999999999999999999999 99999
Q ss_pred cHHHHHHhhc-------------c---cCCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC
Q psy17191 170 PVELLMKLAA-------------E---SIKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS 232 (377)
Q Consensus 170 P~eF~~k~v~-------------D---~~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~ 232 (377)
|+|||+++++ | +|||++.|.++..+|+. |++..|+|+|+|.+++++.++|+++.|||||||+.
T Consensus 251 pleFy~~~vk~~~n~~~~vcL~nDPR~~h~Y~kly~v~~lsNm~GG~~~~YnNq~~d~l~k~vv~sl~~~kaVwfgcd~~ 330 (457)
T KOG4128|consen 251 PLEFYEVMVKCTVNLDAFVCLGNDPRISHSYHKLYQVANLSNMMGGKPHRYNNQSMDILMKIVVTSLEGDKAVWFGCDIR 330 (457)
T ss_pred hHHHHHHHhccccCCCcceeecCCCCccccccceeeehhhhcccCCCceeecCCCHHHHHHHHHHHhcCCcceEEecccH
Confidence 9999999987 3 59999999999999999 99999999999999999999999999999999999
Q ss_pred CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEe-c-CCCCCCCeEEEEcccCCCCCCCeE
Q psy17191 233 KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSI-D-KETEEPTKWRVENSWGEEQNHKGY 310 (377)
Q Consensus 233 ~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~-d-~~G~~~~yW~VkNSWG~~~G~kGY 310 (377)
+.++.|+|++|..+++++.+||.++. .++||||++|++|.+||||+|||++. | ++|- +..|+||||||.+.|.+||
T Consensus 331 k~~~~K~G~~dl~l~~~~l~fG~~l~-~~~KAeRl~y~eSlmthAml~T~v~~kd~~~g~-~~~~rVenswgkd~gkkg~ 408 (457)
T KOG4128|consen 331 KAISLKSGPLDLRLHQFDLLFGFKLG-ESTKAERLDYRESLMTHAMLLTSVGLKDPATGG-LNEHRVENSWGKDLGKKGV 408 (457)
T ss_pred hhhhcccCccchhhccCceeeeeecc-ccchhhhhhHHHHHHHHHHHhhhccccCcccCC-chhhhhhchhhhhccccch
Confidence 99999999999999999999998875 78999999999999999999999994 3 3566 7999999999999999999
Q ss_pred EEEehhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhc-CCCeecCCCCccCccCC
Q psy17191 311 ILMTSPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFN-QEPTILPAWDPMGTLAQ 377 (377)
Q Consensus 311 ~~ms~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~l~~wdpm~~l~~ 377 (377)
..|+++||+||+|+||||. ++||+|++.++. +|||+||+||||||||.
T Consensus 409 ~~mt~ewf~EY~feiVVd~-------------------k~vP~evl~~~~~~epi~LPaWDPMGALa~ 457 (457)
T KOG4128|consen 409 NKMTAEWFREYAFEIVVDE-------------------KFVPSEVLHAARIHEPIELPAWDPMGALAN 457 (457)
T ss_pred hhhhHHHHHhhheeEEeec-------------------ccCcHHHHHHhhhcCCeeccccCchhhhcC
Confidence 9999999999999999999 999999999998 99999999999999984
No 5
>KOG1543|consensus
Probab=99.84 E-value=1.4e-20 Score=186.38 Aligned_cols=165 Identities=17% Similarity=0.179 Sum_probs=121.7
Q ss_pred HHHHHHHHhHHHHhcC-CChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHhhc----ccCCCCcce-eec
Q psy17191 118 FGCENRIIRIRIIYNN-QPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKLAA----ESIKNNEAV-WFG 191 (377)
Q Consensus 118 ~~~l~~v~~il~i~lG-~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~v~----D~~py~~~y-~~~ 191 (377)
-.....|..+.+|..| ..+.....++.||...... ||++|. ...-.+|.+++-. +.+||.-.. ...
T Consensus 137 Faa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~---GC~GG~-----~~~A~~yi~~~G~~t~~~~Ypy~~~~~~C~ 208 (325)
T KOG1543|consen 137 FAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGD---GCNGGE-----PKNAFKYIKKNGGVTECENYPYIGKDGTCK 208 (325)
T ss_pred HHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCC---CcCCCC-----HHHHHHHHHHhCCCCCCcCCCCcCCCCCcc
Confidence 4566778888888999 8999999999999886444 999986 2233344444322 268873222 222
Q ss_pred cCC-ceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhh
Q psy17191 192 CEN-RIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMY 269 (377)
Q Consensus 192 ~~~-Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~ 269 (377)
... +-......+.++|.++ ..++.++...+||.+++|++. +..|++||+..+.++
T Consensus 209 ~~~~~~~~~~~~~~~~~~~e--~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~--------------------- 265 (325)
T KOG1543|consen 209 SNKKDKTVTIKGFYNVPANE--EAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGD--------------------- 265 (325)
T ss_pred CCCccceeEeeeeeecCcCH--HHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCC---------------------
Confidence 111 2224445777899883 556667777779999999996 447999999887642
Q ss_pred CCC-CCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehhhh
Q psy17191 270 GES-SMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSPWF 318 (377)
Q Consensus 270 ~es-~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf 318 (377)
+ ..+|||+||||+. .+|. +||+|+||||++||++|||+|.++=-
T Consensus 266 --~~~~~Hav~iVGyG~-~~~~--~YWivkNSWG~~WGe~Gy~ri~r~~~ 310 (325)
T KOG1543|consen 266 --DKEGDHAVLIVGYGT-GDGV--DYWIVKNSWGTDWGEKGYFRIARGVN 310 (325)
T ss_pred --CCCCCceEEEEEEcC-CCCc--eeEEEEcCCCCCcccCceEEEecCCC
Confidence 2 4799999999999 6787 99999999999999999999988643
No 6
>KOG1542|consensus
Probab=99.84 E-value=1.1e-20 Score=184.70 Aligned_cols=157 Identities=14% Similarity=0.137 Sum_probs=119.0
Q ss_pred HHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHhhc--------ccCCC---Cc-c
Q psy17191 120 CENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKLAA--------ESIKN---NE-A 187 (377)
Q Consensus 120 ~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~v~--------D~~py---~~-~ 187 (377)
.-..|...-.|..|+.....++|+.|||..= =|||||. ++ . +-+|+. +.||| .+ .
T Consensus 186 ~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d----~gC~GGl-------~~-n-A~~~~~~~gGL~~E~dYPY~g~~~~~ 252 (372)
T KOG1542|consen 186 TTGAVEGAWAIATGKLVSLSEQELVDCDSCD----NGCNGGL-------MD-N-AFKYIKKAGGLEKEKDYPYTGKKGNQ 252 (372)
T ss_pred hhhhhhhHHHhhcCcccccchhhhhcccCcC----CcCCCCC-------hh-H-HHHHHHHhCCccccccCCccccCCCc
Confidence 4567888889999999999999999999642 2999996 32 2 224432 38999 23 6
Q ss_pred eeeccCCceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeec--ccccccccccCCCCCcCCHHH
Q psy17191 188 VWFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDL--EIHNFKAVFDSDVSLPMSKAE 265 (377)
Q Consensus 188 y~~~~~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~--~~~d~~~~fg~~~~~~~~kae 265 (377)
|.+....++ ..+..+.-+|.++ +.|++.|.+.+||.+|+++.....|..||..+ ..|
T Consensus 253 C~~~~~~~~-v~I~~f~~l~~nE--~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P~~~~C------------------ 311 (372)
T KOG1542|consen 253 CHFDKSKIV-VSIKDFSMLSNNE--DQIAAWLVTFGPLSVGINAKPMQFYRGGVSCPSKYIC------------------ 311 (372)
T ss_pred cccchhhce-EEEeccEecCCCH--HHHHHHHHhcCCeEEEEchHHHHHhcccccCCCcccC------------------
Confidence 666554444 4555787888877 44666777778999999976777899999865 232
Q ss_pred HhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 266 RMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 266 R~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
.....|||++||||+...-++ +|||||||||+.||++|||++.++
T Consensus 312 ----s~~~~~HaVLlvGyG~~g~~~--PYWIVKNSWG~~WGE~GY~~l~RG 356 (372)
T KOG1542|consen 312 ----SPKLLNHAVLLVGYGSSGYEK--PYWIVKNSWGTSWGEKGYYKLCRG 356 (372)
T ss_pred ----CccccCceEEEEeecCCCCCC--ceEEEECCccccccccceEEEecc
Confidence 122389999999999852245 899999999999999999999875
No 7
>PTZ00203 cathepsin L protease; Provisional
Probab=99.83 E-value=4.5e-20 Score=184.28 Aligned_cols=160 Identities=16% Similarity=0.191 Sum_probs=111.5
Q ss_pred HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh----hc--ccCCCC-ccee-e
Q psy17191 119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL----AA--ESIKNN-EAVW-F 190 (377)
Q Consensus 119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~----v~--D~~py~-~~y~-~ 190 (377)
+....|...+.+.-|..+..+.+++.||+.. . -||++|. .....+|.++. +. +.+||- +.-. -
T Consensus 154 a~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~-~---~GC~GG~-----~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~ 224 (348)
T PTZ00203 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHV-D---NGCGGGL-----MLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVP 224 (348)
T ss_pred hhHHHHHHHHHHhcCCCccCCHHHHHhccCC-C---CCCCCCC-----HHHHHHHHHHhcCCCCCccccCCCccCCCCCC
Confidence 3445566666777899889999999999753 2 3999996 12234455432 22 378983 1110 0
Q ss_pred ccC--Cce-E-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHH
Q psy17191 191 GCE--NRI-I-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAER 266 (377)
Q Consensus 191 ~~~--~Nv-~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR 266 (377)
.|. ... . .....|..++.++ +.++.+|..++||++++|+..+..|++||++. |.
T Consensus 225 ~C~~~~~~~~~~~i~~~~~i~~~e--~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy~~--c~------------------ 282 (348)
T PTZ00203 225 ECSNSSELAPGARIDGYVSMESSE--RVMAAWLAKNGPISIAVDASSFMSYHSGVLTS--CI------------------ 282 (348)
T ss_pred cCCCCcccccceEecceeecCcCH--HHHHHHHHhCCCEEEEEEhhhhcCccCceeec--cC------------------
Confidence 111 111 2 3344676676543 45666777778999999997666899999863 31
Q ss_pred hhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 267 MMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 267 ~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
.+..||||+||||+.+ +|. +|||||||||++||++|||+|+++
T Consensus 283 ----~~~~nHaVliVGYG~~-~g~--~YWiikNSWG~~WGe~GY~ri~rg 325 (348)
T PTZ00203 283 ----GEQLNHGVLLVGYNMT-GEV--PYWVIKNSWGEDWGEKGYVRVTMG 325 (348)
T ss_pred ----CCCCCeEEEEEEEecC-CCc--eEEEEEcCCCCCcCcCceEEEEcC
Confidence 1246999999999975 787 899999999999999999999875
No 8
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.79 E-value=1e-18 Score=165.36 Aligned_cols=159 Identities=16% Similarity=0.150 Sum_probs=102.2
Q ss_pred HHHHhHHHHhcC--CChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccCCCCc-ceee-----
Q psy17191 122 NRIIRIRIIYNN--QPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESIKNNE-AVWF----- 190 (377)
Q Consensus 122 ~~v~~il~i~lG--~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~py~~-~y~~----- 190 (377)
..+...+.+..+ ..+..+.+++.||...- ..||++|. ....+++.++. +. +.+||.. ....
T Consensus 35 ~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~---~~gC~GG~-----~~~a~~~i~~~G~~~e~~yPY~~~~~~~~~~~~ 106 (236)
T cd02620 35 EAFSDRLCIQSNGKENVLLSAQDLLSCCSGC---GDGCNGGY-----PDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGP 106 (236)
T ss_pred HHHhhHHHHhcCCCCccccCHHHHHhhcCCC---CCCCCCCC-----HHHHHHHHHhcCCCcCCEecCcCCCCccCCCCC
Confidence 344455555556 66778899999997631 23999886 12334444332 11 3678832 1110
Q ss_pred -----------ccCCce----E---EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccc
Q psy17191 191 -----------GCENRI----I---RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKA 251 (377)
Q Consensus 191 -----------~~~~Nv----~---g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~ 251 (377)
.+.... . .....+..++.++ ..++.+|.+.+||.++++++ .++.|++||++.+.
T Consensus 107 ~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~--~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~~----- 179 (236)
T cd02620 107 PPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVPSDE--TDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTS----- 179 (236)
T ss_pred CCCCCCCCCCCCCCcCCccccceeeeeecceeeeCCHH--HHHHHHHHHCCCeEEEEEechhhhhcCCcEEeecC-----
Confidence 011111 1 1222333444322 34666777777999999997 45579999986431
Q ss_pred cccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 252 VFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 252 ~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
.....+|||+||||+.+ +|+ +||+||||||++||++|||+|+++
T Consensus 180 ------------------~~~~~~HaV~iVGyg~~-~g~--~YWivrNSWG~~WGe~Gy~ri~~~ 223 (236)
T cd02620 180 ------------------GKQLGGHAVKIIGWGVE-NGV--PYWLAANSWGTDWGENGYFRILRG 223 (236)
T ss_pred ------------------CCCcCCeEEEEEEEecc-CCe--eEEEEEeCCCCCCCCCcEEEEEcc
Confidence 12235899999999976 787 899999999999999999999875
No 9
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.79 E-value=1.3e-18 Score=164.95 Aligned_cols=149 Identities=14% Similarity=0.194 Sum_probs=100.0
Q ss_pred hhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccCCCCcc-eeec-------cC--------Cce
Q psy17191 136 VELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESIKNNEA-VWFG-------CE--------NRI 196 (377)
Q Consensus 136 P~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~py~~~-y~~~-------~~--------~Nv 196 (377)
...+.++..||..+ -||++|. .....+|+++. +. ..+||... .... +. .+.
T Consensus 55 ~~lS~Q~lldC~~~-----~gC~GG~-----~~~a~~~~~~~Gl~~e~~yPY~~~~~~C~~~~~~~~c~~~~~c~~~~~~ 124 (239)
T cd02698 55 VYLSVQVVIDCAGG-----GSCHGGD-----PGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNY 124 (239)
T ss_pred cccCHHHHHhCCCC-----CCccCcC-----HHHHHHHHHHcCcCCCCeeCCcCCCCCCcCCCCCCCcccCccccccccc
Confidence 35778888999763 2999996 22334555443 11 26788321 1100 00 111
Q ss_pred E-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCC
Q psy17191 197 I-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSM 274 (377)
Q Consensus 197 ~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~ 274 (377)
. .....|.+++.+ +.++.+|.+.+||.+++++.. ++.|++||++...+ ....
T Consensus 125 ~~~~i~~~~~~~~~---~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~-----------------------~~~~ 178 (239)
T cd02698 125 TLYFVSDYGSVSGR---DKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQ-----------------------DPLI 178 (239)
T ss_pred ceEEeeeceecCCH---HHHHHHHHHcCCEEEEEEecccccccCCeEEccCCC-----------------------CCcC
Confidence 1 233456566532 335555656679999999985 55799999976532 2346
Q ss_pred CeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehhhhhccc
Q psy17191 275 THAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYV 322 (377)
Q Consensus 275 ~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf~~y~ 322 (377)
+|||+||||+.+.+|+ +||+||||||++||++|||+|+++.+++--
T Consensus 179 ~HaV~IVGyG~~~~g~--~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~ 224 (239)
T cd02698 179 NHIISVAGWGVDENGV--EYWIVRNSWGEPWGERGWFRIVTSSYKGAR 224 (239)
T ss_pred CeEEEEEEEEecCCCC--EEEEEEcCCCcccCcCceEEEEccCCcccc
Confidence 8999999999874487 899999999999999999999998866443
No 10
>PTZ00021 falcipain-2; Provisional
Probab=99.79 E-value=1e-18 Score=180.64 Aligned_cols=160 Identities=19% Similarity=0.177 Sum_probs=109.0
Q ss_pred HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh--hc--ccCCCCc----ceee
Q psy17191 119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL--AA--ESIKNNE----AVWF 190 (377)
Q Consensus 119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~--v~--D~~py~~----~y~~ 190 (377)
+....+...+.|.-|.......+++.||+.. .-||++|.. ....++..+. +. +.+||.. .|..
T Consensus 294 Aa~~alEs~~~I~~g~~v~LSeQqLVDCs~~----n~GC~GG~~-----~~Af~yi~~~gGl~tE~~YPY~~~~~~~C~~ 364 (489)
T PTZ00021 294 STVGVVESQYAIRKNELVSLSEQELVDCSFK----NNGCYGGLI-----PNAFEDMIELGGLCSEDDYPYVSDTPELCNI 364 (489)
T ss_pred HHHHHHHHHHHHHcCCCcccCHHHHhhhccC----CCCCCCcch-----HhhhhhhhhccccCcccccCccCCCCCcccc
Confidence 4556666677788899999999999999853 249999961 1122233222 11 3789932 2322
Q ss_pred ccCCceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhh
Q psy17191 191 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMY 269 (377)
Q Consensus 191 ~~~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~ 269 (377)
....+. ..+..|..+|.+. +..+|...+||++++|++. +..|++||++.+ |
T Consensus 365 ~~~~~~-~~i~~y~~i~~~~----lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~-C---------------------- 416 (489)
T PTZ00021 365 DRCKEK-YKIKSYVSIPEDK----FKEAIRFLGPISVSIAVSDDFAFYKGGIFDGE-C---------------------- 416 (489)
T ss_pred cccccc-ceeeeEEEecHHH----HHHHHHhcCCeEEEEEeecccccCCCCcCCCC-C----------------------
Confidence 222222 3445777887544 4455666679999999984 557999998753 3
Q ss_pred CCCCCCeEEEEEEEEecCC-------CCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 270 GESSMTHAMVISAVSIDKE-------TEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 270 ~es~~~HAM~ivG~~~d~~-------G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
....+|||+||||+.+.. |+...|||||||||++||++|||+|+++
T Consensus 417 -~~~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~ 469 (489)
T PTZ00021 417 -GEEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETD 469 (489)
T ss_pred -CCccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcC
Confidence 123699999999997521 1112699999999999999999999876
No 11
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.78 E-value=2.6e-18 Score=157.76 Aligned_cols=157 Identities=17% Similarity=0.195 Sum_probs=104.6
Q ss_pred hHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-h-c-ccCCCCcc-eeeccCCceE-EEe
Q psy17191 126 RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-A-A-ESIKNNEA-VWFGCENRII-RIR 200 (377)
Q Consensus 126 ~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v-~-D~~py~~~-y~~~~~~Nv~-g~~ 200 (377)
..+.+..|.....+..++.++..... -||++|.. ...++|.+++ + . +.+||... -.......-. .+.
T Consensus 35 ~~~~i~~~~~~~lS~q~l~~c~~~~~---~gC~GG~~-----~~a~~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i 106 (210)
T cd02248 35 GAYAIKTGKLVSLSEQQLVDCSTSGN---NGCNGGNP-----DNAFEYVKNGGLASESDYPYTGKDGTCKYNSSKVGAKI 106 (210)
T ss_pred HHHHHHcCCCcccCHHHHhccCCCCC---CCCCCCCH-----HHhHHHHHHCCcCccccCCccCCCCCccCCCCcccEEE
Confidence 33344455666667777788765412 38998862 1234444443 1 1 37888531 1111111112 445
Q ss_pred eEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEE
Q psy17191 201 IIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMV 279 (377)
Q Consensus 201 ~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ 279 (377)
..|..++..+. +.++.+|..++||.+++++.. +..++.||++.+.++ ....+|||+
T Consensus 107 ~~~~~i~~~~~-~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~----------------------~~~~~Hav~ 163 (210)
T cd02248 107 TGYSNVPPGDE-EALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCS----------------------NTNLNHAVL 163 (210)
T ss_pred eeEEEcCCCcH-HHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCC----------------------CCcCCEEEE
Confidence 56777776433 446777777889999999985 447899999776431 234689999
Q ss_pred EEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 280 ISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 280 ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
||||+.+ .|. .||+||||||++||++||++|+.+
T Consensus 164 iVGy~~~-~~~--~ywiv~NSWG~~WG~~Gy~~i~~~ 197 (210)
T cd02248 164 LVGYGTE-NGV--DYWIVKNSWGTSWGEKGYIRIARG 197 (210)
T ss_pred EEEEeec-CCc--eEEEEEcCCCCccccCcEEEEEcC
Confidence 9999976 577 899999999999999999999863
No 12
>PTZ00200 cysteine proteinase; Provisional
Probab=99.77 E-value=2.2e-18 Score=177.09 Aligned_cols=157 Identities=17% Similarity=0.232 Sum_probs=105.5
Q ss_pred HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccCCCCc---ceeecc
Q psy17191 119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESIKNNE---AVWFGC 192 (377)
Q Consensus 119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~py~~---~y~~~~ 192 (377)
+....|...+.+.-|.......+++.||+.. .-||++|. .....++.++. +. +.+||.. .|...
T Consensus 263 at~~aiEs~~~i~~~~~~~LSeQqLvDC~~~----~~GC~GG~-----~~~A~~yi~~~Gi~~e~~YPY~~~~~~C~~~- 332 (448)
T PTZ00200 263 SSVGSVESLYKIYRDKSVDLSEQELVNCDTK----SQGCSGGY-----PDTALEYVKNKGLSSSSDVPYLAKDGKCVVS- 332 (448)
T ss_pred hHHHHHHHHHHHhcCCCeecCHHHHhhccCc----cCCCCCCc-----HHHHHHHHhhcCccccccCCCCCCCCCCcCC-
Confidence 3445566666677788888999999999852 24999996 11233444332 11 3789932 22211
Q ss_pred CCceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCC
Q psy17191 193 ENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGE 271 (377)
Q Consensus 193 ~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~e 271 (377)
......+..|..++.+ +++.+++. .+||.++++++ .++.|++||++.+ | .
T Consensus 333 -~~~~~~i~~y~~~~~~---~~l~~~l~-~GPV~v~i~~~~~f~~Yk~GIy~~~-C-----------------------~ 383 (448)
T PTZ00200 333 -STKKVYIDSYLVAKGK---DVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNGE-C-----------------------G 383 (448)
T ss_pred -CCCeeEecceEecCHH---HHHHHHHh-cCCEEEEeecccccccCCCCccccc-c-----------------------C
Confidence 1111233355544433 33445554 46999999998 4558999998653 2 1
Q ss_pred CCCCeEEEEEEEEec-CCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 272 SSMTHAMVISAVSID-KETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 272 s~~~HAM~ivG~~~d-~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
+..+|||+||||+.| ++|. +||+||||||++||++|||+|+++
T Consensus 384 ~~~nHaV~lVGyG~d~~~g~--~YWIIkNSWG~~WGe~GY~ri~r~ 427 (448)
T PTZ00200 384 KSLNHAVLLVGEGYDEKTKK--RYWIIKNSWGTDWGENGYMRLERT 427 (448)
T ss_pred CCCcEEEEEEEecccCCCCC--ceEEEEcCCCCCcccCeeEEEEeC
Confidence 236999999999875 3677 899999999999999999999874
No 13
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.76 E-value=5.6e-18 Score=160.70 Aligned_cols=156 Identities=17% Similarity=0.150 Sum_probs=94.0
Q ss_pred CChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh--hc-ccCCCCc-ceeeccCCce---E-EEeeEEEe
Q psy17191 134 QPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL--AA-ESIKNNE-AVWFGCENRI---I-RIRIIYNN 205 (377)
Q Consensus 134 ~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~--v~-D~~py~~-~y~~~~~~Nv---~-g~~~~y~N 205 (377)
..+....+++.||.. +. -||++|. ...-.++.+++ +. +.+||.. .-. .+..+. . .....|.-
T Consensus 54 ~~~~lS~q~l~dC~~-~~---~GC~GG~-----~~~a~~~~~~~Gi~~e~~yPY~~~~~~-~C~~~~~~~~~~~~~~~~~ 123 (243)
T cd02621 54 QQPILSPQHVLSCSQ-YS---QGCDGGF-----PFLVGKFAEDFGIVTEDYFPYTADDDR-PCKASPSECRRYYFSDYNY 123 (243)
T ss_pred cCcccCHHHhhhhcC-CC---CCCCCCC-----HHHHHHHHHhcCcCCCceeCCCCCCCC-CCCCCccccccccccceeE
Confidence 366777888889874 22 3999996 11223344332 11 3688843 111 111111 0 11111211
Q ss_pred cC----hhHHHHHHHHHhccCCeEEEEEEcCCCC-cCCCCeeeccc----ccccccccCCCCCcCCHHHHhhhCCCCCCe
Q psy17191 206 QP----VELLMKLAAESIKNNEAVWFGCEVSKRF-ANKLGLNDLEI----HNFKAVFDSDVSLPMSKAERMMYGESSMTH 276 (377)
Q Consensus 206 vp----~~~l~~~~~~~L~~g~PV~~g~Dv~~~f-~~k~GI~d~~~----~d~~~~fg~~~~~~~~kaeR~~~~es~~~H 276 (377)
+. ..+. +.++.+|.+.+||.+++++...| .|+.||++.+. |+-.. ... ......+|
T Consensus 124 i~~~~~~~~~-~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~---~~~-----------~~~~~~~H 188 (243)
T cd02621 124 VGGCYGCTNE-DEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDN---DNF-----------NPFELTNH 188 (243)
T ss_pred cccccccCCH-HHHHHHHHHcCCEEEEEEecccccccCCeEECcCCccccccccc---ccc-----------cCcccCCe
Confidence 10 1111 34556676678999999998544 79999998753 21100 000 00124689
Q ss_pred EEEEEEEEecC-CCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 277 AMVISAVSIDK-ETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 277 AM~ivG~~~d~-~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
||+||||+.+. +|. +||+||||||++||++|||+|+++
T Consensus 189 aV~iVGyg~~~~~g~--~YWiirNSWG~~WGe~Gy~~i~~~ 227 (243)
T cd02621 189 AVLLVGWGEDEIKGE--KYWIVKNSWGSSWGEKGYFKIRRG 227 (243)
T ss_pred EEEEEEeeccCCCCC--cEEEEEcCCCCCCCcCCeEEEecC
Confidence 99999999864 377 899999999999999999999874
No 14
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.74 E-value=3.3e-17 Score=150.25 Aligned_cols=179 Identities=18% Similarity=0.093 Sum_probs=113.2
Q ss_pred HHHHHHhHHHHhcC--CChhHHHHHHHhhCCCcce-eeeeccCCccccccCCCcHHHHHHhhc-------ccCCCCccee
Q psy17191 120 CENRIIRIRIIYNN--QPVELLMKLAAESIKNNEA-VWFGCENSRIRIIYNNQPVELLMKLAA-------ESIKNNEAVW 189 (377)
Q Consensus 120 ~l~~v~~il~i~lG--~pP~~F~~~~~d~~k~y~~-~~~Gc~~~~~~~~~~~TP~eF~~k~v~-------D~~py~~~y~ 189 (377)
....+...+.+..+ ++......++.|+...... ..-||++|. |...+..++. +.+||...-.
T Consensus 26 ~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~--------~~~~~~~~~~~~Gi~~e~~~Py~~~~~ 97 (223)
T cd02619 26 SAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGG--------PLSALLKLVALKGIPPEEDYPYGAESD 97 (223)
T ss_pred HHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCc--------HHHHHHHHHHHcCCCccccCCCCCCCC
Confidence 34444455555555 6667777788888765420 013888885 4444443332 3688843322
Q ss_pred ec-c----CCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCC
Q psy17191 190 FG-C----ENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMS 262 (377)
Q Consensus 190 ~~-~----~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~ 262 (377)
.. . ..+.. .+...|..+...++ +.++.+|.+++||.+++.+.. ++.++.|+...+... . .
T Consensus 98 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~-~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~-----~-~------ 164 (223)
T cd02619 98 GEEPKSEAALNAAKVKLKDYRRVLKNNI-EDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVY-----L-L------ 164 (223)
T ss_pred CCCCCCccchhhcceeecceeEeCchhH-HHHHHHHHHCCCEEEEEEcccchhcccCccccccccc-----c-c------
Confidence 11 1 11112 33345555555433 457777888889999999985 446788875310000 0 0
Q ss_pred HHHHhhhCCCCCCeEEEEEEEEecC-CCCCCCeEEEEcccCCCCCCCeEEEEehhhhhcccEEEE
Q psy17191 263 KAERMMYGESSMTHAMVISAVSIDK-ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVV 326 (377)
Q Consensus 263 kaeR~~~~es~~~HAM~ivG~~~d~-~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf~~y~~~vv 326 (377)
.......+|||+||||+.+. .|+ .||+|+||||++||++||++|+.+-+.++++...
T Consensus 165 -----~~~~~~~~Hav~ivGy~~~~~~~~--~~~i~~NSwG~~wg~~Gy~~i~~~~~~~~~~~~~ 222 (223)
T cd02619 165 -----YEDGDLGGHAVVIVGYDDNYVEGK--GAFIVKNSWGTDWGDNGYGRISYEDVYEMTFGAN 222 (223)
T ss_pred -----cCCCccCCeEEEEEeecCCCCCCC--CEEEEEeCCCCccccCCEEEEehhhhhhhhhccc
Confidence 01234568999999999752 166 8999999999999999999999999998877653
No 15
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.69 E-value=2.1e-16 Score=172.30 Aligned_cols=165 Identities=18% Similarity=0.243 Sum_probs=111.1
Q ss_pred HHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHH---Hh--hc--ccCCCCc----
Q psy17191 118 FGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLM---KL--AA--ESIKNNE---- 186 (377)
Q Consensus 118 ~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~---k~--v~--D~~py~~---- 186 (377)
-+....+..+.++.-|.......+++.||...+.. -||++|. .|.+|+. +. +. ..+||..
T Consensus 559 FASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn--~GC~GG~-------~~~efl~yI~e~GgLptESdYPYt~k~~~ 629 (1004)
T PTZ00462 559 FASKYHLETIKCMKGYEPHAISALYIANCSKGEHK--DRCDEGS-------NPLEFLQIIEDNGFLPADSNYLYNYTKVG 629 (1004)
T ss_pred HHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCC--CCCCCCC-------cHHHHHHHHHHcCCCcccccCCCccCCCC
Confidence 34566677777788788888999999999875532 4999997 6777752 22 22 2788831
Q ss_pred -ceeeccC--CceE---------------EEeeEEEecCh-------hHHHHHHHHHhccCCeEEEEEEcCCCCcC-CCC
Q psy17191 187 -AVWFGCE--NRII---------------RIRIIYNNQPV-------ELLMKLAAESIKNNEAVWFGCEVSKRFAN-KLG 240 (377)
Q Consensus 187 -~y~~~~~--~Nv~---------------g~~~~y~Nvp~-------~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~-k~G 240 (377)
.|..... .|.+ .....|..++. +.+.+++..+|.+.+||.++++++.++.| .+|
T Consensus 630 g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsdf~~Y~~sG 709 (1004)
T PTZ00462 630 EDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENVLGYEFNG 709 (1004)
T ss_pred CCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeehHHhhhcCC
Confidence 2211100 0111 01123433332 33345677788877799999999766677 489
Q ss_pred eeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecC----CCCCCCeEEEEcccCCCCCCCeEEEEeh
Q psy17191 241 LNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDK----ETEEPTKWRVENSWGEEQNHKGYILMTS 315 (377)
Q Consensus 241 I~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~----~G~~~~yW~VkNSWG~~~G~kGY~~ms~ 315 (377)
|+..+.| +....+|||+||||+.+. +|+ .||+||||||+.||++|||+|.+
T Consensus 710 Iyv~~~C----------------------gs~~~nHAVlIVGYGt~in~eg~gk--~YWIVRNSWGt~WGEnGYFKI~r 764 (1004)
T PTZ00462 710 KKVQNLC----------------------GDDTADHAVNIVGYGNYINDEDEKK--SYWIVRNSWGKYWGDEGYFKVDM 764 (1004)
T ss_pred ccccCCC----------------------CCCcCCceEEEEEecccccccCCCC--ceEEEEcCCCCCcCCCeEEEEEe
Confidence 8755544 123468999999999742 255 89999999999999999999987
No 16
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.69 E-value=7.1e-17 Score=147.87 Aligned_cols=162 Identities=17% Similarity=0.212 Sum_probs=103.7
Q ss_pred HHHHhHHHHhc-CCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHH-h-hc--ccCCCCcc----eeecc
Q psy17191 122 NRIIRIRIIYN-NQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMK-L-AA--ESIKNNEA----VWFGC 192 (377)
Q Consensus 122 ~~v~~il~i~l-G~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k-~-v~--D~~py~~~----y~~~~ 192 (377)
..+...+.+.. +..+.....++.|+...+.. ||+++. ...-.++.++ . +. +.+||... +...
T Consensus 33 ~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~---~c~gg~-----~~~a~~~~~~~~Gi~~e~~~pY~~~~~~~c~~~- 103 (219)
T PF00112_consen 33 AALESRLAIQNNGKNVDLSEQYLIDCSNKYNK---GCDGGS-----PFDALKYIKNNNGIVTEEDYPYNGNENPTCKSK- 103 (219)
T ss_dssp HHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSS---TTBBBE-----HHHHHHHHHHHTSBEBTTTS--SSSSSCSSCHS-
T ss_pred cceecccccccccccccccccccccccccccc---ccccCc-----ccccceeecccCccccccccccccccccccccc-
Confidence 33444444444 67777777778888763333 898885 1233445544 2 11 36888532 2221
Q ss_pred CCce-EEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CC-cCCCCeeecccccccccccCCCCCcCCHHHHhhh
Q psy17191 193 ENRI-IRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RF-ANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMY 269 (377)
Q Consensus 193 ~~Nv-~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f-~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~ 269 (377)
..+. ..+...|..++...+ +.++++|..++||.+++++.. .| .+++||++.+.++
T Consensus 104 ~~~~~~~~i~~~~~~~~~~~-~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~--------------------- 161 (219)
T PF00112_consen 104 KSNSYYVKIKGYGKVKDNDI-EDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCS--------------------- 161 (219)
T ss_dssp GGGEEEBEESEEEEEESTCH-HHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSS---------------------
T ss_pred ccccccccccccccccccch-hHHHHHHhhCceeeeeeeccccccccccceeeeccccc---------------------
Confidence 1111 123335555554332 446677878789999999996 35 6899999876331
Q ss_pred CCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehhhh
Q psy17191 270 GESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSPWF 318 (377)
Q Consensus 270 ~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf 318 (377)
....+|||+||||+.+ .|+ .||+|+||||++||++||++|+++-.
T Consensus 162 -~~~~~Hav~iVGy~~~-~~~--~~wiv~NSWG~~WG~~Gy~~i~~~~~ 206 (219)
T PF00112_consen 162 -NESGGHAVLIVGYDDE-NGK--GYWIVKNSWGTDWGDNGYFRISYDYN 206 (219)
T ss_dssp -SSSEEEEEEEEEEEEE-TTE--EEEEEE-SBTTTSTBTTEEEEESSSS
T ss_pred -cccccccccccccccc-cce--eeEeeehhhCCccCCCeEEEEeeCCC
Confidence 2346899999999987 677 89999999999999999999998754
No 17
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.68 E-value=2.7e-16 Score=164.55 Aligned_cols=167 Identities=12% Similarity=0.063 Sum_probs=98.8
Q ss_pred CCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccC--CCCcc------eeeccCCceE-E--
Q psy17191 133 NQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESI--KNNEA------VWFGCENRII-R-- 198 (377)
Q Consensus 133 G~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~--py~~~------y~~~~~~Nv~-g-- 198 (377)
|..+..+.++..||+. +. -||++|. .+.-.+|.+++ +. +.+ ||... |......... .
T Consensus 259 g~~~~LS~QqLVDCs~-~n---~GCdGG~-----p~~A~~yi~~~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~ 329 (548)
T PTZ00364 259 GQQTFLSARHVLDCSQ-YG---QGCAGGF-----PEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTN 329 (548)
T ss_pred CcccCcCHHHHhcccC-CC---CCCCCCc-----HHHHHHHHHhCCcccccccCCCCCCCCCCCCCCCCCcccceeeeee
Confidence 6677889999999974 33 3999996 12223444333 11 367 98321 2111111111 1
Q ss_pred --EeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCC
Q psy17191 199 --IRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMT 275 (377)
Q Consensus 199 --~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~ 275 (377)
....|..++.++ +.++.+|...+||.+++|+. .++.|++||+..-.+.+..-.... .-+...| .++....|
T Consensus 330 ~~~I~gyy~~~~~e--~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~---~~~~~~~-~~~~~~~n 403 (548)
T PTZ00364 330 YGPLGGYYGAVTDP--DEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTA---SADRPLR-HYFASNVN 403 (548)
T ss_pred eEEecceeecCCcH--HHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeecccccccccc---ccCCccc-ccccccCC
Confidence 111343444333 23455565667999999998 566899998753200000000000 0000000 01123469
Q ss_pred eEEEEEEEEecCCCCCCCeEEEEcccCC--CCCCCeEEEEehh
Q psy17191 276 HAMVISAVSIDKETEEPTKWRVENSWGE--EQNHKGYILMTSP 316 (377)
Q Consensus 276 HAM~ivG~~~d~~G~~~~yW~VkNSWG~--~~G~kGY~~ms~~ 316 (377)
|||+||||+.+++|. +||+||||||+ +||++|||+|.++
T Consensus 404 HAVlIVGYG~de~G~--~YWIVKNSWGt~~~WGE~GYfRI~RG 444 (548)
T PTZ00364 404 HTVLIIGWGTDENGG--DYWLVLDPWGSRRSWCDGGTRKIARG 444 (548)
T ss_pred eEEEEEEecccCCCc--eEEEEECCCCCCCCcccCCeEEEEcC
Confidence 999999999876787 89999999999 9999999999986
No 18
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.65 E-value=8.5e-16 Score=163.16 Aligned_cols=97 Identities=20% Similarity=0.263 Sum_probs=64.8
Q ss_pred HHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHh---hhCCCCCCeEEEEEEEEecC-CC
Q psy17191 215 AAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERM---MYGESSMTHAMVISAVSIDK-ET 289 (377)
Q Consensus 215 ~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~---~~~es~~~HAM~ivG~~~d~-~G 289 (377)
++..|...+||.+++|+. .+..|++||++.+.++.....+.+. ++. .- ..+....+|||+||||+.++ +|
T Consensus 560 Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~----~~~-~~~~~~~G~e~~NHAVlIVGwG~d~enG 634 (693)
T PTZ00049 560 MMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDL----PKH-NGVYNITGWEKVNHAIVLVGWGEEEING 634 (693)
T ss_pred HHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCcc----ccc-cccccccccccCceEEEEEEeccccCCC
Confidence 445565667999999998 4558999999864321111111110 000 00 01222469999999999853 56
Q ss_pred CCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 290 EEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 290 ~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
...+||||+||||+.||++|||+|.++
T Consensus 635 ~~~~YWIVRNSWGt~WGenGYfKI~RG 661 (693)
T PTZ00049 635 KLYKYWIGRNSWGKNWGKEGYFKIIRG 661 (693)
T ss_pred cccCEEEEECCCCCCcccCceEEEEcC
Confidence 412799999999999999999999886
No 19
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.64 E-value=1.8e-15 Score=137.07 Aligned_cols=127 Identities=17% Similarity=0.261 Sum_probs=86.7
Q ss_pred HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh--hc--ccCCCCcceeeccCC
Q psy17191 119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL--AA--ESIKNNEAVWFGCEN 194 (377)
Q Consensus 119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~--v~--D~~py~~~y~~~~~~ 194 (377)
+....+...+.+.-|..+.....+..||.+... -||+++. ..+..+|.++. +. +.+||..
T Consensus 29 a~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~---~gC~GG~-----~~~a~~~~~~~~Gi~~e~~~PY~~-------- 92 (174)
T smart00645 29 SATGALEGRYCIKTGKLVSLSEQQLVDCSTGGN---NGCNGGL-----PDNAFEYIKKNGGLETESCYPYTG-------- 92 (174)
T ss_pred HHHHHHHHHHHHhcCCccccCHHHHhhhcCCCC---CCCCCcC-----HHHHHHHHHHcCCcccccccCccc--------
Confidence 334455555666667777788888888876422 3899886 22334555443 11 2445422
Q ss_pred ceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCC
Q psy17191 195 RIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSM 274 (377)
Q Consensus 195 Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~ 274 (377)
.+..++..+..|+.||++.+.|. ....
T Consensus 93 -------------------------------~~~~~~~~f~~Y~~Gi~~~~~~~----------------------~~~~ 119 (174)
T smart00645 93 -------------------------------SVAIDASDFQFYKSGIYDHPGCG----------------------SGTL 119 (174)
T ss_pred -------------------------------EEEEEcccccCCcCeEECCCCCC----------------------CCcc
Confidence 33345555568999999765331 1225
Q ss_pred CeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 275 THAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 275 ~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
+|||+||||+.+.+|+ .||+|+||||+.||++|||+|+++
T Consensus 120 ~Hav~ivGyg~~~~g~--~yWii~NSwG~~WG~~G~~~i~~~ 159 (174)
T smart00645 120 DHAVLIVGYGTEENGK--DYWIVKNSWGTDWGENGYFRIARG 159 (174)
T ss_pred cEEEEEEEEeecCCCe--eEEEEECCCCCCcccCeEEEEEcC
Confidence 8999999999865788 799999999999999999999876
No 20
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=99.33 E-value=4.7e-13 Score=130.96 Aligned_cols=74 Identities=30% Similarity=0.337 Sum_probs=67.0
Q ss_pred CChhhhhhcCCcccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhc---
Q psy17191 1 MSKAERMMYGESSMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNH--- 77 (377)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kY--- 77 (377)
++||+|++||||.||||||||++..| +.|.|++|| |+++|+...|+.|+|.|++.|+++|
T Consensus 348 q~KA~RldY~eSLmTHAMvlTGvd~d----------------~~g~p~rwk-VENSWG~d~G~~GyfvaSd~wmdEytyQ 410 (444)
T COG3579 348 QDKAGRLDYGESLMTHAMVLTGVDLD----------------ETGNPLRWK-VENSWGKDVGKKGYFVASDAWMDEYTYQ 410 (444)
T ss_pred cchhhccccchHHHHHHHHhhccccc----------------cCCCceeeE-eecccccccCCCceEeehHhHhhhheeE
Confidence 58999999999999999999999999 567999999 8999999999999999999999999
Q ss_pred ceeeCcccchHHHHH
Q psy17191 78 GLMPKNCLVWIRIRI 92 (377)
Q Consensus 78 GvVPk~~~pt~~m~~ 92 (377)
-+|.++..| .+.-+
T Consensus 411 IvV~k~~l~-~e~~~ 424 (444)
T COG3579 411 IVVDKKFLP-KEELA 424 (444)
T ss_pred EEEehhhCC-HHHHH
Confidence 588888888 44333
No 21
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.13 E-value=1.2e-11 Score=127.11 Aligned_cols=78 Identities=38% Similarity=0.493 Sum_probs=64.2
Q ss_pred CChhhhhhcCCcccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhc---
Q psy17191 1 MSKAERMMYGESSMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNH--- 77 (377)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kY--- 77 (377)
|+|++|+.|++|.|||||+|+++..| ++|+|..+| |+++|+...|.+|++.|+.+||++|
T Consensus 346 ~~K~~Rl~~~eS~~tHAM~itGv~~D----------------~~g~p~~wk-VeNSWG~~~g~kGy~~msd~wf~e~v~~ 408 (438)
T PF03051_consen 346 MSKAERLDYGESTMTHAMVITGVDLD----------------EDGKPVRWK-VENSWGTDNGDKGYFYMSDDWFDEYVYQ 408 (438)
T ss_dssp S-HHHHHHTTSS--EEEEEEEEEEE-----------------TTSSEEEEE-EE-SBTTTSTBTTEEEEEHHHHHHHEEE
T ss_pred cCHHHHHHhCCCCCceeEEEEEEEec----------------cCCCeeEEE-EEcCCCCCCCCCcEEEECHHHHHhhhhe
Confidence 68999999999999999999999998 668888999 8999999999999999999999999
Q ss_pred ceeeCcccchHHHHHHHhh
Q psy17191 78 GLMPKNCLVWIRIRIIYNN 96 (377)
Q Consensus 78 GvVPk~~~pt~~m~~~L~~ 96 (377)
-||+|++.| .++..+|..
T Consensus 409 vvV~K~~lp-~~~~~~~~~ 426 (438)
T PF03051_consen 409 VVVHKKYLP-KELLKKLEK 426 (438)
T ss_dssp EEEEGGGS--HHHHHHHHS
T ss_pred EEechhhCC-HHHHHHhcC
Confidence 499999999 665555543
No 22
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=98.97 E-value=2.1e-10 Score=117.99 Aligned_cols=77 Identities=32% Similarity=0.431 Sum_probs=68.0
Q ss_pred CChhhhhhcCCcccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcc--
Q psy17191 1 MSKAERMMYGESSMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHG-- 78 (377)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYG-- 78 (377)
||||+|+.+++|.|||||+|+++..| +.|+|..|+ |+++|+..+|++|++.|..+||++|.
T Consensus 345 ~~KaeRl~~~es~~tHAM~ivGv~~D----------------~~g~p~yw~-VkNSWG~~~G~~Gy~~ms~~wf~~y~~~ 407 (437)
T cd00585 345 LNKAERLDYGESLMTHAMVLTGVDLD----------------EDGKPVKWK-VENSWGEKVGKKGYFVMSDDWFDEYVYQ 407 (437)
T ss_pred CCHHHHHhhcCCcCCeEEEEEEEEec----------------CCCCcceEE-EEcccCCCCCCCcceehhHHHHHhhheE
Confidence 79999999999999999999998887 346778888 79999999999999999999999994
Q ss_pred -eeeCcccchHHHHHHHh
Q psy17191 79 -LMPKNCLVWIRIRIIYN 95 (377)
Q Consensus 79 -vVPk~~~pt~~m~~~L~ 95 (377)
||+|+..| .++..+|.
T Consensus 408 ivV~kk~l~-~~~~~~l~ 424 (437)
T cd00585 408 VVVDKKYLP-EEVLDLLK 424 (437)
T ss_pred EEeecccCC-HHHHHHhc
Confidence 99999999 55555553
No 23
>KOG1544|consensus
Probab=98.80 E-value=7.1e-09 Score=101.48 Aligned_cols=89 Identities=22% Similarity=0.318 Sum_probs=65.8
Q ss_pred HHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecC--C
Q psy17191 212 MKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDK--E 288 (377)
Q Consensus 212 ~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~--~ 288 (377)
++++...+++ +||-.-..|. .+|.|+.||+..... ..-+.+|-+. .-+|+|-|+|.+.+. +
T Consensus 354 ~eImkElM~N-GPVQA~m~VHEDFF~YkgGiY~H~~~------------~~~~~e~yr~---~gtHsVk~tGWG~~~~~~ 417 (470)
T KOG1544|consen 354 KEIMKELMEN-GPVQALMEVHEDFFLYKGGIYSHTPV------------SLGRPERYRR---HGTHSVKITGWGEETLPD 417 (470)
T ss_pred HHHHHHHHhC-CChhhhhhhhhhhhhhccceeecccc------------ccCCchhhhh---cccceEEEeecccccCCC
Confidence 4555555555 5999999999 678999999964321 1223333222 358999999999863 6
Q ss_pred CCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191 289 TEEPTKWRVENSWGEEQNHKGYILMTSP 316 (377)
Q Consensus 289 G~~~~yW~VkNSWG~~~G~kGY~~ms~~ 316 (377)
|+..+||+.-||||+.||++|||.+=++
T Consensus 418 G~~~KyW~aANSWG~~WGE~GYFriLRG 445 (470)
T KOG1544|consen 418 GRTLKYWTAANSWGPAWGERGYFRILRG 445 (470)
T ss_pred CCeeEEEEeecccccccccCceEEEecc
Confidence 7756899999999999999999998654
No 24
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.71 E-value=1e-08 Score=101.93 Aligned_cols=47 Identities=19% Similarity=0.370 Sum_probs=37.7
Q ss_pred CCCeEEEEEEEEecCC---------CCCCCeEEEEcccCCCCCCCeEEEEehhhhhcc
Q psy17191 273 SMTHAMVISAVSIDKE---------TEEPTKWRVENSWGEEQNHKGYILMTSPWFKEY 321 (377)
Q Consensus 273 ~~~HAM~ivG~~~d~~---------G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf~~y 321 (377)
.-+||++||||+..-+ |. .-||||||||+.||++|||+|+..+-..-
T Consensus 263 ~~gHAv~iVGyDDs~~~n~~~~~~~g~--GAfiikNSWGt~wG~~GYfwisY~ya~~g 318 (372)
T COG4870 263 NWGHAVLIVGYDDSFDINNFKYGPPGD--GAFIIKNSWGTNWGENGYFWISYYYALNG 318 (372)
T ss_pred cccceEEEEeccccccccccccCCCCC--ceEEEECccccccccCceEEEEeeecccc
Confidence 4589999999985321 22 47999999999999999999998765543
No 25
>KOG4128|consensus
Probab=98.59 E-value=2.2e-08 Score=98.40 Aligned_cols=70 Identities=30% Similarity=0.350 Sum_probs=60.6
Q ss_pred CChhhhhhcCCcccchhhhhhhhh-hhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcc-
Q psy17191 1 MSKAERMMYGESSMTHAMVISAVS-IDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHG- 78 (377)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~f-~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYG- 78 (377)
|+||||++|+||.|||||+||.+- -| -+ .|.+..+| |++.|+...|..|+..|...||++|-
T Consensus 358 ~~KAeRl~y~eSlmthAml~T~v~~kd--------------~~-~g~~~~~r-Venswgkd~gkkg~~~mt~ewf~EY~f 421 (457)
T KOG4128|consen 358 STKAERLDYRESLMTHAMLLTSVGLKD--------------PA-TGGLNEHR-VENSWGKDLGKKGVNKMTAEWFREYAF 421 (457)
T ss_pred cchhhhhhHHHHHHHHHHHhhhccccC--------------cc-cCCchhhh-hhchhhhhccccchhhhhHHHHHhhhe
Confidence 589999999999999999999986 33 11 35677888 89999999999999999999999994
Q ss_pred --eeeCcccc
Q psy17191 79 --LMPKNCLV 86 (377)
Q Consensus 79 --vVPk~~~p 86 (377)
||...-.|
T Consensus 422 eiVVd~k~vP 431 (457)
T KOG4128|consen 422 EIVVDEKFVP 431 (457)
T ss_pred eEEeecccCc
Confidence 88888888
No 26
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=77.76 E-value=7.5 Score=37.77 Aligned_cols=61 Identities=8% Similarity=0.074 Sum_probs=40.0
Q ss_pred ChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEec
Q psy17191 207 PVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSID 286 (377)
Q Consensus 207 p~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d 286 (377)
..++..+.++++|.+|.||.+.+|+.--. |...- +.....+|.++|+||+.+
T Consensus 73 ~~~~~~~~l~~~l~~g~pv~~~~D~~~lp-y~~~~---------------------------~~~~~~~H~i~v~G~d~~ 124 (317)
T PF14399_consen 73 SPDEAWEELKEALDAGRPVIVWVDMYYLP-YRPNY---------------------------YKKHHADHYIVVYGYDEE 124 (317)
T ss_pred CHHHHHHHHHHHHhCCCceEEEeccccCC-CCccc---------------------------cccccCCcEEEEEEEeCC
Confidence 35667788999999999999999975211 11000 011235899999999853
Q ss_pred CCCCCCCeEEEEc
Q psy17191 287 KETEEPTKWRVEN 299 (377)
Q Consensus 287 ~~G~~~~yW~VkN 299 (377)
++ .++++.+
T Consensus 125 --~~--~~~v~D~ 133 (317)
T PF14399_consen 125 --ED--VFYVSDP 133 (317)
T ss_pred --CC--EEEEEcC
Confidence 33 4666655
No 27
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=77.22 E-value=7.3 Score=32.25 Aligned_cols=19 Identities=11% Similarity=0.081 Sum_probs=13.9
Q ss_pred HHHHHHhccCCeEEEEEEc
Q psy17191 213 KLAAESIKNNEAVWFGCEV 231 (377)
Q Consensus 213 ~~~~~~L~~g~PV~~g~Dv 231 (377)
+.+++.|++|.||.+.++.
T Consensus 90 ~~i~~~i~~G~Pvi~~~~~ 108 (144)
T PF13529_consen 90 DDIKQEIDAGRPVIVSVNS 108 (144)
T ss_dssp HHHHHHHHTT--EEEEEET
T ss_pred HHHHHHHHCCCcEEEEEEc
Confidence 4467889999999999974
No 28
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=70.76 E-value=21 Score=32.95 Aligned_cols=30 Identities=13% Similarity=0.309 Sum_probs=22.7
Q ss_pred CCCeEEEEEEEEecCCCCCCCeEEEEcccCCC
Q psy17191 273 SMTHAMVISAVSIDKETEEPTKWRVENSWGEE 304 (377)
Q Consensus 273 ~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~ 304 (377)
...|||+++|+..-.+|+ .+..|=|-|-.+
T Consensus 118 ~~gHAlavvGya~~~~g~--~~y~~WNPW~~~ 147 (175)
T PF05543_consen 118 HAGHALAVVGYAKPNNGQ--KTYYFWNPWWND 147 (175)
T ss_dssp --EEEEEEEEEEEETTSE--EEEEEE-TT-SS
T ss_pred ccceeEEEEeeeecCCCC--eEEEEeCCccCC
Confidence 357999999999865788 789999999765
No 29
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=65.58 E-value=11 Score=35.67 Aligned_cols=116 Identities=10% Similarity=0.024 Sum_probs=61.1
Q ss_pred CcceeeeeccCCccccccCCCcHHHHHHhhc-ccCCCCcceeeccCCceEEEeeEEEecChhHHHHHHHHHhccCCeEEE
Q psy17191 149 NNEAVWFGCENSRIRIIYNNQPVELLMKLAA-ESIKNNEAVWFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWF 227 (377)
Q Consensus 149 ~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~v~-D~~py~~~y~~~~~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~ 227 (377)
.|..+++|.||+- .-.+||++... |..--++.|.-....++. +.--.|+++++ +.-|..|+|+-+
T Consensus 63 ~f~T~TlGvnp~y-------~~~~FY~~~~~~D~~RV~~lF~~A~~~gi~---V~~rsvs~~ei----~~hl~~g~~aIv 128 (212)
T PF09778_consen 63 SFYTVTLGVNPNY-------SVESFYKKNFDEDENRVNRLFQKAKAAGIN---VEKRSVSIQEI----IEHLSSGGPAIV 128 (212)
T ss_pred eEecCccccCcCc-------cccchHHHhhhhHHHHHHHHHHHHHHcCCc---eEEeeccHHHH----HHHHhCCCcEEE
Confidence 3445668888874 66689998766 422223344333333332 11224777764 444667778888
Q ss_pred EEEcCCCC-c-CCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcc
Q psy17191 228 GCEVSKRF-A-NKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENS 300 (377)
Q Consensus 228 g~Dv~~~f-~-~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNS 300 (377)
=+|.+.-. + -+.-..+.. ....|+.+ .....|-++|+||+.+ . +.+.++|-
T Consensus 129 LVd~~~L~C~~Ck~~~~~~~---~~~~~~~~--------------~~Y~GHYVVlcGyd~~-~----~~~~yrdP 181 (212)
T PF09778_consen 129 LVDASLLHCDLCKSNCFDPI---GSKCFGRS--------------PDYQGHYVVLCGYDAA-T----KEFEYRDP 181 (212)
T ss_pred EEccccccChhhcccccccc---cccccCCC--------------CCccEEEEEEEeecCC-C----CeEEEeCC
Confidence 88865211 1 121111110 11122222 2346899999999875 2 34566654
No 30
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=60.51 E-value=52 Score=32.42 Aligned_cols=51 Identities=22% Similarity=0.200 Sum_probs=36.2
Q ss_pred CCeEEEEEEEEecCC-CCCCCeEEEEcccCCC------------C--------------CCCeEEEEehhhhhcccEEEE
Q psy17191 274 MTHAMVISAVSIDKE-TEEPTKWRVENSWGEE------------Q--------------NHKGYILMTSPWFKEYVFEVV 326 (377)
Q Consensus 274 ~~HAM~ivG~~~d~~-G~~~~yW~VkNSWG~~------------~--------------G~kGY~~ms~~wf~~y~~~vv 326 (377)
.+||=.|+++..-+. |. +.-+++|.||.. | .++|-|.|+-+=|-.+--.+.
T Consensus 235 ~~HaY~Vl~~~~~~~~~~--~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~~~df~~~F~~~~ 312 (315)
T cd00044 235 KGHAYSVLDVREVQEEGL--RLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMSFEDFLRNFDGLY 312 (315)
T ss_pred cCcceEEeEEEEEccCce--EEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEEhHHhheeeCeEE
Confidence 489999999986421 66 789999999863 1 356888888766655544443
No 31
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=48.37 E-value=28 Score=31.72 Aligned_cols=19 Identities=16% Similarity=0.363 Sum_probs=14.2
Q ss_pred HHHHHHhccCCeEEEEEEc
Q psy17191 213 KLAAESIKNNEAVWFGCEV 231 (377)
Q Consensus 213 ~~~~~~L~~g~PV~~g~Dv 231 (377)
+.+.+.|++.+|+|++.+.
T Consensus 99 e~~~~LL~~yGPLwv~~~~ 117 (166)
T PF12385_consen 99 EGLANLLREYGPLWVAWEA 117 (166)
T ss_pred HHHHHHHHHcCCeEEEecC
Confidence 3456678888999999653
No 32
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.21 E-value=26 Score=32.62 Aligned_cols=24 Identities=13% Similarity=0.290 Sum_probs=18.1
Q ss_pred CeEEEEEEEEecCCCCCCCeEEEEcccCCCC
Q psy17191 275 THAMVISAVSIDKETEEPTKWRVENSWGEEQ 305 (377)
Q Consensus 275 ~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~ 305 (377)
-|+++||||+. .++-.-++||-+.
T Consensus 148 ~H~v~itgyDk-------~n~yynDpyG~kn 171 (195)
T COG4990 148 IHSVLITGYDK-------YNIYYNDPYGYKN 171 (195)
T ss_pred eeeeEeecccc-------cceEeccccccch
Confidence 69999999974 3566667887654
No 33
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=45.22 E-value=1e+02 Score=25.71 Aligned_cols=59 Identities=14% Similarity=0.165 Sum_probs=37.2
Q ss_pred HHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCe
Q psy17191 215 AAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTK 294 (377)
Q Consensus 215 ~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~y 294 (377)
+...|+.|.||.+.++.+ + ......|.++|+||+ + + ..
T Consensus 70 ~~~~l~~~~Pvi~~~~~~-------~-----------------------------~~~~~gH~vVv~g~~-~-~----~~ 107 (141)
T cd02549 70 LLRQLAAGHPVIVSVNLG-------V-----------------------------SITPSGHAMVVIGYD-R-K----GN 107 (141)
T ss_pred HHHHHHCCCeEEEEEecC-------c-----------------------------ccCCCCeEEEEEEEc-C-C----CC
Confidence 556788899999988751 0 012358999999997 1 1 23
Q ss_pred EEEEcccCCCCCCCeEEEEehhhhhc
Q psy17191 295 WRVENSWGEEQNHKGYILMTSPWFKE 320 (377)
Q Consensus 295 W~VkNSWG~~~G~kGY~~ms~~wf~~ 320 (377)
..|-+.|. .....|+.+.|.+
T Consensus 108 ~~i~DP~~-----~~~~~~~~~~f~~ 128 (141)
T cd02549 108 VYVNDPGG-----GRRLVVSFDEFEK 128 (141)
T ss_pred EEEECCCC-----CcCEEEeHHHHHH
Confidence 55666653 2344666665543
No 34
>COG5572 Predicted integral membrane protein [Function unknown]
Probab=42.79 E-value=8.3 Score=31.80 Aligned_cols=18 Identities=44% Similarity=0.407 Sum_probs=15.5
Q ss_pred hhhhcCCcccchhhhhhh
Q psy17191 5 ERMMYGESSMTHAMVISA 22 (377)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~ 22 (377)
-|++..||+|||||+-..
T Consensus 3 ~rl~a~~s~~~~am~aaA 20 (104)
T COG5572 3 GRLGANESLMTRAMTAAA 20 (104)
T ss_pred ccccccHHHHHHHHHHHH
Confidence 588999999999998765
No 35
>PF09713 A_thal_3526: Plant protein 1589 of unknown function (A_thal_3526); InterPro: IPR006476 This plant-specific family of proteins are defined by an uncharacterised region 57 residues in length. It is found toward the N terminus of most proteins that contain it. Examples include at least several proteins from Arabidopsis thaliana (Mouse-ear cress) and Oryza sativa (Rice). The function of the proteins are unknown.
Probab=35.82 E-value=25 Score=26.31 Aligned_cols=30 Identities=23% Similarity=0.266 Sum_probs=19.4
Q ss_pred CChhhhhhc--CCcccchhhhhhhhhhhhhhhhh
Q psy17191 1 MSKAERMMY--GESSMTHAMVISAVSIDKIERCN 32 (377)
Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~f~DklEkan 32 (377)
|||+|-+.+ ..+...+.++. .-|.+|||.|
T Consensus 13 Msk~E~v~~L~~~a~I~P~~T~--~VW~~Le~eN 44 (54)
T PF09713_consen 13 MSKEECVRALQKQANIEPVFTS--TVWQKLEKEN 44 (54)
T ss_pred CCHHHHHHHHHHHcCCChHHHH--HHHHHHHHHC
Confidence 677776554 24445555444 4599999988
No 36
>PF01640 Peptidase_C10: Peptidase C10 family classification.; InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=35.53 E-value=1.5e+02 Score=27.15 Aligned_cols=59 Identities=22% Similarity=0.372 Sum_probs=38.4
Q ss_pred cChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEe
Q psy17191 206 QPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSI 285 (377)
Q Consensus 206 vp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~ 285 (377)
...++..+.+...|.+|.||.+..-- .+. .||.+|=||+.
T Consensus 134 y~~~~~~~~i~~el~~~rPV~~~g~~---------------------------------------~~~-GHawViDGy~~ 173 (192)
T PF01640_consen 134 YTDSEWMDMIRNELDNGRPVLYSGNS---------------------------------------KSG-GHAWVIDGYDS 173 (192)
T ss_dssp S-HHHHHHHHHHHHHTT--EEEEEEE---------------------------------------TTE-EEEEEEEEEES
T ss_pred cCHHHHHHHHHHHHHcCCCEEEEEec---------------------------------------CCC-CeEEEEcCccC
Confidence 34456667889999999999987631 011 79999999964
Q ss_pred cCCCCCCCeEEEEcccCCCCCCCeEEE
Q psy17191 286 DKETEEPTKWRVENSWGEEQNHKGYIL 312 (377)
Q Consensus 286 d~~G~~~~yW~VkNSWG~~~G~kGY~~ 312 (377)
+ .|+-+==.||-. .+|||.
T Consensus 174 ~------~~~H~NwGW~G~--~nGyy~ 192 (192)
T PF01640_consen 174 D------GYFHCNWGWGGS--SNGYYR 192 (192)
T ss_dssp S------SEEEEE-SSTTT--T-EEEE
T ss_pred C------CeEEEeeCccCC--CCCccC
Confidence 3 678776666654 568873
No 37
>PF09028 Mac-1: Mac 1; InterPro: IPR015117 The bacterial protein Mac 1 adopts an alpha/beta fold, with 14 beta strands and 9 alpha helices. The N-terminal domain is made up predominantly of alpha helices, whereas the C-terminal domain consists predominantly of beta sheets. Mac 1 blocks polymorphonuclear opsonophagocytosis, inhibits the production of reactive oxygen species and contains IgG endopeptidase activity. ; PDB: 1Y08_A 2AVW_C 2AU1_A.
Probab=33.15 E-value=1.5e+02 Score=30.13 Aligned_cols=47 Identities=19% Similarity=0.258 Sum_probs=32.4
Q ss_pred cChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEe
Q psy17191 206 QPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSI 285 (377)
Q Consensus 206 vp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~ 285 (377)
-..+++.+.+..+|.+|+.|.+.... +.+..+|+|++=|+..
T Consensus 227 ~~~~~fs~~ik~~L~~g~aigLs~~~--------------------------------------~n~~~~HiItlWGae~ 268 (333)
T PF09028_consen 227 RSYEDFSEKIKEALDNGKAIGLSYSP--------------------------------------GNSSGNHIITLWGAEY 268 (333)
T ss_dssp --HHHHHHHHHHHHHTT-EEEEEE----------------------------------------SSTTS-EEEEEEEEEE
T ss_pred CCHHHHHHHHHHHHhcCCEEEEEeec--------------------------------------CCCCCCcEEEEeeeEE
Confidence 34678889999999999888777632 1233689999999999
Q ss_pred cCCCC
Q psy17191 286 DKETE 290 (377)
Q Consensus 286 d~~G~ 290 (377)
|++|+
T Consensus 269 D~~G~ 273 (333)
T PF09028_consen 269 DDNGN 273 (333)
T ss_dssp -TTS-
T ss_pred cCCCC
Confidence 98887
No 38
>PF04046 PSP: PSP; InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins.
Probab=27.05 E-value=38 Score=24.78 Aligned_cols=22 Identities=9% Similarity=0.302 Sum_probs=19.3
Q ss_pred cCCCCHHHHhhhcCCCeecCCC
Q psy17191 348 KKYVPASVLDVFNQEPTILPAW 369 (377)
Q Consensus 348 ~~~~~~~~~~~~~~~p~~l~~w 369 (377)
|-.+|+++++++.-.+-.+|||
T Consensus 2 PG~lS~~LR~ALg~~~~~~PPw 23 (48)
T PF04046_consen 2 PGKLSDELREALGMQENDPPPW 23 (48)
T ss_pred CcccCHHHHHHcCCCCCCCChH
Confidence 5578999999999888889998
No 39
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=26.41 E-value=18 Score=34.11 Aligned_cols=39 Identities=21% Similarity=0.207 Sum_probs=27.9
Q ss_pred cchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhh
Q psy17191 14 MTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLV 71 (377)
Q Consensus 14 ~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~ 71 (377)
.+||+++.++--+ ...+--+++++|+..+|++|++-+..
T Consensus 184 ~~HaV~iVGyg~~-------------------~g~~YWivrNSWG~~WGe~Gy~ri~~ 222 (236)
T cd02620 184 GGHAVKIIGWGVE-------------------NGVPYWLAANSWGTDWGENGYFRILR 222 (236)
T ss_pred CCeEEEEEEEecc-------------------CCeeEEEEEeCCCCCCCCCcEEEEEc
Confidence 4699999994322 12333468899999999999996543
No 40
>smart00581 PSP proline-rich domain in spliceosome associated proteins.
Probab=25.93 E-value=42 Score=25.16 Aligned_cols=23 Identities=13% Similarity=0.400 Sum_probs=20.6
Q ss_pred ccCCCCHHHHhhhcCCCeecCCC
Q psy17191 347 DKKYVPASVLDVFNQEPTILPAW 369 (377)
Q Consensus 347 ~~~~~~~~~~~~~~~~p~~l~~w 369 (377)
+|-.+|+++++||.-.+-.+|||
T Consensus 5 kPG~lS~~LR~ALG~~~~~pPPW 27 (54)
T smart00581 5 KPGRISDELREALGLPPGQPPPW 27 (54)
T ss_pred cCCcCCHHHHHHcCCCCCCCChH
Confidence 47789999999999888889998
No 41
>PF07910 Peptidase_C78: Peptidase family C78; InterPro: IPR012462 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This entry contains UfSP1 and UfSP2, which are cysteine peptidases required for the processing and activation of Ubiquitin fold modifier 1 (Ufm1, IPR005375 from INTERPRO) and for its release from conjugated cellular proteins. UfSP1 and UfSP2 are 217 aa and 461 aa respectively [, ]. The peptidases belong to MEROPS peptidase family C78, clan CA. The UfSP2 family have an N-terminal extension with one or more zinc finger domains of the C2H2 type (IPR007087 from INTERPRO), which have been shown to be involved in protein:protein interaction. UfSP2 is present in most, if not all, multi-cellular organisms including plants, nematodes, flies, and mammals, whereas UfSP1 is not present in plants and nematodes []. ; PDB: 3OQC_B 2Z84_A.
Probab=25.49 E-value=72 Score=30.29 Aligned_cols=22 Identities=23% Similarity=0.268 Sum_probs=15.0
Q ss_pred CCeEEEEEEEEecCCCCCCCeEEE
Q psy17191 274 MTHAMVISAVSIDKETEEPTKWRV 297 (377)
Q Consensus 274 ~~HAM~ivG~~~d~~G~~~~yW~V 297 (377)
..|+++|||+....+|. .+.+|
T Consensus 155 ~ghS~TIvGie~~~~g~--~~LLV 176 (218)
T PF07910_consen 155 DGHSRTIVGIERNKDGE--VNLLV 176 (218)
T ss_dssp TTEEEEEEEEEE-TT----EEEEE
T ss_pred cccceEEEEEEECCCCC--EEEEE
Confidence 58999999999976776 45444
No 42
>PTZ00200 cysteine proteinase; Provisional
Probab=24.32 E-value=16 Score=38.39 Aligned_cols=57 Identities=12% Similarity=0.121 Sum_probs=37.9
Q ss_pred ccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHH--hhhcceeeCcccc
Q psy17191 13 SMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNL--IVNHGLMPKNCLV 86 (377)
Q Consensus 13 ~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~l--i~kYGvVPk~~~p 86 (377)
..+||+++.+.=+| + ....+--+++++|+..+|+.|++-+..+- ..+-||.-...+|
T Consensus 385 ~~nHaV~lVGyG~d----------------~-~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~~~~~~P 443 (448)
T PTZ00200 385 SLNHAVLLVGEGYD----------------E-KTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGILTVGLTP 443 (448)
T ss_pred CCcEEEEEEEeccc----------------C-CCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCccccceee
Confidence 36899999996555 1 11244557899999999999999776431 2345655444444
No 43
>PF00648 Peptidase_C2: Calpain family cysteine protease This is family C2 in the peptidase classification. ; InterPro: IPR001300 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. Calpains are a family of cytosolic cysteine proteinases (see PDOC00126 from PROSITEDOC). Members of the calpain family are believed to function in various biological processes, including integrin-mediated cell migration, cytoskeletal remodeling, cell differentiation and apoptosis [, ]. The calpain family includes numerous members from C. elegans to mammals and with homologues in yeast and bacteria. The best characterised members are the m- and mu-calpains, both proteins are heterodimer composed of a large catalytic subunit and a small regulatory subunit. The large subunit comprises four domains (dI-dIV) while the small subunit has two domains (dV-dVI). Domain dI is a short region cleaved by autolysis, dII is the catalytic core, dIII is a C2-like domain, dIV consists of five calcium binding EF-hand motifs []. The crystal structure of calpain has been solved [, ]. The catalytic region consists of two distinct structural domains (dIIa and dIIb). dIIa contains a central helix flanked on three faces by a cluster of alpha-helices and is entirely unrelated to the corresponding domain in the typical thiol proteinases. The fold of dIIb is similar to the corresponding domain in other cysteine proteinases and contains two three-stranded anti-parallel beta-sheets. The catalytic triad residues (C,H,N) are located in dIIa and dIIb. The activation of the domain is dependent on the binding of two calcium atoms in two non EF-hand calcium binding sites located in the catalytic core, one close to the Cys active site in dIIa and one at the end of dIIb. Calcium-binding induced conformational changes in the catalytic domain which align the active site [][]. The profile covers the whole catalytic domain.; GO: 0004198 calcium-dependent cysteine-type endopeptidase activity, 0006508 proteolysis, 0005622 intracellular; PDB: 2NQA_A 1KFU_L 1KFX_L 1QXP_B 2R9C_A 1TL9_A 2G8E_A 1KXR_B 2G8J_A 2NQG_A ....
Probab=23.74 E-value=4.8e+02 Score=25.18 Aligned_cols=30 Identities=23% Similarity=0.341 Sum_probs=19.3
Q ss_pred CCeEEEEEEEEecC-CCCCCCeEEEEcccCC
Q psy17191 274 MTHAMVISAVSIDK-ETEEPTKWRVENSWGE 303 (377)
Q Consensus 274 ~~HAM~ivG~~~d~-~G~~~~yW~VkNSWG~ 303 (377)
.+||-.|+++.... +|..-+.-+++|.||.
T Consensus 213 ~~HaY~Vl~~~~~~~~~~~~~lv~LrNPwg~ 243 (298)
T PF00648_consen 213 PGHAYAVLDVREVNGNGEGHRLVKLRNPWGS 243 (298)
T ss_dssp TTS-EEEEEEEEEEETTEEEEEEEEE-TTSS
T ss_pred cceeEEEEEEEeeccccceeEEEEEcCCCcc
Confidence 48999999997631 2321256789999985
No 44
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=23.19 E-value=4.8e+02 Score=25.84 Aligned_cols=54 Identities=22% Similarity=0.281 Sum_probs=37.1
Q ss_pred CCeEEEEEEEEecCCCCCCCeEEEEcccCC------------CC---------------CCCeEEEEehhhhhcccEEEE
Q psy17191 274 MTHAMVISAVSIDKETEEPTKWRVENSWGE------------EQ---------------NHKGYILMTSPWFKEYVFEVV 326 (377)
Q Consensus 274 ~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~------------~~---------------G~kGY~~ms~~wf~~y~~~vv 326 (377)
.+||=.|+++..- +|.+..--+++|-||. .| .++|-|.|+-+=|-.+--.|.
T Consensus 227 ~~HaYsVl~v~~~-~~~~~~Ll~lrNPWg~~eW~G~wsd~s~~W~~~~~~~~~~l~~~~~~dG~FWM~~~df~~~F~~~~ 305 (318)
T smart00230 227 KGHAYSVTDVREV-QGRRQELLRLRNPWGQVEWNGPWSDDSPEWRSVSASEKKNLGLTFDDDGEFWMSFEDFLRHFDKVE 305 (318)
T ss_pred cCccEEEEEEEEE-ecCCeEEEEEECCCCCCCcCCCCCCCCccccccCHHHHHHhCCCCCCCCEEEEEhHHHHhhCCeEE
Confidence 4899999999864 4541137999999984 12 257888888766665555555
Q ss_pred Ee
Q psy17191 327 VD 328 (377)
Q Consensus 327 V~ 328 (377)
|=
T Consensus 306 vc 307 (318)
T smart00230 306 IC 307 (318)
T ss_pred Ee
Confidence 53
No 45
>COG3423 Nlp Predicted transcriptional regulator [Transcription]
Probab=23.18 E-value=71 Score=25.73 Aligned_cols=44 Identities=18% Similarity=-0.119 Sum_probs=25.5
Q ss_pred hhHHHHHHHHHH---hhccChhhhh---HHHHHHHHhHHHHhcCCChhHH
Q psy17191 96 NQPVELLMKLAA---ESIKNNEAVW---FGCENRIIRIRIIYNNQPVELL 139 (377)
Q Consensus 96 ~~Lr~~a~~lr~---~~~~~~~~l~---~~~l~~v~~il~i~lG~pP~~F 139 (377)
.-|++-...||. +.|-+...|. ..--..-.+|++.+||.||++.
T Consensus 15 A~Lkk~G~Sl~~LS~~agls~~tL~n~L~rp~pkgEriIA~algv~P~eI 64 (82)
T COG3423 15 AALKKKGTSLAALSREAGLSSSTLANALDRPWPKGERIIADALGVPPEEI 64 (82)
T ss_pred HHHHHccccHHHHHHHcCCCHHHHHHHHcCCCchHHHHHHHHhCCCHHHh
Confidence 445555555665 3454443332 1111235789999999999763
No 46
>PTZ00203 cathepsin L protease; Provisional
Probab=20.95 E-value=27 Score=35.43 Aligned_cols=40 Identities=10% Similarity=0.136 Sum_probs=29.7
Q ss_pred ccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhh
Q psy17191 13 SMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLV 71 (377)
Q Consensus 13 ~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~ 71 (377)
.++||+++.+ |+ . +...+--+++++|+..+|+.|++-+..
T Consensus 285 ~~nHaVliVG--YG----------------~-~~g~~YWiikNSWG~~WGe~GY~ri~r 324 (348)
T PTZ00203 285 QLNHGVLLVG--YN----------------M-TGEVPYWVIKNSWGEDWGEKGYVRVTM 324 (348)
T ss_pred CCCeEEEEEE--Ee----------------c-CCCceEEEEEcCCCCCcCcCceEEEEc
Confidence 4689999999 54 0 112445568999999999999997754
Done!