Query psy8713
Match_columns 309
No_of_seqs 174 out of 1922
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 17:22:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8713.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8713hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 1.6E-51 3.5E-56 378.7 14.7 219 34-305 108-370 (372)
2 PTZ00203 cathepsin L protease; 100.0 3.6E-47 7.9E-52 361.7 20.1 235 17-304 61-338 (348)
3 KOG1543|consensus 100.0 5E-46 1.1E-50 351.2 18.1 235 17-307 49-324 (325)
4 PTZ00021 falcipain-2; Provisio 100.0 4E-44 8.6E-49 351.3 19.6 233 17-305 192-487 (489)
5 PTZ00200 cysteine proteinase; 100.0 1.1E-43 2.4E-48 346.6 19.5 232 17-306 149-445 (448)
6 cd02621 Peptidase_C1A_Cathepsi 100.0 4.9E-43 1.1E-47 319.0 15.4 187 84-307 1-243 (243)
7 cd02620 Peptidase_C1A_Cathepsi 100.0 4.6E-43 1E-47 318.0 14.6 178 85-304 1-236 (236)
8 cd02698 Peptidase_C1A_Cathepsi 100.0 5.4E-42 1.2E-46 311.5 16.0 180 84-307 1-239 (239)
9 PTZ00049 cathepsin C-like prot 100.0 4.1E-40 9E-45 330.1 17.5 207 81-309 378-679 (693)
10 cd02248 Peptidase_C1A Peptidas 100.0 8.8E-39 1.9E-43 283.4 15.6 168 85-302 1-208 (210)
11 PTZ00364 dipeptidyl-peptidase 100.0 6E-38 1.3E-42 311.0 15.2 199 81-306 202-459 (548)
12 PF00112 Peptidase_C1: Papain 100.0 8.6E-38 1.9E-42 277.2 10.1 177 84-305 1-219 (219)
13 KOG1544|consensus 100.0 5.5E-37 1.2E-41 279.6 7.9 239 29-307 154-461 (470)
14 PTZ00462 Serine-repeat antigen 100.0 5.7E-34 1.2E-38 293.3 13.8 185 86-309 530-784 (1004)
15 smart00645 Pept_C1 Papain fami 100.0 4.8E-33 1E-37 241.3 9.9 151 84-303 1-172 (174)
16 cd02619 Peptidase_C1 C1 Peptid 100.0 2.4E-30 5.3E-35 229.8 12.8 169 87-292 1-213 (223)
17 COG4870 Cysteine protease [Pos 99.9 7.7E-22 1.7E-26 184.2 10.8 45 248-292 262-314 (372)
18 cd00585 Peptidase_C1B Peptidas 99.6 7.5E-15 1.6E-19 143.4 8.3 96 178-291 303-399 (437)
19 PF03051 Peptidase_C1_2: Pepti 98.6 1.7E-07 3.7E-12 92.0 10.7 108 165-290 289-399 (438)
20 COG3579 PepC Aminopeptidase C 96.4 0.0013 2.9E-08 61.9 1.3 44 247-290 358-401 (444)
21 PF08246 Inhibitor_I29: Cathep 95.8 0.0068 1.5E-07 42.6 2.5 35 17-54 24-58 (58)
22 KOG1543|consensus 94.2 0.033 7.2E-07 53.1 2.8 28 9-36 170-198 (325)
23 PF08127 Propeptide_C1: Peptid 93.6 0.034 7.4E-07 36.6 1.2 27 37-64 14-40 (41)
24 PF13529 Peptidase_C39_2: Pept 93.1 0.31 6.7E-06 39.1 6.4 59 170-276 86-144 (144)
25 smart00848 Inhibitor_I29 Cathe 90.6 0.11 2.5E-06 35.8 0.9 34 17-53 24-57 (57)
26 PTZ00364 dipeptidyl-peptidase 87.5 0.37 7.9E-06 49.1 2.3 38 3-40 266-307 (548)
27 cd02698 Peptidase_C1A_Cathepsi 84.2 0.65 1.4E-05 42.1 2.1 36 3-39 59-96 (239)
28 cd02620 Peptidase_C1A_Cathepsi 81.6 0.87 1.9E-05 41.2 1.8 37 3-40 55-95 (236)
29 cd02621 Peptidase_C1A_Cathepsi 80.8 1 2.3E-05 40.7 2.1 37 4-41 61-100 (243)
30 PTZ00049 cathepsin C-like prot 80.2 1.3 2.9E-05 46.1 2.8 38 3-41 444-484 (693)
31 PF14399 Transpep_BrtH: NlpC/p 77.8 6.5 0.00014 36.6 6.5 22 172-193 77-98 (317)
32 KOG1542|consensus 77.6 1.9 4.2E-05 41.2 2.8 37 7-44 213-250 (372)
33 PTZ00200 cysteine proteinase; 75.6 1.8 3.9E-05 43.2 2.1 37 3-40 284-323 (448)
34 KOG4128|consensus 72.7 0.48 1E-05 45.0 -2.5 41 249-289 370-412 (457)
35 COG4990 Uncharacterized protei 71.3 8.1 0.00018 33.7 4.8 20 172-191 122-141 (195)
36 PF05543 Peptidase_C47: Stapho 70.8 15 0.00032 31.9 6.3 36 250-291 119-154 (175)
37 PTZ00203 cathepsin L protease; 70.2 3 6.5E-05 40.2 2.2 38 3-41 175-218 (348)
38 smart00645 Pept_C1 Papain fami 62.6 5.3 0.00011 34.3 2.0 29 11-40 62-91 (174)
39 PTZ00021 falcipain-2; Provisio 57.5 5.9 0.00013 40.0 1.6 37 3-40 315-355 (489)
40 cd02248 Peptidase_C1A Peptidas 53.6 11 0.00023 32.8 2.5 36 4-40 50-89 (210)
41 PF09778 Guanylate_cyc_2: Guan 53.3 53 0.0012 29.5 6.8 21 172-192 112-132 (212)
42 KOG1544|consensus 51.7 5.1 0.00011 38.1 0.1 27 10-36 273-299 (470)
43 cd00044 CysPc Calpains, domain 45.5 64 0.0014 30.3 6.6 31 248-278 233-263 (315)
44 PF00112 Peptidase_C1: Papain 44.1 18 0.00038 31.3 2.4 27 10-36 63-90 (219)
45 cd02549 Peptidase_C39A A sub-f 36.4 92 0.002 24.7 5.4 21 250-276 93-114 (141)
46 PF12385 Peptidase_C70: Papain 30.0 1.1E+02 0.0024 26.3 4.8 21 172-192 97-117 (166)
47 cd02619 Peptidase_C1 C1 Peptid 29.5 44 0.00094 28.8 2.5 26 11-36 64-90 (223)
48 PF00376 MerR: MerR family reg 26.9 47 0.001 21.1 1.6 17 16-32 10-26 (38)
No 1
>KOG1542|consensus
Probab=100.00 E-value=1.6e-51 Score=378.72 Aligned_cols=219 Identities=25% Similarity=0.512 Sum_probs=167.1
Q ss_pred CCCCCcccccccCCCCcHHHHHHHhCCCCCCCCCCCCCCcc-ccc-CCCCCCCCCceecCCCCCCCCCCccccCccCCCC
Q psy8713 34 GAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGY-SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS 111 (309)
Q Consensus 34 ~~~~s~~~g~Nf~~~~~~e~~~~~lg~~~~~~~~~~~~~~~-~~~-~~~~~~lP~~~DwR~~~~~cg~vtpVkdQg~CGS 111 (309)
++.+|-..|+|+|+|+++|||+++++...... . +.+.. ... .....+||++||||++ |+||||||||+|||
T Consensus 108 ~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~--~-~~~~~~~~~~~~~~~~lP~~fDWR~k----gaVTpVKnQG~CGS 180 (372)
T KOG1542|consen 108 NDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRG--S-KLPGDAAEAPIEPGESLPESFDWRDK----GAVTPVKNQGMCGS 180 (372)
T ss_pred cCccccccCccchhhcCHHHHHHHhhcccccc--c-cCccccccCcCCCCCCCCcccchhcc----CCccccccCCcCcc
Confidence 44579999999999999999888765543321 0 11111 111 2347789999999999 99999999999999
Q ss_pred Cc--------------------CCCccccCCCCCCCCCCCCCCCCCCC-------------------CCccccc-ccccC
Q psy8713 112 CW--------------------GCRPYEIAPCEHHVNGTRPSCDASKG-------------------HTPKCVR-ECQEN 151 (309)
Q Consensus 112 CW--------------------~cs~~~~~~C~~~~~g~~~~C~~~~~-------------------~~~~~~~-~c~~~ 151 (309)
|| +.|.|++.+|+.-+ ..|+++.. |+..-.+ .|...
T Consensus 181 CWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d----~gC~GGl~~nA~~~~~~~gGL~~E~dYPY~g~~~~~C~~~ 256 (372)
T KOG1542|consen 181 CWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCD----NGCNGGLMDNAFKYIKKAGGLEKEKDYPYTGKKGNQCHFD 256 (372)
T ss_pred hhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcC----CcCCCCChhHHHHHHHHhCCccccccCCccccCCCccccc
Confidence 99 66788899997643 34777653 3332222 33211
Q ss_pred cccccccccccee-eeeeecchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCc
Q psy8713 152 YDVPYKKDLNFGA-KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230 (309)
Q Consensus 152 ~~~~~~~~~~~~~-~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~ 230 (309)
. .+....+ ..+.++.||++|.+.|.++|||+|+|++ ..++.|.+||...-
T Consensus 257 ~-----~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~YrgGV~~P~----------------------- 307 (372)
T KOG1542|consen 257 K-----SKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQFYRGGVSCPS----------------------- 307 (372)
T ss_pred h-----hhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHHhcccccCCC-----------------------
Confidence 1 1112222 3345677999999999999999999997 57899999999831
Q ss_pred CCcceeccccccccCCCccCCceEEEEEeeccCCC-CccEEEEEcCCCCCCCCCceEEEEccCCccccCcceEEEe
Q psy8713 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS-KEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305 (309)
Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~-g~~YWiikNSWG~~WG~~Gy~~i~~g~n~cgi~~~~~~~~ 305 (309)
...|++..++|||+|||||.. . .++|||||||||++|||+|||||.||.|.|||++++++++
T Consensus 308 -----------~~~Cs~~~~~HaVLlvGyG~~--g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mvss~~ 370 (372)
T KOG1542|consen 308 -----------KYICSPKLLNHAVLLVGYGSS--GYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMVSSAA 370 (372)
T ss_pred -----------cccCCccccCceEEEEeecCC--CCCCceEEEECCccccccccceEEEeccccccccccchhhhh
Confidence 047988889999999999988 5 8999999999999999999999999999999999998765
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=3.6e-47 Score=361.66 Aligned_cols=235 Identities=17% Similarity=0.317 Sum_probs=160.9
Q ss_pred cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHHHHHH-hCCCCCCCCCCCCCCcccccC-CCCCCCCCceecCCCC
Q psy8713 17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIGYS-EVDEDLPANFDSRTKW 94 (309)
Q Consensus 17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~~~~-lg~~~~~~~~~~~~~~~~~~~-~~~~~lP~~~DwR~~~ 94 (309)
++.+..++|.+||.. + .+|++|+|+|+|+++|||+++ ++.......+........... ....+||++||||++
T Consensus 61 iF~~N~~~I~~~N~~---~-~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~DWR~~- 135 (348)
T PTZ00203 61 NFERNLELMREHQAR---N-PHARFGITKFFDLSEAEFAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREK- 135 (348)
T ss_pred HHHHHHHHHHHHhcc---C-CCeEEeccccccCCHHHHHHHhcCCCcccccccccccccccccccccccCCCCCcCCcC-
Confidence 467778889888853 2 489999999999999999864 432211100100000001111 112368999999999
Q ss_pred CCCCCCccccCccCCCCCc--------------------CCCccccCCCCC-------CCCC---------CC-CCCCCC
Q psy8713 95 PNCPTIREIRDQGSCGSCW--------------------GCRPYEIAPCEH-------HVNG---------TR-PSCDAS 137 (309)
Q Consensus 95 ~~cg~vtpVkdQg~CGSCW--------------------~cs~~~~~~C~~-------~~~g---------~~-~~C~~~ 137 (309)
|.|+||||||.||||| ..|+|++.+|.. |... .. ..+.+.
T Consensus 136 ---g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~~~GC~GG~~~~a~~yi~~~~~ggi~~e~ 212 (348)
T PTZ00203 136 ---GAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEK 212 (348)
T ss_pred ---CCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHHHhccCCCCCCCCCCHHHHHHHHHHhcCCCCCccc
Confidence 9999999999999999 345556666643 2100 00 122333
Q ss_pred CCCCccccc---ccccCccccccccccceeee-eeecchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchh
Q psy8713 138 KGHTPKCVR---ECQENYDVPYKKDLNFGAKS-YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213 (309)
Q Consensus 138 ~~~~~~~~~---~c~~~~~~~~~~~~~~~~~~-~~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~ 213 (309)
.+|+..... .|...... ...+.... ..++.++++|+.+|+++|||+|+|++ .+|+.|++|||.
T Consensus 213 ~YPY~~~~~~~~~C~~~~~~----~~~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a-~~f~~Y~~GIy~-------- 279 (348)
T PTZ00203 213 SYPYVSGNGDVPECSNSSEL----APGARIDGYVSMESSERVMAAWLAKNGPISIAVDA-SSFMSYHSGVLT-------- 279 (348)
T ss_pred cCCCccCCCCCCcCCCCccc----ccceEecceeecCcCHHHHHHHHHhCCCEEEEEEh-hhhcCccCceee--------
Confidence 344432221 24321110 01112222 24555788899999999999999998 489999999997
Q ss_pred hhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEEcCCCCCCCCCceEEEEccCC
Q psy8713 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293 (309)
Q Consensus 214 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~~Gy~~i~~g~n 293 (309)
. |.....||||+|||||.+ ++++|||||||||++|||+|||||+||.|
T Consensus 280 -----------------------------~-c~~~~~nHaVliVGYG~~--~g~~YWiikNSWG~~WGe~GY~ri~rg~n 327 (348)
T PTZ00203 280 -----------------------------S-CIGEQLNHGVLLVGYNMT--GEVPYWVIKNSWGEDWGEKGYVRVTMGVN 327 (348)
T ss_pred -----------------------------c-cCCCCCCeEEEEEEEecC--CCceEEEEEcCCCCCcCcCceEEEEcCCC
Confidence 3 544456999999999987 78999999999999999999999999999
Q ss_pred ccccCcceEEE
Q psy8713 294 ECGIESSITAG 304 (309)
Q Consensus 294 ~cgi~~~~~~~ 304 (309)
.|||++.++.+
T Consensus 328 ~Cgi~~~~~~~ 338 (348)
T PTZ00203 328 ACLLTGYPVSV 338 (348)
T ss_pred cccccceEEEE
Confidence 99999887765
No 3
>KOG1543|consensus
Probab=100.00 E-value=5e-46 Score=351.24 Aligned_cols=235 Identities=27% Similarity=0.483 Sum_probs=174.0
Q ss_pred cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHHHHHHhCCCCCCCCCCCCCCcccccCCCCCCCCCceecCCCCCC
Q psy8713 17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96 (309)
Q Consensus 17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~lP~~~DwR~~~~~ 96 (309)
.+...++++..|++.. ..+|++++|+|++++.||++.+........ . ............+||++||||++|
T Consensus 49 ~f~~n~~~~~~~n~~~---~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~-~---~~~~~~~~~~~~~~p~s~DwR~~~-- 119 (325)
T KOG1543|consen 49 IFKENLQKIESHNLKY---VLSFLMGVNQFADLTTEEFKRKKTGKKPPE-I---KRDKFTEKLDGDDLPDSFDWRDKG-- 119 (325)
T ss_pred HHHHHHHHHHhhhhhh---ceeeeeccccccccchHHHHHhhccccCcc-c---cccccccccchhhCCCCccccccC--
Confidence 4566677777777766 579999999888888777766554433221 1 111111222356899999999994
Q ss_pred CCCCccccCccCCCCCc---------------------CCCccccCCCCCCCCCCCCCCCCCC-----------------
Q psy8713 97 CPTIREIRDQGSCGSCW---------------------GCRPYEIAPCEHHVNGTRPSCDASK----------------- 138 (309)
Q Consensus 97 cg~vtpVkdQg~CGSCW---------------------~cs~~~~~~C~~~~~g~~~~C~~~~----------------- 138 (309)
..+.|||||+.||||| ..|++++.+|... ....|.++.
T Consensus 120 -~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~---~~~GC~GG~~~~A~~yi~~~G~~t~~ 195 (325)
T KOG1543|consen 120 -AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGE---CGDGCNGGEPKNAFKYIKKNGGVTEC 195 (325)
T ss_pred -CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCC---CCCCcCCCCHHHHHHHHHHhCCCCCC
Confidence 3455699999999999 3456666666553 122344432
Q ss_pred --CCCcccccccccCccccccccccceeeeeeecchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhh
Q psy8713 139 --GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216 (309)
Q Consensus 139 --~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~ 216 (309)
||+......|..... ...++....+.++.++++|+.+|+.+|||+|+|.++.+|+.|++|||..
T Consensus 196 ~~Ypy~~~~~~C~~~~~----~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~---------- 261 (325)
T KOG1543|consen 196 ENYPYIGKDGTCKSNKK----DKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAE---------- 261 (325)
T ss_pred cCCCCcCCCCCccCCCc----cceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeC----------
Confidence 343333346654332 3345555667888889999999999999999999999999999999994
Q ss_pred hhhhhcccCcccCcCCcceeccccccccCCCc-cCCceEEEEEeeccCCCCccEEEEEcCCCCCCCCCceEEEEccCCcc
Q psy8713 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA-LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295 (309)
Q Consensus 217 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~~Gy~~i~~g~n~c 295 (309)
.+... .++|||+|||||+. ++.+|||||||||++|||+|||||.|+.+.|
T Consensus 262 ---------------------------~~~~~~~~~Hav~iVGyG~~--~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~ 312 (325)
T KOG1543|consen 262 ---------------------------EKGDDKEGDHAVLIVGYGTG--DGVDYWIVKNSWGTDWGEKGYFRIARGVNKC 312 (325)
T ss_pred ---------------------------CCCCCCCCCceEEEEEEcCC--CCceeEEEEcCCCCCcccCceEEEecCCCch
Confidence 33333 57999999999993 8899999999999999999999999999999
Q ss_pred ccCcceEEEeec
Q psy8713 296 GIESSITAGVPK 307 (309)
Q Consensus 296 gi~~~~~~~~~~ 307 (309)
+|++.++++.++
T Consensus 313 ~I~~~~~~~p~~ 324 (325)
T KOG1543|consen 313 GIASEASYGPIK 324 (325)
T ss_pred hhhcccccCCCC
Confidence 999999886664
No 4
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=4e-44 Score=351.29 Aligned_cols=233 Identities=21% Similarity=0.383 Sum_probs=159.4
Q ss_pred cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHHHHHH-hCCCCC-CC---CCCCCCCcc----cccCCC-CCCCCC
Q psy8713 17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSW-MGVHPD-YN---LPANRLPEL----IGYSEV-DEDLPA 86 (309)
Q Consensus 17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~~~~-lg~~~~-~~---~~~~~~~~~----~~~~~~-~~~lP~ 86 (309)
++.+.+++|.+||... + .+|++|+|+|+|++.|||+++ ++.... .. .+..+.... ....+. ...+|+
T Consensus 192 iF~~Nl~~Ie~hN~~~--~-~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~ 268 (489)
T PTZ00021 192 SFVENLAKINAHNNKE--N-VLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSNGKKSPRVINYDDVIKKYKPKDATFDHA 268 (489)
T ss_pred HHHHHHHHHHHhhccC--C-CCEEEeccccccCCHHHHHHHhccccccccccccccccccccccccccccccccccCCcc
Confidence 4677888999999642 3 589999999999999999875 443211 00 000000000 000011 112499
Q ss_pred ceecCCCCCCCCCCccccCccCCCCCc--------------------CCCccccCCCCCCCCCCCCCCCCC---------
Q psy8713 87 NFDSRTKWPNCPTIREIRDQGSCGSCW--------------------GCRPYEIAPCEHHVNGTRPSCDAS--------- 137 (309)
Q Consensus 87 ~~DwR~~~~~cg~vtpVkdQg~CGSCW--------------------~cs~~~~~~C~~~~~g~~~~C~~~--------- 137 (309)
+||||++ |.|+||||||.||||| ..|+|++.+|... ..+|.++
T Consensus 269 s~DWR~~----g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVDCs~~----n~GC~GG~~~~Af~yi 340 (489)
T PTZ00021 269 KYDWRLH----NGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFK----NNGCYGGLIPNAFEDM 340 (489)
T ss_pred ccccccC----CCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhhhccC----CCCCCCcchHhhhhhh
Confidence 9999999 9999999999999999 4566667677532 2234443
Q ss_pred ----------CCCCccc-ccccccCccccccccccceeeee-eecchHHHHHHHHHhcCCEEEEEecccccccCCCceEe
Q psy8713 138 ----------KGHTPKC-VRECQENYDVPYKKDLNFGAKSY-SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205 (309)
Q Consensus 138 ----------~~~~~~~-~~~c~~~~~~~~~~~~~~~~~~~-~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~ 205 (309)
.+|+... ...|..... ...+.+..| .++ +++|+++|+.+|||+|+|.+.++|+.|++|||.
T Consensus 341 ~~~gGl~tE~~YPY~~~~~~~C~~~~~-----~~~~~i~~y~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~ 413 (489)
T PTZ00021 341 IELGGLCSEDDYPYVSDTPELCNIDRC-----KEKYKIKSYVSIP--EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFD 413 (489)
T ss_pred hhccccCcccccCccCCCCCccccccc-----cccceeeeEEEec--HHHHHHHHHhcCCeEEEEEeecccccCCCCcCC
Confidence 3343222 123321110 112333333 343 468999999999999999998899999999998
Q ss_pred CCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCC--------CCccEEEEEcCCC
Q psy8713 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK--------SKEKYWLIANSWN 277 (309)
Q Consensus 206 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~--------~g~~YWiikNSWG 277 (309)
..|+.. ++|||+|||||++.. .+.+|||||||||
T Consensus 414 -------------------------------------~~C~~~-~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWG 455 (489)
T PTZ00021 414 -------------------------------------GECGEE-PNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWG 455 (489)
T ss_pred -------------------------------------CCCCCc-cceEEEEEEecCcCCcccccccCCCCCEEEEECCCC
Confidence 447654 699999999997521 2357999999999
Q ss_pred CCCCCCceEEEEccC----CccccCcceEEEe
Q psy8713 278 TDWGDNGLFKILRGK----DECGIESSITAGV 305 (309)
Q Consensus 278 ~~WG~~Gy~~i~~g~----n~cgi~~~~~~~~ 305 (309)
++|||+|||||+|+. |.|||++.+++++
T Consensus 456 t~WGE~GY~rI~r~~~g~~n~CGI~t~a~yP~ 487 (489)
T PTZ00021 456 ESWGEKGFIRIETDENGLMKTCSLGTEAYVPL 487 (489)
T ss_pred CCcccCeEEEEEcCCCCCCCCCCCcccceeEe
Confidence 999999999999986 5899999876643
No 5
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=1.1e-43 Score=346.61 Aligned_cols=232 Identities=24% Similarity=0.410 Sum_probs=158.8
Q ss_pred cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHHHHHHh-CCCCCCCC----CCCC-----C-Cccccc--------
Q psy8713 17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWM-GVHPDYNL----PANR-----L-PELIGY-------- 77 (309)
Q Consensus 17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~~~~l-g~~~~~~~----~~~~-----~-~~~~~~-------- 77 (309)
++....++|.+|+- + .+|++|+|+|+|+++|||++++ +...+... +... . ......
T Consensus 149 iF~~Nl~~I~~hN~----~-~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (448)
T PTZ00200 149 TFRNNYLEVKSHKG----D-EPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHNNDFKARHVSNPTYLKNLKKAKNT 223 (448)
T ss_pred HHHHHHHHHHHhcC----c-CCeEEeccccccCCHHHHHHHhccCCCccccccccccccccccccccccccccccccccc
Confidence 45566777888862 2 5899999999999999997754 33221110 0000 0 000000
Q ss_pred -C---CCCCCCCCceecCCCCCCCCCCccccCcc-CCCCCc--------------------CCCccccCCCCCCCCCCCC
Q psy8713 78 -S---EVDEDLPANFDSRTKWPNCPTIREIRDQG-SCGSCW--------------------GCRPYEIAPCEHHVNGTRP 132 (309)
Q Consensus 78 -~---~~~~~lP~~~DwR~~~~~cg~vtpVkdQg-~CGSCW--------------------~cs~~~~~~C~~~~~g~~~ 132 (309)
. .....+|++||||++ |.|+|||||| .||||| ..|+|++.+|... ..
T Consensus 224 ~~~~~~~~~~~P~~~DWR~~----g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~~~~~LSeQqLvDC~~~----~~ 295 (448)
T PTZ00200 224 DEDVKDPSKITGEGLDWRRA----DAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSVDLSEQELVNCDTK----SQ 295 (448)
T ss_pred ccccccccccCCCCccCCCC----CCCCCcccCCCccchHHHHhHHHHHHHHHHHhcCCCeecCHHHHhhccCc----cC
Confidence 0 001236999999999 9999999999 999999 4566777777542 12
Q ss_pred CCCCC------------------CCCCcccccccccCccccccccccceeeeeeecchHHHHHHHHHhcCCEEEEEeccc
Q psy8713 133 SCDAS------------------KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194 (309)
Q Consensus 133 ~C~~~------------------~~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~ 194 (309)
+|.++ .+|+......|..... ..+.+..|.+..+.+ ++++++.+|||+|+|.+++
T Consensus 296 GC~GG~~~~A~~yi~~~Gi~~e~~YPY~~~~~~C~~~~~------~~~~i~~y~~~~~~~-~l~~~l~~GPV~v~i~~~~ 368 (448)
T PTZ00200 296 GCSGGYPDTALEYVKNKGLSSSSDVPYLAKDGKCVVSST------KKVYIDSYLVAKGKD-VLNKSLVISPTVVYIAVSR 368 (448)
T ss_pred CCCCCcHHHHHHHHhhcCccccccCCCCCCCCCCcCCCC------CeeEecceEecCHHH-HHHHHHhcCCEEEEeeccc
Confidence 34443 3455444445532211 112233344444444 4455556899999999988
Q ss_pred ccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEEc
Q psy8713 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274 (309)
Q Consensus 195 ~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiikN 274 (309)
+|+.|++|||. ..|+.. ++|||+|||||.+...+.+||||||
T Consensus 369 ~f~~Yk~GIy~-------------------------------------~~C~~~-~nHaV~lVGyG~d~~~g~~YWIIkN 410 (448)
T PTZ00200 369 ELLKYKSGVYN-------------------------------------GECGKS-LNHAVLLVGEGYDEKTKKRYWIIKN 410 (448)
T ss_pred ccccCCCCccc-------------------------------------cccCCC-CcEEEEEEEecccCCCCCceEEEEc
Confidence 99999999998 447654 6999999999965336889999999
Q ss_pred CCCCCCCCCceEEEEcc---CCccccCcceEEEee
Q psy8713 275 SWNTDWGDNGLFKILRG---KDECGIESSITAGVP 306 (309)
Q Consensus 275 SWG~~WG~~Gy~~i~~g---~n~cgi~~~~~~~~~ 306 (309)
|||++|||+|||||+|+ .|.|||++.+..++.
T Consensus 411 SWG~~WGe~GY~ri~r~~~g~n~CGI~~~~~~P~~ 445 (448)
T PTZ00200 411 SWGTDWGENGYMRLERTNEGTDKCGILTVGLTPVF 445 (448)
T ss_pred CCCCCcccCeeEEEEeCCCCCCcCCccccceeeEE
Confidence 99999999999999995 489999998877654
No 6
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=4.9e-43 Score=318.95 Aligned_cols=187 Identities=35% Similarity=0.672 Sum_probs=135.2
Q ss_pred CCCceecCCCCCCCCCCccccCccCCCCCc--------------------------CCCccccCCCCCCCCCCCCCCCCC
Q psy8713 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW--------------------------GCRPYEIAPCEHHVNGTRPSCDAS 137 (309)
Q Consensus 84 lP~~~DwR~~~~~cg~vtpVkdQg~CGSCW--------------------------~cs~~~~~~C~~~~~g~~~~C~~~ 137 (309)
||++||||+.++++..|+|||||+.||||| ..|++++.+|... ...|.++
T Consensus 1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~----~~GC~GG 76 (243)
T cd02621 1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQY----SQGCDGG 76 (243)
T ss_pred CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCC----CCCCCCC
Confidence 799999999987777999999999999999 1346667777542 1234443
Q ss_pred C------------------CCCcc-cccccccCc-c-ccccccccce-eeeeeecchHHHHHHHHHhcCCEEEEEecccc
Q psy8713 138 K------------------GHTPK-CVRECQENY-D-VPYKKDLNFG-AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195 (309)
Q Consensus 138 ~------------------~~~~~-~~~~c~~~~-~-~~~~~~~~~~-~~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~~ 195 (309)
. +|+.. ....|.... . ..+.. ..+. +..+....++++||++|+++|||+++|.++++
T Consensus 77 ~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~ 155 (243)
T cd02621 77 FPFLVGKFAEDFGIVTEDYFPYTADDDRPCKASPSECRRYYF-SDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSD 155 (243)
T ss_pred CHHHHHHHHHhcCcCCCceeCCCCCCCCCCCCCccccccccc-cceeEcccccccCCHHHHHHHHHHcCCEEEEEEeccc
Confidence 2 34433 334453211 0 00000 0011 11111234788999999999999999999989
Q ss_pred cccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCC--------CccCCceEEEEEeeccCCCCc
Q psy8713 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKE 267 (309)
Q Consensus 196 f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~HaV~iVGyg~~~~~g~ 267 (309)
|+.|++|||..+... ..|. ...++|||+|||||++..+++
T Consensus 156 F~~Y~~GIy~~~~~~--------------------------------~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~ 203 (243)
T cd02621 156 FDFYKEGVYHHTDND--------------------------------EVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGE 203 (243)
T ss_pred ccccCCeEECcCCcc--------------------------------cccccccccccCcccCCeEEEEEEeeccCCCCC
Confidence 999999999843100 1132 235799999999998732388
Q ss_pred cEEEEEcCCCCCCCCCceEEEEccCCccccCcceEEEeec
Q psy8713 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307 (309)
Q Consensus 268 ~YWiikNSWG~~WG~~Gy~~i~~g~n~cgi~~~~~~~~~~ 307 (309)
+|||||||||++|||+|||||+|+.|.|||++.+++++|.
T Consensus 204 ~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~~~ 243 (243)
T cd02621 204 KYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVFAYPI 243 (243)
T ss_pred cEEEEEcCCCCCCCcCCeEEEecCCcccCcccceEeeccC
Confidence 9999999999999999999999999999999999999883
No 7
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=4.6e-43 Score=318.01 Aligned_cols=178 Identities=49% Similarity=1.000 Sum_probs=131.9
Q ss_pred CCceecCCCCCCCCCCccccCccCCCCCc----------------------CCCccccCCCCCCCCCCCCCCCCCC----
Q psy8713 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW----------------------GCRPYEIAPCEHHVNGTRPSCDASK---- 138 (309)
Q Consensus 85 P~~~DwR~~~~~cg~vtpVkdQg~CGSCW----------------------~cs~~~~~~C~~~~~g~~~~C~~~~---- 138 (309)
|++||||+++++|..|+||+|||.||||| ..|+|++.+|... ....|.++.
T Consensus 1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~---~~~gC~GG~~~~a 77 (236)
T cd02620 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSG---CGDGCNGGYPDAA 77 (236)
T ss_pred CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCC---CCCCCCCCCHHHH
Confidence 89999999988877778999999999999 3456666666542 122344332
Q ss_pred --------------CCCcccc------------------cccccCccccccccccceeeeeeecchHHHHHHHHHhcCCE
Q psy8713 139 --------------GHTPKCV------------------RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186 (309)
Q Consensus 139 --------------~~~~~~~------------------~~c~~~~~~~~~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv 186 (309)
+|+.... ..|.......+..+.......+.+..++++||++|+++|||
T Consensus 78 ~~~i~~~G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPv 157 (236)
T cd02620 78 WKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPV 157 (236)
T ss_pred HHHHHhcCCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecceeeeCCHHHHHHHHHHHCCCe
Confidence 2332111 12321111011121222223345556789999999999999
Q ss_pred EEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCC
Q psy8713 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266 (309)
Q Consensus 187 ~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g 266 (309)
+++|.+.++|+.|++|||. ..|+...++|||+|||||++ ++
T Consensus 158 ~v~i~~~~~f~~Y~~Giy~-------------------------------------~~~~~~~~~HaV~iVGyg~~--~g 198 (236)
T cd02620 158 QAAFTVYEDFLYYKSGVYQ-------------------------------------HTSGKQLGGHAVKIIGWGVE--NG 198 (236)
T ss_pred EEEEEechhhhhcCCcEEe-------------------------------------ecCCCCcCCeEEEEEEEecc--CC
Confidence 9999998899999999998 33555567999999999988 78
Q ss_pred ccEEEEEcCCCCCCCCCceEEEEccCCccccCcceEEE
Q psy8713 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304 (309)
Q Consensus 267 ~~YWiikNSWG~~WG~~Gy~~i~~g~n~cgi~~~~~~~ 304 (309)
++|||||||||++|||+|||||+|+.|+|||++.++++
T Consensus 199 ~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~~ 236 (236)
T cd02620 199 VPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVAG 236 (236)
T ss_pred eeEEEEEeCCCCCCCCCcEEEEEccCcccccccceecC
Confidence 99999999999999999999999999999999988753
No 8
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=5.4e-42 Score=311.55 Aligned_cols=180 Identities=27% Similarity=0.505 Sum_probs=130.2
Q ss_pred CCCceecCCCCCCCCCCccccCcc---CCCCCc-----------------------CCCccccCCCCCCCCCCCCCCCCC
Q psy8713 84 LPANFDSRTKWPNCPTIREIRDQG---SCGSCW-----------------------GCRPYEIAPCEHHVNGTRPSCDAS 137 (309)
Q Consensus 84 lP~~~DwR~~~~~cg~vtpVkdQg---~CGSCW-----------------------~cs~~~~~~C~~~~~g~~~~C~~~ 137 (309)
||++||||+++.. ..|+|||||| .||||| ..|+|++.+|.. ...|.++
T Consensus 1 lP~~~Dwr~~~~~-~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~-----~~gC~GG 74 (239)
T cd02698 1 LPKSWDWRNVNGV-NYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG-----GGSCHGG 74 (239)
T ss_pred CCCCcccccCCCC-cccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC-----CCCccCc
Confidence 7999999998221 2899999998 899999 245666677753 1224332
Q ss_pred C------------------CCCccccccccc----------CccccccccccceeeeeeecchHHHHHHHHHhcCCEEEE
Q psy8713 138 K------------------GHTPKCVRECQE----------NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189 (309)
Q Consensus 138 ~------------------~~~~~~~~~c~~----------~~~~~~~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv~v~ 189 (309)
. +|+..-...|.. ......+....+.+..|....++++||++|+++|||+++
T Consensus 75 ~~~~a~~~~~~~Gl~~e~~yPY~~~~~~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~~~~~~i~~~l~~~GPV~v~ 154 (239)
T cd02698 75 DPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCG 154 (239)
T ss_pred CHHHHHHHHHHcCcCCCCeeCCcCCCCCCcCCCCCCCcccCcccccccccceEEeeeceecCCHHHHHHHHHHcCCEEEE
Confidence 2 333222222321 000001111122333332223578899999999999999
Q ss_pred EecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccE
Q psy8713 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269 (309)
Q Consensus 190 ~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~Y 269 (309)
|.++++|+.|++|||+. .+....++|||+|||||++. ++++|
T Consensus 155 i~~~~~f~~Y~~GIy~~-------------------------------------~~~~~~~~HaV~IVGyG~~~-~g~~Y 196 (239)
T cd02698 155 IMATEALENYTGGVYKE-------------------------------------YVQDPLINHIISVAGWGVDE-NGVEY 196 (239)
T ss_pred EEecccccccCCeEEcc-------------------------------------CCCCCcCCeEEEEEEEEecC-CCCEE
Confidence 99988999999999983 23344579999999999872 38999
Q ss_pred EEEEcCCCCCCCCCceEEEEccC-----CccccCcceEEEeec
Q psy8713 270 WLIANSWNTDWGDNGLFKILRGK-----DECGIESSITAGVPK 307 (309)
Q Consensus 270 WiikNSWG~~WG~~Gy~~i~~g~-----n~cgi~~~~~~~~~~ 307 (309)
||||||||++|||+|||||+|+. |+|+||+.+++++|.
T Consensus 197 WiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~~~~~~~~~ 239 (239)
T cd02698 197 WIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEEDCAWADPI 239 (239)
T ss_pred EEEEcCCCcccCcCceEEEEccCCcccccccccccceEEEeeC
Confidence 99999999999999999999999 999999999999983
No 9
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=4.1e-40 Score=330.12 Aligned_cols=207 Identities=28% Similarity=0.529 Sum_probs=139.0
Q ss_pred CCCCCCceecCCCCCCCCCCccccCccCCCCCc---------------C---------------CCccccCCCCCCCCCC
Q psy8713 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------G---------------CRPYEIAPCEHHVNGT 130 (309)
Q Consensus 81 ~~~lP~~~DwR~~~~~cg~vtpVkdQg~CGSCW---------------~---------------cs~~~~~~C~~~~~g~ 130 (309)
...||++||||+.|++|+.++||+|||.||||| . .|+|++.+|...
T Consensus 378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~---- 453 (693)
T PTZ00049 378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFY---- 453 (693)
T ss_pred cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCC----
Confidence 468999999999999999999999999999999 0 355667777532
Q ss_pred CCCCCCC------------------CCCCcccccccccCcc-----------------------------------cccc
Q psy8713 131 RPSCDAS------------------KGHTPKCVRECQENYD-----------------------------------VPYK 157 (309)
Q Consensus 131 ~~~C~~~------------------~~~~~~~~~~c~~~~~-----------------------------------~~~~ 157 (309)
..+|.++ .+|+......|..... ....
T Consensus 454 nqGC~GG~~~~A~kya~~~GI~tEscYPY~a~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 533 (693)
T PTZ00049 454 DQGCNGGFPYLVSKMAKLQGIPLDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSE 533 (693)
T ss_pred CCCcCCCcHHHHHHHHHHCCCCcCCccCCcCCCCCCCCCCCCcccccccccccccccccccccccccccccccccccccc
Confidence 1224333 2344322233321000 0001
Q ss_pred ccccceeeee-ee--------cchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCccc
Q psy8713 158 KDLNFGAKSY-SV--------SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228 (309)
Q Consensus 158 ~~~~~~~~~~-~v--------~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~ 228 (309)
.+..+. +.| .+ +.++++||++|+.+|||+|+|+++++|+.|++|||..+...++..+
T Consensus 534 ~~r~y~-k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C------------- 599 (693)
T PTZ00049 534 PARWYA-KDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRC------------- 599 (693)
T ss_pred ccceee-eeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCccccccccc-------------
Confidence 111111 222 11 2478899999999999999999988999999999985310000000
Q ss_pred CcCCccee-ccccccccCCCccCCceEEEEEeeccCCCC--ccEEEEEcCCCCCCCCCceEEEEccCCccccCcceEEEe
Q psy8713 229 GAEGAFTV-FDDLILYKSGKALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305 (309)
Q Consensus 229 ~~~~~~~~-~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g--~~YWiikNSWG~~WG~~Gy~~i~~g~n~cgi~~~~~~~~ 305 (309)
..+. -...++...+....+|||+|||||.+..+| .+|||||||||+.||++|||||.||.|.|||++.+++++
T Consensus 600 ----~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~ 675 (693)
T PTZ00049 600 ----TVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIE 675 (693)
T ss_pred ----CCccccccccccccccccCceEEEEEEeccccCCCcccCEEEEECCCCCCcccCceEEEEcCCCccCCccceeEEe
Confidence 0000 000000022334579999999999763235 489999999999999999999999999999999999999
Q ss_pred ecCC
Q psy8713 306 PKLD 309 (309)
Q Consensus 306 ~~~~ 309 (309)
|++.
T Consensus 676 pd~~ 679 (693)
T PTZ00049 676 PDFS 679 (693)
T ss_pred eecc
Confidence 9973
No 10
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=8.8e-39 Score=283.36 Aligned_cols=168 Identities=29% Similarity=0.632 Sum_probs=128.2
Q ss_pred CCceecCCCCCCCCCCccccCccCCCCCc--------------------CCCccccCCCCCCCCCCCCCCCCCC------
Q psy8713 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW--------------------GCRPYEIAPCEHHVNGTRPSCDASK------ 138 (309)
Q Consensus 85 P~~~DwR~~~~~cg~vtpVkdQg~CGSCW--------------------~cs~~~~~~C~~~~~g~~~~C~~~~------ 138 (309)
|++||||+. +.++||+|||.||||| ..|++.+..|... ....|.++.
T Consensus 1 P~~~d~r~~----~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~---~~~gC~GG~~~~a~~ 73 (210)
T cd02248 1 PESVDWREK----GAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTS---GNNGCNGGNPDNAFE 73 (210)
T ss_pred CCcccCCcC----CCCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCC---CCCCCCCCCHHHhHH
Confidence 899999999 7799999999999999 4567777777642 123344432
Q ss_pred ------------CCCcccccccccCccccccccccceeeee-eecc-hHHHHHHHHHhcCCEEEEEecccccccCCCceE
Q psy8713 139 ------------GHTPKCVRECQENYDVPYKKDLNFGAKSY-SVSS-NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204 (309)
Q Consensus 139 ------------~~~~~~~~~c~~~~~~~~~~~~~~~~~~~-~v~~-~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy 204 (309)
+|+......|..... ...+.+..| .+.. ++++||++|+++|||+++|.+.++|+.|++|||
T Consensus 74 ~~~~~Gi~~e~~yPY~~~~~~C~~~~~-----~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy 148 (210)
T cd02248 74 YVKNGGLASESDYPYTGKDGTCKYNSS-----KVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIY 148 (210)
T ss_pred HHHHCCcCccccCCccCCCCCccCCCC-----cccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCce
Confidence 343332334432211 123334333 3443 478999999999999999999999999999999
Q ss_pred eCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEEcCCCCCCCCCc
Q psy8713 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284 (309)
Q Consensus 205 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~~G 284 (309)
..+ .|....++|||+|||||++ .+.+|||||||||+.||++|
T Consensus 149 ~~~------------------------------------~~~~~~~~Hav~iVGy~~~--~~~~ywiv~NSWG~~WG~~G 190 (210)
T cd02248 149 SGP------------------------------------CCSNTNLNHAVLLVGYGTE--NGVDYWIVKNSWGTSWGEKG 190 (210)
T ss_pred eCC------------------------------------CCCCCcCCEEEEEEEEeec--CCceEEEEEcCCCCccccCc
Confidence 843 2324567999999999998 68899999999999999999
Q ss_pred eEEEEccCCccccCcceE
Q psy8713 285 LFKILRGKDECGIESSIT 302 (309)
Q Consensus 285 y~~i~~g~n~cgi~~~~~ 302 (309)
||||+++.|.|||++.+.
T Consensus 191 y~~i~~~~~~cgi~~~~~ 208 (210)
T cd02248 191 YIRIARGSNLCGIASYAS 208 (210)
T ss_pred EEEEEcCCCccCceeeee
Confidence 999999999999997654
No 11
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=6e-38 Score=311.00 Aligned_cols=199 Identities=22% Similarity=0.336 Sum_probs=131.4
Q ss_pred CCCCCCceecCCCCCCCCCCccccCccC---CCCCc--------------------------CCCccccCCCCCC---CC
Q psy8713 81 DEDLPANFDSRTKWPNCPTIREIRDQGS---CGSCW--------------------------GCRPYEIAPCEHH---VN 128 (309)
Q Consensus 81 ~~~lP~~~DwR~~~~~cg~vtpVkdQg~---CGSCW--------------------------~cs~~~~~~C~~~---~~ 128 (309)
..+||++||||++. ++..|+|||||+. ||||| ..|+|++.+|... ..
T Consensus 202 ~~~LP~sfDWR~~g-g~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCd 280 (548)
T PTZ00364 202 GDPPPAAWSWGDVG-GASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCA 280 (548)
T ss_pred ccCCCCccccCcCC-CCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCC
Confidence 46799999999982 2247999999999 99999 1355666677531 11
Q ss_pred C-----------CCCCCCCCCC--CCccccc---ccccCccc-cc-cccccceeeeeeecchHHHHHHHHHhcCCEEEEE
Q psy8713 129 G-----------TRPSCDASKG--HTPKCVR---ECQENYDV-PY-KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAF 190 (309)
Q Consensus 129 g-----------~~~~C~~~~~--~~~~~~~---~c~~~~~~-~~-~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv~v~~ 190 (309)
| ..+.+.+..+ |+..... .|...... .+ ..+..+....+.+..++++||++|+++|||+|+|
T Consensus 281 GG~p~~A~~yi~~~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GPVsVaI 360 (548)
T PTZ00364 281 GGFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASV 360 (548)
T ss_pred CCcHHHHHHHHHhCCcccccccCCCCCCCCCCCCCCCCCcccceeeeeeeEEecceeecCCcHHHHHHHHHHcCCeEEEE
Confidence 1 1122223334 4432111 24321110 00 0000111122334457889999999999999999
Q ss_pred ecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccc-------cccCCCccCCceEEEEEeeccC
Q psy8713 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI-------LYKSGKALGGHAIRILGWGEDE 263 (309)
Q Consensus 191 ~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~HaV~iVGyg~~~ 263 (309)
+++.+|+.|++|||... .|.+.++.. ...+.....||||+|||||.++
T Consensus 361 da~~df~~YksGiy~gi-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de 415 (548)
T PTZ00364 361 YANSDWYNCDENSTEDV-------------------------RYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE 415 (548)
T ss_pred EechHHHhcCCCCccCe-------------------------eccccccccccccCCcccccccccCCeEEEEEEecccC
Confidence 99989999999998620 000000000 0001123569999999999753
Q ss_pred CCCccEEEEEcCCCC--CCCCCceEEEEccCCccccCcceEEEee
Q psy8713 264 KSKEKYWLIANSWNT--DWGDNGLFKILRGKDECGIESSITAGVP 306 (309)
Q Consensus 264 ~~g~~YWiikNSWG~--~WG~~Gy~~i~~g~n~cgi~~~~~~~~~ 306 (309)
++.+|||||||||+ +|||+|||||+||.|+||||+.++++.|
T Consensus 416 -~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~~v~~~~ 459 (548)
T PTZ00364 416 -NGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVVVMYW 459 (548)
T ss_pred -CCceEEEEECCCCCCCCcccCCeEEEEcCCCcccccceeeeeee
Confidence 67899999999999 9999999999999999999999998766
No 12
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=8.6e-38 Score=277.22 Aligned_cols=177 Identities=33% Similarity=0.604 Sum_probs=123.7
Q ss_pred CCCceecCCCCCCCCCCccccCccCCCCCc---------------------CCCccccCCC--------CCCCCC-----
Q psy8713 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------------GCRPYEIAPC--------EHHVNG----- 129 (309)
Q Consensus 84 lP~~~DwR~~~~~cg~vtpVkdQg~CGSCW---------------------~cs~~~~~~C--------~~~~~g----- 129 (309)
||++||||+.+ +.++||+||+.||||| ..|++.+..| .++...
T Consensus 1 lP~~~D~r~~~---~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~ 77 (219)
T PF00112_consen 1 LPKSFDWRDKG---GRITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKY 77 (219)
T ss_dssp STSSEEGGGTT---TCSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHH
T ss_pred CCCCEecccCC---CCcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCccccccee
Confidence 79999999961 3699999999999999 2233333334 332100
Q ss_pred --C-CCCCCCCCCCCcccc-cccccCccccccccccceeeee-eecc-hHHHHHHHHHhcCCEEEEEeccc-ccccCCCc
Q psy8713 130 --T-RPSCDASKGHTPKCV-RECQENYDVPYKKDLNFGAKSY-SVSS-NEKSIMKEIYEHGPVEGAFTVFD-DLILYKSG 202 (309)
Q Consensus 130 --~-~~~C~~~~~~~~~~~-~~c~~~~~~~~~~~~~~~~~~~-~v~~-~~~~ik~~l~~~GPv~v~~~~~~-~f~~Y~sG 202 (309)
. .+.+....+|+.... ..|....... ..+....+ .+.. ++++||++|+++|||+++|.+.+ +|+.|++|
T Consensus 78 ~~~~~Gi~~e~~~pY~~~~~~~c~~~~~~~----~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~g 153 (219)
T PF00112_consen 78 IKNNNGIVTEEDYPYNGNENPTCKSKKSNS----YYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSG 153 (219)
T ss_dssp HHHHTSBEBTTTS--SSSSSCSSCHSGGGE----EEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESS
T ss_pred ecccCccccccccccccccccccccccccc----ccccccccccccccchhHHHHHHhhCceeeeeeeccccccccccce
Confidence 1 233334344444333 3343221100 01222233 3333 58999999999999999999987 69999999
Q ss_pred eEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEEcCCCCCCCC
Q psy8713 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282 (309)
Q Consensus 203 iy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~ 282 (309)
||.. ..|....++|||+|||||++ .+++|||||||||++||+
T Consensus 154 i~~~------------------------------------~~~~~~~~~Hav~iVGy~~~--~~~~~wiv~NSWG~~WG~ 195 (219)
T PF00112_consen 154 IYDP------------------------------------PDCSNESGGHAVLIVGYDDE--NGKGYWIVKNSWGTDWGD 195 (219)
T ss_dssp EECS------------------------------------TSSSSSSEEEEEEEEEEEEE--TTEEEEEEE-SBTTTSTB
T ss_pred eeec------------------------------------cccccccccccccccccccc--cceeeEeeehhhCCccCC
Confidence 9994 23555678999999999999 799999999999999999
Q ss_pred CceEEEEccCC-ccccCcceEEEe
Q psy8713 283 NGLFKILRGKD-ECGIESSITAGV 305 (309)
Q Consensus 283 ~Gy~~i~~g~n-~cgi~~~~~~~~ 305 (309)
+|||||+|+.+ +|+|++.+++++
T Consensus 196 ~Gy~~i~~~~~~~c~i~~~~~~~~ 219 (219)
T PF00112_consen 196 NGYFRISYDYNNECGIESQAVYPI 219 (219)
T ss_dssp TTEEEEESSSSSGGGTTSSEEEEE
T ss_pred CeEEEEeeCCCCcCccCceeeecC
Confidence 99999999997 999999999874
No 13
>KOG1544|consensus
Probab=100.00 E-value=5.5e-37 Score=279.64 Aligned_cols=239 Identities=33% Similarity=0.643 Sum_probs=169.9
Q ss_pred ccccCCCCCCcccccc--cCCCCcHHHHHHHhCCCCCCCCCCCC-CCcccccCCCCCCCCCceecCCCCCCCCCCccccC
Q psy8713 29 GIVSGGAYGSKQAEKN--SLSNIPRAHLKSWMGVHPDYNLPANR-LPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105 (309)
Q Consensus 29 ~~~~~~~~~s~~~g~N--f~~~~~~e~~~~~lg~~~~~~~~~~~-~~~~~~~~~~~~~lP~~~DwR~~~~~cg~vtpVkd 105 (309)
+-+..++| +|+++.- |+.++.++-||-.||+.++.. +.. ..+.+...+...+||++||.|.||| +++.|+.|
T Consensus 154 E~in~G~Y-gW~A~NYSaFWGmtL~DGiKyRLGTL~Ps~--sv~nMNEi~~~l~p~~~LPE~F~As~KWp--~liH~plD 228 (470)
T KOG1544|consen 154 EAINQGNY-GWQAGNYSAFWGMTLDDGIKYRLGTLRPSS--SVMNMNEIYTVLNPGEVLPEAFEASEKWP--NLIHEPLD 228 (470)
T ss_pred HHHhcCCc-cccccchhhhhcccccccceeeecccCchh--hhhhHHhHhhccCcccccchhhhhhhcCC--ccccCccc
Confidence 44566777 6676654 999999999999999987652 111 2233333444678999999999999 78999999
Q ss_pred ccCCCCCc----------------------CCCccccCCCCCCCCCCCCCCCCCC-------------------------
Q psy8713 106 QGSCGSCW----------------------GCRPYEIAPCEHHVNGTRPSCDASK------------------------- 138 (309)
Q Consensus 106 Qg~CGSCW----------------------~cs~~~~~~C~~~~~g~~~~C~~~~------------------------- 138 (309)
||+|+++| ..|+|.+.+|.... ...|.++.
T Consensus 229 QgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~---q~GC~gG~lDRAWWYlRKrGvVsdhCYP~~~dQ 305 (470)
T KOG1544|consen 229 QGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQ---QQGCRGGRLDRAWWYLRKRGVVSDHCYPFSGDQ 305 (470)
T ss_pred cCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhh---hccCccCcccchheeeecccccccccccccCCC
Confidence 99999999 34566666775431 11233322
Q ss_pred -CCCcccc--------------cccccCccccccccccce-eeeeeecchHHHHHHHHHhcCCEEEEEecccccccCCCc
Q psy8713 139 -GHTPKCV--------------RECQENYDVPYKKDLNFG-AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202 (309)
Q Consensus 139 -~~~~~~~--------------~~c~~~~~~~~~~~~~~~-~~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sG 202 (309)
.+...|. +.|..++. +....|. ..-|.|.++|++||++|+++|||.+.|.|+++|+.|++|
T Consensus 306 ~~~~~~C~m~sR~~grgkRqat~~CPn~~~---~Sn~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgG 382 (470)
T KOG1544|consen 306 AGPAPPCMMHSRAMGRGKRQATAHCPNSYV---NSNDIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGG 382 (470)
T ss_pred CCCCCCceeeccccCcccccccCcCCCccc---ccCceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Confidence 1112221 11222111 0001121 234688889999999999999999999999999999999
Q ss_pred eEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCC---CCccEEEEEcCCCCC
Q psy8713 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTD 279 (309)
Q Consensus 203 iy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~---~g~~YWiikNSWG~~ 279 (309)
||.-... . +.. .......+.|+|.|.|||++.. ...+|||..||||+.
T Consensus 383 iY~H~~~------------~--------------~~~---~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~ 433 (470)
T KOG1544|consen 383 IYSHTPV------------S--------------LGR---PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPA 433 (470)
T ss_pred eeecccc------------c--------------cCC---chhhhhcccceEEEeecccccCCCCCeeEEEEeecccccc
Confidence 9982100 0 000 1223346799999999998753 456899999999999
Q ss_pred CCCCceEEEEccCCccccCcceEEEeec
Q psy8713 280 WGDNGLFKILRGKDECGIESSITAGVPK 307 (309)
Q Consensus 280 WG~~Gy~~i~~g~n~cgi~~~~~~~~~~ 307 (309)
|||+|||||.||+|+|.||+.+.+|+..
T Consensus 434 WGE~GYFriLRGvNecdIEsfvIgAWGr 461 (470)
T KOG1544|consen 434 WGERGYFRILRGVNECDIESFVIGAWGR 461 (470)
T ss_pred cccCceEEEeccccchhhhHhhhhhhhc
Confidence 9999999999999999999999888754
No 14
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=5.7e-34 Score=293.33 Aligned_cols=185 Identities=26% Similarity=0.481 Sum_probs=129.6
Q ss_pred CceecCCCCCCCCCCccccCccCCCCCc--------------------CCCccccCCCCCCCCCCCCCCCCC--------
Q psy8713 86 ANFDSRTKWPNCPTIREIRDQGSCGSCW--------------------GCRPYEIAPCEHHVNGTRPSCDAS-------- 137 (309)
Q Consensus 86 ~~~DwR~~~~~cg~vtpVkdQg~CGSCW--------------------~cs~~~~~~C~~~~~g~~~~C~~~-------- 137 (309)
+.+|+.+++++|..+.||||||.||||| ..|++++.+|.... ....|.++
T Consensus 530 ~yCdR~kD~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~--gn~GC~GG~~~~efl~ 607 (1004)
T PTZ00462 530 EFCNRLKDENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGE--HKDRCDEGSNPLEFLQ 607 (1004)
T ss_pred hhccccccCCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhccccc--CCCCCCCCCcHHHHHH
Confidence 6789999999998899999999999999 34555556664310 01223332
Q ss_pred ------------CCCCcc--cccccccCcc-----------cccccc--ccceeeee-eecc---------hHHHHHHHH
Q psy8713 138 ------------KGHTPK--CVRECQENYD-----------VPYKKD--LNFGAKSY-SVSS---------NEKSIMKEI 180 (309)
Q Consensus 138 ------------~~~~~~--~~~~c~~~~~-----------~~~~~~--~~~~~~~~-~v~~---------~~~~ik~~l 180 (309)
.+|+.. ....|..... +.+... .......| .+.. -++.||++|
T Consensus 608 yI~e~GgLptESdYPYt~k~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI 687 (1004)
T PTZ00462 608 IIEDNGFLPADSNYLYNYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEI 687 (1004)
T ss_pred HHHHcCCCcccccCCCccCCCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHH
Confidence 234421 2234532110 000000 01111112 1111 146899999
Q ss_pred HhcCCEEEEEecccccccCC-CceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEe
Q psy8713 181 YEHGPVEGAFTVFDDLILYK-SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259 (309)
Q Consensus 181 ~~~GPv~v~~~~~~~f~~Y~-sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGy 259 (309)
+.+|||+|+|.+. +|+.|. +|||.. ..|+...++|||+||||
T Consensus 688 ~~kGPVaV~IdAs-df~~Y~~sGIyv~------------------------------------~~Cgs~~~nHAVlIVGY 730 (1004)
T PTZ00462 688 MNKGSVIAYIKAE-NVLGYEFNGKKVQ------------------------------------NLCGDDTADHAVNIVGY 730 (1004)
T ss_pred HhcCCEEEEEEee-hHHhhhcCCcccc------------------------------------CCCCCCcCCceEEEEEe
Confidence 9999999999984 788884 898763 45776667999999999
Q ss_pred eccC---CCCccEEEEEcCCCCCCCCCceEEEEc-cCCccccCcceEEEeecCC
Q psy8713 260 GEDE---KSKEKYWLIANSWNTDWGDNGLFKILR-GKDECGIESSITAGVPKLD 309 (309)
Q Consensus 260 g~~~---~~g~~YWiikNSWG~~WG~~Gy~~i~~-g~n~cgi~~~~~~~~~~~~ 309 (309)
|.+. +.+++|||||||||++|||+|||||+| ++++|||++++++++|+||
T Consensus 731 Gt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin~i~t~~~fn~d 784 (1004)
T PTZ00462 731 GNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIHSVVIFNID 784 (1004)
T ss_pred cccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCCCCCccchheeeeeEeec
Confidence 9742 246799999999999999999999998 7899999999999999987
No 15
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00 E-value=4.8e-33 Score=241.34 Aligned_cols=151 Identities=34% Similarity=0.756 Sum_probs=103.8
Q ss_pred CCCceecCCCCCCCCCCccccCccCCCCCc--------------------CCCccccCCCCCCCCCCCCCCCCCCCCCcc
Q psy8713 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW--------------------GCRPYEIAPCEHHVNGTRPSCDASKGHTPK 143 (309)
Q Consensus 84 lP~~~DwR~~~~~cg~vtpVkdQg~CGSCW--------------------~cs~~~~~~C~~~~~g~~~~C~~~~~~~~~ 143 (309)
||++||||++ +.++||+||+.||||| +.|++++..|... ....|.++.... .
T Consensus 1 lP~~~D~R~~----~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~---~~~gC~GG~~~~-a 72 (174)
T smart00645 1 LPESFDWRKK----GAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTG---GNNGCNGGLPDN-A 72 (174)
T ss_pred CCCcCccccc----CCCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCC---CCCCCCCcCHHH-H
Confidence 6999999999 6799999999999999 4566777777652 122466543210 0
Q ss_pred cccccccCccccccccccceeeeeeecchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcc
Q psy8713 144 CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223 (309)
Q Consensus 144 ~~~~c~~~~~~~~~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~ 223 (309)
..-.....-++.+++.+|.. ++.+.+ .+|+.|++|||+.+
T Consensus 73 -~~~~~~~~Gi~~e~~~PY~~----------------------~~~~~~-~~f~~Y~~Gi~~~~---------------- 112 (174)
T smart00645 73 -FEYIKKNGGLETESCYPYTG----------------------SVAIDA-SDFQFYKSGIYDHP---------------- 112 (174)
T ss_pred -HHHHHHcCCcccccccCccc----------------------EEEEEc-ccccCCcCeEECCC----------------
Confidence 00000000001111111110 444544 46999999999832
Q ss_pred cCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEEcCCCCCCCCCceEEEEccC-CccccCcceE
Q psy8713 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK-DECGIESSIT 302 (309)
Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~~Gy~~i~~g~-n~cgi~~~~~ 302 (309)
.|+...++|+|+|||||.+ .++++|||||||||+.|||+|||||.|+. |.|+|+....
T Consensus 113 --------------------~~~~~~~~Hav~ivGyg~~-~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~~~~ 171 (174)
T smart00645 113 --------------------GCGSGTLDHAVLIVGYGTE-ENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA 171 (174)
T ss_pred --------------------CCCCCcccEEEEEEEEeec-CCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCceeeee
Confidence 3555457999999999975 26789999999999999999999999998 9999977664
Q ss_pred E
Q psy8713 303 A 303 (309)
Q Consensus 303 ~ 303 (309)
+
T Consensus 172 ~ 172 (174)
T smart00645 172 S 172 (174)
T ss_pred e
Confidence 3
No 16
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.97 E-value=2.4e-30 Score=229.75 Aligned_cols=169 Identities=27% Similarity=0.444 Sum_probs=111.9
Q ss_pred ceecCCCCCCCCCCccccCccCCCCCc----------------------CCCccccCCCCCCCCC-CCCCCCCCC-----
Q psy8713 87 NFDSRTKWPNCPTIREIRDQGSCGSCW----------------------GCRPYEIAPCEHHVNG-TRPSCDASK----- 138 (309)
Q Consensus 87 ~~DwR~~~~~cg~vtpVkdQg~CGSCW----------------------~cs~~~~~~C~~~~~g-~~~~C~~~~----- 138 (309)
.+|||.. + ++||+|||.||+|| ..|++++..|...... ....|.++.
T Consensus 1 ~~d~r~~----~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~ 75 (223)
T cd02619 1 SVDLRPL----R-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSAL 75 (223)
T ss_pred CCcchhc----C-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHH
Confidence 4799998 6 99999999999999 3455566666432100 013444433
Q ss_pred --------------CCCcccccccccCccccccccccceeeee-eec-chHHHHHHHHHhcCCEEEEEecccccccCCCc
Q psy8713 139 --------------GHTPKCVRECQENYDVPYKKDLNFGAKSY-SVS-SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202 (309)
Q Consensus 139 --------------~~~~~~~~~c~~~~~~~~~~~~~~~~~~~-~v~-~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sG 202 (309)
+|+..-...|..... .......+....| .+. .++++||++|+++|||+++|.+.++|..|++|
T Consensus 76 ~~~~~~~Gi~~e~~~Py~~~~~~~~~~~~-~~~~~~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~ 154 (223)
T cd02619 76 LKLVALKGIPPEEDYPYGAESDGEEPKSE-AALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEG 154 (223)
T ss_pred HHHHHHcCCCccccCCCCCCCCCCCCCCc-cchhhcceeecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCc
Confidence 222211111111000 0001122333333 233 25789999999999999999999999999999
Q ss_pred eEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEEcCCCCCCCC
Q psy8713 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282 (309)
Q Consensus 203 iy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~ 282 (309)
+|..+.. ....+....++|||+|||||++...+++|||||||||+.||+
T Consensus 155 ~~~~~~~-------------------------------~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~ 203 (223)
T cd02619 155 IIYEEIV-------------------------------YLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGD 203 (223)
T ss_pred ccccccc-------------------------------ccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCcccc
Confidence 8741000 003355667899999999998832378999999999999999
Q ss_pred CceEEEEccC
Q psy8713 283 NGLFKILRGK 292 (309)
Q Consensus 283 ~Gy~~i~~g~ 292 (309)
+||+||+++.
T Consensus 204 ~Gy~~i~~~~ 213 (223)
T cd02619 204 NGYGRISYED 213 (223)
T ss_pred CCEEEEehhh
Confidence 9999999973
No 17
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.86 E-value=7.7e-22 Score=184.23 Aligned_cols=45 Identities=36% Similarity=0.664 Sum_probs=39.2
Q ss_pred ccCCceEEEEEeeccC--------CCCccEEEEEcCCCCCCCCCceEEEEccC
Q psy8713 248 ALGGHAIRILGWGEDE--------KSKEKYWLIANSWNTDWGDNGLFKILRGK 292 (309)
Q Consensus 248 ~~~~HaV~iVGyg~~~--------~~g~~YWiikNSWG~~WG~~Gy~~i~~g~ 292 (309)
...+|||+||||++.. +.+.+.||||||||++||++|||||+...
T Consensus 262 ~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~y 314 (372)
T COG4870 262 ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYY 314 (372)
T ss_pred ccccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEeee
Confidence 5679999999999864 24567999999999999999999999853
No 18
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.55 E-value=7.5e-15 Score=143.40 Aligned_cols=96 Identities=24% Similarity=0.255 Sum_probs=64.6
Q ss_pred HHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEE
Q psy8713 178 KEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257 (309)
Q Consensus 178 ~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iV 257 (309)
++|.+.+||.++.++. .|+.|++||+.... ..+-.+. +. .+.+-.+-.+ .+.....+|||+||
T Consensus 303 ~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~--~~~~~~f-------~~------~~~~~KaeRl-~~~es~~tHAM~iv 365 (437)
T cd00585 303 AQLKDGEPVWFGCDVG-KFSDRKSGILDTDL--FDYELLF-------GI------DFGLNKAERL-DYGESLMTHAMVLT 365 (437)
T ss_pred HHHhcCCCEEEEEEcC-hhhccCCccccCcc--cchhhhc-------Cc------cccCCHHHHH-hhcCCcCCeEEEEE
Confidence 5778889999999996 57799999996310 0000000 00 0000011111 23344569999999
Q ss_pred EeeccCCCCc-cEEEEEcCCCCCCCCCceEEEEcc
Q psy8713 258 GWGEDEKSKE-KYWLIANSWNTDWGDNGLFKILRG 291 (309)
Q Consensus 258 Gyg~~~~~g~-~YWiikNSWG~~WG~~Gy~~i~~g 291 (309)
|||.+. +|+ .||+||||||+.||++|||+|+++
T Consensus 366 Gv~~D~-~g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 366 GVDLDE-DGKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred EEEecC-CCCcceEEEEcccCCCCCCCcceehhHH
Confidence 999873 465 699999999999999999999874
No 19
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.64 E-value=1.7e-07 Score=92.05 Aligned_cols=108 Identities=25% Similarity=0.269 Sum_probs=61.5
Q ss_pred eeeeecch--HHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceecccccc
Q psy8713 165 KSYSVSSN--EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242 (309)
Q Consensus 165 ~~~~v~~~--~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 242 (309)
..++++-+ .+.+.+.|...-||..+-+|.+ +..-+.||.+.+ .-..-.+.+.. ....++-+
T Consensus 289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k-~~~~k~Gi~D~~--~~d~~~~fg~~-------~~~~K~~R------- 351 (438)
T PF03051_consen 289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGK-FFDRKNGIMDTD--LYDYDSLFGVD-------FNMSKAER------- 351 (438)
T ss_dssp EEEE--HHHHHHHHHHHHHTT--EEEEEETTT-TEETTTTEE-TT--SB-HHHHHT---------S-S-HHHH-------
T ss_pred eEeccCHHHHHHHHHHHHHcCCcEEEeccCCc-cccccchhhccc--hhhhhhhhccc-------cccCHHHH-------
Confidence 34577642 2344556666779999999975 456688998732 11111111111 11111110
Q ss_pred ccCCCccCCceEEEEEeeccCCCCc-cEEEEEcCCCCCCCCCceEEEEc
Q psy8713 243 YKSGKALGGHAIRILGWGEDEKSKE-KYWLIANSWNTDWGDNGLFKILR 290 (309)
Q Consensus 243 ~~~~~~~~~HaV~iVGyg~~~~~g~-~YWiikNSWG~~WG~~Gy~~i~~ 290 (309)
-.......+|||+|||...+. +|. .+|+|+||||+..|.+|||.|+.
T Consensus 352 l~~~eS~~tHAM~itGv~~D~-~g~p~~wkVeNSWG~~~g~kGy~~msd 399 (438)
T PF03051_consen 352 LDYGESTMTHAMVITGVDLDE-DGKPVRWKVENSWGTDNGDKGYFYMSD 399 (438)
T ss_dssp HHTTSS--EEEEEEEEEEE-T-TSSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred HHhCCCCCceeEEEEEEEecc-CCCeeEEEEEcCCCCCCCCCcEEEECH
Confidence 012334569999999999864 555 69999999999999999999975
No 20
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=96.40 E-value=0.0013 Score=61.94 Aligned_cols=44 Identities=32% Similarity=0.555 Sum_probs=35.9
Q ss_pred CccCCceEEEEEeeccCCCCccEEEEEcCCCCCCCCCceEEEEc
Q psy8713 247 KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290 (309)
Q Consensus 247 ~~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~~Gy~~i~~ 290 (309)
.....|||+|.|.+-++....-=|.|.||||.+-|.+|||-++.
T Consensus 358 eSLmTHAMvlTGvd~d~~g~p~rwkVENSWG~d~G~~GyfvaSd 401 (444)
T COG3579 358 ESLMTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYFVASD 401 (444)
T ss_pred hHHHHHHHHhhccccccCCCceeeEeecccccccCCCceEeehH
Confidence 34568999999999765334457999999999999999997753
No 21
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=95.85 E-value=0.0068 Score=42.63 Aligned_cols=35 Identities=9% Similarity=0.060 Sum_probs=27.7
Q ss_pred cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHHH
Q psy8713 17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHL 54 (309)
Q Consensus 17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~ 54 (309)
++....+.|.+|| . .+-.+|++|+|.|+|++.||+
T Consensus 24 ~F~~N~~~I~~~N--~-~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 24 IFKENLRRIEEHN--A-NGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHHHHHHH--H-TTSSSEEE-SSTTTTSSHHHH
T ss_pred HHHHHHHHHHHHh--c-CCCCCeEEeCccccCcChhhC
Confidence 4677888999999 3 333789999999999999886
No 22
>KOG1543|consensus
Probab=94.22 E-value=0.033 Score=53.09 Aligned_cols=28 Identities=50% Similarity=0.927 Sum_probs=26.1
Q ss_pred cCCCCCCCcchHHHHHHHHhcccc-CCCC
Q psy8713 9 CGFGCNGGFPGMAWRYWVKSGIVS-GGAY 36 (309)
Q Consensus 9 ~~~~~~~~~~~~a~~~~~~~~~~~-~~~~ 36 (309)
||.||+||++..||+||.++|+++ ..+|
T Consensus 170 ~~~GC~GG~~~~A~~yi~~~G~~t~~~~Y 198 (325)
T KOG1543|consen 170 CGDGCNGGEPKNAFKYIKKNGGVTECENY 198 (325)
T ss_pred CCCCcCCCCHHHHHHHHHHhCCCCCCcCC
Confidence 689999999999999999999999 6776
No 23
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=93.63 E-value=0.034 Score=36.60 Aligned_cols=27 Identities=30% Similarity=0.348 Sum_probs=17.1
Q ss_pred CCcccccccCCCCcHHHHHHHhCCCCCC
Q psy8713 37 GSKQAEKNSLSNIPRAHLKSWMGVHPDY 64 (309)
Q Consensus 37 ~s~~~g~Nf~~~~~~e~~~~~lg~~~~~ 64 (309)
.+|++|.|| ..++.+++++|||+.+..
T Consensus 14 ~tWkAG~NF-~~~~~~~ik~LlGv~~~~ 40 (41)
T PF08127_consen 14 TTWKAGRNF-ENTSIEYIKRLLGVLPDP 40 (41)
T ss_dssp -SEEE-----SSB-HHHHHHCS-B-TTS
T ss_pred CcccCCCCC-CCCCHHHHHHHcCCCCCC
Confidence 599999996 999999999999997653
No 24
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=93.08 E-value=0.31 Score=39.13 Aligned_cols=59 Identities=24% Similarity=0.363 Sum_probs=35.0
Q ss_pred cchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCcc
Q psy8713 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249 (309)
Q Consensus 170 ~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (309)
..+..+|+++|.+..||.+.+...-. ......+. ...
T Consensus 86 ~~~~~~i~~~i~~G~Pvi~~~~~~~~--~~~~~~~~-----------------------------------------~~~ 122 (144)
T PF13529_consen 86 DASFDDIKQEIDAGRPVIVSVNSGWR--PPNGDGYD-----------------------------------------GTY 122 (144)
T ss_dssp TS-HHHHHHHHHTT--EEEEEETTSS----TTEEEE-----------------------------------------E-T
T ss_pred CCcHHHHHHHHHCCCcEEEEEEcccc--cCCCCCcC-----------------------------------------CCc
Confidence 34678899999998899998864211 00112222 223
Q ss_pred CCceEEEEEeeccCCCCccEEEEEcCC
Q psy8713 250 GGHAIRILGWGEDEKSKEKYWLIANSW 276 (309)
Q Consensus 250 ~~HaV~iVGyg~~~~~g~~YWiikNSW 276 (309)
..|.|+|+||.++ . +++|-.+|
T Consensus 123 ~~H~vvi~Gy~~~----~-~~~v~DP~ 144 (144)
T PF13529_consen 123 GGHYVVIIGYDED----G-YVYVNDPW 144 (144)
T ss_dssp TEEEEEEEEE-SS----E--EEEE-TT
T ss_pred CCEEEEEEEEeCC----C-EEEEeCCC
Confidence 5999999999986 2 78888776
No 25
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=90.55 E-value=0.11 Score=35.81 Aligned_cols=34 Identities=6% Similarity=0.026 Sum_probs=25.8
Q ss_pred cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHH
Q psy8713 17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAH 53 (309)
Q Consensus 17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~ 53 (309)
++...+++|..|+.. +. .+|++|+|.|+|++.||
T Consensus 24 ~f~~n~~~i~~~N~~--~~-~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 24 IFKENLKFIEEHNKK--ND-HSYTLGLNQFADLTNEE 57 (57)
T ss_pred HHHHHHHHHHHHHhc--CC-CCeEecCcccccCCCCC
Confidence 456677788888843 23 68999999999988764
No 26
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=87.46 E-value=0.37 Score=49.13 Aligned_cols=38 Identities=29% Similarity=0.621 Sum_probs=31.6
Q ss_pred ccccccc---CCCCCCCcchHHHHHHHHhccccCCCC-CCcc
Q psy8713 3 TQQIRLC---GFGCNGGFPGMAWRYWVKSGIVSGGAY-GSKQ 40 (309)
Q Consensus 3 ~~~~~~~---~~~~~~~~~~~a~~~~~~~~~~~~~~~-~s~~ 40 (309)
.||+=.| +.||+||.+..|++|+.++||+++..| ..|+
T Consensus 266 ~QqLVDCs~~n~GCdGG~p~~A~~yi~~~GI~tE~dY~~PY~ 307 (548)
T PTZ00364 266 ARHVLDCSQYGQGCAGGFPEEVGKFAETFGILTTDSYYIPYD 307 (548)
T ss_pred HHHHhcccCCCCCCCCCcHHHHHHHHHhCCcccccccCCCCC
Confidence 3566667 689999999999999999999999987 2454
No 27
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=84.23 E-value=0.65 Score=42.06 Aligned_cols=36 Identities=33% Similarity=0.537 Sum_probs=30.0
Q ss_pred ccccccc--CCCCCCCcchHHHHHHHHhccccCCCCCCc
Q psy8713 3 TQQIRLC--GFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39 (309)
Q Consensus 3 ~~~~~~~--~~~~~~~~~~~a~~~~~~~~~~~~~~~~s~ 39 (309)
.||+=.| +.||+||.+..|++|++++|+.++..| .|
T Consensus 59 ~Q~lldC~~~~gC~GG~~~~a~~~~~~~Gl~~e~~y-PY 96 (239)
T cd02698 59 VQVVIDCAGGGSCHGGDPGGVYEYAHKHGIPDETCN-PY 96 (239)
T ss_pred HHHHHhCCCCCCccCcCHHHHHHHHHHcCcCCCCee-CC
Confidence 3566566 579999999999999999999998886 44
No 28
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=81.65 E-value=0.87 Score=41.16 Aligned_cols=37 Identities=43% Similarity=1.048 Sum_probs=31.2
Q ss_pred ccccccc----CCCCCCCcchHHHHHHHHhccccCCCCCCcc
Q psy8713 3 TQQIRLC----GFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40 (309)
Q Consensus 3 ~~~~~~~----~~~~~~~~~~~a~~~~~~~~~~~~~~~~s~~ 40 (309)
.||+=.| ..||+||.+..|++|+.++|+.++..| .|+
T Consensus 55 ~Q~lidC~~~~~~gC~GG~~~~a~~~i~~~G~~~e~~y-PY~ 95 (236)
T cd02620 55 AQDLLSCCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQ-PYT 95 (236)
T ss_pred HHHHHhhcCCCCCCCCCCCHHHHHHHHHhcCCCcCCEe-cCc
Confidence 4666666 679999999999999999999998886 553
No 29
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=80.78 E-value=1 Score=40.72 Aligned_cols=37 Identities=32% Similarity=0.651 Sum_probs=30.7
Q ss_pred cccccc---CCCCCCCcchHHHHHHHHhccccCCCCCCccc
Q psy8713 4 QQIRLC---GFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41 (309)
Q Consensus 4 ~~~~~~---~~~~~~~~~~~a~~~~~~~~~~~~~~~~s~~~ 41 (309)
||+=.| ..||+||.+.+|+.|+.++|++++..| .|+.
T Consensus 61 q~l~dC~~~~~GC~GG~~~~a~~~~~~~Gi~~e~~y-PY~~ 100 (243)
T cd02621 61 QHVLSCSQYSQGCDGGFPFLVGKFAEDFGIVTEDYF-PYTA 100 (243)
T ss_pred HHhhhhcCCCCCCCCCCHHHHHHHHHhcCcCCCcee-CCCC
Confidence 555555 469999999999999999999999887 5644
No 30
>PTZ00049 cathepsin C-like protein; Provisional
Probab=80.21 E-value=1.3 Score=46.13 Aligned_cols=38 Identities=32% Similarity=0.545 Sum_probs=32.0
Q ss_pred ccccccc---CCCCCCCcchHHHHHHHHhccccCCCCCCccc
Q psy8713 3 TQQIRLC---GFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41 (309)
Q Consensus 3 ~~~~~~~---~~~~~~~~~~~a~~~~~~~~~~~~~~~~s~~~ 41 (309)
.||+=.| ..||+||.+..|++|+.++||.++..| .|+.
T Consensus 444 ~QqLLDCs~~nqGC~GG~~~~A~kya~~~GI~tEscY-PY~a 484 (693)
T PTZ00049 444 IQTVLSCSFYDQGCNGGFPYLVSKMAKLQGIPLDKVF-PYTA 484 (693)
T ss_pred HHHhcccCCCCCCcCCCcHHHHHHHHHHCCCCcCCcc-CCcC
Confidence 3667777 369999999999999999999999987 5643
No 31
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=77.83 E-value=6.5 Score=36.59 Aligned_cols=22 Identities=9% Similarity=0.081 Sum_probs=16.2
Q ss_pred hHHHHHHHHHhcCCEEEEEecc
Q psy8713 172 NEKSIMKEIYEHGPVEGAFTVF 193 (309)
Q Consensus 172 ~~~~ik~~l~~~GPv~v~~~~~ 193 (309)
-.+.|++.|....||.+.++.+
T Consensus 77 ~~~~l~~~l~~g~pv~~~~D~~ 98 (317)
T PF14399_consen 77 AWEELKEALDAGRPVIVWVDMY 98 (317)
T ss_pred HHHHHHHHHhCCCceEEEeccc
Confidence 3556777887777999987653
No 32
>KOG1542|consensus
Probab=77.64 E-value=1.9 Score=41.19 Aligned_cols=37 Identities=30% Similarity=0.602 Sum_probs=30.9
Q ss_pred cccCCCCCCCcchHHHHH-HHHhccccCCCCCCcccccc
Q psy8713 7 RLCGFGCNGGFPGMAWRY-WVKSGIVSGGAYGSKQAEKN 44 (309)
Q Consensus 7 ~~~~~~~~~~~~~~a~~~-~~~~~~~~~~~~~s~~~g~N 44 (309)
..|..||+||+..-|-+| +..-||+.+..| +|+.-..
T Consensus 213 D~~d~gC~GGl~~nA~~~~~~~gGL~~E~dY-PY~g~~~ 250 (372)
T KOG1542|consen 213 DSCDNGCNGGLMDNAFKYIKKAGGLEKEKDY-PYTGKKG 250 (372)
T ss_pred cCcCCcCCCCChhHHHHHHHHhCCccccccC-CccccCC
Confidence 378999999999999999 555899999998 7765443
No 33
>PTZ00200 cysteine proteinase; Provisional
Probab=75.56 E-value=1.8 Score=43.18 Aligned_cols=37 Identities=30% Similarity=0.619 Sum_probs=31.2
Q ss_pred cccccccC---CCCCCCcchHHHHHHHHhccccCCCCCCcc
Q psy8713 3 TQQIRLCG---FGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40 (309)
Q Consensus 3 ~~~~~~~~---~~~~~~~~~~a~~~~~~~~~~~~~~~~s~~ 40 (309)
.||+=.|. .||.||.+..|+.|+.++||.++..| .|+
T Consensus 284 eQqLvDC~~~~~GC~GG~~~~A~~yi~~~Gi~~e~~Y-PY~ 323 (448)
T PTZ00200 284 EQELVNCDTKSQGCSGGYPDTALEYVKNKGLSSSSDV-PYL 323 (448)
T ss_pred HHHHhhccCccCCCCCCcHHHHHHHHhhcCccccccC-CCC
Confidence 36666673 69999999999999999999999988 564
No 34
>KOG4128|consensus
Probab=72.74 E-value=0.48 Score=45.03 Aligned_cols=41 Identities=29% Similarity=0.428 Sum_probs=32.3
Q ss_pred cCCceEEEEEeec-cC-CCCccEEEEEcCCCCCCCCCceEEEE
Q psy8713 249 LGGHAIRILGWGE-DE-KSKEKYWLIANSWNTDWGDNGLFKIL 289 (309)
Q Consensus 249 ~~~HaV~iVGyg~-~~-~~g~~YWiikNSWG~~WG~~Gy~~i~ 289 (309)
.-.||++|.|-|. ++ ..+-.=|-|.||||++-|.+|+.++.
T Consensus 370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt 412 (457)
T KOG4128|consen 370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMT 412 (457)
T ss_pred HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhh
Confidence 4589999999882 22 24456799999999999999997664
No 35
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=71.31 E-value=8.1 Score=33.74 Aligned_cols=20 Identities=20% Similarity=0.120 Sum_probs=16.8
Q ss_pred hHHHHHHHHHhcCCEEEEEe
Q psy8713 172 NEKSIMKEIYEHGPVEGAFT 191 (309)
Q Consensus 172 ~~~~ik~~l~~~GPv~v~~~ 191 (309)
+..+|+..|.+..||.+-..
T Consensus 122 sl~~ik~ql~kg~PV~iw~T 141 (195)
T COG4990 122 SLSDIKGQLLKGRPVVIWVT 141 (195)
T ss_pred cHHHHHHHHhcCCcEEEEEe
Confidence 78899999999999987553
No 36
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=70.80 E-value=15 Score=31.90 Aligned_cols=36 Identities=22% Similarity=0.499 Sum_probs=24.8
Q ss_pred CCceEEEEEeeccCCCCccEEEEEcCCCCCCCCCceEEEEcc
Q psy8713 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291 (309)
Q Consensus 250 ~~HaV~iVGyg~~~~~g~~YWiikNSWG~~WG~~Gy~~i~~g 291 (309)
.+||++||||-.-. .|.++.++=|-|- ++++-++..
T Consensus 119 ~gHAlavvGya~~~-~g~~~y~~WNPW~-----~~~~~~sa~ 154 (175)
T PF05543_consen 119 AGHALAVVGYAKPN-NGQKTYYFWNPWW-----NDVMIQSAK 154 (175)
T ss_dssp -EEEEEEEEEEEET-TSEEEEEEE-TT------SS-EEEETT
T ss_pred cceeEEEEeeeecC-CCCeEEEEeCCcc-----CCcEEEecC
Confidence 48999999998642 6788999999984 445655554
No 37
>PTZ00203 cathepsin L protease; Provisional
Probab=70.19 E-value=3 Score=40.17 Aligned_cols=38 Identities=24% Similarity=0.529 Sum_probs=29.5
Q ss_pred ccccccc---CCCCCCCcchHHHHHHHHh---ccccCCCCCCccc
Q psy8713 3 TQQIRLC---GFGCNGGFPGMAWRYWVKS---GIVSGGAYGSKQA 41 (309)
Q Consensus 3 ~~~~~~~---~~~~~~~~~~~a~~~~~~~---~~~~~~~~~s~~~ 41 (309)
.||+=.| ..||+||.+..||+|+.++ ++.++..| .|+.
T Consensus 175 eQqLvdC~~~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~Y-PY~~ 218 (348)
T PTZ00203 175 EQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY-PYVS 218 (348)
T ss_pred HHHHHhccCCCCCCCCCCHHHHHHHHHHhcCCCCCccccC-CCcc
Confidence 3667677 4699999999999999876 46788887 5543
No 38
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=62.59 E-value=5.3 Score=34.28 Aligned_cols=29 Identities=38% Similarity=0.799 Sum_probs=25.3
Q ss_pred CCCCCCcchHHHHHHHHh-ccccCCCCCCcc
Q psy8713 11 FGCNGGFPGMAWRYWVKS-GIVSGGAYGSKQ 40 (309)
Q Consensus 11 ~~~~~~~~~~a~~~~~~~-~~~~~~~~~s~~ 40 (309)
.||.||.+..|++|+.++ |+.++..| .|+
T Consensus 62 ~gC~GG~~~~a~~~~~~~~Gi~~e~~~-PY~ 91 (174)
T smart00645 62 NGCNGGLPDNAFEYIKKNGGLETESCY-PYT 91 (174)
T ss_pred CCCCCcCHHHHHHHHHHcCCccccccc-Ccc
Confidence 499999999999999998 89988876 554
No 39
>PTZ00021 falcipain-2; Provisional
Probab=57.53 E-value=5.9 Score=39.98 Aligned_cols=37 Identities=27% Similarity=0.500 Sum_probs=30.2
Q ss_pred ccccccc---CCCCCCCcchHHHHHHHHh-ccccCCCCCCcc
Q psy8713 3 TQQIRLC---GFGCNGGFPGMAWRYWVKS-GIVSGGAYGSKQ 40 (309)
Q Consensus 3 ~~~~~~~---~~~~~~~~~~~a~~~~~~~-~~~~~~~~~s~~ 40 (309)
.||+=.| ..||+||.+..|++|+.++ ||.++..| .|+
T Consensus 315 eQqLVDCs~~n~GC~GG~~~~Af~yi~~~gGl~tE~~Y-PY~ 355 (489)
T PTZ00021 315 EQELVDCSFKNNGCYGGLIPNAFEDMIELGGLCSEDDY-PYV 355 (489)
T ss_pred HHHHhhhccCCCCCCCcchHhhhhhhhhccccCccccc-Ccc
Confidence 3666667 4699999999999999875 89999887 564
No 40
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=53.56 E-value=11 Score=32.82 Aligned_cols=36 Identities=39% Similarity=0.710 Sum_probs=29.4
Q ss_pred cccccc----CCCCCCCcchHHHHHHHHhccccCCCCCCcc
Q psy8713 4 QQIRLC----GFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40 (309)
Q Consensus 4 ~~~~~~----~~~~~~~~~~~a~~~~~~~~~~~~~~~~s~~ 40 (309)
||+-.| ..||.||....|++++.++|++++..| .|.
T Consensus 50 q~l~~c~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y-PY~ 89 (210)
T cd02248 50 QQLVDCSTSGNNGCNGGNPDNAFEYVKNGGLASESDY-PYT 89 (210)
T ss_pred HHHhccCCCCCCCCCCCCHHHhHHHHHHCCcCccccC-Ccc
Confidence 445455 469999999999999999999999887 553
No 41
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=53.29 E-value=53 Score=29.47 Aligned_cols=21 Identities=14% Similarity=0.240 Sum_probs=16.5
Q ss_pred hHHHHHHHHHhcCCEEEEEec
Q psy8713 172 NEKSIMKEIYEHGPVEGAFTV 192 (309)
Q Consensus 172 ~~~~ik~~l~~~GPv~v~~~~ 192 (309)
..++|...|...||+.+-++.
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~ 132 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDA 132 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEcc
Confidence 678899999998877766654
No 42
>KOG1544|consensus
Probab=51.67 E-value=5.1 Score=38.14 Aligned_cols=27 Identities=44% Similarity=0.818 Sum_probs=25.6
Q ss_pred CCCCCCCcchHHHHHHHHhccccCCCC
Q psy8713 10 GFGCNGGFPGMAWRYWVKSGIVSGGAY 36 (309)
Q Consensus 10 ~~~~~~~~~~~a~~~~~~~~~~~~~~~ 36 (309)
++||+||..-.||=||-++|+|.+.=|
T Consensus 273 q~GC~gG~lDRAWWYlRKrGvVsdhCY 299 (470)
T KOG1544|consen 273 QQGCRGGRLDRAWWYLRKRGVVSDHCY 299 (470)
T ss_pred hccCccCcccchheeeecccccccccc
Confidence 589999999999999999999999887
No 43
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=45.52 E-value=64 Score=30.30 Aligned_cols=31 Identities=23% Similarity=0.409 Sum_probs=24.6
Q ss_pred ccCCceEEEEEeeccCCCCccEEEEEcCCCC
Q psy8713 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278 (309)
Q Consensus 248 ~~~~HaV~iVGyg~~~~~g~~YWiikNSWG~ 278 (309)
...+||-.|++.-+-...+.+.-.|||-||.
T Consensus 233 l~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~ 263 (315)
T cd00044 233 LVKGHAYSVLDVREVQEEGLRLLRLRNPWGV 263 (315)
T ss_pred cccCcceEEeEEEEEccCceEEEEecCCccC
Confidence 3469999999998652127899999999994
No 44
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=44.06 E-value=18 Score=31.30 Aligned_cols=27 Identities=41% Similarity=0.728 Sum_probs=25.3
Q ss_pred CCCCCCCcchHHHHHHHH-hccccCCCC
Q psy8713 10 GFGCNGGFPGMAWRYWVK-SGIVSGGAY 36 (309)
Q Consensus 10 ~~~~~~~~~~~a~~~~~~-~~~~~~~~~ 36 (309)
..+|.||.+..|+.++.+ +|+.++..|
T Consensus 63 ~~~c~gg~~~~a~~~~~~~~Gi~~e~~~ 90 (219)
T PF00112_consen 63 NKGCDGGSPFDALKYIKNNNGIVTEEDY 90 (219)
T ss_dssp SSTTBBBEHHHHHHHHHHHTSBEBTTTS
T ss_pred ccccccCcccccceeecccCcccccccc
Confidence 369999999999999999 999999998
No 45
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=36.43 E-value=92 Score=24.70 Aligned_cols=21 Identities=19% Similarity=0.498 Sum_probs=15.6
Q ss_pred CCceEEEEEee-ccCCCCccEEEEEcCC
Q psy8713 250 GGHAIRILGWG-EDEKSKEKYWLIANSW 276 (309)
Q Consensus 250 ~~HaV~iVGyg-~~~~~g~~YWiikNSW 276 (309)
.+|.|+|+||. .+ +.+|.+.|
T Consensus 93 ~gH~vVv~g~~~~~------~~~i~DP~ 114 (141)
T cd02549 93 SGHAMVVIGYDRKG------NVYVNDPG 114 (141)
T ss_pred CCeEEEEEEEcCCC------CEEEECCC
Confidence 48999999998 33 35666665
No 46
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=30.01 E-value=1.1e+02 Score=26.30 Aligned_cols=21 Identities=19% Similarity=0.303 Sum_probs=17.2
Q ss_pred hHHHHHHHHHhcCCEEEEEec
Q psy8713 172 NEKSIMKEIYEHGPVEGAFTV 192 (309)
Q Consensus 172 ~~~~ik~~l~~~GPv~v~~~~ 192 (309)
..+.+...|.++||+-++...
T Consensus 97 t~e~~~~LL~~yGPLwv~~~~ 117 (166)
T PF12385_consen 97 TAEGLANLLREYGPLWVAWEA 117 (166)
T ss_pred CHHHHHHHHHHcCCeEEEecC
Confidence 467888999999999998543
No 47
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=29.46 E-value=44 Score=28.82 Aligned_cols=26 Identities=31% Similarity=0.371 Sum_probs=24.0
Q ss_pred CCCCCCcchHHHH-HHHHhccccCCCC
Q psy8713 11 FGCNGGFPGMAWR-YWVKSGIVSGGAY 36 (309)
Q Consensus 11 ~~~~~~~~~~a~~-~~~~~~~~~~~~~ 36 (309)
.||.||.+..++. ++.++|+.++..|
T Consensus 64 ~~c~gG~~~~~~~~~~~~~Gi~~e~~~ 90 (223)
T cd02619 64 GSCDGGGPLSALLKLVALKGIPPEEDY 90 (223)
T ss_pred CCCCCCcHHHHHHHHHHHcCCCccccC
Confidence 6899999999998 8889999999887
No 48
>PF00376 MerR: MerR family regulatory protein; InterPro: IPR000551 The many bacterial transcription regulation proteins which bind DNA through a 'helix-turn-helix' motif can be classified into subfamilies on the basis of sequence similarities. One of these is the MerR subfamily. MerR, which is found in many bacterial species mediates the mercuric-dependent induction of the mercury resistance operon. In the absence of mercury merR represses transcription by binding tightly, as a dimer, to the 'mer' operator region; when mercury is present the dimeric complex binds a single ion and becomes a potent transcriptional activator, while remaining bound to the mer site. Members of the family include the mercuric resistance operon regulatory protein merR; Bacillus subtilis bltR and bmrR; Bacillus glnR; Streptomyces coelicolor hspR; Bradyrhizobium japonicum nolA; Escherichia coli superoxide response regulator soxR; and Streptomyces lividans transcriptional activator tipA [, , , , , ]. Other members include hypothetical proteins from E. coli, B. subtilis and Haemophilus influenzae. Within this family, the HTH motif is situated towards the N terminus.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 3HH0_A 2DG6_A 1R8D_B 1JBG_A 2VZ4_A 2ZHH_A 2ZHG_A 1Q07_A 1Q06_A 1Q05_B ....
Probab=26.90 E-value=47 Score=21.12 Aligned_cols=17 Identities=24% Similarity=0.646 Sum_probs=13.9
Q ss_pred CcchHHHHHHHHhcccc
Q psy8713 16 GFPGMAWRYWVKSGIVS 32 (309)
Q Consensus 16 ~~~~~a~~~~~~~~~~~ 32 (309)
|++..+.|||.+.||++
T Consensus 10 gvs~~tlR~ye~~Gll~ 26 (38)
T PF00376_consen 10 GVSPRTLRYYEREGLLP 26 (38)
T ss_dssp TS-HHHHHHHHHTTSS-
T ss_pred CCCHHHHHHHHHCCCCC
Confidence 67889999999999993
Done!