Query psy1727
Match_columns 240
No_of_seqs 215 out of 1710
Neff 8.4
Searched_HMMs 46136
Date Fri Aug 16 17:40:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1727.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1727hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 4.2E-57 9.2E-62 383.6 17.7 210 10-231 62-366 (372)
2 PTZ00203 cathepsin L protease; 100.0 1.9E-51 4E-56 362.1 19.6 210 7-232 26-336 (348)
3 PTZ00021 falcipain-2; Provisio 100.0 3.3E-50 7.2E-55 364.5 18.8 212 11-235 161-486 (489)
4 PTZ00200 cysteine proteinase; 100.0 2.8E-47 6.1E-52 344.1 19.4 206 12-236 119-444 (448)
5 KOG1543|consensus 100.0 3.7E-44 7.9E-49 313.6 17.7 152 62-228 162-316 (325)
6 cd02621 Peptidase_C1A_Cathepsi 100.0 1.1E-40 2.4E-45 282.2 14.4 164 62-235 62-242 (243)
7 cd02698 Peptidase_C1A_Cathepsi 100.0 1.2E-40 2.7E-45 281.1 14.2 157 62-235 61-238 (239)
8 cd02620 Peptidase_C1A_Cathepsi 100.0 1.2E-39 2.7E-44 274.5 13.4 152 62-230 57-232 (236)
9 cd02248 Peptidase_C1A Peptidas 100.0 6.3E-38 1.4E-42 259.2 15.5 151 62-228 51-204 (210)
10 PTZ00049 cathepsin C-like prot 100.0 4.4E-38 9.4E-43 291.8 15.5 166 62-236 446-677 (693)
11 PTZ00364 dipeptidyl-peptidase 100.0 5.8E-38 1.3E-42 287.8 15.4 164 62-235 268-459 (548)
12 PF00112 Peptidase_C1: Papain 100.0 1.1E-36 2.5E-41 252.4 9.8 159 62-235 54-218 (219)
13 PTZ00462 Serine-repeat antigen 100.0 5.9E-34 1.3E-38 271.2 9.9 190 30-229 526-774 (1004)
14 smart00645 Pept_C1 Papain fami 100.0 1.2E-31 2.6E-36 215.9 10.9 120 62-233 52-172 (174)
15 cd02619 Peptidase_C1 C1 Peptid 100.0 3.3E-31 7.1E-36 220.0 12.7 149 62-221 50-213 (223)
16 KOG1544|consensus 100.0 5.3E-30 1.2E-34 216.2 7.8 162 62-229 264-454 (470)
17 cd00585 Peptidase_C1B Peptidas 99.7 1.1E-17 2.4E-22 151.0 8.9 95 119-222 286-401 (437)
18 COG4870 Cysteine protease [Pos 99.6 2.9E-15 6.3E-20 129.6 7.4 136 75-221 168-314 (372)
19 PF03051 Peptidase_C1_2: Pepti 99.0 1.3E-09 2.7E-14 99.0 9.6 94 121-221 289-401 (438)
20 PF08246 Inhibitor_I29: Cathep 98.8 4.5E-09 9.8E-14 69.1 4.1 36 19-54 1-36 (58)
21 smart00848 Inhibitor_I29 Cathe 98.5 1.2E-07 2.6E-12 61.8 4.0 37 19-55 1-37 (57)
22 COG3579 PepC Aminopeptidase C 97.6 7.3E-05 1.6E-09 64.9 4.0 83 133-221 305-403 (444)
23 PF13529 Peptidase_C39_2: Pept 92.1 0.28 6E-06 36.8 4.5 57 128-203 88-144 (144)
24 PF14399 Transpep_BrtH: NlpC/p 88.9 1.6 3.5E-05 37.9 7.1 47 129-185 78-124 (317)
25 PF09778 Guanylate_cyc_2: Guan 88.2 2.2 4.8E-05 35.3 6.9 58 128-185 112-172 (212)
26 PF05543 Peptidase_C47: Stapho 82.2 8 0.00017 31.0 7.2 65 128-220 90-154 (175)
27 KOG4128|consensus 81.7 0.19 4.2E-06 44.0 -2.2 87 129-219 306-413 (457)
28 PTZ00203 cathepsin L protease; 69.2 4.1 8.9E-05 36.4 2.7 36 195-233 287-322 (348)
29 KOG3404|consensus 68.7 7.7 0.00017 29.2 3.6 44 6-49 34-77 (145)
30 KOG1543|consensus 66.9 7.7 0.00017 34.3 3.9 41 193-236 268-308 (325)
31 cd02698 Peptidase_C1A_Cathepsi 66.4 5 0.00011 33.6 2.6 37 195-233 179-215 (239)
32 PTZ00200 cysteine proteinase; 65.4 5.9 0.00013 36.7 3.0 39 195-234 387-425 (448)
33 cd02621 Peptidase_C1A_Cathepsi 64.9 5.9 0.00013 33.2 2.7 39 195-234 187-225 (243)
34 COG4990 Uncharacterized protei 63.3 16 0.00035 29.4 4.7 47 128-204 122-168 (195)
35 smart00645 Pept_C1 Papain fami 62.4 12 0.00026 29.7 4.0 39 194-234 119-157 (174)
36 cd02620 Peptidase_C1A_Cathepsi 59.6 11 0.00023 31.5 3.4 37 196-235 186-222 (236)
37 KOG1542|consensus 53.8 11 0.00024 33.5 2.5 37 195-234 317-354 (372)
38 cd02549 Peptidase_C39A A sub-f 52.9 45 0.00097 24.7 5.6 34 132-183 70-103 (141)
39 PTZ00364 dipeptidyl-peptidase 52.4 13 0.00027 35.4 2.8 37 195-233 403-441 (548)
40 KOG4702|consensus 49.1 34 0.00074 23.0 3.7 32 17-49 29-60 (77)
41 PTZ00021 falcipain-2; Provisio 42.9 24 0.00053 33.1 3.1 41 194-234 420-467 (489)
42 cd02619 Peptidase_C1 C1 Peptid 42.8 25 0.00055 28.3 3.0 29 206-234 179-210 (223)
43 PTZ00049 cathepsin C-like prot 40.3 27 0.00058 34.1 3.0 37 196-233 620-658 (693)
44 PF12385 Peptidase_C70: Papain 38.0 73 0.0016 25.2 4.6 38 129-185 98-135 (166)
45 cd02248 Peptidase_C1A Peptidas 37.7 26 0.00056 28.2 2.3 37 195-234 159-195 (210)
46 PTZ00462 Serine-repeat antigen 33.4 40 0.00087 34.3 3.1 39 196-234 723-763 (1004)
47 cd00044 CysPc Calpains, domain 32.3 75 0.0016 27.7 4.4 30 172-205 234-263 (315)
48 PF03714 PUD: Bacterial pullan 30.1 13 0.00028 26.8 -0.7 41 196-238 61-103 (103)
49 PF00112 Peptidase_C1: Papain 27.4 32 0.00069 27.6 1.2 36 196-234 167-202 (219)
50 PF04928 PAP_central: Poly(A) 22.0 1.6E+02 0.0034 25.2 4.4 39 10-50 17-55 (254)
No 1
>KOG1542|consensus
Probab=100.00 E-value=4.2e-57 Score=383.62 Aligned_cols=210 Identities=46% Similarity=0.812 Sum_probs=186.5
Q ss_pred CChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhc---CCc------------------------------
Q psy1727 10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIR---GEG------------------------------ 56 (240)
Q Consensus 10 ~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~---n~g------------------------------ 56 (240)
+..+..+++|..|+.+|+|+|.+.+|..+|+.||++|+..++++ ++|
T Consensus 62 ~~~l~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~ 141 (372)
T KOG1542|consen 62 PRGLGLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSK 141 (372)
T ss_pred CcccchHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhcccccccc
Confidence 33455589999999999999999999999999999999999543 232
Q ss_pred --------------------------------------------ceece-----------------eEecCCCCCCCCCC
Q psy1727 57 --------------------------------------------THLAL-----------------KLVDCDKVDAGCGG 75 (240)
Q Consensus 57 --------------------------------------------~~g~l-----------------qlvDCs~~~~gC~G 75 (240)
+||+| |||||+.++.||+|
T Consensus 142 ~~~~~~~~~~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~G 221 (372)
T KOG1542|consen 142 LPGDAAEAPIEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCNG 221 (372)
T ss_pred CccccccCcCCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCCC
Confidence 24443 99999999999999
Q ss_pred CccchHHHHHHHhcCCCccCCcccccccCCC-cccCCcccceeEEeEeeeechhHHHHHHHHHhCCCeEEEEeccccccc
Q psy1727 76 GLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFY 154 (240)
Q Consensus 76 G~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~-~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~GPv~v~~~~~~f~~Y 154 (240)
|.+..||+|++ +. +||..|.+|||++..+ .|..++....+.|.+|..++.||++|.++|.++|||+|+|++..+|+|
T Consensus 222 Gl~~nA~~~~~-~~-gGL~~E~dYPY~g~~~~~C~~~~~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~Y 299 (372)
T KOG1542|consen 222 GLMDNAFKYIK-KA-GGLEKEKDYPYTGKKGNQCHFDKSKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFY 299 (372)
T ss_pred CChhHHHHHHH-Hh-CCccccccCCccccCCCccccchhhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHh
Confidence 99999999977 66 7999999999999997 999999999999999999999999999999999999999998889999
Q ss_pred cCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEECCCCCccccceeE
Q psy1727 155 FGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTK 231 (240)
Q Consensus 155 ~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~ 231 (240)
.+||+.+....|+. ..++|+|||||||... -.++|||||||||+.|||+| ||||.||. +.||++..+
T Consensus 300 rgGV~~P~~~~Cs~--~~~~HaVLlvGyG~~g-----~~~PYWIVKNSWG~~WGE~G--Y~~l~RG~-N~CGi~~mv 366 (372)
T KOG1542|consen 300 RGGVSCPSKYICSP--KLLNHAVLLVGYGSSG-----YEKPYWIVKNSWGTSWGEKG--YYKLCRGS-NACGIADMV 366 (372)
T ss_pred cccccCCCcccCCc--cccCceEEEEeecCCC-----CCCceEEEECCccccccccc--eEEEeccc-cccccccch
Confidence 99999995558996 5599999999999882 27899999999999999999 99999996 567777443
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00 E-value=1.9e-51 Score=362.13 Aligned_cols=210 Identities=37% Similarity=0.657 Sum_probs=174.0
Q ss_pred CCCCChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC-------c-----------------------
Q psy1727 7 PHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE-------G----------------------- 56 (240)
Q Consensus 7 ~~~~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~-------g----------------------- 56 (240)
+..+-+++++.+|++||++|+|.|.+.+|+.+||+||++|+++|+++|. |
T Consensus 26 ~~~~~~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~ 105 (348)
T PTZ00203 26 RAIYVGTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAARYLNGAAYF 105 (348)
T ss_pred hhcccccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCCCeEEeccccccCCHHHHHHHhcCCCccc
Confidence 3344467778999999999999999988999999999999999987763 1
Q ss_pred -------------------------------------------------ceece-----------------eEecCCCCC
Q psy1727 57 -------------------------------------------------THLAL-----------------KLVDCDKVD 70 (240)
Q Consensus 57 -------------------------------------------------~~g~l-----------------qlvDCs~~~ 70 (240)
++++| |||||+..+
T Consensus 106 ~~~~~~~~~~~~~~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~~ 185 (348)
T PTZ00203 106 AAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVD 185 (348)
T ss_pred ccccccccccccccccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHHHhccCCC
Confidence 01111 999999878
Q ss_pred CCCCCCccchHHHHHHHhc-CCCccCCcccccccCCC---cccCCcc-cceeEEeEeeeechhHHHHHHHHHhCCCeEEE
Q psy1727 71 AGCGGGLMSNAFETIISKL-GGGLEGEKDYPYKGSNR---ACHLNKE-EIRVKIQSYVNVSSDETEMAKYLVKNGPMAVA 145 (240)
Q Consensus 71 ~gC~GG~~~~A~~yi~~~~-~~Gi~~e~~yPY~~~~~---~C~~~~~-~~~~~i~~~~~v~~~~~~i~~~l~~~GPv~v~ 145 (240)
.||+||++..||+|++ ++ .+|+++|++|||.+.++ .|..... ...+++.+|..++.++++|+.+|+++|||+|+
T Consensus 186 ~GC~GG~~~~a~~yi~-~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~ 264 (348)
T PTZ00203 186 NGCGGGLMLQAFEWVL-RNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIA 264 (348)
T ss_pred CCCCCCCHHHHHHHHH-HhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecceeecCcCHHHHHHHHHhCCCEEEE
Confidence 8999999999999997 42 14689999999998765 6874332 23567888888877888999999999999999
Q ss_pred EeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEECCCCCcc
Q psy1727 146 INANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRW 225 (240)
Q Consensus 146 ~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~ 225 (240)
|++..|++|++|||+ .|.. ..++|+|+|||||.+ +|++|||||||||++|||+| ||||+||.| .|
T Consensus 265 i~a~~f~~Y~~GIy~----~c~~--~~~nHaVliVGYG~~------~g~~YWiikNSWG~~WGe~G--Y~ri~rg~n-~C 329 (348)
T PTZ00203 265 VDASSFMSYHSGVLT----SCIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKG--YVRVTMGVN-AC 329 (348)
T ss_pred EEhhhhcCccCceee----ccCC--CCCCeEEEEEEEecC------CCceEEEEEcCCCCCcCcCc--eEEEEcCCC-cc
Confidence 999779999999995 3764 457999999999987 68999999999999999999 999999975 57
Q ss_pred ccceeEE
Q psy1727 226 GEQVTKS 232 (240)
Q Consensus 226 g~~~~~~ 232 (240)
|+.....
T Consensus 330 gi~~~~~ 336 (348)
T PTZ00203 330 LLTGYPV 336 (348)
T ss_pred cccceEE
Confidence 7764443
No 3
>PTZ00021 falcipain-2; Provisional
Probab=100.00 E-value=3.3e-50 Score=364.53 Aligned_cols=212 Identities=36% Similarity=0.668 Sum_probs=175.9
Q ss_pred ChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC--------c-------------------c------
Q psy1727 11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE--------G-------------------T------ 57 (240)
Q Consensus 11 ~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~--------g-------------------~------ 57 (240)
.+++...+|++|+++|+|+|.+.+|+..||.||++|+++|+++|. | .
T Consensus 161 ~n~e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~ 240 (489)
T PTZ00021 161 TNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKS 240 (489)
T ss_pred cChHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhcccccccccc
Confidence 356677889999999999999999999999999999999987663 1 0
Q ss_pred -------------------------------------------------------eece-----------------eEec
Q psy1727 58 -------------------------------------------------------HLAL-----------------KLVD 65 (240)
Q Consensus 58 -------------------------------------------------------~g~l-----------------qlvD 65 (240)
+++| ||||
T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVD 320 (489)
T PTZ00021 241 NGKKSPRVINYDDVIKKYKPKDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVD 320 (489)
T ss_pred ccccccccccccccccccccccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhh
Confidence 0000 8999
Q ss_pred CCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccC-CCcccCCcccceeEEeEeeeechhHHHHHHHHHhCCCeEE
Q psy1727 66 CDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAV 144 (240)
Q Consensus 66 Cs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~-~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~GPv~v 144 (240)
|+..+.||+||++..||+|+. ++ +||++|++|||.+. .+.|........++|.+|..++ +++|+++|+..|||+|
T Consensus 321 Cs~~n~GC~GG~~~~Af~yi~-~~-gGl~tE~~YPY~~~~~~~C~~~~~~~~~~i~~y~~i~--~~~lk~al~~~GPVsv 396 (489)
T PTZ00021 321 CSFKNNGCYGGLIPNAFEDMI-EL-GGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIP--EDKFKEAIRFLGPISV 396 (489)
T ss_pred hccCCCCCCCcchHhhhhhhh-hc-cccCcccccCccCCCCCccccccccccceeeeEEEec--HHHHHHHHHhcCCeEE
Confidence 998889999999999999998 66 69999999999987 4789876666678899998886 5689999999999999
Q ss_pred EEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccc----cccCCcCEEEEecccCCCcCCCCcceEEEEC
Q psy1727 145 AINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTK----FTHKIQPYWIIKNSWGPHWGEKTMPFWIIKN 219 (240)
Q Consensus 145 ~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~----~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~ 219 (240)
+|++.. |++|++|||++. |+ ..++|||+|||||++... ....+.+|||||||||+.|||+| ||||+|
T Consensus 397 ~i~a~~~f~~YkgGIy~~~---C~---~~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~G--Y~rI~r 468 (489)
T PTZ00021 397 SIAVSDDFAFYKGGIFDGE---CG---EEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKG--FIRIET 468 (489)
T ss_pred EEEeecccccCCCCcCCCC---CC---CccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCe--EEEEEc
Confidence 999964 999999999764 87 458999999999975210 01124689999999999999999 999999
Q ss_pred CCC---CccccceeEEecC
Q psy1727 220 SWG---PRWGEQVTKSIYS 235 (240)
Q Consensus 220 ~~~---~~~g~~~~~~~~~ 235 (240)
+.+ +.||+.. .+.||
T Consensus 469 ~~~g~~n~CGI~t-~a~yP 486 (489)
T PTZ00021 469 DENGLMKTCSLGT-EAYVP 486 (489)
T ss_pred CCCCCCCCCCCcc-cceeE
Confidence 975 6899974 44554
No 4
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00 E-value=2.8e-47 Score=344.07 Aligned_cols=206 Identities=30% Similarity=0.519 Sum_probs=167.0
Q ss_pred hhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC------c---------------------c-e-----
Q psy1727 12 KLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE------G---------------------T-H----- 58 (240)
Q Consensus 12 ~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~------g---------------------~-~----- 58 (240)
+++...+|++|+++|+|.|.+.+|+..||.||++|+++|+++|. | . .
T Consensus 119 e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~~ 198 (448)
T PTZ00200 119 EFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTS 198 (448)
T ss_pred hHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCcCCeEEeccccccCCHHHHHHHhccCCCcccccccc
Confidence 34445789999999999999999999999999999999976663 1 0 0
Q ss_pred --------------------------e------c-e--------------------------------------------
Q psy1727 59 --------------------------L------A-L-------------------------------------------- 61 (240)
Q Consensus 59 --------------------------g------~-l-------------------------------------------- 61 (240)
. . +
T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~~ 278 (448)
T PTZ00200 199 HNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDK 278 (448)
T ss_pred cccccccccccccccccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHHHhHHHHHHHHHHHhcCC
Confidence 0 0 1
Q ss_pred -------eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeechhHHHHHH
Q psy1727 62 -------KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAK 134 (240)
Q Consensus 62 -------qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~ 134 (240)
|||||+..+.||+||++..||+|++ ++ ||++|++|||.+..+.|..... ..+.|.+|..++ +.+.+++
T Consensus 279 ~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~-~~--Gi~~e~~YPY~~~~~~C~~~~~-~~~~i~~y~~~~-~~~~l~~ 353 (448)
T PTZ00200 279 SVDLSEQELVNCDTKSQGCSGGYPDTALEYVK-NK--GLSSSSDVPYLAKDGKCVVSST-KKVYIDSYLVAK-GKDVLNK 353 (448)
T ss_pred CeecCHHHHhhccCccCCCCCCcHHHHHHHHh-hc--CccccccCCCCCCCCCCcCCCC-CeeEecceEecC-HHHHHHH
Confidence 8999998789999999999999998 76 9999999999999999986543 345678887654 3444555
Q ss_pred HHHhCCCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcc
Q psy1727 135 YLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMP 213 (240)
Q Consensus 135 ~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~ 213 (240)
++ .+|||+|+|++.. |++|++|||++. |.. .++|||+|||||.+.. +|.+|||||||||++|||+|
T Consensus 354 ~l-~~GPV~v~i~~~~~f~~Yk~GIy~~~---C~~---~~nHaV~lVGyG~d~~----~g~~YWIIkNSWG~~WGe~G-- 420 (448)
T PTZ00200 354 SL-VISPTVVYIAVSRELLKYKSGVYNGE---CGK---SLNHAVLLVGEGYDEK----TKKRYWIIKNSWGTDWGENG-- 420 (448)
T ss_pred HH-hcCCEEEEeecccccccCCCCccccc---cCC---CCcEEEEEEEecccCC----CCCceEEEEcCCCCCcccCe--
Confidence 54 6899999999975 999999999764 874 4899999999996421 57899999999999999999
Q ss_pred eEEEECCC--CCccccceeEEecCC
Q psy1727 214 FWIIKNSW--GPRWGEQVTKSIYSS 236 (240)
Q Consensus 214 y~~i~~~~--~~~~g~~~~~~~~~~ 236 (240)
||||+|+. .+.||+.. .+.||.
T Consensus 421 Y~ri~r~~~g~n~CGI~~-~~~~P~ 444 (448)
T PTZ00200 421 YMRLERTNEGTDKCGILT-VGLTPV 444 (448)
T ss_pred eEEEEeCCCCCCcCCccc-cceeeE
Confidence 99999974 46799874 445553
No 5
>KOG1543|consensus
Probab=100.00 E-value=3.7e-44 Score=313.64 Aligned_cols=152 Identities=40% Similarity=0.689 Sum_probs=137.9
Q ss_pred eEecCCC-CCCCCCCCccchHHHHHHHhcCCCccC-CcccccccCCCcccCCcccceeEEeEeeeechhHHHHHHHHHhC
Q psy1727 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEG-EKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKN 139 (240)
Q Consensus 62 qlvDCs~-~~~gC~GG~~~~A~~yi~~~~~~Gi~~-e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~ 139 (240)
|||||+. ++.||+||++..||+|++ ++ |+++ +.+|||.+..+.|..+.....+.+.++..++.++.+|+.+|+.+
T Consensus 162 ~lvdC~~~~~~GC~GG~~~~A~~yi~-~~--G~~t~~~~Ypy~~~~~~C~~~~~~~~~~~~~~~~~~~~e~~i~~~v~~~ 238 (325)
T KOG1543|consen 162 DLVDCCGECGDGCNGGEPKNAFKYIK-KN--GGVTECENYPYIGKDGTCKSNKKDKTVTIKGFYNVPANEEAIAEAVAKN 238 (325)
T ss_pred hhhhccCCCCCCcCCCCHHHHHHHHH-Hh--CCCCCCcCCCCcCCCCCccCCCccceeEeeeeeecCcCHHHHHHHHHhc
Confidence 9999998 589999999999999999 87 6666 99999999999999888777888899999998999999999999
Q ss_pred CCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEE
Q psy1727 140 GPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIK 218 (240)
Q Consensus 140 GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~ 218 (240)
|||+|+|++.. |++|++|||.++ .|.. ..++|||+|||||+. ++.+|||||||||+.|||+| ||||.
T Consensus 239 GPv~v~~~a~~~F~~Y~~GVy~~~--~~~~--~~~~Hav~iVGyG~~------~~~~YWivkNSWG~~WGe~G--y~ri~ 306 (325)
T KOG1543|consen 239 GPVSVAIDAYEDFSLYKGGVYAEE--KGDD--KEGDHAVLIVGYGTG------DGVDYWIVKNSWGTDWGEKG--YFRIA 306 (325)
T ss_pred CCeEEEEeehhhhhhccCceEeCC--CCCC--CCCCceEEEEEEcCC------CCceeEEEEcCCCCCcccCc--eEEEe
Confidence 99999999987 999999999988 5663 159999999999993 78999999999999999999 99999
Q ss_pred CCCCCccccc
Q psy1727 219 NSWGPRWGEQ 228 (240)
Q Consensus 219 ~~~~~~~g~~ 228 (240)
|+.+.++++.
T Consensus 307 r~~~~~~I~~ 316 (325)
T KOG1543|consen 307 RGVNKCGIAS 316 (325)
T ss_pred cCCCchhhhc
Confidence 9998777443
No 6
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00 E-value=1.1e-40 Score=282.18 Aligned_cols=164 Identities=31% Similarity=0.620 Sum_probs=129.9
Q ss_pred eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCccccccc-CCCcccCCc-ccceeEEeEeeee-----chhHHHHHH
Q psy1727 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLNK-EEIRVKIQSYVNV-----SSDETEMAK 134 (240)
Q Consensus 62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~-~~~~C~~~~-~~~~~~i~~~~~v-----~~~~~~i~~ 134 (240)
||+||+..+.||+||++..|++|++ +. ||++|++|||+. ..+.|.... ....+++..|..+ ..++++|++
T Consensus 62 ~l~dC~~~~~GC~GG~~~~a~~~~~-~~--Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ik~ 138 (243)
T cd02621 62 HVLSCSQYSQGCDGGFPFLVGKFAE-DF--GIVTEDYFPYTADDDRPCKASPSECRRYYFSDYNYVGGCYGCTNEDEMKW 138 (243)
T ss_pred HhhhhcCCCCCCCCCCHHHHHHHHH-hc--CcCCCceeCCCCCCCCCCCCCccccccccccceeEcccccccCCHHHHHH
Confidence 8999998788999999999999998 77 999999999998 678898654 3333444444443 236889999
Q ss_pred HHHhCCCeEEEEeccc-cccccCCcccCCC--cccCCC------CCCCCccEEEEEEeecccccccCCcCEEEEecccCC
Q psy1727 135 YLVKNGPMAVAINANA-MQFYFGGVSHPLK--FLCKGG------MDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGP 205 (240)
Q Consensus 135 ~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~--~~c~~~------~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~ 205 (240)
+|+++|||+++|++.. |++|++|||+... ..|... ...++|+|+|||||++.. ++.+|||||||||+
T Consensus 139 ~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~----~g~~YWiirNSWG~ 214 (243)
T cd02621 139 EIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI----KGEKYWIVKNSWGS 214 (243)
T ss_pred HHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC----CCCcEEEEEcCCCC
Confidence 9999999999999975 9999999998751 126531 024799999999998720 27899999999999
Q ss_pred CcCCCCcceEEEECCCCCcccc-ceeEEecC
Q psy1727 206 HWGEKTMPFWIIKNSWGPRWGE-QVTKSIYS 235 (240)
Q Consensus 206 ~WG~~G~~y~~i~~~~~~~~g~-~~~~~~~~ 235 (240)
.|||+| ||||+|+.| .||+ ......+|
T Consensus 215 ~WGe~G--y~~i~~~~~-~cgi~~~~~~~~~ 242 (243)
T cd02621 215 SWGEKG--YFKIRRGTN-ECGIESQAVFAYP 242 (243)
T ss_pred CCCcCC--eEEEecCCc-ccCcccceEeecc
Confidence 999999 999999975 5566 44444444
No 7
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00 E-value=1.2e-40 Score=281.10 Aligned_cols=157 Identities=27% Similarity=0.482 Sum_probs=131.2
Q ss_pred eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCC---------------cccceeEEeEeeeec
Q psy1727 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN---------------KEEIRVKIQSYVNVS 126 (240)
Q Consensus 62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~---------------~~~~~~~i~~~~~v~ 126 (240)
|||||+. +.||+||++..||+|++ ++ |+++|++|||...++.|... +....+++.+|..++
T Consensus 61 ~lldC~~-~~gC~GG~~~~a~~~~~-~~--Gl~~e~~yPY~~~~~~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~~ 136 (239)
T cd02698 61 VVIDCAG-GGSCHGGDPGGVYEYAH-KH--GIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVS 136 (239)
T ss_pred HHHhCCC-CCCccCcCHHHHHHHHH-Hc--CcCCCCeeCCcCCCCCCcCCCCCCCcccCcccccccccceEEeeeceecC
Confidence 8999998 78999999999999998 77 99999999999877666521 112346677777775
Q ss_pred hhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCC
Q psy1727 127 SDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGP 205 (240)
Q Consensus 127 ~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~ 205 (240)
++++|+++|+++|||+|+|.+.. |+.|++|||+.. .|. ..++|+|+|||||++. ++++|||||||||+
T Consensus 137 -~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~--~~~---~~~~HaV~IVGyG~~~-----~g~~YWiikNSWG~ 205 (239)
T cd02698 137 -GRDKMMAEIYARGPISCGIMATEALENYTGGVYKEY--VQD---PLINHIISVAGWGVDE-----NGVEYWIVRNSWGE 205 (239)
T ss_pred -CHHHHHHHHHHcCCEEEEEEecccccccCCeEEccC--CCC---CcCCeEEEEEEEEecC-----CCCEEEEEEcCCCc
Confidence 57889999999999999999975 999999999876 454 5689999999999872 38899999999999
Q ss_pred CcCCCCcceEEEECCC-----CCccccceeEEecC
Q psy1727 206 HWGEKTMPFWIIKNSW-----GPRWGEQVTKSIYS 235 (240)
Q Consensus 206 ~WG~~G~~y~~i~~~~-----~~~~g~~~~~~~~~ 235 (240)
+|||+| ||||+|+. |++++|...+...|
T Consensus 206 ~WGe~G--y~~i~rg~~~~~~~~~~i~~~~~~~~~ 238 (239)
T cd02698 206 PWGERG--WFRIVTSSYKGARYNLAIEEDCAWADP 238 (239)
T ss_pred ccCcCc--eEEEEccCCcccccccccccceEEEee
Confidence 999999 99999998 88877764444433
No 8
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00 E-value=1.2e-39 Score=274.50 Aligned_cols=152 Identities=30% Similarity=0.536 Sum_probs=125.1
Q ss_pred eEecCCCC-CCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCc------------------ccCCcc----cceeE
Q psy1727 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA------------------CHLNKE----EIRVK 118 (240)
Q Consensus 62 qlvDCs~~-~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~------------------C~~~~~----~~~~~ 118 (240)
|||||+.. +.||+||++..||+|++ ++ |+++|++|||...+.. |..... ...++
T Consensus 57 ~lidC~~~~~~gC~GG~~~~a~~~i~-~~--G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~ 133 (236)
T cd02620 57 DLLSCCSGCGDGCNGGYPDAAWKYLT-TT--GVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHK 133 (236)
T ss_pred HHHhhcCCCCCCCCCCCHHHHHHHHH-hc--CCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeee
Confidence 99999985 88999999999999998 77 9999999999876532 432211 12234
Q ss_pred EeEeeeechhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEE
Q psy1727 119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYW 197 (240)
Q Consensus 119 i~~~~~v~~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yW 197 (240)
+..+..+..++++|+++|+++|||+|+|.+.. |++|++|||+.. |.. ..++|||+|||||++ ++++||
T Consensus 134 ~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~---~~~--~~~~HaV~iVGyg~~------~g~~YW 202 (236)
T cd02620 134 GKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHT---SGK--QLGGHAVKIIGWGVE------NGVPYW 202 (236)
T ss_pred ecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEeec---CCC--CcCCeEEEEEEEecc------CCeeEE
Confidence 45555565578999999999999999999965 999999999754 543 457999999999987 688999
Q ss_pred EEecccCCCcCCCCcceEEEECCCCCcccccee
Q psy1727 198 IIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVT 230 (240)
Q Consensus 198 ivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~ 230 (240)
|||||||+.|||+| ||||+|+. +.||+...
T Consensus 203 ivrNSWG~~WGe~G--y~ri~~~~-~~cgi~~~ 232 (236)
T cd02620 203 LAANSWGTDWGENG--YFRILRGS-NECGIESE 232 (236)
T ss_pred EEEeCCCCCCCCCc--EEEEEccC-cccccccc
Confidence 99999999999999 99999997 57787743
No 9
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00 E-value=6.3e-38 Score=259.20 Aligned_cols=151 Identities=43% Similarity=0.823 Sum_probs=135.1
Q ss_pred eEecCCCC-CCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeech-hHHHHHHHHHhC
Q psy1727 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKN 139 (240)
Q Consensus 62 qlvDCs~~-~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~-~~~~i~~~l~~~ 139 (240)
+|+||+.. +.+|.||.+..||+++. +. |+++|++|||......|........+++.+|..++. ++++||++|+++
T Consensus 51 ~l~~c~~~~~~gC~GG~~~~a~~~~~-~~--Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~i~~~~~~~ik~~l~~~ 127 (210)
T cd02248 51 QLVDCSTSGNNGCNGGNPDNAFEYVK-NG--GLASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANY 127 (210)
T ss_pred HHhccCCCCCCCCCCCCHHHhHHHHH-HC--CcCccccCCccCCCCCccCCCCcccEEEeeEEEcCCCcHHHHHHHHhhc
Confidence 89999985 78999999999999998 76 999999999999888998776667889999998876 488999999999
Q ss_pred CCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEE
Q psy1727 140 GPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIK 218 (240)
Q Consensus 140 GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~ 218 (240)
|||++++.+.. |+.|++|||..+ .|.. ..++|||+|||||++ .+.+|||||||||+.||++| ||||+
T Consensus 128 gPV~~~~~~~~~f~~y~~Giy~~~--~~~~--~~~~Hav~iVGy~~~------~~~~ywiv~NSWG~~WG~~G--y~~i~ 195 (210)
T cd02248 128 GPVSVAIDASSSFQFYKGGIYSGP--CCSN--TNLNHAVLLVGYGTE------NGVDYWIVKNSWGTSWGEKG--YIRIA 195 (210)
T ss_pred CCEEEEEecCcccccCCCCceeCC--CCCC--CcCCEEEEEEEEeec------CCceEEEEEcCCCCccccCc--EEEEE
Confidence 99999999975 999999999876 5532 568999999999998 57899999999999999999 99999
Q ss_pred CCCCCccccc
Q psy1727 219 NSWGPRWGEQ 228 (240)
Q Consensus 219 ~~~~~~~g~~ 228 (240)
++. +.||+.
T Consensus 196 ~~~-~~cgi~ 204 (210)
T cd02248 196 RGS-NLCGIA 204 (210)
T ss_pred cCC-CccCce
Confidence 987 578887
No 10
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00 E-value=4.4e-38 Score=291.83 Aligned_cols=166 Identities=23% Similarity=0.469 Sum_probs=128.9
Q ss_pred eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCccc---------------------------
Q psy1727 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEE--------------------------- 114 (240)
Q Consensus 62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~--------------------------- 114 (240)
|||||+..++||+||++..|++|++ +. ||++|.+|||++..+.|......
T Consensus 446 qLLDCs~~nqGC~GG~~~~A~kya~-~~--GI~tEscYPY~a~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 522 (693)
T PTZ00049 446 TVLSCSFYDQGCNGGFPYLVSKMAK-LQ--GIPLDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSETQSDM 522 (693)
T ss_pred HhcccCCCCCCcCCCcHHHHHHHHH-HC--CCCcCCccCCcCCCCCCCCCCCCccccccccccccccccccccccccccc
Confidence 8999998889999999999999998 77 99999999999888888642111
Q ss_pred ------------ceeEEeEeeeec--------hhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCC----cccCCC
Q psy1727 115 ------------IRVKIQSYVNVS--------SDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLK----FLCKGG 169 (240)
Q Consensus 115 ------------~~~~i~~~~~v~--------~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~----~~c~~~ 169 (240)
..+.+.+|..+. .++++|+++|+++|||+|+|++.. |++|++|||+... ..|..+
T Consensus 523 ~~~~~~~~~~~~~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d 602 (693)
T PTZ00049 523 HADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVD 602 (693)
T ss_pred cccccccccccccceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCc
Confidence 123345555553 258899999999999999999975 9999999998531 137532
Q ss_pred ------------CCCCCccEEEEEEeecccccccCC--cCEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEecC
Q psy1727 170 ------------MDNLDHGVLIVGYGVHKTKFTHKI--QPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYS 235 (240)
Q Consensus 170 ------------~~~~~Hav~iVGyg~~~~~~~~~g--~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~ 235 (240)
...++|||+|||||.+.. +| .+|||||||||+.||++| ||||+||.|.|.++.....+.|
T Consensus 603 ~~~~~~~~~~~G~e~~NHAVlIVGwG~d~e----nG~~~~YWIVRNSWGt~WGenG--YfKI~RG~N~CGIEs~a~~~~p 676 (693)
T PTZ00049 603 LPKHNGVYNITGWEKVNHAIVLVGWGEEEI----NGKLYKYWIGRNSWGKNWGKEG--YFKIIRGKNFSGIESQSLFIEP 676 (693)
T ss_pred cccccccccccccccCceEEEEEEeccccC----CCcccCEEEEECCCCCCcccCc--eEEEEcCCCccCCccceeEEee
Confidence 124799999999997521 34 489999999999999999 9999999865544454444444
Q ss_pred C
Q psy1727 236 S 236 (240)
Q Consensus 236 ~ 236 (240)
+
T Consensus 677 d 677 (693)
T PTZ00049 677 D 677 (693)
T ss_pred e
Confidence 3
No 11
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00 E-value=5.8e-38 Score=287.77 Aligned_cols=164 Identities=21% Similarity=0.464 Sum_probs=131.0
Q ss_pred eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCccc--ccccCCC---cccCCcccceeEEeE------eeeechhHH
Q psy1727 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY--PYKGSNR---ACHLNKEEIRVKIQS------YVNVSSDET 130 (240)
Q Consensus 62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~y--PY~~~~~---~C~~~~~~~~~~i~~------~~~v~~~~~ 130 (240)
|||||+..+.||+||++..|++|++ ++ ||++|++| ||++..+ .|+.......+.+.+ |..+..+++
T Consensus 268 qLVDCs~~n~GCdGG~p~~A~~yi~-~~--GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~~~~I~gyy~~~~~e~ 344 (548)
T PTZ00364 268 HVLDCSQYGQGCAGGFPEEVGKFAE-TF--GILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPD 344 (548)
T ss_pred HHhcccCCCCCCCCCcHHHHHHHHH-hC--CcccccccCCCCCCCCCCCCCCCCCcccceeeeeeeEEecceeecCCcHH
Confidence 8999998889999999999999998 77 99999999 9987665 587654444444444 333344688
Q ss_pred HHHHHHHhCCCeEEEEeccc-cccccCCcccCC------CcccCCC--------CCCCCccEEEEEEeecccccccCCcC
Q psy1727 131 EMAKYLVKNGPMAVAINANA-MQFYFGGVSHPL------KFLCKGG--------MDNLDHGVLIVGYGVHKTKFTHKIQP 195 (240)
Q Consensus 131 ~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~------~~~c~~~--------~~~~~Hav~iVGyg~~~~~~~~~g~~ 195 (240)
+|+++|+++|||+|+|++.. |+.|++|||... ...|... ...++|||+|||||.+. +|.+
T Consensus 345 ~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de-----~G~~ 419 (548)
T PTZ00364 345 EIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE-----NGGD 419 (548)
T ss_pred HHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccC-----CCce
Confidence 99999999999999999975 889999998621 0012110 13579999999999753 6889
Q ss_pred EEEEecccCC--CcCCCCcceEEEECCCCCccccceeEEecC
Q psy1727 196 YWIIKNSWGP--HWGEKTMPFWIIKNSWGPRWGEQVTKSIYS 235 (240)
Q Consensus 196 yWivkNSWG~--~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~ 235 (240)
|||||||||+ +|||+| ||||+||.|+|.++...+.++|
T Consensus 420 YWIVKNSWGt~~~WGE~G--YfRI~RG~N~CGIes~~v~~~~ 459 (548)
T PTZ00364 420 YWLVLDPWGSRRSWCDGG--TRKIARGVNAYNIESEVVVMYW 459 (548)
T ss_pred EEEEECCCCCCCCcccCC--eEEEEcCCCcccccceeeeeee
Confidence 9999999999 999999 9999999987777777777776
No 12
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00 E-value=1.1e-36 Score=252.40 Aligned_cols=159 Identities=34% Similarity=0.719 Sum_probs=136.1
Q ss_pred eEecCCC-CCCCCCCCccchHHHHHHHhcCCCccCCcccccccCC-CcccCCcccc-eeEEeEeeeechh-HHHHHHHHH
Q psy1727 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLNKEEI-RVKIQSYVNVSSD-ETEMAKYLV 137 (240)
Q Consensus 62 qlvDCs~-~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~-~~C~~~~~~~-~~~i~~~~~v~~~-~~~i~~~l~ 137 (240)
+|+||+. .+.+|+||++..|+++++ ++ .|+++|.+|||.... ..|....... .+++..|..+..+ .++|+++|.
T Consensus 54 ~l~~~~~~~~~~c~gg~~~~a~~~~~-~~-~Gi~~e~~~pY~~~~~~~c~~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~ 131 (219)
T PF00112_consen 54 YLIDCSNKYNKGCDGGSPFDALKYIK-NN-NGIVTEEDYPYNGNENPTCKSKKSNSYYVKIKGYGKVKDNDIEDIKKALM 131 (219)
T ss_dssp HHHHHSTGTSSTTBBBEHHHHHHHHH-HH-TSBEBTTTS--SSSSSCSSCHSGGGEEEBEESEEEEEESTCHHHHHHHHH
T ss_pred cccccccccccccccCcccccceeec-cc-CcccccccccccccccccccccccccccccccccccccccchhHHHHHHh
Confidence 7999998 678999999999999999 82 399999999999887 7898765543 4788888888764 899999999
Q ss_pred hCCCeEEEEeccc--cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceE
Q psy1727 138 KNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFW 215 (240)
Q Consensus 138 ~~GPv~v~~~~~~--f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~ 215 (240)
++|||++++.+.. |+.|++|||..+ .|.. ..++|+|+|||||++ .+++|||||||||+.||++| ||
T Consensus 132 ~~gpV~~~~~~~~~~f~~~~~gi~~~~--~~~~--~~~~Hav~iVGy~~~------~~~~~wiv~NSWG~~WG~~G--y~ 199 (219)
T PF00112_consen 132 KYGPVVASIDVSSEDFQNYKSGIYDPP--DCSN--ESGGHAVLIVGYDDE------NGKGYWIVKNSWGTDWGDNG--YF 199 (219)
T ss_dssp HHSSEEEEEEEESHHHHTEESSEECST--SSSS--SSEEEEEEEEEEEEE------TTEEEEEEE-SBTTTSTBTT--EE
T ss_pred hCceeeeeeeccccccccccceeeecc--cccc--ccccccccccccccc------cceeeEeeehhhCCccCCCe--EE
Confidence 9999999999865 999999999987 6775 578999999999998 68999999999999999999 99
Q ss_pred EEECCCCCccccceeEEecC
Q psy1727 216 IIKNSWGPRWGEQVTKSIYS 235 (240)
Q Consensus 216 ~i~~~~~~~~g~~~~~~~~~ 235 (240)
||+|+.++.||+.. .++||
T Consensus 200 ~i~~~~~~~c~i~~-~~~~~ 218 (219)
T PF00112_consen 200 RISYDYNNECGIES-QAVYP 218 (219)
T ss_dssp EEESSSSSGGGTTS-SEEEE
T ss_pred EEeeCCCCcCccCc-eeeec
Confidence 99999988999984 44444
No 13
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00 E-value=5.9e-34 Score=271.24 Aligned_cols=190 Identities=25% Similarity=0.470 Sum_probs=136.7
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCc---------ceece-----------------eEecCCC--CCCCCCCCccch-
Q psy1727 30 YATKEEYHKRLRIFRANLKKIQIRGEG---------THLAL-----------------KLVDCDK--VDAGCGGGLMSN- 80 (240)
Q Consensus 30 Y~~~~E~~~R~~iF~~n~~~i~~~n~g---------~~g~l-----------------qlvDCs~--~~~gC~GG~~~~- 80 (240)
|-+ .|+-.|++-...+...|...++| ++++| ||+||+. .+.||.||+...
T Consensus 526 ~Cn-~~yCdR~kD~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~e 604 (1004)
T PTZ00462 526 FCN-KEFCNRLKDENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLE 604 (1004)
T ss_pred CCC-hhhccccccCCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHH
Confidence 543 57888887655555555555664 11111 9999986 368999998554
Q ss_pred HHHHHHHhcCCCccCCccccccc--CCCcccCCcc------------------cceeEEeEeeeechh---------HHH
Q psy1727 81 AFETIISKLGGGLEGEKDYPYKG--SNRACHLNKE------------------EIRVKIQSYVNVSSD---------ETE 131 (240)
Q Consensus 81 A~~yi~~~~~~Gi~~e~~yPY~~--~~~~C~~~~~------------------~~~~~i~~~~~v~~~---------~~~ 131 (240)
++.|+. ++ +||++|++|||.. ..+.|+.... ...+.+.+|..+... +++
T Consensus 605 fl~yI~-e~-GgLptESdYPYt~k~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~ 682 (1004)
T PTZ00462 605 FLQIIE-DN-GFLPADSNYLYNYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKI 682 (1004)
T ss_pred HHHHHH-Hc-CCCcccccCCCccCCCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHH
Confidence 558887 76 5699999999975 4567874321 012334456555321 468
Q ss_pred HHHHHHhCCCeEEEEeccccccc-cCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCC
Q psy1727 132 MAKYLVKNGPMAVAINANAMQFY-FGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210 (240)
Q Consensus 132 i~~~l~~~GPv~v~~~~~~f~~Y-~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~ 210 (240)
|+++|++.|||+|+|++..|+.| ++|||... .|+. ..++|||+|||||.+.. ....+++|||||||||+.|||+
T Consensus 683 IK~eI~~kGPVaV~IdAsdf~~Y~~sGIyv~~--~Cgs--~~~nHAVlIVGYGt~in-~eg~gk~YWIVRNSWGt~WGEn 757 (1004)
T PTZ00462 683 IKDEIMNKGSVIAYIKAENVLGYEFNGKKVQN--LCGD--DTADHAVNIVGYGNYIN-DEDEKKSYWIVRNSWGKYWGDE 757 (1004)
T ss_pred HHHHHHhcCCEEEEEEeehHHhhhcCCccccC--CCCC--CcCCceEEEEEeccccc-ccCCCCceEEEEcCCCCCcCCC
Confidence 99999999999999998777788 58998765 6985 45799999999997520 0013679999999999999999
Q ss_pred CcceEEEECCCCCccccce
Q psy1727 211 TMPFWIIKNSWGPRWGEQV 229 (240)
Q Consensus 211 G~~y~~i~~~~~~~~g~~~ 229 (240)
| ||||+|...+.||+..
T Consensus 758 G--YFKI~r~g~n~CGin~ 774 (1004)
T PTZ00462 758 G--YFKVDMYGPSHCEDNF 774 (1004)
T ss_pred e--EEEEEeCCCCCCccch
Confidence 9 9999985456788753
No 14
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.97 E-value=1.2e-31 Score=215.95 Aligned_cols=120 Identities=47% Similarity=0.870 Sum_probs=100.2
Q ss_pred eEecCCCC-CCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeechhHHHHHHHHHhCC
Q psy1727 62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNG 140 (240)
Q Consensus 62 qlvDCs~~-~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~G 140 (240)
+|+||+.. +.||+||++..|++|++ ++ .|+++|++|||+.
T Consensus 52 ~l~~C~~~~~~gC~GG~~~~a~~~~~-~~-~Gi~~e~~~PY~~------------------------------------- 92 (174)
T smart00645 52 QLVDCSTGGNNGCNGGLPDNAFEYIK-KN-GGLETESCYPYTG------------------------------------- 92 (174)
T ss_pred HHhhhcCCCCCCCCCcCHHHHHHHHH-Hc-CCcccccccCccc-------------------------------------
Confidence 89999874 67999999999999998 64 5999999999975
Q ss_pred CeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEECC
Q psy1727 141 PMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNS 220 (240)
Q Consensus 141 Pv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~ 220 (240)
++.+.+..|++|++|||+.. .|.. ..++|+|+|||||.+. ++++|||||||||+.||++| ||||+|+
T Consensus 93 --~~~~~~~~f~~Y~~Gi~~~~--~~~~--~~~~Hav~ivGyg~~~-----~g~~yWii~NSwG~~WG~~G--~~~i~~~ 159 (174)
T smart00645 93 --SVAIDASDFQFYKSGIYDHP--GCGS--GTLDHAVLIVGYGTEE-----NGKDYWIVKNSWGTDWGENG--YFRIARG 159 (174)
T ss_pred --EEEEEcccccCCcCeEECCC--CCCC--CcccEEEEEEEEeecC-----CCeeEEEEECCCCCCcccCe--EEEEEcC
Confidence 45555556999999999875 5764 3479999999999752 57899999999999999999 9999999
Q ss_pred CCCccccceeEEe
Q psy1727 221 WGPRWGEQVTKSI 233 (240)
Q Consensus 221 ~~~~~g~~~~~~~ 233 (240)
..+.||+......
T Consensus 160 ~~~~c~i~~~~~~ 172 (174)
T smart00645 160 KNNECGIEASVAS 172 (174)
T ss_pred CCCccCceeeeee
Confidence 7456777544433
No 15
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.97 E-value=3.3e-31 Score=220.03 Aligned_cols=149 Identities=26% Similarity=0.438 Sum_probs=121.2
Q ss_pred eEecCCCC-----CCCCCCCccchHHH-HHHHhcCCCccCCcccccccCCCcccC----CcccceeEEeEeeeech-hHH
Q psy1727 62 KLVDCDKV-----DAGCGGGLMSNAFE-TIISKLGGGLEGEKDYPYKGSNRACHL----NKEEIRVKIQSYVNVSS-DET 130 (240)
Q Consensus 62 qlvDCs~~-----~~gC~GG~~~~A~~-yi~~~~~~Gi~~e~~yPY~~~~~~C~~----~~~~~~~~i~~~~~v~~-~~~ 130 (240)
+|+||... ..+|.||.+..++. +++ ++ ||++|.+|||......|.. ......+++..|..+.. +++
T Consensus 50 ~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~~-~~--Gi~~e~~~Py~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~ 126 (223)
T cd02619 50 YLYICANDECLGINGSCDGGGPLSALLKLVA-LK--GIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIE 126 (223)
T ss_pred HHHHhccccccccCCCCCCCcHHHHHHHHHH-Hc--CCCccccCCCCCCCCCCCCCCccchhhcceeecceeEeCchhHH
Confidence 89999873 36999999999998 777 66 9999999999988766653 23345577888888765 478
Q ss_pred HHHHHHHhCCCeEEEEeccc-cccccCCccc---CCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCC
Q psy1727 131 EMAKYLVKNGPMAVAINANA-MQFYFGGVSH---PLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPH 206 (240)
Q Consensus 131 ~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~---~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~ 206 (240)
+||++|.++|||++++.+.. |..|++|++. .....|.. ..++|||+|||||++.. .+++|||||||||+.
T Consensus 127 ~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~Hav~ivGy~~~~~----~~~~~~i~~NSwG~~ 200 (223)
T cd02619 127 DIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDG--DLGGHAVVIVGYDDNYV----EGKGAFIVKNSWGTD 200 (223)
T ss_pred HHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCC--ccCCeEEEEEeecCCCC----CCCCEEEEEeCCCCc
Confidence 99999999999999999876 8899999873 11112333 56899999999998721 278999999999999
Q ss_pred cCCCCcceEEEECCC
Q psy1727 207 WGEKTMPFWIIKNSW 221 (240)
Q Consensus 207 WG~~G~~y~~i~~~~ 221 (240)
||++| |+||+++.
T Consensus 201 wg~~G--y~~i~~~~ 213 (223)
T cd02619 201 WGDNG--YGRISYED 213 (223)
T ss_pred cccCC--EEEEehhh
Confidence 99999 99999985
No 16
>KOG1544|consensus
Probab=99.96 E-value=5.3e-30 Score=216.20 Aligned_cols=162 Identities=27% Similarity=0.521 Sum_probs=128.3
Q ss_pred eEecCCC-CCCCCCCCccchHHHHHHHhcCCCccCCcccccccCC----CcccCC--------------------cccce
Q psy1727 62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN----RACHLN--------------------KEEIR 116 (240)
Q Consensus 62 qlvDCs~-~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~----~~C~~~--------------------~~~~~ 116 (240)
||++|.. ...||+||..+.|+=|+. +. |++...+|||.+.+ +.|... ..+..
T Consensus 264 nLlSC~~h~q~GC~gG~lDRAWWYlR-Kr--GvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~i 340 (470)
T KOG1544|consen 264 NLLSCDTHQQQGCRGGRLDRAWWYLR-KR--GVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNSNDI 340 (470)
T ss_pred HhcchhhhhhccCccCcccchheeee-cc--cccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccccCce
Confidence 8999998 468999999999999998 77 99999999997533 445321 12234
Q ss_pred eEEeEeeeechhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCCcccCCC---CCCCCccEEEEEEeecccccccC
Q psy1727 117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGG---MDNLDHGVLIVGYGVHKTKFTHK 192 (240)
Q Consensus 117 ~~i~~~~~v~~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~---~~~~~Hav~iVGyg~~~~~~~~~ 192 (240)
+++.....|+++|++|+++|+++|||.+.|.+.. |..|++|||.+.+...... ...+.|+|.|.|||++.. ..+.
T Consensus 341 yq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~-~~G~ 419 (470)
T KOG1544|consen 341 YQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL-PDGR 419 (470)
T ss_pred eeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccC-CCCC
Confidence 5566666677789999999999999999998866 8899999998763221110 124789999999998742 1235
Q ss_pred CcCEEEEecccCCCcCCCCcceEEEECCCCCccccce
Q psy1727 193 IQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQV 229 (240)
Q Consensus 193 g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~ 229 (240)
..+|||..||||+.|||+| ||||.||.|++-+|+-
T Consensus 420 ~~KyW~aANSWG~~WGE~G--YFriLRGvNecdIEsf 454 (470)
T KOG1544|consen 420 TLKYWTAANSWGPAWGERG--YFRILRGVNECDIESF 454 (470)
T ss_pred eeEEEEeecccccccccCc--eEEEeccccchhhhHh
Confidence 6789999999999999999 9999999998877763
No 17
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.72 E-value=1.1e-17 Score=150.98 Aligned_cols=95 Identities=19% Similarity=0.383 Sum_probs=73.6
Q ss_pred EeEeeeechh--HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCc------------------ccCCCCCCCCccEE
Q psy1727 119 IQSYVNVSSD--ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF------------------LCKGGMDNLDHGVL 178 (240)
Q Consensus 119 i~~~~~v~~~--~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~------------------~c~~~~~~~~Hav~ 178 (240)
...|.+++.+ ...+.++|..++||.++.++..|..|++||++.... .|.. ...+|||+
T Consensus 286 ~~~y~Nvp~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~e--s~~tHAM~ 363 (437)
T cd00585 286 PILYLNVPMDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGE--SLMTHAMV 363 (437)
T ss_pred cceEEecCHHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcC--CcCCeEEE
Confidence 3466677654 233446888889999999998888999999965311 1222 45789999
Q ss_pred EEEEeecccccccCCc-CEEEEecccCCCcCCCCcceEEEECCCC
Q psy1727 179 IVGYGVHKTKFTHKIQ-PYWIIKNSWGPHWGEKTMPFWIIKNSWG 222 (240)
Q Consensus 179 iVGyg~~~~~~~~~g~-~yWivkNSWG~~WG~~G~~y~~i~~~~~ 222 (240)
||||+.+. +|. .||+||||||+.||++| ||+|++++-
T Consensus 364 ivGv~~D~-----~g~p~yw~VkNSWG~~~G~~G--y~~ms~~wf 401 (437)
T cd00585 364 LTGVDLDE-----DGKPVKWKVENSWGEKVGKKG--YFVMSDDWF 401 (437)
T ss_pred EEEEEecC-----CCCcceEEEEcccCCCCCCCc--ceehhHHHH
Confidence 99999863 465 69999999999999999 999998863
No 18
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.58 E-value=2.9e-15 Score=129.65 Aligned_cols=136 Identities=25% Similarity=0.344 Sum_probs=84.1
Q ss_pred CCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeec-----hhHHHHHHHHHhCCCeEEE--Ee
Q psy1727 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVS-----SDETEMAKYLVKNGPMAVA--IN 147 (240)
Q Consensus 75 GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~-----~~~~~i~~~l~~~GPv~v~--~~ 147 (240)
||....+..|+. +. .|-+.+.+-||....-.|..... ...++..-..++ .++..|++++...|-++.. |+
T Consensus 168 ~g~~~m~~a~l~-e~-sgpv~et~d~y~~~s~~~~~~~p-~~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id 244 (372)
T COG4870 168 GGNADMSAAYLT-EW-SGPVYETDDPYSENSYFSPTNLP-VTKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYID 244 (372)
T ss_pred CCcccccccccc-cc-CCcchhhcCccccccccCCcCCc-hhhccccceecccchhhhcccchHHHHhhhccccceeEEe
Confidence 566665666776 65 78888888888776655543111 111122222222 2355577777777776644 44
Q ss_pred ccccccccCCcccCCCcccCCCCCCCCccEEEEEEeeccc----ccccCCcCEEEEecccCCCcCCCCcceEEEECCC
Q psy1727 148 ANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKT----KFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSW 221 (240)
Q Consensus 148 ~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~----~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~~ 221 (240)
+..+....-+.+... +. ...+|||+||||++... .....|...||||||||+.||++| ||+|++..
T Consensus 245 ~~~~~~~~~~~~~~~----s~--~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~G--YfwisY~y 314 (372)
T COG4870 245 ATNSLGICIPYPYVD----SG--ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENG--YFWISYYY 314 (372)
T ss_pred cccccccccCCCCCC----cc--ccccceEEEEeccccccccccccCCCCCceEEEECccccccccCc--eEEEEeee
Confidence 444323222333322 11 46899999999998631 122235669999999999999999 99999864
No 19
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.03 E-value=1.3e-09 Score=99.02 Aligned_cols=94 Identities=21% Similarity=0.428 Sum_probs=64.1
Q ss_pred Eeeeechh--HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccC----------------CCCCCCCccEEEEEE
Q psy1727 121 SYVNVSSD--ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCK----------------GGMDNLDHGVLIVGY 182 (240)
Q Consensus 121 ~~~~v~~~--~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~----------------~~~~~~~Hav~iVGy 182 (240)
.|.++|.+ .+.+.+.|...-||..+-++..+...+.||.+....... ......+|||+|||.
T Consensus 289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv 368 (438)
T PF03051_consen 289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGV 368 (438)
T ss_dssp EEEE--HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEE
T ss_pred eEeccCHHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEE
Confidence 46677765 566777888888999999998755778898875420000 001345899999999
Q ss_pred eecccccccCCc-CEEEEecccCCCcCCCCcceEEEECCC
Q psy1727 183 GVHKTKFTHKIQ-PYWIIKNSWGPHWGEKTMPFWIIKNSW 221 (240)
Q Consensus 183 g~~~~~~~~~g~-~yWivkNSWG~~WG~~G~~y~~i~~~~ 221 (240)
..+. +|. .+|+|+||||+..|.+| ||.|+..+
T Consensus 369 ~~D~-----~g~p~~wkVeNSWG~~~g~kG--y~~msd~w 401 (438)
T PF03051_consen 369 DLDE-----DGKPVRWKVENSWGTDNGDKG--YFYMSDDW 401 (438)
T ss_dssp EE-T-----TSSEEEEEEE-SBTTTSTBTT--EEEEEHHH
T ss_pred Eecc-----CCCeeEEEEEcCCCCCCCCCc--EEEECHHH
Confidence 9863 565 59999999999999999 99998764
No 20
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=98.82 E-value=4.5e-09 Score=69.06 Aligned_cols=36 Identities=39% Similarity=0.684 Sum_probs=31.7
Q ss_pred HHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcC
Q psy1727 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRG 54 (240)
Q Consensus 19 f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n 54 (240)
|++|+++|+|.|.+++|+..|++||.+|+++|+.+|
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N 36 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHN 36 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHh
Confidence 899999999999999999999999999999998877
No 21
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=98.52 E-value=1.2e-07 Score=61.79 Aligned_cols=37 Identities=38% Similarity=0.624 Sum_probs=34.7
Q ss_pred HHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC
Q psy1727 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE 55 (240)
Q Consensus 19 f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~ 55 (240)
|++|+.+|+|.|.+.+|...|+.||.+|++.|+.+|.
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~ 37 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNK 37 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHh
Confidence 6899999999999999999999999999999988774
No 22
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=97.57 E-value=7.3e-05 Score=64.88 Aligned_cols=83 Identities=16% Similarity=0.359 Sum_probs=56.9
Q ss_pred HHHHHhCCCeEEEEeccccccccCCcccCCCcc-------------cCC-C--CCCCCccEEEEEEeecccccccCCcCE
Q psy1727 133 AKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL-------------CKG-G--MDNLDHGVLIVGYGVHKTKFTHKIQPY 196 (240)
Q Consensus 133 ~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~-------------c~~-~--~~~~~Hav~iVGyg~~~~~~~~~g~~y 196 (240)
.+.+..+-+|..+-++..+..-+.||.+..-.. .+. + .....|||+|.|.+.++. ...--
T Consensus 305 ~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~----g~p~r 380 (444)
T COG3579 305 IKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDET----GNPLR 380 (444)
T ss_pred HHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccC----CCcee
Confidence 334445668888888777777788877632100 000 0 022579999999997641 23447
Q ss_pred EEEecccCCCcCCCCcceEEEECCC
Q psy1727 197 WIIKNSWGPHWGEKTMPFWIIKNSW 221 (240)
Q Consensus 197 WivkNSWG~~WG~~G~~y~~i~~~~ 221 (240)
|.|.||||.+=|.+| ||-++..+
T Consensus 381 wkVENSWG~d~G~~G--yfvaSd~w 403 (444)
T COG3579 381 WKVENSWGKDVGKKG--YFVASDAW 403 (444)
T ss_pred eEeecccccccCCCc--eEeehHhH
Confidence 999999999999999 99987654
No 23
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=92.05 E-value=0.28 Score=36.76 Aligned_cols=57 Identities=19% Similarity=0.283 Sum_probs=33.5
Q ss_pred hHHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEeccc
Q psy1727 128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSW 203 (240)
Q Consensus 128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSW 203 (240)
+.+.|++.|.+..||.+.+....- ...+..+. . ....|.|+|+||..+ + +++|..+|
T Consensus 88 ~~~~i~~~i~~G~Pvi~~~~~~~~-~~~~~~~~------~---~~~~H~vvi~Gy~~~-------~--~~~v~DP~ 144 (144)
T PF13529_consen 88 SFDDIKQEIDAGRPVIVSVNSGWR-PPNGDGYD------G---TYGGHYVVIIGYDED-------G--YVYVNDPW 144 (144)
T ss_dssp -HHHHHHHHHTT--EEEEEETTSS---TTEEEE------E----TTEEEEEEEEE-SS-------E---EEEE-TT
T ss_pred cHHHHHHHHHCCCcEEEEEEcccc-cCCCCCcC------C---CcCCEEEEEEEEeCC-------C--EEEEeCCC
Confidence 578899999998899999974210 00011111 1 347899999999975 2 78887776
No 24
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=88.92 E-value=1.6 Score=37.90 Aligned_cols=47 Identities=21% Similarity=0.274 Sum_probs=32.6
Q ss_pred HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeec
Q psy1727 129 ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVH 185 (240)
Q Consensus 129 ~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~ 185 (240)
.+.|++.|.++.||.+.++...+ .|...-+ .. ....|.|+|+||+++
T Consensus 78 ~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~-------~~--~~~~H~i~v~G~d~~ 124 (317)
T PF14399_consen 78 WEELKEALDAGRPVIVWVDMYYL-PYRPNYY-------KK--HHADHYIVVYGYDEE 124 (317)
T ss_pred HHHHHHHHhCCCceEEEeccccC-CCCcccc-------cc--ccCCcEEEEEEEeCC
Confidence 56788888877799999887543 2332211 11 346899999999975
No 25
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=88.20 E-value=2.2 Score=35.28 Aligned_cols=58 Identities=22% Similarity=0.338 Sum_probs=35.5
Q ss_pred hHHHHHHHHHhCCCeEEEEeccccccc--cCCcccCCCcccC-CCCCCCCccEEEEEEeec
Q psy1727 128 DETEMAKYLVKNGPMAVAINANAMQFY--FGGVSHPLKFLCK-GGMDNLDHGVLIVGYGVH 185 (240)
Q Consensus 128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y--~~Gv~~~~~~~c~-~~~~~~~Hav~iVGyg~~ 185 (240)
..++|...|.++||+.+-+++..+.=. +.-........|- ......+|-|+|+||+..
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~ 172 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA 172 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC
Confidence 478899999999988888877543200 2222211101122 112457899999999975
No 26
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=82.19 E-value=8 Score=30.97 Aligned_cols=65 Identities=20% Similarity=0.291 Sum_probs=40.8
Q ss_pred hHHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCc
Q psy1727 128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHW 207 (240)
Q Consensus 128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~W 207 (240)
+.+++++.+.++-|+.+..+.-. +. .. ...+||++||||-.-. +|.++.++=|-|
T Consensus 90 s~~eV~~~~~~nk~i~i~~~~v~-----~~---------~~--~~~gHAlavvGya~~~-----~g~~~y~~WNPW---- 144 (175)
T PF05543_consen 90 SFDEVKKLIDNNKGIAILADRVE-----QT---------NG--PHAGHALAVVGYAKPN-----NGQKTYYFWNPW---- 144 (175)
T ss_dssp -HHHHHHHHHTT-EEEEEEEETT-----SC---------TT--B--EEEEEEEEEEEET-----TSEEEEEEE-TT----
T ss_pred CHHHHHHHHHcCCCeEEEecccc-----cC---------CC--CccceeEEEEeeeecC-----CCCeEEEEeCCc----
Confidence 56788888888888777665211 11 11 4578999999997643 578899998988
Q ss_pred CCCCcceEEEECC
Q psy1727 208 GEKTMPFWIIKNS 220 (240)
Q Consensus 208 G~~G~~y~~i~~~ 220 (240)
+++ ++.++..
T Consensus 145 -~~~--~~~~sa~ 154 (175)
T PF05543_consen 145 -WND--VMIQSAK 154 (175)
T ss_dssp --SS---EEEETT
T ss_pred -cCC--cEEEecC
Confidence 355 6666544
No 27
>KOG4128|consensus
Probab=81.72 E-value=0.19 Score=44.05 Aligned_cols=87 Identities=14% Similarity=0.109 Sum_probs=53.6
Q ss_pred HHHHHHHHH----hCCCeEEEEeccccccccCCcccCCCccc--------C--CC-------CCCCCccEEEEEEeeccc
Q psy1727 129 ETEMAKYLV----KNGPMAVAINANAMQFYFGGVSHPLKFLC--------K--GG-------MDNLDHGVLIVGYGVHKT 187 (240)
Q Consensus 129 ~~~i~~~l~----~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c--------~--~~-------~~~~~Hav~iVGyg~~~~ 187 (240)
.+-|++.++ .+-||..+-+...+..-++|+.+-....- . .. ...-.||+++.|.|.-
T Consensus 306 ~d~l~k~vv~sl~~~kaVwfgcd~~k~~~~K~G~~dl~l~~~~l~fG~~l~~~~KAeRl~y~eSlmthAml~T~v~~k-- 383 (457)
T KOG4128|consen 306 MDILMKIVVTSLEGDKAVWFGCDIRKAISLKSGPLDLRLHQFDLLFGFKLGESTKAERLDYRESLMTHAMLLTSVGLK-- 383 (457)
T ss_pred HHHHHHHHHHHhcCCcceEEecccHhhhhcccCccchhhccCceeeeeeccccchhhhhhHHHHHHHHHHHhhhcccc--
Confidence 344444443 35577777776666677888765321000 0 00 0123699999888832
Q ss_pred ccccCCcCEEEEecccCCCcCCCCcceEEEEC
Q psy1727 188 KFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKN 219 (240)
Q Consensus 188 ~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~ 219 (240)
+....+-.-|-|.||||++-|.+| +..|..
T Consensus 384 d~~~g~~~~~rVenswgkd~gkkg--~~~mt~ 413 (457)
T KOG4128|consen 384 DPATGGLNEHRVENSWGKDLGKKG--VNKMTA 413 (457)
T ss_pred CcccCCchhhhhhchhhhhccccc--hhhhhH
Confidence 112256667999999999999999 876643
No 28
>PTZ00203 cathepsin L protease; Provisional
Probab=69.20 E-value=4.1 Score=36.40 Aligned_cols=36 Identities=31% Similarity=0.763 Sum_probs=28.6
Q ss_pred CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEe
Q psy1727 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSI 233 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~ 233 (240)
.+-++--+||. ++|.+|++|+.+|+..||+.+++.|
T Consensus 287 nHaVliVGYG~---~~g~~YWiikNSWG~~WGe~GY~ri 322 (348)
T PTZ00203 287 NHGVLLVGYNM---TGEVPYWVIKNSWGEDWGEKGYVRV 322 (348)
T ss_pred CeEEEEEEEec---CCCceEEEEEcCCCCCcCcCceEEE
Confidence 45555556665 3578899999999999999999877
No 29
>KOG3404|consensus
Probab=68.73 E-value=7.7 Score=29.18 Aligned_cols=44 Identities=23% Similarity=0.225 Sum_probs=29.4
Q ss_pred CCCCCChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHH
Q psy1727 6 KPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK 49 (240)
Q Consensus 6 ~~~~~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~ 49 (240)
..+|..+...|..|.-|+..|.|+----+-+-+|.+|-.+=..+
T Consensus 34 ~~~~~~~~~~E~lwpIfqlhHQrsRYiYdlyykR~~IS~eLY~~ 77 (145)
T KOG3404|consen 34 TEPHEGKRKTESLWPIFQLHHQRSRYIYDLYYKRKAISRELYDY 77 (145)
T ss_pred cCcccCCCcchhhhhHHHHhhhhhhhHHHHHHHHHHHHHHHHHH
Confidence 34454555668999999999999743345666777776554433
No 30
>KOG1543|consensus
Probab=66.91 E-value=7.7 Score=34.32 Aligned_cols=41 Identities=29% Similarity=0.631 Sum_probs=35.2
Q ss_pred CcCEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEecCC
Q psy1727 193 IQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYSS 236 (240)
Q Consensus 193 g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~~ 236 (240)
....+++--.||+ .+|.||+.++.+|+..||+.++..|...
T Consensus 268 ~~~Hav~iVGyG~---~~~~~YWivkNSWG~~WGe~Gy~ri~r~ 308 (325)
T KOG1543|consen 268 EGDHAVLIVGYGT---GDGVDYWIVKNSWGTDWGEKGYFRIARG 308 (325)
T ss_pred CCCceEEEEEEcC---CCCceeEEEEcCCCCCcccCceEEEecC
Confidence 3678999999999 5666799999999999999999988643
No 31
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=66.40 E-value=5 Score=33.64 Aligned_cols=37 Identities=38% Similarity=0.843 Sum_probs=27.7
Q ss_pred CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEe
Q psy1727 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSI 233 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~ 233 (240)
..-++--.||.. ++|.+|++|+..++..||+.+.+.|
T Consensus 179 ~HaV~IVGyG~~--~~g~~YWiikNSWG~~WGe~Gy~~i 215 (239)
T cd02698 179 NHIISVAGWGVD--ENGVEYWIVRNSWGEPWGERGWFRI 215 (239)
T ss_pred CeEEEEEEEEec--CCCCEEEEEEcCCCcccCcCceEEE
Confidence 345555556652 1278899999999999999998876
No 32
>PTZ00200 cysteine proteinase; Provisional
Probab=65.45 E-value=5.9 Score=36.69 Aligned_cols=39 Identities=31% Similarity=0.588 Sum_probs=29.0
Q ss_pred CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
..-++--+||.+ -++|.+|+.|+..|+..||+.++..|-
T Consensus 387 nHaV~lVGyG~d-~~~g~~YWIIkNSWG~~WGe~GY~ri~ 425 (448)
T PTZ00200 387 NHAVLLVGEGYD-EKTKKRYWIIKNSWGTDWGENGYMRLE 425 (448)
T ss_pred cEEEEEEEeccc-CCCCCceEEEEcCCCCCcccCeeEEEE
Confidence 455666666531 035777999999999999999998773
No 33
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=64.92 E-value=5.9 Score=33.22 Aligned_cols=39 Identities=36% Similarity=0.725 Sum_probs=29.5
Q ss_pred CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
.+-++--+||.+= .+|.+|++|+.+++..||+.+++.|-
T Consensus 187 ~HaV~iVGyg~~~-~~g~~YWiirNSWG~~WGe~Gy~~i~ 225 (243)
T cd02621 187 NHAVLLVGWGEDE-IKGEKYWIVKNSWGSSWGEKGYFKIR 225 (243)
T ss_pred CeEEEEEEeeccC-CCCCcEEEEEcCCCCCCCcCCeEEEe
Confidence 4666666777531 02667999999999999999998773
No 34
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=63.32 E-value=16 Score=29.44 Aligned_cols=47 Identities=23% Similarity=0.429 Sum_probs=34.4
Q ss_pred hHHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccC
Q psy1727 128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWG 204 (240)
Q Consensus 128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG 204 (240)
+..+|+..|.+..||.+-... | .+ ..-|+|+|.||++. |...-++||
T Consensus 122 sl~~ik~ql~kg~PV~iw~T~--~--------~~----------~s~H~v~itgyDk~----------n~yynDpyG 168 (195)
T COG4990 122 SLSDIKGQLLKGRPVVIWVTN--F--------HS----------YSIHSVLITGYDKY----------NIYYNDPYG 168 (195)
T ss_pred cHHHHHHHHhcCCcEEEEEec--c--------cc----------cceeeeEeeccccc----------ceEeccccc
Confidence 478999999999999877653 2 11 13599999999975 555666664
No 35
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=62.36 E-value=12 Score=29.68 Aligned_cols=39 Identities=33% Similarity=0.690 Sum_probs=32.7
Q ss_pred cCEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727 194 QPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 194 ~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
..+-++--.||.+ ++|++|++|+.+++..||+.+.+.|.
T Consensus 119 ~~Hav~ivGyg~~--~~g~~yWii~NSwG~~WG~~G~~~i~ 157 (174)
T smart00645 119 LDHAVLIVGYGTE--ENGKDYWIVKNSWGTDWGENGYFRIA 157 (174)
T ss_pred ccEEEEEEEEeec--CCCeeEEEEECCCCCCcccCeEEEEE
Confidence 4678888888875 46888999999999999999988774
No 36
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=59.62 E-value=11 Score=31.53 Aligned_cols=37 Identities=32% Similarity=0.781 Sum_probs=27.9
Q ss_pred EEEEecccCCCcCCCCcceEEEECCCCCccccceeEEecC
Q psy1727 196 YWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYS 235 (240)
Q Consensus 196 yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~ 235 (240)
.=++--.||.. +|++|++|+.+|+..||+.++..|.-
T Consensus 186 HaV~iVGyg~~---~g~~YWivrNSWG~~WGe~Gy~ri~~ 222 (236)
T cd02620 186 HAVKIIGWGVE---NGVPYWLAANSWGTDWGENGYFRILR 222 (236)
T ss_pred eEEEEEEEecc---CCeeEEEEEeCCCCCCCCCcEEEEEc
Confidence 44444455543 67779999999999999999987743
No 37
>KOG1542|consensus
Probab=53.78 E-value=11 Score=33.47 Aligned_cols=37 Identities=32% Similarity=0.797 Sum_probs=28.2
Q ss_pred CEEEEecccCCCcCCC-CcceEEEECCCCCccccceeEEec
Q psy1727 195 PYWIIKNSWGPHWGEK-TMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~-G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
+.=++--++|+. + .+||+.|+.+|++.||+.+++.+.
T Consensus 317 ~HaVLlvGyG~~---g~~~PYWIVKNSWG~~WGE~GY~~l~ 354 (372)
T KOG1542|consen 317 NHAVLLVGYGSS---GYEKPYWIVKNSWGTSWGEKGYYKLC 354 (372)
T ss_pred CceEEEEeecCC---CCCCceEEEECCccccccccceEEEe
Confidence 344555566652 3 588999999999999999988764
No 38
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are
Probab=52.91 E-value=45 Score=24.70 Aligned_cols=34 Identities=24% Similarity=0.503 Sum_probs=24.9
Q ss_pred HHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEe
Q psy1727 132 MAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG 183 (240)
Q Consensus 132 i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg 183 (240)
+++.|....||.+.++.. . . . ...+|.|+|+||.
T Consensus 70 ~~~~l~~~~Pvi~~~~~~--------~---~----~---~~~gH~vVv~g~~ 103 (141)
T cd02549 70 LLRQLAAGHPVIVSVNLG--------V---S----I---TPSGHAMVVIGYD 103 (141)
T ss_pred HHHHHHCCCeEEEEEecC--------c---c----c---CCCCeEEEEEEEc
Confidence 778888888998877641 0 1 1 2368999999998
No 39
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=52.41 E-value=13 Score=35.42 Aligned_cols=37 Identities=24% Similarity=0.737 Sum_probs=28.3
Q ss_pred CEEEEecccCCCcCCCCcceEEEECCCCC--ccccceeEEe
Q psy1727 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGP--RWGEQVTKSI 233 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~--~~g~~~~~~~ 233 (240)
.+-++--+||.+ ++|.+|+.|+.+|+. .||+.++..|
T Consensus 403 nHAVlIVGYG~d--e~G~~YWIVKNSWGt~~~WGE~GYfRI 441 (548)
T PTZ00364 403 NHTVLIIGWGTD--ENGGDYWLVLDPWGSRRSWCDGGTRKI 441 (548)
T ss_pred CeEEEEEEeccc--CCCceEEEEECCCCCCCCcccCCeEEE
Confidence 345555566642 357789999999999 9999999876
No 40
>KOG4702|consensus
Probab=49.06 E-value=34 Score=23.03 Aligned_cols=32 Identities=19% Similarity=0.373 Sum_probs=23.9
Q ss_pred HHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHH
Q psy1727 17 AMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK 49 (240)
Q Consensus 17 ~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~ 49 (240)
.-|++|+.+|.+.-.. .|...|.+-|.+-++.
T Consensus 29 e~Fee~v~~~krel~p-pe~~~~~EE~~~~lRe 60 (77)
T KOG4702|consen 29 EIFEEFVRGYKRELSP-PEATKRKEEYENFLRE 60 (77)
T ss_pred HHHHHHHHhccccCCC-hHHHhhHHHHHHHHHH
Confidence 5799999999998754 4777777777665543
No 41
>PTZ00021 falcipain-2; Provisional
Probab=42.94 E-value=24 Score=33.06 Aligned_cols=41 Identities=32% Similarity=0.461 Sum_probs=29.9
Q ss_pred cCEEEEecccCCCcC-------CCCcceEEEECCCCCccccceeEEec
Q psy1727 194 QPYWIIKNSWGPHWG-------EKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 194 ~~yWivkNSWG~~WG-------~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
...-++--+||..=| ..|.+|+.|+.+|+..||+.+++.|-
T Consensus 420 ~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~ 467 (489)
T PTZ00021 420 PNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIE 467 (489)
T ss_pred cceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEE
Confidence 456777777875311 01245999999999999999998773
No 42
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=42.80 E-value=25 Score=28.27 Aligned_cols=29 Identities=31% Similarity=0.692 Sum_probs=22.7
Q ss_pred CcCCC---CcceEEEECCCCCccccceeEEec
Q psy1727 206 HWGEK---TMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 206 ~WG~~---G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
+||+. |++||+|+.+++..||+.+...+.
T Consensus 179 Gy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~ 210 (223)
T cd02619 179 GYDDNYVEGKGAFIVKNSWGTDWGDNGYGRIS 210 (223)
T ss_pred eecCCCCCCCCEEEEEeCCCCccccCCEEEEe
Confidence 45543 344999999999999999887764
No 43
>PTZ00049 cathepsin C-like protein; Provisional
Probab=40.31 E-value=27 Score=34.13 Aligned_cols=37 Identities=32% Similarity=0.717 Sum_probs=26.4
Q ss_pred EEEEecccCCCcCCCCc--ceEEEECCCCCccccceeEEe
Q psy1727 196 YWIIKNSWGPHWGEKTM--PFWIIKNSWGPRWGEQVTKSI 233 (240)
Q Consensus 196 yWivkNSWG~~WG~~G~--~y~~i~~~~~~~~g~~~~~~~ 233 (240)
.-++--+||+. .++|. +|+.|+.+|+..||+.+++.|
T Consensus 620 HAVlIVGwG~d-~enG~~~~YWIVRNSWGt~WGenGYfKI 658 (693)
T PTZ00049 620 HAIVLVGWGEE-EINGKLYKYWIGRNSWGKNWGKEGYFKI 658 (693)
T ss_pred eEEEEEEeccc-cCCCcccCEEEEECCCCCCcccCceEEE
Confidence 34444455542 12464 699999999999999999877
No 44
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=37.97 E-value=73 Score=25.21 Aligned_cols=38 Identities=26% Similarity=0.357 Sum_probs=27.7
Q ss_pred HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeec
Q psy1727 129 ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVH 185 (240)
Q Consensus 129 ~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~ 185 (240)
.+.+...|.++||+-+++.... . .-..|+++|.|-..+
T Consensus 98 ~e~~~~LL~~yGPLwv~~~~P~----------------~---~~~~H~~ViTGI~~d 135 (166)
T PF12385_consen 98 AEGLANLLREYGPLWVAWEAPG----------------D---SWVAHASVITGIDGD 135 (166)
T ss_pred HHHHHHHHHHcCCeEEEecCCC----------------C---cceeeEEEEEeecCC
Confidence 6889999999999988854311 1 224688888888765
No 45
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=37.65 E-value=26 Score=28.18 Aligned_cols=37 Identities=35% Similarity=0.759 Sum_probs=27.8
Q ss_pred CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
.+-++--.||...| .+ ||.|+.+|+..||+.+...|-
T Consensus 159 ~Hav~iVGy~~~~~-~~--ywiv~NSWG~~WG~~Gy~~i~ 195 (210)
T cd02248 159 NHAVLLVGYGTENG-VD--YWIVKNSWGTSWGEKGYIRIA 195 (210)
T ss_pred CEEEEEEEEeecCC-ce--EEEEEcCCCCccccCcEEEEE
Confidence 45666666665443 35 999999999999999888763
No 46
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=33.38 E-value=40 Score=34.34 Aligned_cols=39 Identities=23% Similarity=0.602 Sum_probs=28.3
Q ss_pred EEEEecccCCCcC--CCCcceEEEECCCCCccccceeEEec
Q psy1727 196 YWIIKNSWGPHWG--EKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 196 yWivkNSWG~~WG--~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
+-++--+||..=. .+|++|+.|+.+|+..||+.+++.|-
T Consensus 723 HAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~ 763 (1004)
T PTZ00462 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763 (1004)
T ss_pred ceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEE
Confidence 4555555664321 12678999999999999999998873
No 47
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=32.26 E-value=75 Score=27.72 Aligned_cols=30 Identities=13% Similarity=0.222 Sum_probs=23.5
Q ss_pred CCCccEEEEEEeecccccccCCcCEEEEecccCC
Q psy1727 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGP 205 (240)
Q Consensus 172 ~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~ 205 (240)
..+||=.|+++-... ..+...-.+||.||.
T Consensus 234 ~~~HaY~Vl~~~~~~----~~~~~lv~lrNPWg~ 263 (315)
T cd00044 234 VKGHAYSVLDVREVQ----EEGLRLLRLRNPWGV 263 (315)
T ss_pred ccCcceEEeEEEEEc----cCceEEEEecCCccC
Confidence 468999999998651 027888999999984
No 48
>PF03714 PUD: Bacterial pullanase-associated domain; InterPro: IPR005323 Domain is found in pullanase - carbohydrate de-branching - proteins. It is found both to the N or the C-termini of of the alpha-amylase active site region. This domain contains several conserved aromatic residues that are suggestive of a carbohydrate binding function.; GO: 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process; PDB: 2J43_A 2J44_A 2YA1_A 2FHC_A 2FHB_A 2FHF_A 2J73_B 2J72_A 2J71_A.
Probab=30.14 E-value=13 Score=26.85 Aligned_cols=41 Identities=17% Similarity=0.240 Sum_probs=24.9
Q ss_pred EEEEecccCCCcCCCCcceEEEE--CCCCCccccceeEEecCCCC
Q psy1727 196 YWIIKNSWGPHWGEKTMPFWIIK--NSWGPRWGEQVTKSIYSSAP 238 (240)
Q Consensus 196 yWivkNSWG~~WG~~G~~y~~i~--~~~~~~~g~~~~~~~~~~~~ 238 (240)
-+||++++.++-+.+= ++.+. .+..+-+..++...+|+++|
T Consensus 61 gfIv~~~~~kd~~~D~--~i~~~~~~~~~evwi~~Gd~~vy~~~p 103 (103)
T PF03714_consen 61 GFIVRKGDWKDQGGDR--FIDLTSPQGNTEVWIVSGDETVYYSPP 103 (103)
T ss_dssp EEEEEETTEECSSSEE--EEETTTTTTCEEEEEETTTSSEBSSTT
T ss_pred EEEEEcCCCCCCCCCE--EEEeccCCCcEEEEEECCCccEEecCC
Confidence 4889998666522222 34441 23333577777788888887
No 49
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=27.39 E-value=32 Score=27.61 Aligned_cols=36 Identities=28% Similarity=0.613 Sum_probs=25.1
Q ss_pred EEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727 196 YWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY 234 (240)
Q Consensus 196 yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~ 234 (240)
+=++--+||..= |.+||.|+.+++..||+.+...+.
T Consensus 167 Hav~iVGy~~~~---~~~~wiv~NSWG~~WG~~Gy~~i~ 202 (219)
T PF00112_consen 167 HAVLIVGYDDEN---GKGYWIVKNSWGTDWGDNGYFRIS 202 (219)
T ss_dssp EEEEEEEEEEET---TEEEEEEE-SBTTTSTBTTEEEEE
T ss_pred cccccccccccc---ceeeEeeehhhCCccCCCeEEEEe
Confidence 445555555432 344999999999999999988764
No 50
>PF04928 PAP_central: Poly(A) polymerase central domain; InterPro: IPR007012 In eukaryotes, polyadenylation of pre-mRNA plays an essential role in the initiation step of protein synthesis, as well as in the export and stability of mRNAs. Poly(A) polymerase, the enzyme at the heart of the polyadenylation machinery, is a template-independent RNA polymerase which specifically incorporates ATP at the 3' end of mRNA. The crystal structure of bovine poly(A) polymerase bound to an ATP analog at 2.5 A resolutio has been determined []. The structure revealed expected and unexpected similarities to other proteins. As expected, the catalytic domain of poly(A) polymerase shares substantial structural homology with other nucleotidyl transferases such as DNA polymerase beta and kanamycin transferase. The central domain of Poly(A) polymerase shares structural similarity with the allosteric activity domain of ribonucleotide reductase R1, which comprises a four-helix bundle and a three-stranded mixed beta-sheet. Even though the two enzymes bind ATP, the ATP-recognition motifs are different.; GO: 0004652 polynucleotide adenylyltransferase activity, 0006351 transcription, DNA-dependent; PDB: 1Q79_A 1Q78_A 1F5A_A 2O1P_A 2HHP_A 3C66_B 1FA0_A 2Q66_A.
Probab=21.98 E-value=1.6e+02 Score=25.15 Aligned_cols=39 Identities=28% Similarity=0.503 Sum_probs=27.1
Q ss_pred CChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHH
Q psy1727 10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50 (240)
Q Consensus 10 ~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i 50 (240)
..++....+..+|.+.++ .|.+++|..+|.+++.+ |+.|
T Consensus 17 ~~Dl~~s~~L~~~l~~~~-~~es~ee~~~R~~vl~~-L~~i 55 (254)
T PF04928_consen 17 EKDLKRSASLEEFLKDYG-LFESEEEEQKREEVLRK-LQQI 55 (254)
T ss_dssp HHHHHHHHHHHHHHHHCT--S--HHHHHHHHHHHHH-HHHH
T ss_pred hhhHHhHHHHHHHHHHcC-CCCChHHHhHHHHHHHH-HHHH
Confidence 334556688999999997 66899999999888755 4444
Done!