Query         psy282
Match_columns 233
No_of_seqs    301 out of 2087
Neff          8.0 
Searched_HMMs 46136
Date          Fri Aug 16 17:45:40 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy282.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/282hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542|consensus              100.0   9E-34 1.9E-38  243.1  10.8  130    1-133   235-366 (372)
  2 KOG1542|consensus              100.0 8.7E-32 1.9E-36  230.9   3.1  126  101-228   229-366 (372)
  3 PTZ00203 cathepsin L protease; 100.0 9.5E-30 2.1E-34  225.4  13.8  127    2-133   207-335 (348)
  4 cd02621 Peptidase_C1A_Cathepsi 100.0 2.6E-29 5.7E-34  213.3  13.5  130    1-134    88-238 (243)
  5 cd02248 Peptidase_C1A Peptidas 100.0   1E-28 2.2E-33  204.5  14.0  126    1-132    78-206 (210)
  6 cd02620 Peptidase_C1A_Cathepsi 100.0 8.4E-29 1.8E-33  209.3  13.3  129    1-133    84-233 (236)
  7 cd02698 Peptidase_C1A_Cathepsi 100.0 1.5E-28 3.3E-33  208.1  13.6  125    1-132    86-232 (239)
  8 KOG1543|consensus              100.0 1.9E-28 4.2E-33  215.5  13.2  126    1-132   189-317 (325)
  9 PTZ00021 falcipain-2; Provisio  99.9 2.4E-27 5.1E-32  216.7  12.8  123    2-133   345-483 (489)
 10 KOG1543|consensus               99.9 1.5E-28 3.3E-33  216.2   4.7  119  102-225   185-315 (325)
 11 PTZ00049 cathepsin C-like prot  99.9 1.8E-26 3.8E-31  215.9  13.2  130    1-134   472-672 (693)
 12 PTZ00200 cysteine proteinase;   99.9 1.8E-26   4E-31  209.8  12.9  123    1-133   312-440 (448)
 13 PTZ00203 cathepsin L protease;  99.9 5.4E-27 1.2E-31  207.8   6.8  118  102-223   199-330 (348)
 14 cd02621 Peptidase_C1A_Cathepsi  99.9 1.2E-26 2.6E-31  196.9   8.2   89  135-223   131-232 (243)
 15 PTZ00364 dipeptidyl-peptidase   99.9   1E-25 2.3E-30  208.1  14.3  134    1-134   294-455 (548)
 16 PF00112 Peptidase_C1:  Papain   99.9 4.2E-26   9E-31  189.2   9.8  127    1-133    82-215 (219)
 17 cd02698 Peptidase_C1A_Cathepsi  99.9 2.5E-26 5.3E-31  194.6   6.6   81  136-219   137-219 (239)
 18 cd02620 Peptidase_C1A_Cathepsi  99.9 2.5E-26 5.5E-31  194.2   6.3   84  136-222   143-227 (236)
 19 PTZ00021 falcipain-2; Provisio  99.9 4.3E-26 9.3E-31  208.4   7.9  122  102-231   339-480 (489)
 20 cd02248 Peptidase_C1A Peptidas  99.9 7.5E-25 1.6E-29  181.3   7.7   84  137-222   117-201 (210)
 21 PTZ00200 cysteine proteinase;   99.9   4E-24 8.8E-29  194.5   7.5  103  111-219   314-428 (448)
 22 PTZ00462 Serine-repeat antigen  99.9   2E-23 4.4E-28  200.6  12.4  126    2-133   614-776 (1004)
 23 cd02619 Peptidase_C1 C1 Peptid  99.9   6E-23 1.3E-27  170.5  12.4  119    1-123    82-212 (223)
 24 PTZ00364 dipeptidyl-peptidase   99.9 6.4E-24 1.4E-28  196.3   5.9   89  136-224   342-450 (548)
 25 PTZ00049 cathepsin C-like prot  99.9   2E-23 4.3E-28  195.5   6.8   89  136-224   556-667 (693)
 26 PF00112 Peptidase_C1:  Papain   99.9 2.8E-23 6.1E-28  172.2   5.2   83  136-220   122-206 (219)
 27 PTZ00462 Serine-repeat antigen  99.9 1.4E-22   3E-27  194.9   6.4   80  138-219   681-766 (1004)
 28 smart00645 Pept_C1 Papain fami  99.8 3.8E-21 8.2E-26  155.5   8.7   70  151-222    93-164 (174)
 29 cd02619 Peptidase_C1 C1 Peptid  99.8 1.4E-21   3E-26  162.3   6.1   86  136-221   124-215 (223)
 30 smart00645 Pept_C1 Papain fami  99.8 3.4E-20 7.3E-25  149.9   9.3   74   56-131    93-168 (174)
 31 KOG1544|consensus               99.8 4.1E-20 8.8E-25  157.7   8.8  134    1-135   291-457 (470)
 32 KOG1544|consensus               99.8 7.8E-20 1.7E-24  156.0   3.6   89  136-224   352-451 (470)
 33 cd00585 Peptidase_C1B Peptidas  99.6 1.5E-16 3.2E-21  144.5   5.9   77  142-218   303-399 (437)
 34 COG4870 Cysteine protease [Pos  99.5 2.8E-14   6E-19  124.4   5.2   78  140-219   227-314 (372)
 35 cd00585 Peptidase_C1B Peptidas  99.4 8.8E-13 1.9E-17  119.9   9.3   90   32-123   286-399 (437)
 36 COG4870 Cysteine protease [Pos  99.1 2.2E-10 4.7E-15  100.3   7.9   80   41-124   223-314 (372)
 37 PF03051 Peptidase_C1_2:  Pepti  98.8 4.8E-09   1E-13   95.7   4.3   81  138-218   300-400 (438)
 38 PF03051 Peptidase_C1_2:  Pepti  98.0 2.2E-05 4.8E-10   71.9   8.7   87   36-122   290-399 (438)
 39 COG3579 PepC Aminopeptidase C   97.2 9.4E-05   2E-09   64.7  -0.0   74  144-217   308-401 (444)
 40 COG3579 PepC Aminopeptidase C   93.8    0.05 1.1E-06   48.0   2.9   80   41-120   296-399 (444)
 41 PF13529 Peptidase_C39_2:  Pept  89.7    0.26 5.6E-06   37.0   2.3   56  137-203    89-144 (144)
 42 PF13529 Peptidase_C39_2:  Pept  88.2     1.9 4.1E-05   32.1   6.3   55   41-108    88-144 (144)
 43 PF09778 Guanylate_cyc_2:  Guan  78.1     8.8 0.00019   32.0   6.5   64   41-106   112-180 (212)
 44 KOG4128|consensus               72.9    0.35 7.6E-06   42.7  -3.1   40  178-217   370-413 (457)
 45 PF14399 Transpep_BrtH:  NlpC/p  71.7      12 0.00025   32.5   6.1   57   41-106    77-133 (317)
 46 PF14399 Transpep_BrtH:  NlpC/p  66.9     4.7  0.0001   35.0   2.6   56  137-201    78-133 (317)
 47 PF09778 Guanylate_cyc_2:  Guan  62.4      13 0.00027   31.1   4.1   64  136-201   112-180 (212)
 48 PF05543 Peptidase_C47:  Stapho  61.3      25 0.00055   28.3   5.5   54   41-108    90-144 (175)
 49 PF12385 Peptidase_C70:  Papain  59.8      27 0.00058   27.8   5.3   39   41-96     97-135 (166)
 50 cd00044 CysPc Calpains, domain  49.9      26 0.00057   30.7   4.3   27  179-205   235-263 (315)
 51 PF05543 Peptidase_C47:  Stapho  48.3      27 0.00058   28.2   3.7   36  178-218   118-154 (175)
 52 COG4990 Uncharacterized protei  40.9      19 0.00041   29.3   1.8   47  136-204   122-168 (195)
 53 COG4990 Uncharacterized protei  29.2 1.7E+02  0.0036   23.9   5.4   37   41-95    122-158 (195)
 54 KOG4128|consensus               20.2      16 0.00034   32.6  -2.0   79   41-119   305-410 (457)

No 1  
>KOG1542|consensus
Probab=100.00  E-value=9e-34  Score=243.08  Aligned_cols=130  Identities=42%  Similarity=0.790  Sum_probs=122.5

Q ss_pred             CCCCCCCCCCCCCCCCCccccccCCCceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCC
Q psy282            1 MGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETC   79 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c   79 (233)
                      .||+.|++|||++..++  .|..++....+.|++ |..++ ||++|.+.|.++|||+|+|++..+|+|.+||+.+....|
T Consensus       235 gGL~~E~dYPY~g~~~~--~C~~~~~~~~v~I~~-f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P~~~~C  311 (372)
T KOG1542|consen  235 GGLEKEKDYPYTGKKGN--QCHFDKSKIVVSIKD-FSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGGVSCPSKYIC  311 (372)
T ss_pred             CCccccccCCccccCCC--ccccchhhceEEEec-cEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhcccccCCCcccC
Confidence            48999999999998875  899999999999999 99999 999999999999999999998899999999999854479


Q ss_pred             CCCCCCcEEEEEeeeccC-CccEEEEecCCCCCCCCCcceeeeccccccCccccc
Q psy282           80 SPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH  133 (233)
Q Consensus        80 ~~~~~~H~v~ivG~g~~~-~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~  133 (233)
                      ++..+||+|+|||||... ..||||||||||+.|||+||+|+.|+.|.||++..+
T Consensus       312 s~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mv  366 (372)
T KOG1542|consen  312 SPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMV  366 (372)
T ss_pred             CccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccccch
Confidence            998899999999999998 999999999999999999999999999999998764


No 2  
>KOG1542|consensus
Probab=99.97  E-value=8.7e-32  Score=230.87  Aligned_cols=126  Identities=32%  Similarity=0.620  Sum_probs=108.9

Q ss_pred             EEEEecCCCCCCCCCcceeeeccccccCccccc-----------ccCccceeeeeecccceEEEEeccccccccCcceec
Q psy282          101 YWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH-----------FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRK  169 (233)
Q Consensus       101 ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~-----------~~~e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~  169 (233)
                      -|+++.  |..-.|+.|||.....+.|.+....           ..+|++|.+.|.++|||+|+|++..+|+|.+||+.+
T Consensus       229 ~~~~~~--gGL~~E~dYPY~g~~~~~C~~~~~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P  306 (372)
T KOG1542|consen  229 KYIKKA--GGLEKEKDYPYTGKKGNQCHFDKSKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGGVSCP  306 (372)
T ss_pred             HHHHHh--CCccccccCCccccCCCccccchhhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhcccccCC
Confidence            355665  5667799999988876789875421           247899999999999999999999999999999987


Q ss_pred             CCCCCCCCCCCEEEEEEEeeecC-CcCEEEEEcCCCCCCCCCcEEEEEccCCCCccCCCC
Q psy282          170 NDETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIEHTLRSHLTHDIP  228 (233)
Q Consensus       170 ~~~~~~~~~~~Hav~iVGy~~~~-~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~~~~~  228 (233)
                      ....|++..++|||||||||..+ .+||||||||||++|||+||||+.||.|.|.++.++
T Consensus       307 ~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mv  366 (372)
T KOG1542|consen  307 SKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMV  366 (372)
T ss_pred             CcccCCccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccccch
Confidence            67789987899999999999988 899999999999999999999999999988665554


No 3  
>PTZ00203 cathepsin L protease; Provisional
Probab=99.96  E-value=9.5e-30  Score=225.42  Aligned_cols=127  Identities=28%  Similarity=0.588  Sum_probs=109.7

Q ss_pred             CCCCCCCCCCCCCCCCccccccCCC-ceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCC
Q psy282            2 GLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETC   79 (233)
Q Consensus         2 Gi~~e~~yPY~~~~~~~~~C~~~~~-~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c   79 (233)
                      |+++|++|||++.++..+.|+.... ...+++++ |..++ ++++|+++|+++|||+|+|++++|++|++|||..    |
T Consensus       207 gi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~-~~~i~~~e~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy~~----c  281 (348)
T PTZ00203        207 TVFTEKSYPYVSGNGDVPECSNSSELAPGARIDG-YVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----C  281 (348)
T ss_pred             CCCccccCCCccCCCCCCcCCCCcccccceEecc-eeecCcCHHHHHHHHHhCCCEEEEEEhhhhcCccCceeec----c
Confidence            4789999999987764346874333 24567888 88887 8999999999999999999998999999999963    7


Q ss_pred             CCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCccccc
Q psy282           80 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH  133 (233)
Q Consensus        80 ~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~  133 (233)
                      +...+||+|+|||||.+++.+|||||||||+.|||+||+|+.++.+.|++..++
T Consensus       282 ~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cgi~~~~  335 (348)
T PTZ00203        282 IGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGYP  335 (348)
T ss_pred             CCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCcccccceE
Confidence            666689999999999998999999999999999999999999988999987654


No 4  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.96  E-value=2.6e-29  Score=213.27  Aligned_cols=130  Identities=29%  Similarity=0.585  Sum_probs=107.1

Q ss_pred             CCCCCCCCCCCCC-CCCCccccccCC-CceeEEeCCceEEe------CCHHHHHHHHHhcCCEEEEEecC-CCCCcCCce
Q psy282            1 MGLESEKDYPYKN-ANGEKFKCAYDK-SKVKLFTGKDFLHF------NGSETMKKILYKYGPLSVLLNSD-LIHDYNGTP   71 (233)
Q Consensus         1 ~Gi~~e~~yPY~~-~~~~~~~C~~~~-~~~~~~i~~~~~~v------~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi   71 (233)
                      +||++|++|||+. ...   .|+... ...++++.+ |..+      +++++||++|+++|||+++|++. +|++|++||
T Consensus        88 ~Gi~~e~~yPY~~~~~~---~C~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GI  163 (243)
T cd02621          88 FGIVTEDYFPYTADDDR---PCKASPSECRRYYFSD-YNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGV  163 (243)
T ss_pred             cCcCCCceeCCCCCCCC---CCCCCccccccccccc-eeEcccccccCCHHHHHHHHHHcCCEEEEEEecccccccCCeE
Confidence            4899999999998 566   898655 334455555 4443      48999999999999999999995 999999999


Q ss_pred             eeCCC--CCCCC--------CCCCcEEEEEeeeccC--CccEEEEecCCCCCCCCCcceeeeccccccCcccccc
Q psy282           72 IRKND--ETCSP--------YDLGHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLHF  134 (233)
Q Consensus        72 ~~~~~--~~c~~--------~~~~H~v~ivG~g~~~--~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~~  134 (233)
                      |....  ..|+.        ..++|+|+|||||.+.  +.+|||||||||+.|||+||+|++++.+.|++.+.+.
T Consensus       164 y~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~  238 (243)
T cd02621         164 YHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAV  238 (243)
T ss_pred             ECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCcccCcccceE
Confidence            98752  01543        2579999999999886  8999999999999999999999999889999987643


No 5  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.96  E-value=1e-28  Score=204.49  Aligned_cols=126  Identities=40%  Similarity=0.780  Sum_probs=114.0

Q ss_pred             CCCCCCCCCCCCCCCCCccccccCCCceeEEeCCceEEeC--CHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCC
Q psy282            1 MGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDE   77 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~--~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~   77 (233)
                      +|+++|++|||.....   .|+.......+++.+ |..++  ++++||++|+++|||+++|.+. +|+.|++|||..+  
T Consensus        78 ~Gi~~e~~yPY~~~~~---~C~~~~~~~~~~i~~-~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~--  151 (210)
T cd02248          78 GGLASESDYPYTGKDG---TCKYNSSKVGAKITG-YSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGP--  151 (210)
T ss_pred             CCcCccccCCccCCCC---CccCCCCcccEEEee-EEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCC--
Confidence            5899999999998777   898777678899999 99887  5899999999999999999995 9999999999987  


Q ss_pred             CCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCcccc
Q psy282           78 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFL  132 (233)
Q Consensus        78 ~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~  132 (233)
                      .|....++|+|+|||||.+.+.+|||+|||||+.||++||+|+.++++.|++...
T Consensus       152 ~~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~~~~~cgi~~~  206 (210)
T cd02248         152 CCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASY  206 (210)
T ss_pred             CCCCCcCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEcCCCccCceee
Confidence            4556678999999999998899999999999999999999999998889999754


No 6  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.96  E-value=8.4e-29  Score=209.34  Aligned_cols=129  Identities=25%  Similarity=0.435  Sum_probs=106.2

Q ss_pred             CCCCCCCCCCCCCCCCCc---------------cccccCCC----ceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEe
Q psy282            1 MGLESEKDYPYKNANGEK---------------FKCAYDKS----KVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLN   60 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~---------------~~C~~~~~----~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~   60 (233)
                      +|+++|++|||.+.+...               +.|.....    ...+++.. +..+. ++++||++|+++|||+++|+
T Consensus        84 ~G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~-~~~~~~~~~~ik~~l~~~GPv~v~i~  162 (236)
T cd02620          84 TGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKS-AYSVPSDETDIMKEIMTNGPVQAAFT  162 (236)
T ss_pred             cCCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecc-eeeeCCHHHHHHHHHHHCCCeEEEEE
Confidence            589999999998765410               12543221    22345666 66665 78999999999999999999


Q ss_pred             c-CCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCccccc
Q psy282           61 S-DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH  133 (233)
Q Consensus        61 ~-~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~  133 (233)
                      + ++|+.|++|||...   |+...++|||+|||||.+++.+|||||||||+.|||+||+|+.++.+.|++.+..
T Consensus       163 ~~~~f~~Y~~Giy~~~---~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~  233 (236)
T cd02620         163 VYEDFLYYKSGVYQHT---SGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEV  233 (236)
T ss_pred             echhhhhcCCcEEeec---CCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccce
Confidence            9 59999999999765   6666789999999999999999999999999999999999999988999998754


No 7  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.96  E-value=1.5e-28  Score=208.12  Aligned_cols=125  Identities=24%  Similarity=0.505  Sum_probs=105.5

Q ss_pred             CCCCCCCCCCCCCCCCCcccccc---------------CCCceeEEeCCceEEeCCHHHHHHHHHhcCCEEEEEecC-CC
Q psy282            1 MGLESEKDYPYKNANGEKFKCAY---------------DKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD-LI   64 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~~~C~~---------------~~~~~~~~i~~~~~~v~~~~~i~~~l~~~GPv~v~i~~~-~f   64 (233)
                      +|+++|++|||++.+.   .|+.               .+....+++++ |..++++++||++|.++|||+|+|++. +|
T Consensus        86 ~Gl~~e~~yPY~~~~~---~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~-~~~~~~~~~i~~~l~~~GPV~v~i~~~~~f  161 (239)
T cd02698          86 HGIPDETCNPYQAKDG---ECNPFNRCGTCNPFGECFAIKNYTLYFVSD-YGSVSGRDKMMAEIYARGPISCGIMATEAL  161 (239)
T ss_pred             cCcCCCCeeCCcCCCC---CCcCCCCCCCcccCcccccccccceEEeee-ceecCCHHHHHHHHHHcCCEEEEEEecccc
Confidence            4899999999998766   5542               11123567888 888888999999999999999999995 89


Q ss_pred             CCcCCceeeCCCCCCCCCCCCcEEEEEeeeccC-CccEEEEecCCCCCCCCCcceeeeccc-----cccCcccc
Q psy282           65 HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIERGN-----NACGKDFL  132 (233)
Q Consensus        65 ~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~-~~~ywivknswg~~wge~gy~~~~~~~-----~~Cgi~~~  132 (233)
                      +.|++|||+..  .| ...++|+|+|||||+++ +.+|||||||||+.|||+||+|+.++.     +.|+|++.
T Consensus       162 ~~Y~~GIy~~~--~~-~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~~  232 (239)
T cd02698         162 ENYTGGVYKEY--VQ-DPLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEED  232 (239)
T ss_pred             cccCCeEEccC--CC-CCcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccCCcccccccccccc
Confidence            99999999876  34 45689999999999886 899999999999999999999999987     77887643


No 8  
>KOG1543|consensus
Probab=99.96  E-value=1.9e-28  Score=215.47  Aligned_cols=126  Identities=32%  Similarity=0.609  Sum_probs=112.8

Q ss_pred             CCCCC-CCCCCCCCCCCCccccccCCCceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCC
Q psy282            1 MGLES-EKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDE   77 (233)
Q Consensus         1 ~Gi~~-e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~   77 (233)
                      +|+++ |++|||.+..+   .|+.+.....+.+.+ +..++ +|++|+++|+.+|||+|+|++. +|+.|++|||.++  
T Consensus       189 ~G~~t~~~~Ypy~~~~~---~C~~~~~~~~~~~~~-~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~--  262 (325)
T KOG1543|consen  189 NGGVTECENYPYIGKDG---TCKSNKKDKTVTIKG-FYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEE--  262 (325)
T ss_pred             hCCCCCCcCCCCcCCCC---CccCCCccceeEeee-eeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCC--
Confidence            46677 99999999999   999888777788888 88888 9999999999999999999995 9999999999998  


Q ss_pred             CCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCcccc
Q psy282           78 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFL  132 (233)
Q Consensus        78 ~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~  132 (233)
                      .|....++|+|+|||||++++.+|||++||||+.|||+||+|+.|+.+.|+++..
T Consensus       263 ~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~  317 (325)
T KOG1543|consen  263 KGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIASE  317 (325)
T ss_pred             CCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhhcc
Confidence            3433368999999999996678999999999999999999999999999986553


No 9  
>PTZ00021 falcipain-2; Provisional
Probab=99.95  E-value=2.4e-27  Score=216.67  Aligned_cols=123  Identities=31%  Similarity=0.592  Sum_probs=106.0

Q ss_pred             CCCCCCCCCCCCCC-CCccccccCCCceeEEeCCceEEeCCHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCCCC
Q psy282            2 GLESEKDYPYKNAN-GEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC   79 (233)
Q Consensus         2 Gi~~e~~yPY~~~~-~~~~~C~~~~~~~~~~i~~~~~~v~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~~c   79 (233)
                      ||++|++|||++.. +   .|+.......++|.+ |..++ +++|+++|+.+|||+|+|++. +|++|++|||...   |
T Consensus       345 Gl~tE~~YPY~~~~~~---~C~~~~~~~~~~i~~-y~~i~-~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~---C  416 (489)
T PTZ00021        345 GLCSEDDYPYVSDTPE---LCNIDRCKEKYKIKS-YVSIP-EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGE---C  416 (489)
T ss_pred             ccCcccccCccCCCCC---ccccccccccceeee-EEEec-HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCC---C
Confidence            89999999999874 5   798665556788999 98886 678999999999999999995 9999999999764   8


Q ss_pred             CCCCCCcEEEEEeeeccCC----------ccEEEEecCCCCCCCCCcceeeeccc----cccCccccc
Q psy282           80 SPYDLGHAVLLVGYGKQDD----------IPYWLVRNSWGPIGPDEGFFKIERGN----NACGKDFLH  133 (233)
Q Consensus        80 ~~~~~~H~v~ivG~g~~~~----------~~ywivknswg~~wge~gy~~~~~~~----~~Cgi~~~~  133 (233)
                      +. .+||||+|||||.+++          .+|||||||||+.|||+||+|+.++.    +.|++...+
T Consensus       417 ~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a  483 (489)
T PTZ00021        417 GE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEA  483 (489)
T ss_pred             CC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccc
Confidence            65 4799999999997642          47999999999999999999999875    589997643


No 10 
>KOG1543|consensus
Probab=99.95  E-value=1.5e-28  Score=216.17  Aligned_cols=119  Identities=28%  Similarity=0.554  Sum_probs=95.3

Q ss_pred             EEEecCCCCCCCCCcceeeeccccccCcccc-----------cccCccceeeeeecccceEEEEec-cccccccCcceec
Q psy282          102 WLVRNSWGPIGPDEGFFKIERGNNACGKDFL-----------HFNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRK  169 (233)
Q Consensus       102 wivknswg~~wge~gy~~~~~~~~~Cgi~~~-----------~~~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~  169 (233)
                      |+.+|  |..-+++.|++... ++.|.....           ...++.+|++++..+|||+++|++ ..|++|++|||. 
T Consensus       185 yi~~~--G~~t~~~~Ypy~~~-~~~C~~~~~~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~-  260 (325)
T KOG1543|consen  185 YIKKN--GGVTECENYPYIGK-DGTCKSNKKDKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYA-  260 (325)
T ss_pred             HHHHh--CCCCCCcCCCCcCC-CCCccCCCccceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEe-
Confidence            55666  44445888987765 356765441           123588999999999999999999 589999999997 


Q ss_pred             CCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEEEEccCCCCccC
Q psy282          170 NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIEHTLRSHLTH  225 (233)
Q Consensus       170 ~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~~  225 (233)
                       ...|....++|||+|||||+.++.+|||||||||+.|||+|||||.|+.+.+.++
T Consensus       261 -~~~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~  315 (325)
T KOG1543|consen  261 -EEKGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIA  315 (325)
T ss_pred             -CCCCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhh
Confidence             3344432589999999999966789999999999999999999999999888543


No 11 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.94  E-value=1.8e-26  Score=215.86  Aligned_cols=130  Identities=29%  Similarity=0.635  Sum_probs=103.5

Q ss_pred             CCCCCCCCCCCCCCCCCccccccCCC---------------------------------------ceeEEeCCceEEe--
Q psy282            1 MGLESEKDYPYKNANGEKFKCAYDKS---------------------------------------KVKLFTGKDFLHF--   39 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~---------------------------------------~~~~~i~~~~~~v--   39 (233)
                      +||++|+||||++..+   .|+....                                       ..++++++ |..+  
T Consensus       472 ~GI~tEscYPY~a~~g---~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~y~k~-y~yI~g  547 (693)
T PTZ00049        472 QGIPLDKVFPYTATEQ---TCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEPARWYAKD-YNYIGG  547 (693)
T ss_pred             CCCCcCCccCCcCCCC---CCCCCCCCccccccccccccccccccccccccccccccccccccccccceeeee-eEEecc
Confidence            4899999999998877   6754211                                       11234555 5544  


Q ss_pred             -------CCHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCC----CCCCCC--------------CCCcEEEEEee
Q psy282           40 -------NGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKND----ETCSPY--------------DLGHAVLLVGY   93 (233)
Q Consensus        40 -------~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~----~~c~~~--------------~~~H~v~ivG~   93 (233)
                             .++++|+++|+++|||+|+|++. +|++|++|||..+.    ..|+..              .+||||+||||
T Consensus       548 ~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGw  627 (693)
T PTZ00049        548 CYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGW  627 (693)
T ss_pred             cccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEe
Confidence                   26899999999999999999995 99999999998531    036422              46999999999


Q ss_pred             ecc--CC--ccEEEEecCCCCCCCCCcceeeeccccccCcccccc
Q psy282           94 GKQ--DD--IPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLHF  134 (233)
Q Consensus        94 g~~--~~--~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~~  134 (233)
                      |.+  ++  .+|||+|||||+.||++||+|+.|+.+.|+|++.+.
T Consensus       628 G~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~  672 (693)
T PTZ00049        628 GEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSL  672 (693)
T ss_pred             ccccCCCcccCEEEEECCCCCCcccCceEEEEcCCCccCCcccee
Confidence            985  45  389999999999999999999999999999987654


No 12 
>PTZ00200 cysteine proteinase; Provisional
Probab=99.94  E-value=1.8e-26  Score=209.84  Aligned_cols=123  Identities=31%  Similarity=0.631  Sum_probs=102.9

Q ss_pred             CCCCCCCCCCCCCCCCCccccccCCCceeEEeCCceEEeCCHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCCCC
Q psy282            1 MGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC   79 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~~c   79 (233)
                      +||++|++|||++..+   .|+.... ..++|.+ |..+++.+.|+++|. +|||+|+|++. +|+.|++|||..+   |
T Consensus       312 ~Gi~~e~~YPY~~~~~---~C~~~~~-~~~~i~~-y~~~~~~~~l~~~l~-~GPV~v~i~~~~~f~~Yk~GIy~~~---C  382 (448)
T PTZ00200        312 KGLSSSSDVPYLAKDG---KCVVSST-KKVYIDS-YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNGE---C  382 (448)
T ss_pred             cCccccccCCCCCCCC---CCcCCCC-CeeEecc-eEecCHHHHHHHHHh-cCCEEEEeecccccccCCCCccccc---c
Confidence            5999999999999988   8986543 4567888 887766566666664 69999999995 9999999999764   8


Q ss_pred             CCCCCCcEEEEEeeec--cCCccEEEEecCCCCCCCCCcceeeecc---ccccCccccc
Q psy282           80 SPYDLGHAVLLVGYGK--QDDIPYWLVRNSWGPIGPDEGFFKIERG---NNACGKDFLH  133 (233)
Q Consensus        80 ~~~~~~H~v~ivG~g~--~~~~~ywivknswg~~wge~gy~~~~~~---~~~Cgi~~~~  133 (233)
                      +.. +||||+|||||.  +++.+|||||||||+.|||+||+|+.++   .+.|++.+.+
T Consensus       383 ~~~-~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~~~~  440 (448)
T PTZ00200        383 GKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGILTVG  440 (448)
T ss_pred             CCC-CcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCccccc
Confidence            654 899999999996  4678999999999999999999999985   4789997653


No 13 
>PTZ00203 cathepsin L protease; Provisional
Probab=99.93  E-value=5.4e-27  Score=207.84  Aligned_cols=118  Identities=25%  Similarity=0.590  Sum_probs=90.6

Q ss_pred             EEEecCCCCCCCCCcceeeeccc--cccCccc----------c-c-ccCccceeeeeecccceEEEEeccccccccCcce
Q psy282          102 WLVRNSWGPIGPDEGFFKIERGN--NACGKDF----------L-H-FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPI  167 (233)
Q Consensus       102 wivknswg~~wge~gy~~~~~~~--~~Cgi~~----------~-~-~~~e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~  167 (233)
                      |++++.-|..-.|+.|||.....  ..|....          + . ..+++.|+++|.+.|||+++|++..|++|++|||
T Consensus       199 yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy  278 (348)
T PTZ00203        199 WVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL  278 (348)
T ss_pred             HHHHhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecceeecCcCHHHHHHHHHhCCCEEEEEEhhhhcCccCcee
Confidence            55544334444577888865431  1464211          0 0 1245678888988999999999988999999998


Q ss_pred             ecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEEEEccCCCCc
Q psy282          168 RKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIEHTLRSHL  223 (233)
Q Consensus       168 ~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~  223 (233)
                      .    .|....+||||+|||||++++++|||||||||++|||+|||||+|+.+.|.
T Consensus       279 ~----~c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cg  330 (348)
T PTZ00203        279 T----SCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACL  330 (348)
T ss_pred             e----ccCCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCccc
Confidence            5    386656899999999999889999999999999999999999999987653


No 14 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.93  E-value=1.2e-26  Score=196.91  Aligned_cols=89  Identities=33%  Similarity=0.617  Sum_probs=74.9

Q ss_pred             cCccceeeeeecccceEEEEec-cccccccCcceecCC--CCCCC--------CCCCEEEEEEEeeecC--CcCEEEEEc
Q psy282          135 NGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKND--ETCSP--------YDLGHAVLLVGYGKQD--DIPYWLVRN  201 (233)
Q Consensus       135 ~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~--~~~~~--------~~~~Hav~iVGy~~~~--~~~ywivkN  201 (233)
                      .++++|+++|.++|||+++|++ +.|++|++|||....  ..|+.        ..++|||+|||||++.  +++||||||
T Consensus       131 ~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirN  210 (243)
T cd02621         131 TNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKN  210 (243)
T ss_pred             CCHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEc
Confidence            3567899999999999999998 589999999986321  12643        2479999999999876  899999999


Q ss_pred             CCCCCCCCCcEEEEEccCCCCc
Q psy282          202 SWGPIGPDEGFFKIEHTLRSHL  223 (233)
Q Consensus       202 SWg~~WG~~Gy~~i~~~~~~~~  223 (233)
                      |||++|||+|||||+|+.+.+-
T Consensus       211 SWG~~WGe~Gy~~i~~~~~~cg  232 (243)
T cd02621         211 SWGSSWGEKGYFKIRRGTNECG  232 (243)
T ss_pred             CCCCCCCcCCeEEEecCCcccC
Confidence            9999999999999999876553


No 15 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.93  E-value=1e-25  Score=208.13  Aligned_cols=134  Identities=22%  Similarity=0.443  Sum_probs=105.0

Q ss_pred             CCCCCCCCC--CCCCCCCCccccccCCCceeEEeCCc-----eEEeC-CHHHHHHHHHhcCCEEEEEecC-CCCCcCCce
Q psy282            1 MGLESEKDY--PYKNANGEKFKCAYDKSKVKLFTGKD-----FLHFN-GSETMKKILYKYGPLSVLLNSD-LIHDYNGTP   71 (233)
Q Consensus         1 ~Gi~~e~~y--PY~~~~~~~~~C~~~~~~~~~~i~~~-----~~~v~-~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi   71 (233)
                      +||++|++|  ||++.++....|+......+++++++     |..+. ++++|+++|+++|||+|+|++. +|+.|++||
T Consensus       294 ~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGi  373 (548)
T PTZ00364        294 FGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENS  373 (548)
T ss_pred             CCcccccccCCCCCCCCCCCCCCCCCcccceeeeeeeEEecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCC
Confidence            589999999  99877663335876554444444430     44344 7899999999999999999995 899999999


Q ss_pred             eeCC-----C-CCC---C-------CCCCCcEEEEEeeec-cCCccEEEEecCCCC--CCCCCcceeeeccccccCcccc
Q psy282           72 IRKN-----D-ETC---S-------PYDLGHAVLLVGYGK-QDDIPYWLVRNSWGP--IGPDEGFFKIERGNNACGKDFL  132 (233)
Q Consensus        72 ~~~~-----~-~~c---~-------~~~~~H~v~ivG~g~-~~~~~ywivknswg~--~wge~gy~~~~~~~~~Cgi~~~  132 (233)
                      |...     . +.|   +       ...+||||+|||||. +++.+|||+|||||+  .|||+||+|+.|+.+.|++++.
T Consensus       374 y~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~  453 (548)
T PTZ00364        374 TEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESE  453 (548)
T ss_pred             ccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcccccce
Confidence            8631     0 011   1       235799999999997 478899999999999  9999999999999999999876


Q ss_pred             cc
Q psy282          133 HF  134 (233)
Q Consensus       133 ~~  134 (233)
                      +.
T Consensus       454 ~v  455 (548)
T PTZ00364        454 VV  455 (548)
T ss_pred             ee
Confidence            53


No 16 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.93  E-value=4.2e-26  Score=189.19  Aligned_cols=127  Identities=33%  Similarity=0.630  Sum_probs=110.1

Q ss_pred             CCCCCCCCCCCCCCC-CCccccccCCCce-eEEeCCceEEeC--CHHHHHHHHHhcCCEEEEEecC--CCCCcCCceeeC
Q psy282            1 MGLESEKDYPYKNAN-GEKFKCAYDKSKV-KLFTGKDFLHFN--GSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRK   74 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~-~~~~~C~~~~~~~-~~~i~~~~~~v~--~~~~i~~~l~~~GPv~v~i~~~--~f~~y~~Gi~~~   74 (233)
                      +|+++|++|||.... .   .|....... ..++.. |..+.  ++++||++|.++|||++.|.+.  +|+.|++|||..
T Consensus        82 ~Gi~~e~~~pY~~~~~~---~c~~~~~~~~~~~i~~-~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~  157 (219)
T PF00112_consen   82 NGIVTEEDYPYNGNENP---TCKSKKSNSYYVKIKG-YGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDP  157 (219)
T ss_dssp             TSBEBTTTS--SSSSSC---SSCHSGGGEEEBEESE-EEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECS
T ss_pred             Ccccccccccccccccc---cccccccccccccccc-cccccccchhHHHHHHhhCceeeeeeeccccccccccceeeec
Confidence            599999999999876 5   798765544 478888 88887  5999999999999999999994  599999999998


Q ss_pred             CCCCCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeecccc-ccCccccc
Q psy282           75 NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNN-ACGKDFLH  133 (233)
Q Consensus        75 ~~~~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~-~Cgi~~~~  133 (233)
                      +  .|....++|+|+|||||.+.+..|||+|||||+.||++||+|+.++.+ .|++++.+
T Consensus       158 ~--~~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~~~c~i~~~~  215 (219)
T PF00112_consen  158 P--DCSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYFRISYDYNNECGIESQA  215 (219)
T ss_dssp             T--SSSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEEEEESSSSSGGGTTSSE
T ss_pred             c--ccccccccccccccccccccceeeEeeehhhCCccCCCeEEEEeeCCCCcCccCcee
Confidence            7  587788999999999999999999999999999999999999999876 99998654


No 17 
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.93  E-value=2.5e-26  Score=194.63  Aligned_cols=81  Identities=28%  Similarity=0.649  Sum_probs=71.8

Q ss_pred             CccceeeeeecccceEEEEec-cccccccCcceecCCCCCCCCCCCEEEEEEEeeecC-CcCEEEEEcCCCCCCCCCcEE
Q psy282          136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFF  213 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~-~~~ywivkNSWg~~WG~~Gy~  213 (233)
                      +++.|+++|.++|||+++|++ +.|++|++|||.  ...|. ..++|||+|||||+++ +++|||||||||++|||+|||
T Consensus       137 ~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~--~~~~~-~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~  213 (239)
T cd02698         137 GRDKMMAEIYARGPISCGIMATEALENYTGGVYK--EYVQD-PLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWF  213 (239)
T ss_pred             CHHHHHHHHHHcCCEEEEEEecccccccCCeEEc--cCCCC-CcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceE
Confidence            467799999899999999999 479999999985  33443 4679999999999876 899999999999999999999


Q ss_pred             EEEccC
Q psy282          214 KIEHTL  219 (233)
Q Consensus       214 ~i~~~~  219 (233)
                      ||+|+.
T Consensus       214 ~i~rg~  219 (239)
T cd02698         214 RIVTSS  219 (239)
T ss_pred             EEEccC
Confidence            999998


No 18 
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.93  E-value=2.5e-26  Score=194.22  Aligned_cols=84  Identities=26%  Similarity=0.556  Sum_probs=74.1

Q ss_pred             CccceeeeeecccceEEEEec-cccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEE
Q psy282          136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK  214 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~  214 (233)
                      ++++|+.+|.++|||+++|++ +.|++|++|||.   ..|....++|||+|||||++++++|||||||||+.|||+||||
T Consensus       143 ~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~---~~~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~r  219 (236)
T cd02620         143 DETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQ---HTSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFR  219 (236)
T ss_pred             HHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEe---ecCCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEE
Confidence            467899999999999999999 589999999985   2355556799999999999889999999999999999999999


Q ss_pred             EEccCCCC
Q psy282          215 IEHTLRSH  222 (233)
Q Consensus       215 i~~~~~~~  222 (233)
                      |+|+.+.+
T Consensus       220 i~~~~~~c  227 (236)
T cd02620         220 ILRGSNEC  227 (236)
T ss_pred             EEccCccc
Confidence            99987544


No 19 
>PTZ00021 falcipain-2; Provisional
Probab=99.93  E-value=4.3e-26  Score=208.42  Aligned_cols=122  Identities=24%  Similarity=0.466  Sum_probs=90.6

Q ss_pred             EEEecCCCCCCCCCcceeeeccccccCccc---------ccccCccceeeeeecccceEEEEec-cccccccCcceecCC
Q psy282          102 WLVRNSWGPIGPDEGFFKIERGNNACGKDF---------LHFNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKND  171 (233)
Q Consensus       102 wivknswg~~wge~gy~~~~~~~~~Cgi~~---------~~~~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~  171 (233)
                      |++++  +....|+.|||.......|....         +....++.++++|...|||+++|++ +.|++|++|||.   
T Consensus       339 yi~~~--gGl~tE~~YPY~~~~~~~C~~~~~~~~~~i~~y~~i~~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~---  413 (489)
T PTZ00021        339 DMIEL--GGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIPEDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFD---  413 (489)
T ss_pred             hhhhc--cccCcccccCccCCCCCccccccccccceeeeEEEecHHHHHHHHHhcCCeEEEEEeecccccCCCCcCC---
Confidence            55554  34556888998765335664321         1111246788899889999999999 589999999984   


Q ss_pred             CCCCCCCCCEEEEEEEeeecC----------CcCEEEEEcCCCCCCCCCcEEEEEccCCCCccCCCCCCC
Q psy282          172 ETCSPYDLGHAVLLVGYGKQD----------DIPYWLVRNSWGPIGPDEGFFKIEHTLRSHLTHDIPGVP  231 (233)
Q Consensus       172 ~~~~~~~~~Hav~iVGy~~~~----------~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~~~~~~~~  231 (233)
                      ..|+. .++|||+|||||+++          +.+|||||||||++|||+|||||+|+.+..  .++|||.
T Consensus       414 ~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~--~n~CGI~  480 (489)
T PTZ00021        414 GECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDENGL--MKTCSLG  480 (489)
T ss_pred             CCCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCCCCC--CCCCCCc
Confidence            46875 579999999999763          257999999999999999999999986421  3455553


No 20 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.91  E-value=7.5e-25  Score=181.32  Aligned_cols=84  Identities=38%  Similarity=0.791  Sum_probs=74.4

Q ss_pred             ccceeeeeecccceEEEEecc-ccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEEE
Q psy282          137 SETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI  215 (233)
Q Consensus       137 e~~i~~~l~~~GPv~v~i~~~-~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~i  215 (233)
                      .++||++|.++|||+++|.+. .|++|++|||..  ..|....++|||+|||||++.+.+|||||||||+.||++|||||
T Consensus       117 ~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~--~~~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i  194 (210)
T cd02248         117 EEALKAALANYGPVSVAIDASSSFQFYKGGIYSG--PCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRI  194 (210)
T ss_pred             HHHHHHHHhhcCCEEEEEecCcccccCCCCceeC--CCCCCCcCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEE
Confidence            578999999999999999984 899999999863  34545568999999999998889999999999999999999999


Q ss_pred             EccCCCC
Q psy282          216 EHTLRSH  222 (233)
Q Consensus       216 ~~~~~~~  222 (233)
                      +++.+.|
T Consensus       195 ~~~~~~c  201 (210)
T cd02248         195 ARGSNLC  201 (210)
T ss_pred             EcCCCcc
Confidence            9988654


No 21 
>PTZ00200 cysteine proteinase; Provisional
Probab=99.90  E-value=4e-24  Score=194.55  Aligned_cols=103  Identities=25%  Similarity=0.489  Sum_probs=77.7

Q ss_pred             CCCCCcceeeeccccccCccc--------cc-ccCccceeeeeecccceEEEEec-cccccccCcceecCCCCCCCCCCC
Q psy282          111 IGPDEGFFKIERGNNACGKDF--------LH-FNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLG  180 (233)
Q Consensus       111 ~wge~gy~~~~~~~~~Cgi~~--------~~-~~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~~~~~  180 (233)
                      .-.|+.|||... +..|....        +. ..+.+.+++++ ..|||+++|++ ..|++|++|||.   +.|+. .++
T Consensus       314 i~~e~~YPY~~~-~~~C~~~~~~~~~i~~y~~~~~~~~l~~~l-~~GPV~v~i~~~~~f~~Yk~GIy~---~~C~~-~~n  387 (448)
T PTZ00200        314 LSSSSDVPYLAK-DGKCVVSSTKKVYIDSYLVAKGKDVLNKSL-VISPTVVYIAVSRELLKYKSGVYN---GECGK-SLN  387 (448)
T ss_pred             ccccccCCCCCC-CCCCcCCCCCeeEecceEecCHHHHHHHHH-hcCCEEEEeecccccccCCCCccc---cccCC-CCc
Confidence            345777887654 35564321        11 11234455555 57999999999 489999999985   35876 489


Q ss_pred             EEEEEEEeee--cCCcCEEEEEcCCCCCCCCCcEEEEEccC
Q psy282          181 HAVLLVGYGK--QDDIPYWLVRNSWGPIGPDEGFFKIEHTL  219 (233)
Q Consensus       181 Hav~iVGy~~--~~~~~ywivkNSWg~~WG~~Gy~~i~~~~  219 (233)
                      |||+|||||.  +++.+|||||||||++|||+|||||+|+.
T Consensus       388 HaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~  428 (448)
T PTZ00200        388 HAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTN  428 (448)
T ss_pred             EEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCC
Confidence            9999999984  46789999999999999999999999974


No 22 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.90  E-value=2e-23  Score=200.65  Aligned_cols=126  Identities=27%  Similarity=0.591  Sum_probs=99.6

Q ss_pred             CCCCCCCCCCCC--CCCCccccccCCC------------------ceeEEeCCceEEeCC----------HHHHHHHHHh
Q psy282            2 GLESEKDYPYKN--ANGEKFKCAYDKS------------------KVKLFTGKDFLHFNG----------SETMKKILYK   51 (233)
Q Consensus         2 Gi~~e~~yPY~~--~~~~~~~C~~~~~------------------~~~~~i~~~~~~v~~----------~~~i~~~l~~   51 (233)
                      ||++|++|||.+  ..+   .|+....                  ...+.+++ |..+.+          +++||++|++
T Consensus       614 gLptESdYPYt~k~~~g---~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kg-Y~~~~s~~~~~n~d~~i~~IK~eI~~  689 (1004)
T PTZ00462        614 FLPADSNYLYNYTKVGE---DCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKA-YRAYESEHFHDKMDAFIKIIKDEIMN  689 (1004)
T ss_pred             CCcccccCCCccCCCCC---CCCCCcccccccccccccccccccccceeeccc-eEEecccccccchhhHHHHHHHHHHh
Confidence            588999999985  444   7874321                  01234456 665541          4689999999


Q ss_pred             cCCEEEEEecCCCCCcC-CceeeCCCCCCCCCCCCcEEEEEeeecc-----CCccEEEEecCCCCCCCCCcceeeec-cc
Q psy282           52 YGPLSVLLNSDLIHDYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQ-----DDIPYWLVRNSWGPIGPDEGFFKIER-GN  124 (233)
Q Consensus        52 ~GPv~v~i~~~~f~~y~-~Gi~~~~~~~c~~~~~~H~v~ivG~g~~-----~~~~ywivknswg~~wge~gy~~~~~-~~  124 (233)
                      +|||+|+|++.+|+.|. +|||...  .|+....+|||+|||||.+     .+.+|||||||||+.||++||+|+.| +.
T Consensus       690 kGPVaV~IdAsdf~~Y~~sGIyv~~--~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~  767 (1004)
T PTZ00462        690 KGSVIAYIKAENVLGYEFNGKKVQN--LCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGP  767 (1004)
T ss_pred             cCCEEEEEEeehHHhhhcCCccccC--CCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCC
Confidence            99999999998788884 8998766  5877678999999999974     25799999999999999999999987 57


Q ss_pred             cccCccccc
Q psy282          125 NACGKDFLH  133 (233)
Q Consensus       125 ~~Cgi~~~~  133 (233)
                      +.|++....
T Consensus       768 n~CGin~i~  776 (1004)
T PTZ00462        768 SHCEDNFIH  776 (1004)
T ss_pred             CCCccchhe
Confidence            999987654


No 23 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.89  E-value=6e-23  Score=170.50  Aligned_cols=119  Identities=24%  Similarity=0.282  Sum_probs=100.2

Q ss_pred             CCCCCCCCCCCCCCCCCcccccc----CCCceeEEeCCceEEeC--CHHHHHHHHHhcCCEEEEEecC-CCCCcCCceee
Q psy282            1 MGLESEKDYPYKNANGEKFKCAY----DKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIR   73 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~~~~~C~~----~~~~~~~~i~~~~~~v~--~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~   73 (233)
                      +|+++|++|||.....   .|..    ......+++.+ |..+.  ++++||++|.++|||+++|.+. .|..|++|++.
T Consensus        82 ~Gi~~e~~~Py~~~~~---~~~~~~~~~~~~~~~~~~~-y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~  157 (223)
T cd02619          82 KGIPPEEDYPYGAESD---GEEPKSEAALNAAKVKLKD-YRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIY  157 (223)
T ss_pred             cCCCccccCCCCCCCC---CCCCCCccchhhcceeecc-eeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCcccc
Confidence            5899999999998877   5542    23346688889 88887  6899999999999999999995 99999999973


Q ss_pred             ---CCCCCCCCCCCCcEEEEEeeeccC--CccEEEEecCCCCCCCCCcceeeecc
Q psy282           74 ---KNDETCSPYDLGHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERG  123 (233)
Q Consensus        74 ---~~~~~c~~~~~~H~v~ivG~g~~~--~~~ywivknswg~~wge~gy~~~~~~  123 (233)
                         .....|....++|||+|||||.+.  +.+|||+|||||+.||++||.|+.++
T Consensus       158 ~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~  212 (223)
T cd02619         158 EEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYE  212 (223)
T ss_pred             ccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehh
Confidence               111145667789999999999886  89999999999999999999999885


No 24 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.89  E-value=6.4e-24  Score=196.30  Aligned_cols=89  Identities=21%  Similarity=0.408  Sum_probs=72.2

Q ss_pred             CccceeeeeecccceEEEEec-cccccccCccee-----cCC-CCCC----------CCCCCEEEEEEEeee-cCCcCEE
Q psy282          136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIR-----KND-ETCS----------PYDLGHAVLLVGYGK-QDDIPYW  197 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~-----~~~-~~~~----------~~~~~Hav~iVGy~~-~~~~~yw  197 (233)
                      ++++|+.+|.++|||+++|++ +.|++|++|++.     +.+ ..|.          ...+||||+|||||+ +++++||
T Consensus       342 ~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YW  421 (548)
T PTZ00364        342 DPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYW  421 (548)
T ss_pred             cHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEE
Confidence            456799999999999999999 478899998863     111 1121          135799999999997 5688999


Q ss_pred             EEEcCCCC--CCCCCcEEEEEccCCCCcc
Q psy282          198 LVRNSWGP--IGPDEGFFKIEHTLRSHLT  224 (233)
Q Consensus       198 ivkNSWg~--~WG~~Gy~~i~~~~~~~~~  224 (233)
                      |||||||+  +|||+|||||+|+.+++-+
T Consensus       422 IVKNSWGt~~~WGE~GYfRI~RG~N~CGI  450 (548)
T PTZ00364        422 LVLDPWGSRRSWCDGGTRKIARGVNAYNI  450 (548)
T ss_pred             EEECCCCCCCCcccCCeEEEEcCCCcccc
Confidence            99999999  9999999999999876644


No 25 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.88  E-value=2e-23  Score=195.46  Aligned_cols=89  Identities=29%  Similarity=0.567  Sum_probs=73.3

Q ss_pred             CccceeeeeecccceEEEEecc-ccccccCcceecCC----CCCCCC--------------CCCEEEEEEEeeec--CC-
Q psy282          136 GSETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKND----ETCSPY--------------DLGHAVLLVGYGKQ--DD-  193 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~~-~~~~y~~gi~~~~~----~~~~~~--------------~~~Hav~iVGy~~~--~~-  193 (233)
                      ++..|+.+|..+|||+++|++. .|++|++|||...+    ..|...              .++|||+|||||++  ++ 
T Consensus       556 ~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~  635 (693)
T PTZ00049        556 GEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGK  635 (693)
T ss_pred             CHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccCCCc
Confidence            5678999999999999999994 79999999996321    147432              36999999999975  45 


Q ss_pred             -cCEEEEEcCCCCCCCCCcEEEEEccCCCCcc
Q psy282          194 -IPYWLVRNSWGPIGPDEGFFKIEHTLRSHLT  224 (233)
Q Consensus       194 -~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~  224 (233)
                       .+|||||||||+.|||+|||||+|+.+.+.+
T Consensus       636 ~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGI  667 (693)
T PTZ00049        636 LYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGI  667 (693)
T ss_pred             ccCEEEEECCCCCCcccCceEEEEcCCCccCC
Confidence             3799999999999999999999999876533


No 26 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.88  E-value=2.8e-23  Score=172.18  Aligned_cols=83  Identities=36%  Similarity=0.683  Sum_probs=74.2

Q ss_pred             CccceeeeeecccceEEEEec-c-ccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEE
Q psy282          136 GSETMKKILYKYGPLSVGLNS-H-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFF  213 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~-~-~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~  213 (233)
                      +.++|+++|.++|||+++|.+ + .|+.|++||+.  ...|.....+|||+|||||++.+++|||||||||+.||++|||
T Consensus       122 ~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~--~~~~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~  199 (219)
T PF00112_consen  122 DIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYD--PPDCSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYF  199 (219)
T ss_dssp             CHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEEC--STSSSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEE
T ss_pred             chhHHHHHHhhCceeeeeeeccccccccccceeee--ccccccccccccccccccccccceeeEeeehhhCCccCCCeEE
Confidence            467899999999999999998 4 49999999986  3367766789999999999999999999999999999999999


Q ss_pred             EEEccCC
Q psy282          214 KIEHTLR  220 (233)
Q Consensus       214 ~i~~~~~  220 (233)
                      ||+|+.+
T Consensus       200 ~i~~~~~  206 (219)
T PF00112_consen  200 RISYDYN  206 (219)
T ss_dssp             EEESSSS
T ss_pred             EEeeCCC
Confidence            9999987


No 27 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.86  E-value=1.4e-22  Score=194.93  Aligned_cols=80  Identities=30%  Similarity=0.549  Sum_probs=67.9

Q ss_pred             cceeeeeecccceEEEEeccccccc-cCcceecCCCCCCCCCCCEEEEEEEeeec-----CCcCEEEEEcCCCCCCCCCc
Q psy282          138 ETMKKILYKYGPLSVGLNSHLIHFY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQ-----DDIPYWLVRNSWGPIGPDEG  211 (233)
Q Consensus       138 ~~i~~~l~~~GPv~v~i~~~~~~~y-~~gi~~~~~~~~~~~~~~Hav~iVGy~~~-----~~~~ywivkNSWg~~WG~~G  211 (233)
                      ..|+++|...|||+|+|++..|++| .+|++.  ...|+...++|||+|||||++     ++++|||||||||+.|||+|
T Consensus       681 ~~IK~eI~~kGPVaV~IdAsdf~~Y~~sGIyv--~~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnG  758 (1004)
T PTZ00462        681 KIIKDEIMNKGSVIAYIKAENVLGYEFNGKKV--QNLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEG  758 (1004)
T ss_pred             HHHHHHHHhcCCEEEEEEeehHHhhhcCCccc--cCCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCe
Confidence            4688889899999999999667777 488865  346986678999999999974     25789999999999999999


Q ss_pred             EEEEEccC
Q psy282          212 FFKIEHTL  219 (233)
Q Consensus       212 y~~i~~~~  219 (233)
                      ||||+|..
T Consensus       759 YFKI~r~g  766 (1004)
T PTZ00462        759 YFKVDMYG  766 (1004)
T ss_pred             EEEEEeCC
Confidence            99999953


No 28 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.84  E-value=3.8e-21  Score=155.46  Aligned_cols=70  Identities=34%  Similarity=0.719  Sum_probs=58.9

Q ss_pred             EEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeec-CCcCEEEEEcCCCCCCCCCcEEEEEccC-CCC
Q psy282          151 SVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DDIPYWLVRNSWGPIGPDEGFFKIEHTL-RSH  222 (233)
Q Consensus       151 ~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~-~~~~ywivkNSWg~~WG~~Gy~~i~~~~-~~~  222 (233)
                      ++.+.+..|++|++|||.  ...|.....+|+|+|||||.+ ++++|||||||||+.|||+|||||.|+. +.+
T Consensus        93 ~~~~~~~~f~~Y~~Gi~~--~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c  164 (174)
T smart00645       93 SVAIDASDFQFYKSGIYD--HPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNEC  164 (174)
T ss_pred             EEEEEcccccCCcCeEEC--CCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCcc
Confidence            555666669999999885  235765557999999999987 8899999999999999999999999987 443


No 29 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.84  E-value=1.4e-21  Score=162.31  Aligned_cols=86  Identities=27%  Similarity=0.279  Sum_probs=72.7

Q ss_pred             CccceeeeeecccceEEEEec-cccccccCccee---cCCCCCCCCCCCEEEEEEEeeecC--CcCEEEEEcCCCCCCCC
Q psy282          136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIR---KNDETCSPYDLGHAVLLVGYGKQD--DIPYWLVRNSWGPIGPD  209 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~---~~~~~~~~~~~~Hav~iVGy~~~~--~~~ywivkNSWg~~WG~  209 (233)
                      +++.|+++|.+.|||+++|.+ +.|.+|+++++.   .....|.....+|||+|||||++.  +++|||||||||+.||+
T Consensus       124 ~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~  203 (223)
T cd02619         124 NIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGD  203 (223)
T ss_pred             hHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCcccc
Confidence            357899999999999999998 578889988762   123345556789999999999876  88999999999999999


Q ss_pred             CcEEEEEccCCC
Q psy282          210 EGFFKIEHTLRS  221 (233)
Q Consensus       210 ~Gy~~i~~~~~~  221 (233)
                      +|||||.++...
T Consensus       204 ~Gy~~i~~~~~~  215 (223)
T cd02619         204 NGYGRISYEDVY  215 (223)
T ss_pred             CCEEEEehhhhh
Confidence            999999998643


No 30 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.82  E-value=3.4e-20  Score=149.93  Aligned_cols=74  Identities=38%  Similarity=0.784  Sum_probs=64.8

Q ss_pred             EEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeecc-CCccEEEEecCCCCCCCCCcceeeeccc-cccCccc
Q psy282           56 SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGKDF  131 (233)
Q Consensus        56 ~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~-~~~~ywivknswg~~wge~gy~~~~~~~-~~Cgi~~  131 (233)
                      ++.+.+.+|+.|++|||+.+  .|+....+|+|+|||||.+ ++.+|||++||||+.||++||+|+.++. +.|+++.
T Consensus        93 ~~~~~~~~f~~Y~~Gi~~~~--~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~  168 (174)
T smart00645       93 SVAIDASDFQFYKSGIYDHP--GCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEA  168 (174)
T ss_pred             EEEEEcccccCCcCeEECCC--CCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCcee
Confidence            56666668999999999886  4766668999999999987 8899999999999999999999999986 8899854


No 31 
>KOG1544|consensus
Probab=99.82  E-value=4.1e-20  Score=157.73  Aligned_cols=134  Identities=22%  Similarity=0.441  Sum_probs=103.0

Q ss_pred             CCCCCCCCCCCCCCCC-Ccccc------------------ccCCC--ceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEE
Q psy282            1 MGLESEKDYPYKNANG-EKFKC------------------AYDKS--KVKLFTGKDFLHFN-GSETMKKILYKYGPLSVL   58 (233)
Q Consensus         1 ~Gi~~e~~yPY~~~~~-~~~~C------------------~~~~~--~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~   58 (233)
                      .|++...||||.+.+. ..+.|                  .....  +..+++.. =+.|+ +|++|+++|+++|||.+.
T Consensus       291 rGvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~iyq~tP-PYrVSSnE~eImkElM~NGPVQA~  369 (470)
T KOG1544|consen  291 RGVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNSNDIYQVTP-PYRVSSNEKEIMKELMENGPVQAL  369 (470)
T ss_pred             ccccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccccCceeeecC-CeeccCCHHHHHHHHHhCCChhhh
Confidence            4899999999985433 22234                  32211  13344444 33344 999999999999999999


Q ss_pred             Eec-CCCCCcCCceeeCCCCCC-----CCCCCCcEEEEEeeeccC-----CccEEEEecCCCCCCCCCcceeeecccccc
Q psy282           59 LNS-DLIHDYNGTPIRKNDETC-----SPYDLGHAVLLVGYGKQD-----DIPYWLVRNSWGPIGPDEGFFKIERGNNAC  127 (233)
Q Consensus        59 i~~-~~f~~y~~Gi~~~~~~~c-----~~~~~~H~v~ivG~g~~~-----~~~ywivknswg~~wge~gy~~~~~~~~~C  127 (233)
                      |.+ ++|..|++|||.+.+..-     ....+.|+|.|.|||++.     ..+||+.-||||+.|||+||+++.++.|.|
T Consensus       370 m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNec  449 (470)
T KOG1544|consen  370 MEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNEC  449 (470)
T ss_pred             hhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccch
Confidence            999 599999999998864211     123578999999999874     367999999999999999999999999999


Q ss_pred             Cccccccc
Q psy282          128 GKDFLHFN  135 (233)
Q Consensus       128 gi~~~~~~  135 (233)
                      .|++..+.
T Consensus       450 dIEsfvIg  457 (470)
T KOG1544|consen  450 DIESFVIG  457 (470)
T ss_pred             hhhHhhhh
Confidence            99887543


No 32 
>KOG1544|consensus
Probab=99.78  E-value=7.8e-20  Score=156.01  Aligned_cols=89  Identities=22%  Similarity=0.472  Sum_probs=75.4

Q ss_pred             CccceeeeeecccceEEEEec-cccccccCcceecCCCCCCC-----CCCCEEEEEEEeeecC-----CcCEEEEEcCCC
Q psy282          136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSP-----YDLGHAVLLVGYGKQD-----DIPYWLVRNSWG  204 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~-----~~~~Hav~iVGy~~~~-----~~~ywivkNSWg  204 (233)
                      +|++|++.|+++|||.+.|.+ .+|++|++|||.+.+.+...     ..+.|+|.|.|||++.     -++||+..||||
T Consensus       352 nE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG  431 (470)
T KOG1544|consen  352 NEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWG  431 (470)
T ss_pred             CHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecccc
Confidence            577899999999999999999 58999999999865444332     2579999999999863     267999999999


Q ss_pred             CCCCCCcEEEEEccCCCCcc
Q psy282          205 PIGPDEGFFKIEHTLRSHLT  224 (233)
Q Consensus       205 ~~WG~~Gy~~i~~~~~~~~~  224 (233)
                      +.|||+|||||.|+.|++-+
T Consensus       432 ~~WGE~GYFriLRGvNecdI  451 (470)
T KOG1544|consen  432 PAWGERGYFRILRGVNECDI  451 (470)
T ss_pred             cccccCceEEEeccccchhh
Confidence            99999999999999987733


No 33 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.64  E-value=1.5e-16  Score=144.45  Aligned_cols=77  Identities=19%  Similarity=0.272  Sum_probs=63.5

Q ss_pred             eeeecccceEEEEeccccccccCcceecCCC------------------CCCCCCCCEEEEEEEeeecC-Cc-CEEEEEc
Q psy282          142 KILYKYGPLSVGLNSHLIHFYNGTPIRKNDE------------------TCSPYDLGHAVLLVGYGKQD-DI-PYWLVRN  201 (233)
Q Consensus       142 ~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~------------------~~~~~~~~Hav~iVGy~~~~-~~-~ywivkN  201 (233)
                      ++|..++||.++.++..++.|++||+.....                  .|.++..+|||+||||+.++ |. .||+|||
T Consensus       303 ~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D~~g~p~yw~VkN  382 (437)
T cd00585         303 AQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLDEDGKPVKWKVEN  382 (437)
T ss_pred             HHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEecCCCCcceEEEEc
Confidence            6777889999999998788999999842110                  14445679999999999754 66 5999999


Q ss_pred             CCCCCCCCCcEEEEEcc
Q psy282          202 SWGPIGPDEGFFKIEHT  218 (233)
Q Consensus       202 SWg~~WG~~Gy~~i~~~  218 (233)
                      |||+.||++|||+|+++
T Consensus       383 SWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         383 SWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             ccCCCCCCCcceehhHH
Confidence            99999999999999985


No 34 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.48  E-value=2.8e-14  Score=124.40  Aligned_cols=78  Identities=26%  Similarity=0.359  Sum_probs=50.2

Q ss_pred             eeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeec----------CCcCEEEEEcCCCCCCCC
Q psy282          140 MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ----------DDIPYWLVRNSWGPIGPD  209 (233)
Q Consensus       140 i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~----------~~~~ywivkNSWg~~WG~  209 (233)
                      +++++...|-++..|..+...++.-.+-.+  ..-+....+|||+||||++.          .+...||||||||+.||+
T Consensus       227 i~~~~~~yg~~s~~~~id~~~~~~~~~~~~--~~~s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~  304 (372)
T COG4870         227 IKAMFGFYGAVSSSMYIDATNSLGICIPYP--YVDSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGE  304 (372)
T ss_pred             hHHHHhhhccccceeEEecccccccccCCC--CCCccccccceEEEEeccccccccccccCCCCCceEEEECcccccccc
Confidence            555555666665444442222222111111  11111467999999999974          235599999999999999


Q ss_pred             CcEEEEEccC
Q psy282          210 EGFFKIEHTL  219 (233)
Q Consensus       210 ~Gy~~i~~~~  219 (233)
                      +|||||...-
T Consensus       305 ~GYfwisY~y  314 (372)
T COG4870         305 NGYFWISYYY  314 (372)
T ss_pred             CceEEEEeee
Confidence            9999999854


No 35 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.40  E-value=8.8e-13  Score=119.92  Aligned_cols=90  Identities=19%  Similarity=0.268  Sum_probs=70.4

Q ss_pred             eCCceEEeCCHHHHH----HHHHhcCCEEEEEecCCCCCcCCceeeCCCC------------------CCCCCCCCcEEE
Q psy282           32 TGKDFLHFNGSETMK----KILYKYGPLSVLLNSDLIHDYNGTPIRKNDE------------------TCSPYDLGHAVL   89 (233)
Q Consensus        32 i~~~~~~v~~~~~i~----~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~------------------~c~~~~~~H~v~   89 (233)
                      ... |..++ .+.|+    ++|...+||.++.++..|+.|++||+.....                  .|.....+|||+
T Consensus       286 ~~~-y~Nvp-~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~  363 (437)
T cd00585         286 PIL-YLNVP-MDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMV  363 (437)
T ss_pred             cce-EEecC-HHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEE
Confidence            345 66665 44444    6778888999999998888999999954310                  144456799999


Q ss_pred             EEeeeccC-Cc-cEEEEecCCCCCCCCCcceeeecc
Q psy282           90 LVGYGKQD-DI-PYWLVRNSWGPIGPDEGFFKIERG  123 (233)
Q Consensus        90 ivG~g~~~-~~-~ywivknswg~~wge~gy~~~~~~  123 (233)
                      |||++.+. +. .||+|+||||..||++||+++..+
T Consensus       364 ivGv~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         364 LTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             EEEEEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence            99999854 76 599999999999999999988653


No 36 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.11  E-value=2.2e-10  Score=100.28  Aligned_cols=80  Identities=25%  Similarity=0.316  Sum_probs=57.7

Q ss_pred             CHHHHHHHHHhcCCEEEEEec--CCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccC----------CccEEEEecCC
Q psy282           41 GSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD----------DIPYWLVRNSW  108 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~--~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~----------~~~ywivknsw  108 (233)
                      +...|++++...|-++..|..  ..+....-+.+...    +....+|||+||||+..-          +...|||||||
T Consensus       223 dnG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~~~~----s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSW  298 (372)
T COG4870         223 DNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVD----SGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSW  298 (372)
T ss_pred             cccchHHHHhhhccccceeEEecccccccccCCCCCC----ccccccceEEEEeccccccccccccCCCCCceEEEECcc
Confidence            555688888888877755554  34444344444443    236789999999998642          23479999999


Q ss_pred             CCCCCCCcceeeeccc
Q psy282          109 GPIGPDEGFFKIERGN  124 (233)
Q Consensus       109 g~~wge~gy~~~~~~~  124 (233)
                      |+.||++||+++...+
T Consensus       299 Gt~wG~~GYfwisY~y  314 (372)
T COG4870         299 GTNWGENGYFWISYYY  314 (372)
T ss_pred             ccccccCceEEEEeee
Confidence            9999999999987653


No 37 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.78  E-value=4.8e-09  Score=95.68  Aligned_cols=81  Identities=21%  Similarity=0.315  Sum_probs=55.8

Q ss_pred             cceeeeeecccceEEEEeccccccccCcceecCCC----------C--------CCCCCCCEEEEEEEeee-cCCc-CEE
Q psy282          138 ETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDE----------T--------CSPYDLGHAVLLVGYGK-QDDI-PYW  197 (233)
Q Consensus       138 ~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~----------~--------~~~~~~~Hav~iVGy~~-~~~~-~yw  197 (233)
                      ..+.++|..+-||-.|-++..+...+.|+....-.          .        ...+..+|||+|||.+. ++|. .+|
T Consensus       300 ~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D~~g~p~~w  379 (438)
T PF03051_consen  300 DAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLDEDGKPVRW  379 (438)
T ss_dssp             HHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-TTSSEEEE
T ss_pred             HHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEeccCCCeeEE
Confidence            35666777788999999997555666776521100          0        01235699999999997 5666 499


Q ss_pred             EEEcCCCCCCCCCcEEEEEcc
Q psy282          198 LVRNSWGPIGPDEGFFKIEHT  218 (233)
Q Consensus       198 ivkNSWg~~WG~~Gy~~i~~~  218 (233)
                      +|+||||+..|.+|||.|+..
T Consensus       380 kVeNSWG~~~g~kGy~~msd~  400 (438)
T PF03051_consen  380 KVENSWGTDNGDKGYFYMSDD  400 (438)
T ss_dssp             EEE-SBTTTSTBTTEEEEEHH
T ss_pred             EEEcCCCCCCCCCcEEEECHH
Confidence            999999999999999999863


No 38 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.03  E-value=2.2e-05  Score=71.89  Aligned_cols=87  Identities=22%  Similarity=0.361  Sum_probs=58.6

Q ss_pred             eEEeC-C--HHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCC------------------CCCCCCCCcEEEEEeee
Q psy282           36 FLHFN-G--SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDE------------------TCSPYDLGHAVLLVGYG   94 (233)
Q Consensus        36 ~~~v~-~--~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~------------------~c~~~~~~H~v~ivG~g   94 (233)
                      |.+++ +  .+.+.++|...-||..+-++..+...+.||.....-                  .-.....+||++|+|..
T Consensus       290 ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~  369 (438)
T PF03051_consen  290 YLNVPIDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVD  369 (438)
T ss_dssp             EEE--HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEE
T ss_pred             EeccCHHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEE
Confidence            56666 1  245566777767999999997666778888854310                  00113569999999999


Q ss_pred             c-cCCcc-EEEEecCCCCCCCCCcceeeec
Q psy282           95 K-QDDIP-YWLVRNSWGPIGPDEGFFKIER  122 (233)
Q Consensus        95 ~-~~~~~-ywivknswg~~wge~gy~~~~~  122 (233)
                      . +++.| +|.|+||||...|.+||+.+..
T Consensus       370 ~D~~g~p~~wkVeNSWG~~~g~kGy~~msd  399 (438)
T PF03051_consen  370 LDEDGKPVRWKVENSWGTDNGDKGYFYMSD  399 (438)
T ss_dssp             E-TTSSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred             eccCCCeeEEEEEcCCCCCCCCCcEEEECH
Confidence            7 56765 8999999999999999987653


No 39 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=97.16  E-value=9.4e-05  Score=64.66  Aligned_cols=74  Identities=18%  Similarity=0.225  Sum_probs=49.5

Q ss_pred             eecccceEEEEeccccccccCcceecCCC------------------CCCCCCCCEEEEEEEeee-cCCc-CEEEEEcCC
Q psy282          144 LYKYGPLSVGLNSHLIHFYNGTPIRKNDE------------------TCSPYDLGHAVLLVGYGK-QDDI-PYWLVRNSW  203 (233)
Q Consensus       144 l~~~GPv~v~i~~~~~~~y~~gi~~~~~~------------------~~~~~~~~Hav~iVGy~~-~~~~-~ywivkNSW  203 (233)
                      +...-+|=+|-++..++-.+.||+.-...                  ..+.+.+.|||+|.|.+. ++|. --|.|.|||
T Consensus       308 ~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g~p~rwkVENSW  387 (444)
T COG3579         308 MQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETGNPLRWKVENSW  387 (444)
T ss_pred             HhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCCCceeeEeeccc
Confidence            33455676776665555555555421100                  011234689999999985 4443 389999999


Q ss_pred             CCCCCCCcEEEEEc
Q psy282          204 GPIGPDEGFFKIEH  217 (233)
Q Consensus       204 g~~WG~~Gy~~i~~  217 (233)
                      |..=|.+|||-++.
T Consensus       388 G~d~G~~GyfvaSd  401 (444)
T COG3579         388 GKDVGKKGYFVASD  401 (444)
T ss_pred             ccccCCCceEeehH
Confidence            99999999998865


No 40 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=93.84  E-value=0.05  Score=48.01  Aligned_cols=80  Identities=21%  Similarity=0.323  Sum_probs=54.7

Q ss_pred             CHHHHHHH----HHhcCCEEEEEecCCCCCcCCceeeCCC-------------C-----CCCCCCCCcEEEEEeeecc-C
Q psy282           41 GSETMKKI----LYKYGPLSVLLNSDLIHDYNGTPIRKND-------------E-----TCSPYDLGHAVLLVGYGKQ-D   97 (233)
Q Consensus        41 ~~~~i~~~----l~~~GPv~v~i~~~~f~~y~~Gi~~~~~-------------~-----~c~~~~~~H~v~ivG~g~~-~   97 (233)
                      +.+.+|++    +...-||-.+-++..|+.-+.||.....             +     +.+.....||++|.|...+ +
T Consensus       296 ~me~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~  375 (444)
T COG3579         296 DMERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDET  375 (444)
T ss_pred             cHHHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccC
Confidence            44445443    3444588888777788888888764321             0     1122345799999999864 4


Q ss_pred             Ccc-EEEEecCCCCCCCCCcceee
Q psy282           98 DIP-YWLVRNSWGPIGPDEGFFKI  120 (233)
Q Consensus        98 ~~~-ywivknswg~~wge~gy~~~  120 (233)
                      +.| -|-|+||||..-|.+||+-.
T Consensus       376 g~p~rwkVENSWG~d~G~~Gyfva  399 (444)
T COG3579         376 GNPLRWKVENSWGKDVGKKGYFVA  399 (444)
T ss_pred             CCceeeEeecccccccCCCceEee
Confidence            444 59999999999999999754


No 41 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=89.70  E-value=0.26  Score=36.98  Aligned_cols=56  Identities=27%  Similarity=0.417  Sum_probs=32.1

Q ss_pred             ccceeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCC
Q psy282          137 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW  203 (233)
Q Consensus       137 e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSW  203 (233)
                      ...+++.|.+..||.+.+....-.. .+..+.       .....|.|+|+||+++.   +++|..+|
T Consensus        89 ~~~i~~~i~~G~Pvi~~~~~~~~~~-~~~~~~-------~~~~~H~vvi~Gy~~~~---~~~v~DP~  144 (144)
T PF13529_consen   89 FDDIKQEIDAGRPVIVSVNSGWRPP-NGDGYD-------GTYGGHYVVIIGYDEDG---YVYVNDPW  144 (144)
T ss_dssp             HHHHHHHHHTT--EEEEEETTSS---TTEEEE-------E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred             HHHHHHHHHCCCcEEEEEEcccccC-CCCCcC-------CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence            4678888988889999987421111 111111       12468999999999643   78887776


No 42 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=88.23  E-value=1.9  Score=32.10  Aligned_cols=55  Identities=27%  Similarity=0.439  Sum_probs=32.1

Q ss_pred             CHHHHHHHHHhcCCEEEEEecC--CCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccCCccEEEEecCC
Q psy282           41 GSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW  108 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~~--~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~~~~ywivknsw  108 (233)
                      +.+.|+++|....||.+.+...  ..   ....+.       .....|.|+|+||..+.   ++++...|
T Consensus        88 ~~~~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~-------~~~~~H~vvi~Gy~~~~---~~~v~DP~  144 (144)
T PF13529_consen   88 SFDDIKQEIDAGRPVIVSVNSGWRPP---NGDGYD-------GTYGGHYVVIIGYDEDG---YVYVNDPW  144 (144)
T ss_dssp             -HHHHHHHHHTT--EEEEEETTSS-----TTEEEE-------E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred             cHHHHHHHHHCCCcEEEEEEcccccC---CCCCcC-------CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence            7899999999988999999742  12   111122       23468999999997543   55554443


No 43 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=78.11  E-value=8.8  Score=31.96  Aligned_cols=64  Identities=25%  Similarity=0.331  Sum_probs=39.4

Q ss_pred             CHHHHHHHHHhcCCEEEEEecCCCC--CcCCceee--CCCCCC-CCCCCCcEEEEEeeeccCCccEEEEec
Q psy282           41 GSETMKKILYKYGPLSVLLNSDLIH--DYNGTPIR--KNDETC-SPYDLGHAVLLVGYGKQDDIPYWLVRN  106 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~~~f~--~y~~Gi~~--~~~~~c-~~~~~~H~v~ivG~g~~~~~~ywivkn  106 (233)
                      +.++|...|...||+.+-+++..+.  .-+.-...  .+.+-| +.....|-|+|+||....+.  .+.+|
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~~~--~~yrd  180 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAATKE--FEYRD  180 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCCCe--EEEeC
Confidence            6899999999989888888875333  11222221  111111 23567999999999754433  44444


No 44 
>KOG4128|consensus
Probab=72.89  E-value=0.35  Score=42.74  Aligned_cols=40  Identities=30%  Similarity=0.386  Sum_probs=32.4

Q ss_pred             CCCEEEEEEEeee----cCCcCEEEEEcCCCCCCCCCcEEEEEc
Q psy282          178 DLGHAVLLVGYGK----QDDIPYWLVRNSWGPIGPDEGFFKIEH  217 (233)
Q Consensus       178 ~~~Hav~iVGy~~----~~~~~ywivkNSWg~~WG~~Gy~~i~~  217 (233)
                      ...||+++.+-+.    +++-.-|-|.||||+.-|.+|+..|..
T Consensus       370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt~  413 (457)
T KOG4128|consen  370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMTA  413 (457)
T ss_pred             HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhhH
Confidence            4689999999883    233457999999999999999977754


No 45 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=71.66  E-value=12  Score=32.53  Aligned_cols=57  Identities=16%  Similarity=0.280  Sum_probs=35.9

Q ss_pred             CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccCCccEEEEec
Q psy282           41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRN  106 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~~~~ywivkn  106 (233)
                      -.+.|+++|....||.+.++.-.+ .|....+       ......|.++|+||..++ ..+.++..
T Consensus        77 ~~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~-------~~~~~~H~i~v~G~d~~~-~~~~v~D~  133 (317)
T PF14399_consen   77 AWEELKEALDAGRPVIVWVDMYYL-PYRPNYY-------KKHHADHYIVVYGYDEEE-DVFYVSDP  133 (317)
T ss_pred             HHHHHHHHHhCCCceEEEeccccC-CCCcccc-------ccccCCcEEEEEEEeCCC-CEEEEEcC
Confidence            346888888886699999776322 2222212       123458999999997653 34555554


No 46 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=66.92  E-value=4.7  Score=35.01  Aligned_cols=56  Identities=14%  Similarity=0.254  Sum_probs=35.2

Q ss_pred             ccceeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEc
Q psy282          137 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRN  201 (233)
Q Consensus       137 e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkN  201 (233)
                      .+.+++.|.++.||.+.++.-.+.++..  +      .......|.|+|+||++++ ..+.++-.
T Consensus        78 ~~~l~~~l~~g~pv~~~~D~~~lpy~~~--~------~~~~~~~H~i~v~G~d~~~-~~~~v~D~  133 (317)
T PF14399_consen   78 WEELKEALDAGRPVIVWVDMYYLPYRPN--Y------YKKHHADHYIVVYGYDEEE-DVFYVSDP  133 (317)
T ss_pred             HHHHHHHHhCCCceEEEeccccCCCCcc--c------cccccCCcEEEEEEEeCCC-CEEEEEcC
Confidence            3568888877779999977633322221  1      1222458999999999652 34556544


No 47 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=62.40  E-value=13  Score=31.07  Aligned_cols=64  Identities=23%  Similarity=0.376  Sum_probs=37.5

Q ss_pred             CccceeeeeecccceEEEEeccccc--cccCcceecCCCCC--C-CCCCCEEEEEEEeeecCCcCEEEEEc
Q psy282          136 GSETMKKILYKYGPLSVGLNSHLIH--FYNGTPIRKNDETC--S-PYDLGHAVLLVGYGKQDDIPYWLVRN  201 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~~~~~--~y~~gi~~~~~~~~--~-~~~~~Hav~iVGy~~~~~~~ywivkN  201 (233)
                      +.+++.+.|..++|+.+-+++..+.  .=+..........|  . +.-.+|-|+|+||+...  +-+++||
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~--~~~~yrd  180 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAAT--KEFEYRD  180 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCC--CeEEEeC
Confidence            4567888888889988888875433  11221111111122  1 22358999999999653  2355554


No 48 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=61.30  E-value=25  Score=28.32  Aligned_cols=54  Identities=22%  Similarity=0.482  Sum_probs=34.4

Q ss_pred             CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeec-cCCccEEEEecCC
Q psy282           41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK-QDDIPYWLVRNSW  108 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~-~~~~~ywivknsw  108 (233)
                      +-+++++.+.++.|+.+..+.          . ..   -+.....||++||||-. .++.+++++=|-|
T Consensus        90 s~~eV~~~~~~nk~i~i~~~~----------v-~~---~~~~~~gHAlavvGya~~~~g~~~y~~WNPW  144 (175)
T PF05543_consen   90 SFDEVKKLIDNNKGIAILADR----------V-EQ---TNGPHAGHALAVVGYAKPNNGQKTYYFWNPW  144 (175)
T ss_dssp             -HHHHHHHHHTT-EEEEEEEE----------T-TS---CTTB--EEEEEEEEEEEETTSEEEEEEE-TT
T ss_pred             CHHHHHHHHHcCCCeEEEecc----------c-cc---CCCCccceeEEEEeeeecCCCCeEEEEeCCc
Confidence            788889988888888776643          1 11   12334689999999976 4557777776755


No 49 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=59.81  E-value=27  Score=27.79  Aligned_cols=39  Identities=26%  Similarity=0.428  Sum_probs=29.1

Q ss_pred             CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeecc
Q psy282           41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ   96 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~   96 (233)
                      +.+.+++.|.++||+.++.++             +    +.....|+++|.|-..+
T Consensus        97 t~e~~~~LL~~yGPLwv~~~~-------------P----~~~~~~H~~ViTGI~~d  135 (166)
T PF12385_consen   97 TAEGLANLLREYGPLWVAWEA-------------P----GDSWVAHASVITGIDGD  135 (166)
T ss_pred             CHHHHHHHHHHcCCeEEEecC-------------C----CCcceeeEEEEEeecCC
Confidence            678999999999999998643             2    23445688888876543


No 50 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=49.93  E-value=26  Score=30.69  Aligned_cols=27  Identities=26%  Similarity=0.503  Sum_probs=23.6

Q ss_pred             CCEEEEEEEeeecC--CcCEEEEEcCCCC
Q psy282          179 LGHAVLLVGYGKQD--DIPYWLVRNSWGP  205 (233)
Q Consensus       179 ~~Hav~iVGy~~~~--~~~ywivkNSWg~  205 (233)
                      .+||=.|++.-+.+  +...-.+||-||.
T Consensus       235 ~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~  263 (315)
T cd00044         235 KGHAYSVLDVREVQEEGLRLLRLRNPWGV  263 (315)
T ss_pred             cCcceEEeEEEEEccCceEEEEecCCccC
Confidence            48999999998766  7889999999995


No 51 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=48.28  E-value=27  Score=28.19  Aligned_cols=36  Identities=25%  Similarity=0.471  Sum_probs=24.0

Q ss_pred             CCCEEEEEEEeee-cCCcCEEEEEcCCCCCCCCCcEEEEEcc
Q psy282          178 DLGHAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKIEHT  218 (233)
Q Consensus       178 ~~~Hav~iVGy~~-~~~~~ywivkNSWg~~WG~~Gy~~i~~~  218 (233)
                      ..+||+++|||-. .++.++.++=|=|     +++++-+...
T Consensus       118 ~~gHAlavvGya~~~~g~~~y~~WNPW-----~~~~~~~sa~  154 (175)
T PF05543_consen  118 HAGHALAVVGYAKPNNGQKTYYFWNPW-----WNDVMIQSAK  154 (175)
T ss_dssp             --EEEEEEEEEEEETTSEEEEEEE-TT------SS-EEEETT
T ss_pred             ccceeEEEEeeeecCCCCeEEEEeCCc-----cCCcEEEecC
Confidence            5689999999987 5568899998877     4455555544


No 52 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.93  E-value=19  Score=29.27  Aligned_cols=47  Identities=26%  Similarity=0.366  Sum_probs=32.3

Q ss_pred             CccceeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCC
Q psy282          136 GSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWG  204 (233)
Q Consensus       136 ~e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg  204 (233)
                      +..+|+..|.+..||.+=...  |+-                .--|+|+|+||++.    +...-+.||
T Consensus       122 sl~~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyDk~----n~yynDpyG  168 (195)
T COG4990         122 SLSDIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYDKY----NIYYNDPYG  168 (195)
T ss_pred             cHHHHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeeccccc----ceEeccccc
Confidence            456788889888898754332  221                12599999999976    566666664


No 53 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.21  E-value=1.7e+02  Score=23.94  Aligned_cols=37  Identities=30%  Similarity=0.456  Sum_probs=28.1

Q ss_pred             CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeec
Q psy282           41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK   95 (233)
Q Consensus        41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~   95 (233)
                      +..+|+..|.+..||.+-...  |..                ..-|+++|.||+.
T Consensus       122 sl~~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyDk  158 (195)
T COG4990         122 SLSDIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYDK  158 (195)
T ss_pred             cHHHHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeecccc
Confidence            889999999998898875543  322                2369999999953


No 54 
>KOG4128|consensus
Probab=20.25  E-value=16  Score=32.64  Aligned_cols=79  Identities=19%  Similarity=0.181  Sum_probs=49.0

Q ss_pred             CHHHHHHHHHh----cCCEEEEEecCCCCCcCCceeeCCC--------------C-----CCCCCCCCcEEEEEeeecc-
Q psy282           41 GSETMKKILYK----YGPLSVLLNSDLIHDYNGTPIRKND--------------E-----TCSPYDLGHAVLLVGYGKQ-   96 (233)
Q Consensus        41 ~~~~i~~~l~~----~GPv~v~i~~~~f~~y~~Gi~~~~~--------------~-----~c~~~~~~H~v~ivG~g~~-   96 (233)
                      +.+.|++.+.+    ..||-.+-+...|..-++|+.+-.-              .     +.......||+++.+-|.. 
T Consensus       305 ~~d~l~k~vv~sl~~~kaVwfgcd~~k~~~~K~G~~dl~l~~~~l~fG~~l~~~~KAeRl~y~eSlmthAml~T~v~~kd  384 (457)
T KOG4128|consen  305 SMDILMKIVVTSLEGDKAVWFGCDIRKAISLKSGPLDLRLHQFDLLFGFKLGESTKAERLDYRESLMTHAMLLTSVGLKD  384 (457)
T ss_pred             CHHHHHHHHHHHhcCCcceEEecccHhhhhcccCccchhhccCceeeeeeccccchhhhhhHHHHHHHHHHHhhhccccC
Confidence            56677766654    3466666555455555666542210              0     0111245799998888742 


Q ss_pred             ---CCccEEEEecCCCCCCCCCccee
Q psy282           97 ---DDIPYWLVRNSWGPIGPDEGFFK  119 (233)
Q Consensus        97 ---~~~~ywivknswg~~wge~gy~~  119 (233)
                         .+-.-|-|+||||..-|.+|+..
T Consensus       385 ~~~g~~~~~rVenswgkd~gkkg~~~  410 (457)
T KOG4128|consen  385 PATGGLNEHRVENSWGKDLGKKGVNK  410 (457)
T ss_pred             cccCCchhhhhhchhhhhccccchhh
Confidence               23446999999999999999843


Done!