Query         psy15346
Match_columns 280
No_of_seqs    200 out of 1326
Neff          6.1 
Searched_HMMs 46136
Date          Fri Aug 16 18:34:33 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15346.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15346hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd02620 Peptidase_C1A_Cathepsi 100.0   3E-38 6.6E-43  284.9  17.4  168    1-240    68-235 (236)
  2 cd02698 Peptidase_C1A_Cathepsi 100.0 1.2E-37 2.6E-42  281.4  17.6  162    1-244    70-239 (239)
  3 cd02621 Peptidase_C1A_Cathepsi 100.0 2.3E-37 4.9E-42  279.8  15.8  165    1-243    72-242 (243)
  4 PTZ00049 cathepsin C-like prot 100.0 1.5E-36 3.2E-41  305.9  18.0  189    1-249   456-682 (693)
  5 KOG1543|consensus              100.0 1.3E-35 2.7E-40  280.1  15.3  151    1-244   173-324 (325)
  6 PTZ00364 dipeptidyl-peptidase  100.0 2.9E-35 6.2E-40  292.6  17.3  189    1-258   278-474 (548)
  7 cd02248 Peptidase_C1A Peptidas 100.0   8E-35 1.7E-39  254.9  16.1  147    1-240    62-209 (210)
  8 KOG1542|consensus              100.0 5.4E-35 1.2E-39  272.4  12.7  148    1-242   218-370 (372)
  9 PTZ00203 cathepsin L protease; 100.0 2.8E-34   6E-39  273.2  17.5  151    1-243   187-340 (348)
 10 PF00112 Peptidase_C1:  Papain  100.0   4E-33 8.8E-38  243.6  13.6  150    1-242    65-219 (219)
 11 PTZ00200 cysteine proteinase;  100.0 5.1E-33 1.1E-37  271.9  15.5  145    1-243   296-445 (448)
 12 PTZ00021 falcipain-2; Provisio 100.0 1.9E-32 4.1E-37  269.5  15.3  147    1-243   327-488 (489)
 13 KOG1544|consensus              100.0 4.5E-33 9.8E-38  257.7   8.6  179    1-245   275-462 (470)
 14 PTZ00462 Serine-repeat antigen  99.9 6.3E-27 1.4E-31  242.5  16.8  103   91-251   680-789 (1004)
 15 cd02619 Peptidase_C1 C1 Peptid  99.9 9.2E-27   2E-31  203.6  14.9  141    1-229    65-213 (223)
 16 smart00645 Pept_C1 Papain fami  99.9 1.5E-26 3.2E-31  200.0  11.2   52  188-239   118-171 (174)
 17 cd00585 Peptidase_C1B Peptidas  99.5 9.2E-14   2E-18  136.1   8.8   41  188-228   357-399 (437)
 18 COG4870 Cysteine protease [Pos  99.4 6.9E-13 1.5E-17  125.7   7.1   42  188-229   263-314 (372)
 19 PF03051 Peptidase_C1_2:  Pepti  98.3 3.4E-06 7.4E-11   83.2   9.9   41  188-228   358-400 (438)
 20 PTZ00203 cathepsin L protease;  95.7  0.0051 1.1E-07   59.2   2.0   29  157-185   288-316 (348)
 21 cd02698 Peptidase_C1A_Cathepsi  95.7   0.005 1.1E-07   55.7   1.7   28  157-184   180-208 (239)
 22 cd02621 Peptidase_C1A_Cathepsi  95.5  0.0081 1.8E-07   54.3   2.1   30  156-185   187-218 (243)
 23 KOG1543|consensus               95.4  0.0083 1.8E-07   57.2   2.2   29  157-185   271-299 (325)
 24 smart00645 Pept_C1 Papain fami  95.3  0.0085 1.9E-07   51.7   1.5   45  140-184   104-149 (174)
 25 cd02620 Peptidase_C1A_Cathepsi  94.9   0.015 3.2E-07   52.6   2.0   28  157-184   186-213 (236)
 26 COG3579 PepC Aminopeptidase C   94.8  0.0084 1.8E-07   57.4   0.2   40  188-227   360-401 (444)
 27 KOG1542|consensus               94.5   0.023   5E-07   54.5   2.4   29  157-185   318-347 (372)
 28 cd02248 Peptidase_C1A Peptidas  94.4   0.018   4E-07   50.0   1.5   28  157-184   160-187 (210)
 29 PTZ00200 cysteine proteinase;   93.6   0.041 8.9E-07   54.8   2.3   28  157-184   388-417 (448)
 30 PTZ00364 dipeptidyl-peptidase   93.4   0.043 9.2E-07   55.9   2.0   31  156-186   403-436 (548)
 31 KOG1544|consensus               92.9   0.068 1.5E-06   51.1   2.3   84   78-186   346-437 (470)
 32 PTZ00049 cathepsin C-like prot  92.5   0.073 1.6E-06   55.4   2.2   30  156-185   619-652 (693)
 33 PF13529 Peptidase_C39_2:  Pept  92.3    0.49 1.1E-05   37.5   6.4   24   87-110    85-108 (144)
 34 PTZ00021 falcipain-2; Provisio  92.2   0.084 1.8E-06   53.1   2.2   28  157-184   422-459 (489)
 35 PF00112 Peptidase_C1:  Papain   92.0   0.056 1.2E-06   46.8   0.7   29  157-185   167-195 (219)
 36 cd02619 Peptidase_C1 C1 Peptid  91.7    0.13 2.7E-06   44.6   2.5   29  157-185   173-203 (223)
 37 PTZ00462 Serine-repeat antigen  90.5    0.15 3.3E-06   55.0   2.2   31  157-187   723-758 (1004)
 38 PF05543 Peptidase_C47:  Stapho  66.4      12 0.00026   32.9   4.9   26  188-213   118-144 (175)
 39 KOG4128|consensus               64.4    0.83 1.8E-05   44.0  -2.8   54  188-245   370-427 (457)
 40 PF14399 Transpep_BrtH:  NlpC/p  64.1      16 0.00035   33.6   5.7   23   89-111    76-98  (317)
 41 cd00585 Peptidase_C1B Peptidas  57.0     7.1 0.00015   39.0   2.1   73   84-182   289-387 (437)
 42 COG4990 Uncharacterized protei  52.6      32  0.0007   30.5   5.2   21  190-214   148-168 (195)
 43 PF09778 Guanylate_cyc_2:  Guan  44.0      87  0.0019   28.4   6.8   21   90-110   112-132 (212)
 44 cd00044 CysPc Calpains, domain  37.8      43 0.00093   31.4   4.0   28  188-215   234-263 (315)
 45 PF12385 Peptidase_C70:  Papain  34.7      71  0.0015   27.8   4.5   23   90-112    97-119 (166)
 46 cd02549 Peptidase_C39A A sub-f  33.1 1.4E+02  0.0029   23.6   5.8   22  189-213    93-114 (141)
 47 PF01357 Pollen_allerg_1:  Poll  31.0      78  0.0017   23.9   3.8   43  170-218    10-52  (82)

No 1  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=3e-38  Score=284.89  Aligned_cols=168  Identities=39%  Similarity=0.788  Sum_probs=130.8

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||+||++..||+||+++|+++|.+|       ||+...|..+...  ...|..    ++.|...|.. .....+....++
T Consensus        68 gC~GG~~~~a~~~i~~~G~~~e~~y-------PY~~~~~~~~~~~--~~~~~~----~~~~~~~C~~-~~~~~~~~~~~~  133 (236)
T cd02620          68 GCNGGYPDAAWKYLTTTGVVTGGCQ-------PYTIPPCGHHPEG--PPPCCG----TPYCTPKCQD-GCEKTYEEDKHK  133 (236)
T ss_pred             CCCCCCHHHHHHHHHhcCCCcCCEe-------cCcCCCCccCCCC--CCCCCC----CCCCCCCCCc-CCccccceeeee
Confidence            7999999999999999999997666       9996544331111  122322    1233344541 111123445566


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK  160 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~  160 (280)
                      +..++.+..++++||++|+++|||+++|.++++|+.                       |++|||+.....         
T Consensus       134 ~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~-----------------------Y~~Giy~~~~~~---------  181 (236)
T cd02620         134 GKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-----------------------YKSGVYQHTSGK---------  181 (236)
T ss_pred             ecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhh-----------------------cCCcEEeecCCC---------
Confidence            777787766789999999999999999999888999                       999999865322         


Q ss_pred             eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeee
Q psy15346        161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG  240 (280)
Q Consensus       161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~  240 (280)
                      .                          .++|||+|||||++++++|||||||||++||++|||||+||.|.|||+++++.
T Consensus       182 ~--------------------------~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~  235 (236)
T cd02620         182 Q--------------------------LGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA  235 (236)
T ss_pred             C--------------------------cCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence            1                          56899999999999999999999999999999999999999999999998875


No 2  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=1.2e-37  Score=281.44  Aligned_cols=162  Identities=22%  Similarity=0.457  Sum_probs=129.2

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCccc--ccccCCCCCcccccce
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCH--TRCTNDNYGRGFFQDK   78 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~--~~c~~~~~~~~~~~~~   78 (280)
                      ||+||++..||+|++++|++++.+|       ||..          ....|+... ...+|.  ..|....     ....
T Consensus        70 gC~GG~~~~a~~~~~~~Gl~~e~~y-------PY~~----------~~~~C~~~~-~~~~c~~~~~c~~~~-----~~~~  126 (239)
T cd02698          70 SCHGGDPGGVYEYAHKHGIPDETCN-------PYQA----------KDGECNPFN-RCGTCNPFGECFAIK-----NYTL  126 (239)
T ss_pred             CccCcCHHHHHHHHHHcCcCCCCee-------CCcC----------CCCCCcCCC-CCCCcccCccccccc-----ccce
Confidence            7999999999999999999996666       9983          344564321 111221  1222100     1234


Q ss_pred             eeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhe
Q psy15346         79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT  158 (280)
Q Consensus        79 ~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~  158 (280)
                      ++++++..+. ++++||++|+++|||+++|.++.+|+.                       |++|||++..+.       
T Consensus       127 ~~i~~~~~~~-~~~~i~~~l~~~GPV~v~i~~~~~f~~-----------------------Y~~GIy~~~~~~-------  175 (239)
T cd02698         127 YFVSDYGSVS-GRDKMMAEIYARGPISCGIMATEALEN-----------------------YTGGVYKEYVQD-------  175 (239)
T ss_pred             EEeeeceecC-CHHHHHHHHHHcCCEEEEEEecccccc-----------------------cCCeEEccCCCC-------
Confidence            5677776674 578999999999999999999988999                       999999876544       


Q ss_pred             eeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC-CccEEEEEcCCCCCCCCCceEEEEccC-----Ccc
Q psy15346        159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR-----NEA  232 (280)
Q Consensus       159 ~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~-g~~YWiirNSWG~~WG~~Gy~kI~rg~-----n~c  232 (280)
                        .                          .++|||+|||||+++ +++|||||||||++||++|||||+||.     |+|
T Consensus       176 --~--------------------------~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~  227 (239)
T cd02698         176 --P--------------------------LINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNL  227 (239)
T ss_pred             --C--------------------------cCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccCCccccccc
Confidence              1                          569999999999886 999999999999999999999999999     999


Q ss_pred             cccceeeeEeec
Q psy15346        233 IIESLVNGALPK  244 (280)
Q Consensus       233 gIe~~~~~~~p~  244 (280)
                      |||+.+++++|.
T Consensus       228 ~i~~~~~~~~~~  239 (239)
T cd02698         228 AIEEDCAWADPI  239 (239)
T ss_pred             ccccceEEEeeC
Confidence            999999999983


No 3  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=2.3e-37  Score=279.78  Aligned_cols=165  Identities=32%  Similarity=0.544  Sum_probs=124.7

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||+||++.+|++|++++||+++..|       ||+.         .....|....       ..|.     +.+..+..+
T Consensus        72 GC~GG~~~~a~~~~~~~Gi~~e~~y-------PY~~---------~~~~~C~~~~-------~~~~-----~~~~~~~~~  123 (243)
T cd02621          72 GCDGGFPFLVGKFAEDFGIVTEDYF-------PYTA---------DDDRPCKASP-------SECR-----RYYFSDYNY  123 (243)
T ss_pred             CCCCCCHHHHHHHHHhcCcCCCcee-------CCCC---------CCCCCCCCCc-------cccc-----cccccceeE
Confidence            7999999999999999999996666       9983         1345565421       0011     111112223


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCC----chhhhhh
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA----SAEIVAY  156 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~----~~~~~~~  156 (280)
                      +.+++.. .++++||++|+++|||+++|.++++|+.                       |++|||+...    |..    
T Consensus       124 i~~~~~~-~~~~~ik~~i~~~GPv~v~~~~~~~F~~-----------------------Y~~GIy~~~~~~~~C~~----  175 (243)
T cd02621         124 VGGCYGC-TNEDEMKWEIYRNGPIVVAFEVYSDFDF-----------------------YKEGVYHHTDNDEVSDG----  175 (243)
T ss_pred             ccccccc-CCHHHHHHHHHHcCCEEEEEEecccccc-----------------------cCCeEECcCCccccccc----
Confidence            3333333 4789999999999999999999988999                       9999998763    320    


Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC--CccEEEEEcCCCCCCCCCceEEEEccCCcccc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN--GRPYWTIVSTFGEQFGDKGTIKILRGRNEAII  234 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~--g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgI  234 (280)
                       ..+.                     ......++|||+|||||+++  +++|||||||||++||++|||||+||.|.|||
T Consensus       176 -~~~~---------------------~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi  233 (243)
T cd02621         176 -DNDN---------------------FNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGI  233 (243)
T ss_pred             -cccc---------------------ccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCcccCc
Confidence             0000                     00011569999999999986  89999999999999999999999999999999


Q ss_pred             cceeeeEee
Q psy15346        235 ESLVNGALP  243 (280)
Q Consensus       235 e~~~~~~~p  243 (280)
                      ++++++++|
T Consensus       234 ~~~~~~~~~  242 (243)
T cd02621         234 ESQAVFAYP  242 (243)
T ss_pred             ccceEeecc
Confidence            999999988


No 4  
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=1.5e-36  Score=305.89  Aligned_cols=189  Identities=25%  Similarity=0.421  Sum_probs=134.0

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCC-------------------Ccc
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-------------------PKC   61 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~-------------------p~c   61 (280)
                      ||+||++..|++|++++||+++..|       ||+.          ..+.|+......                   +.|
T Consensus       456 GC~GG~~~~A~kya~~~GI~tEscY-------PY~a----------~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~  518 (693)
T PTZ00049        456 GCNGGFPYLVSKMAKLQGIPLDKVF-------PYTA----------TEQTCPYQVDQSANSMNGSANLRQINAVFFSSET  518 (693)
T ss_pred             CcCCCcHHHHHHHHHHCCCCcCCcc-------CCcC----------CCCCCCCCCCCccccccccccccccccccccccc
Confidence            7999999999999999999996665       9983          234454321100                   112


Q ss_pred             cccccC-------CCCCcccccceeeeEEEEEcC--chHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCccccccc
Q psy15346         62 HTRCTN-------DNYGRGFFQDKYRFKRYYWVN--DEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY  132 (280)
Q Consensus        62 ~~~c~~-------~~~~~~~~~~~~~i~~~y~~~--~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~  132 (280)
                      ...|..       ..+.++|..+..++.++|.+.  .++++||++|+++|||+++|.++.+|+.                
T Consensus       519 ~~~~~~~~~~~~~~~~~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~----------------  582 (693)
T PTZ00049        519 QSDMHADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYD----------------  582 (693)
T ss_pred             cccccccccccccccccceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhc----------------
Confidence            222211       122334444444455555542  3689999999999999999999888998                


Q ss_pred             ccccccccccceeecCC------chhhhhhheeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeecc--CCc
Q psy15346        133 LYSDIFSYKSGVYAVSA------SAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE--NGR  204 (280)
Q Consensus       133 ~~~~~~~Y~~GVy~~~~------~~~~~~~~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e--~g~  204 (280)
                             |++|||+...      |.       ...         |    .+.+.....++..++|||+|||||++  ++.
T Consensus       583 -------YksGVY~~~~~~h~~~C~-------~d~---------~----~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~  635 (693)
T PTZ00049        583 -------YADGVYYVEDFPHARRCT-------VDL---------P----KHNGVYNITGWEKVNHAIVLVGWGEEEINGK  635 (693)
T ss_pred             -------CCCccccCcccccccccC-------Ccc---------c----cccccccccccccCceEEEEEEeccccCCCc
Confidence                   9999998632      32       000         0    00000000112257999999999985  463


Q ss_pred             --cEEEEEcCCCCCCCCCceEEEEccCCcccccceeeeEeeccCCCC
Q psy15346        205 --PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGV  249 (280)
Q Consensus       205 --~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~~~p~~~~~~  249 (280)
                        +|||||||||++||++|||||+||.|.||||+++++++|+++||.
T Consensus       636 ~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~pd~~rg~  682 (693)
T PTZ00049        636 LYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEPDFSRGA  682 (693)
T ss_pred             ccCEEEEECCCCCCcccCceEEEEcCCCccCCccceeEEeeeccccH
Confidence              899999999999999999999999999999999999999999985


No 5  
>KOG1543|consensus
Probab=100.00  E-value=1.3e-35  Score=280.14  Aligned_cols=151  Identities=28%  Similarity=0.501  Sum_probs=130.6

Q ss_pred             CCCCCchHHHHHHHHhcCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCccccccee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVT-GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKY   79 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~t-e~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~   79 (280)
                      ||+||++..||+|++++|+++ +.+|       ||.          +.+..|..+..                   .+.+
T Consensus       173 GC~GG~~~~A~~yi~~~G~~t~~~~Y-------py~----------~~~~~C~~~~~-------------------~~~~  216 (325)
T KOG1543|consen  173 GCNGGEPKNAFKYIKKNGGVTECENY-------PYI----------GKDGTCKSNKK-------------------DKTV  216 (325)
T ss_pred             CcCCCCHHHHHHHHHHhCCCCCCcCC-------CCc----------CCCCCccCCCc-------------------ccee
Confidence            799999999999999999998 8888       888          34446665421                   2355


Q ss_pred             eeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhee
Q psy15346         80 RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV  159 (280)
Q Consensus        80 ~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~  159 (280)
                      .+.+++.++.++.+||.+|+.+|||.++|.++.+|+.                       |++|||.+..+.        
T Consensus       217 ~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~-----------------------Y~~GVy~~~~~~--------  265 (325)
T KOG1543|consen  217 TIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSL-----------------------YKGGVYAEEKGD--------  265 (325)
T ss_pred             EeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhh-----------------------ccCceEeCCCCC--------
Confidence            6778888887899999999999999999999999999                       999999998776        


Q ss_pred             eeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceee
Q psy15346        160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN  239 (280)
Q Consensus       160 ~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~  239 (280)
                       .                        . .++|||+|||||+.++.+|||||||||++||++|||||+|++|.|+|++.+.
T Consensus       266 -~------------------------~-~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~  319 (325)
T KOG1543|consen  266 -D------------------------K-EGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIASEAS  319 (325)
T ss_pred             -C------------------------C-CCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhhcccc
Confidence             1                        0 3699999999999667899999999999999999999999999999999988


Q ss_pred             eEeec
Q psy15346        240 GALPK  244 (280)
Q Consensus       240 ~~~p~  244 (280)
                      ++.|+
T Consensus       320 ~~p~~  324 (325)
T KOG1543|consen  320 YGPIK  324 (325)
T ss_pred             cCCCC
Confidence            86554


No 6  
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=2.9e-35  Score=292.57  Aligned_cols=189  Identities=29%  Similarity=0.345  Sum_probs=137.1

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||+||++..|++|++++||++|++|     |.||+..       ++..+.|+..          +   ...+.+..+..+
T Consensus       278 GCdGG~p~~A~~yi~~~GI~tE~dY-----~~PY~~~-------dg~~~~Ck~~----------~---~~~~y~~~~~~~  332 (548)
T PTZ00364        278 GCAGGFPEEVGKFAETFGILTTDSY-----YIPYDSG-------DGVERACKTR----------R---PSRRYYFTNYGP  332 (548)
T ss_pred             CCCCCcHHHHHHHHHhCCccccccc-----CCCCCCC-------CCCCCCCCCC----------c---ccceeeeeeeEE
Confidence            7999999999999999999997766     6699831       1222235432          1   112223334456


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecC-----Cchhhhh
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVS-----ASAEIVA  155 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~-----~~~~~~~  155 (280)
                      +.++|.+..++++||++|+++|||+++|+++.+|+.                       |++|||.+.     .... ..
T Consensus       333 I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~-----------------------YksGiy~gi~~~~~~~~~-~~  388 (548)
T PTZ00364        333 LGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYN-----------------------CDENSTEDVRYVSLDDYS-TA  388 (548)
T ss_pred             ecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHh-----------------------cCCCCccCeecccccccc-cc
Confidence            667777666788999999999999999999989999                       999888632     1000 00


Q ss_pred             hheeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeec-cCCccEEEEEcCCCC--CCCCCceEEEEccCCcc
Q psy15346        156 YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGE--QFGDKGTIKILRGRNEA  232 (280)
Q Consensus       156 ~~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~-e~g~~YWiirNSWG~--~WG~~Gy~kI~rg~n~c  232 (280)
                      .++.        ....+            ....++|||+|||||+ +++++|||||||||+  +|||+|||||+||.|+|
T Consensus       389 ~~~~--------~~~~~------------~~~~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~C  448 (548)
T PTZ00364        389 SADR--------PLRHY------------FASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAY  448 (548)
T ss_pred             ccCC--------ccccc------------ccccCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcc
Confidence            0000        00000            0014699999999997 478999999999999  99999999999999999


Q ss_pred             cccceeeeEeeccCCCCccCcccccc
Q psy15346        233 IIESLVNGALPKDNYGVEFGEESGER  258 (280)
Q Consensus       233 gIe~~~~~~~p~~~~~~~~~~~~~~~  258 (280)
                      |||+.++.+.|.....+...++.-.+
T Consensus       449 GIes~~v~~~~~~~~~~~~~~~~~~~  474 (548)
T PTZ00364        449 NIESEVVVMYWAPYPDVLHPEEYFLV  474 (548)
T ss_pred             cccceeeeeeeecCCCccCCCceEEE
Confidence            99999999999776666666655444


No 7  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=8e-35  Score=254.94  Aligned_cols=147  Identities=26%  Similarity=0.483  Sum_probs=123.7

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      +|+||.+..||++++++|++++++|       ||.          .....|+...          .         ...++
T Consensus        62 gC~GG~~~~a~~~~~~~Gi~~e~~y-------PY~----------~~~~~C~~~~----------~---------~~~~~  105 (210)
T cd02248          62 GCNGGNPDNAFEYVKNGGLASESDY-------PYT----------GKDGTCKYNS----------S---------KVGAK  105 (210)
T ss_pred             CCCCCCHHHhHHHHHHCCcCccccC-------Ccc----------CCCCCccCCC----------C---------cccEE
Confidence            6999999999999999999997777       998          2334554421          0         23566


Q ss_pred             eEEEEEcCc-hHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhee
Q psy15346         81 FKRYYWVND-EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV  159 (280)
Q Consensus        81 i~~~y~~~~-~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~  159 (280)
                      +.+++.+.. +.++||++|+++|||+++|.++++|..                       |++|||.+..+..       
T Consensus       106 i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~-----------------------y~~Giy~~~~~~~-------  155 (210)
T cd02248         106 ITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQF-----------------------YKGGIYSGPCCSN-------  155 (210)
T ss_pred             EeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCccccc-----------------------CCCCceeCCCCCC-------
Confidence            778777753 488999999999999999999989999                       9999999876520       


Q ss_pred             eeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceee
Q psy15346        160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN  239 (280)
Q Consensus       160 ~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~  239 (280)
                       .                          .++|||+|||||++.+.+|||||||||++||++|||||.|+.|.|||++.+.
T Consensus       156 -~--------------------------~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~~~~~cgi~~~~~  208 (210)
T cd02248         156 -T--------------------------NLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYAS  208 (210)
T ss_pred             -C--------------------------cCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEcCCCccCceeeee
Confidence             1                          6799999999999989999999999999999999999999999999998765


Q ss_pred             e
Q psy15346        240 G  240 (280)
Q Consensus       240 ~  240 (280)
                      +
T Consensus       209 ~  209 (210)
T cd02248         209 Y  209 (210)
T ss_pred             c
Confidence            3


No 8  
>KOG1542|consensus
Probab=100.00  E-value=5.4e-35  Score=272.44  Aligned_cols=148  Identities=22%  Similarity=0.454  Sum_probs=126.9

Q ss_pred             CCCCCchHHHHHHHH-hcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CccCCCCCCCcccccccCCCCCcccccce
Q psy15346          1 VCSSGISSSTWVWVH-KRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP-ECKTLATPQPKCHTRCTNDNYGRGFFQDK   78 (280)
Q Consensus         1 gC~GG~~~~A~~yi~-~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~-~C~~~~~~~p~c~~~c~~~~~~~~~~~~~   78 (280)
                      ||+||.+..||+|++ ..||..|.+|       ||+          ++.. .|..+..                   ...
T Consensus       218 gC~GGl~~nA~~~~~~~gGL~~E~dY-------PY~----------g~~~~~C~~~~~-------------------~~~  261 (372)
T KOG1542|consen  218 GCNGGLMDNAFKYIKKAGGLEKEKDY-------PYT----------GKKGNQCHFDKS-------------------KIV  261 (372)
T ss_pred             cCCCCChhHHHHHHHHhCCccccccC-------Ccc----------ccCCCccccchh-------------------hce
Confidence            799999999999954 5689999999       999          4444 7766421                   235


Q ss_pred             eeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecC--Cchhhhhh
Q psy15346         79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVS--ASAEIVAY  156 (280)
Q Consensus        79 ~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~--~~~~~~~~  156 (280)
                      .+|.+++-++.++++|.+.|.++|||+|+|.+ ..++.                       |++||+.+.  .|+     
T Consensus       262 v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~-----------------------YrgGV~~P~~~~Cs-----  312 (372)
T KOG1542|consen  262 VSIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQF-----------------------YRGGVSCPSKYICS-----  312 (372)
T ss_pred             EEEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHH-----------------------hcccccCCCcccCC-----
Confidence            67889999989999999999999999999996 44666                       999999983  354     


Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC-CccEEEEEcCCCCCCCCCceEEEEccCCccccc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE  235 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~-g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe  235 (280)
                        . .                          .++|+|+|||||.+. ..||||||||||++||++||+|+.||.|.|||+
T Consensus       313 --~-~--------------------------~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~  363 (372)
T KOG1542|consen  313 --P-K--------------------------LLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIA  363 (372)
T ss_pred             --c-c--------------------------ccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccc
Confidence              1 1                          479999999999998 899999999999999999999999999999999


Q ss_pred             ceeeeEe
Q psy15346        236 SLVNGAL  242 (280)
Q Consensus       236 ~~~~~~~  242 (280)
                      +.+.+++
T Consensus       364 ~mvss~~  370 (372)
T KOG1542|consen  364 DMVSSAA  370 (372)
T ss_pred             cchhhhh
Confidence            9988765


No 9  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=2.8e-34  Score=273.23  Aligned_cols=151  Identities=23%  Similarity=0.421  Sum_probs=119.2

Q ss_pred             CCCCCchHHHHHHHHhc---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccc
Q psy15346          1 VCSSGISSSTWVWVHKR---GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD   77 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~---Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~   77 (280)
                      ||+||++..||+|++++   ||++|.+|       ||+..       ++..+.|...          +.   .     ..
T Consensus       187 GC~GG~~~~a~~yi~~~~~ggi~~e~~Y-------PY~~~-------~~~~~~C~~~----------~~---~-----~~  234 (348)
T PTZ00203        187 GCGGGLMLQAFEWVLRNMNGTVFTEKSY-------PYVSG-------NGDVPECSNS----------SE---L-----AP  234 (348)
T ss_pred             CCCCCCHHHHHHHHHHhcCCCCCccccC-------CCccC-------CCCCCcCCCC----------cc---c-----cc
Confidence            79999999999999864   58898888       99831       1112234321          10   0     01


Q ss_pred             eeeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhh
Q psy15346         78 KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYA  157 (280)
Q Consensus        78 ~~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~  157 (280)
                      .+++.++..+..++++||++|+++|||+++|.+. +|+.                       |++|||++  |..     
T Consensus       235 ~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~-----------------------Y~~GIy~~--c~~-----  283 (348)
T PTZ00203        235 GARIDGYVSMESSERVMAAWLAKNGPISIAVDAS-SFMS-----------------------YHSGVLTS--CIG-----  283 (348)
T ss_pred             ceEecceeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcC-----------------------ccCceeec--cCC-----
Confidence            2345666666667888999999999999999984 7988                       99999974  330     


Q ss_pred             eeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccce
Q psy15346        158 TVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL  237 (280)
Q Consensus       158 ~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~  237 (280)
                         .                          ..+|||+|||||++++++|||||||||++||++|||||+||.|.|||+++
T Consensus       284 ---~--------------------------~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cgi~~~  334 (348)
T PTZ00203        284 ---E--------------------------QLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGY  334 (348)
T ss_pred             ---C--------------------------CCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCcccccce
Confidence               1                          35999999999999999999999999999999999999999999999998


Q ss_pred             eeeEee
Q psy15346        238 VNGALP  243 (280)
Q Consensus       238 ~~~~~p  243 (280)
                      ++.+..
T Consensus       335 ~~~~~~  340 (348)
T PTZ00203        335 PVSVHV  340 (348)
T ss_pred             EEEEec
Confidence            887744


No 10 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=4e-33  Score=243.56  Aligned_cols=150  Identities=31%  Similarity=0.559  Sum_probs=121.3

Q ss_pred             CCCCCchHHHHHHHHh-cCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCccCCCCCCCcccccccCCCCCcccccce
Q psy15346          1 VCSSGISSSTWVWVHK-RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDK   78 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~-~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~-~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~   78 (280)
                      +|+||++..||+++++ +||+++..|       ||..          .. +.|....         +.         ...
T Consensus        65 ~c~gg~~~~a~~~~~~~~Gi~~e~~~-------pY~~----------~~~~~c~~~~---------~~---------~~~  109 (219)
T PF00112_consen   65 GCDGGSPFDALKYIKNNNGIVTEEDY-------PYNG----------NENPTCKSKK---------SN---------SYY  109 (219)
T ss_dssp             TTBBBEHHHHHHHHHHHTSBEBTTTS---------SS----------SSSCSSCHSG---------GG---------EEE
T ss_pred             ccccCcccccceeecccCcccccccc-------cccc----------cccccccccc---------cc---------ccc
Confidence            6999999999999999 999997777       9992          22 4554421         00         012


Q ss_pred             eeeEEEEEcCc-hHHHHHHHHHhCCcEEEEEEeCc-cccccccCccCCCcccccccccccccccccceeecCCchhhhhh
Q psy15346         79 YRFKRYYWVND-EVADIQQEIMKNGPVVANMYLYS-DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY  156 (280)
Q Consensus        79 ~~i~~~y~~~~-~~~~Ik~~I~~~GPV~v~~~v~~-~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~  156 (280)
                      +++..+..+.. ++++||++|+++|||+++|.+.. +|..                       |++|||....+..    
T Consensus       110 ~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~-----------------------~~~gi~~~~~~~~----  162 (219)
T PF00112_consen  110 VKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQN-----------------------YKSGIYDPPDCSN----  162 (219)
T ss_dssp             BEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHT-----------------------EESSEECSTSSSS----
T ss_pred             ccccccccccccchhHHHHHHhhCceeeeeeecccccccc-----------------------ccceeeecccccc----
Confidence            45556666543 58999999999999999999988 5988                       9999999875440    


Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCC-ccccc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-EAIIE  235 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n-~cgIe  235 (280)
                                                    ..++|||+|||||++.+.+|||||||||++||++|||||.|+.| +||||
T Consensus       163 ------------------------------~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~~~c~i~  212 (219)
T PF00112_consen  163 ------------------------------ESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYFRISYDYNNECGIE  212 (219)
T ss_dssp             ------------------------------SSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEEEEESSSSSGGGTT
T ss_pred             ------------------------------ccccccccccccccccceeeEeeehhhCCccCCCeEEEEeeCCCCcCccC
Confidence                                          16799999999999999999999999999999999999999997 99999


Q ss_pred             ceeeeEe
Q psy15346        236 SLVNGAL  242 (280)
Q Consensus       236 ~~~~~~~  242 (280)
                      +++++++
T Consensus       213 ~~~~~~~  219 (219)
T PF00112_consen  213 SQAVYPI  219 (219)
T ss_dssp             SSEEEEE
T ss_pred             ceeeecC
Confidence            9999875


No 11 
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=5.1e-33  Score=271.86  Aligned_cols=145  Identities=21%  Similarity=0.423  Sum_probs=114.6

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||+||++..||+|++++||+++++|       ||+          +..+.|....                    ...++
T Consensus       296 GC~GG~~~~A~~yi~~~Gi~~e~~Y-------PY~----------~~~~~C~~~~--------------------~~~~~  338 (448)
T PTZ00200        296 GCSGGYPDTALEYVKNKGLSSSSDV-------PYL----------AKDGKCVVSS--------------------TKKVY  338 (448)
T ss_pred             CCCCCcHHHHHHHHhhcCccccccC-------CCC----------CCCCCCcCCC--------------------CCeeE
Confidence            7999999999999999999997777       998          4455675421                    01223


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK  160 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~  160 (280)
                      +.++..+. . .+++++++.+|||+++|.++.+|+.                       |++|||++. |.         
T Consensus       339 i~~y~~~~-~-~~~l~~~l~~GPV~v~i~~~~~f~~-----------------------Yk~GIy~~~-C~---------  383 (448)
T PTZ00200        339 IDSYLVAK-G-KDVLNKSLVISPTVVYIAVSRELLK-----------------------YKSGVYNGE-CG---------  383 (448)
T ss_pred             ecceEecC-H-HHHHHHHHhcCCEEEEeeccccccc-----------------------CCCCccccc-cC---------
Confidence            44444343 3 3455566678999999999888999                       999999864 33         


Q ss_pred             eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeecc--CCccEEEEEcCCCCCCCCCceEEEEcc---CCccccc
Q psy15346        161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRG---RNEAIIE  235 (280)
Q Consensus       161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e--~g~~YWiirNSWG~~WG~~Gy~kI~rg---~n~cgIe  235 (280)
                      .                          .++|||+|||||.+  ++.+|||||||||++||++|||||+|+   .|.|||+
T Consensus       384 ~--------------------------~~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~  437 (448)
T PTZ00200        384 K--------------------------SLNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGIL  437 (448)
T ss_pred             C--------------------------CCcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCcc
Confidence            1                          35999999999953  688999999999999999999999995   5899999


Q ss_pred             ceeeeEee
Q psy15346        236 SLVNGALP  243 (280)
Q Consensus       236 ~~~~~~~p  243 (280)
                      +.+.+++.
T Consensus       438 ~~~~~P~~  445 (448)
T PTZ00200        438 TVGLTPVF  445 (448)
T ss_pred             ccceeeEE
Confidence            98887653


No 12 
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=1.9e-32  Score=269.55  Aligned_cols=147  Identities=23%  Similarity=0.419  Sum_probs=118.2

Q ss_pred             CCCCCchHHHHHHHHhc-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCccccccee
Q psy15346          1 VCSSGISSSTWVWVHKR-GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKY   79 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~-Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~   79 (280)
                      ||+||++..||+|++++ ||++|++|       ||+.         ...+.|...     .    |.          ..+
T Consensus       327 GC~GG~~~~Af~yi~~~gGl~tE~~Y-------PY~~---------~~~~~C~~~-----~----~~----------~~~  371 (489)
T PTZ00021        327 GCYGGLIPNAFEDMIELGGLCSEDDY-------PYVS---------DTPELCNID-----R----CK----------EKY  371 (489)
T ss_pred             CCCCcchHhhhhhhhhccccCccccc-------CccC---------CCCCccccc-----c----cc----------ccc
Confidence            79999999999999766 89998888       9983         112456432     1    21          134


Q ss_pred             eeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhhee
Q psy15346         80 RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV  159 (280)
Q Consensus        80 ~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~  159 (280)
                      .+.++..++  ..+|+++|+.+|||+|+|.+..+|+.                       |++|||++. |.        
T Consensus       372 ~i~~y~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~-----------------------YkgGIy~~~-C~--------  417 (489)
T PTZ00021        372 KIKSYVSIP--EDKFKEAIRFLGPISVSIAVSDDFAF-----------------------YKGGIFDGE-CG--------  417 (489)
T ss_pred             eeeeEEEec--HHHHHHHHHhcCCeEEEEEeeccccc-----------------------CCCCcCCCC-CC--------
Confidence            566776664  57899999999999999999888999                       999999864 43        


Q ss_pred             eeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCC----------ccEEEEEcCCCCCCCCCceEEEEccC
Q psy15346        160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENG----------RPYWTIVSTFGEQFGDKGTIKILRGR  229 (280)
Q Consensus       160 ~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g----------~~YWiirNSWG~~WG~~Gy~kI~rg~  229 (280)
                       .                          .++|||+|||||++++          .+|||||||||++||++|||||+|+.
T Consensus       418 -~--------------------------~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~  470 (489)
T PTZ00021        418 -E--------------------------EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDE  470 (489)
T ss_pred             -C--------------------------ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCC
Confidence             1                          4599999999997642          57999999999999999999999996


Q ss_pred             ----CcccccceeeeEee
Q psy15346        230 ----NEAIIESLVNGALP  243 (280)
Q Consensus       230 ----n~cgIe~~~~~~~p  243 (280)
                          |.|||..++.+++.
T Consensus       471 ~g~~n~CGI~t~a~yP~~  488 (489)
T PTZ00021        471 NGLMKTCSLGTEAYVPLI  488 (489)
T ss_pred             CCCCCCCCCcccceeEec
Confidence                58999998887653


No 13 
>KOG1544|consensus
Probab=99.98  E-value=4.5e-33  Score=257.69  Aligned_cols=179  Identities=34%  Similarity=0.625  Sum_probs=142.2

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCC----CCCcccccccCCCCCccccc
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT----PQPKCHTRCTNDNYGRGFFQ   76 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~----~~p~c~~~c~~~~~~~~~~~   76 (280)
                      ||+||+.+.||-||.+.|+|.       +.|+||..      ...+..+.|...+.    ......+.|++. +.  -..
T Consensus       275 GC~gG~lDRAWWYlRKrGvVs-------dhCYP~~~------dQ~~~~~~C~m~sR~~grgkRqat~~CPn~-~~--~Sn  338 (470)
T KOG1544|consen  275 GCRGGRLDRAWWYLRKRGVVS-------DHCYPFSG------DQAGPAPPCMMHSRAMGRGKRQATAHCPNS-YV--NSN  338 (470)
T ss_pred             cCccCcccchheeeecccccc-------cccccccC------CCCCCCCCceeeccccCcccccccCcCCCc-cc--ccC
Confidence            799999999999999999999       78889985      22345667766443    112223446532 11  124


Q ss_pred             ceeeeEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhh
Q psy15346         77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY  156 (280)
Q Consensus        77 ~~~~i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~  156 (280)
                      +.|+.+..|.+++++++||++||++|||.+.|.|.+||+.                       |++|||.|....     
T Consensus       339 ~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~-----------------------YkgGiY~H~~~~-----  390 (470)
T KOG1544|consen  339 DIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFL-----------------------YKGGIYSHTPVS-----  390 (470)
T ss_pred             ceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhh-----------------------hccceeeccccc-----
Confidence            6788899999999999999999999999999999999999                       999999997643     


Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC-----CccEEEEEcCCCCCCCCCceEEEEccCCc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN-----GRPYWTIVSTFGEQFGDKGTIKILRGRNE  231 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~-----g~~YWiirNSWG~~WG~~Gy~kI~rg~n~  231 (280)
                          .       +           .+......+.|+|.|.|||++.     ..+|||..||||+.||++|||||.||+|+
T Consensus       391 ----~-------~-----------~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNe  448 (470)
T KOG1544|consen  391 ----L-------G-----------RPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNE  448 (470)
T ss_pred             ----c-------C-----------CchhhhhcccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccc
Confidence                0       0           0111123678999999999973     36799999999999999999999999999


Q ss_pred             ccccceeeeEeecc
Q psy15346        232 AIIESLVNGALPKD  245 (280)
Q Consensus       232 cgIe~~~~~~~p~~  245 (280)
                      |.||+.+++|+-.+
T Consensus       449 cdIEsfvIgAWGr~  462 (470)
T KOG1544|consen  449 CDIESFVIGAWGRV  462 (470)
T ss_pred             hhhhHhhhhhhhcc
Confidence            99999999887644


No 14 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.95  E-value=6.3e-27  Score=242.50  Aligned_cols=103  Identities=20%  Similarity=0.385  Sum_probs=86.8

Q ss_pred             HHHHHHHHHhCCcEEEEEEeCccccccccCccCCCccccccccccccccc-ccceeecCCchhhhhhheeeeeccCcCCC
Q psy15346         91 VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEENG  169 (280)
Q Consensus        91 ~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y-~~GVy~~~~~~~~~~~~~~~~~gwg~~~~  169 (280)
                      ++.||++|+.+|||+|+|.+. +|+.                       | .+|||....|..        .        
T Consensus       680 i~~IK~eI~~kGPVaV~IdAs-df~~-----------------------Y~~sGIyv~~~Cgs--------~--------  719 (1004)
T PTZ00462        680 IKIIKDEIMNKGSVIAYIKAE-NVLG-----------------------YEFNGKKVQNLCGD--------D--------  719 (1004)
T ss_pred             HHHHHHHHHhcCCEEEEEEee-hHHh-----------------------hhcCCccccCCCCC--------C--------
Confidence            468999999999999999985 6888                       7 489876654540        1        


Q ss_pred             CCceeeeeeeecccCccccCCeEEEEEEeecc-----CCccEEEEEcCCCCCCCCCceEEEEc-cCCcccccceeeeEee
Q psy15346        170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEE-----NGRPYWTIVSTFGEQFGDKGTIKILR-GRNEAIIESLVNGALP  243 (280)
Q Consensus       170 ~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e-----~g~~YWiirNSWG~~WG~~Gy~kI~r-g~n~cgIe~~~~~~~p  243 (280)
                                        ..+|||+|||||.+     .+++|||||||||+.||++|||||+| |.|.|||.....++++
T Consensus       720 ------------------~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin~i~t~~~f  781 (1004)
T PTZ00462        720 ------------------TADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIHSVVIF  781 (1004)
T ss_pred             ------------------cCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCCCCCccchheeeeeE
Confidence                              45899999999974     25799999999999999999999998 7899999999999999


Q ss_pred             ccCCCCcc
Q psy15346        244 KDNYGVEF  251 (280)
Q Consensus       244 ~~~~~~~~  251 (280)
                      ++.-++.-
T Consensus       782 n~d~~~~~  789 (1004)
T PTZ00462        782 NIDLPKNK  789 (1004)
T ss_pred             eecccccc
Confidence            88766653


No 15 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.94  E-value=9.2e-27  Score=203.61  Aligned_cols=141  Identities=21%  Similarity=0.335  Sum_probs=106.1

Q ss_pred             CCCCCchHHHHH-HHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCccccccee
Q psy15346          1 VCSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKY   79 (280)
Q Consensus         1 gC~GG~~~~A~~-yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~   79 (280)
                      +|.||.+..|+. +++++||+++..|       ||..          ....|...          |....     ....+
T Consensus        65 ~c~gG~~~~~~~~~~~~~Gi~~e~~~-------Py~~----------~~~~~~~~----------~~~~~-----~~~~~  112 (223)
T cd02619          65 SCDGGGPLSALLKLVALKGIPPEEDY-------PYGA----------ESDGEEPK----------SEAAL-----NAAKV  112 (223)
T ss_pred             CCCCCcHHHHHHHHHHHcCCCccccC-------CCCC----------CCCCCCCC----------Cccch-----hhcce
Confidence            699999999998 9999999997777       9983          22223221          00000     11234


Q ss_pred             eeEEEEEcC-chHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeec----CCchhhh
Q psy15346         80 RFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAV----SASAEIV  154 (280)
Q Consensus        80 ~i~~~y~~~-~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~----~~~~~~~  154 (280)
                      ++..+..+. .++++||++|+++|||+++|.+..+|+.                       |++|+|..    ....   
T Consensus       113 ~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~---  166 (223)
T cd02619         113 KLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDR-----------------------LKEGIIYEEIVYLLYE---  166 (223)
T ss_pred             eecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhc-----------------------ccCccccccccccccC---
Confidence            455665554 3578999999999999999999988988                       88888631    1111   


Q ss_pred             hhheeeeeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccC--CccEEEEEcCCCCCCCCCceEEEEccC
Q psy15346        155 AYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEEN--GRPYWTIVSTFGEQFGDKGTIKILRGR  229 (280)
Q Consensus       155 ~~~~~~~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~--g~~YWiirNSWG~~WG~~Gy~kI~rg~  229 (280)
                                                    ....++|||+|||||++.  +++|||||||||+.||++||+||.++.
T Consensus       167 ------------------------------~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~  213 (223)
T cd02619         167 ------------------------------DGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYED  213 (223)
T ss_pred             ------------------------------CCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence                                          011679999999999987  889999999999999999999999974


No 16 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.94  E-value=1.5e-26  Score=199.99  Aligned_cols=52  Identities=37%  Similarity=0.797  Sum_probs=48.0

Q ss_pred             cCCeEEEEEEeecc-CCccEEEEEcCCCCCCCCCceEEEEccC-Ccccccceee
Q psy15346        188 VAYATVKLIGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGR-NEAIIESLVN  239 (280)
Q Consensus       188 ~~~HaV~IVGwG~e-~g~~YWiirNSWG~~WG~~Gy~kI~rg~-n~cgIe~~~~  239 (280)
                      .++|+|+|||||++ ++++|||||||||+.||++|||||+|+. |+||||....
T Consensus       118 ~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~~~~  171 (174)
T smart00645      118 TLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA  171 (174)
T ss_pred             cccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCceeeee
Confidence            35999999999987 8999999999999999999999999998 9999987654


No 17 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.48  E-value=9.2e-14  Score=136.07  Aligned_cols=41  Identities=22%  Similarity=0.492  Sum_probs=37.0

Q ss_pred             cCCeEEEEEEeeccC-Cc-cEEEEEcCCCCCCCCCceEEEEcc
Q psy15346        188 VAYATVKLIGWGEEN-GR-PYWTIVSTFGEQFGDKGTIKILRG  228 (280)
Q Consensus       188 ~~~HaV~IVGwG~e~-g~-~YWiirNSWG~~WG~~Gy~kI~rg  228 (280)
                      ..+|||+||||+.+. |. .||+|+||||+.||++||++|.+.
T Consensus       357 ~~tHAM~ivGv~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         357 LMTHAMVLTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             cCCeEEEEEEEEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence            468999999999864 76 599999999999999999999986


No 18 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.38  E-value=6.9e-13  Score=125.66  Aligned_cols=42  Identities=19%  Similarity=0.311  Sum_probs=36.9

Q ss_pred             cCCeEEEEEEeeccC----------CccEEEEEcCCCCCCCCCceEEEEccC
Q psy15346        188 VAYATVKLIGWGEEN----------GRPYWTIVSTFGEQFGDKGTIKILRGR  229 (280)
Q Consensus       188 ~~~HaV~IVGwG~e~----------g~~YWiirNSWG~~WG~~Gy~kI~rg~  229 (280)
                      ..+|||+||||++..          +...||||||||++||++|||||....
T Consensus       263 ~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~y  314 (372)
T COG4870         263 NWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYY  314 (372)
T ss_pred             cccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEeee
Confidence            469999999999851          345999999999999999999999875


No 19 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.30  E-value=3.4e-06  Score=83.17  Aligned_cols=41  Identities=24%  Similarity=0.534  Sum_probs=33.9

Q ss_pred             cCCeEEEEEEeec-cCCcc-EEEEEcCCCCCCCCCceEEEEcc
Q psy15346        188 VAYATVKLIGWGE-ENGRP-YWTIVSTFGEQFGDKGTIKILRG  228 (280)
Q Consensus       188 ~~~HaV~IVGwG~-e~g~~-YWiirNSWG~~WG~~Gy~kI~rg  228 (280)
                      ..+|||+|+|... ++|.+ +|+|+||||++.|.+|||.|...
T Consensus       358 ~~tHAM~itGv~~D~~g~p~~wkVeNSWG~~~g~kGy~~msd~  400 (438)
T PF03051_consen  358 TMTHAMVITGVDLDEDGKPVRWKVENSWGTDNGDKGYFYMSDD  400 (438)
T ss_dssp             --EEEEEEEEEEE-TTSSEEEEEEE-SBTTTSTBTTEEEEEHH
T ss_pred             CCceeEEEEEEEeccCCCeeEEEEEcCCCCCCCCCcEEEECHH
Confidence            5689999999997 46664 99999999999999999999853


No 20 
>PTZ00203 cathepsin L protease; Provisional
Probab=95.75  E-value=0.0051  Score=59.20  Aligned_cols=29  Identities=24%  Similarity=0.500  Sum_probs=25.0

Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASA  185 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~  185 (280)
                      |+|.++|||++++.+||++.|||...|..
T Consensus       288 HaVliVGYG~~~g~~YWiikNSWG~~WGe  316 (348)
T PTZ00203        288 HGVLLVGYNMTGEVPYWVIKNSWGEDWGE  316 (348)
T ss_pred             eEEEEEEEecCCCceEEEEEcCCCCCcCc
Confidence            37889999999999999999999775553


No 21 
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=95.70  E-value=0.005  Score=55.71  Aligned_cols=28  Identities=21%  Similarity=0.713  Sum_probs=24.3

Q ss_pred             heeeeeccCcCC-CCCceeeeeeeecccC
Q psy15346        157 ATVKIVGWGEEN-GRPYWTIVRVYAVSAS  184 (280)
Q Consensus       157 ~~~~~~gwg~~~-~~~~w~~~~~~~~~~~  184 (280)
                      |+|.++|||+++ +++||++.|||...|.
T Consensus       180 HaV~IVGyG~~~~g~~YWiikNSWG~~WG  208 (239)
T cd02698         180 HIISVAGWGVDENGVEYWIVRNSWGEPWG  208 (239)
T ss_pred             eEEEEEEEEecCCCCEEEEEEcCCCcccC
Confidence            388899999886 8999999999977655


No 22 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=95.46  E-value=0.0081  Score=54.32  Aligned_cols=30  Identities=33%  Similarity=0.796  Sum_probs=25.5

Q ss_pred             hheeeeeccCcCC--CCCceeeeeeeecccCc
Q psy15346        156 YATVKIVGWGEEN--GRPYWTIVRVYAVSASA  185 (280)
Q Consensus       156 ~~~~~~~gwg~~~--~~~~w~~~~~~~~~~~~  185 (280)
                      .|+|.++|||.++  +.+||+++|||...|..
T Consensus       187 ~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe  218 (243)
T cd02621         187 NHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGE  218 (243)
T ss_pred             CeEEEEEEeeccCCCCCcEEEEEcCCCCCCCc
Confidence            4689999999886  88999999999776653


No 23 
>KOG1543|consensus
Probab=95.44  E-value=0.0083  Score=57.20  Aligned_cols=29  Identities=28%  Similarity=0.633  Sum_probs=24.5

Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASA  185 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~  185 (280)
                      |+|+++|||+.++.+||+++|||...|..
T Consensus       271 Hav~iVGyG~~~~~~YWivkNSWG~~WGe  299 (325)
T KOG1543|consen  271 HAVLIVGYGTGDGVDYWIVKNSWGTDWGE  299 (325)
T ss_pred             ceEEEEEEcCCCCceeEEEEcCCCCCccc
Confidence            48889999996668999999999776653


No 24 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=95.26  E-value=0.0085  Score=51.68  Aligned_cols=45  Identities=33%  Similarity=0.647  Sum_probs=36.6

Q ss_pred             cccceeecCCchhhhhhheeeeeccCcC-CCCCceeeeeeeecccC
Q psy15346        140 YKSGVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVRVYAVSAS  184 (280)
Q Consensus       140 Y~~GVy~~~~~~~~~~~~~~~~~gwg~~-~~~~~w~~~~~~~~~~~  184 (280)
                      |++|||++..+......|.|.++|||++ ++++||+++|||...|.
T Consensus       104 Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG  149 (174)
T smart00645      104 YKSGIYDHPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWG  149 (174)
T ss_pred             CcCeEECCCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcc
Confidence            9999998864433335799999999987 88999999999966444


No 25 
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=94.88  E-value=0.015  Score=52.57  Aligned_cols=28  Identities=43%  Similarity=0.949  Sum_probs=24.5

Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccC
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSAS  184 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~  184 (280)
                      |+|.++|||.+++.+||++.|||...|.
T Consensus       186 HaV~iVGyg~~~g~~YWivrNSWG~~WG  213 (236)
T cd02620         186 HAVKIIGWGVENGVPYWLAANSWGTDWG  213 (236)
T ss_pred             eEEEEEEEeccCCeeEEEEEeCCCCCCC
Confidence            3788999999999999999999977554


No 26 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=94.80  E-value=0.0084  Score=57.42  Aligned_cols=40  Identities=20%  Similarity=0.416  Sum_probs=33.8

Q ss_pred             cCCeEEEEEEeecc-CC-ccEEEEEcCCCCCCCCCceEEEEc
Q psy15346        188 VAYATVKLIGWGEE-NG-RPYWTIVSTFGEQFGDKGTIKILR  227 (280)
Q Consensus       188 ~~~HaV~IVGwG~e-~g-~~YWiirNSWG~~WG~~Gy~kI~r  227 (280)
                      ...|||+|.|...+ +| .-=|.|.||||.+=|.+|||-+.-
T Consensus       360 LmTHAMvlTGvd~d~~g~p~rwkVENSWG~d~G~~GyfvaSd  401 (444)
T COG3579         360 LMTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYFVASD  401 (444)
T ss_pred             HHHHHHHhhccccccCCCceeeEeecccccccCCCceEeehH
Confidence            56899999999865 43 347999999999999999998753


No 27 
>KOG1542|consensus
Probab=94.52  E-value=0.023  Score=54.50  Aligned_cols=29  Identities=28%  Similarity=0.697  Sum_probs=25.6

Q ss_pred             heeeeeccCcCC-CCCceeeeeeeecccCc
Q psy15346        157 ATVKIVGWGEEN-GRPYWTIVRVYAVSASA  185 (280)
Q Consensus       157 ~~~~~~gwg~~~-~~~~w~~~~~~~~~~~~  185 (280)
                      |+|.++|+|..+ +.|||+++|||...|..
T Consensus       318 HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE  347 (372)
T KOG1542|consen  318 HAVLLVGYGSSGYEKPYWIVKNSWGTSWGE  347 (372)
T ss_pred             ceEEEEeecCCCCCCceEEEECCccccccc
Confidence            488899999998 99999999999887764


No 28 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=94.44  E-value=0.018  Score=50.05  Aligned_cols=28  Identities=36%  Similarity=0.732  Sum_probs=24.7

Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccC
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSAS  184 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~  184 (280)
                      |+|.++|||.+.+++||++.|||...|.
T Consensus       160 Hav~iVGy~~~~~~~ywiv~NSWG~~WG  187 (210)
T cd02248         160 HAVLLVGYGTENGVDYWIVKNSWGTSWG  187 (210)
T ss_pred             EEEEEEEEeecCCceEEEEEcCCCCccc
Confidence            3888999999989999999999977655


No 29 
>PTZ00200 cysteine proteinase; Provisional
Probab=93.61  E-value=0.041  Score=54.77  Aligned_cols=28  Identities=25%  Similarity=0.487  Sum_probs=23.1

Q ss_pred             heeeeeccCc--CCCCCceeeeeeeecccC
Q psy15346        157 ATVKIVGWGE--ENGRPYWTIVRVYAVSAS  184 (280)
Q Consensus       157 ~~~~~~gwg~--~~~~~~w~~~~~~~~~~~  184 (280)
                      |+|.++|||.  +++.+||+++|||...|.
T Consensus       388 HaV~lVGyG~d~~~g~~YWIIkNSWG~~WG  417 (448)
T PTZ00200        388 HAVLLVGEGYDEKTKKRYWIIKNSWGTDWG  417 (448)
T ss_pred             EEEEEEEecccCCCCCceEEEEcCCCCCcc
Confidence            3888999995  367899999999977554


No 30 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=93.40  E-value=0.043  Score=55.92  Aligned_cols=31  Identities=35%  Similarity=0.775  Sum_probs=26.5

Q ss_pred             hheeeeeccCcC-CCCCceeeeeeeec--ccCcc
Q psy15346        156 YATVKIVGWGEE-NGRPYWTIVRVYAV--SASAE  186 (280)
Q Consensus       156 ~~~~~~~gwg~~-~~~~~w~~~~~~~~--~~~~~  186 (280)
                      .|+|.++|||++ ++.+||+++|||..  .|...
T Consensus       403 nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~  436 (548)
T PTZ00364        403 NHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDG  436 (548)
T ss_pred             CeEEEEEEecccCCCceEEEEECCCCCCCCcccC
Confidence            469999999974 78899999999988  77653


No 31 
>KOG1544|consensus
Probab=92.86  E-value=0.068  Score=51.13  Aligned_cols=84  Identities=26%  Similarity=0.503  Sum_probs=48.7

Q ss_pred             eeeeEEEEEcCchHHHHHHH--HHhCCcEEEEEEeCcc-ccccccCccCCCcccccccccccccccccceeecCCchhhh
Q psy15346         78 KYRFKRYYWVNDEVADIQQE--IMKNGPVVANMYLYSD-IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIV  154 (280)
Q Consensus        78 ~~~i~~~y~~~~~~~~Ik~~--I~~~GPV~v~~~v~~~-f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~  154 (280)
                      -|++++  +-...+.+||++  +..---|-..|..|.. .+.|.       |+-+          =++.-|....     
T Consensus       346 PYrVSS--nE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~-------~~~~----------~~~e~yr~~g-----  401 (470)
T KOG1544|consen  346 PYRVSS--NEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHT-------PVSL----------GRPERYRRHG-----  401 (470)
T ss_pred             CeeccC--CHHHHHHHHHhCCChhhhhhhhhhhhhhccceeecc-------cccc----------CCchhhhhcc-----
Confidence            455554  223457788776  3333335566766654 55531       1100          1122233333     


Q ss_pred             hhheeeeeccCcC---CC--CCceeeeeeeecccCcc
Q psy15346        155 AYATVKIVGWGEE---NG--RPYWTIVRVYAVSASAE  186 (280)
Q Consensus       155 ~~~~~~~~gwg~~---~~--~~~w~~~~~~~~~~~~~  186 (280)
                       .|+||+.|||++   .+  .+||+.+|||...|...
T Consensus       402 -tHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~  437 (470)
T KOG1544|consen  402 -THSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER  437 (470)
T ss_pred             -cceEEEeecccccCCCCCeeEEEEeecccccccccC
Confidence             359999999987   22  48999999999888754


No 32 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=92.52  E-value=0.073  Score=55.45  Aligned_cols=30  Identities=33%  Similarity=0.750  Sum_probs=24.1

Q ss_pred             hheeeeeccCcC--CCC--CceeeeeeeecccCc
Q psy15346        156 YATVKIVGWGEE--NGR--PYWTIVRVYAVSASA  185 (280)
Q Consensus       156 ~~~~~~~gwg~~--~~~--~~w~~~~~~~~~~~~  185 (280)
                      .|+|.++|||++  ++.  +||++.|||...|..
T Consensus       619 NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGe  652 (693)
T PTZ00049        619 NHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGK  652 (693)
T ss_pred             ceEEEEEEeccccCCCcccCEEEEECCCCCCccc
Confidence            469999999985  453  899999999776653


No 33 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=92.26  E-value=0.49  Score=37.51  Aligned_cols=24  Identities=29%  Similarity=0.426  Sum_probs=17.7

Q ss_pred             cCchHHHHHHHHHhCCcEEEEEEe
Q psy15346         87 VNDEVADIQQEIMKNGPVVANMYL  110 (280)
Q Consensus        87 ~~~~~~~Ik~~I~~~GPV~v~~~v  110 (280)
                      ...+..+|+++|.+..||++.+..
T Consensus        85 ~~~~~~~i~~~i~~G~Pvi~~~~~  108 (144)
T PF13529_consen   85 SDASFDDIKQEIDAGRPVIVSVNS  108 (144)
T ss_dssp             TTS-HHHHHHHHHTT--EEEEEET
T ss_pred             cCCcHHHHHHHHHCCCcEEEEEEc
Confidence            346789999999999999999874


No 34 
>PTZ00021 falcipain-2; Provisional
Probab=92.23  E-value=0.084  Score=53.11  Aligned_cols=28  Identities=32%  Similarity=0.503  Sum_probs=22.3

Q ss_pred             heeeeeccCcCC----------CCCceeeeeeeecccC
Q psy15346        157 ATVKIVGWGEEN----------GRPYWTIVRVYAVSAS  184 (280)
Q Consensus       157 ~~~~~~gwg~~~----------~~~~w~~~~~~~~~~~  184 (280)
                      |+|.+||||.++          +.+||++.|||...|.
T Consensus       422 HAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WG  459 (489)
T PTZ00021        422 HAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWG  459 (489)
T ss_pred             eEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcc
Confidence            388899999763          2589999999977554


No 35 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=92.03  E-value=0.056  Score=46.78  Aligned_cols=29  Identities=31%  Similarity=0.699  Sum_probs=24.6

Q ss_pred             heeeeeccCcCCCCCceeeeeeeecccCc
Q psy15346        157 ATVKIVGWGEENGRPYWTIVRVYAVSASA  185 (280)
Q Consensus       157 ~~~~~~gwg~~~~~~~w~~~~~~~~~~~~  185 (280)
                      |++.++||+.+.++.||+++|||...|..
T Consensus       167 Hav~iVGy~~~~~~~~wiv~NSWG~~WG~  195 (219)
T PF00112_consen  167 HAVLIVGYDDENGKGYWIVKNSWGTDWGD  195 (219)
T ss_dssp             EEEEEEEEEEETTEEEEEEE-SBTTTSTB
T ss_pred             ccccccccccccceeeEeeehhhCCccCC
Confidence            37889999999999999999999887664


No 36 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=91.65  E-value=0.13  Score=44.61  Aligned_cols=29  Identities=17%  Similarity=0.377  Sum_probs=25.2

Q ss_pred             heeeeeccCcCC--CCCceeeeeeeecccCc
Q psy15346        157 ATVKIVGWGEEN--GRPYWTIVRVYAVSASA  185 (280)
Q Consensus       157 ~~~~~~gwg~~~--~~~~w~~~~~~~~~~~~  185 (280)
                      |+|.++||+.+.  +.+||++.|||...|..
T Consensus       173 Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~  203 (223)
T cd02619         173 HAVVIVGYDDNYVEGKGAFIVKNSWGTDWGD  203 (223)
T ss_pred             eEEEEEeecCCCCCCCCEEEEEeCCCCcccc
Confidence            588899999887  89999999999876654


No 37 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=90.52  E-value=0.15  Score=54.95  Aligned_cols=31  Identities=26%  Similarity=0.519  Sum_probs=25.2

Q ss_pred             heeeeeccCcC-----CCCCceeeeeeeecccCccc
Q psy15346        157 ATVKIVGWGEE-----NGRPYWTIVRVYAVSASAEI  187 (280)
Q Consensus       157 ~~~~~~gwg~~-----~~~~~w~~~~~~~~~~~~~~  187 (280)
                      |+|.++|||.+     .+.+||++.|||...|..++
T Consensus       723 HAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnG  758 (1004)
T PTZ00462        723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEG  758 (1004)
T ss_pred             ceEEEEEecccccccCCCCceEEEEcCCCCCcCCCe
Confidence            48889999974     25799999999999886544


No 38 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=66.43  E-value=12  Score=32.93  Aligned_cols=26  Identities=15%  Similarity=0.391  Sum_probs=20.2

Q ss_pred             cCCeEEEEEEeec-cCCccEEEEEcCC
Q psy15346        188 VAYATVKLIGWGE-ENGRPYWTIVSTF  213 (280)
Q Consensus       188 ~~~HaV~IVGwG~-e~g~~YWiirNSW  213 (280)
                      ..+|||+||||-. .+|.++.++=|-|
T Consensus       118 ~~gHAlavvGya~~~~g~~~y~~WNPW  144 (175)
T PF05543_consen  118 HAGHALAVVGYAKPNNGQKTYYFWNPW  144 (175)
T ss_dssp             --EEEEEEEEEEEETTSEEEEEEE-TT
T ss_pred             ccceeEEEEeeeecCCCCeEEEEeCCc
Confidence            5789999999976 4578899999999


No 39 
>KOG4128|consensus
Probab=64.43  E-value=0.83  Score=44.05  Aligned_cols=54  Identities=15%  Similarity=0.247  Sum_probs=39.4

Q ss_pred             cCCeEEEEEEeec-c---CCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeeeEeecc
Q psy15346        188 VAYATVKLIGWGE-E---NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD  245 (280)
Q Consensus       188 ~~~HaV~IVGwG~-e---~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~~~p~~  245 (280)
                      .-.|||++.+-|. +   .+..=|.|.||||.+-|.+|+.+|..-    -.+.++.-.+.+.
T Consensus       370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt~e----wf~EY~feiVVd~  427 (457)
T KOG4128|consen  370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMTAE----WFREYAFEIVVDE  427 (457)
T ss_pred             HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhhHH----HHHhhheeEEeec
Confidence            4579999999983 2   455679999999999999999766532    2455555555544


No 40 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=64.06  E-value=16  Score=33.63  Aligned_cols=23  Identities=13%  Similarity=0.401  Sum_probs=17.6

Q ss_pred             chHHHHHHHHHhCCcEEEEEEeC
Q psy15346         89 DEVADIQQEIMKNGPVVANMYLY  111 (280)
Q Consensus        89 ~~~~~Ik~~I~~~GPV~v~~~v~  111 (280)
                      ...+.|++.|.++.||++.++.+
T Consensus        76 ~~~~~l~~~l~~g~pv~~~~D~~   98 (317)
T PF14399_consen   76 EAWEELKEALDAGRPVIVWVDMY   98 (317)
T ss_pred             HHHHHHHHHHhCCCceEEEeccc
Confidence            34557888888888999998764


No 41 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=56.98  E-value=7.1  Score=38.96  Aligned_cols=73  Identities=15%  Similarity=0.258  Sum_probs=51.2

Q ss_pred             EEEcCchHHHHH----HHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCC----------
Q psy15346         84 YYWVNDEVADIQ----QEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA----------  149 (280)
Q Consensus        84 ~y~~~~~~~~Ik----~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~----------  149 (280)
                      ++++  .+++|+    +.|..++||.++.++. .|+.                       |++||+....          
T Consensus       289 y~Nv--p~d~l~~~~~~~L~~g~pV~~g~Dv~-~~~~-----------------------~k~GI~d~~~~~~~~~f~~~  342 (437)
T cd00585         289 YLNV--PMDVLKKAAIAQLKDGEPVWFGCDVG-KFSD-----------------------RKSGILDTDLFDYELLFGID  342 (437)
T ss_pred             EEec--CHHHHHHHHHHHHhcCCCEEEEEEcC-hhhc-----------------------cCCccccCcccchhhhcCcc
Confidence            4455  344555    5678899999999996 4666                       7888875421          


Q ss_pred             ----------chhhhhhheeeeeccCcC-CCC-Cceeeeeeeecc
Q psy15346        150 ----------SAEIVAYATVKIVGWGEE-NGR-PYWTIVRVYAVS  182 (280)
Q Consensus       150 ----------~~~~~~~~~~~~~gwg~~-~~~-~~w~~~~~~~~~  182 (280)
                                +.+....|++.++|++.+ +++ .||++.|||...
T Consensus       343 ~~~~KaeRl~~~es~~tHAM~ivGv~~D~~g~p~yw~VkNSWG~~  387 (437)
T cd00585         343 FGLNKAERLDYGESLMTHAMVLTGVDLDEDGKPVKWKVENSWGEK  387 (437)
T ss_pred             ccCCHHHHHhhcCCcCCeEEEEEEEEecCCCCcceEEEEcccCCC
Confidence                      112234578999999986 476 599999999653


No 42 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.59  E-value=32  Score=30.54  Aligned_cols=21  Identities=14%  Similarity=0.273  Sum_probs=15.3

Q ss_pred             CeEEEEEEeeccCCccEEEEEcCCC
Q psy15346        190 YATVKLIGWGEENGRPYWTIVSTFG  214 (280)
Q Consensus       190 ~HaV~IVGwG~e~g~~YWiirNSWG  214 (280)
                      -|+|+|+||++.    |...-++||
T Consensus       148 ~H~v~itgyDk~----n~yynDpyG  168 (195)
T COG4990         148 IHSVLITGYDKY----NIYYNDPYG  168 (195)
T ss_pred             eeeeEeeccccc----ceEeccccc
Confidence            599999999764    555566663


No 43 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=44.02  E-value=87  Score=28.37  Aligned_cols=21  Identities=14%  Similarity=0.382  Sum_probs=17.6

Q ss_pred             hHHHHHHHHHhCCcEEEEEEe
Q psy15346         90 EVADIQQEIMKNGPVVANMYL  110 (280)
Q Consensus        90 ~~~~Ik~~I~~~GPV~v~~~v  110 (280)
                      ++++|..+|..+||+++-++.
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~  132 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDA  132 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEcc
Confidence            688999999999988777765


No 44 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=37.78  E-value=43  Score=31.40  Aligned_cols=28  Identities=14%  Similarity=0.169  Sum_probs=24.0

Q ss_pred             cCCeEEEEEEeeccC--CccEEEEEcCCCC
Q psy15346        188 VAYATVKLIGWGEEN--GRPYWTIVSTFGE  215 (280)
Q Consensus       188 ~~~HaV~IVGwG~e~--g~~YWiirNSWG~  215 (280)
                      ..+||=.|++.-+-+  +...-.+||.||.
T Consensus       234 ~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~  263 (315)
T cd00044         234 VKGHAYSVLDVREVQEEGLRLLRLRNPWGV  263 (315)
T ss_pred             ccCcceEEeEEEEEccCceEEEEecCCccC
Confidence            568999999998765  7889999999993


No 45 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=34.65  E-value=71  Score=27.82  Aligned_cols=23  Identities=9%  Similarity=0.105  Sum_probs=16.5

Q ss_pred             hHHHHHHHHHhCCcEEEEEEeCc
Q psy15346         90 EVADIQQEIMKNGPVVANMYLYS  112 (280)
Q Consensus        90 ~~~~Ik~~I~~~GPV~v~~~v~~  112 (280)
                      +.+.+.+.|.++||+-++.....
T Consensus        97 t~e~~~~LL~~yGPLwv~~~~P~  119 (166)
T PF12385_consen   97 TAEGLANLLREYGPLWVAWEAPG  119 (166)
T ss_pred             CHHHHHHHHHHcCCeEEEecCCC
Confidence            45677777888899888865543


No 46 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=33.08  E-value=1.4e+02  Score=23.61  Aligned_cols=22  Identities=9%  Similarity=0.083  Sum_probs=15.5

Q ss_pred             CCeEEEEEEeeccCCccEEEEEcCC
Q psy15346        189 AYATVKLIGWGEENGRPYWTIVSTF  213 (280)
Q Consensus       189 ~~HaV~IVGwG~e~g~~YWiirNSW  213 (280)
                      .+|.|+|+||..   ....+|.+.|
T Consensus        93 ~gH~vVv~g~~~---~~~~~i~DP~  114 (141)
T cd02549          93 SGHAMVVIGYDR---KGNVYVNDPG  114 (141)
T ss_pred             CCeEEEEEEEcC---CCCEEEECCC
Confidence            489999999971   1235667765


No 47 
>PF01357 Pollen_allerg_1:  Pollen allergen;  InterPro: IPR007117 Expansins are unusual proteins that mediate cell wall extension in plants []. They are believed to act as a sort of chemical grease, allowing polymers to slide past one another by disrupting non-covalent hydrogen bonds that hold many wall polymers to one another. This process is not degradative and hence does not weaken the wall, which could otherwise rupture under internal pressure during growth. Sequence comparisons indicate at least four distinct expansin cDNAs in rice and at least six in Arabidopsis. The proteins are highly conserved in size and sequence (75-95% amino acid sequence similarity between any pairwise comparison), and phylogenetic trees indicate that this multigene family formed before the evolutionary divergence of monocotyledons and dicotyledons []. Sequence and motif analyses show no similarities to known functional domains that might account for expansin action on wall extension. It is thought that several highly-conserved tryptophans may function in expansin binding to cellulose, or other glycans. The high conservation of the family indicates that the mechanism by which expansins promote wall extensin tolerates little variation in protein structure.  Grass pollens, such as pollen from timothy grass, represent a major cause of type I allergy []. Interestingly, expansins share a high degree of sequence similarity with the Lol p I family of allergens. This entry represents the C-terminal domain.; PDB: 2VXQ_A 1WHP_A 1BMW_A 1WHO_A 2HCZ_X 2JNZ_A 3FT9_A 3FT1_C 1N10_B.
Probab=31.02  E-value=78  Score=23.91  Aligned_cols=43  Identities=16%  Similarity=0.300  Sum_probs=23.1

Q ss_pred             CCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCC
Q psy15346        170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFG  218 (280)
Q Consensus       170 ~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG  218 (280)
                      +|||+.+-...+      .+.+.|.-|=--..+....--++++||..|=
T Consensus        10 ~~~~l~v~v~n~------gG~gdi~~Vevk~~~s~~W~~m~r~wGa~W~   52 (82)
T PF01357_consen   10 NPYYLAVLVKNV------GGDGDIKAVEVKQSGSGNWIPMKRSWGAVWQ   52 (82)
T ss_dssp             BTTEEEEEEEEC------CTTS-EEEEEEEETTSSS-EE-EEECTTEEE
T ss_pred             CCcEEEEEEEEc------CCCccEEEEEEEeCCCCCceEeecCcCceEE
Confidence            588888877766      3344444332221222234467889998884


Done!