Query         psy1911
Match_columns 342
No_of_seqs    430 out of 2698
Neff          7.0 
Searched_HMMs 46136
Date          Fri Aug 16 19:27:05 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1911.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1911hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd02620 Peptidase_C1A_Cathepsi 100.0 5.1E-29 1.1E-33  230.6  11.1  139  188-340     1-140 (236)
  2 KOG1542|consensus              100.0 9.5E-29 2.1E-33  232.9   7.3   95  182-289   152-247 (372)
  3 PTZ00049 cathepsin C-like prot  99.9 2.6E-27 5.5E-32  243.1  12.6  103  184-293   378-490 (693)
  4 cd02621 Peptidase_C1A_Cathepsi  99.9 3.5E-27 7.5E-32  219.1   9.3   96  187-289     1-100 (243)
  5 KOG1543|consensus               99.9   4E-27 8.8E-32  227.2   9.6   92  183-278   105-198 (325)
  6 cd02698 Peptidase_C1A_Cathepsi  99.9 3.4E-26 7.4E-31  212.1  10.7  105   63-167   127-238 (239)
  7 cd02620 Peptidase_C1A_Cathepsi  99.9 7.2E-26 1.6E-30  209.6  10.8  102   62-163   131-234 (236)
  8 KOG1542|consensus               99.9 1.1E-25 2.3E-30  212.3   8.6  112   52-165   252-369 (372)
  9 PTZ00203 cathepsin L protease;  99.9 2.1E-25 4.5E-30  217.0   9.3   92  185-289   124-218 (348)
 10 cd02621 Peptidase_C1A_Cathepsi  99.9 3.7E-25   8E-30  205.5  10.1   98   70-167   129-242 (243)
 11 KOG1543|consensus               99.9 6.2E-25 1.3E-29  212.0  10.9  112   52-164   206-320 (325)
 12 PTZ00364 dipeptidyl-peptidase   99.9 4.4E-25 9.6E-30  223.9   9.9  100  184-289   202-308 (548)
 13 PTZ00203 cathepsin L protease;  99.9 1.3E-24 2.8E-29  211.4  11.3  100   62-163   235-336 (348)
 14 cd02698 Peptidase_C1A_Cathepsi  99.9 9.7E-25 2.1E-29  202.4   9.7   94  187-289     1-98  (239)
 15 PTZ00200 cysteine proteinase;   99.9 6.2E-25 1.3E-29  219.4   8.5   91  186-289   233-324 (448)
 16 cd02248 Peptidase_C1A Peptidas  99.9 1.9E-24   4E-29  195.3  10.8  103   60-162   101-207 (210)
 17 PTZ00021 falcipain-2; Provisio  99.9 1.4E-24   3E-29  218.0   8.0   90  187-289   266-356 (489)
 18 smart00645 Pept_C1 Papain fami  99.9 3.2E-24 6.9E-29  189.8   8.1   86  187-278     1-88  (174)
 19 PTZ00021 falcipain-2; Provisio  99.9   1E-23 2.2E-28  211.7  10.6  104   62-166   370-487 (489)
 20 PTZ00200 cysteine proteinase;   99.9 1.5E-23 3.3E-28  209.4  10.9  104   62-166   336-444 (448)
 21 PF00112 Peptidase_C1:  Papain   99.9 2.9E-23 6.3E-28  187.5   8.8  104   62-165   109-218 (219)
 22 cd02248 Peptidase_C1A Peptidas  99.9 6.4E-23 1.4E-27  185.3   8.6   89  188-289     1-90  (210)
 23 PTZ00049 cathepsin C-like prot  99.9   1E-22 2.2E-27  209.3  10.8   91   77-167   561-676 (693)
 24 PF00112 Peptidase_C1:  Papain   99.9   2E-22 4.4E-27  182.0   8.1   92  187-289     1-94  (219)
 25 PTZ00364 dipeptidyl-peptidase   99.9 8.6E-22 1.9E-26  200.0   9.8  101   65-165   333-457 (548)
 26 KOG1544|consensus               99.8 9.4E-21   2E-25  177.1   6.9  108   63-170   341-463 (470)
 27 KOG1544|consensus               99.8 2.8E-21 6.1E-26  180.6   0.4  100  181-289   203-303 (470)
 28 cd02619 Peptidase_C1 C1 Peptid  99.8 1.4E-19   3E-24  163.7   9.7   93   61-153   110-213 (223)
 29 PTZ00462 Serine-repeat antigen  99.8   1E-19 2.3E-24  192.2  10.1   85   77-162   684-776 (1004)
 30 cd02619 Peptidase_C1 C1 Peptid  99.8 3.6E-19 7.7E-24  161.0   8.8   89  190-290     1-95  (223)
 31 smart00645 Pept_C1 Papain fami  99.8 1.3E-18 2.9E-23  153.9   7.1   75   87-162    93-170 (174)
 32 PTZ00462 Serine-repeat antigen  99.7 3.3E-17 7.2E-22  173.4   8.8   84  197-289   538-625 (1004)
 33 cd00585 Peptidase_C1B Peptidas  99.5 8.4E-15 1.8E-19  146.0   8.4   85   67-152   288-399 (437)
 34 COG4870 Cysteine protease [Pos  99.4   8E-13 1.7E-17  126.6   8.2   99   53-153   197-314 (372)
 35 PF03051 Peptidase_C1_2:  Pepti  98.8 1.5E-08 3.3E-13  101.4   8.9   84   67-151   289-399 (438)
 36 COG4870 Cysteine protease [Pos  98.5 2.1E-08 4.6E-13   96.5  -0.3   86  185-278    97-190 (372)
 37 cd00585 Peptidase_C1B Peptidas  97.1  0.0011 2.3E-08   67.0   6.5   75  204-279    55-157 (437)
 38 PF03051 Peptidase_C1_2:  Pepti  96.9 0.00086 1.9E-08   67.6   4.0   74  204-278    56-157 (438)
 39 COG3579 PepC Aminopeptidase C   96.8 0.00093   2E-08   64.2   2.9   70   81-151   308-401 (444)
 40 PF13529 Peptidase_C39_2:  Pept  84.0     1.5 3.2E-05   35.8   4.1   52   77-137    92-144 (144)
 41 COG3579 PepC Aminopeptidase C   82.5     1.9 4.2E-05   42.0   4.6   74  204-278    58-159 (444)
 42 PF14399 Transpep_BrtH:  NlpC/p  80.8     3.5 7.5E-05   39.2   5.8   52   77-135    81-133 (317)
 43 PF10500 SR-25:  Nuclear RNA-sp  79.8     2.3 5.1E-05   38.9   3.9    6   82-87    141-146 (225)
 44 PF10500 SR-25:  Nuclear RNA-sp  66.8     7.5 0.00016   35.7   3.9   11   77-87    151-161 (225)
 45 PF05543 Peptidase_C47:  Stapho  65.8      24 0.00052   31.3   6.8   37  111-152   117-154 (175)
 46 KOG4364|consensus               65.8      11 0.00023   39.9   5.2   14   78-91    427-440 (811)
 47 KOG4128|consensus               62.3     1.2 2.5E-05   43.3  -2.2   39  112-150   370-412 (457)
 48 COG4990 Uncharacterized protei  52.9      24 0.00052   31.6   4.5   50   69-138   116-168 (195)
 49 KOG4364|consensus               50.1      28 0.00061   37.0   5.1   10  114-123   463-472 (811)
 50 cd02549 Peptidase_C39A A sub-f  48.7      49  0.0011   26.9   5.7   22  113-137    93-114 (141)
 51 PF01754 zf-A20:  A20-like zinc  47.2      14  0.0003   22.1   1.5   19  198-216     6-24  (25)
 52 KOG3925|consensus               42.5      40 0.00087   33.2   4.7   14  254-267   306-319 (371)
 53 PF09778 Guanylate_cyc_2:  Guan  34.8 1.3E+02  0.0028   27.7   6.4   14  113-126   160-173 (212)
 54 smart00259 ZnF_A20 A20-like zi  32.9      24 0.00051   21.3   0.9   19  198-216     7-25  (26)
 55 PF15246 NCKAP5:  Nck-associate  31.2      50  0.0011   31.9   3.3   36   48-83    125-160 (318)
 56 KOG4128|consensus               31.1      21 0.00045   35.0   0.7   76  203-279    62-167 (457)
 57 cd00044 CysPc Calpains, domain  25.0      75  0.0016   30.5   3.4   27  113-139   235-263 (315)
 58 PF12385 Peptidase_C70:  Papain  24.6 1.6E+02  0.0034   25.9   4.9   34   77-125   102-135 (166)
 59 KOG1144|consensus               23.4 1.4E+02  0.0031   32.7   5.2   10   77-86    298-307 (1064)
 60 PF12569 NARP1:  NMDA receptor-  22.4      78  0.0017   32.9   3.2    9   77-85    480-488 (517)
 61 KOG3925|consensus               21.4 1.4E+02  0.0031   29.5   4.5    8   85-92     97-104 (371)

No 1  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.96  E-value=5.1e-29  Score=230.58  Aligned_cols=139  Identities=53%  Similarity=0.998  Sum_probs=107.6

Q ss_pred             CcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC-CCCCCCCchhhHHHH
Q psy1911         188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CWGCNGGWPQLAWRF  266 (342)
Q Consensus       188 P~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~-~~GC~GG~~~~a~~y  266 (342)
                      |++||||++|++|..|+||+||+.||||||||++++||++++|++++...+.||+|+||||+.. ..||+||++..||+|
T Consensus         1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~   80 (236)
T cd02620           1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY   80 (236)
T ss_pred             CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence            8999999999988888999999999999999999999999999988555789999999999876 559999999999999


Q ss_pred             hhhcccccCCCCCCCCCcccCCCCCCccCCCCCCccCCCCCccccccccccccccccccccccccccccceeEe
Q psy1911         267 WGHNGVVTGGDYNSQEGCQPYTLAPCEHHVQGPLQNCTLLGKLKTPECKQNCYNPSYESTYRFDLKKGKKAHMV  340 (342)
Q Consensus       267 ~~~~Gi~~e~~Y~~~~~C~PY~~~~c~~~~~~~~~~C~~~~~~~~p~C~~~C~~~~~~~~y~~d~~~~~~~y~~  340 (342)
                      |+++||++|.+|       ||....|..+.... .+|..     ++.|+..|. ......|..+.++....|.+
T Consensus        81 i~~~G~~~e~~y-------PY~~~~~~~~~~~~-~~~~~-----~~~~~~~C~-~~~~~~~~~~~~~~~~~~~~  140 (236)
T cd02620          81 LTTTGVVTGGCQ-------PYTIPPCGHHPEGP-PPCCG-----TPYCTPKCQ-DGCEKTYEEDKHKGKSAYSV  140 (236)
T ss_pred             HHhcCCCcCCEe-------cCcCCCCccCCCCC-CCCCC-----CCCCCCCCC-cCCccccceeeeeecceeee
Confidence            999999996655       99876665433222 34542     335555665 33222355555555544543


No 2  
>KOG1542|consensus
Probab=99.95  E-value=9.5e-29  Score=232.90  Aligned_cols=95  Identities=35%  Similarity=0.653  Sum_probs=88.0

Q ss_pred             cCCCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchh
Q psy1911         182 QNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQ  261 (342)
Q Consensus       182 ~~~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~  261 (342)
                      .....||++||||++    +.||||+|||.||||||||+++++|....|+++  ..+.||+|+||||+....||+||.+.
T Consensus       152 ~~~~~lP~~fDWR~k----gaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g--~LvsLSEQeLvDCD~~d~gC~GGl~~  225 (372)
T KOG1542|consen  152 EPGESLPESFDWRDK----GAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATG--KLVSLSEQELVDCDSCDNGCNGGLMD  225 (372)
T ss_pred             CCCCCCCcccchhcc----CCccccccCCcCcchhhhhhhhhhhhHHHhhcC--cccccchhhhhcccCcCCcCCCCChh
Confidence            356789999999999    999999999999999999999999999999999  68999999999998777799999999


Q ss_pred             hHHHHh-hhcccccCCCCCCCCCcccCCC
Q psy1911         262 LAWRFW-GHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       262 ~a~~y~-~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      .||+|+ ...||.+|.+|       ||++
T Consensus       226 nA~~~~~~~gGL~~E~dY-------PY~g  247 (372)
T KOG1542|consen  226 NAFKYIKKAGGLEKEKDY-------PYTG  247 (372)
T ss_pred             HHHHHHHHhCCccccccC-------Cccc
Confidence            999995 55699999999       8876


No 3  
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.94  E-value=2.6e-27  Score=243.06  Aligned_cols=103  Identities=29%  Similarity=0.536  Sum_probs=91.2

Q ss_pred             CCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCc--------cccccCHHHHHhhCCCCCCC
Q psy1911         184 AKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGY--------FTGQISAQHIVACTPNCWGC  255 (342)
Q Consensus       184 ~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~--------~~~~LS~Q~lldC~~~~~GC  255 (342)
                      ..+||++||||+.|++|+.|+||+|||.||||||||++++||+|++|++++.        +...||+|+||||+..++||
T Consensus       378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC  457 (693)
T PTZ00049        378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGC  457 (693)
T ss_pred             cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCc
Confidence            5689999999999999999999999999999999999999999999997531        23479999999999877899


Q ss_pred             CCCchhhHHHHhhhcccccCCCCCCCCCcccCCC--CCCc
Q psy1911         256 NGGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL--APCE  293 (342)
Q Consensus       256 ~GG~~~~a~~y~~~~Gi~~e~~Y~~~~~C~PY~~--~~c~  293 (342)
                      +||++..|++|+.++||+++..|       ||++  +.|.
T Consensus       458 ~GG~~~~A~kya~~~GI~tEscY-------PY~a~~g~C~  490 (693)
T PTZ00049        458 NGGFPYLVSKMAKLQGIPLDKVF-------PYTATEQTCP  490 (693)
T ss_pred             CCCcHHHHHHHHHHCCCCcCCcc-------CCcCCCCCCC
Confidence            99999999999999999996555       8875  4564


No 4  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.94  E-value=3.5e-27  Score=219.10  Aligned_cols=96  Identities=34%  Similarity=0.628  Sum_probs=87.8

Q ss_pred             CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCC----ccccccCHHHHHhhCCCCCCCCCCchhh
Q psy1911         187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNG----YFTGQISAQHIVACTPNCWGCNGGWPQL  262 (342)
Q Consensus       187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~----~~~~~LS~Q~lldC~~~~~GC~GG~~~~  262 (342)
                      ||++||||+.|++|..|+||+||+.||||||||++++||++++|+++.    ...+.||+|+||||+....||+||++..
T Consensus         1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~   80 (243)
T cd02621           1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL   80 (243)
T ss_pred             CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence            799999999998889999999999999999999999999999999875    2368899999999987667999999999


Q ss_pred             HHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911         263 AWRFWGHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       263 a~~y~~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      |++|+.++||+++.+|       ||..
T Consensus        81 a~~~~~~~Gi~~e~~y-------PY~~  100 (243)
T cd02621          81 VGKFAEDFGIVTEDYF-------PYTA  100 (243)
T ss_pred             HHHHHHhcCcCCCcee-------CCCC
Confidence            9999999999996665       8875


No 5  
>KOG1543|consensus
Probab=99.94  E-value=4e-27  Score=227.19  Aligned_cols=92  Identities=39%  Similarity=0.743  Sum_probs=84.3

Q ss_pred             CCCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCC-CCCCCchh
Q psy1911         183 NAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGWPQ  261 (342)
Q Consensus       183 ~~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~-GC~GG~~~  261 (342)
                      ...+||++||||++|   +.+.|||||+.||||||||++++||+|++|++++ .++.||+||||||+..++ ||+||++.
T Consensus       105 ~~~~~p~s~DwR~~~---~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~-~l~sLSeq~lvdC~~~~~~GC~GG~~~  180 (325)
T KOG1543|consen  105 DGDDLPDSFDWRDKG---AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGG-KLLSLSEQDLVDCCGECGDGCNGGEPK  180 (325)
T ss_pred             chhhCCCCccccccC---CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCC-ccCccChhhhhhccCCCCCCcCCCCHH
Confidence            356899999999997   5677899999999999999999999999999997 788999999999998744 99999999


Q ss_pred             hHHHHhhhccccc-CCCC
Q psy1911         262 LAWRFWGHNGVVT-GGDY  278 (342)
Q Consensus       262 ~a~~y~~~~Gi~~-e~~Y  278 (342)
                      .||+|+.++|+++ +.+|
T Consensus       181 ~A~~yi~~~G~~t~~~~Y  198 (325)
T KOG1543|consen  181 NAFKYIKKNGGVTECENY  198 (325)
T ss_pred             HHHHHHHHhCCCCCCcCC
Confidence            9999999999998 6666


No 6  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.93  E-value=3.4e-26  Score=212.09  Aligned_cols=105  Identities=29%  Similarity=0.602  Sum_probs=92.9

Q ss_pred             eeeeeeEEcCchHH-HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccC-CccEEEEecCCCCC
Q psy1911          63 HYFKKAHMVPRCNA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN-DIPYWLVANSWNDH  140 (342)
Q Consensus        63 ~~i~~~~~v~~~~~-~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~-g~~ywiikNSWG~~  140 (342)
                      +++.+|..++..+. +++|+++|||+++|.++++|+.|++|||...++...++|+|+|||||++. +.+|||||||||+.
T Consensus       127 ~~i~~~~~~~~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~  206 (239)
T cd02698         127 YFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDENGVEYWIVRNSWGEP  206 (239)
T ss_pred             EEeeeceecCCHHHHHHHHHHcCCEEEEEEecccccccCCeEEccCCCCCcCCeEEEEEEEEecCCCCEEEEEEcCCCcc
Confidence            56778888876555 77899999999999999899999999998776666679999999999876 89999999999999


Q ss_pred             CCCCCceeeecCC-----cchhhccccccccc
Q psy1911         141 WGDHGTFKILRGE-----NEADIEMGFNNRVE  167 (342)
Q Consensus       141 WG~~Gy~~i~~~~-----~~~gi~~~~~~~v~  167 (342)
                      ||++|||||.|+.     |++||++.+.++.+
T Consensus       207 WGe~Gy~~i~rg~~~~~~~~~~i~~~~~~~~~  238 (239)
T cd02698         207 WGERGWFRIVTSSYKGARYNLAIEEDCAWADP  238 (239)
T ss_pred             cCcCceEEEEccCCcccccccccccceEEEee
Confidence            9999999999999     99999988877653


No 7  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.93  E-value=7.2e-26  Score=209.58  Aligned_cols=102  Identities=54%  Similarity=1.018  Sum_probs=89.1

Q ss_pred             ceeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCCCC
Q psy1911          62 SHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND  139 (342)
Q Consensus        62 ~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSWG~  139 (342)
                      .+++..++.+..++.  +.+|+++|||+++|.++++|+.|.+|||...++...++|||+|||||++++.+|||||||||+
T Consensus       131 ~~~~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~  210 (236)
T cd02620         131 KHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVENGVPYWLAANSWGT  210 (236)
T ss_pred             eeeecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEeccCCeeEEEEEeCCCC
Confidence            455667777765433  678999999999999988999999999987665556799999999999888999999999999


Q ss_pred             CCCCCCceeeecCCcchhhccccc
Q psy1911         140 HWGDHGTFKILRGENEADIEMGFN  163 (342)
Q Consensus       140 ~WG~~Gy~~i~~~~~~~gi~~~~~  163 (342)
                      .||++|||||.|+.|.|||++.+.
T Consensus       211 ~WGe~Gy~ri~~~~~~cgi~~~~~  234 (236)
T cd02620         211 DWGENGYFRILRGSNECGIESEVV  234 (236)
T ss_pred             CCCCCcEEEEEccCccccccccee
Confidence            999999999999999999987764


No 8  
>KOG1542|consensus
Probab=99.92  E-value=1.1e-25  Score=212.31  Aligned_cols=112  Identities=25%  Similarity=0.471  Sum_probs=98.3

Q ss_pred             ccCCCCCcccceeeeeeEEcCchHH--HHHHHHcCCeEEEEeecccccccccccccc---CCCCCccceEEEEeeecccC
Q psy1911          52 RLYLPTSIPLSHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQH---NFGDSIGLHAVRVLGWGVEN  126 (342)
Q Consensus        52 ~~~~p~s~~~~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~---~~~~~~~~Hav~ivGyg~~~  126 (342)
                      .|.... .....+|++|+.++.+++  ++.|+++|||+|+|++ ..+|+|++||+.+   .|....++|+|+|||||..+
T Consensus       252 ~C~~~~-~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g  329 (372)
T KOG1542|consen  252 QCHFDK-SKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSG  329 (372)
T ss_pred             ccccch-hhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHHhcccccCCCcccCCccccCceEEEEeecCCC
Confidence            666665 446799999999999887  5679999999999996 4699999999987   46666689999999999988


Q ss_pred             -CccEEEEecCCCCCCCCCCceeeecCCcchhhccccccc
Q psy1911         127 -DIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNR  165 (342)
Q Consensus       127 -g~~ywiikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~  165 (342)
                       ..+|||||||||+.||++||+++.||.|.|||...+..+
T Consensus       330 ~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mvss~  369 (372)
T KOG1542|consen  330 YEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMVSSA  369 (372)
T ss_pred             CCCceEEEECCccccccccceEEEeccccccccccchhhh
Confidence             899999999999999999999999999999998776544


No 9  
>PTZ00203 cathepsin L protease; Provisional
Probab=99.92  E-value=2.1e-25  Score=216.97  Aligned_cols=92  Identities=33%  Similarity=0.589  Sum_probs=83.3

Q ss_pred             CCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchhhHH
Q psy1911         185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAW  264 (342)
Q Consensus       185 ~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~~a~  264 (342)
                      .+||++||||++    +.|+||+|||.||||||||++++||++++|+++  ..+.||+|+||||+..+.||+||++..||
T Consensus       124 ~~lP~~~DWR~~----g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~--~~~~LSeQqLvdC~~~~~GC~GG~~~~a~  197 (348)
T PTZ00203        124 SAVPDAVDWREK----GAVTPVKNQGACGSCWAFSAVGNIESQWAVAGH--KLVRLSEQQLVSCDHVDNGCGGGLMLQAF  197 (348)
T ss_pred             ccCCCCCcCCcC----CCCCCccccCCCccHHHHhhHHHHHHHHHHhcC--CCccCCHHHHHhccCCCCCCCCCCHHHHH
Confidence            368999999998    889999999999999999999999999999988  46789999999998776799999999999


Q ss_pred             HHhhhc---ccccCCCCCCCCCcccCCC
Q psy1911         265 RFWGHN---GVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       265 ~y~~~~---Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      +|+.++   ||++|.+|       ||..
T Consensus       198 ~yi~~~~~ggi~~e~~Y-------PY~~  218 (348)
T PTZ00203        198 EWVLRNMNGTVFTEKSY-------PYVS  218 (348)
T ss_pred             HHHHHhcCCCCCccccC-------CCcc
Confidence            999764   57888888       8864


No 10 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.92  E-value=3.7e-25  Score=205.52  Aligned_cols=98  Identities=41%  Similarity=0.778  Sum_probs=82.7

Q ss_pred             EcCchHH-HHHHHHcCCeEEEEeeccccccccccccccC-----CCC--------CccceEEEEeeecccC--CccEEEE
Q psy1911          70 MVPRCNA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-----FGD--------SIGLHAVRVLGWGVEN--DIPYWLV  133 (342)
Q Consensus        70 ~v~~~~~-~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~-----~~~--------~~~~Hav~ivGyg~~~--g~~ywii  133 (342)
                      .+...+. +++|+++|||+++|++.++|++|++|||...     |..        ..++|+|+|||||++.  +.+||||
T Consensus       129 ~~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWii  208 (243)
T cd02621         129 GCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIV  208 (243)
T ss_pred             ccCCHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEE
Confidence            3334444 6789999999999999989999999999765     321        2468999999999876  8899999


Q ss_pred             ecCCCCCCCCCCceeeecCCcchhhccccccccc
Q psy1911         134 ANSWNDHWGDHGTFKILRGENEADIEMGFNNRVE  167 (342)
Q Consensus       134 kNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~v~  167 (342)
                      |||||+.||++|||||.|+.|.|||+..+..+++
T Consensus       209 rNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~~  242 (243)
T cd02621         209 KNSWGSSWGEKGYFKIRRGTNECGIESQAVFAYP  242 (243)
T ss_pred             EcCCCCCCCcCCeEEEecCCcccCcccceEeecc
Confidence            9999999999999999999999999888766543


No 11 
>KOG1543|consensus
Probab=99.92  E-value=6.2e-25  Score=212.00  Aligned_cols=112  Identities=32%  Similarity=0.617  Sum_probs=98.5

Q ss_pred             ccCCCCCcccceeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCCCC-ccceEEEEeeecccCCc
Q psy1911          52 RLYLPTSIPLSHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDS-IGLHAVRVLGWGVENDI  128 (342)
Q Consensus        52 ~~~~p~s~~~~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~-~~~Hav~ivGyg~~~g~  128 (342)
                      .|-.... ...+.+.+++.++.++.  +.+|+.+|||+|+|++.++|+.|++|||.++++.. .++|+|+|||||..++.
T Consensus       206 ~C~~~~~-~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~~~~~  284 (325)
T KOG1543|consen  206 TCKSNKK-DKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGTGDGV  284 (325)
T ss_pred             CccCCCc-cceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcCCCCc
Confidence            4444443 56788999999999976  78899999999999999999999999999988776 48999999999996568


Q ss_pred             cEEEEecCCCCCCCCCCceeeecCCcchhhcccccc
Q psy1911         129 PYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNN  164 (342)
Q Consensus       129 ~ywiikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~  164 (342)
                      +||||+||||+.||++|||||.|+.+.|+|++...+
T Consensus       285 ~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~~  320 (325)
T KOG1543|consen  285 DYWIVKNSWGTDWGEKGYFRIARGVNKCGIASEASY  320 (325)
T ss_pred             eeEEEEcCCCCCcccCceEEEecCCCchhhhccccc
Confidence            999999999999999999999999999998776655


No 12 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.92  E-value=4.4e-25  Score=223.93  Aligned_cols=100  Identities=25%  Similarity=0.424  Sum_probs=88.7

Q ss_pred             CCCCCcccccCccCCCCCCCcccccccc---cchHHHHHhhHHHHHHHHHHhCCc----cccccCHHHHHhhCCCCCCCC
Q psy1911         184 AKGLPRNFDAREKWPECPSLRHIADQSN---CGSCWAVSVANAISDRLCIASNGY----FTGQISAQHIVACTPNCWGCN  256 (342)
Q Consensus       184 ~~~lP~sfD~R~~~~~c~~i~~v~dQg~---CGsCwAfa~~~~le~r~~i~~~~~----~~~~LS~Q~lldC~~~~~GC~  256 (342)
                      ..+||++||||++ .++++|+|||||+.   ||||||||++++||+|++|++++.    ..+.||+|+||||+..+.||+
T Consensus       202 ~~~LP~sfDWR~~-gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCd  280 (548)
T PTZ00364        202 GDPPPAAWSWGDV-GGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCA  280 (548)
T ss_pred             ccCCCCccccCcC-CCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCC
Confidence            4689999999998 34457999999999   999999999999999999998643    357899999999987767999


Q ss_pred             CCchhhHHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911         257 GGWPQLAWRFWGHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       257 GG~~~~a~~y~~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      ||++..|++|++++||++|.+|     |.||..
T Consensus       281 GG~p~~A~~yi~~~GI~tE~dY-----~~PY~~  308 (548)
T PTZ00364        281 GGFPEEVGKFAETFGILTTDSY-----YIPYDS  308 (548)
T ss_pred             CCcHHHHHHHHHhCCccccccc-----CCCCCC
Confidence            9999999999999999998887     668864


No 13 
>PTZ00203 cathepsin L protease; Provisional
Probab=99.91  E-value=1.3e-24  Score=211.42  Aligned_cols=100  Identities=21%  Similarity=0.456  Sum_probs=85.4

Q ss_pred             ceeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCCCC
Q psy1911          62 SHYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWND  139 (342)
Q Consensus        62 ~~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSWG~  139 (342)
                      ...+.+|..|+.++.  +.+|+++|||+|+|++. +|++|++|||.. |....++|+|+|||||.++|.+|||||||||+
T Consensus       235 ~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~  312 (348)
T PTZ00203        235 GARIDGYVSMESSERVMAAWLAKNGPISIAVDAS-SFMSYHSGVLTS-CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGE  312 (348)
T ss_pred             ceEecceeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcCccCceeec-cCCCCCCeEEEEEEEecCCCceEEEEEcCCCC
Confidence            456778888876544  56788899999999985 899999999974 54445699999999999888999999999999


Q ss_pred             CCCCCCceeeecCCcchhhccccc
Q psy1911         140 HWGDHGTFKILRGENEADIEMGFN  163 (342)
Q Consensus       140 ~WG~~Gy~~i~~~~~~~gi~~~~~  163 (342)
                      .||++|||||.++.|.|||...+.
T Consensus       313 ~WGe~GY~ri~rg~n~Cgi~~~~~  336 (348)
T PTZ00203        313 DWGEKGYVRVTMGVNACLLTGYPV  336 (348)
T ss_pred             CcCcCceEEEEcCCCcccccceEE
Confidence            999999999999999999975443


No 14 
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.91  E-value=9.7e-25  Score=202.38  Aligned_cols=94  Identities=29%  Similarity=0.545  Sum_probs=83.1

Q ss_pred             CCcccccCccCCCCCCCccccccc---ccchHHHHHhhHHHHHHHHHHhCCcc-ccccCHHHHHhhCCCCCCCCCCchhh
Q psy1911         187 LPRNFDAREKWPECPSLRHIADQS---NCGSCWAVSVANAISDRLCIASNGYF-TGQISAQHIVACTPNCWGCNGGWPQL  262 (342)
Q Consensus       187 lP~sfD~R~~~~~c~~i~~v~dQg---~CGsCwAfa~~~~le~r~~i~~~~~~-~~~LS~Q~lldC~~~~~GC~GG~~~~  262 (342)
                      ||++||||+.+.. ..|+||+||+   .||||||||++++||+|++|++++.. .+.||+|+||||+. ..||+||++..
T Consensus         1 lP~~~Dwr~~~~~-~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~-~~gC~GG~~~~   78 (239)
T cd02698           1 LPKSWDWRNVNGV-NYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG-GGSCHGGDPGG   78 (239)
T ss_pred             CCCCcccccCCCC-cccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC-CCCccCcCHHH
Confidence            7999999998432 4899999998   89999999999999999999987543 57899999999986 56999999999


Q ss_pred             HHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911         263 AWRFWGHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       263 a~~y~~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      ||+|++++||++|.+|       ||..
T Consensus        79 a~~~~~~~Gl~~e~~y-------PY~~   98 (239)
T cd02698          79 VYEYAHKHGIPDETCN-------PYQA   98 (239)
T ss_pred             HHHHHHHcCcCCCCee-------CCcC
Confidence            9999999999996666       8875


No 15 
>PTZ00200 cysteine proteinase; Provisional
Probab=99.91  E-value=6.2e-25  Score=219.36  Aligned_cols=91  Identities=30%  Similarity=0.611  Sum_probs=83.8

Q ss_pred             CCCcccccCccCCCCCCCccccccc-ccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchhhHH
Q psy1911         186 GLPRNFDAREKWPECPSLRHIADQS-NCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAW  264 (342)
Q Consensus       186 ~lP~sfD~R~~~~~c~~i~~v~dQg-~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~~a~  264 (342)
                      .+|++||||+.    +.|+||+||| .||||||||++++||++++|.++  ..+.||+|+||||+..+.||+||++..||
T Consensus       233 ~~P~~~DWR~~----g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~--~~~~LSeQqLvDC~~~~~GC~GG~~~~A~  306 (448)
T PTZ00200        233 ITGEGLDWRRA----DAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRD--KSVDLSEQELVNCDTKSQGCSGGYPDTAL  306 (448)
T ss_pred             cCCCCccCCCC----CCCCCcccCCCccchHHHHhHHHHHHHHHHHhcC--CCeecCHHHHhhccCccCCCCCCcHHHHH
Confidence            46999999998    8899999999 99999999999999999999876  46889999999998777799999999999


Q ss_pred             HHhhhcccccCCCCCCCCCcccCCC
Q psy1911         265 RFWGHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       265 ~y~~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      +|++++||+++.+|       ||++
T Consensus       307 ~yi~~~Gi~~e~~Y-------PY~~  324 (448)
T PTZ00200        307 EYVKNKGLSSSSDV-------PYLA  324 (448)
T ss_pred             HHHhhcCccccccC-------CCCC
Confidence            99999999997777       8875


No 16 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.91  E-value=1.9e-24  Score=195.29  Aligned_cols=103  Identities=30%  Similarity=0.557  Sum_probs=91.1

Q ss_pred             ccceeeeeeEEcCch--HH-HHHHHHcCCeEEEEeeccccccccccccccCCC-CCccceEEEEeeecccCCccEEEEec
Q psy1911          60 PLSHYFKKAHMVPRC--NA-MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG-DSIGLHAVRVLGWGVENDIPYWLVAN  135 (342)
Q Consensus        60 ~~~~~i~~~~~v~~~--~~-~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~-~~~~~Hav~ivGyg~~~g~~ywiikN  135 (342)
                      ....++.+|+.|+..  +. +++|+++|||+++|.+.++|+.|.+|||..+.+ ...++|+|+|||||++.+.+||||+|
T Consensus       101 ~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~~~~~ywiv~N  180 (210)
T cd02248         101 KVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYGTENGVDYWIVKN  180 (210)
T ss_pred             cccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCCCcCCEEEEEEEEeecCCceEEEEEc
Confidence            456888999999874  33 788999999999999999999999999987665 45679999999999988899999999


Q ss_pred             CCCCCCCCCCceeeecCCcchhhcccc
Q psy1911         136 SWNDHWGDHGTFKILRGENEADIEMGF  162 (342)
Q Consensus       136 SWG~~WG~~Gy~~i~~~~~~~gi~~~~  162 (342)
                      |||+.||++|||||+++.+.|||....
T Consensus       181 SWG~~WG~~Gy~~i~~~~~~cgi~~~~  207 (210)
T cd02248         181 SWGTSWGEKGYIRIARGSNLCGIASYA  207 (210)
T ss_pred             CCCCccccCcEEEEEcCCCccCceeee
Confidence            999999999999999999999997554


No 17 
>PTZ00021 falcipain-2; Provisional
Probab=99.90  E-value=1.4e-24  Score=218.00  Aligned_cols=90  Identities=31%  Similarity=0.537  Sum_probs=82.8

Q ss_pred             CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCCCCCCCchhhHHHH
Q psy1911         187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCWGCNGGWPQLAWRF  266 (342)
Q Consensus       187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~GC~GG~~~~a~~y  266 (342)
                      +|++||||+.    +.|+||+|||.||||||||++++||++++|+++  ..+.||+|+||||+..+.||+||++..||+|
T Consensus       266 ~P~s~DWR~~----g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g--~~v~LSeQqLVDCs~~n~GC~GG~~~~Af~y  339 (489)
T PTZ00021        266 DHAKYDWRLH----NGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKN--ELVSLSEQELVDCSFKNNGCYGGLIPNAFED  339 (489)
T ss_pred             CccccccccC----CCCCCcccccccccHHHHHHHHHHHHHHHHHcC--CCcccCHHHHhhhccCCCCCCCcchHhhhhh
Confidence            5999999998    889999999999999999999999999999988  4688999999999877779999999999999


Q ss_pred             hhhc-ccccCCCCCCCCCcccCCC
Q psy1911         267 WGHN-GVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       267 ~~~~-Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      +.++ ||++|.+|       ||..
T Consensus       340 i~~~gGl~tE~~Y-------PY~~  356 (489)
T PTZ00021        340 MIELGGLCSEDDY-------PYVS  356 (489)
T ss_pred             hhhccccCccccc-------CccC
Confidence            9776 89998888       8864


No 18 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.90  E-value=3.2e-24  Score=189.81  Aligned_cols=86  Identities=40%  Similarity=0.790  Sum_probs=78.2

Q ss_pred             CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC-CCCCCCCchhhHHH
Q psy1911         187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CWGCNGGWPQLAWR  265 (342)
Q Consensus       187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~-~~GC~GG~~~~a~~  265 (342)
                      ||++||||+.    +.++||+||+.||+|||||++++||++++|+++.  .+.||+|+|++|... ..||+||++..|++
T Consensus         1 lP~~~D~R~~----~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~--~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~   74 (174)
T smart00645        1 LPESFDWRKK----GAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGK--LVSLSEQQLVDCSTGGNNGCNGGLPDNAFE   74 (174)
T ss_pred             CCCcCccccc----CCCCccccCcccchHHHHHHHHHHHHHHHHhcCC--ccccCHHHHhhhcCCCCCCCCCcCHHHHHH
Confidence            7999999998    3689999999999999999999999999999884  678999999999875 44999999999999


Q ss_pred             Hhhhc-ccccCCCC
Q psy1911         266 FWGHN-GVVTGGDY  278 (342)
Q Consensus       266 y~~~~-Gi~~e~~Y  278 (342)
                      |+.++ ||++|.+|
T Consensus        75 ~~~~~~Gi~~e~~~   88 (174)
T smart00645       75 YIKKNGGLETESCY   88 (174)
T ss_pred             HHHHcCCccccccc
Confidence            99998 99996555


No 19 
>PTZ00021 falcipain-2; Provisional
Probab=99.90  E-value=1e-23  Score=211.70  Aligned_cols=104  Identities=26%  Similarity=0.499  Sum_probs=88.2

Q ss_pred             ceeeeeeEEcCchHHHHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccC----------CccEE
Q psy1911          62 SHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVEN----------DIPYW  131 (342)
Q Consensus        62 ~~~i~~~~~v~~~~~~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~----------g~~yw  131 (342)
                      .+.+.+|..|+..+.+++|+.+|||+|+|+++.+|++|.+|||...|.. .++|||+|||||.++          +.+||
T Consensus       370 ~~~i~~y~~i~~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YW  448 (489)
T PTZ00021        370 KYKIKSYVSIPEDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYY  448 (489)
T ss_pred             cceeeeEEEecHHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCCCCC-ccceEEEEEEecCcCCcccccccCCCCCEE
Confidence            4678889999865557788889999999999889999999999876654 469999999999753          24799


Q ss_pred             EEecCCCCCCCCCCceeeecCC----cchhhcccccccc
Q psy1911         132 LVANSWNDHWGDHGTFKILRGE----NEADIEMGFNNRV  166 (342)
Q Consensus       132 iikNSWG~~WG~~Gy~~i~~~~----~~~gi~~~~~~~v  166 (342)
                      |||||||+.||++|||||.++.    |.|||...+.+++
T Consensus       449 IVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~yP~  487 (489)
T PTZ00021        449 IIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEAYVPL  487 (489)
T ss_pred             EEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccceeEe
Confidence            9999999999999999999986    5899987766553


No 20 
>PTZ00200 cysteine proteinase; Provisional
Probab=99.89  E-value=1.5e-23  Score=209.40  Aligned_cols=104  Identities=28%  Similarity=0.510  Sum_probs=87.8

Q ss_pred             ceeeeeeEEcCchHHHHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecc--cCCccEEEEecCCCC
Q psy1911          62 SHYFKKAHMVPRCNAMRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGV--ENDIPYWLVANSWND  139 (342)
Q Consensus        62 ~~~i~~~~~v~~~~~~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~--~~g~~ywiikNSWG~  139 (342)
                      ..+|.+|..++..+.+++++.+|||+|+|+++.+|++|++|||..+|... ++|||+|||||.  ++|.+||||+||||+
T Consensus       336 ~~~i~~y~~~~~~~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~~-~nHaV~lVGyG~d~~~g~~YWIIkNSWG~  414 (448)
T PTZ00200        336 KVYIDSYLVAKGKDVLNKSLVISPTVVYIAVSRELLKYKSGVYNGECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGT  414 (448)
T ss_pred             eeEecceEecCHHHHHHHHHhcCCEEEEeecccccccCCCCccccccCCC-CcEEEEEEEecccCCCCCceEEEEcCCCC
Confidence            35677787776555577777899999999998899999999998776554 799999999984  467899999999999


Q ss_pred             CCCCCCceeeecC---Ccchhhcccccccc
Q psy1911         140 HWGDHGTFKILRG---ENEADIEMGFNNRV  166 (342)
Q Consensus       140 ~WG~~Gy~~i~~~---~~~~gi~~~~~~~v  166 (342)
                      .||++|||||.|+   .|.|||...+.+++
T Consensus       415 ~WGe~GY~ri~r~~~g~n~CGI~~~~~~P~  444 (448)
T PTZ00200        415 DWGENGYMRLERTNEGTDKCGILTVGLTPV  444 (448)
T ss_pred             CcccCeeEEEEeCCCCCCcCCccccceeeE
Confidence            9999999999985   48999987766554


No 21 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.89  E-value=2.9e-23  Score=187.53  Aligned_cols=104  Identities=36%  Similarity=0.575  Sum_probs=90.6

Q ss_pred             ceeeeeeEEcCch--HH-HHHHHHcCCeEEEEeecc-ccccccccccccCC-CCCccceEEEEeeecccCCccEEEEecC
Q psy1911          62 SHYFKKAHMVPRC--NA-MRQIYEHGPLVAIFSVYA-DFLQYKSGVYQHNF-GDSIGLHAVRVLGWGVENDIPYWLVANS  136 (342)
Q Consensus        62 ~~~i~~~~~v~~~--~~-~~~i~~~GPV~v~i~v~~-~f~~y~~Gi~~~~~-~~~~~~Hav~ivGyg~~~g~~ywiikNS  136 (342)
                      ...+..+..+...  +. +++|+++|||+++|.+++ +|+.|.+|||..+. ....++|+|+|||||++.+.+||||+||
T Consensus       109 ~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~~~~~~wiv~NS  188 (219)
T PF00112_consen  109 YVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDENGKGYWIVKNS  188 (219)
T ss_dssp             EBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEETTEEEEEEE-S
T ss_pred             cccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeeccccccccccccccccccccccceeeEeeehh
Confidence            3677888888875  23 788999999999999998 69999999998863 3457899999999999999999999999


Q ss_pred             CCCCCCCCCceeeecCCc-chhhccccccc
Q psy1911         137 WNDHWGDHGTFKILRGEN-EADIEMGFNNR  165 (342)
Q Consensus       137 WG~~WG~~Gy~~i~~~~~-~~gi~~~~~~~  165 (342)
                      ||+.||++||+||+++.+ .|||+..+.++
T Consensus       189 WG~~WG~~Gy~~i~~~~~~~c~i~~~~~~~  218 (219)
T PF00112_consen  189 WGTDWGDNGYFRISYDYNNECGIESQAVYP  218 (219)
T ss_dssp             BTTTSTBTTEEEEESSSSSGGGTTSSEEEE
T ss_pred             hCCccCCCeEEEEeeCCCCcCccCceeeec
Confidence            999999999999999987 99999888765


No 22 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.88  E-value=6.4e-23  Score=185.25  Aligned_cols=89  Identities=37%  Similarity=0.722  Sum_probs=81.1

Q ss_pred             CcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC-CCCCCCCchhhHHHH
Q psy1911         188 PRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN-CWGCNGGWPQLAWRF  266 (342)
Q Consensus       188 P~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~-~~GC~GG~~~~a~~y  266 (342)
                      |++||||+.    +.++||+||+.||||||||++++||++++|+++  ..+.||+|+|++|... ..||.||++..|++|
T Consensus         1 P~~~d~r~~----~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~   74 (210)
T cd02248           1 PESVDWREK----GAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTG--KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEY   74 (210)
T ss_pred             CCcccCCcC----CCCCCCccCCCCcchHHhHHHHHHHHHHHHHcC--CCcccCHHHHhccCCCCCCCCCCCCHHHhHHH
Confidence            889999998    558999999999999999999999999999987  5678999999999876 569999999999999


Q ss_pred             hhhcccccCCCCCCCCCcccCCC
Q psy1911         267 WGHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       267 ~~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      +.++||+++++|       ||..
T Consensus        75 ~~~~Gi~~e~~y-------PY~~   90 (210)
T cd02248          75 VKNGGLASESDY-------PYTG   90 (210)
T ss_pred             HHHCCcCccccC-------CccC
Confidence            999999997776       8864


No 23 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.88  E-value=1e-22  Score=209.26  Aligned_cols=91  Identities=41%  Similarity=0.747  Sum_probs=78.6

Q ss_pred             HHHHHHcCCeEEEEeeccccccccccccccC-------CCC--------------CccceEEEEeeeccc--CCc--cEE
Q psy1911          77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHN-------FGD--------------SIGLHAVRVLGWGVE--NDI--PYW  131 (342)
Q Consensus        77 ~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~-------~~~--------------~~~~Hav~ivGyg~~--~g~--~yw  131 (342)
                      |++|+.+|||+|+|+++++|++|++|||...       |..              ...+|||+|||||.+  +|.  +||
T Consensus       561 m~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~~~~YW  640 (693)
T PTZ00049        561 MNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLYKYW  640 (693)
T ss_pred             HHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccCCCcccCEE
Confidence            7899999999999999889999999999742       321              136899999999975  353  799


Q ss_pred             EEecCCCCCCCCCCceeeecCCcchhhccccccccc
Q psy1911         132 LVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVE  167 (342)
Q Consensus       132 iikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~v~  167 (342)
                      |||||||+.||++|||||.|+.|.|||+..+.++.+
T Consensus       641 IVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~p  676 (693)
T PTZ00049        641 IGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEP  676 (693)
T ss_pred             EEECCCCCCcccCceEEEEcCCCccCCccceeEEee
Confidence            999999999999999999999999999988877653


No 24 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.87  E-value=2e-22  Score=182.04  Aligned_cols=92  Identities=40%  Similarity=0.740  Sum_probs=77.0

Q ss_pred             CCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCC-CCCCCCCCchhhHHH
Q psy1911         187 LPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NCWGCNGGWPQLAWR  265 (342)
Q Consensus       187 lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~-~~~GC~GG~~~~a~~  265 (342)
                      ||++||||+.+   +.++||+||+.||+|||||++++||++++++.+ ...+.||+|+|++|+. ...+|+||++..|+.
T Consensus         1 lP~~~D~r~~~---~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~-~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~   76 (219)
T PF00112_consen    1 LPKSFDWRDKG---GRITPVRDQGSCGSCWAFAAAAALESRLAIQNN-GKNVDLSEQYLIDCSNKYNKGCDGGSPFDALK   76 (219)
T ss_dssp             STSSEEGGGTT---TCSG---BTTSSBTHHHHHHHHHHHHHHHHHHT-SSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHH
T ss_pred             CCCCEecccCC---CCcCccccCCcccccccchhccceecccccccc-ccccccccccccccccccccccccCcccccce
Confidence            89999999962   359999999999999999999999999999985 3678999999999987 345999999999999


Q ss_pred             Hhhh-cccccCCCCCCCCCcccCCC
Q psy1911         266 FWGH-NGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       266 y~~~-~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      |+.+ +||+++.+|       ||..
T Consensus        77 ~~~~~~Gi~~e~~~-------pY~~   94 (219)
T PF00112_consen   77 YIKNNNGIVTEEDY-------PYNG   94 (219)
T ss_dssp             HHHHHTSBEBTTTS---------SS
T ss_pred             eecccCcccccccc-------cccc
Confidence            9999 999997777       8873


No 25 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.86  E-value=8.6e-22  Score=200.04  Aligned_cols=101  Identities=31%  Similarity=0.483  Sum_probs=82.7

Q ss_pred             eeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccC---------CC----------CCccceEEEEeeec
Q psy1911          65 FKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHN---------FG----------DSIGLHAVRVLGWG  123 (342)
Q Consensus        65 i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~---------~~----------~~~~~Hav~ivGyg  123 (342)
                      +.+|+.+..++.  +.+|+.+|||+|+|+++.+|++|.+|||...         ++          ....+|+|+|||||
T Consensus       333 I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG  412 (548)
T PTZ00364        333 LGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWG  412 (548)
T ss_pred             ecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEec
Confidence            444555554333  7789999999999999989999999998521         11          13469999999999


Q ss_pred             c-cCCccEEEEecCCCC--CCCCCCceeeecCCcchhhccccccc
Q psy1911         124 V-ENDIPYWLVANSWND--HWGDHGTFKILRGENEADIEMGFNNR  165 (342)
Q Consensus       124 ~-~~g~~ywiikNSWG~--~WG~~Gy~~i~~~~~~~gi~~~~~~~  165 (342)
                      . ++|.+|||||||||+  .||++|||||.|+.|.|||+..+...
T Consensus       413 ~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~~v~~  457 (548)
T PTZ00364        413 TDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVVVM  457 (548)
T ss_pred             ccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcccccceeeee
Confidence            6 467899999999999  99999999999999999998887654


No 26 
>KOG1544|consensus
Probab=99.82  E-value=9.4e-21  Score=177.06  Aligned_cols=108  Identities=37%  Similarity=0.686  Sum_probs=93.5

Q ss_pred             eeeeeeEEcCchHH--HHHHHHcCCeEEEEeeccccccccccccccCCC--------CCccceEEEEeeecccCC-----
Q psy1911          63 HYFKKAHMVPRCNA--MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFG--------DSIGLHAVRVLGWGVEND-----  127 (342)
Q Consensus        63 ~~i~~~~~v~~~~~--~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~--------~~~~~Hav~ivGyg~~~g-----  127 (342)
                      +....-|+|+.+++  |++||++|||.+.|.|.++|+.|.+|||.+...        ...+.|+|.|.|||.+.+     
T Consensus       341 yq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~  420 (470)
T KOG1544|consen  341 YQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRT  420 (470)
T ss_pred             eeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCe
Confidence            55667889999887  899999999999999999999999999976432        235789999999997632     


Q ss_pred             ccEEEEecCCCCCCCCCCceeeecCCcchhhccccccccccCC
Q psy1911         128 IPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNNRVEANS  170 (342)
Q Consensus       128 ~~ywiikNSWG~~WG~~Gy~~i~~~~~~~gi~~~~~~~v~~~~  170 (342)
                      ..|||..||||+.||++|||+|.||.|+|.|+..+..++....
T Consensus       421 ~KyW~aANSWG~~WGE~GYFriLRGvNecdIEsfvIgAWGr~~  463 (470)
T KOG1544|consen  421 LKYWTAANSWGPAWGERGYFRILRGVNECDIESFVIGAWGRVG  463 (470)
T ss_pred             eEEEEeecccccccccCceEEEeccccchhhhHhhhhhhhccc
Confidence            5899999999999999999999999999999998887765433


No 27 
>KOG1544|consensus
Probab=99.81  E-value=2.8e-21  Score=180.57  Aligned_cols=100  Identities=37%  Similarity=0.722  Sum_probs=92.8

Q ss_pred             ccCCCCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCCC-CCCCCc
Q psy1911         181 CQNAKGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNCW-GCNGGW  259 (342)
Q Consensus       181 ~~~~~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~~-GC~GG~  259 (342)
                      .+...+||+.||+|++||  +.|.++.||++|++.|||+++++.+||++|++.+..+..||+|+||+|+...+ ||+||.
T Consensus       203 l~p~~~LPE~F~As~KWp--~liH~plDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~  280 (470)
T KOG1544|consen  203 LNPGEVLPEAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGR  280 (470)
T ss_pred             cCcccccchhhhhhhcCC--ccccCccccCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCc
Confidence            355689999999999999  57999999999999999999999999999999999999999999999987666 999999


Q ss_pred             hhhHHHHhhhcccccCCCCCCCCCcccCCC
Q psy1911         260 PQLAWRFWGHNGVVTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       260 ~~~a~~y~~~~Gi~~e~~Y~~~~~C~PY~~  289 (342)
                      ...||=||++.|||+       +.|.||..
T Consensus       281 lDRAWWYlRKrGvVs-------dhCYP~~~  303 (470)
T KOG1544|consen  281 LDRAWWYLRKRGVVS-------DHCYPFSG  303 (470)
T ss_pred             ccchheeeecccccc-------cccccccC
Confidence            999999999999998       77779964


No 28 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.80  E-value=1.4e-19  Score=163.70  Aligned_cols=93  Identities=29%  Similarity=0.467  Sum_probs=79.0

Q ss_pred             cceeeeeeEEcCchHH---HHHHHHcCCeEEEEeeccccccccccccc------cCCCCCccceEEEEeeecccC--Ccc
Q psy1911          61 LSHYFKKAHMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQ------HNFGDSIGLHAVRVLGWGVEN--DIP  129 (342)
Q Consensus        61 ~~~~i~~~~~v~~~~~---~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~------~~~~~~~~~Hav~ivGyg~~~--g~~  129 (342)
                      ...++..|..+...+.   +++|++.|||+++|.++++|+.|.+|++.      ..++...++|||+|||||++.  +.+
T Consensus       110 ~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~  189 (223)
T cd02619         110 AKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKG  189 (223)
T ss_pred             cceeecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCCCCCC
Confidence            3477888888887532   77899999999999999999999999862      223445679999999999886  789


Q ss_pred             EEEEecCCCCCCCCCCceeeecCC
Q psy1911         130 YWLVANSWNDHWGDHGTFKILRGE  153 (342)
Q Consensus       130 ywiikNSWG~~WG~~Gy~~i~~~~  153 (342)
                      ||||+||||+.||++||+||.++.
T Consensus       190 ~~i~~NSwG~~wg~~Gy~~i~~~~  213 (223)
T cd02619         190 AFIVKNSWGTDWGDNGYGRISYED  213 (223)
T ss_pred             EEEEEeCCCCccccCCEEEEehhh
Confidence            999999999999999999999864


No 29 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.80  E-value=1e-19  Score=192.21  Aligned_cols=85  Identities=31%  Similarity=0.618  Sum_probs=71.1

Q ss_pred             HHHHHHcCCeEEEEeecccccccc-cccccc-CCCCCccceEEEEeeeccc-----CCccEEEEecCCCCCCCCCCceee
Q psy1911          77 MRQIYEHGPLVAIFSVYADFLQYK-SGVYQH-NFGDSIGLHAVRVLGWGVE-----NDIPYWLVANSWNDHWGDHGTFKI  149 (342)
Q Consensus        77 ~~~i~~~GPV~v~i~v~~~f~~y~-~Gi~~~-~~~~~~~~Hav~ivGyg~~-----~g~~ywiikNSWG~~WG~~Gy~~i  149 (342)
                      +++|+.+|||+|+|++. +|+.|. +|||.. .|+...++|||+|||||..     ++.+|||||||||+.||++|||||
T Consensus       684 K~eI~~kGPVaV~IdAs-df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI  762 (1004)
T PTZ00462        684 KDEIMNKGSVIAYIKAE-NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKV  762 (1004)
T ss_pred             HHHHHhcCCEEEEEEee-hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEE
Confidence            77899999999999985 688885 898654 4655567999999999964     256999999999999999999999


Q ss_pred             ec-CCcchhhcccc
Q psy1911         150 LR-GENEADIEMGF  162 (342)
Q Consensus       150 ~~-~~~~~gi~~~~  162 (342)
                      .| +.+.||+....
T Consensus       763 ~r~g~n~CGin~i~  776 (1004)
T PTZ00462        763 DMYGPSHCEDNFIH  776 (1004)
T ss_pred             EeCCCCCCccchhe
Confidence            98 67999985543


No 30 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.78  E-value=3.6e-19  Score=161.03  Aligned_cols=89  Identities=28%  Similarity=0.449  Sum_probs=78.3

Q ss_pred             ccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCCC-----CCCCCCchhhHH
Q psy1911         190 NFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPNC-----WGCNGGWPQLAW  264 (342)
Q Consensus       190 sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~~-----~GC~GG~~~~a~  264 (342)
                      .||||+.    + ++||+||+.||+|||||++++||++++++.+....+.||+|+|++|....     .||.||.+..++
T Consensus         1 ~~d~r~~----~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~   75 (223)
T cd02619           1 SVDLRPL----R-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSAL   75 (223)
T ss_pred             CCcchhc----C-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHH
Confidence            4899998    4 89999999999999999999999999998764456899999999998653     489999999999


Q ss_pred             H-HhhhcccccCCCCCCCCCcccCCCC
Q psy1911         265 R-FWGHNGVVTGGDYNSQEGCQPYTLA  290 (342)
Q Consensus       265 ~-y~~~~Gi~~e~~Y~~~~~C~PY~~~  290 (342)
                      . ++..+||+++.+|       ||...
T Consensus        76 ~~~~~~~Gi~~e~~~-------Py~~~   95 (223)
T cd02619          76 LKLVALKGIPPEEDY-------PYGAE   95 (223)
T ss_pred             HHHHHHcCCCccccC-------CCCCC
Confidence            8 8899999997777       88753


No 31 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.75  E-value=1.3e-18  Score=153.87  Aligned_cols=75  Identities=44%  Similarity=0.866  Sum_probs=62.6

Q ss_pred             EEEEeeccccccccccccccC-CCCCccceEEEEeeeccc-CCccEEEEecCCCCCCCCCCceeeecCC-cchhhcccc
Q psy1911          87 VAIFSVYADFLQYKSGVYQHN-FGDSIGLHAVRVLGWGVE-NDIPYWLVANSWNDHWGDHGTFKILRGE-NEADIEMGF  162 (342)
Q Consensus        87 ~v~i~v~~~f~~y~~Gi~~~~-~~~~~~~Hav~ivGyg~~-~g~~ywiikNSWG~~WG~~Gy~~i~~~~-~~~gi~~~~  162 (342)
                      ++.+.+. +|++|.+|||..+ |.....+|+|+|||||.+ ++.+||||+||||+.||++|||||.++. +.|+++...
T Consensus        93 ~~~~~~~-~f~~Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~~~  170 (174)
T smart00645       93 SVAIDAS-DFQFYKSGIYDHPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASV  170 (174)
T ss_pred             EEEEEcc-cccCCcCeEECCCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCceeee
Confidence            4444444 6999999999874 544447999999999986 7889999999999999999999999987 899985443


No 32 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.69  E-value=3.3e-17  Score=173.41  Aligned_cols=84  Identities=25%  Similarity=0.409  Sum_probs=71.8

Q ss_pred             CCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCCC--CCCCCCCc-hhhHHHHhhhcc-c
Q psy1911         197 WPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTPN--CWGCNGGW-PQLAWRFWGHNG-V  272 (342)
Q Consensus       197 ~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~~--~~GC~GG~-~~~a~~y~~~~G-i  272 (342)
                      +++|..+.||+|||.||||||||++++||++++|+++  ..+.||+|+||||+..  ..||.||+ +..++.|++++| |
T Consensus       538 ~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg--~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgL  615 (1004)
T PTZ00462        538 ENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGY--EPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFL  615 (1004)
T ss_pred             CCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcC--CCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCC
Confidence            5899888999999999999999999999999999977  4678999999999864  35999997 555669998885 7


Q ss_pred             ccCCCCCCCCCcccCCC
Q psy1911         273 VTGGDYNSQEGCQPYTL  289 (342)
Q Consensus       273 ~~e~~Y~~~~~C~PY~~  289 (342)
                      +++.+|       ||..
T Consensus       616 ptESdY-------PYt~  625 (1004)
T PTZ00462        616 PADSNY-------LYNY  625 (1004)
T ss_pred             cccccC-------CCcc
Confidence            776666       8864


No 33 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.55  E-value=8.4e-15  Score=146.00  Aligned_cols=85  Identities=26%  Similarity=0.390  Sum_probs=70.3

Q ss_pred             eeEEcCchHHH----HHHHHcCCeEEEEeeccccccccccccccC---------------------CCCCccceEEEEee
Q psy1911          67 KAHMVPRCNAM----RQIYEHGPLVAIFSVYADFLQYKSGVYQHN---------------------FGDSIGLHAVRVLG  121 (342)
Q Consensus        67 ~~~~v~~~~~~----~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~---------------------~~~~~~~Hav~ivG  121 (342)
                      .|++|+.++.+    ++|...+||.++++|. .|+.|.+||+...                     ++....+|||+|||
T Consensus       288 ~y~Nvp~d~l~~~~~~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivG  366 (437)
T cd00585         288 LYLNVPMDVLKKAAIAQLKDGEPVWFGCDVG-KFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTG  366 (437)
T ss_pred             eEEecCHHHHHHHHHHHHhcCCCEEEEEEcC-hhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEE
Confidence            68999888773    4667777999999996 4789999999543                     23445689999999


Q ss_pred             ecccC-Cc-cEEEEecCCCCCCCCCCceeeecC
Q psy1911         122 WGVEN-DI-PYWLVANSWNDHWGDHGTFKILRG  152 (342)
Q Consensus       122 yg~~~-g~-~ywiikNSWG~~WG~~Gy~~i~~~  152 (342)
                      ||.+. |. .||+|+||||+.||++||++|+.+
T Consensus       367 v~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         367 VDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             EEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence            99754 65 699999999999999999999974


No 34 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.39  E-value=8e-13  Score=126.63  Aligned_cols=99  Identities=19%  Similarity=0.297  Sum_probs=70.1

Q ss_pred             cCCCCCcccceeeeeeEEcCchHH-------HHHHHHcCCeEEE--EeeccccccccccccccCCCCCccceEEEEeeec
Q psy1911          53 LYLPTSIPLSHYFKKAHMVPRCNA-------MRQIYEHGPLVAI--FSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWG  123 (342)
Q Consensus        53 ~~~p~s~~~~~~i~~~~~v~~~~~-------~~~i~~~GPV~v~--i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg  123 (342)
                      ...|+..++..+.+....++.+..       .+++...|-++..  |+++..+. ..-+.|..... ...+|||+||||+
T Consensus       197 ~~~~~~~p~~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~-~~~~~~~~~s~-~~~gHAv~iVGyD  274 (372)
T COG4870         197 YFSPTNLPVTKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLG-ICIPYPYVDSG-ENWGHAVLIVGYD  274 (372)
T ss_pred             ccCCcCCchhhccccceecccchhhhcccchHHHHhhhccccceeEEecccccc-cccCCCCCCcc-ccccceEEEEecc
Confidence            345566677777777777776654       4667778866543  44444333 33344433333 4569999999999


Q ss_pred             cc----------CCccEEEEecCCCCCCCCCCceeeecCC
Q psy1911         124 VE----------NDIPYWLVANSWNDHWGDHGTFKILRGE  153 (342)
Q Consensus       124 ~~----------~g~~ywiikNSWG~~WG~~Gy~~i~~~~  153 (342)
                      +.          .|.+.||||||||+.||++|||||++..
T Consensus       275 Ds~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~y  314 (372)
T COG4870         275 DSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYY  314 (372)
T ss_pred             ccccccccccCCCCCceEEEECccccccccCceEEEEeee
Confidence            75          3567999999999999999999999864


No 35 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.80  E-value=1.5e-08  Score=101.45  Aligned_cols=84  Identities=27%  Similarity=0.425  Sum_probs=59.6

Q ss_pred             eeEEcCchHH---HHHHHHcC-CeEEEEeeccccccccccccccCC---------------------CCCccceEEEEee
Q psy1911          67 KAHMVPRCNA---MRQIYEHG-PLVAIFSVYADFLQYKSGVYQHNF---------------------GDSIGLHAVRVLG  121 (342)
Q Consensus        67 ~~~~v~~~~~---~~~i~~~G-PV~v~i~v~~~f~~y~~Gi~~~~~---------------------~~~~~~Hav~ivG  121 (342)
                      .|++||.++.   +.+.+..| ||..+-+|.. +.....||.....                     .....+|||+|+|
T Consensus       289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k-~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itG  367 (438)
T PF03051_consen  289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGK-FFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITG  367 (438)
T ss_dssp             EEEE--HHHHHHHHHHHHHTT--EEEEEETTT-TEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEE
T ss_pred             eEeccCHHHHHHHHHHHHHcCCcEEEeccCCc-cccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEE
Confidence            5899999887   34455566 9999999976 4455677763321                     1244689999999


Q ss_pred             ecc-cCCc-cEEEEecCCCCCCCCCCceeeec
Q psy1911         122 WGV-ENDI-PYWLVANSWNDHWGDHGTFKILR  151 (342)
Q Consensus       122 yg~-~~g~-~ywiikNSWG~~WG~~Gy~~i~~  151 (342)
                      .+. ++|. .+|.|+||||+..|.+||+.++.
T Consensus       368 v~~D~~g~p~~wkVeNSWG~~~g~kGy~~msd  399 (438)
T PF03051_consen  368 VDLDEDGKPVRWKVENSWGTDNGDKGYFYMSD  399 (438)
T ss_dssp             EEE-TTSSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred             EEeccCCCeeEEEEEcCCCCCCCCCcEEEECH
Confidence            996 5565 59999999999999999999985


No 36 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.49  E-value=2.1e-08  Score=96.52  Aligned_cols=86  Identities=24%  Similarity=0.387  Sum_probs=62.1

Q ss_pred             CCCCcccccCccCCCCCCCcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHhhCC-CC-CCC-----CC
Q psy1911         185 KGLPRNFDAREKWPECPSLRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVACTP-NC-WGC-----NG  257 (342)
Q Consensus       185 ~~lP~sfD~R~~~~~c~~i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lldC~~-~~-~GC-----~G  257 (342)
                      ..||.-||||+.    +.|+||+||+.||+||||+++++||+.+.....    ..+|+-.++.=.. .. .+|     +|
T Consensus        97 ~s~~~~fd~r~~----g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~~----w~~s~~nm~~ll~~~ye~~fd~~~~d~  168 (372)
T COG4870          97 ASLPSYFDRRDE----GKVSPVKDQGSGGSCWAFATTRSLESYLNPESA----WDFSENNMKNLLGVPYEKGFDYTSNDG  168 (372)
T ss_pred             ccchhheeeecc----CCcccccccCcccceEeeeehhhhhheeccccc----ccccccchhhhcCCCccccCCCccccC
Confidence            458999999999    889999999999999999999999988876542    3445444333111 11 144     37


Q ss_pred             CchhhHHHHhhhc-ccccCCCC
Q psy1911         258 GWPQLAWRFWGHN-GVVTGGDY  278 (342)
Q Consensus       258 G~~~~a~~y~~~~-Gi~~e~~Y  278 (342)
                      |....+..|+.+. |.+.+.+-
T Consensus       169 g~~~m~~a~l~e~sgpv~et~d  190 (372)
T COG4870         169 GNADMSAAYLTEWSGPVYETDD  190 (372)
T ss_pred             CccccccccccccCCcchhhcC
Confidence            7777777787665 77765554


No 37 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=97.08  E-value=0.0011  Score=66.95  Aligned_cols=75  Identities=15%  Similarity=0.189  Sum_probs=59.8

Q ss_pred             cccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHh----------------hCC------------CCCCC
Q psy1911         204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA----------------CTP------------NCWGC  255 (342)
Q Consensus       204 ~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lld----------------C~~------------~~~GC  255 (342)
                      .+|.||+.-|-||.||++.+|+..+...-+. ..+.||+.+|+-                ...            ...-.
T Consensus        55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~-~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~  133 (437)
T cd00585          55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNL-KEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQN  133 (437)
T ss_pred             CCcccCCCCchhHHHHCHHHHHHHHHHHcCC-CCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcC
Confidence            5899999999999999999999988875554 478899855544                320            12257


Q ss_pred             CCCchhhHHHHhhhcccccCCCCC
Q psy1911         256 NGGWPQLAWRFWGHNGVVTGGDYN  279 (342)
Q Consensus       256 ~GG~~~~a~~y~~~~Gi~~e~~Y~  279 (342)
                      +||.-..+...+.+.|||+...||
T Consensus       134 DGGqw~m~~~li~KYGvVPk~~~p  157 (437)
T cd00585         134 DGGQWDMLVNLIEKYGLVPKSVMP  157 (437)
T ss_pred             CCCchHHHHHHHHHcCCCcccccC
Confidence            899999999999999999977663


No 38 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=96.89  E-value=0.00086  Score=67.60  Aligned_cols=74  Identities=16%  Similarity=0.223  Sum_probs=46.3

Q ss_pred             cccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHH----------------hhCCC------------CCCC
Q psy1911         204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIV----------------ACTPN------------CWGC  255 (342)
Q Consensus       204 ~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~ll----------------dC~~~------------~~GC  255 (342)
                      .+|.||..-|-||.||++.+|+..+....+. ....||+-+|.                +....            ..-.
T Consensus        56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l-~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~  134 (438)
T PF03051_consen   56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNL-KDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVS  134 (438)
T ss_dssp             -S--B--BSSTHHHHHHHHHHHHHHHHHCT--SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-
T ss_pred             CCCCCCCCCCCcchhhchHHHHHHHHHHcCC-CceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCC
Confidence            5899999999999999999999988877653 46889986654                33210            1246


Q ss_pred             CCCchhhHHHHhhhcccccCCCC
Q psy1911         256 NGGWPQLAWRFWGHNGVVTGGDY  278 (342)
Q Consensus       256 ~GG~~~~a~~y~~~~Gi~~e~~Y  278 (342)
                      +||.-..+..-+.++|||+...|
T Consensus       135 DGGqw~~~~nli~KYGvVPk~~m  157 (438)
T PF03051_consen  135 DGGQWDMVVNLIKKYGVVPKSVM  157 (438)
T ss_dssp             S-B-HHHHHHHHHHH---BGGGS
T ss_pred             CCCchHHHHHHHHHcCcCcHhhC
Confidence            89999999999999999997777


No 39 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=96.76  E-value=0.00093  Score=64.21  Aligned_cols=70  Identities=24%  Similarity=0.425  Sum_probs=49.0

Q ss_pred             HHcC-CeEEEEeeccccccccccccccC---------------------CCCCccceEEEEeeeccc-CCc-cEEEEecC
Q psy1911          81 YEHG-PLVAIFSVYADFLQYKSGVYQHN---------------------FGDSIGLHAVRVLGWGVE-NDI-PYWLVANS  136 (342)
Q Consensus        81 ~~~G-PV~v~i~v~~~f~~y~~Gi~~~~---------------------~~~~~~~Hav~ivGyg~~-~g~-~ywiikNS  136 (342)
                      ++.| ||-.+-+|.. +..-+.||+...                     .+.....|||+|.|.+.+ +|. --|-|.||
T Consensus       308 ~qagetVwFG~dvgq-~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g~p~rwkVENS  386 (444)
T COG3579         308 MQAGETVWFGCDVGQ-LSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETGNPLRWKVENS  386 (444)
T ss_pred             HhcCCcEEeecCchh-hcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCCCceeeEeecc
Confidence            4455 7877777643 555566655211                     012346899999999954 333 56999999


Q ss_pred             CCCCCCCCCceeeec
Q psy1911         137 WNDHWGDHGTFKILR  151 (342)
Q Consensus       137 WG~~WG~~Gy~~i~~  151 (342)
                      ||..-|.+|||-++-
T Consensus       387 WG~d~G~~GyfvaSd  401 (444)
T COG3579         387 WGKDVGKKGYFVASD  401 (444)
T ss_pred             cccccCCCceEeehH
Confidence            999999999998773


No 40 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=84.03  E-value=1.5  Score=35.76  Aligned_cols=52  Identities=21%  Similarity=0.431  Sum_probs=26.5

Q ss_pred             HHHHHHcC-CeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCC
Q psy1911          77 MRQIYEHG-PLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSW  137 (342)
Q Consensus        77 ~~~i~~~G-PV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSW  137 (342)
                      +...++.| ||.+.+...-.-  .....+.    .....|.|+|+||+.+.   +++|..+|
T Consensus        92 i~~~i~~G~Pvi~~~~~~~~~--~~~~~~~----~~~~~H~vvi~Gy~~~~---~~~v~DP~  144 (144)
T PF13529_consen   92 IKQEIDAGRPVIVSVNSGWRP--PNGDGYD----GTYGGHYVVIIGYDEDG---YVYVNDPW  144 (144)
T ss_dssp             HHHHHHTT--EEEEEETTSS----TTEEEE----E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred             HHHHHHCCCcEEEEEEccccc--CCCCCcC----CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence            44444445 999988632100  0111111    12358999999998753   67777766


No 41 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=82.53  E-value=1.9  Score=41.99  Aligned_cols=74  Identities=19%  Similarity=0.331  Sum_probs=50.2

Q ss_pred             cccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHH----------------HHHhhCC------------CCCCC
Q psy1911         204 RHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQ----------------HIVACTP------------NCWGC  255 (342)
Q Consensus       204 ~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q----------------~lldC~~------------~~~GC  255 (342)
                      .+|.||..-|-||.|||+..+-..+.-.-+ ...+.||..                |||.-..            ...--
T Consensus        58 d~vtNQk~SGRCWmFAAlNtfRhk~~~el~-le~fElSQaytfFwDKlEKaN~FleqIi~tadq~ldsRlv~~LL~~Pqq  136 (444)
T COG3579          58 DKVTNQKQSGRCWMFAALNTFRHKLISELK-LEDFELSQAYTFFWDKLEKANWFLEQIIETADQELDSRLVSFLLATPQQ  136 (444)
T ss_pred             CccccccccceehHHHHHHHHHHHHHHhcC-cceeehhhHHHHHHHHHHHhhHHHHHHHhhcccchHHHHHHHHHcCccc
Confidence            478999999999999999988554443322 245667752                3333111            01135


Q ss_pred             CCCchhhHHHHhhhcccccCCCC
Q psy1911         256 NGGWPQLAWRFWGHNGVVTGGDY  278 (342)
Q Consensus       256 ~GG~~~~a~~y~~~~Gi~~e~~Y  278 (342)
                      +||--.....-+.+.|||+-+.|
T Consensus       137 DGGQwdM~v~l~eKYGvVpK~~y  159 (444)
T COG3579         137 DGGQWDMFVSLFEKYGVVPKSVY  159 (444)
T ss_pred             cCchHHHHHHHHHHhCCCchhhc
Confidence            78888888888899999986666


No 42 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=80.78  E-value=3.5  Score=39.25  Aligned_cols=52  Identities=19%  Similarity=0.520  Sum_probs=34.0

Q ss_pred             HHHHHHcC-CeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEec
Q psy1911          77 MRQIYEHG-PLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVAN  135 (342)
Q Consensus        77 ~~~i~~~G-PV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikN  135 (342)
                      +++.+..| ||.+.++++  ++.|...-|    ......|.|+|+||++++ ..|.++.+
T Consensus        81 l~~~l~~g~pv~~~~D~~--~lpy~~~~~----~~~~~~H~i~v~G~d~~~-~~~~v~D~  133 (317)
T PF14399_consen   81 LKEALDAGRPVIVWVDMY--YLPYRPNYY----KKHHADHYIVVYGYDEEE-DVFYVSDP  133 (317)
T ss_pred             HHHHHhCCCceEEEeccc--cCCCCcccc----ccccCCcEEEEEEEeCCC-CEEEEEcC
Confidence            66677777 999998764  444443322    122358999999999653 34666644


No 43 
>PF10500 SR-25:  Nuclear RNA-splicing-associated protein;  InterPro: IPR019532  SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissue types. At the N terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing []. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1; also see IPR019093 from INTERPRO) signalling pathway [].
Probab=79.83  E-value=2.3  Score=38.91  Aligned_cols=6  Identities=50%  Similarity=1.210  Sum_probs=3.3

Q ss_pred             HcCCeE
Q psy1911          82 EHGPLV   87 (342)
Q Consensus        82 ~~GPV~   87 (342)
                      +.|||.
T Consensus       141 ~~~Pv~  146 (225)
T PF10500_consen  141 EDGPVL  146 (225)
T ss_pred             ccCCcc
Confidence            456754


No 44 
>PF10500 SR-25:  Nuclear RNA-splicing-associated protein;  InterPro: IPR019532  SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissue types. At the N terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing []. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1; also see IPR019093 from INTERPRO) signalling pathway [].
Probab=66.82  E-value=7.5  Score=35.69  Aligned_cols=11  Identities=18%  Similarity=0.359  Sum_probs=6.2

Q ss_pred             HHHHHHcCCeE
Q psy1911          77 MRQIYEHGPLV   87 (342)
Q Consensus        77 ~~~i~~~GPV~   87 (342)
                      +..|-..+|++
T Consensus       151 ksr~~am~PmT  161 (225)
T PF10500_consen  151 KSRIQAMAPMT  161 (225)
T ss_pred             hhhhhhcCCCC
Confidence            44455667763


No 45 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=65.85  E-value=24  Score=31.30  Aligned_cols=37  Identities=19%  Similarity=0.267  Sum_probs=23.0

Q ss_pred             CccceEEEEeeecc-cCCccEEEEecCCCCCCCCCCceeeecC
Q psy1911         111 SIGLHAVRVLGWGV-ENDIPYWLVANSWNDHWGDHGTFKILRG  152 (342)
Q Consensus       111 ~~~~Hav~ivGyg~-~~g~~ywiikNSWG~~WG~~Gy~~i~~~  152 (342)
                      ...+||++||||-. .+|.++.+|=|-|-     ++++-++..
T Consensus       117 ~~~gHAlavvGya~~~~g~~~y~~WNPW~-----~~~~~~sa~  154 (175)
T PF05543_consen  117 PHAGHALAVVGYAKPNNGQKTYYFWNPWW-----NDVMIQSAK  154 (175)
T ss_dssp             B--EEEEEEEEEEEETTSEEEEEEE-TT------SS-EEEETT
T ss_pred             CccceeEEEEeeeecCCCCeEEEEeCCcc-----CCcEEEecC
Confidence            34689999999986 45678888867665     345555543


No 46 
>KOG4364|consensus
Probab=65.84  E-value=11  Score=39.94  Aligned_cols=14  Identities=14%  Similarity=0.266  Sum_probs=6.8

Q ss_pred             HHHHHcCCeEEEEe
Q psy1911          78 RQIYEHGPLVAIFS   91 (342)
Q Consensus        78 ~~i~~~GPV~v~i~   91 (342)
                      ..|...+|+...-+
T Consensus       427 d~Lf~r~pls~~~d  440 (811)
T KOG4364|consen  427 DELFPRLPLSTNSD  440 (811)
T ss_pred             hhhcccCCCccccc
Confidence            34555556554433


No 47 
>KOG4128|consensus
Probab=62.27  E-value=1.2  Score=43.35  Aligned_cols=39  Identities=26%  Similarity=0.419  Sum_probs=30.8

Q ss_pred             ccceEEEEeeecc----cCCccEEEEecCCCCCCCCCCceeee
Q psy1911         112 IGLHAVRVLGWGV----ENDIPYWLVANSWNDHWGDHGTFKIL  150 (342)
Q Consensus       112 ~~~Hav~ivGyg~----~~g~~ywiikNSWG~~WG~~Gy~~i~  150 (342)
                      ...||++|.|-|.    +++-.-|-|.||||.+-|..|+..+.
T Consensus       370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt  412 (457)
T KOG4128|consen  370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMT  412 (457)
T ss_pred             HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhh
Confidence            3589999999882    23445699999999999999995544


No 48 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.94  E-value=24  Score=31.56  Aligned_cols=50  Identities=14%  Similarity=0.209  Sum_probs=30.6

Q ss_pred             EEcCchHH---HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeecccCCccEEEEecCCC
Q psy1911          69 HMVPRCNA---MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVENDIPYWLVANSWN  138 (342)
Q Consensus        69 ~~v~~~~~---~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~~g~~ywiikNSWG  138 (342)
                      .+++..+.   +..|.+..||.+-.-.   |-.             -.-|+|+|+||++.    ++..-+.||
T Consensus       116 ~d~tGksl~~ik~ql~kg~PV~iw~T~---~~~-------------~s~H~v~itgyDk~----n~yynDpyG  168 (195)
T COG4990         116 VDLTGKSLSDIKGQLLKGRPVVIWVTN---FHS-------------YSIHSVLITGYDKY----NIYYNDPYG  168 (195)
T ss_pred             ccCcCCcHHHHHHHHhcCCcEEEEEec---ccc-------------cceeeeEeeccccc----ceEeccccc
Confidence            34444443   4455555598776533   222             12599999999976    566667775


No 49 
>KOG4364|consensus
Probab=50.09  E-value=28  Score=36.98  Aligned_cols=10  Identities=10%  Similarity=-0.144  Sum_probs=5.4

Q ss_pred             ceEEEEeeec
Q psy1911         114 LHAVRVLGWG  123 (342)
Q Consensus       114 ~Hav~ivGyg  123 (342)
                      +-++++|--+
T Consensus       463 ~sa~kpve~e  472 (811)
T KOG4364|consen  463 GSACKPVESE  472 (811)
T ss_pred             cccceeeccC
Confidence            4555565554


No 50 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=48.65  E-value=49  Score=26.94  Aligned_cols=22  Identities=23%  Similarity=0.249  Sum_probs=14.8

Q ss_pred             cceEEEEeeecccCCccEEEEecCC
Q psy1911         113 GLHAVRVLGWGVENDIPYWLVANSW  137 (342)
Q Consensus       113 ~~Hav~ivGyg~~~g~~ywiikNSW  137 (342)
                      .+|.|+|+||+.+   ...+|.+.|
T Consensus        93 ~gH~vVv~g~~~~---~~~~i~DP~  114 (141)
T cd02549          93 SGHAMVVIGYDRK---GNVYVNDPG  114 (141)
T ss_pred             CCeEEEEEEEcCC---CCEEEECCC
Confidence            4799999999811   234555655


No 51 
>PF01754 zf-A20:  A20-like zinc finger;  InterPro: IPR002653 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the zinc finger domain found in A20. A20 is an inhibitor of cell death that inhibits NF-kappaB activation via the tumour necrosis factor receptor associated factor pathway []. The zinc finger domains appear to mediate self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding, 0008270 zinc ion binding; PDB: 2FIF_F 2FID_B 2C7N_C 2C7M_A 2L00_A 2KZY_A 2EQG_A 2EQE_A 3OJ3_J 3OJ4_C ....
Probab=47.16  E-value=14  Score=22.14  Aligned_cols=19  Identities=21%  Similarity=0.522  Sum_probs=15.5

Q ss_pred             CCCCCCcccccccccchHH
Q psy1911         198 PECPSLRHIADQSNCGSCW  216 (342)
Q Consensus       198 ~~c~~i~~v~dQg~CGsCw  216 (342)
                      ++|++.+.+..++.|.-||
T Consensus         6 ~gCgf~Gs~~~~~~Cs~C~   24 (25)
T PF01754_consen    6 NGCGFYGSPATNGLCSKCY   24 (25)
T ss_dssp             TTSSSB-BGGGTTS-HHHH
T ss_pred             CCCCCcccccccCcchhhc
Confidence            6899999999999999997


No 52 
>KOG3925|consensus
Probab=42.51  E-value=40  Score=33.20  Aligned_cols=14  Identities=29%  Similarity=0.214  Sum_probs=9.8

Q ss_pred             CCCCCchhhHHHHh
Q psy1911         254 GCNGGWPQLAWRFW  267 (342)
Q Consensus       254 GC~GG~~~~a~~y~  267 (342)
                      =|.||+...++.+-
T Consensus       306 lvfg~l~~l~~a~e  319 (371)
T KOG3925|consen  306 LVFGGLAGLVRAFE  319 (371)
T ss_pred             EEeechhhHHHHHH
Confidence            47788877766663


No 53 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=34.85  E-value=1.3e+02  Score=27.67  Aligned_cols=14  Identities=21%  Similarity=0.275  Sum_probs=11.9

Q ss_pred             cceEEEEeeecccC
Q psy1911         113 GLHAVRVLGWGVEN  126 (342)
Q Consensus       113 ~~Hav~ivGyg~~~  126 (342)
                      .+|=|+|+||+...
T Consensus       160 ~GHYVVlcGyd~~~  173 (212)
T PF09778_consen  160 QGHYVVLCGYDAAT  173 (212)
T ss_pred             cEEEEEEEeecCCC
Confidence            48999999998754


No 54 
>smart00259 ZnF_A20 A20-like zinc fingers. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappaB activation.
Probab=32.85  E-value=24  Score=21.32  Aligned_cols=19  Identities=26%  Similarity=0.739  Sum_probs=17.0

Q ss_pred             CCCCCCcccccccccchHH
Q psy1911         198 PECPSLRHIADQSNCGSCW  216 (342)
Q Consensus       198 ~~c~~i~~v~dQg~CGsCw  216 (342)
                      ++|++.+.+..|+.|.-||
T Consensus         7 ~~CgF~G~~~t~~~CskCy   25 (26)
T smart00259        7 PGCGFFGNPATEGLCSKCF   25 (26)
T ss_pred             CCCCCcCChhhcccCHhhc
Confidence            5799999999999999887


No 55 
>PF15246 NCKAP5:  Nck-associated protein 5, Peripheral clock protein
Probab=31.16  E-value=50  Score=31.95  Aligned_cols=36  Identities=8%  Similarity=0.061  Sum_probs=18.1

Q ss_pred             hcccccCCCCCcccceeeeeeEEcCchHHHHHHHHc
Q psy1911          48 KKKKRLYLPTSIPLSHYFKKAHMVPRCNAMRQIYEH   83 (342)
Q Consensus        48 ~~kk~~~~p~s~~~~~~i~~~~~v~~~~~~~~i~~~   83 (342)
                      ++-..|+..........-..|..+..+..|++|+++
T Consensus       125 ~~~~~~~~qm~~~~~q~~~~~~~~~kD~FM~eLLnR  160 (318)
T PF15246_consen  125 KRPQDCYNQMGQEQRQLPSMYQCSGKDTFMQELLNR  160 (318)
T ss_pred             cCCCccccccccccccCcCcccccchhHHHHHHHHh
Confidence            344566666533232222223334444448888875


No 56 
>KOG4128|consensus
Probab=31.11  E-value=21  Score=35.02  Aligned_cols=76  Identities=20%  Similarity=0.215  Sum_probs=50.1

Q ss_pred             CcccccccccchHHHHHhhHHHHHHHHHHhCCccccccCHHHHHh--------------------hCCCCC---------
Q psy1911         203 LRHIADQSNCGSCWAVSVANAISDRLCIASNGYFTGQISAQHIVA--------------------CTPNCW---------  253 (342)
Q Consensus       203 i~~v~dQg~CGsCwAfa~~~~le~r~~i~~~~~~~~~LS~Q~lld--------------------C~~~~~---------  253 (342)
                      -.||.+|..-|-||.|+++..|---+...-+- ....||..+|+-                    |-...+         
T Consensus        62 ~~pvtnqkssGrcWift~ln~lrl~~~~kLnl-~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~n  140 (457)
T KOG4128|consen   62 RQPVTNQKSSGRCWIFTGLNLLRLEMDRKLNL-PEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKN  140 (457)
T ss_pred             CcccccCcCCCceEEEechhHHHHHHHhcCCc-chhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhC
Confidence            37899999999999999998874333322221 245677655432                    111111         


Q ss_pred             -CCCCCchhhHHHHhhhcccccCCCCC
Q psy1911         254 -GCNGGWPQLAWRFWGHNGVVTGGDYN  279 (342)
Q Consensus       254 -GC~GG~~~~a~~y~~~~Gi~~e~~Y~  279 (342)
                       -=+||.-..-..-+++.|+++-.-|+
T Consensus       141 P~~DGGqw~MfvNlVkKYGviPKkcy~  167 (457)
T KOG4128|consen  141 PVPDGGQWQMFVNLVKKYGVIPKKCYL  167 (457)
T ss_pred             CCCCCchHHHHHHHHHHhCCCcHHhcc
Confidence             24688877777888888998866663


No 57 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=24.97  E-value=75  Score=30.48  Aligned_cols=27  Identities=22%  Similarity=0.231  Sum_probs=23.2

Q ss_pred             cceEEEEeeecccC--CccEEEEecCCCC
Q psy1911         113 GLHAVRVLGWGVEN--DIPYWLVANSWND  139 (342)
Q Consensus       113 ~~Hav~ivGyg~~~--g~~ywiikNSWG~  139 (342)
                      .+|+-.|++...-+  +.....++|-||.
T Consensus       235 ~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~  263 (315)
T cd00044         235 KGHAYSVLDVREVQEEGLRLLRLRNPWGV  263 (315)
T ss_pred             cCcceEEeEEEEEccCceEEEEecCCccC
Confidence            58999999998655  7788999999995


No 58 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=24.61  E-value=1.6e+02  Score=25.89  Aligned_cols=34  Identities=29%  Similarity=0.364  Sum_probs=23.8

Q ss_pred             HHHHHHcCCeEEEEeeccccccccccccccCCCCCccceEEEEeeeccc
Q psy1911          77 MRQIYEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE  125 (342)
Q Consensus        77 ~~~i~~~GPV~v~i~v~~~f~~y~~Gi~~~~~~~~~~~Hav~ivGyg~~  125 (342)
                      ...|.++|||-+++..              + .+....|+++|+|...+
T Consensus       102 ~~LL~~yGPLwv~~~~--------------P-~~~~~~H~~ViTGI~~d  135 (166)
T PF12385_consen  102 ANLLREYGPLWVAWEA--------------P-GDSWVAHASVITGIDGD  135 (166)
T ss_pred             HHHHHHcCCeEEEecC--------------C-CCcceeeEEEEEeecCC
Confidence            4568889999998543              2 22223699999998755


No 59 
>KOG1144|consensus
Probab=23.41  E-value=1.4e+02  Score=32.71  Aligned_cols=10  Identities=20%  Similarity=0.381  Sum_probs=6.3

Q ss_pred             HHHHHHcCCe
Q psy1911          77 MRQIYEHGPL   86 (342)
Q Consensus        77 ~~~i~~~GPV   86 (342)
                      +++|+..||+
T Consensus       298 l~~ll~sg~~  307 (1064)
T KOG1144|consen  298 LKQLLASGGG  307 (1064)
T ss_pred             HHHHHhcCCC
Confidence            5667777743


No 60 
>PF12569 NARP1:  NMDA receptor-regulated protein 1 ;  InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=22.42  E-value=78  Score=32.94  Aligned_cols=9  Identities=22%  Similarity=0.663  Sum_probs=4.8

Q ss_pred             HHHHHHcCC
Q psy1911          77 MRQIYEHGP   85 (342)
Q Consensus        77 ~~~i~~~GP   85 (342)
                      ++-|.+.+|
T Consensus       480 l~pL~~~a~  488 (517)
T PF12569_consen  480 LKPLLELAP  488 (517)
T ss_pred             HHHHHHhCc
Confidence            344555555


No 61 
>KOG3925|consensus
Probab=21.44  E-value=1.4e+02  Score=29.46  Aligned_cols=8  Identities=25%  Similarity=0.505  Sum_probs=3.5

Q ss_pred             CeEEEEee
Q psy1911          85 PLVAIFSV   92 (342)
Q Consensus        85 PV~v~i~v   92 (342)
                      +++..|.|
T Consensus        97 R~aslf~V  104 (371)
T KOG3925|consen   97 RAASLFRV  104 (371)
T ss_pred             hhheeecc
Confidence            34444444


Done!