Query         psy1664
Match_columns 524
No_of_seqs    409 out of 2951
Neff          8.1 
Searched_HMMs 46136
Date          Fri Aug 16 18:13:04 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1664.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1664hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542|consensus              100.0 5.7E-70 1.2E-74  527.3  21.5  294    4-350    66-366 (372)
  2 PTZ00203 cathepsin L protease; 100.0 4.6E-66   1E-70  525.8  27.0  298    1-347    30-332 (348)
  3 PTZ00021 falcipain-2; Provisio 100.0 6.9E-64 1.5E-68  523.5  25.1  307    4-353   164-486 (489)
  4 PTZ00200 cysteine proteinase;  100.0 7.3E-63 1.6E-67  514.7  25.0  298    4-353   121-443 (448)
  5 KOG1543|consensus              100.0 4.3E-61 9.2E-66  486.1  24.5  285   13-348    30-316 (325)
  6 cd02620 Peptidase_C1A_Cathepsi 100.0 1.8E-54   4E-59  422.4  23.8  235   98-492     1-235 (236)
  7 PTZ00049 cathepsin C-like prot 100.0 2.9E-51 6.3E-56  436.7  24.9  271   94-500   378-681 (693)
  8 cd02621 Peptidase_C1A_Cathepsi 100.0 2.4E-51 5.2E-56  402.6  21.9  218   97-349     1-236 (243)
  9 cd02698 Peptidase_C1A_Cathepsi 100.0 3.4E-51 7.4E-56  400.1  21.6  214   97-340     1-220 (239)
 10 PTZ00364 dipeptidyl-peptidase  100.0 3.3E-49 7.2E-54  417.6  22.8  231   94-496   202-460 (548)
 11 cd02248 Peptidase_C1A Peptidas 100.0 1.1E-48 2.3E-53  375.8  21.1  204   98-349     1-206 (210)
 12 KOG1544|consensus              100.0 1.8E-48 3.9E-53  371.6   7.9  302   39-498   152-463 (470)
 13 PF00112 Peptidase_C1:  Papain  100.0 2.8E-46   6E-51  360.7  16.7  210   97-350     1-215 (219)
 14 smart00645 Pept_C1 Papain fami 100.0 1.9E-43 4.1E-48  328.7  16.7  164   97-346     1-166 (174)
 15 cd02619 Peptidase_C1 C1 Peptid 100.0 4.3E-41 9.4E-46  325.2  18.9  202  100-340     1-214 (223)
 16 PTZ00462 Serine-repeat antigen 100.0 4.8E-40   1E-44  359.8  21.4  223  107-348   538-774 (1004)
 17 cd02698 Peptidase_C1A_Cathepsi  99.9   3E-27 6.5E-32  231.2  14.1   99  392-496   136-239 (239)
 18 KOG1543|consensus               99.9 2.1E-27 4.6E-32  240.6  13.1  111  379-496   213-324 (325)
 19 cd02621 Peptidase_C1A_Cathepsi  99.9 9.3E-27   2E-31  228.5  13.2  101  391-496   130-243 (243)
 20 KOG1542|consensus               99.9   1E-26 2.2E-31  226.4   9.7  105  382-493   262-369 (372)
 21 PTZ00203 cathepsin L protease;  99.9 5.7E-26 1.2E-30  231.8  12.0   99  386-493   240-338 (348)
 22 COG4870 Cysteine protease [Pos  99.9 5.7E-26 1.2E-30  223.8   5.9  205   95-340    97-315 (372)
 23 cd02248 Peptidase_C1A Peptidas  99.9   1E-24 2.2E-29  209.3  14.2  101  384-491   106-208 (210)
 24 cd02620 Peptidase_C1A_Cathepsi  99.9 3.7E-25 8.1E-30  216.0  11.2   94  247-349   139-232 (236)
 25 PTZ00021 falcipain-2; Provisio  99.9 7.1E-24 1.5E-28  222.7  12.0  107  386-497   375-488 (489)
 26 PTZ00200 cysteine proteinase;   99.9 1.6E-23 3.5E-28  219.3  12.9   95  395-497   348-445 (448)
 27 PF00112 Peptidase_C1:  Papain   99.9 2.8E-23   6E-28  200.2  11.2   94  393-493   122-218 (219)
 28 PTZ00462 Serine-repeat antigen  99.9 3.5E-23 7.6E-28  227.8  11.2  125  394-523   680-812 (1004)
 29 PTZ00049 cathepsin C-like prot  99.9 7.8E-22 1.7E-26  211.6   9.3   94  250-350   555-671 (693)
 30 PTZ00364 dipeptidyl-peptidase   99.8 3.1E-21 6.7E-26  205.1   9.5   95  248-350   339-454 (548)
 31 cd00585 Peptidase_C1B Peptidas  99.8 3.4E-20 7.5E-25  193.0  16.3  213  114-337    55-398 (437)
 32 KOG1544|consensus               99.8 3.6E-20 7.7E-25  178.0   6.8   98  246-345   347-452 (470)
 33 smart00645 Pept_C1 Papain fami  99.8 1.3E-19 2.9E-24  168.5   8.7   75  408-490    93-170 (174)
 34 cd02619 Peptidase_C1 C1 Peptid  99.8   4E-18 8.6E-23  164.6  12.7   84  393-481   124-213 (223)
 35 cd00585 Peptidase_C1B Peptidas  99.5 9.6E-14 2.1E-18  145.0   8.5   86  386-480   288-399 (437)
 36 PF03051 Peptidase_C1_2:  Pepti  99.5 8.5E-13 1.8E-17  137.9  14.4   76  114-190    56-158 (438)
 37 COG4870 Cysteine protease [Pos  99.3 1.8E-12 3.9E-17  128.8   8.2  124  394-524   224-351 (372)
 38 PF08246 Inhibitor_I29:  Cathep  99.3 1.5E-12 3.2E-17   98.3   2.4   58    9-67      1-58  (58)
 39 smart00848 Inhibitor_I29 Cathe  99.0   1E-10 2.3E-15   87.8   0.7   57    9-66      1-57  (57)
 40 PF03051 Peptidase_C1_2:  Pepti  98.4 9.3E-07   2E-11   92.9   9.1   87  386-479   289-399 (438)
 41 COG3579 PepC Aminopeptidase C   98.3 3.9E-06 8.4E-11   82.6  10.3   80  251-336   296-400 (444)
 42 PF08127 Propeptide_C1:  Peptid  97.2 0.00019 4.1E-09   49.5   1.7   36   38-76      4-39  (41)
 43 PF05543 Peptidase_C47:  Stapho  96.2   0.041 8.8E-07   50.2   9.8  120  118-324    18-145 (175)
 44 KOG4128|consensus               95.4   0.023   5E-07   56.4   5.0   75  114-189    63-166 (457)
 45 COG3579 PepC Aminopeptidase C   95.3   0.014   3E-07   58.2   3.2   72  401-478   308-400 (444)
 46 PF13529 Peptidase_C39_2:  Pept  94.6    0.44 9.5E-06   41.6  10.9   57  250-323    87-144 (144)
 47 PF13529 Peptidase_C39_2:  Pept  82.5     5.7 0.00012   34.3   7.5   60  390-465    85-144 (144)
 48 PF14399 Transpep_BrtH:  NlpC/p  76.2     6.1 0.00013   40.0   6.3   47  252-304    78-124 (317)
 49 PF05543 Peptidase_C47:  Stapho  75.5       7 0.00015   35.9   5.7   56  392-465    89-144 (175)
 50 PF09778 Guanylate_cyc_2:  Guan  74.7     8.8 0.00019   36.6   6.4   54  251-304   112-172 (212)
 51 COG4990 Uncharacterized protei  70.1     8.9 0.00019   35.4   5.0   39  250-304   121-159 (195)
 52 PF12385 Peptidase_C70:  Papain  66.4      57  0.0012   29.6   9.2   38  252-304    98-135 (166)
 53 PF14399 Transpep_BrtH:  NlpC/p  65.9      13 0.00029   37.4   6.2   47  395-447    79-125 (317)
 54 KOG4128|consensus               55.8     1.8 3.9E-05   43.5  -2.2   42  434-478   371-412 (457)
 55 PF09778 Guanylate_cyc_2:  Guan  50.4      56  0.0012   31.2   6.9   53  393-447   112-173 (212)
 56 cd02549 Peptidase_C39A A sub-f  43.6      43 0.00094   28.9   4.9   34  255-302    70-103 (141)
 57 COG4990 Uncharacterized protei  33.7      98  0.0021   28.7   5.4   40  392-447   121-160 (195)
 58 PF12385 Peptidase_C70:  Papain  31.5 1.1E+02  0.0023   27.9   5.2   39  394-447    98-136 (166)
 59 cd02549 Peptidase_C39A A sub-f  29.9 1.9E+02  0.0042   24.7   6.8   34  397-444    70-103 (141)
 60 PF01640 Peptidase_C10:  Peptid  29.5 1.5E+02  0.0032   27.8   6.2   52  253-334   141-192 (192)
 61 cd00044 CysPc Calpains, domain  29.4      68  0.0015   32.6   4.2   29  292-325   235-263 (315)

No 1  
>KOG1542|consensus
Probab=100.00  E-value=5.7e-70  Score=527.35  Aligned_cols=294  Identities=26%  Similarity=0.413  Sum_probs=244.5

Q ss_pred             cHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCC--CCCCC
Q psy1664           4 STADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPD--SKLPQ   81 (524)
Q Consensus         4 st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~--~~~~~   81 (524)
                      ...+.|..|+.+|.|+|.+.+|...|+.+|++|+..++++++... .+-..|+|+|||||.|||+++++..+.  .+.+.
T Consensus        66 ~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~-gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~~  144 (372)
T KOG1542|consen   66 GLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDP-GSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLPG  144 (372)
T ss_pred             chHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCc-cccccCccchhhcCHHHHHHHhhccccccccCcc
Confidence            347899999999999999999999999999999999999987652 477889999999999999997654332  21111


Q ss_pred             CCCCcccccCCCCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCC
Q psy1664          82 NRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD  161 (524)
Q Consensus        82 ~~~~~~~~~~~~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~  161 (524)
                      .   .......+...||++||||++    |+||||||||+||||||||+++++|.+++|+++  ++++||||+||||+ .
T Consensus       145 ~---~~~~~~~~~~~lP~~fDWR~k----gaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g--~LvsLSEQeLvDCD-~  214 (372)
T KOG1542|consen  145 D---AAEAPIEPGESLPESFDWRDK----GAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATG--KLVSLSEQELVDCD-S  214 (372)
T ss_pred             c---cccCcCCCCCCCCcccchhcc----CCccccccCCcCcchhhhhhhhhhhhHHHhhcC--cccccchhhhhccc-C
Confidence            1   111112445689999999999    999999999999999999999999999999986  68999999999995 5


Q ss_pred             CCCCCCCCChHHHHHHHHH-hCCccCCccCCCCCccccccCcccccCCCCC-CCCCCCCCCccccccccCCCcccccccc
Q psy1664         162 CGNGCQGGFHGKAWKYWVT-TGIVSGGTYASKQGCRPYEIPCERYMNGSHS-SCQDNEPNTPECIRKCQPGYDVSYEDDL  239 (524)
Q Consensus       162 ~~~gC~GG~~~~a~~~~~~-~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~-~C~~~~~~~~~~~~~~~~~~~~~~~~~~  239 (524)
                      .++||+||.+..||+|+++ .|+..|.+|       ||++.        .+ .|...+.....                 
T Consensus       215 ~d~gC~GGl~~nA~~~~~~~gGL~~E~dY-------PY~g~--------~~~~C~~~~~~~~v-----------------  262 (372)
T KOG1542|consen  215 CDNGCNGGLMDNAFKYIKKAGGLEKEKDY-------PYTGK--------KGNQCHFDKSKIVV-----------------  262 (372)
T ss_pred             cCCcCCCCChhHHHHHHHHhCCccccccC-------Ccccc--------CCCccccchhhceE-----------------
Confidence            6889999999999999655 489999999       99987        44 78765533221                 


Q ss_pred             ccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEc--CCCCCC-CCcEEEEEEeccCCCCCCCccceeE
Q psy1664         240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH--VAGGPL-GEHAIRIIGWGQEPLGEGTSSVVKY  316 (524)
Q Consensus       240 ~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~--~~~~~~-~~HaV~iVGyg~~~~~~g~~~g~~Y  316 (524)
                       +......++.|+++|.+.|+++|||+|+|++ ..+|+|++||..+  ..|++. ++|+|+|||||...      -.++|
T Consensus       263 -~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g------~~~PY  334 (372)
T KOG1542|consen  263 -SIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSG------YEKPY  334 (372)
T ss_pred             -EEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHHhcccccCCCcccCCccccCceEEEEeecCCC------CCCce
Confidence             1111245677999999999999999999995 6799999999987  457664 99999999999862      25899


Q ss_pred             EEEeCCCCCcccccCccccccccCccCCcCcCCc
Q psy1664         317 WLVANSFNTNWGENGLFRIGCRPYEIPCERYMNG  350 (524)
Q Consensus       317 WivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~~  350 (524)
                      ||||||||++|||+||+||.|+.+  .||+....
T Consensus       335 WIVKNSWG~~WGE~GY~~l~RG~N--~CGi~~mv  366 (372)
T KOG1542|consen  335 WIVKNSWGTSWGEKGYYKLCRGSN--ACGIADMV  366 (372)
T ss_pred             EEEECCccccccccceEEEecccc--ccccccch
Confidence            999999999999999999999977  68886544


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=4.6e-66  Score=525.81  Aligned_cols=298  Identities=21%  Similarity=0.424  Sum_probs=231.8

Q ss_pred             CCCcHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHh-CCCCCCCC
Q psy1664           1 MGKSTADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRM-GVHPDSKL   79 (524)
Q Consensus         1 ~~~st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~-~~~~~~~~   79 (524)
                      |+++..+.|++|+++|+|+|.+.+|...|+.+|.+|+++|++||++.  .+|++++|+|+|||.|||++++ +.......
T Consensus        30 ~~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~--~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~  107 (348)
T PTZ00203         30 VGTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARN--PHARFGITKFFDLSEAEFAARYLNGAAYFAA  107 (348)
T ss_pred             cccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccC--CCeEEeccccccCCHHHHHHHhcCCCccccc
Confidence            46678899999999999999998888899999999999999999875  6899999999999999999754 22111100


Q ss_pred             CCCCCCc-ccccCCCCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhh
Q psy1664          80 PQNRLPL-LVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSC  158 (524)
Q Consensus        80 ~~~~~~~-~~~~~~~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC  158 (524)
                      +...... .........+||++||||++    |+|+||||||.||||||||++++||++++|+++  ..+.||+|||+||
T Consensus       108 ~~~~~~~~~~~~~~~~~~lP~~~DWR~~----g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~--~~~~LSeQqLvdC  181 (348)
T PTZ00203        108 AKQHAGQHYRKARADLSAVPDAVDWREK----GAVTPVKNQGACGSCWAFSAVGNIESQWAVAGH--KLVRLSEQQLVSC  181 (348)
T ss_pred             ccccccccccccccccccCCCCCcCCcC----CCCCCccccCCCccHHHHhhHHHHHHHHHHhcC--CCccCCHHHHHhc
Confidence            0000000 00011122369999999998    899999999999999999999999999999975  4689999999999


Q ss_pred             cCCCCCCCCCCChHHHHHHHHHh---CCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCcccc
Q psy1664         159 CKDCGNGCQGGFHGKAWKYWVTT---GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY  235 (524)
Q Consensus       159 ~~~~~~gC~GG~~~~a~~~~~~~---Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~  235 (524)
                      +. .+.||+||++..||+|++++   |+++|++|       ||...     .+....|.......        ...    
T Consensus       182 ~~-~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~Y-------PY~~~-----~~~~~~C~~~~~~~--------~~~----  236 (348)
T PTZ00203        182 DH-VDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY-------PYVSG-----NGDVPECSNSSELA--------PGA----  236 (348)
T ss_pred             cC-CCCCCCCCCHHHHHHHHHHhcCCCCCccccC-------CCccC-----CCCCCcCCCCcccc--------cce----
Confidence            75 36799999999999999865   47888888       99865     11112454211000        000    


Q ss_pred             ccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCcccee
Q psy1664         236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVK  315 (524)
Q Consensus       236 ~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~  315 (524)
                       ....    ...++.++++|+.+|+++|||+|+|++. +|++|++|||+. +....++|||+|||||.+       ++++
T Consensus       237 -~i~~----~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~-------~g~~  302 (348)
T PTZ00203        237 -RIDG----YVSMESSERVMAAWLAKNGPISIAVDAS-SFMSYHSGVLTS-CIGEQLNHGVLLVGYNMT-------GEVP  302 (348)
T ss_pred             -Eecc----eeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcCccCceeec-cCCCCCCeEEEEEEEecC-------CCce
Confidence             1111    1334557888999999999999999985 899999999985 333457999999999976       4689


Q ss_pred             EEEEeCCCCCcccccCccccccccCccCCcCc
Q psy1664         316 YWLVANSFNTNWGENGLFRIGCRPYEIPCERY  347 (524)
Q Consensus       316 YWivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~  347 (524)
                      |||||||||++|||+|||||+++..  .|++.
T Consensus       303 YWiikNSWG~~WGe~GY~ri~rg~n--~Cgi~  332 (348)
T PTZ00203        303 YWVIKNSWGEDWGEKGYVRVTMGVN--ACLLT  332 (348)
T ss_pred             EEEEEcCCCCCcCcCceEEEEcCCC--ccccc
Confidence            9999999999999999999998754  57764


No 3  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=6.9e-64  Score=523.54  Aligned_cols=307  Identities=21%  Similarity=0.423  Sum_probs=235.9

Q ss_pred             cHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCCCCCCC--
Q psy1664           4 STADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQ--   81 (524)
Q Consensus         4 st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~~~~~~--   81 (524)
                      -++.+|++|+++|+|+|.+.+|...|+.+|.+|+++|++||++.+ .+|++++|+|+|||.|||++++...+......  
T Consensus       164 e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~-~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~~  242 (489)
T PTZ00021        164 ENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKEN-VLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSNG  242 (489)
T ss_pred             HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCC-CCEEEeccccccCCHHHHHHHhcccccccccccc
Confidence            457889999999999999999999999999999999999998754 79999999999999999998543221100000  


Q ss_pred             ---CCCCcc----ccc-CCCCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHH
Q psy1664          82 ---NRLPLL----VQL-SDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSD  153 (524)
Q Consensus        82 ---~~~~~~----~~~-~~~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q  153 (524)
                         ......    ... +.....+|++||||+.    |.|+||||||.||||||||++++||++++|+++  ..+.||+|
T Consensus       243 ~~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~----g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g--~~v~LSeQ  316 (489)
T PTZ00021        243 KKSPRVINYDDVIKKYKPKDATFDHAKYDWRLH----NGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKN--ELVSLSEQ  316 (489)
T ss_pred             ccccccccccccccccccccccCCccccccccC----CCCCCcccccccccHHHHHHHHHHHHHHHHHcC--CCcccCHH
Confidence               000000    000 0011125999999998    899999999999999999999999999999976  46899999


Q ss_pred             HHHhhcCCCCCCCCCCChHHHHHHHHHh-CCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCc
Q psy1664         154 DLVSCCKDCGNGCQGGFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD  232 (524)
Q Consensus       154 ~lvdC~~~~~~gC~GG~~~~a~~~~~~~-Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~  232 (524)
                      ||+||+. .+.||+||++..||+|+.++ |+++|++|       ||.+.       ..+.|....         |...+.
T Consensus       317 qLVDCs~-~n~GC~GG~~~~Af~yi~~~gGl~tE~~Y-------PY~~~-------~~~~C~~~~---------~~~~~~  372 (489)
T PTZ00021        317 ELVDCSF-KNNGCYGGLIPNAFEDMIELGGLCSEDDY-------PYVSD-------TPELCNIDR---------CKEKYK  372 (489)
T ss_pred             HHhhhcc-CCCCCCCcchHhhhhhhhhccccCccccc-------CccCC-------CCCcccccc---------ccccce
Confidence            9999975 36799999999999999876 89998888       99864       125565321         111111


Q ss_pred             cccccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCC---C
Q psy1664         233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE---G  309 (524)
Q Consensus       233 ~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~---g  309 (524)
                      +     ..|    ..++  +++|+++|+.+|||+|+|+++.+|++|++|||+.. |...++|||+|||||.+...+   +
T Consensus       373 i-----~~y----~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~-C~~~~nHAVlIVGYG~e~~~~~~~~  440 (489)
T PTZ00021        373 I-----KSY----VSIP--EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGE-CGEEPNHAVILVGYGMEEIYNSDTK  440 (489)
T ss_pred             e-----eeE----EEec--HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCC-CCCccceEEEEEEecCcCCcccccc
Confidence            1     111    2333  57899999999999999999889999999999874 655689999999999763100   0


Q ss_pred             CccceeEEEEeCCCCCcccccCccccccccC--ccCCcCcCCcCCC
Q psy1664         310 TSSVVKYWLVANSFNTNWGENGLFRIGCRPY--EIPCERYMNGSRS  353 (524)
Q Consensus       310 ~~~g~~YWivkNSWG~~WGe~Gy~ri~~~~~--~~~c~~~~~~~~~  353 (524)
                      ...+.+|||||||||++|||+|||||+++..  ...||+.+.+.+|
T Consensus       441 ~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~yP  486 (489)
T PTZ00021        441 KMEKRYYYIIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEAYVP  486 (489)
T ss_pred             cCCCCCEEEEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccceeE
Confidence            0123579999999999999999999998753  2479998766554


No 4  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=7.3e-63  Score=514.71  Aligned_cols=298  Identities=24%  Similarity=0.418  Sum_probs=230.3

Q ss_pred             cHHHHHHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhC-CCCCCCC---
Q psy1664           4 STADAVATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMG-VHPDSKL---   79 (524)
Q Consensus         4 st~~~f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~-~~~~~~~---   79 (524)
                      .+...|++|+++|+|+|.+.+|...|+.+|.+|++.|++||..   .+|++|+|+|+|||.|||.+++. ...+...   
T Consensus       121 e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~---~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~  197 (448)
T PTZ00200        121 EVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD---EPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNST  197 (448)
T ss_pred             HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc---CCeEEeccccccCCHHHHHHHhccCCCccccccc
Confidence            3567899999999999999999999999999999999999963   58999999999999999998653 2211100   


Q ss_pred             -CCCC-------CCcccc---c-----C---CCCCCCCCccccccCCCCCCCCccCCCCC-CCccHHHHHHHHHHHHHHH
Q psy1664          80 -PQNR-------LPLLVQ---L-----S---DPLEELPEGFDARINWPYCPTIQEIRDQG-SCGSGWALGAVEAMSDRVC  139 (524)
Q Consensus        80 -~~~~-------~~~~~~---~-----~---~~~~~lP~s~DwR~~~~~~g~vtpvkdQg-~CGsCwAfA~~~~le~~~~  139 (524)
                       +...       .+....   .     .   .....+|++||||+.    +.|+|||||| .||||||||++++||++++
T Consensus       198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~----g~vtpVkdQG~~CGSCWAFat~~aiEs~~~  273 (448)
T PTZ00200        198 SHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRA----DAVTKVKDQGLNCGSCWAFSSVGSVESLYK  273 (448)
T ss_pred             ccccccccccccccccccccccccccccccccccccCCCCccCCCC----CCCCCcccCCCccchHHHHhHHHHHHHHHH
Confidence             0000       000000   0     0   001236999999998    8899999999 9999999999999999999


Q ss_pred             HHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCC
Q psy1664         140 IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPN  219 (524)
Q Consensus       140 i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~  219 (524)
                      |+++  ..+.||+|||+||+. .++||+||++..||+|++++|+++|++|       ||.+.        .+.|......
T Consensus       274 i~~~--~~~~LSeQqLvDC~~-~~~GC~GG~~~~A~~yi~~~Gi~~e~~Y-------PY~~~--------~~~C~~~~~~  335 (448)
T PTZ00200        274 IYRD--KSVDLSEQELVNCDT-KSQGCSGGYPDTALEYVKNKGLSSSSDV-------PYLAK--------DGKCVVSSTK  335 (448)
T ss_pred             HhcC--CCeecCHHHHhhccC-ccCCCCCCcHHHHHHHHhhcCccccccC-------CCCCC--------CCCCcCCCCC
Confidence            9865  468999999999975 3679999999999999999999998888       99876        6677643211


Q ss_pred             CccccccccCCCccccccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEE
Q psy1664         220 TPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII  299 (524)
Q Consensus       220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iV  299 (524)
                      .          .     ....     +.+..+. +++++++.+|||+|+|.++.+|+.|++|||+++ |...++|||+||
T Consensus       336 ~----------~-----~i~~-----y~~~~~~-~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~-C~~~~nHaV~lV  393 (448)
T PTZ00200        336 K----------V-----YIDS-----YLVAKGK-DVLNKSLVISPTVVYIAVSRELLKYKSGVYNGE-CGKSLNHAVLLV  393 (448)
T ss_pred             e----------e-----Eecc-----eEecCHH-HHHHHHHhcCCEEEEeecccccccCCCCccccc-cCCCCcEEEEEE
Confidence            0          0     0111     1222333 455556678999999999989999999999875 555589999999


Q ss_pred             EeccCCCCCCCccceeEEEEeCCCCCcccccCccccccccC-ccCCcCcCCcCCC
Q psy1664         300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPY-EIPCERYMNGSRS  353 (524)
Q Consensus       300 Gyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~~~~~-~~~c~~~~~~~~~  353 (524)
                      |||.+.     ++|.+|||||||||++|||+|||||++... ...||+.+.+.+|
T Consensus       394 GyG~d~-----~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~~~~~~P  443 (448)
T PTZ00200        394 GEGYDE-----KTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGILTVGLTP  443 (448)
T ss_pred             EecccC-----CCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCccccceee
Confidence            999752     246899999999999999999999998642 2368887665444


No 5  
>KOG1543|consensus
Probab=100.00  E-value=4.3e-61  Score=486.13  Aligned_cols=285  Identities=31%  Similarity=0.550  Sum_probs=235.2

Q ss_pred             HHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCCCCCCCCCCCcccccCC
Q psy1664          13 LKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSD   92 (524)
Q Consensus        13 ~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~~~~~~~~~~~~~~~~~   92 (524)
                      +..|.+.|.+..+...|+.+|.+|++.|+.||.... .+|++++|+|+|++.+|+++..........    .........
T Consensus        30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~-~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~----~~~~~~~~~  104 (325)
T KOG1543|consen   30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYV-LSFLMGVNQFADLTTEEFKRKKTGKKPPEI----KRDKFTEKL  104 (325)
T ss_pred             hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhc-eeeeeccccccccchHHHHHhhccccCccc----ccccccccc
Confidence            445667777777788889999999999999999853 899999999999999999986544322210    111111223


Q ss_pred             CCCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChH
Q psy1664          93 PLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG  172 (524)
Q Consensus        93 ~~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~  172 (524)
                      ...+||++||||++|   .+++||||||.||||||||++++||++++|++++ .++.||+|+|+||+..+++||+||++.
T Consensus       105 ~~~~~p~s~DwR~~~---~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~-~l~sLSeq~lvdC~~~~~~GC~GG~~~  180 (325)
T KOG1543|consen  105 DGDDLPDSFDWRDKG---AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGG-KLLSLSEQDLVDCCGECGDGCNGGEPK  180 (325)
T ss_pred             chhhCCCCccccccC---CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCC-ccCccChhhhhhccCCCCCCcCCCCHH
Confidence            345899999999996   4567799999999999999999999999999976 689999999999987767899999999


Q ss_pred             HHHHHHHHhCCcc-CCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCC
Q psy1664         173 KAWKYWVTTGIVS-GGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPAN  251 (524)
Q Consensus       173 ~a~~~~~~~Gi~~-e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  251 (524)
                      .|++|++++|+++ +.+|       ||...        .+.|......                  ...+....+.++.+
T Consensus       181 ~A~~yi~~~G~~t~~~~Y-------py~~~--------~~~C~~~~~~------------------~~~~~~~~~~~~~~  227 (325)
T KOG1543|consen  181 NAFKYIKKNGGVTECENY-------PYIGK--------DGTCKSNKKD------------------KTVTIKGFYNVPAN  227 (325)
T ss_pred             HHHHHHHHhCCCCCCcCC-------CCcCC--------CCCccCCCcc------------------ceeEeeeeeecCcC
Confidence            9999999999888 8888       99877        5577654420                  11122224567888


Q ss_pred             HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCC-CCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCccccc
Q psy1664         252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN  330 (524)
Q Consensus       252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~-~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~  330 (524)
                      +++|+.+|+.+|||+|+|+++.+|+.|++|||.+++|.. .++|||+|||||+ .      ++.+|||||||||++|||+
T Consensus       228 e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~-~------~~~~YWivkNSWG~~WGe~  300 (325)
T KOG1543|consen  228 EEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGT-G------DGVDYWIVKNSWGTDWGEK  300 (325)
T ss_pred             HHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcC-C------CCceeEEEEcCCCCCcccC
Confidence            999999999999999999999999999999999999887 4999999999998 3      5689999999999999999


Q ss_pred             CccccccccCccCCcCcC
Q psy1664         331 GLFRIGCRPYEIPCERYM  348 (524)
Q Consensus       331 Gy~ri~~~~~~~~c~~~~  348 (524)
                      |||||.++..  .|++..
T Consensus       301 Gy~ri~r~~~--~~~I~~  316 (325)
T KOG1543|consen  301 GYFRIARGVN--KCGIAS  316 (325)
T ss_pred             ceEEEecCCC--chhhhc
Confidence            9999999877  355443


No 6  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=1.8e-54  Score=422.38  Aligned_cols=235  Identities=55%  Similarity=1.119  Sum_probs=186.0

Q ss_pred             CCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHH
Q psy1664          98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY  177 (524)
Q Consensus        98 P~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~  177 (524)
                      |++||||++|+++..|+||+|||.||||||||++++||++++|++++...+.||+|+|+||+...+.||+||++..||+|
T Consensus         1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~   80 (236)
T cd02620           1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY   80 (236)
T ss_pred             CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence            89999999988776678999999999999999999999999998764457899999999997654679999999999999


Q ss_pred             HHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHHHH
Q psy1664         178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMR  257 (524)
Q Consensus       178 ~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~  257 (524)
                      ++++|+++|++|       ||...        ...|...                                         
T Consensus        81 i~~~G~~~e~~y-------PY~~~--------~~~~~~~-----------------------------------------  104 (236)
T cd02620          81 LTTTGVVTGGCQ-------PYTIP--------PCGHHPE-----------------------------------------  104 (236)
T ss_pred             HHhcCCCcCCEe-------cCcCC--------CCccCCC-----------------------------------------
Confidence            999999998877       99754        1111000                                         


Q ss_pred             HHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCcccccc
Q psy1664         258 EIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGC  337 (524)
Q Consensus       258 ~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~~  337 (524)
                                                         .|                                           
T Consensus       105 -----------------------------------~~-------------------------------------------  106 (236)
T cd02620         105 -----------------------------------GP-------------------------------------------  106 (236)
T ss_pred             -----------------------------------CC-------------------------------------------
Confidence                                               00                                           


Q ss_pred             ccCccCCcCcCCcCCCCCCCCCCCCcccccccccCccccccCcceeeeEEEEcCCCHHHHHHHHHhCCCEEEEEeccccc
Q psy1664         338 RPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADM  417 (524)
Q Consensus       338 ~~~~~~c~~~~~~~~~~C~~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f  417 (524)
                                     ..|..    ...|+..|.......|..+.+.....+.+..++++||.+|+++|||+++|.++++|
T Consensus       107 ---------------~~~~~----~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f  167 (236)
T cd02620         107 ---------------PPCCG----TPYCTPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDF  167 (236)
T ss_pred             ---------------CCCCC----CCCCCCCCCcCCccccceeeeeecceeeeCCHHHHHHHHHHHCCCeEEEEEechhh
Confidence                           00100    11111223222111133334444556666667899999999999999999998899


Q ss_pred             ccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcEEEEEeCCCccCcccccee
Q psy1664         418 ILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITA  492 (524)
Q Consensus       418 ~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~  492 (524)
                      +.|++|||...+....++|||+|||||+++       +++|||||||||++||++|||||+||.|.|||+++++.
T Consensus       168 ~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~~-------g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~  235 (236)
T cd02620         168 LYYKSGVYQHTSGKQLGGHAVKIIGWGVEN-------GVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA  235 (236)
T ss_pred             hhcCCcEEeecCCCCcCCeEEEEEEEeccC-------CeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence            999999998765555679999999999886       88999999999999999999999999999999998764


No 7  
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=2.9e-51  Score=436.65  Aligned_cols=271  Identities=26%  Similarity=0.435  Sum_probs=191.4

Q ss_pred             CCCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCC--------ccccCCHHHHHhhcCCCCCC
Q psy1664          94 LEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGK--------RHVRLSSDDLVSCCKDCGNG  165 (524)
Q Consensus        94 ~~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~--------~~~~LS~q~lvdC~~~~~~g  165 (524)
                      ..+||++||||+.|+.++.++||+|||.||||||||++++||++++|+++..        ....||+|+||||+. .++|
T Consensus       378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~-~nqG  456 (693)
T PTZ00049        378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSF-YDQG  456 (693)
T ss_pred             cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCC-CCCC
Confidence            4589999999999999999999999999999999999999999999986421        123799999999975 4679


Q ss_pred             CCCCChHHHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceee
Q psy1664         166 CQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIA  245 (524)
Q Consensus       166 C~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  245 (524)
                      |+||++..|++|++++||++|..|       ||++.        .+.|+.........                      
T Consensus       457 C~GG~~~~A~kya~~~GI~tEscY-------PY~a~--------~g~C~~~~~~~~~~----------------------  499 (693)
T PTZ00049        457 CNGGFPYLVSKMAKLQGIPLDKVF-------PYTAT--------EQTCPYQVDQSANS----------------------  499 (693)
T ss_pred             cCCCcHHHHHHHHHHCCCCcCCcc-------CCcCC--------CCCCCCCCCCcccc----------------------
Confidence            999999999999999999997777       99865        55675321100000                      


Q ss_pred             eecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCC
Q psy1664         246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT  325 (524)
Q Consensus       246 ~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~  325 (524)
                                                                          +.|+..                      
T Consensus       500 ----------------------------------------------------~~g~~~----------------------  505 (693)
T PTZ00049        500 ----------------------------------------------------MNGSAN----------------------  505 (693)
T ss_pred             ----------------------------------------------------cccccc----------------------
Confidence                                                                000000                      


Q ss_pred             cccccCccccccccCccCCcCcCCcCCCCCCCCCCCCcccccccccCccccccCcceeeeEEEEc--CCCHHHHHHHHHh
Q psy1664         326 NWGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSL--PANEETIMREIFR  403 (524)
Q Consensus       326 ~WGe~Gy~ri~~~~~~~~c~~~~~~~~~~C~~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~--~~~~~~~~~~~~~  403 (524)
                               +.    +..+.+......+.|.     ...|...|... ...|.++..+...+|.+  ..++++||.+|++
T Consensus       506 ---------~~----~~~~~~~~~~~~~~~~-----~~~~~~~~~~~-~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~  566 (693)
T PTZ00049        506 ---------LR----QINAVFFSSETQSDMH-----ADFEAPISSEP-ARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYR  566 (693)
T ss_pred             ---------cc----cccccccccccccccc-----ccccccccccc-cceeeeeeEEecccccccCCCCHHHHHHHHHh
Confidence                     00    0000000000000010     00011111111 11122223333333443  2478999999999


Q ss_pred             CCCEEEEEecccccccccccEEeCC-------CCC--------------CccCeeEEEeeecCCCCCCCccCC--ccEEE
Q psy1664         404 HGPVEGSMTIYADMILYKTGIYKHV-------AGG--------------PLGEHAIRIIGWGQEPLGEGTSSV--VKYWL  460 (524)
Q Consensus       404 ~gPv~~~~~~~~~f~~y~~gi~~~~-------~~~--------------~~~~H~v~ivG~g~~~~~~~~~~~--~~ywi  460 (524)
                      +|||+|+|+++++|++|++|||+.+       |..              ..++|||+|||||.+..     .|  ++|||
T Consensus       567 ~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~e-----nG~~~~YWI  641 (693)
T PTZ00049        567 NGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEI-----NGKLYKYWI  641 (693)
T ss_pred             cCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccC-----CCcccCEEE
Confidence            9999999999889999999999853       211              13699999999998631     14  48999


Q ss_pred             EEcCCCCCCCCCcEEEEEeCCCccCccccceeccceeccc
Q psy1664         461 VANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIGLE  500 (524)
Q Consensus       461 v~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~~~~~  500 (524)
                      ||||||++||++|||||+||.|.||||++++++.|++..-
T Consensus       642 VRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~pd~~rg  681 (693)
T PTZ00049        642 GRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEPDFSRG  681 (693)
T ss_pred             EECCCCCCcccCceEEEEcCCCccCCccceeEEeeecccc
Confidence            9999999999999999999999999999999999998754


No 8  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=2.4e-51  Score=402.63  Aligned_cols=218  Identities=31%  Similarity=0.629  Sum_probs=173.9

Q ss_pred             CCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCC----CccccCCHHHHHhhcCCCCCCCCCCChH
Q psy1664          97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRG----KRHVRLSSDDLVSCCKDCGNGCQGGFHG  172 (524)
Q Consensus        97 lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~----~~~~~LS~q~lvdC~~~~~~gC~GG~~~  172 (524)
                      ||++||||+.++++.+|+|||||+.||||||||++++||++++|+++.    ...+.||+|||+||+. .++||+||++.
T Consensus         1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~-~~~GC~GG~~~   79 (243)
T cd02621           1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQ-YSQGCDGGFPF   79 (243)
T ss_pred             CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcC-CCCCCCCCCHH
Confidence            799999999977777999999999999999999999999999998764    2368999999999974 46799999999


Q ss_pred             HHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCcccccccccccee-eeecCCC
Q psy1664         173 KAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRI-AYSLPAN  251 (524)
Q Consensus       173 ~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~  251 (524)
                      .|++|++++|+++|++|       ||...       ..+.|......       +....      ...|... .+....+
T Consensus        80 ~a~~~~~~~Gi~~e~~y-------PY~~~-------~~~~C~~~~~~-------~~~~~------~~~~~~i~~~~~~~~  132 (243)
T cd02621          80 LVGKFAEDFGIVTEDYF-------PYTAD-------DDRPCKASPSE-------CRRYY------FSDYNYVGGCYGCTN  132 (243)
T ss_pred             HHHHHHHhcCcCCCcee-------CCCCC-------CCCCCCCCccc-------ccccc------ccceeEcccccccCC
Confidence            99999999999998777       99861       15667643200       00000      0011111 1112357


Q ss_pred             HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCC----CCC---------CCCcEEEEEEeccCCCCCCCccceeEEE
Q psy1664         252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----GGP---------LGEHAIRIIGWGQEPLGEGTSSVVKYWL  318 (524)
Q Consensus       252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~----~~~---------~~~HaV~iVGyg~~~~~~g~~~g~~YWi  318 (524)
                      +++||++|+++|||+|+|.++++|++|++|||+...    |..         .++|||+|||||++..     ++++|||
T Consensus       133 ~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~-----~g~~YWi  207 (243)
T cd02621         133 EDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI-----KGEKYWI  207 (243)
T ss_pred             HHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC-----CCCcEEE
Confidence            889999999999999999999999999999998763    421         4799999999998631     3689999


Q ss_pred             EeCCCCCcccccCccccccccCccCCcCcCC
Q psy1664         319 VANSFNTNWGENGLFRIGCRPYEIPCERYMN  349 (524)
Q Consensus       319 vkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~  349 (524)
                      ||||||++|||+|||||+|+..  .|++...
T Consensus       208 irNSWG~~WGe~Gy~~i~~~~~--~cgi~~~  236 (243)
T cd02621         208 VKNSWGSSWGEKGYFKIRRGTN--ECGIESQ  236 (243)
T ss_pred             EEcCCCCCCCcCCeEEEecCCc--ccCcccc
Confidence            9999999999999999999764  6887654


No 9  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=3.4e-51  Score=400.11  Aligned_cols=214  Identities=31%  Similarity=0.600  Sum_probs=168.7

Q ss_pred             CCCccccccCCCCCCCCccCCCCC---CCccHHHHHHHHHHHHHHHHHcCCC-ccccCCHHHHHhhcCCCCCCCCCCChH
Q psy1664          97 LPEGFDARINWPYCPTIQEIRDQG---SCGSGWALGAVEAMSDRVCIASRGK-RHVRLSSDDLVSCCKDCGNGCQGGFHG  172 (524)
Q Consensus        97 lP~s~DwR~~~~~~g~vtpvkdQg---~CGsCwAfA~~~~le~~~~i~~~~~-~~~~LS~q~lvdC~~~~~~gC~GG~~~  172 (524)
                      ||++||||+++.. .+|+||||||   .||||||||++++||++++|++++. ..+.||+|||+||+.  +.||+||++.
T Consensus         1 lP~~~Dwr~~~~~-~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~--~~gC~GG~~~   77 (239)
T cd02698           1 LPKSWDWRNVNGV-NYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG--GGSCHGGDPG   77 (239)
T ss_pred             CCCCcccccCCCC-cccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC--CCCccCcCHH
Confidence            7999999998322 2799999998   8999999999999999999987653 357899999999975  6799999999


Q ss_pred             HHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCcccc--ccccCCCccccccccccceeeeecCC
Q psy1664         173 KAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI--RKCQPGYDVSYEDDLNFGRIAYSLPA  250 (524)
Q Consensus       173 ~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~  250 (524)
                      .|++|++++|+++|++|       ||...        ...|...... ..|.  ..|.............|    ..+ .
T Consensus        78 ~a~~~~~~~Gl~~e~~y-------PY~~~--------~~~C~~~~~~-~~c~~~~~c~~~~~~~~~~i~~~----~~~-~  136 (239)
T cd02698          78 GVYEYAHKHGIPDETCN-------PYQAK--------DGECNPFNRC-GTCNPFGECFAIKNYTLYFVSDY----GSV-S  136 (239)
T ss_pred             HHHHHHHHcCcCCCCee-------CCcCC--------CCCCcCCCCC-CCcccCcccccccccceEEeeec----eec-C
Confidence            99999999999998877       99865        4556432110 1111  11211100000011111    122 3


Q ss_pred             CHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCccccc
Q psy1664         251 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGEN  330 (524)
Q Consensus       251 ~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~  330 (524)
                      ++++||++|+++|||+|+|.++++|+.|++|||+..+|...++|||+|||||++.      ++++|||||||||++|||+
T Consensus       137 ~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~------~g~~YWiikNSWG~~WGe~  210 (239)
T cd02698         137 GRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDE------NGVEYWIVRNSWGEPWGER  210 (239)
T ss_pred             CHHHHHHHHHHcCCEEEEEEecccccccCCeEEccCCCCCcCCeEEEEEEEEecC------CCCEEEEEEcCCCcccCcC
Confidence            5789999999999999999999999999999999888877889999999999863      2689999999999999999


Q ss_pred             CccccccccC
Q psy1664         331 GLFRIGCRPY  340 (524)
Q Consensus       331 Gy~ri~~~~~  340 (524)
                      |||||+++.+
T Consensus       211 Gy~~i~rg~~  220 (239)
T cd02698         211 GWFRIVTSSY  220 (239)
T ss_pred             ceEEEEccCC
Confidence            9999999873


No 10 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=3.3e-49  Score=417.57  Aligned_cols=231  Identities=26%  Similarity=0.462  Sum_probs=178.5

Q ss_pred             CCCCCCccccccCCCCCCCCccCCCCCC---CccHHHHHHHHHHHHHHHHHcCCC----ccccCCHHHHHhhcCCCCCCC
Q psy1664          94 LEELPEGFDARINWPYCPTIQEIRDQGS---CGSGWALGAVEAMSDRVCIASRGK----RHVRLSSDDLVSCCKDCGNGC  166 (524)
Q Consensus        94 ~~~lP~s~DwR~~~~~~g~vtpvkdQg~---CGsCwAfA~~~~le~~~~i~~~~~----~~~~LS~q~lvdC~~~~~~gC  166 (524)
                      ..+||++||||++. .+.+|+||||||.   ||||||||++++||++++|++++.    ..+.||+|+|+||+. .++||
T Consensus       202 ~~~LP~sfDWR~~g-g~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~-~n~GC  279 (548)
T PTZ00364        202 GDPPPAAWSWGDVG-GASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQ-YGQGC  279 (548)
T ss_pred             ccCCCCccccCcCC-CCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccC-CCCCC
Confidence            35799999999982 2247899999999   999999999999999999998542    358899999999974 47899


Q ss_pred             CCCChHHHHHHHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeee
Q psy1664         167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAY  246 (524)
Q Consensus       167 ~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  246 (524)
                      +||++..|++|++++|+++|++|     |.||...     .+....|+.....                           
T Consensus       280 dGG~p~~A~~yi~~~GI~tE~dY-----~~PY~~~-----dg~~~~Ck~~~~~---------------------------  322 (548)
T PTZ00364        280 AGGFPEEVGKFAETFGILTTDSY-----YIPYDSG-----DGVERACKTRRPS---------------------------  322 (548)
T ss_pred             CCCcHHHHHHHHHhCCccccccc-----CCCCCCC-----CCCCCCCCCCccc---------------------------
Confidence            99999999999999999997766     4588653     0001112210000                           


Q ss_pred             ecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCc
Q psy1664         247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN  326 (524)
Q Consensus       247 ~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~  326 (524)
                                                                                                      
T Consensus       323 --------------------------------------------------------------------------------  322 (548)
T PTZ00364        323 --------------------------------------------------------------------------------  322 (548)
T ss_pred             --------------------------------------------------------------------------------
Confidence                                                                                            


Q ss_pred             ccccCccccccccCccCCcCcCCcCCCCCCCCCCCCcccccccccCccccccCcceeeeEEEEcCCCHHHHHHHHHhCCC
Q psy1664         327 WGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGP  406 (524)
Q Consensus       327 WGe~Gy~ri~~~~~~~~c~~~~~~~~~~C~~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~gP  406 (524)
                                                                     ...+..+..+..+++.+..++++|+.+|+++||
T Consensus       323 -----------------------------------------------~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GP  355 (548)
T PTZ00364        323 -----------------------------------------------RRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGP  355 (548)
T ss_pred             -----------------------------------------------ceeeeeeeEEecceeecCCcHHHHHHHHHHcCC
Confidence                                                           000000111112233344578899999999999


Q ss_pred             EEEEEecccccccccccEEeCC---------CC----------CCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCC
Q psy1664         407 VEGSMTIYADMILYKTGIYKHV---------AG----------GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT  467 (524)
Q Consensus       407 v~~~~~~~~~f~~y~~gi~~~~---------~~----------~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~  467 (524)
                      |+|+|+++.+|+.|++|||...         ++          ...++|+|+|||||++.+      |++|||||||||+
T Consensus       356 VsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~------G~~YWIVKNSWGt  429 (548)
T PTZ00364        356 VPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDEN------GGDYWLVLDPWGS  429 (548)
T ss_pred             eEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCC------CceEEEEECCCCC
Confidence            9999999889999999998521         11          134799999999997542      7899999999999


Q ss_pred             --CCCCCcEEEEEeCCCccCccccceeccce
Q psy1664         468 --NWGENGLFRIVRGQNECGIEADITAGLPK  496 (524)
Q Consensus       468 --~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~  496 (524)
                        +||++|||||+||.|+||||+.++++.|.
T Consensus       430 ~~~WGE~GYfRI~RG~N~CGIes~~v~~~~~  460 (548)
T PTZ00364        430 RRSWCDGGTRKIARGVNAYNIESEVVVMYWA  460 (548)
T ss_pred             CCCcccCCeEEEEcCCCcccccceeeeeeee
Confidence              99999999999999999999999988884


No 11 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=1.1e-48  Score=375.76  Aligned_cols=204  Identities=33%  Similarity=0.616  Sum_probs=170.4

Q ss_pred             CCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHH
Q psy1664          98 PEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKY  177 (524)
Q Consensus        98 P~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~  177 (524)
                      |++||||+.    +.++||+|||.||+|||||++++||++++++++  ..+.||+|+|++|....+.+|.||++..|+++
T Consensus         1 P~~~d~r~~----~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~--~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~   74 (210)
T cd02248           1 PESVDWREK----GAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTG--KLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEY   74 (210)
T ss_pred             CCcccCCcC----CCCCCCccCCCCcchHHhHHHHHHHHHHHHHcC--CCcccCHHHHhccCCCCCCCCCCCCHHHhHHH
Confidence            889999998    669999999999999999999999999999876  46889999999997644579999999999999


Q ss_pred             HHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCC-CHHHHH
Q psy1664         178 WVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPA-NEETIM  256 (524)
Q Consensus       178 ~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ik  256 (524)
                      +++.|+++|++|       ||...        ...|......         ..+     +...    ...+.. +.++||
T Consensus        75 ~~~~Gi~~e~~y-------PY~~~--------~~~C~~~~~~---------~~~-----~i~~----~~~i~~~~~~~ik  121 (210)
T cd02248          75 VKNGGLASESDY-------PYTGK--------DGTCKYNSSK---------VGA-----KITG----YSNVPPGDEEALK  121 (210)
T ss_pred             HHHCCcCccccC-------CccCC--------CCCccCCCCc---------ccE-----EEee----EEEcCCCcHHHHH
Confidence            999999998888       99864        4556543210         000     1111    123333 478999


Q ss_pred             HHHHHcCCeEEEEEecccccccCCceEEcCCC-CCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCcccc
Q psy1664         257 REIFRHGPVEGSMTIYADMILYKTGIYKHVAG-GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI  335 (524)
Q Consensus       257 ~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~-~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri  335 (524)
                      ++|+++|||+++|.++++|+.|++|||..+.+ ...++|||+|||||++       .+.+|||||||||++||++|||||
T Consensus       122 ~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~-------~~~~ywiv~NSWG~~WG~~Gy~~i  194 (210)
T cd02248         122 AALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYGTE-------NGVDYWIVKNSWGTSWGEKGYIRI  194 (210)
T ss_pred             HHHhhcCCEEEEEecCcccccCCCCceeCCCCCCCcCCEEEEEEEEeec-------CCceEEEEEcCCCCccccCcEEEE
Confidence            99999999999999999999999999998877 4568999999999987       368999999999999999999999


Q ss_pred             ccccCccCCcCcCC
Q psy1664         336 GCRPYEIPCERYMN  349 (524)
Q Consensus       336 ~~~~~~~~c~~~~~  349 (524)
                      +++..  .|++...
T Consensus       195 ~~~~~--~cgi~~~  206 (210)
T cd02248         195 ARGSN--LCGIASY  206 (210)
T ss_pred             EcCCC--ccCceee
Confidence            98773  6887643


No 12 
>KOG1544|consensus
Probab=100.00  E-value=1.8e-48  Score=371.58  Aligned_cols=302  Identities=35%  Similarity=0.649  Sum_probs=232.1

Q ss_pred             hhhhhhcCCCCcccccccc-ccccchHHH-HHHHhCCCCCCCCCCCCCCcccccCCCCCCCCCccccccCCCCCCCCccC
Q psy1664          39 RVDHSILLPKLPFYGAEKN-ALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI  116 (524)
Q Consensus        39 ~I~~~N~~~~~~~~~~g~N-~fsd~t~eE-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~s~DwR~~~~~~g~vtpv  116 (524)
                      +|+++|+..  .+|+++.. +|=.||.++ |+-+||..+++....+.. +....-.+...||+.||.|.+||  +++.++
T Consensus       152 ~iE~in~G~--YgW~A~NYSaFWGmtL~DGiKyRLGTL~Ps~sv~nMN-Ei~~~l~p~~~LPE~F~As~KWp--~liH~p  226 (470)
T KOG1544|consen  152 MIEAINQGN--YGWQAGNYSAFWGMTLDDGIKYRLGTLRPSSSVMNMN-EIYTVLNPGEVLPEAFEASEKWP--NLIHEP  226 (470)
T ss_pred             HHHHHhcCC--ccccccchhhhhcccccccceeeecccCchhhhhhHH-hHhhccCcccccchhhhhhhcCC--ccccCc
Confidence            499999877  79999844 688888876 666788766543211111 11112233468999999999999  679999


Q ss_pred             CCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHHHHHhCCccCCccCCCCCcc
Q psy1664         117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCR  196 (524)
Q Consensus       117 kdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~  196 (524)
                      .|||.|++.|||+++++..++++|.+.|+....||+|+|++|.....+||+||....|+-||++.|++.       ..|+
T Consensus       227 lDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKrGvVs-------dhCY  299 (470)
T KOG1544|consen  227 LDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKRGVVS-------DHCY  299 (470)
T ss_pred             cccCCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecccccc-------cccc
Confidence            999999999999999999999999999988899999999999876567999999999999999999997       5566


Q ss_pred             ccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHHHHHHHHcCCeEEEEEeccccc
Q psy1664         197 PYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMI  276 (524)
Q Consensus       197 PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~  276 (524)
                      ||....    +...+.|...+                                                           
T Consensus       300 P~~~dQ----~~~~~~C~m~s-----------------------------------------------------------  316 (470)
T KOG1544|consen  300 PFSGDQ----AGPAPPCMMHS-----------------------------------------------------------  316 (470)
T ss_pred             cccCCC----CCCCCCceeec-----------------------------------------------------------
Confidence            997540    00111111000                                                           


Q ss_pred             ccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCccccccccCccCCcCcCCcCCCCCC
Q psy1664         277 LYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQ  356 (524)
Q Consensus       277 ~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~~~~~~C~  356 (524)
                                          ...|+|                                                      
T Consensus       317 --------------------R~~grg------------------------------------------------------  322 (470)
T KOG1544|consen  317 --------------------RAMGRG------------------------------------------------------  322 (470)
T ss_pred             --------------------cccCcc------------------------------------------------------
Confidence                                001222                                                      


Q ss_pred             CCCCCCcccccccccCccccccCcceeeeEEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCC------
Q psy1664         357 ANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG------  430 (524)
Q Consensus       357 ~~~~~~p~C~~~C~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~------  430 (524)
                           +.+.++-|++++..  .++.+.....|++.++|++|+++||++|||.+.|.|.++|+.|++|||.+.+.      
T Consensus       323 -----kRqat~~CPn~~~~--Sn~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e  395 (470)
T KOG1544|consen  323 -----KRQATAHCPNSYVN--SNDIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPE  395 (470)
T ss_pred             -----cccccCcCCCcccc--cCceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCch
Confidence                 12222334333321  12455666778999999999999999999999999999999999999987642      


Q ss_pred             --CCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcEEEEEeCCCccCccccceeccceec
Q psy1664         431 --GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKIG  498 (524)
Q Consensus       431 --~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~~~  498 (524)
                        ...+.|+|.|.|||++....|.  ..+|||..||||+.||++|||||.||.|+|.||+.++++.-.++
T Consensus       396 ~yr~~gtHsVk~tGWG~~~~~~G~--~~KyW~aANSWG~~WGE~GYFriLRGvNecdIEsfvIgAWGr~~  463 (470)
T KOG1544|consen  396 RYRRHGTHSVKITGWGEETLPDGR--TLKYWTAANSWGPAWGERGYFRILRGVNECDIESFVIGAWGRVG  463 (470)
T ss_pred             hhhhcccceEEEeecccccCCCCC--eeEEEEeecccccccccCceEEEeccccchhhhHhhhhhhhccc
Confidence              1347999999999998755555  78999999999999999999999999999999999988876554


No 13 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=2.8e-46  Score=360.66  Aligned_cols=210  Identities=35%  Similarity=0.649  Sum_probs=165.2

Q ss_pred             CCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHH
Q psy1664          97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK  176 (524)
Q Consensus        97 lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~  176 (524)
                      ||++||||+.+   +.++||+||+.||+|||||++++||++++++.+ ...+.||+|+|++|....+.+|+||++..|++
T Consensus         1 lP~~~D~r~~~---~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~-~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~   76 (219)
T PF00112_consen    1 LPKSFDWRDKG---GRITPVRDQGSCGSCWAFAAAAALESRLAIQNN-GKNVDLSEQYLIDCSNKYNKGCDGGSPFDALK   76 (219)
T ss_dssp             STSSEEGGGTT---TCSG---BTTSSBTHHHHHHHHHHHHHHHHHHT-SSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHH
T ss_pred             CCCCEecccCC---CCcCccccCCcccccccchhccceecccccccc-ccccccccccccccccccccccccCcccccce
Confidence            89999999962   368999999999999999999999999999985 46799999999999864457999999999999


Q ss_pred             HHHH-hCCccCCccCCCCCccccccCcccccCCCC-CCCCCCCCCCccccccccCCCccccccccccceeeeecC-CCHH
Q psy1664         177 YWVT-TGIVSGGTYASKQGCRPYEIPCERYMNGSH-SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP-ANEE  253 (524)
Q Consensus       177 ~~~~-~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~-~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~  253 (524)
                      ++++ +|+++|+.|       ||...        . ..|.......             ...+...|    ..+. .+.+
T Consensus        77 ~~~~~~Gi~~e~~~-------pY~~~--------~~~~c~~~~~~~-------------~~~~i~~~----~~~~~~~~~  124 (219)
T PF00112_consen   77 YIKNNNGIVTEEDY-------PYNGN--------ENPTCKSKKSNS-------------YYVKIKGY----GKVKDNDIE  124 (219)
T ss_dssp             HHHHHTSBEBTTTS---------SSS--------SSCSSCHSGGGE-------------EEBEESEE----EEEESTCHH
T ss_pred             eecccCcccccccc-------ccccc--------cccccccccccc-------------cccccccc----ccccccchh
Confidence            9999 899998888       99865        2 3454321100             00111111    1222 2589


Q ss_pred             HHHHHHHHcCCeEEEEEecc-cccccCCceEEcCCCC-CCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccC
Q psy1664         254 TIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG  331 (524)
Q Consensus       254 ~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~~~~~-~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~G  331 (524)
                      +||++|+++|||+++|.+.+ +|+.|++|||..+.+. ..++|||+|||||++       .+++|||||||||++||++|
T Consensus       125 ~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~-------~~~~~wiv~NSWG~~WG~~G  197 (219)
T PF00112_consen  125 DIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDE-------NGKGYWIVKNSWGTDWGDNG  197 (219)
T ss_dssp             HHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEE-------TTEEEEEEE-SBTTTSTBTT
T ss_pred             HHHHHHhhCceeeeeeeccccccccccceeeeccccccccccccccccccccc-------cceeeEeeehhhCCccCCCe
Confidence            99999999999999999998 6999999999998665 478999999999987       47899999999999999999


Q ss_pred             ccccccccCccCCcCcCCc
Q psy1664         332 LFRIGCRPYEIPCERYMNG  350 (524)
Q Consensus       332 y~ri~~~~~~~~c~~~~~~  350 (524)
                      ||||+++.. ..|++....
T Consensus       198 y~~i~~~~~-~~c~i~~~~  215 (219)
T PF00112_consen  198 YFRISYDYN-NECGIESQA  215 (219)
T ss_dssp             EEEEESSSS-SGGGTTSSE
T ss_pred             EEEEeeCCC-CcCccCcee
Confidence            999999765 257776544


No 14 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00  E-value=1.9e-43  Score=328.69  Aligned_cols=164  Identities=40%  Similarity=0.780  Sum_probs=140.0

Q ss_pred             CCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHH
Q psy1664          97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWK  176 (524)
Q Consensus        97 lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~  176 (524)
                      ||++||||++    ++++||+||+.||+|||||++++||++++++++.  .+.||+|+|++|....++||.||++..|++
T Consensus         1 lP~~~D~R~~----~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~--~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~   74 (174)
T smart00645        1 LPESFDWRKK----GAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGK--LVSLSEQQLVDCSTGGNNGCNGGLPDNAFE   74 (174)
T ss_pred             CCCcCccccc----CCCCccccCcccchHHHHHHHHHHHHHHHHhcCC--ccccCHHHHhhhcCCCCCCCCCcCHHHHHH
Confidence            7999999998    5789999999999999999999999999999863  689999999999765345999999999999


Q ss_pred             HHHHh-CCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHH
Q psy1664         177 YWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETI  255 (524)
Q Consensus       177 ~~~~~-Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i  255 (524)
                      |++++ |+++|+.|       ||+.                                                       
T Consensus        75 ~~~~~~Gi~~e~~~-------PY~~-------------------------------------------------------   92 (174)
T smart00645       75 YIKKNGGLETESCY-------PYTG-------------------------------------------------------   92 (174)
T ss_pred             HHHHcCCccccccc-------Cccc-------------------------------------------------------
Confidence            99998 99997777       9841                                                       


Q ss_pred             HHHHHHcCCeEEEEEecccccccCCceEEcCCCCC-CCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCccc
Q psy1664         256 MREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR  334 (524)
Q Consensus       256 k~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~-~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy~r  334 (524)
                                ++.+.+. +|+.|++|||+.+.|.. .++|+|+|||||.+.      ++++|||||||||+.|||+||||
T Consensus        93 ----------~~~~~~~-~f~~Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~------~g~~yWii~NSwG~~WG~~G~~~  155 (174)
T smart00645       93 ----------SVAIDAS-DFQFYKSGIYDHPGCGSGTLDHAVLIVGYGTEE------NGKDYWIVKNSWGTDWGENGYFR  155 (174)
T ss_pred             ----------EEEEEcc-cccCCcCeEECCCCCCCCcccEEEEEEEEeecC------CCeeEEEEECCCCCCcccCeEEE
Confidence                      3444443 69999999998865543 479999999999752      46899999999999999999999


Q ss_pred             cccccCccCCcC
Q psy1664         335 IGCRPYEIPCER  346 (524)
Q Consensus       335 i~~~~~~~~c~~  346 (524)
                      |+++.+ ..|++
T Consensus       156 i~~~~~-~~c~i  166 (174)
T smart00645      156 IARGKN-NECGI  166 (174)
T ss_pred             EEcCCC-CccCc
Confidence            998752 25666


No 15 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00  E-value=4.3e-41  Score=325.17  Aligned_cols=202  Identities=26%  Similarity=0.396  Sum_probs=157.2

Q ss_pred             ccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCCC----CCCCCCChHHHH
Q psy1664         100 GFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG----NGCQGGFHGKAW  175 (524)
Q Consensus       100 s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~~----~gC~GG~~~~a~  175 (524)
                      .||||+.    + ++||+|||.||+|||||++++||++++++......+.||+|+|++|.....    .+|.||.+..++
T Consensus         1 ~~d~r~~----~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~   75 (223)
T cd02619           1 SVDLRPL----R-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSAL   75 (223)
T ss_pred             CCcchhc----C-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHH
Confidence            4899998    6 899999999999999999999999999987533468999999999976532    699999999999


Q ss_pred             H-HHHHhCCccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecC-CCHH
Q psy1664         176 K-YWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLP-ANEE  253 (524)
Q Consensus       176 ~-~~~~~Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~  253 (524)
                      . +++++|+++|++|       ||...        ...|........       ...   ......|    ..+. .+++
T Consensus        76 ~~~~~~~Gi~~e~~~-------Py~~~--------~~~~~~~~~~~~-------~~~---~~~~~~y----~~~~~~~~~  126 (223)
T cd02619          76 LKLVALKGIPPEEDY-------PYGAE--------SDGEEPKSEAAL-------NAA---KVKLKDY----RRVLKNNIE  126 (223)
T ss_pred             HHHHHHcCCCccccC-------CCCCC--------CCCCCCCCccch-------hhc---ceeecce----eEeCchhHH
Confidence            8 8889999998888       99865        333322110000       000   0011111    1222 3478


Q ss_pred             HHHHHHHHcCCeEEEEEecccccccCCceEE-----c-CCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcc
Q psy1664         254 TIMREIFRHGPVEGSMTIYADMILYKTGIYK-----H-VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNW  327 (524)
Q Consensus       254 ~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~-----~-~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~W  327 (524)
                      +||++|+++|||+++|.++.+|+.|++|+|.     . .++...++|||+|||||++..     .+++|||||||||+.|
T Consensus       127 ~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~-----~~~~~~i~~NSwG~~w  201 (223)
T cd02619         127 DIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYV-----EGKGAFIVKNSWGTDW  201 (223)
T ss_pred             HHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCC-----CCCCEEEEEeCCCCcc
Confidence            9999999999999999999999999999873     2 223446899999999998731     2689999999999999


Q ss_pred             cccCccccccccC
Q psy1664         328 GENGLFRIGCRPY  340 (524)
Q Consensus       328 Ge~Gy~ri~~~~~  340 (524)
                      |++||+||++...
T Consensus       202 g~~Gy~~i~~~~~  214 (223)
T cd02619         202 GDNGYGRISYEDV  214 (223)
T ss_pred             ccCCEEEEehhhh
Confidence            9999999998765


No 16 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00  E-value=4.8e-40  Score=359.82  Aligned_cols=223  Identities=21%  Similarity=0.379  Sum_probs=155.6

Q ss_pred             CCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhcCCC-CCCCCCCC-hHHHHHHHHHhC-C
Q psy1664         107 WPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDC-GNGCQGGF-HGKAWKYWVTTG-I  183 (524)
Q Consensus       107 ~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~~~~-~~gC~GG~-~~~a~~~~~~~G-i  183 (524)
                      ++.|....||||||.||+|||||++++||++++|+++  ..+.||+|+|+||+... ..||.||+ +..++.|++++| +
T Consensus       538 ~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg--~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgL  615 (1004)
T PTZ00462        538 ENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGY--EPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFL  615 (1004)
T ss_pred             CCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcC--CCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCC
Confidence            5777777899999999999999999999999999875  46899999999998643 46999997 556679998885 7


Q ss_pred             ccCCccCCCCCccccccCcccccCCCCCCCCCCCCCCccccccccC---C-Cccccccccccceeeee-----cCCCHHH
Q psy1664         184 VSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP---G-YDVSYEDDLNFGRIAYS-----LPANEET  254 (524)
Q Consensus       184 ~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~-----~~~~~~~  254 (524)
                      ++|++|       ||...      ...+.|+........+-.....   . ..........|....-.     +..-++.
T Consensus       616 ptESdY-------PYt~k------~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~  682 (1004)
T PTZ00462        616 PADSNY-------LYNYT------KVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKI  682 (1004)
T ss_pred             cccccC-------CCccC------CCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHH
Confidence            777777       99753      1245676432211111000000   0 00000011112111000     0011468


Q ss_pred             HHHHHHHcCCeEEEEEecccccccC-CceEEcCCCC-CCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCc
Q psy1664         255 IMREIFRHGPVEGSMTIYADMILYK-TGIYKHVAGG-PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL  332 (524)
Q Consensus       255 ik~~l~~~GPV~v~i~v~~~f~~Y~-sGIy~~~~~~-~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy  332 (524)
                      |+++|+.+|||+|+|++. +|+.|. +|||....|. ..++|||+|||||.+.+.++  .+++|||||||||+.|||+||
T Consensus       683 IK~eI~~kGPVaV~IdAs-df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg--~gk~YWIVRNSWGt~WGEnGY  759 (1004)
T PTZ00462        683 IKDEIMNKGSVIAYIKAE-NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDED--EKKSYWIVRNSWGKYWGDEGY  759 (1004)
T ss_pred             HHHHHHhcCCEEEEEEee-hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccC--CCCceEEEEcCCCCCcCCCeE
Confidence            999999999999999985 788885 8987766565 45799999999997532111  357999999999999999999


Q ss_pred             cccccccCccCCcCcC
Q psy1664         333 FRIGCRPYEIPCERYM  348 (524)
Q Consensus       333 ~ri~~~~~~~~c~~~~  348 (524)
                      |||+|+.. ..|++..
T Consensus       760 FKI~r~g~-n~CGin~  774 (1004)
T PTZ00462        760 FKVDMYGP-SHCEDNF  774 (1004)
T ss_pred             EEEEeCCC-CCCccch
Confidence            99999532 2688754


No 17 
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.95  E-value=3e-27  Score=231.18  Aligned_cols=99  Identities=34%  Similarity=0.682  Sum_probs=90.7

Q ss_pred             CCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCC
Q psy1664         392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE  471 (524)
Q Consensus       392 ~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~  471 (524)
                      .++++||.+|+++|||+++|.++.+|+.|++|||+..++...++|+|+|||||++..      +++|||||||||++||+
T Consensus       136 ~~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~~------g~~YWiikNSWG~~WGe  209 (239)
T cd02698         136 SGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDEN------GVEYWIVRNSWGEPWGE  209 (239)
T ss_pred             CCHHHHHHHHHHcCCEEEEEEecccccccCCeEEccCCCCCcCCeEEEEEEEEecCC------CCEEEEEEcCCCcccCc
Confidence            468899999999999999999988999999999988776667899999999998652      78999999999999999


Q ss_pred             CcEEEEEeCC-----CccCccccceeccce
Q psy1664         472 NGLFRIVRGQ-----NECGIEADITAGLPK  496 (524)
Q Consensus       472 ~Gy~~i~~g~-----~~cgi~~~~~~~~p~  496 (524)
                      +|||||+||.     |+||||++++++.|.
T Consensus       210 ~Gy~~i~rg~~~~~~~~~~i~~~~~~~~~~  239 (239)
T cd02698         210 RGWFRIVTSSYKGARYNLAIEEDCAWADPI  239 (239)
T ss_pred             CceEEEEccCCcccccccccccceEEEeeC
Confidence            9999999999     999999999988873


No 18 
>KOG1543|consensus
Probab=99.95  E-value=2.1e-27  Score=240.58  Aligned_cols=111  Identities=42%  Similarity=0.809  Sum_probs=100.4

Q ss_pred             CcceeeeEEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCC-ccCeeEEEeeecCCCCCCCccCCcc
Q psy1664         379 DDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP-LGEHAIRIIGWGQEPLGEGTSSVVK  457 (524)
Q Consensus       379 ~~~~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~-~~~H~v~ivG~g~~~~~~~~~~~~~  457 (524)
                      .+.++..+.+.++.+|++|+++|+.+|||+|+|++..+|+.|++|||.++++.. .++|+|+|||||+.+       +.+
T Consensus       213 ~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~~~-------~~~  285 (325)
T KOG1543|consen  213 DKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGTGD-------GVD  285 (325)
T ss_pred             cceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCCCCCCCceEEEEEEcCCC-------Cce
Confidence            456777888889999999999999999999999998899999999999988776 499999999999933       789


Q ss_pred             EEEEEcCCCCCCCCCcEEEEEeCCCccCccccceeccce
Q psy1664         458 YWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK  496 (524)
Q Consensus       458 ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~  496 (524)
                      |||||||||+.||++|||||.|+.+.|+|++.+.++.|.
T Consensus       286 YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~~~p~~  324 (325)
T KOG1543|consen  286 YWIVKNSWGTDWGEKGYFRIARGVNKCGIASEASYGPIK  324 (325)
T ss_pred             eEEEEcCCCCCcccCceEEEecCCCchhhhcccccCCCC
Confidence            999999999999999999999999999999987765543


No 19 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.94  E-value=9.3e-27  Score=228.50  Aligned_cols=101  Identities=42%  Similarity=0.931  Sum_probs=89.4

Q ss_pred             CCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCC----C-C--------CccCeeEEEeeecCCCCCCCccCCcc
Q psy1664         391 PANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA----G-G--------PLGEHAIRIIGWGQEPLGEGTSSVVK  457 (524)
Q Consensus       391 ~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~----~-~--------~~~~H~v~ivG~g~~~~~~~~~~~~~  457 (524)
                      ..++++||.+|+++|||+++|++.++|++|++|||+...    | .        ..++|+|+|||||++..     ++++
T Consensus       130 ~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~-----~g~~  204 (243)
T cd02621         130 CTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI-----KGEK  204 (243)
T ss_pred             cCCHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC-----CCCc
Confidence            357899999999999999999998899999999998752    1 1        24799999999998751     2779


Q ss_pred             EEEEEcCCCCCCCCCcEEEEEeCCCccCccccceeccce
Q psy1664         458 YWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPK  496 (524)
Q Consensus       458 ywiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~~p~  496 (524)
                      |||||||||++||++|||||+||.|.|||++.+++++|.
T Consensus       205 YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~~~  243 (243)
T cd02621         205 YWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVFAYPI  243 (243)
T ss_pred             EEEEEcCCCCCCCcCCeEEEecCCcccCcccceEeeccC
Confidence            999999999999999999999999999999999998884


No 20 
>KOG1542|consensus
Probab=99.94  E-value=1e-26  Score=226.44  Aligned_cols=105  Identities=29%  Similarity=0.581  Sum_probs=91.9

Q ss_pred             eeeeEEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeC---CCCCCccCeeEEEeeecCCCCCCCccCCccE
Q psy1664         382 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKH---VAGGPLGEHAIRIIGWGQEPLGEGTSSVVKY  458 (524)
Q Consensus       382 ~~~~~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~---~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~y  458 (524)
                      .+....+.++.||++|.+.|.++|||+|+|++. .+++|.+||+.+   .|....++|+|||||||.+..      ..+|
T Consensus       262 v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~-~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~------~~PY  334 (372)
T KOG1542|consen  262 VSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAK-PMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGY------EKPY  334 (372)
T ss_pred             EEEeccEecCCCHHHHHHHHHhcCCeEEEEchH-HHHHhcccccCCCcccCCccccCceEEEEeecCCCC------CCce
Confidence            344556677889999999999999999999974 599999999987   344455999999999998752      6899


Q ss_pred             EEEEcCCCCCCCCCcEEEEEeCCCccCccccceec
Q psy1664         459 WLVANSFNTNWGENGLFRIVRGQNECGIEADITAG  493 (524)
Q Consensus       459 wiv~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~~~  493 (524)
                      ||||||||++||++||+||.||.|.|||++.++++
T Consensus       335 WIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mvss~  369 (372)
T KOG1542|consen  335 WIVKNSWGTSWGEKGYYKLCRGSNACGIADMVSSA  369 (372)
T ss_pred             EEEECCccccccccceEEEeccccccccccchhhh
Confidence            99999999999999999999999999999987654


No 21 
>PTZ00203 cathepsin L protease; Provisional
Probab=99.93  E-value=5.7e-26  Score=231.81  Aligned_cols=99  Identities=22%  Similarity=0.495  Sum_probs=86.8

Q ss_pred             EEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCC
Q psy1664         386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF  465 (524)
Q Consensus       386 ~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSW  465 (524)
                      .+..++.++++|+.+|+++|||+++|++. +|++|++|||.. |....++|+|+|||||+++       +++||||||||
T Consensus       240 ~~~~i~~~e~~~~~~l~~~GPv~v~i~a~-~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~~-------g~~YWiikNSW  310 (348)
T PTZ00203        240 GYVSMESSERVMAAWLAKNGPISIAVDAS-SFMSYHSGVLTS-CIGEQLNHGVLLVGYNMTG-------EVPYWVIKNSW  310 (348)
T ss_pred             ceeecCcCHHHHHHHHHhCCCEEEEEEhh-hhcCccCceeec-cCCCCCCeEEEEEEEecCC-------CceEEEEEcCC
Confidence            44556668899999999999999999995 899999999975 4444579999999999876       88999999999


Q ss_pred             CCCCCCCcEEEEEeCCCccCccccceec
Q psy1664         466 NTNWGENGLFRIVRGQNECGIEADITAG  493 (524)
Q Consensus       466 G~~WG~~Gy~~i~~g~~~cgi~~~~~~~  493 (524)
                      |++||++|||||+||.|.|||++.++.+
T Consensus       311 G~~WGe~GY~ri~rg~n~Cgi~~~~~~~  338 (348)
T PTZ00203        311 GEDWGEKGYVRVTMGVNACLLTGYPVSV  338 (348)
T ss_pred             CCCcCcCceEEEEcCCCcccccceEEEE
Confidence            9999999999999999999999776543


No 22 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.92  E-value=5.7e-26  Score=223.77  Aligned_cols=205  Identities=24%  Similarity=0.251  Sum_probs=128.1

Q ss_pred             CCCCCccccccCCCCCCCCccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHhhc-CCCCCCC-----CC
Q psy1664          95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCC-KDCGNGC-----QG  168 (524)
Q Consensus        95 ~~lP~s~DwR~~~~~~g~vtpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvdC~-~~~~~gC-----~G  168 (524)
                      ..||+.||||..    |.|+||||||.||+||||+++++||+.+.-..    ...+|+-.+..-. ..+..+|     +|
T Consensus        97 ~s~~~~fd~r~~----g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~----~w~~s~~nm~~ll~~~ye~~fd~~~~d~  168 (372)
T COG4870          97 ASLPSYFDRRDE----GKVSPVKDQGSGGSCWAFATTRSLESYLNPES----AWDFSENNMKNLLGVPYEKGFDYTSNDG  168 (372)
T ss_pred             ccchhheeeecc----CCcccccccCcccceEeeeehhhhhheecccc----cccccccchhhhcCCCccccCCCccccC
Confidence            358999999999    88999999999999999999999999885532    3455554443221 1111222     36


Q ss_pred             CChHHHHHHHHHh-CCccCCccCCCCCccccccCcccccCCCCCCCCCCCC---CCccccccccCCCcccccccccccee
Q psy1664         169 GFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEP---NTPECIRKCQPGYDVSYEDDLNFGRI  244 (524)
Q Consensus       169 G~~~~a~~~~~~~-Gi~~e~~y~~~e~c~PY~~~~~~~~~~~~~~C~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~  244 (524)
                      |....+.-|+.+. |-+.+.+-       ||...        ...|....+   ....|...+..               
T Consensus       169 g~~~m~~a~l~e~sgpv~et~d-------~y~~~--------s~~~~~~~p~~k~~~~~~~i~~~---------------  218 (372)
T COG4870         169 GNADMSAAYLTEWSGPVYETDD-------PYSEN--------SYFSPTNLPVTKHVQEAQIIPSR---------------  218 (372)
T ss_pred             CccccccccccccCCcchhhcC-------ccccc--------cccCCcCCchhhccccceecccc---------------
Confidence            7766666666554 76666555       66543        222222111   11111100000               


Q ss_pred             eeecCCCHHHHHHHHHHcCCeEEEEEecc-cccccCCceEEcCCCCCCCCcEEEEEEeccCCCC---CCCccceeEEEEe
Q psy1664         245 AYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG---EGTSSVVKYWLVA  320 (524)
Q Consensus       245 ~~~~~~~~~~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~---~g~~~g~~YWivk  320 (524)
                        .-..+...|++++...|-++.+|.+.. .+.....+.|..... ...+|||+||||++....   +....|.+.||||
T Consensus       219 --~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~~~~s~-~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiik  295 (372)
T COG4870         219 --KKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSG-ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIK  295 (372)
T ss_pred             --hhhhcccchHHHHhhhccccceeEEecccccccccCCCCCCcc-ccccceEEEEeccccccccccccCCCCCceEEEE
Confidence              001123347888888888876666542 222233344433322 567999999999986431   1234567799999


Q ss_pred             CCCCCcccccCccccccccC
Q psy1664         321 NSFNTNWGENGLFRIGCRPY  340 (524)
Q Consensus       321 NSWG~~WGe~Gy~ri~~~~~  340 (524)
                      ||||++||++|||||+.+..
T Consensus       296 NSWGt~wG~~GYfwisY~ya  315 (372)
T COG4870         296 NSWGTNWGENGYFWISYYYA  315 (372)
T ss_pred             CccccccccCceEEEEeeec
Confidence            99999999999999997643


No 23 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.92  E-value=1e-24  Score=209.25  Aligned_cols=101  Identities=35%  Similarity=0.645  Sum_probs=88.6

Q ss_pred             eeEEEEcCC-CHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCC-CCccCeeEEEeeecCCCCCCCccCCccEEEE
Q psy1664         384 GRIAYSLPA-NEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAG-GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLV  461 (524)
Q Consensus       384 ~~~~~~~~~-~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~-~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv  461 (524)
                      ...+..+.. ++++||++|+++|||+++|.+.++|+.|++|||..+++ ...++|+|+|||||++.       +.+||||
T Consensus       106 i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~~-------~~~ywiv  178 (210)
T cd02248         106 ITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYGTEN-------GVDYWIV  178 (210)
T ss_pred             EeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCCCcCCEEEEEEEEeecC-------CceEEEE
Confidence            334445543 48899999999999999999988999999999988766 45689999999999987       7899999


Q ss_pred             EcCCCCCCCCCcEEEEEeCCCccCccccce
Q psy1664         462 ANSFNTNWGENGLFRIVRGQNECGIEADIT  491 (524)
Q Consensus       462 ~NSWG~~WG~~Gy~~i~~g~~~cgi~~~~~  491 (524)
                      |||||+.||++|||||.++.|.|||++.+.
T Consensus       179 ~NSWG~~WG~~Gy~~i~~~~~~cgi~~~~~  208 (210)
T cd02248         179 KNSWGTSWGEKGYIRIARGSNLCGIASYAS  208 (210)
T ss_pred             EcCCCCccccCcEEEEEcCCCccCceeeee
Confidence            999999999999999999999999997743


No 24 
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.92  E-value=3.7e-25  Score=215.98  Aligned_cols=94  Identities=49%  Similarity=0.992  Sum_probs=82.5

Q ss_pred             ecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCc
Q psy1664         247 SLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTN  326 (524)
Q Consensus       247 ~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~  326 (524)
                      .+..++++||++|+++|||+|+|.++++|+.|++|||+..++...++|||+|||||++       ++++|||||||||++
T Consensus       139 ~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~-------~g~~YWivrNSWG~~  211 (236)
T cd02620         139 SVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVE-------NGVPYWLAANSWGTD  211 (236)
T ss_pred             eeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEecc-------CCeeEEEEEeCCCCC
Confidence            4455789999999999999999999999999999999876555567999999999986       468999999999999


Q ss_pred             ccccCccccccccCccCCcCcCC
Q psy1664         327 WGENGLFRIGCRPYEIPCERYMN  349 (524)
Q Consensus       327 WGe~Gy~ri~~~~~~~~c~~~~~  349 (524)
                      |||+|||||+++..  .|++.+.
T Consensus       212 WGe~Gy~ri~~~~~--~cgi~~~  232 (236)
T cd02620         212 WGENGYFRILRGSN--ECGIESE  232 (236)
T ss_pred             CCCCcEEEEEccCc--ccccccc
Confidence            99999999999764  6887653


No 25 
>PTZ00021 falcipain-2; Provisional
Probab=99.90  E-value=7.1e-24  Score=222.68  Aligned_cols=107  Identities=28%  Similarity=0.583  Sum_probs=86.2

Q ss_pred             EEEEcCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCC---CccCCccEEEEE
Q psy1664         386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGE---GTSSVVKYWLVA  462 (524)
Q Consensus       386 ~~~~~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~---~~~~~~~ywiv~  462 (524)
                      ++..++  +.+|+++|+.+|||+|+|++..+|++|++|||..+|.. .++|||+|||||++...+   +...+.+|||||
T Consensus       375 ~y~~i~--~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVK  451 (489)
T PTZ00021        375 SYVSIP--EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIK  451 (489)
T ss_pred             eEEEec--HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCCCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEE
Confidence            334444  57899999999999999999889999999999876543 479999999999764111   111246899999


Q ss_pred             cCCCCCCCCCcEEEEEeCC----CccCccccceecccee
Q psy1664         463 NSFNTNWGENGLFRIVRGQ----NECGIEADITAGLPKI  497 (524)
Q Consensus       463 NSWG~~WG~~Gy~~i~~g~----~~cgi~~~~~~~~p~~  497 (524)
                      ||||++||++|||||+|+.    |.|||.+.+.  +|.+
T Consensus       452 NSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~--yP~~  488 (489)
T PTZ00021        452 NSWGESWGEKGFIRIETDENGLMKTCSLGTEAY--VPLI  488 (489)
T ss_pred             CCCCCCcccCeEEEEEcCCCCCCCCCCCcccce--eEec
Confidence            9999999999999999986    5899999854  5654


No 26 
>PTZ00200 cysteine proteinase; Provisional
Probab=99.90  E-value=1.6e-23  Score=219.32  Aligned_cols=95  Identities=27%  Similarity=0.610  Sum_probs=79.6

Q ss_pred             HHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcE
Q psy1664         395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL  474 (524)
Q Consensus       395 ~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy  474 (524)
                      .+++.+++.+|||+|+|.++.+|+.|++|||.++|.. .++|||+|||||.+.     +.|++|||||||||++||++||
T Consensus       348 ~~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~-~~nHaV~lVGyG~d~-----~~g~~YWIIkNSWG~~WGe~GY  421 (448)
T PTZ00200        348 KDVLNKSLVISPTVVYIAVSRELLKYKSGVYNGECGK-SLNHAVLLVGEGYDE-----KTKKRYWIIKNSWGTDWGENGY  421 (448)
T ss_pred             HHHHHHHHhcCCEEEEeecccccccCCCCccccccCC-CCcEEEEEEEecccC-----CCCCceEEEEcCCCCCcccCee
Confidence            3455566678999999999889999999999876544 489999999999642     1278999999999999999999


Q ss_pred             EEEEeC---CCccCccccceecccee
Q psy1664         475 FRIVRG---QNECGIEADITAGLPKI  497 (524)
Q Consensus       475 ~~i~~g---~~~cgi~~~~~~~~p~~  497 (524)
                      |||+|+   .|.|||++.+  .+|.+
T Consensus       422 ~ri~r~~~g~n~CGI~~~~--~~P~~  445 (448)
T PTZ00200        422 MRLERTNEGTDKCGILTVG--LTPVF  445 (448)
T ss_pred             EEEEeCCCCCCcCCccccc--eeeEE
Confidence            999995   5899999984  46765


No 27 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.89  E-value=2.8e-23  Score=200.16  Aligned_cols=94  Identities=39%  Similarity=0.747  Sum_probs=85.6

Q ss_pred             CHHHHHHHHHhCCCEEEEEeccc-ccccccccEEeCCCC-CCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCC
Q psy1664         393 NEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAG-GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG  470 (524)
Q Consensus       393 ~~~~~~~~~~~~gPv~~~~~~~~-~f~~y~~gi~~~~~~-~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG  470 (524)
                      +.++||++|+++|||+++|.+.+ +|..|++|||..+.+ ...++|+|+|||||++.       +..|||||||||+.||
T Consensus       122 ~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~~-------~~~~wiv~NSWG~~WG  194 (219)
T PF00112_consen  122 DIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDEN-------GKGYWIVKNSWGTDWG  194 (219)
T ss_dssp             CHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEET-------TEEEEEEE-SBTTTST
T ss_pred             chhHHHHHHhhCceeeeeeeccccccccccceeeecccccccccccccccccccccc-------ceeeEeeehhhCCccC
Confidence            58999999999999999999988 699999999998744 45789999999999987       8899999999999999


Q ss_pred             CCcEEEEEeCCC-ccCccccceec
Q psy1664         471 ENGLFRIVRGQN-ECGIEADITAG  493 (524)
Q Consensus       471 ~~Gy~~i~~g~~-~cgi~~~~~~~  493 (524)
                      ++|||||.++.+ +|||+++++++
T Consensus       195 ~~Gy~~i~~~~~~~c~i~~~~~~~  218 (219)
T PF00112_consen  195 DNGYFRISYDYNNECGIESQAVYP  218 (219)
T ss_dssp             BTTEEEEESSSSSGGGTTSSEEEE
T ss_pred             CCeEEEEeeCCCCcCccCceeeec
Confidence            999999999987 99999998754


No 28 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.89  E-value=3.5e-23  Score=227.76  Aligned_cols=125  Identities=23%  Similarity=0.425  Sum_probs=99.5

Q ss_pred             HHHHHHHHHhCCCEEEEEeccccccccc-ccEEeCC-CCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCC
Q psy1664         394 EETIMREIFRHGPVEGSMTIYADMILYK-TGIYKHV-AGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE  471 (524)
Q Consensus       394 ~~~~~~~~~~~gPv~~~~~~~~~f~~y~-~gi~~~~-~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~  471 (524)
                      ++.|+.+|+++|||+|+|++. +|+.|. +|||... |+...++|||+|||||.+.+..++  +++|||||||||+.||+
T Consensus       680 i~~IK~eI~~kGPVaV~IdAs-df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~--gk~YWIVRNSWGt~WGE  756 (1004)
T PTZ00462        680 IKIIKDEIMNKGSVIAYIKAE-NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDE--KKSYWIVRNSWGKYWGD  756 (1004)
T ss_pred             HHHHHHHHHhcCCEEEEEEee-hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccCC--CCceEEEEcCCCCCcCC
Confidence            468999999999999999985 688885 8986554 444568999999999986421122  67899999999999999


Q ss_pred             CcEEEEEe-CCCccCccccceeccceeccccCCcccccCcc-----cCCCCCCCCCCC
Q psy1664         472 NGLFRIVR-GQNECGIEADITAGLPKIGLEIDSNEINLGKM-----MTLPLTNRDTYT  523 (524)
Q Consensus       472 ~Gy~~i~~-g~~~cgi~~~~~~~~p~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~  523 (524)
                      +|||||.| |.|.|||...  ..+|.+..+++-...+..+.     .++++.++|+|.
T Consensus       757 nGYFKI~r~g~n~CGin~i--~t~~~fn~d~~~~~~~~~~~~~~~~~y~~k~spdf~~  812 (1004)
T PTZ00462        757 EGYFKVDMYGPSHCEDNFI--HSVVIFNIDLPKNKKSPKKESFKIYDYYLKASPDFYH  812 (1004)
T ss_pred             CeEEEEEeCCCCCCccchh--eeeeeEeeccccccCCccccccchheeeeccChhHhh
Confidence            99999998 7899999775  44677777776666655443     588999999884


No 29 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.85  E-value=7.8e-22  Score=211.63  Aligned_cols=94  Identities=32%  Similarity=0.577  Sum_probs=78.6

Q ss_pred             CCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcC------CCCC---------------CCCcEEEEEEeccCCCCC
Q psy1664         250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV------AGGP---------------LGEHAIRIIGWGQEPLGE  308 (524)
Q Consensus       250 ~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~------~~~~---------------~~~HaV~iVGyg~~~~~~  308 (524)
                      .++++||++|+.+|||+|+|+++++|++|++|||+.+      .|..               .++|||+|||||.+.   
T Consensus       555 ~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~---  631 (693)
T PTZ00049        555 NGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEE---  631 (693)
T ss_pred             CCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEecccc---
Confidence            3688999999999999999999989999999999864      2421               369999999999762   


Q ss_pred             CCccc--eeEEEEeCCCCCcccccCccccccccCccCCcCcCCc
Q psy1664         309 GTSSV--VKYWLVANSFNTNWGENGLFRIGCRPYEIPCERYMNG  350 (524)
Q Consensus       309 g~~~g--~~YWivkNSWG~~WGe~Gy~ri~~~~~~~~c~~~~~~  350 (524)
                        .+|  .+|||||||||++|||+|||||+|+..  .|++...+
T Consensus       632 --enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N--~CGIEs~a  671 (693)
T PTZ00049        632 --INGKLYKYWIGRNSWGKNWGKEGYFKIIRGKN--FSGIESQS  671 (693)
T ss_pred             --CCCcccCEEEEECCCCCCcccCceEEEEcCCC--ccCCccce
Confidence              123  479999999999999999999999865  68876543


No 30 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.84  E-value=3.1e-21  Score=205.06  Aligned_cols=95  Identities=25%  Similarity=0.471  Sum_probs=79.5

Q ss_pred             cCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcC---------CC----------CCCCCcEEEEEEeccCCCCC
Q psy1664         248 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHV---------AG----------GPLGEHAIRIIGWGQEPLGE  308 (524)
Q Consensus       248 ~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~---------~~----------~~~~~HaV~iVGyg~~~~~~  308 (524)
                      +..++++||++|+++|||+|+|+++.+|+.|++|||...         ++          ...++|||+|||||.++   
T Consensus       339 ~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de---  415 (548)
T PTZ00364        339 AVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE---  415 (548)
T ss_pred             cCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccC---
Confidence            345688999999999999999999999999999998631         11          13579999999999853   


Q ss_pred             CCccceeEEEEeCCCCC--cccccCccccccccCccCCcCcCCc
Q psy1664         309 GTSSVVKYWLVANSFNT--NWGENGLFRIGCRPYEIPCERYMNG  350 (524)
Q Consensus       309 g~~~g~~YWivkNSWG~--~WGe~Gy~ri~~~~~~~~c~~~~~~  350 (524)
                         ++.+|||||||||+  +|||+|||||+|+.+  .|++.+.+
T Consensus       416 ---~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N--~CGIes~~  454 (548)
T PTZ00364        416 ---NGGDYWLVLDPWGSRRSWCDGGTRKIARGVN--AYNIESEV  454 (548)
T ss_pred             ---CCceEEEEECCCCCCCCcccCCeEEEEcCCC--ccccccee
Confidence               46899999999999  999999999999865  68876543


No 31 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.84  E-value=3.4e-20  Score=192.99  Aligned_cols=213  Identities=17%  Similarity=0.196  Sum_probs=132.4

Q ss_pred             ccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHh----------------hcCC-----------CCCCC
Q psy1664         114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS----------------CCKD-----------CGNGC  166 (524)
Q Consensus       114 tpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvd----------------C~~~-----------~~~gC  166 (524)
                      .||+||++-|.||.||+...|+..+....+. ..+.||+.+|..                +...           ...-.
T Consensus        55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~-~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~  133 (437)
T cd00585          55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNL-KEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQN  133 (437)
T ss_pred             CCcccCCCCchhHHHHCHHHHHHHHHHHcCC-CCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcC
Confidence            4899999999999999999999998876554 369999988765                2110           12356


Q ss_pred             CCCChHHHHHHHHHhCCccCCccCCCCCccccccC-c------cc-c--------cCC----------------------
Q psy1664         167 QGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIP-C------ER-Y--------MNG----------------------  208 (524)
Q Consensus       167 ~GG~~~~a~~~~~~~Gi~~e~~y~~~e~c~PY~~~-~------~~-~--------~~~----------------------  208 (524)
                      +||.-..+...+++.|+++.+.|+..... ..+.. .      .+ +        ..+                      
T Consensus       134 DGGqw~m~~~li~KYGvVPk~~~pet~~s-~~t~~~n~~L~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~  212 (437)
T cd00585         134 DGGQWDMLVNLIEKYGLVPKSVMPESFNS-ENSRRLNYLLNRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILA  212 (437)
T ss_pred             CCCchHHHHHHHHHcCCCcccccCCCcCc-cchHHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHH
Confidence            89999999999999999998887421100 00000 0      00 0        000                      


Q ss_pred             -CCCCCC---------------CCCCCCccc-ccc---ccCCCcccccccc----ccce----------------eeeec
Q psy1664         209 -SHSSCQ---------------DNEPNTPEC-IRK---CQPGYDVSYEDDL----NFGR----------------IAYSL  248 (524)
Q Consensus       209 -~~~~C~---------------~~~~~~~~~-~~~---~~~~~~~~~~~~~----~~~~----------------~~~~~  248 (524)
                       .-|..+               ....-+|.- ...   +...-.+.+....    .|.+                ..+++
T Consensus       213 ~~lG~pP~~F~~~y~dkd~~~~~~~~~TP~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~y~Nv  292 (437)
T cd00585         213 IALGEPPEKFDWEYRDKDKKYHEIKELTPLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPILYLNV  292 (437)
T ss_pred             HHcCCCCceEEEEEEeCCCCeeeCCCcCHHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccceEEec
Confidence             000000               000011100 000   1110001110000    0110                11222


Q ss_pred             CCCHHHHH----HHHHHcCCeEEEEEecccccccCCceEEcC---------------------CCCCCCCcEEEEEEecc
Q psy1664         249 PANEETIM----REIFRHGPVEGSMTIYADMILYKTGIYKHV---------------------AGGPLGEHAIRIIGWGQ  303 (524)
Q Consensus       249 ~~~~~~ik----~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~---------------------~~~~~~~HaV~iVGyg~  303 (524)
                        ..++++    ++|.+++||.++++|. .|+.|++||++..                     ++.+..+|||+|||||.
T Consensus       293 --p~d~l~~~~~~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~  369 (437)
T cd00585         293 --PMDVLKKAAIAQLKDGEPVWFGCDVG-KFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDL  369 (437)
T ss_pred             --CHHHHHHHHHHHHhcCCCEEEEEEcC-hhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEe
Confidence              234444    5677889999999996 5789999999653                     34456799999999998


Q ss_pred             CCCCCCCccce-eEEEEeCCCCCcccccCcccccc
Q psy1664         304 EPLGEGTSSVV-KYWLVANSFNTNWGENGLFRIGC  337 (524)
Q Consensus       304 ~~~~~g~~~g~-~YWivkNSWG~~WGe~Gy~ri~~  337 (524)
                      +.      +|+ .||+||||||+.||++|||+|+.
T Consensus       370 D~------~g~p~yw~VkNSWG~~~G~~Gy~~ms~  398 (437)
T cd00585         370 DE------DGKPVKWKVENSWGEKVGKKGYFVMSD  398 (437)
T ss_pred             cC------CCCcceEEEEcccCCCCCCCcceehhH
Confidence            74      344 69999999999999999999984


No 32 
>KOG1544|consensus
Probab=99.80  E-value=3.6e-20  Score=178.03  Aligned_cols=98  Identities=43%  Similarity=0.846  Sum_probs=84.0

Q ss_pred             eecCCCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCC--------CCCCcEEEEEEeccCCCCCCCccceeEE
Q psy1664         246 YSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG--------PLGEHAIRIIGWGQEPLGEGTSSVVKYW  317 (524)
Q Consensus       246 ~~~~~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~--------~~~~HaV~iVGyg~~~~~~g~~~g~~YW  317 (524)
                      |++.+++++||++||++|||.+.|.|.++|+.|++|||.+..-.        ..+.|+|.|.|||++....|  ...+||
T Consensus       347 YrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G--~~~KyW  424 (470)
T KOG1544|consen  347 YRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDG--RTLKYW  424 (470)
T ss_pred             eeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCC--CeeEEE
Confidence            56778999999999999999999999999999999999886532        24799999999998753223  557899


Q ss_pred             EEeCCCCCcccccCccccccccCccCCc
Q psy1664         318 LVANSFNTNWGENGLFRIGCRPYEIPCE  345 (524)
Q Consensus       318 ivkNSWG~~WGe~Gy~ri~~~~~~~~c~  345 (524)
                      |..||||+.|||+|||||.++.++.+.+
T Consensus       425 ~aANSWG~~WGE~GYFriLRGvNecdIE  452 (470)
T KOG1544|consen  425 TAANSWGPAWGERGYFRILRGVNECDIE  452 (470)
T ss_pred             EeecccccccccCceEEEeccccchhhh
Confidence            9999999999999999999998854333


No 33 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.79  E-value=1.3e-19  Score=168.49  Aligned_cols=75  Identities=49%  Similarity=0.975  Sum_probs=63.9

Q ss_pred             EEEEecccccccccccEEeCC-CCCCccCeeEEEeeecCC-CCCCCccCCccEEEEEcCCCCCCCCCcEEEEEeCC-Ccc
Q psy1664         408 EGSMTIYADMILYKTGIYKHV-AGGPLGEHAIRIIGWGQE-PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NEC  484 (524)
Q Consensus       408 ~~~~~~~~~f~~y~~gi~~~~-~~~~~~~H~v~ivG~g~~-~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~g~-~~c  484 (524)
                      ++.+.+. +|+.|++|||+.+ +....++|+|+|||||.+ +       +++|||||||||+.||++|||||.|+. |.|
T Consensus        93 ~~~~~~~-~f~~Y~~Gi~~~~~~~~~~~~Hav~ivGyg~~~~-------g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c  164 (174)
T smart00645       93 SVAIDAS-DFQFYKSGIYDHPGCGSGTLDHAVLIVGYGTEEN-------GKDYWIVKNSWGTDWGENGYFRIARGKNNEC  164 (174)
T ss_pred             EEEEEcc-cccCCcCeEECCCCCCCCcccEEEEEEEEeecCC-------CeeEEEEECCCCCCcccCeEEEEEcCCCCcc
Confidence            4555554 6999999999875 433447999999999987 4       789999999999999999999999998 999


Q ss_pred             Cccccc
Q psy1664         485 GIEADI  490 (524)
Q Consensus       485 gi~~~~  490 (524)
                      ||+...
T Consensus       165 ~i~~~~  170 (174)
T smart00645      165 GIEASV  170 (174)
T ss_pred             Cceeee
Confidence            997764


No 34 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.76  E-value=4e-18  Score=164.56  Aligned_cols=84  Identities=32%  Similarity=0.547  Sum_probs=72.9

Q ss_pred             CHHHHHHHHHhCCCEEEEEecccccccccccEEe------CCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCCC
Q psy1664         393 NEETIMREIFRHGPVEGSMTIYADMILYKTGIYK------HVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN  466 (524)
Q Consensus       393 ~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~------~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG  466 (524)
                      ++++||++|+++|||+++|.+..+|..|++|++.      ..+....++|||+|||||++..     .+++|||||||||
T Consensus       124 ~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~-----~~~~~~i~~NSwG  198 (223)
T cd02619         124 NIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYV-----EGKGAFIVKNSWG  198 (223)
T ss_pred             hHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCC-----CCCCEEEEEeCCC
Confidence            4789999999999999999999999999999863      2234456899999999998752     2678999999999


Q ss_pred             CCCCCCcEEEEEeCC
Q psy1664         467 TNWGENGLFRIVRGQ  481 (524)
Q Consensus       467 ~~WG~~Gy~~i~~g~  481 (524)
                      +.||++||+||.++.
T Consensus       199 ~~wg~~Gy~~i~~~~  213 (223)
T cd02619         199 TDWGDNGYGRISYED  213 (223)
T ss_pred             CccccCCEEEEehhh
Confidence            999999999999984


No 35 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.46  E-value=9.6e-14  Score=144.95  Aligned_cols=86  Identities=21%  Similarity=0.279  Sum_probs=68.0

Q ss_pred             EEEEcCCCHHHHH----HHHHhCCCEEEEEecccccccccccEEeCC---------------------CCCCccCeeEEE
Q psy1664         386 IAYSLPANEETIM----REIFRHGPVEGSMTIYADMILYKTGIYKHV---------------------AGGPLGEHAIRI  440 (524)
Q Consensus       386 ~~~~~~~~~~~~~----~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~---------------------~~~~~~~H~v~i  440 (524)
                      .+++++.+  +++    ++|...+||.++++|. .|+.|++||+...                     ++....+|||+|
T Consensus       288 ~y~Nvp~d--~l~~~~~~~L~~g~pV~~g~Dv~-~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~i  364 (437)
T cd00585         288 LYLNVPMD--VLKKAAIAQLKDGEPVWFGCDVG-KFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVL  364 (437)
T ss_pred             eEEecCHH--HHHHHHHHHHhcCCCEEEEEEcC-hhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEE
Confidence            44566543  343    5777888999999996 5789999999653                     233457899999


Q ss_pred             eeecCCCCCCCccCCc-cEEEEEcCCCCCCCCCcEEEEEeC
Q psy1664         441 IGWGQEPLGEGTSSVV-KYWLVANSFNTNWGENGLFRIVRG  480 (524)
Q Consensus       441 vG~g~~~~~~~~~~~~-~ywiv~NSWG~~WG~~Gy~~i~~g  480 (524)
                      ||||.+.+      |. .||+|+||||+.||++||++|+++
T Consensus       365 vGv~~D~~------g~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         365 TGVDLDED------GKPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             EEEEecCC------CCcceEEEEcccCCCCCCCcceehhHH
Confidence            99998652      54 699999999999999999999875


No 36 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.45  E-value=8.5e-13  Score=137.95  Aligned_cols=76  Identities=16%  Similarity=0.245  Sum_probs=48.7

Q ss_pred             ccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHH----------------hhcCC-----------CCCCC
Q psy1664         114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV----------------SCCKD-----------CGNGC  166 (524)
Q Consensus       114 tpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lv----------------dC~~~-----------~~~gC  166 (524)
                      .||.||.+-|-||.||+...|+..+..+.+. ..+.||+.+|.                ++...           ...-.
T Consensus        56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l-~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~  134 (438)
T PF03051_consen   56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNL-KDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVS  134 (438)
T ss_dssp             -S--B--BSSTHHHHHHHHHHHHHHHHHCT--SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-
T ss_pred             CCCCCCCCCCCcchhhchHHHHHHHHHHcCC-CceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCC
Confidence            4899999999999999999999999887654 47999998875                22211           01246


Q ss_pred             CCCChHHHHHHHHHhCCccCCccC
Q psy1664         167 QGGFHGKAWKYWVTTGIVSGGTYA  190 (524)
Q Consensus       167 ~GG~~~~a~~~~~~~Gi~~e~~y~  190 (524)
                      +||.-..+...++.+|+++.+.|+
T Consensus       135 DGGqw~~~~nli~KYGvVPk~~mp  158 (438)
T PF03051_consen  135 DGGQWDMVVNLIKKYGVVPKSVMP  158 (438)
T ss_dssp             S-B-HHHHHHHHHHH---BGGGST
T ss_pred             CCCchHHHHHHHHHcCcCcHhhCC
Confidence            799999999999999999999884


No 37 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.35  E-value=1.8e-12  Score=128.76  Aligned_cols=124  Identities=23%  Similarity=0.134  Sum_probs=79.9

Q ss_pred             HHHHHHHHHhCCCEEEEEeccc-ccccccccEEeCCCCCCccCeeEEEeeecCCCCC---CCccCCccEEEEEcCCCCCC
Q psy1664         394 EETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG---EGTSSVVKYWLVANSFNTNW  469 (524)
Q Consensus       394 ~~~~~~~~~~~gPv~~~~~~~~-~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~---~~~~~~~~ywiv~NSWG~~W  469 (524)
                      ...|++++..+|-++.+|.+.. .+..-.-+.|..... ...+|||+||||++.-..   .....|...||||||||++|
T Consensus       224 nG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~~~~s~-~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~w  302 (372)
T COG4870         224 NGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSG-ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNW  302 (372)
T ss_pred             ccchHHHHhhhccccceeEEecccccccccCCCCCCcc-ccccceEEEEeccccccccccccCCCCCceEEEECcccccc
Confidence            4457888888888776665421 122212233333222 457999999999986321   11222345999999999999


Q ss_pred             CCCcEEEEEeCCCccCccccceeccceeccccCCcccccCcccCCCCCCCCCCCC
Q psy1664         470 GENGLFRIVRGQNECGIEADITAGLPKIGLEIDSNEINLGKMMTLPLTNRDTYTM  524 (524)
Q Consensus       470 G~~Gy~~i~~g~~~cgi~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  524 (524)
                      |++|||||....-.-| +    +.-++ .-++...+.|+++.....|+..++|.|
T Consensus       303 G~~GYfwisY~ya~~g-~----a~~~D-~y~~i~qydpl~wv~~~~y~~~~~w~~  351 (372)
T COG4870         303 GENGYFWISYYYALNG-D----AEALD-FYVYIYQYDPLGWVITSGYGLNTAWMA  351 (372)
T ss_pred             ccCceEEEEeeecccc-c----ccccC-cceEEeeccCcceEeecCcCcchhhhh
Confidence            9999999999743233 1    11111 234456778899998888888777653


No 38 
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.28  E-value=1.5e-12  Score=98.31  Aligned_cols=58  Identities=19%  Similarity=0.203  Sum_probs=50.5

Q ss_pred             HHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHHH
Q psy1664           9 VATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSEL   67 (524)
Q Consensus         9 f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~   67 (524)
                      |+.|+++|+|.|.+.++...|+.+|.+|++.|++||+... .+|++++|+|+|||.+||
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~-~~~~~~~N~fsD~t~eEf   58 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGN-NTYKLGLNQFSDMTPEEF   58 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTS-SSEEE-SSTTTTSSHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCC-CCeEEeCccccCcChhhC
Confidence            6899999999999999999999999999999999995543 899999999999999997


No 39 
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=98.98  E-value=1e-10  Score=87.76  Aligned_cols=57  Identities=18%  Similarity=0.110  Sum_probs=52.6

Q ss_pred             HHHHHHHHccccccccccccccchhHHhhhhhhhhhcCCCCccccccccccccchHHH
Q psy1664           9 VATFLKDLDLSQSSRNHSNGVFCDLSKAFDRVDHSILLPKLPFYGAEKNALSKLTLSE   66 (524)
Q Consensus         9 f~~f~~~~~k~y~~~~~~~~r~~~f~~nl~~I~~~N~~~~~~~~~~g~N~fsd~t~eE   66 (524)
                      |..|+.+|+|.|.+.++...|+.+|.+|++.|+.||+... .+|++++|+|+|||.+|
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~-~~~~~~~N~fsDlt~eE   57 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKND-HSYTLGLNQFADLTNEE   57 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCC-CCeEecCcccccCCCCC
Confidence            5689999999999999999999999999999999998654 79999999999999876


No 40 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.39  E-value=9.3e-07  Score=92.93  Aligned_cols=87  Identities=23%  Similarity=0.286  Sum_probs=57.2

Q ss_pred             EEEEcCCCH--HHHHHHHHhCCCEEEEEecccccccccccEEeCCC---------------------CCCccCeeEEEee
Q psy1664         386 IAYSLPANE--ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVA---------------------GGPLGEHAIRIIG  442 (524)
Q Consensus       386 ~~~~~~~~~--~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~---------------------~~~~~~H~v~ivG  442 (524)
                      .+++++.++  ..++++|...-||..+-+|.. +..-+.||.+...                     .....+|||+|||
T Consensus       289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k-~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itG  367 (438)
T PF03051_consen  289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGK-FFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITG  367 (438)
T ss_dssp             EEEE--HHHHHHHHHHHHHTT--EEEEEETTT-TEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEE
T ss_pred             eEeccCHHHHHHHHHHHHHcCCcEEEeccCCc-cccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEE
Confidence            346666543  334444544559999999975 5566788875431                     1223689999999


Q ss_pred             ecCCCCCCCccCCc-cEEEEEcCCCCCCCCCcEEEEEe
Q psy1664         443 WGQEPLGEGTSSVV-KYWLVANSFNTNWGENGLFRIVR  479 (524)
Q Consensus       443 ~g~~~~~~~~~~~~-~ywiv~NSWG~~WG~~Gy~~i~~  479 (524)
                      ...+..      |. .+|+|+||||+..|.+||+.++.
T Consensus       368 v~~D~~------g~p~~wkVeNSWG~~~g~kGy~~msd  399 (438)
T PF03051_consen  368 VDLDED------GKPVRWKVENSWGTDNGDKGYFYMSD  399 (438)
T ss_dssp             EEE-TT------SSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred             EEeccC------CCeeEEEEEcCCCCCCCCCcEEEECH
Confidence            998652      55 59999999999999999999874


No 41 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=98.30  E-value=3.9e-06  Score=82.61  Aligned_cols=80  Identities=24%  Similarity=0.287  Sum_probs=55.0

Q ss_pred             CHHHHHHHHH----HcCCeEEEEEecccccccCCceEEcCC---------------------CCCCCCcEEEEEEeccCC
Q psy1664         251 NEETIMREIF----RHGPVEGSMTIYADMILYKTGIYKHVA---------------------GGPLGEHAIRIIGWGQEP  305 (524)
Q Consensus       251 ~~~~ik~~l~----~~GPV~v~i~v~~~f~~Y~sGIy~~~~---------------------~~~~~~HaV~iVGyg~~~  305 (524)
                      ..+.++++.+    .+-||=.+-+|. .+..-+.||.+..-                     ..+...|||+|.|.+.++
T Consensus       296 ~me~lkkl~~~q~qagetVwFG~dvg-q~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~  374 (444)
T COG3579         296 DMERLKKLAIKQMQAGETVWFGCDVG-QLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDE  374 (444)
T ss_pred             cHHHHHHHHHHHHhcCCcEEeecCch-hhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhcccccc
Confidence            3455555433    345787777763 45566667653210                     112358999999999886


Q ss_pred             CCCCCccceeEEEEeCCCCCcccccCccccc
Q psy1664         306 LGEGTSSVVKYWLVANSFNTNWGENGLFRIG  336 (524)
Q Consensus       306 ~~~g~~~g~~YWivkNSWG~~WGe~Gy~ri~  336 (524)
                      .|     ..--|.|.||||.+=|.+|||-++
T Consensus       375 ~g-----~p~rwkVENSWG~d~G~~GyfvaS  400 (444)
T COG3579         375 TG-----NPLRWKVENSWGKDVGKKGYFVAS  400 (444)
T ss_pred             CC-----CceeeEeecccccccCCCceEeeh
Confidence            43     245799999999999999999887


No 42 
>PF08127 Propeptide_C1:  Peptidase family C1 propeptide;  InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=97.19  E-value=0.00019  Score=49.52  Aligned_cols=36  Identities=19%  Similarity=0.290  Sum_probs=24.6

Q ss_pred             hhhhhhhcCCCCccccccccccccchHHHHHHHhCCCCC
Q psy1664          38 DRVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPD   76 (524)
Q Consensus        38 ~~I~~~N~~~~~~~~~~g~N~fsd~t~eE~~~~~~~~~~   76 (524)
                      ++|+.+|+.+  .+|+||.| |.+++.+.++.++|..+.
T Consensus         4 e~I~~IN~~~--~tWkAG~N-F~~~~~~~ik~LlGv~~~   39 (41)
T PF08127_consen    4 EFIDYINSKN--TTWKAGRN-FENTSIEYIKRLLGVLPD   39 (41)
T ss_dssp             HHHHHHHHCT---SEEE-----SSB-HHHHHHCS-B-TT
T ss_pred             HHHHHHHcCC--CcccCCCC-CCCCCHHHHHHHcCCCCC
Confidence            3499999986  89999999 799999999999998653


No 43 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=96.22  E-value=0.041  Score=50.23  Aligned_cols=120  Identities=14%  Similarity=0.105  Sum_probs=72.8

Q ss_pred             CCCCCccHHHHHHHHHHHHHHH--------HHcCCCccccCCHHHHHhhcCCCCCCCCCCChHHHHHHHHHhCCccCCcc
Q psy1664         118 DQGSCGSGWALGAVEAMSDRVC--------IASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTY  189 (524)
Q Consensus       118 dQg~CGsCwAfA~~~~le~~~~--------i~~~~~~~~~LS~q~lvdC~~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~y  189 (524)
                      .||.=+=|-+||.++.|-....        |...  ..+.+|+++|.+++.         .+...++|.+..|...    
T Consensus        18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~--~yPn~s~~~l~~~~~---------~~~~~i~y~ks~g~~~----   82 (175)
T PF05543_consen   18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRY--LYPNVSEEQLKFTSL---------TPNQMIKYAKSQGRNP----   82 (175)
T ss_dssp             --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHH--HSTTS-CCCHHH--B----------HHHHHHHHHHTTEEE----
T ss_pred             ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHH--HCCCCCHHHHhhcCC---------CHHHHHHHHHHcCcch----
Confidence            3788888999999988765421        1110  246788888877742         4678999988888653    


Q ss_pred             CCCCCccccccCcccccCCCCCCCCCCCCCCccccccccCCCccccccccccceeeeecCCCHHHHHHHHHHcCCeEEEE
Q psy1664         190 ASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSM  269 (524)
Q Consensus       190 ~~~e~c~PY~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ik~~l~~~GPV~v~i  269 (524)
                             -|..                                               -..+-+++++.+-++.|+.+..
T Consensus        83 -------~~~n-----------------------------------------------~~~s~~eV~~~~~~nk~i~i~~  108 (175)
T PF05543_consen   83 -------QYNN-----------------------------------------------RMPSFDEVKKLIDNNKGIAILA  108 (175)
T ss_dssp             -------EEEC-----------------------------------------------S---HHHHHHHHHTT-EEEEEE
T ss_pred             -------hHhc-----------------------------------------------CCCCHHHHHHHHHcCCCeEEEe
Confidence                   1110                                               0124678999999999998776


Q ss_pred             EecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCC
Q psy1664         270 TIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFN  324 (524)
Q Consensus       270 ~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG  324 (524)
                      ...+            .......+||++||||-.-.      +|.++.++=|=|-
T Consensus       109 ~~v~------------~~~~~~~gHAlavvGya~~~------~g~~~y~~WNPW~  145 (175)
T PF05543_consen  109 DRVE------------QTNGPHAGHALAVVGYAKPN------NGQKTYYFWNPWW  145 (175)
T ss_dssp             EETT------------SCTTB--EEEEEEEEEEEET------TSEEEEEEE-TT-
T ss_pred             cccc------------cCCCCccceeEEEEeeeecC------CCCeEEEEeCCcc
Confidence            6421            11234568999999998642      4688999988774


No 44 
>KOG4128|consensus
Probab=95.35  E-value=0.023  Score=56.42  Aligned_cols=75  Identities=17%  Similarity=0.160  Sum_probs=52.2

Q ss_pred             ccCCCCCCCccHHHHHHHHHHHHHHHHHcCCCccccCCHHHHHh--------------------hcCC---------CCC
Q psy1664         114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVS--------------------CCKD---------CGN  164 (524)
Q Consensus       114 tpvkdQg~CGsCwAfA~~~~le~~~~i~~~~~~~~~LS~q~lvd--------------------C~~~---------~~~  164 (524)
                      .||-+|.+-|-||.|+....|--.+..+-+- ....||..+|+-                    |-..         .+.
T Consensus        63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl-~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP  141 (457)
T KOG4128|consen   63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNL-PEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNP  141 (457)
T ss_pred             cccccCcCCCceEEEechhHHHHHHHhcCCc-chhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCC
Confidence            6999999999999999998776555544432 247888877752                    1110         012


Q ss_pred             CCCCCChHHHHHHHHHhCCccCCcc
Q psy1664         165 GCQGGFHGKAWKYWVTTGIVSGGTY  189 (524)
Q Consensus       165 gC~GG~~~~a~~~~~~~Gi~~e~~y  189 (524)
                      .-+||.-..-.+.++..|+.+..-|
T Consensus       142 ~~DGGqw~MfvNlVkKYGviPKkcy  166 (457)
T KOG4128|consen  142 VPDGGQWQMFVNLVKKYGVIPKKCY  166 (457)
T ss_pred             CCCCchHHHHHHHHHHhCCCcHHhc
Confidence            3358888888888888998875555


No 45 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=95.28  E-value=0.014  Score=58.16  Aligned_cols=72  Identities=25%  Similarity=0.283  Sum_probs=50.6

Q ss_pred             HHhCCCEEEEEecccccccccccEEeCCC---------------------CCCccCeeEEEeeecCCCCCCCccCCccEE
Q psy1664         401 IFRHGPVEGSMTIYADMILYKTGIYKHVA---------------------GGPLGEHAIRIIGWGQEPLGEGTSSVVKYW  459 (524)
Q Consensus       401 ~~~~gPv~~~~~~~~~f~~y~~gi~~~~~---------------------~~~~~~H~v~ivG~g~~~~~~~~~~~~~yw  459 (524)
                      +...-||-.+-+|.. +..-+.||.+-..                     +.....|||+|.|.+.+..+     ..-=|
T Consensus       308 ~qagetVwFG~dvgq-~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g-----~p~rw  381 (444)
T COG3579         308 MQAGETVWFGCDVGQ-LSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETG-----NPLRW  381 (444)
T ss_pred             HhcCCcEEeecCchh-hcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCC-----Cceee
Confidence            333448888877743 6667777753210                     11236799999999877631     23379


Q ss_pred             EEEcCCCCCCCCCcEEEEE
Q psy1664         460 LVANSFNTNWGENGLFRIV  478 (524)
Q Consensus       460 iv~NSWG~~WG~~Gy~~i~  478 (524)
                      .|.||||..=|.+|||-.+
T Consensus       382 kVENSWG~d~G~~GyfvaS  400 (444)
T COG3579         382 KVENSWGKDVGKKGYFVAS  400 (444)
T ss_pred             EeecccccccCCCceEeeh
Confidence            9999999999999999765


No 46 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=94.62  E-value=0.44  Score=41.59  Aligned_cols=57  Identities=28%  Similarity=0.343  Sum_probs=33.5

Q ss_pred             CCHHHHHHHHHHcCCeEEEEEecc-cccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCC
Q psy1664         250 ANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF  323 (524)
Q Consensus       250 ~~~~~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSW  323 (524)
                      .+.+.|+++|.++.||.+.+...- ..   ....+.    .....|.|+|+||..+         . +++|-.+|
T Consensus        87 ~~~~~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~----~~~~~H~vvi~Gy~~~---------~-~~~v~DP~  144 (144)
T PF13529_consen   87 ASFDDIKQEIDAGRPVIVSVNSGWRPP---NGDGYD----GTYGGHYVVIIGYDED---------G-YVYVNDPW  144 (144)
T ss_dssp             S-HHHHHHHHHTT--EEEEEETTSS-----TTEEEE----E-TTEEEEEEEEE-SS---------E--EEEE-TT
T ss_pred             CcHHHHHHHHHCCCcEEEEEEcccccC---CCCCcC----CCcCCEEEEEEEEeCC---------C-EEEEeCCC
Confidence            356889999999999999997421 01   111111    1246899999999974         2 77777766


No 47 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=82.49  E-value=5.7  Score=34.34  Aligned_cols=60  Identities=27%  Similarity=0.285  Sum_probs=34.1

Q ss_pred             cCCCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCC
Q psy1664         390 LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF  465 (524)
Q Consensus       390 ~~~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSW  465 (524)
                      ...+..+|+++|.+..||.+.+....  .......+.    .....|.|+|+||..+        +  +++|..+|
T Consensus        85 ~~~~~~~i~~~i~~G~Pvi~~~~~~~--~~~~~~~~~----~~~~~H~vvi~Gy~~~--------~--~~~v~DP~  144 (144)
T PF13529_consen   85 SDASFDDIKQEIDAGRPVIVSVNSGW--RPPNGDGYD----GTYGGHYVVIIGYDED--------G--YVYVNDPW  144 (144)
T ss_dssp             TTS-HHHHHHHHHTT--EEEEEETTS--S--TTEEEE----E-TTEEEEEEEEE-SS--------E---EEEE-TT
T ss_pred             cCCcHHHHHHHHHCCCcEEEEEEccc--ccCCCCCcC----CCcCCEEEEEEEEeCC--------C--EEEEeCCC
Confidence            34567899999998889999987421  000111111    1236899999999763        2  67776666


No 48 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=76.23  E-value=6.1  Score=39.99  Aligned_cols=47  Identities=21%  Similarity=0.360  Sum_probs=31.9

Q ss_pred             HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccC
Q psy1664         252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE  304 (524)
Q Consensus       252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~  304 (524)
                      .+.|+++|.++.||.+.++++  +..|...-|    ......|.|+|+||+.+
T Consensus        78 ~~~l~~~l~~g~pv~~~~D~~--~lpy~~~~~----~~~~~~H~i~v~G~d~~  124 (317)
T PF14399_consen   78 WEELKEALDAGRPVIVWVDMY--YLPYRPNYY----KKHHADHYIVVYGYDEE  124 (317)
T ss_pred             HHHHHHHHhCCCceEEEeccc--cCCCCcccc----ccccCCcEEEEEEEeCC
Confidence            457777888888999998874  222332211    12346899999999975


No 49 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=75.50  E-value=7  Score=35.93  Aligned_cols=56  Identities=14%  Similarity=0.202  Sum_probs=36.8

Q ss_pred             CCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCCCCCCccCCccEEEEEcCC
Q psy1664         392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSF  465 (524)
Q Consensus       392 ~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSW  465 (524)
                      .+.+++++++-++-||.+..+.-+            ........||++||||-.-.+      |.++.++=|=|
T Consensus        89 ~s~~eV~~~~~~nk~i~i~~~~v~------------~~~~~~~gHAlavvGya~~~~------g~~~y~~WNPW  144 (175)
T PF05543_consen   89 PSFDEVKKLIDNNKGIAILADRVE------------QTNGPHAGHALAVVGYAKPNN------GQKTYYFWNPW  144 (175)
T ss_dssp             --HHHHHHHHHTT-EEEEEEEETT------------SCTTB--EEEEEEEEEEEETT------SEEEEEEE-TT
T ss_pred             CCHHHHHHHHHcCCCeEEEecccc------------cCCCCccceeEEEEeeeecCC------CCeEEEEeCCc
Confidence            357889999999889887665321            112234789999999976542      68899997766


No 50 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=74.70  E-value=8.8  Score=36.61  Aligned_cols=54  Identities=15%  Similarity=0.222  Sum_probs=34.2

Q ss_pred             CHHHHHHHHHHcCCeEEEEEecc-cccccCCceEEc---CC-C--CCCCCcEEEEEEeccC
Q psy1664         251 NEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKH---VA-G--GPLGEHAIRIIGWGQE  304 (524)
Q Consensus       251 ~~~~ik~~l~~~GPV~v~i~v~~-~f~~Y~sGIy~~---~~-~--~~~~~HaV~iVGyg~~  304 (524)
                      ..++|...|.++||++|-++..- .-..-++-....   .+ +  ....+|-|+|+||+.+
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~  172 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA  172 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC
Confidence            57899999999999998888641 000002222111   11 1  2346899999999976


No 51 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=70.14  E-value=8.9  Score=35.37  Aligned_cols=39  Identities=18%  Similarity=0.244  Sum_probs=30.7

Q ss_pred             CCHHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccC
Q psy1664         250 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE  304 (524)
Q Consensus       250 ~~~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~  304 (524)
                      .+.++|+..|.+..||.+-....   -.             ..-|+|+|+||++.
T Consensus       121 ksl~~ik~ql~kg~PV~iw~T~~---~~-------------~s~H~v~itgyDk~  159 (195)
T COG4990         121 KSLSDIKGQLLKGRPVVIWVTNF---HS-------------YSIHSVLITGYDKY  159 (195)
T ss_pred             CcHHHHHHHHhcCCcEEEEEecc---cc-------------cceeeeEeeccccc
Confidence            46899999999999998766652   11             23699999999964


No 52 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=66.43  E-value=57  Score=29.61  Aligned_cols=38  Identities=21%  Similarity=0.196  Sum_probs=29.0

Q ss_pred             HHHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccC
Q psy1664         252 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE  304 (524)
Q Consensus       252 ~~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~  304 (524)
                      .+.+...|.++||+-++....               ......|+++|.|-..+
T Consensus        98 ~e~~~~LL~~yGPLwv~~~~P---------------~~~~~~H~~ViTGI~~d  135 (166)
T PF12385_consen   98 AEGLANLLREYGPLWVAWEAP---------------GDSWVAHASVITGIDGD  135 (166)
T ss_pred             HHHHHHHHHHcCCeEEEecCC---------------CCcceeeEEEEEeecCC
Confidence            578889999999999886542               12234699999998865


No 53 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=65.95  E-value=13  Score=37.44  Aligned_cols=47  Identities=21%  Similarity=0.387  Sum_probs=30.6

Q ss_pred             HHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCC
Q psy1664         395 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP  447 (524)
Q Consensus       395 ~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~  447 (524)
                      +.|+++|.+.-||.+.++++  +..|...-|    ......|.|+|+||+++.
T Consensus        79 ~~l~~~l~~g~pv~~~~D~~--~lpy~~~~~----~~~~~~H~i~v~G~d~~~  125 (317)
T PF14399_consen   79 EELKEALDAGRPVIVWVDMY--YLPYRPNYY----KKHHADHYIVVYGYDEEE  125 (317)
T ss_pred             HHHHHHHhCCCceEEEeccc--cCCCCcccc----ccccCCcEEEEEEEeCCC
Confidence            46666666666999998763  333433322    223468999999998754


No 54 
>KOG4128|consensus
Probab=55.75  E-value=1.8  Score=43.46  Aligned_cols=42  Identities=19%  Similarity=0.307  Sum_probs=31.6

Q ss_pred             cCeeEEEeeecCCCCCCCccCCccEEEEEcCCCCCCCCCcEEEEE
Q psy1664         434 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIV  478 (524)
Q Consensus       434 ~~H~v~ivG~g~~~~~~~~~~~~~ywiv~NSWG~~WG~~Gy~~i~  478 (524)
                      ..||++|.|-|.-..   .+.+-.=|-|.||||.+-|.+|+..+.
T Consensus       371 mthAml~T~v~~kd~---~~g~~~~~rVenswgkd~gkkg~~~mt  412 (457)
T KOG4128|consen  371 MTHAMLLTSVGLKDP---ATGGLNEHRVENSWGKDLGKKGVNKMT  412 (457)
T ss_pred             HHHHHHhhhccccCc---ccCCchhhhhhchhhhhccccchhhhh
Confidence            579999999984221   122445799999999999999996553


No 55 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=50.38  E-value=56  Score=31.24  Aligned_cols=53  Identities=11%  Similarity=0.220  Sum_probs=33.7

Q ss_pred             CHHHHHHHHHhCCCEEEEEeccccccc---ccccEEeC---CC---CCCccCeeEEEeeecCCC
Q psy1664         393 NEETIMREIFRHGPVEGSMTIYADMIL---YKTGIYKH---VA---GGPLGEHAIRIIGWGQEP  447 (524)
Q Consensus       393 ~~~~~~~~~~~~gPv~~~~~~~~~f~~---y~~gi~~~---~~---~~~~~~H~v~ivG~g~~~  447 (524)
                      ..++|...|...||+.+.++..  +..   -+.-....   .+   .....+|-|+|+||+...
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~--~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~  173 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDAS--LLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAAT  173 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccc--cccChhhcccccccccccccCCCCCccEEEEEEEeecCCC
Confidence            4789999999999888877653  222   02222111   11   123468999999998754


No 56 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=43.63  E-value=43  Score=28.94  Aligned_cols=34  Identities=24%  Similarity=0.335  Sum_probs=25.2

Q ss_pred             HHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEec
Q psy1664         255 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG  302 (524)
Q Consensus       255 ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg  302 (524)
                      +++.|....||.+.+...        .      .....+|.|+|+||.
T Consensus        70 ~~~~l~~~~Pvi~~~~~~--------~------~~~~~gH~vVv~g~~  103 (141)
T cd02549          70 LLRQLAAGHPVIVSVNLG--------V------SITPSGHAMVVIGYD  103 (141)
T ss_pred             HHHHHHCCCeEEEEEecC--------c------ccCCCCeEEEEEEEc
Confidence            677788888999887751        0      112358999999998


No 57 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.67  E-value=98  Score=28.74  Aligned_cols=40  Identities=18%  Similarity=0.202  Sum_probs=29.7

Q ss_pred             CCHHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCC
Q psy1664         392 ANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP  447 (524)
Q Consensus       392 ~~~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~  447 (524)
                      .+..+|+..|.+..||.+-...   |..             ..-|+|+|+||++..
T Consensus       121 ksl~~ik~ql~kg~PV~iw~T~---~~~-------------~s~H~v~itgyDk~n  160 (195)
T COG4990         121 KSLSDIKGQLLKGRPVVIWVTN---FHS-------------YSIHSVLITGYDKYN  160 (195)
T ss_pred             CcHHHHHHHHhcCCcEEEEEec---ccc-------------cceeeeEeecccccc
Confidence            3678999999999999776543   322             235999999997653


No 58 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=31.50  E-value=1.1e+02  Score=27.89  Aligned_cols=39  Identities=21%  Similarity=0.177  Sum_probs=27.8

Q ss_pred             HHHHHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeecCCC
Q psy1664         394 EETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEP  447 (524)
Q Consensus       394 ~~~~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g~~~  447 (524)
                      .+.+...|.++|||-++....               ......|+++|.|-..+.
T Consensus        98 ~e~~~~LL~~yGPLwv~~~~P---------------~~~~~~H~~ViTGI~~dg  136 (166)
T PF12385_consen   98 AEGLANLLREYGPLWVAWEAP---------------GDSWVAHASVITGIDGDG  136 (166)
T ss_pred             HHHHHHHHHHcCCeEEEecCC---------------CCcceeeEEEEEeecCCC
Confidence            578899999999999885442               112235888888886543


No 59 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=29.89  E-value=1.9e+02  Score=24.71  Aligned_cols=34  Identities=24%  Similarity=0.335  Sum_probs=24.2

Q ss_pred             HHHHHHhCCCEEEEEecccccccccccEEeCCCCCCccCeeEEEeeec
Q psy1664         397 IMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWG  444 (524)
Q Consensus       397 ~~~~~~~~gPv~~~~~~~~~f~~y~~gi~~~~~~~~~~~H~v~ivG~g  444 (524)
                      ++..+...-||.+.+...        .      .....+|.|+|+||.
T Consensus        70 ~~~~l~~~~Pvi~~~~~~--------~------~~~~~gH~vVv~g~~  103 (141)
T cd02549          70 LLRQLAAGHPVIVSVNLG--------V------SITPSGHAMVVIGYD  103 (141)
T ss_pred             HHHHHHCCCeEEEEEecC--------c------ccCCCCeEEEEEEEc
Confidence            667777777998877640        0      112368999999997


No 60 
>PF01640 Peptidase_C10:  Peptidase C10 family classification.;  InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=29.47  E-value=1.5e+02  Score=27.80  Aligned_cols=52  Identities=27%  Similarity=0.356  Sum_probs=31.5

Q ss_pred             HHHHHHHHHcCCeEEEEEecccccccCCceEEcCCCCCCCCcEEEEEEeccCCCCCCCccceeEEEEeCCCCCcccccCc
Q psy1664         253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL  332 (524)
Q Consensus       253 ~~ik~~l~~~GPV~v~i~v~~~f~~Y~sGIy~~~~~~~~~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~~WGe~Gy  332 (524)
                      +.|+.+|.+..||...-.-.                  ...||-+|=||..+          .|+-+==.||-.  .+||
T Consensus       141 ~~i~~el~~~rPV~~~g~~~------------------~~GHawViDGy~~~----------~~~H~NwGW~G~--~nGy  190 (192)
T PF01640_consen  141 DMIRNELDNGRPVLYSGNSK------------------SGGHAWVIDGYDSD----------GYFHCNWGWGGS--SNGY  190 (192)
T ss_dssp             HHHHHHHHTT--EEEEEEET------------------TEEEEEEEEEEESS----------SEEEEE-SSTTT--T-EE
T ss_pred             HHHHHHHHcCCCEEEEEecC------------------CCCeEEEEcCccCC----------CeEEEeeCccCC--CCCc
Confidence            56888899999997554421                  12899999999643          466554344422  4677


Q ss_pred             cc
Q psy1664         333 FR  334 (524)
Q Consensus       333 ~r  334 (524)
                      |+
T Consensus       191 y~  192 (192)
T PF01640_consen  191 YR  192 (192)
T ss_dssp             EE
T ss_pred             cC
Confidence            64


No 61 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=29.38  E-value=68  Score=32.57  Aligned_cols=29  Identities=10%  Similarity=0.182  Sum_probs=22.8

Q ss_pred             CCcEEEEEEeccCCCCCCCccceeEEEEeCCCCC
Q psy1664         292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNT  325 (524)
Q Consensus       292 ~~HaV~iVGyg~~~~~~g~~~g~~YWivkNSWG~  325 (524)
                      .+||=.|++.-.-.     ..+.+...+||-||.
T Consensus       235 ~~HaY~Vl~~~~~~-----~~~~~lv~lrNPWg~  263 (315)
T cd00044         235 KGHAYSVLDVREVQ-----EEGLRLLRLRNPWGV  263 (315)
T ss_pred             cCcceEEeEEEEEc-----cCceEEEEecCCccC
Confidence            58999999998641     026789999999994


Done!