Query         psy4960
Match_columns 341
No_of_seqs    254 out of 1759
Neff          7.7 
Searched_HMMs 46136
Date          Fri Aug 16 22:17:42 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4960.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4960hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542|consensus              100.0 1.4E-83   3E-88  590.4  21.8  298   31-340    59-371 (372)
  2 PTZ00203 cathepsin L protease; 100.0 3.6E-76 7.7E-81  563.5  29.7  293   36-340    31-340 (348)
  3 PTZ00021 falcipain-2; Provisio 100.0 5.3E-74 1.2E-78  564.2  28.4  297   34-341   160-489 (489)
  4 PTZ00200 cysteine proteinase;  100.0 6.4E-73 1.4E-77  554.7  28.9  293   36-340   119-445 (448)
  5 KOG1543|consensus              100.0 1.3E-66 2.9E-71  494.2  24.6  274   47-337    30-320 (325)
  6 cd02621 Peptidase_C1A_Cathepsi 100.0 6.3E-59 1.4E-63  427.1  21.8  210  126-339     1-241 (243)
  7 cd02698 Peptidase_C1A_Cathepsi 100.0   2E-58 4.3E-63  422.6  21.9  207  126-340     1-238 (239)
  8 cd02620 Peptidase_C1A_Cathepsi 100.0 1.2E-58 2.6E-63  423.3  20.2  207  127-337     1-235 (236)
  9 cd02248 Peptidase_C1A Peptidas 100.0 4.7E-58   1E-62  411.4  20.9  204  127-338     1-210 (210)
 10 PF00112 Peptidase_C1:  Papain  100.0   5E-56 1.1E-60  399.2  19.0  206  126-339     1-219 (219)
 11 PTZ00049 cathepsin C-like prot 100.0 6.9E-55 1.5E-59  437.8  22.3  213  124-340   379-676 (693)
 12 PTZ00364 dipeptidyl-peptidase  100.0 9.9E-55 2.2E-59  432.4  22.7  215  124-338   203-457 (548)
 13 smart00645 Pept_C1 Papain fami 100.0 1.1E-50 2.4E-55  354.2  17.9  167  126-336     1-171 (174)
 14 cd02619 Peptidase_C1 C1 Peptid 100.0 7.6E-47 1.7E-51  340.2  19.3  191  129-326     1-213 (223)
 15 PTZ00462 Serine-repeat antigen 100.0 5.9E-46 1.3E-50  382.8  20.8  196  140-341   544-782 (1004)
 16 KOG1544|consensus              100.0 3.4E-44 7.3E-49  326.1   5.9  254   79-338   167-458 (470)
 17 COG4870 Cysteine protease [Pos 100.0 7.5E-30 1.6E-34  237.8   7.1  193  125-326    98-314 (372)
 18 cd00585 Peptidase_C1B Peptidas  99.9 4.6E-25 9.9E-30  215.7  14.7  184  141-325    55-399 (437)
 19 PF03051 Peptidase_C1_2:  Pepti  99.8 4.4E-18 9.6E-23  166.7  13.8  183  141-324    56-399 (438)
 20 PF08246 Inhibitor_I29:  Cathep  99.4 1.6E-13 3.4E-18   97.9   5.8   49   43-91      1-58  (58)
 21 smart00848 Inhibitor_I29 Cathe  99.2 1.8E-11   4E-16   86.7   3.7   48   43-90      1-57  (57)
 22 COG3579 PepC Aminopeptidase C   99.1 3.1E-10 6.7E-15  105.3   8.1  183  141-323    58-400 (444)
 23 KOG4128|consensus               97.6 1.1E-05 2.4E-10   75.3  -0.2   76  141-216    63-169 (457)
 24 PF05543 Peptidase_C47:  Stapho  96.4   0.044 9.5E-07   47.3  10.5  117  145-311    18-145 (175)
 25 PF13529 Peptidase_C39_2:  Pept  96.3   0.025 5.3E-07   46.3   8.3   52  246-310    91-144 (144)
 26 PF09778 Guanylate_cyc_2:  Guan  83.6     3.4 7.4E-05   37.1   6.3   64  243-308   111-180 (212)
 27 PF14399 Transpep_BrtH:  NlpC/p  80.0     4.1 8.9E-05   38.3   6.0   53  247-308    81-133 (317)
 28 PF12385 Peptidase_C70:  Papain  76.2      37 0.00081   29.1   9.8   34  247-297   101-134 (166)
 29 PF08127 Propeptide_C1:  Peptid  73.6     3.8 8.1E-05   26.8   2.6   32   67-99      5-38  (41)
 30 cd02549 Peptidase_C39A A sub-f  68.8      13 0.00029   30.0   5.7   45  247-310    70-114 (141)
 31 COG4990 Uncharacterized protei  66.0      10 0.00022   33.2   4.3   44  246-311   125-168 (195)
 32 cd00044 CysPc Calpains, domain  65.0      11 0.00024   35.8   5.0   42  285-326   234-303 (315)
 33 smart00230 CysPc Calpain-like   34.0      72  0.0016   30.4   5.0   28  285-312   226-255 (318)
 34 KOG4702|consensus               25.6 1.3E+02  0.0028   22.0   3.8   33   40-73     28-60  (77)
 35 PF01640 Peptidase_C10:  Peptid  24.7 2.6E+02  0.0057   24.3   6.6   48  247-321   143-192 (192)
 36 cd03527 RuBisCO_small Ribulose  21.6      73  0.0016   25.1   2.1   52  247-298    21-86  (99)
 37 KOG4621|consensus               20.8 2.6E+02  0.0056   23.2   5.1   73  246-323    61-143 (167)

No 1  
>KOG1542|consensus
Probab=100.00  E-value=1.4e-83  Score=590.37  Aligned_cols=298  Identities=36%  Similarity=0.685  Sum_probs=265.5

Q ss_pred             cccccchhHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHhhh---------hccccccCCCCCHHHHHHh-ccccCC
Q psy4960          31 DLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD---------EYYGTSGSSDRSPQEILQR-TGLRLT  100 (341)
Q Consensus        31 ~~~~~~~~~~~~f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~---------~~yg~N~fsD~t~eE~~~~-l~~~~~  100 (341)
                      ++....+..++.|..|+.+|+|+|.+.+|...|+.||+.|++.++         +.||+|+|||||+|||+++ |+.+..
T Consensus        59 ~~~~~~l~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~  138 (372)
T KOG1542|consen   59 DLNPRGLGLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRR  138 (372)
T ss_pred             ccCCcccchHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccc
Confidence            556666667899999999999999999999999999999999997         5679999999999999999 654442


Q ss_pred             CchhhhhhhhhhhhhhhhcccCCCCCCCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChhHH
Q psy4960         101 GKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQL  180 (341)
Q Consensus       101 ~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q~l  180 (341)
                      ...   ........+    ..+..+||++||||++|  .||||||||+||||||||+++++|++++|++|+.++||||+|
T Consensus       139 ~~~---~~~~~~~~~----~~~~~~lP~~fDWR~kg--aVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeL  209 (372)
T KOG1542|consen  139 GSK---LPGDAAEAP----IEPGESLPESFDWRDKG--AVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQEL  209 (372)
T ss_pred             ccc---CccccccCc----CCCCCCCCcccchhccC--CccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhh
Confidence            110   000000000    12345699999999999  999999999999999999999999999999999999999999


Q ss_pred             hhcCCCCCCCCCCcHHHHHHHHHHc-CCCCCCCCCCcCCCCCccccccccccceeeeccceeechH--H-HHHHHHhcCC
Q psy4960         181 VECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV--D-HMMHLLQSGP  256 (341)
Q Consensus       181 ~dc~~~~~gC~GG~~~~a~~~~~~~-Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~y~~~~~--d-ik~~l~~~gP  256 (341)
                      +||+..++||+||.+..|++|+++. |+..|.+|||.++.+.  .|...+....+.|++ |..++.  + |.+.|.++||
T Consensus       210 vDCD~~d~gC~GGl~~nA~~~~~~~gGL~~E~dYPY~g~~~~--~C~~~~~~~~v~I~~-f~~l~~nE~~ia~wLv~~GP  286 (372)
T KOG1542|consen  210 VDCDSCDNGCNGGLMDNAFKYIKKAGGLEKEKDYPYTGKKGN--QCHFDKSKIVVSIKD-FSMLSNNEDQIAAWLVTFGP  286 (372)
T ss_pred             hcccCcCCcCCCCChhHHHHHHHHhCCccccccCCccccCCC--ccccchhhceEEEec-cEecCCCHHHHHHHHHhcCC
Confidence            9999999999999999999997666 9999999999998774  899999999999999 999987  3 9999999999


Q ss_pred             eEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecC-CeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCce
Q psy4960         257 IGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA  335 (341)
Q Consensus       257 v~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~-g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~  335 (341)
                      |+|+|++..++.|++||+.+....|++..++|||+|||||... .++|||||||||++|||+||+||.||.|.|||++++
T Consensus       287 i~vgiNa~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mv  366 (372)
T KOG1542|consen  287 LSVGINAKPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMV  366 (372)
T ss_pred             eEEEEchHHHHHhcccccCCCcccCCccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccccch
Confidence            9999999999999999999977799988899999999999887 899999999999999999999999999999999999


Q ss_pred             eEEee
Q psy4960         336 YLASV  340 (341)
Q Consensus       336 ~~~~~  340 (341)
                      ..+++
T Consensus       367 ss~~v  371 (372)
T KOG1542|consen  367 SSAAV  371 (372)
T ss_pred             hhhhc
Confidence            98875


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=3.6e-76  Score=563.54  Aligned_cols=293  Identities=27%  Similarity=0.509  Sum_probs=242.7

Q ss_pred             chhHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHhhh------hcc--ccccCCCCCHHHHHHh-ccccC-CCchhh
Q psy4960          36 SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD------EYY--GTSGSSDRSPQEILQR-TGLRL-TGKEKE  105 (341)
Q Consensus        36 ~~~~~~~f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~------~~y--g~N~fsD~t~eE~~~~-l~~~~-~~~~~~  105 (341)
                      ..++..+|++|+++|+|.|.+.+|+.+|+.||++|+++|+      .+|  |+|+|+|||+|||+++ ++... ....+.
T Consensus        31 ~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~~~~  110 (348)
T PTZ00203         31 GTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAARYLNGAAYFAAAKQ  110 (348)
T ss_pred             ccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCCCeEEeccccccCCHHHHHHHhcCCCcccccccc
Confidence            3445678999999999999998888899999999999999      256  9999999999999976 43211 110000


Q ss_pred             hhhhhhhhhhhhhcccCCCCCCCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChhHHhhcCC
Q psy4960         106 RLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH  185 (341)
Q Consensus       106 ~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q~l~dc~~  185 (341)
                      . .......    ......+||++||||++|  +|+||||||.||||||||++++||+++++++++.+.||+|+|+||+.
T Consensus       111 ~-~~~~~~~----~~~~~~~lP~~~DWR~~g--~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqLvdC~~  183 (348)
T PTZ00203        111 H-AGQHYRK----ARADLSAVPDAVDWREKG--AVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDH  183 (348)
T ss_pred             c-ccccccc----cccccccCCCCCcCCcCC--CCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHHHhccC
Confidence            0 0000000    011123589999999999  99999999999999999999999999999999999999999999998


Q ss_pred             CCCCCCCCcHHHHHHHHHHc---CCCCCCCCCCcCCCCCccccccccc-cceeeeccceeechH--H-HHHHHHhcCCeE
Q psy4960         186 GNLNCNGGNIDVAFEYVKQY---GLESQADYPYRNKENITFRCTYEKE-KAKVFVQDTWVTSGV--D-HMMHLLQSGPIG  258 (341)
Q Consensus       186 ~~~gC~GG~~~~a~~~~~~~---Gi~~e~~yPY~~~~~~~~~C~~~~~-~~~~~i~~~y~~~~~--d-ik~~l~~~gPv~  258 (341)
                      .+.||+||++..|++|++++   |+++|++|||.+..+..+.|..... ...+++.+ |..++.  + |+.+|++.|||+
T Consensus       184 ~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~-~~~i~~~e~~~~~~l~~~GPv~  262 (348)
T PTZ00203        184 VDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDG-YVSMESSERVMAAWLAKNGPIS  262 (348)
T ss_pred             CCCCCCCCCHHHHHHHHHHhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecc-eeecCcCHHHHHHHHHhCCCEE
Confidence            78899999999999999864   5899999999987664446864332 23467888 887765  3 899999999999


Q ss_pred             EEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCceeEE
Q psy4960         259 VYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLA  338 (341)
Q Consensus       259 v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~~~~  338 (341)
                      |+|++.+|++|++|||..    |....++|||+|||||+++|.+|||||||||++||++|||||+||.|.|||+++++.+
T Consensus       263 v~i~a~~f~~Y~~GIy~~----c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cgi~~~~~~~  338 (348)
T PTZ00203        263 IAVDASSFMSYHSGVLTS----CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGYPVSV  338 (348)
T ss_pred             EEEEhhhhcCccCceeec----cCCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCcccccceEEEE
Confidence            999998899999999964    7555689999999999888999999999999999999999999999999999998876


Q ss_pred             ee
Q psy4960         339 SV  340 (341)
Q Consensus       339 ~~  340 (341)
                      .|
T Consensus       339 ~~  340 (348)
T PTZ00203        339 HV  340 (348)
T ss_pred             ec
Confidence            53


No 3  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=5.3e-74  Score=564.24  Aligned_cols=297  Identities=29%  Similarity=0.524  Sum_probs=246.5

Q ss_pred             ccchhHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHhhh-------hcc--ccccCCCCCHHHHHHh-ccccCC-Cc
Q psy4960          34 YDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD-------EYY--GTSGSSDRSPQEILQR-TGLRLT-GK  102 (341)
Q Consensus        34 ~~~~~~~~~f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~-------~~y--g~N~fsD~t~eE~~~~-l~~~~~-~~  102 (341)
                      -..++...+|++|+.+|+|+|.+.+|+..|+.||++|+++|+       .+|  |+|+|+|||+|||+.+ ++.... ..
T Consensus       160 ~~n~e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~  239 (489)
T PTZ00021        160 MTNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFK  239 (489)
T ss_pred             ccChHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccccc
Confidence            445556778999999999999999999999999999999999       356  9999999999999987 543311 00


Q ss_pred             hhhh-hhh-hhh--hhhhhhcccCCCCCCCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChh
Q psy4960         103 EKER-LEA-DRE--RVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKS  178 (341)
Q Consensus       103 ~~~~-~~~-~~~--~~~~~~~~~~~~~lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q  178 (341)
                      .... ... ...  ....+. ......+|++||||+.|  .|+||||||.||||||||++++||++++|+++..+.||+|
T Consensus       240 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~P~s~DWR~~g--~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQ  316 (489)
T PTZ00021        240 SNGKKSPRVINYDDVIKKYK-PKDATFDHAKYDWRLHN--GVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQ  316 (489)
T ss_pred             cccccccccccccccccccc-cccccCCccccccccCC--CCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHH
Confidence            0000 000 000  000000 00011249999999999  9999999999999999999999999999999999999999


Q ss_pred             HHhhcCCCCCCCCCCcHHHHHHHHHHc-CCCCCCCCCCcCCC-CCccccccccccceeeeccceeechH-HHHHHHHhcC
Q psy4960         179 QLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKE-NITFRCTYEKEKAKVFVQDTWVTSGV-DHMMHLLQSG  255 (341)
Q Consensus       179 ~l~dc~~~~~gC~GG~~~~a~~~~~~~-Gi~~e~~yPY~~~~-~~~~~C~~~~~~~~~~i~~~y~~~~~-dik~~l~~~g  255 (341)
                      +|+||+..+.||+||++..|+.|+.+. ||++|++|||.+.. +   .|........+++.+ |..++. +|+.+|+..|
T Consensus       317 qLVDCs~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~~~~~---~C~~~~~~~~~~i~~-y~~i~~~~lk~al~~~G  392 (489)
T PTZ00021        317 ELVDCSFKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVSDTPE---LCNIDRCKEKYKIKS-YVSIPEDKFKEAIRFLG  392 (489)
T ss_pred             HHhhhccCCCCCCCcchHhhhhhhhhccccCcccccCccCCCCC---ccccccccccceeee-EEEecHHHHHHHHHhcC
Confidence            999999888999999999999999877 99999999999863 5   798765566688999 998887 5999999999


Q ss_pred             CeEEEEec-cccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecC----------CeeEEEEEcCCCCCCCCCcEEEEEe
Q psy4960         256 PIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN----------GILTWIVRNSWGDIGPDHGYFQIER  324 (341)
Q Consensus       256 Pv~v~~~~-~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~----------g~~ywivkNSWG~~WG~~GY~~i~r  324 (341)
                      ||+|+|++ .+|++|++|||++   .|+. .++|||+|||||+++          +.+|||||||||++|||+|||||+|
T Consensus       393 PVsv~i~a~~~f~~YkgGIy~~---~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r  468 (489)
T PTZ00021        393 PISVSIAVSDDFAFYKGGIFDG---ECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIET  468 (489)
T ss_pred             CeEEEEEeecccccCCCCcCCC---CCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEc
Confidence            99999999 8999999999986   6865 489999999999653          2479999999999999999999999


Q ss_pred             CC----CcccccCceeEEeeC
Q psy4960         325 GA----NACGIESYAYLASVK  341 (341)
Q Consensus       325 ~~----n~Cgi~~~~~~~~~~  341 (341)
                      +.    |+|||++.+.+|++.
T Consensus       469 ~~~g~~n~CGI~t~a~yP~~~  489 (489)
T PTZ00021        469 DENGLMKTCSLGTEAYVPLIE  489 (489)
T ss_pred             CCCCCCCCCCCcccceeEecC
Confidence            96    589999999999874


No 4  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=6.4e-73  Score=554.68  Aligned_cols=293  Identities=29%  Similarity=0.532  Sum_probs=240.3

Q ss_pred             chhHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHhhh-----hcc--ccccCCCCCHHHHHHh-ccccCCCchhh--
Q psy4960          36 SIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD-----EYY--GTSGSSDRSPQEILQR-TGLRLTGKEKE--  105 (341)
Q Consensus        36 ~~~~~~~f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~-----~~y--g~N~fsD~t~eE~~~~-l~~~~~~~~~~--  105 (341)
                      +.+...+|++|+++|+|.|.+.+|+..|+.||++|++.|+     .+|  |+|+|+|||+|||.++ ++...+.....  
T Consensus       119 e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~~  198 (448)
T PTZ00200        119 EFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTS  198 (448)
T ss_pred             hHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCcCCeEEeccccccCCHHHHHHHhccCCCcccccccc
Confidence            3445678999999999999999999999999999999999     466  9999999999999887 44332211000  


Q ss_pred             hhhhhh---hhhhhhhc----------cc--CCCCCCCeeeccccCccccccccccC-CccchHHHHHHHHHHHHHHHHh
Q psy4960         106 RLEADR---ERVKKFLN----------ER--KKGPLPKSLDWRQSKVKVLNPVESQG-RCGSCWAFATTAILESQVALLK  169 (341)
Q Consensus       106 ~~~~~~---~~~~~~~~----------~~--~~~~lP~~~Dwr~~g~~~v~pV~dQg-~cGsCwAfA~~~~le~~~~~~~  169 (341)
                      ......   .....+..          +.  ....+|++||||+.|  .|+|||||| .||||||||+++++|+++++++
T Consensus       199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g--~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~  276 (448)
T PTZ00200        199 HNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRAD--AVTKVKDQGLNCGSCWAFSSVGSVESLYKIYR  276 (448)
T ss_pred             cccccccccccccccccccccccccccccccccccCCCCccCCCCC--CCCCcccCCCccchHHHHhHHHHHHHHHHHhc
Confidence            000000   00000000          00  011269999999999  999999999 9999999999999999999999


Q ss_pred             CCCCcCChhHHhhcCCCCCCCCCCcHHHHHHHHHHcCCCCCCCCCCcCCCCCccccccccccceeeeccceeechH-H-H
Q psy4960         170 KTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV-D-H  247 (341)
Q Consensus       170 ~~~~~lS~q~l~dc~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~y~~~~~-d-i  247 (341)
                      +..+.||+|+|+||+..+.||+||++..|++|++++||++|++|||.+..+   .|.... ...++|.+ |..++. + +
T Consensus       277 ~~~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~Gi~~e~~YPY~~~~~---~C~~~~-~~~~~i~~-y~~~~~~~~l  351 (448)
T PTZ00200        277 DKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNKGLSSSSDVPYLAKDG---KCVVSS-TKKVYIDS-YLVAKGKDVL  351 (448)
T ss_pred             CCCeecCHHHHhhccCccCCCCCCcHHHHHHHHhhcCccccccCCCCCCCC---CCcCCC-CCeeEecc-eEecCHHHHH
Confidence            999999999999999878999999999999999989999999999999888   897644 33466888 887765 5 5


Q ss_pred             HHHHHhcCCeEEEEec-cccccCCCCcccCCCCCCCCCCCCeEEEEEEEee--cCCeeEEEEEcCCCCCCCCCcEEEEEe
Q psy4960         248 MMHLLQSGPIGVYLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE--KNGILTWIVRNSWGDIGPDHGYFQIER  324 (341)
Q Consensus       248 k~~l~~~gPv~v~~~~-~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~--~~g~~ywivkNSWG~~WG~~GY~~i~r  324 (341)
                      +.++ ..|||+|+|++ .+|+.|++|||.+   .|+. .++|||+|||||.  ++|.+|||||||||++||++|||||+|
T Consensus       352 ~~~l-~~GPV~v~i~~~~~f~~Yk~GIy~~---~C~~-~~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r  426 (448)
T PTZ00200        352 NKSL-VISPTVVYIAVSRELLKYKSGVYNG---ECGK-SLNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLER  426 (448)
T ss_pred             HHHH-hcCCEEEEeecccccccCCCCcccc---ccCC-CCcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEe
Confidence            5555 58999999999 8999999999987   6865 4899999999994  468899999999999999999999999


Q ss_pred             C---CCcccccCceeEEee
Q psy4960         325 G---ANACGIESYAYLASV  340 (341)
Q Consensus       325 ~---~n~Cgi~~~~~~~~~  340 (341)
                      +   .|.|||++.+.+|++
T Consensus       427 ~~~g~n~CGI~~~~~~P~~  445 (448)
T PTZ00200        427 TNEGTDKCGILTVGLTPVF  445 (448)
T ss_pred             CCCCCCcCCccccceeeEE
Confidence            6   489999999999986


No 5  
>KOG1543|consensus
Probab=100.00  E-value=1.3e-66  Score=494.16  Aligned_cols=274  Identities=34%  Similarity=0.624  Sum_probs=235.9

Q ss_pred             HHHhCCccCChHHHHHHHHHHHHHHHhhh-------hcc--ccccCCCCCHHHHHHh-ccccCCCchhhhhhhhhhhhhh
Q psy4960          47 IVKWNRTYTDDNEIKTRFEYFKQDGKETD-------EYY--GTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKK  116 (341)
Q Consensus        47 ~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~-------~~y--g~N~fsD~t~eE~~~~-l~~~~~~~~~~~~~~~~~~~~~  116 (341)
                      +.+|.+.|.+..|+..|+.+|.+|++.|+       .+|  |+|+|+|++.+|++.. ++.+.+..     ....     
T Consensus        30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~-----~~~~-----   99 (325)
T KOG1543|consen   30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEI-----KRDK-----   99 (325)
T ss_pred             hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccc-----cccc-----
Confidence            66777777777788899999999999888       455  9999999999999987 44433221     0000     


Q ss_pred             hhcccCCCCCCCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhC-CCCcCChhHHhhcCCC-CCCCCCCc
Q psy4960         117 FLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK-TLYPLSKSQLVECDHG-NLNCNGGN  194 (341)
Q Consensus       117 ~~~~~~~~~lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~-~~~~lS~q~l~dc~~~-~~gC~GG~  194 (341)
                      +.......+||++||||++| ..++||||||.||||||||++++||++++|.++ ..+.||+|+|+||+.. +.||.||.
T Consensus       100 ~~~~~~~~~~p~s~DwR~~~-~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~  178 (325)
T KOG1543|consen  100 FTEKLDGDDLPDSFDWRDKG-AVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGE  178 (325)
T ss_pred             cccccchhhCCCCccccccC-CcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCC
Confidence            11122334699999999997 356669999999999999999999999999999 8999999999999974 88999999


Q ss_pred             HHHHHHHHHHcCCCC-CCCCCCcCCCCCccccccccccceeeeccceeechH---HHHHHHHhcCCeEEEEec-cccccC
Q psy4960         195 IDVAFEYVKQYGLES-QADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV---DHMMHLLQSGPIGVYLNH-RLIESY  269 (341)
Q Consensus       195 ~~~a~~~~~~~Gi~~-e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~y~~~~~---dik~~l~~~gPv~v~~~~-~~f~~y  269 (341)
                      +..|++|+.++|+++ +.+|||.+..+   .|..........+.+ +..++.   +|+.+|+.+|||+|+|++ .+|+.|
T Consensus       179 ~~~A~~yi~~~G~~t~~~~Ypy~~~~~---~C~~~~~~~~~~~~~-~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y  254 (325)
T KOG1543|consen  179 PKNAFKYIKKNGGVTECENYPYIGKDG---TCKSNKKDKTVTIKG-FYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLY  254 (325)
T ss_pred             HHHHHHHHHHhCCCCCCcCCCCcCCCC---CccCCCccceeEeee-eeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhc
Confidence            999999999998888 99999999999   999887766777888 777776   399999999999999999 999999


Q ss_pred             CCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCceeE
Q psy4960         270 DGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYL  337 (341)
Q Consensus       270 ~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~~~  337 (341)
                      ++|||.+  ..|....++|||+|||||+.++.+|||||||||+.|||+|||||.|+.|.|+|++.+.+
T Consensus       255 ~~GVy~~--~~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~~~~  320 (325)
T KOG1543|consen  255 KGGVYAE--EKGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIASEASY  320 (325)
T ss_pred             cCceEeC--CCCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhhccccc
Confidence            9999999  34433259999999999996678999999999999999999999999999999999998


No 6  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=6.3e-59  Score=427.06  Aligned_cols=210  Identities=31%  Similarity=0.625  Sum_probs=180.8

Q ss_pred             CCCeeeccccC--ccccccccccCCccchHHHHHHHHHHHHHHHHhCC------CCcCChhHHhhcCCCCCCCCCCcHHH
Q psy4960         126 LPKSLDWRQSK--VKVLNPVESQGRCGSCWAFATTAILESQVALLKKT------LYPLSKSQLVECDHGNLNCNGGNIDV  197 (341)
Q Consensus       126 lP~~~Dwr~~g--~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~------~~~lS~q~l~dc~~~~~gC~GG~~~~  197 (341)
                      ||++||||+.+  ..+|+||+|||.||||||||++++||++++|+++.      .+.||+|+|+||+..+.||+||++..
T Consensus         1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~   80 (243)
T cd02621           1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL   80 (243)
T ss_pred             CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence            79999999965  33899999999999999999999999999998776      78999999999998788999999999


Q ss_pred             HHHHHHHcCCCCCCCCCCcC-CCCCcccccccc-ccceeeeccceeec------hH--HHHHHHHhcCCeEEEEec-ccc
Q psy4960         198 AFEYVKQYGLESQADYPYRN-KENITFRCTYEK-EKAKVFVQDTWVTS------GV--DHMMHLLQSGPIGVYLNH-RLI  266 (341)
Q Consensus       198 a~~~~~~~Gi~~e~~yPY~~-~~~~~~~C~~~~-~~~~~~i~~~y~~~------~~--dik~~l~~~gPv~v~~~~-~~f  266 (341)
                      |++|++++|+++|++|||.. ...   .|.... ....+++.+ |..+      +.  +|+++|+++|||+++|++ ++|
T Consensus        81 a~~~~~~~Gi~~e~~yPY~~~~~~---~C~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F  156 (243)
T cd02621          81 VGKFAEDFGIVTEDYFPYTADDDR---PCKASPSECRRYYFSD-YNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDF  156 (243)
T ss_pred             HHHHHHhcCcCCCceeCCCCCCCC---CCCCCccccccccccc-eeEcccccccCCHHHHHHHHHHcCCEEEEEEecccc
Confidence            99999999999999999998 555   787654 334444554 4443      22  299999999999999999 899


Q ss_pred             ccCCCCcccCCC--CCCCC--------CCCCeEEEEEEEeecC--CeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCc
Q psy4960         267 ESYDGNPIRRND--WACNP--------HKLDHAVAIVGYGEKN--GILTWIVRNSWGDIGPDHGYFQIERGANACGIESY  334 (341)
Q Consensus       267 ~~y~~Gv~~~~~--~~~~~--------~~~~Hav~iVGyg~~~--g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~  334 (341)
                      .+|++|||+...  ..|+.        ..++|||+|||||++.  +.+|||||||||++||++|||||+|+.|.|||++.
T Consensus       157 ~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~  236 (243)
T cd02621         157 DFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQ  236 (243)
T ss_pred             cccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCcccCcccc
Confidence            999999998731  12532        2479999999999875  89999999999999999999999999999999999


Q ss_pred             eeEEe
Q psy4960         335 AYLAS  339 (341)
Q Consensus       335 ~~~~~  339 (341)
                      ++++.
T Consensus       237 ~~~~~  241 (243)
T cd02621         237 AVFAY  241 (243)
T ss_pred             eEeec
Confidence            98764


No 7  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=2e-58  Score=422.65  Aligned_cols=207  Identities=30%  Similarity=0.615  Sum_probs=181.1

Q ss_pred             CCCeeeccccC-ccccccccccC---CccchHHHHHHHHHHHHHHHHhC---CCCcCChhHHhhcCCCCCCCCCCcHHHH
Q psy4960         126 LPKSLDWRQSK-VKVLNPVESQG---RCGSCWAFATTAILESQVALLKK---TLYPLSKSQLVECDHGNLNCNGGNIDVA  198 (341)
Q Consensus       126 lP~~~Dwr~~g-~~~v~pV~dQg---~cGsCwAfA~~~~le~~~~~~~~---~~~~lS~q~l~dc~~~~~gC~GG~~~~a  198 (341)
                      ||++||||+.+ .++|+||||||   .||||||||++++||++++|+++   ..+.||+|+|+||+. +.||+||++..|
T Consensus         1 lP~~~Dwr~~~~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~-~~gC~GG~~~~a   79 (239)
T cd02698           1 LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG-GGSCHGGDPGGV   79 (239)
T ss_pred             CCCCcccccCCCCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC-CCCccCcCHHHH
Confidence            69999999864 44899999998   89999999999999999999875   367899999999987 789999999999


Q ss_pred             HHHHHHcCCCCCCCCCCcCCCCCcccccc---------------ccccceeeeccceeechH-H-HHHHHHhcCCeEEEE
Q psy4960         199 FEYVKQYGLESQADYPYRNKENITFRCTY---------------EKEKAKVFVQDTWVTSGV-D-HMMHLLQSGPIGVYL  261 (341)
Q Consensus       199 ~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~---------------~~~~~~~~i~~~y~~~~~-d-ik~~l~~~gPv~v~~  261 (341)
                      ++|++++|+++|++|||.....   .|..               .+....+++.+ |..++. + |+++|.++|||+|+|
T Consensus        80 ~~~~~~~Gl~~e~~yPY~~~~~---~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~-~~~~~~~~~i~~~l~~~GPV~v~i  155 (239)
T cd02698          80 YEYAHKHGIPDETCNPYQAKDG---ECNPFNRCGTCNPFGECFAIKNYTLYFVSD-YGSVSGRDKMMAEIYARGPISCGI  155 (239)
T ss_pred             HHHHHHcCcCCCCeeCCcCCCC---CCcCCCCCCCcccCcccccccccceEEeee-ceecCCHHHHHHHHHHcCCEEEEE
Confidence            9999999999999999987655   4432               11223467777 877765 4 999999999999999


Q ss_pred             ec-cccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecC-CeeEEEEEcCCCCCCCCCcEEEEEeCC-----CcccccCc
Q psy4960         262 NH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN-GILTWIVRNSWGDIGPDHGYFQIERGA-----NACGIESY  334 (341)
Q Consensus       262 ~~-~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~-g~~ywivkNSWG~~WG~~GY~~i~r~~-----n~Cgi~~~  334 (341)
                      .+ .+|+.|++|||+.  ..| ...++|||+|||||+++ +.+|||||||||++||++|||||+|+.     |+||||+.
T Consensus       156 ~~~~~f~~Y~~GIy~~--~~~-~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~~  232 (239)
T cd02698         156 MATEALENYTGGVYKE--YVQ-DPLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEED  232 (239)
T ss_pred             EecccccccCCeEEcc--CCC-CCcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccCCcccccccccccc
Confidence            99 8999999999987  345 34689999999999876 999999999999999999999999999     99999999


Q ss_pred             eeEEee
Q psy4960         335 AYLASV  340 (341)
Q Consensus       335 ~~~~~~  340 (341)
                      ++++..
T Consensus       233 ~~~~~~  238 (239)
T cd02698         233 CAWADP  238 (239)
T ss_pred             eEEEee
Confidence            999864


No 8  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=1.2e-58  Score=423.32  Aligned_cols=207  Identities=30%  Similarity=0.544  Sum_probs=176.9

Q ss_pred             CCeeeccccCcccc--ccccccCCccchHHHHHHHHHHHHHHHHhC--CCCcCChhHHhhcCCC-CCCCCCCcHHHHHHH
Q psy4960         127 PKSLDWRQSKVKVL--NPVESQGRCGSCWAFATTAILESQVALLKK--TLYPLSKSQLVECDHG-NLNCNGGNIDVAFEY  201 (341)
Q Consensus       127 P~~~Dwr~~g~~~v--~pV~dQg~cGsCwAfA~~~~le~~~~~~~~--~~~~lS~q~l~dc~~~-~~gC~GG~~~~a~~~  201 (341)
                      |++||||+++.+++  +||+|||.||||||||++++||++++++++  +.+.||+|+|+||+.. +.||+||++..||+|
T Consensus         1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~   80 (236)
T cd02620           1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY   80 (236)
T ss_pred             CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence            88999999743354  599999999999999999999999999987  7899999999999976 789999999999999


Q ss_pred             HHHcCCCCCCCCCCcCCCCC---------------cccccccc----ccceeeeccceeechH---HHHHHHHhcCCeEE
Q psy4960         202 VKQYGLESQADYPYRNKENI---------------TFRCTYEK----EKAKVFVQDTWVTSGV---DHMMHLLQSGPIGV  259 (341)
Q Consensus       202 ~~~~Gi~~e~~yPY~~~~~~---------------~~~C~~~~----~~~~~~i~~~y~~~~~---dik~~l~~~gPv~v  259 (341)
                      ++++|+++|++|||......               +..|....    ....+++.. +..+..   +||.+|+++|||++
T Consensus        81 i~~~G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~-~~~~~~~~~~ik~~l~~~GPv~v  159 (236)
T cd02620          81 LTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKS-AYSVPSDETDIMKEIMTNGPVQA  159 (236)
T ss_pred             HHhcCCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecc-eeeeCCHHHHHHHHHHHCCCeEE
Confidence            99999999999999876531               11354322    122345555 555543   39999999999999


Q ss_pred             EEec-cccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCceeE
Q psy4960         260 YLNH-RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYL  337 (341)
Q Consensus       260 ~~~~-~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~~~  337 (341)
                      +|.+ ++|+.|++|||..   .|+...++|||+|||||++++.+|||||||||++||++|||||+|+.|+|||++.++.
T Consensus       160 ~i~~~~~f~~Y~~Giy~~---~~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~  235 (236)
T cd02620         160 AFTVYEDFLYYKSGVYQH---TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA  235 (236)
T ss_pred             EEEechhhhhcCCcEEee---cCCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence            9999 8999999999986   3555568999999999988999999999999999999999999999999999998874


No 9  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=4.7e-58  Score=411.39  Aligned_cols=204  Identities=43%  Similarity=0.814  Sum_probs=187.9

Q ss_pred             CCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChhHHhhcCCC-CCCCCCCcHHHHHHHHHHc
Q psy4960         127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQY  205 (341)
Q Consensus       127 P~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q~l~dc~~~-~~gC~GG~~~~a~~~~~~~  205 (341)
                      |++||||+.+  .++||+|||.||+|||||++++||++++++++....||+|+|++|... +.+|.||.+..|++++.+.
T Consensus         1 P~~~d~r~~~--~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~   78 (210)
T cd02248           1 PESVDWREKG--AVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNG   78 (210)
T ss_pred             CCcccCCcCC--CCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHC
Confidence            7899999988  899999999999999999999999999999999999999999999975 7899999999999999999


Q ss_pred             CCCCCCCCCCcCCCCCccccccccccceeeeccceeechH----HHHHHHHhcCCeEEEEec-cccccCCCCcccCCCCC
Q psy4960         206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV----DHMMHLLQSGPIGVYLNH-RLIESYDGNPIRRNDWA  280 (341)
Q Consensus       206 Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~y~~~~~----dik~~l~~~gPv~v~~~~-~~f~~y~~Gv~~~~~~~  280 (341)
                      |+++|++|||.....   .|........+++.+ |..++.    +||++|+++|||+++|.+ ++|..|++|||..  +.
T Consensus        79 Gi~~e~~yPY~~~~~---~C~~~~~~~~~~i~~-~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~--~~  152 (210)
T cd02248          79 GLASESDYPYTGKDG---TCKYNSSKVGAKITG-YSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSG--PC  152 (210)
T ss_pred             CcCccccCCccCCCC---CccCCCCcccEEEee-EEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeC--CC
Confidence            999999999998766   898776667889999 888865    299999999999999999 8999999999988  34


Q ss_pred             CCCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCceeEE
Q psy4960         281 CNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLA  338 (341)
Q Consensus       281 ~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~~~~  338 (341)
                      |....++|||+|||||++.+.+|||||||||++||++|||||+|+.|.|||++.+.+|
T Consensus       153 ~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~~~~~cgi~~~~~~~  210 (210)
T cd02248         153 CSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP  210 (210)
T ss_pred             CCCCcCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEcCCCccCceeeeecC
Confidence            5455689999999999988999999999999999999999999999999999888765


No 10 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=5e-56  Score=399.23  Aligned_cols=206  Identities=38%  Similarity=0.687  Sum_probs=180.9

Q ss_pred             CCCeeecccc-CccccccccccCCccchHHHHHHHHHHHHHHHHh-CCCCcCChhHHhhcCC-CCCCCCCCcHHHHHHHH
Q psy4960         126 LPKSLDWRQS-KVKVLNPVESQGRCGSCWAFATTAILESQVALLK-KTLYPLSKSQLVECDH-GNLNCNGGNIDVAFEYV  202 (341)
Q Consensus       126 lP~~~Dwr~~-g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~-~~~~~lS~q~l~dc~~-~~~gC~GG~~~~a~~~~  202 (341)
                      ||++||||+. +  .++||+|||.||+|||||++++||++++++. ...+.||+|+|++|.. .+.+|+||++..|++++
T Consensus         1 lP~~~D~r~~~~--~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~   78 (219)
T PF00112_consen    1 LPKSFDWRDKGG--RITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYI   78 (219)
T ss_dssp             STSSEEGGGTTT--CSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHH
T ss_pred             CCCCEecccCCC--CcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceee
Confidence            7999999996 5  7999999999999999999999999999999 7899999999999997 57899999999999999


Q ss_pred             HH-cCCCCCCCCCCcCCC-CCccccccccccc-eeeeccceeechH----HHHHHHHhcCCeEEEEec-c-ccccCCCCc
Q psy4960         203 KQ-YGLESQADYPYRNKE-NITFRCTYEKEKA-KVFVQDTWVTSGV----DHMMHLLQSGPIGVYLNH-R-LIESYDGNP  273 (341)
Q Consensus       203 ~~-~Gi~~e~~yPY~~~~-~~~~~C~~~~~~~-~~~i~~~y~~~~~----dik~~l~~~gPv~v~~~~-~-~f~~y~~Gv  273 (341)
                      ++ .|+++|++|||.... .   .|....... ..++.. |..+..    +|+++|.++|||++++.+ . +|..|++||
T Consensus        79 ~~~~Gi~~e~~~pY~~~~~~---~c~~~~~~~~~~~i~~-~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi  154 (219)
T PF00112_consen   79 KNNNGIVTEEDYPYNGNENP---TCKSKKSNSYYVKIKG-YGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGI  154 (219)
T ss_dssp             HHHTSBEBTTTS--SSSSSC---SSCHSGGGEEEBEESE-EEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSE
T ss_pred             cccCcccccccccccccccc---cccccccccccccccc-cccccccchhHHHHHHhhCceeeeeeecccccccccccee
Confidence            99 899999999999876 4   788654443 367787 877764    399999999999999999 6 699999999


Q ss_pred             ccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCCCCcEEEEEeCCC-cccccCceeEEe
Q psy4960         274 IRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGAN-ACGIESYAYLAS  339 (341)
Q Consensus       274 ~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG~~GY~~i~r~~n-~Cgi~~~~~~~~  339 (341)
                      |..  ..|....++|||+|||||++.+++|||||||||++||++||+||+|+.+ +|||++.+++|+
T Consensus       155 ~~~--~~~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~~~c~i~~~~~~~~  219 (219)
T PF00112_consen  155 YDP--PDCSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYFRISYDYNNECGIESQAVYPI  219 (219)
T ss_dssp             ECS--TSSSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEEEEESSSSSGGGTTSSEEEEE
T ss_pred             eec--cccccccccccccccccccccceeeEeeehhhCCccCCCeEEEEeeCCCCcCccCceeeecC
Confidence            998  4676667999999999999999999999999999999999999999997 999999999997


No 11 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=6.9e-55  Score=437.84  Aligned_cols=213  Identities=29%  Similarity=0.507  Sum_probs=177.4

Q ss_pred             CCCCCeeeccccCc--cccccccccCCccchHHHHHHHHHHHHHHHHhCCC----------CcCChhHHhhcCCCCCCCC
Q psy4960         124 GPLPKSLDWRQSKV--KVLNPVESQGRCGSCWAFATTAILESQVALLKKTL----------YPLSKSQLVECDHGNLNCN  191 (341)
Q Consensus       124 ~~lP~~~Dwr~~g~--~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~----------~~lS~q~l~dc~~~~~gC~  191 (341)
                      .+||++||||+..+  ..++||+|||.||||||||++++||++++|+++..          ..||+|+|+||+..+.||+
T Consensus       379 ~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC~  458 (693)
T PTZ00049        379 DELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGCN  458 (693)
T ss_pred             ccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCcC
Confidence            46999999998521  17999999999999999999999999999986431          2799999999998889999


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCCCcCCCCCccccccccc---------------------------------------cc
Q psy4960         192 GGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKE---------------------------------------KA  232 (341)
Q Consensus       192 GG~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~~~~---------------------------------------~~  232 (341)
                      ||++..|++|+++.||++|++|||.+..+   .|.....                                       ..
T Consensus       459 GG~~~~A~kya~~~GI~tEscYPY~a~~g---~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  535 (693)
T PTZ00049        459 GGFPYLVSKMAKLQGIPLDKVFPYTATEQ---TCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEPA  535 (693)
T ss_pred             CCcHHHHHHHHHHCCCCcCCccCCcCCCC---CCCCCCCCcccccccccccccccccccccccccccccccccccccccc
Confidence            99999999999999999999999988766   6653211                                       11


Q ss_pred             eeeeccceeech----------H-HHHHHHHhcCCeEEEEec-cccccCCCCcccCCC----CCCCCC------------
Q psy4960         233 KVFVQDTWVTSG----------V-DHMMHLLQSGPIGVYLNH-RLIESYDGNPIRRND----WACNPH------------  284 (341)
Q Consensus       233 ~~~i~~~y~~~~----------~-dik~~l~~~gPv~v~~~~-~~f~~y~~Gv~~~~~----~~~~~~------------  284 (341)
                      ++++++ |..++          . +|+++|+.+|||+|+|++ ++|++|++|||+...    ..|...            
T Consensus       536 r~y~k~-y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G  614 (693)
T PTZ00049        536 RWYAKD-YNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITG  614 (693)
T ss_pred             ceeeee-eEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccc
Confidence            234555 55542          2 399999999999999999 799999999998621    136321            


Q ss_pred             --CCCeEEEEEEEeec--CCe--eEEEEEcCCCCCCCCCcEEEEEeCCCcccccCceeEEee
Q psy4960         285 --KLDHAVAIVGYGEK--NGI--LTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV  340 (341)
Q Consensus       285 --~~~Hav~iVGyg~~--~g~--~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~~~~~~  340 (341)
                        ..+|||+|||||.+  +|.  +|||||||||++||++|||||+||.|.|||++.++++..
T Consensus       615 ~e~~NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~~~~p  676 (693)
T PTZ00049        615 WEKVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFIEP  676 (693)
T ss_pred             cccCceEEEEEEeccccCCCcccCEEEEECCCCCCcccCceEEEEcCCCccCCccceeEEee
Confidence              36999999999964  453  799999999999999999999999999999999999864


No 12 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=9.9e-55  Score=432.39  Aligned_cols=215  Identities=21%  Similarity=0.438  Sum_probs=179.6

Q ss_pred             CCCCCeeeccccC-ccccccccccCC---ccchHHHHHHHHHHHHHHHHhC------CCCcCChhHHhhcCCCCCCCCCC
Q psy4960         124 GPLPKSLDWRQSK-VKVLNPVESQGR---CGSCWAFATTAILESQVALLKK------TLYPLSKSQLVECDHGNLNCNGG  193 (341)
Q Consensus       124 ~~lP~~~Dwr~~g-~~~v~pV~dQg~---cGsCwAfA~~~~le~~~~~~~~------~~~~lS~q~l~dc~~~~~gC~GG  193 (341)
                      .+||++||||+.+ .++|+||||||.   ||||||||++++||++++|+++      ..+.||+|+|+||+..+.||+||
T Consensus       203 ~~LP~sfDWR~~gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCdGG  282 (548)
T PTZ00364        203 DPPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCAGG  282 (548)
T ss_pred             cCCCCccccCcCCCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCCCC
Confidence            4699999999975 347999999999   9999999999999999999873      46889999999999778999999


Q ss_pred             cHHHHHHHHHHcCCCCCCCC--CCcCCCCCccccccccccceeeecc-----ceeechH---HHHHHHHhcCCeEEEEec
Q psy4960         194 NIDVAFEYVKQYGLESQADY--PYRNKENITFRCTYEKEKAKVFVQD-----TWVTSGV---DHMMHLLQSGPIGVYLNH  263 (341)
Q Consensus       194 ~~~~a~~~~~~~Gi~~e~~y--PY~~~~~~~~~C~~~~~~~~~~i~~-----~y~~~~~---dik~~l~~~gPv~v~~~~  263 (341)
                      ++..|++|++++||++|++|  ||.+.++..+.|+.......+++.+     .|..+..   +|+.+|+++|||+|+|++
T Consensus       283 ~p~~A~~yi~~~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GPVsVaIda  362 (548)
T PTZ00364        283 FPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYA  362 (548)
T ss_pred             cHHHHHHHHHhCCcccccccCCCCCCCCCCCCCCCCCcccceeeeeeeEEecceeecCCcHHHHHHHHHHcCCeEEEEEe
Confidence            99999999999999999999  9987765334587654444444443     0333322   399999999999999999


Q ss_pred             -cccccCCCCcccCC-----C-CCCC----------CCCCCeEEEEEEEee-cCCeeEEEEEcCCCC--CCCCCcEEEEE
Q psy4960         264 -RLIESYDGNPIRRN-----D-WACN----------PHKLDHAVAIVGYGE-KNGILTWIVRNSWGD--IGPDHGYFQIE  323 (341)
Q Consensus       264 -~~f~~y~~Gv~~~~-----~-~~~~----------~~~~~Hav~iVGyg~-~~g~~ywivkNSWG~--~WG~~GY~~i~  323 (341)
                       .+|+.|++|||...     . ..|.          ...++|||+|||||+ ++|.+|||||||||+  +|||+|||||+
T Consensus       363 ~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GYfRI~  442 (548)
T PTZ00364        363 NSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIA  442 (548)
T ss_pred             chHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCeEEEE
Confidence             89999999998631     1 0111          235799999999996 578999999999999  99999999999


Q ss_pred             eCCCcccccCceeEE
Q psy4960         324 RGANACGIESYAYLA  338 (341)
Q Consensus       324 r~~n~Cgi~~~~~~~  338 (341)
                      ||.|+|||++.++..
T Consensus       443 RG~N~CGIes~~v~~  457 (548)
T PTZ00364        443 RGVNAYNIESEVVVM  457 (548)
T ss_pred             cCCCcccccceeeee
Confidence            999999999999854


No 13 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00  E-value=1.1e-50  Score=354.18  Aligned_cols=167  Identities=47%  Similarity=0.887  Sum_probs=151.6

Q ss_pred             CCCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChhHHhhcCCC-CCCCCCCcHHHHHHHHHH
Q psy4960         126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG-NLNCNGGNIDVAFEYVKQ  204 (341)
Q Consensus       126 lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q~l~dc~~~-~~gC~GG~~~~a~~~~~~  204 (341)
                      ||++||||+.+  +++||+|||.||+|||||++++||++++++++..+.||+|+|++|... +.||+||++..|++|+.+
T Consensus         1 lP~~~D~R~~~--~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~   78 (174)
T smart00645        1 LPESFDWRKKG--AVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKK   78 (174)
T ss_pred             CCCcCcccccC--CCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHH
Confidence            69999999998  999999999999999999999999999999998999999999999975 669999999999999999


Q ss_pred             c-CCCCCCCCCCcCCCCCccccccccccceeeeccceeechHHHHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCC
Q psy4960         205 Y-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNP  283 (341)
Q Consensus       205 ~-Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~y~~~~~dik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~  283 (341)
                      + |+++|++|||+.                                        ++.+.+.+|+.|++|||+.  +.|..
T Consensus        79 ~~Gi~~e~~~PY~~----------------------------------------~~~~~~~~f~~Y~~Gi~~~--~~~~~  116 (174)
T smart00645       79 NGGLETESCYPYTG----------------------------------------SVAIDASDFQFYKSGIYDH--PGCGS  116 (174)
T ss_pred             cCCcccccccCccc----------------------------------------EEEEEcccccCCcCeEECC--CCCCC
Confidence            8 999999999965                                        4455556799999999987  35765


Q ss_pred             CCCCeEEEEEEEeec-CCeeEEEEEcCCCCCCCCCcEEEEEeCC-CcccccCcee
Q psy4960         284 HKLDHAVAIVGYGEK-NGILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYAY  336 (341)
Q Consensus       284 ~~~~Hav~iVGyg~~-~g~~ywivkNSWG~~WG~~GY~~i~r~~-n~Cgi~~~~~  336 (341)
                      ..++|||+|||||.+ ++++|||||||||+.||++|||||.|+. |.|||+....
T Consensus       117 ~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~~~~  171 (174)
T smart00645      117 GTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA  171 (174)
T ss_pred             CcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCceeeee
Confidence            558999999999986 8899999999999999999999999998 9999976653


No 14 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00  E-value=7.6e-47  Score=340.21  Aligned_cols=191  Identities=30%  Similarity=0.423  Sum_probs=164.8

Q ss_pred             eeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhC--CCCcCChhHHhhcCCCC-----CCCCCCcHHHHHH-
Q psy4960         129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK--TLYPLSKSQLVECDHGN-----LNCNGGNIDVAFE-  200 (341)
Q Consensus       129 ~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~--~~~~lS~q~l~dc~~~~-----~gC~GG~~~~a~~-  200 (341)
                      .+|||+.+   ++||+|||.||+|||||+++++|++++++..  ..+.||+|+|++|....     .+|.||.+..++. 
T Consensus         1 ~~d~r~~~---~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~   77 (223)
T cd02619           1 SVDLRPLR---LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLK   77 (223)
T ss_pred             CCcchhcC---CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHHHH
Confidence            47999964   7999999999999999999999999999987  78999999999998543     6899999999998 


Q ss_pred             HHHHcCCCCCCCCCCcCCCCCcccccc----ccccceeeeccceeechH---H-HHHHHHhcCCeEEEEec-cccccCCC
Q psy4960         201 YVKQYGLESQADYPYRNKENITFRCTY----EKEKAKVFVQDTWVTSGV---D-HMMHLLQSGPIGVYLNH-RLIESYDG  271 (341)
Q Consensus       201 ~~~~~Gi~~e~~yPY~~~~~~~~~C~~----~~~~~~~~i~~~y~~~~~---d-ik~~l~~~gPv~v~~~~-~~f~~y~~  271 (341)
                      +++.+|+++|++|||.....   .|..    ......+++.+ |..+..   + ||++|.++|||+++|.+ ..|..|++
T Consensus        78 ~~~~~Gi~~e~~~Py~~~~~---~~~~~~~~~~~~~~~~~~~-y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~  153 (223)
T cd02619          78 LVALKGIPPEEDYPYGAESD---GEEPKSEAALNAAKVKLKD-YRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKE  153 (223)
T ss_pred             HHHHcCCCccccCCCCCCCC---CCCCCCccchhhcceeecc-eeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccC
Confidence            88888999999999998776   4432    23345678888 887765   2 99999999999999999 89999999


Q ss_pred             Cccc---CCCCCCCCCCCCeEEEEEEEeecC--CeeEEEEEcCCCCCCCCCcEEEEEeCC
Q psy4960         272 NPIR---RNDWACNPHKLDHAVAIVGYGEKN--GILTWIVRNSWGDIGPDHGYFQIERGA  326 (341)
Q Consensus       272 Gv~~---~~~~~~~~~~~~Hav~iVGyg~~~--g~~ywivkNSWG~~WG~~GY~~i~r~~  326 (341)
                      |++.   .....+....++|||+|||||++.  +++|||||||||+.||++||+||+++.
T Consensus       154 ~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~  213 (223)
T cd02619         154 GIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYED  213 (223)
T ss_pred             ccccccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence            9873   111245566799999999999876  899999999999999999999999985


No 15 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00  E-value=5.9e-46  Score=382.85  Aligned_cols=196  Identities=23%  Similarity=0.426  Sum_probs=160.8

Q ss_pred             cccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChhHHhhcCCC--CCCCCCCc-HHHHHHHHHHc-CCCCCCCCCC
Q psy4960         140 LNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG--NLNCNGGN-IDVAFEYVKQY-GLESQADYPY  215 (341)
Q Consensus       140 v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q~l~dc~~~--~~gC~GG~-~~~a~~~~~~~-Gi~~e~~yPY  215 (341)
                      ..||+|||.||+|||||+++++|++++++++..+.||+|+|+||+..  +.||.||+ +..++.|++++ |+++|++|||
T Consensus       544 ~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPY  623 (1004)
T PTZ00462        544 KIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLY  623 (1004)
T ss_pred             CCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCC
Confidence            47899999999999999999999999999999999999999999853  68999997 55667999888 5899999999


Q ss_pred             cC--CCCCccccccccc------------------cceeeeccceeech-----------H-HHHHHHHhcCCeEEEEec
Q psy4960         216 RN--KENITFRCTYEKE------------------KAKVFVQDTWVTSG-----------V-DHMMHLLQSGPIGVYLNH  263 (341)
Q Consensus       216 ~~--~~~~~~~C~~~~~------------------~~~~~i~~~y~~~~-----------~-dik~~l~~~gPv~v~~~~  263 (341)
                      ..  ..+   .|+....                  ...+.+.+ |..+.           . .|+.+|+..|||+|+|++
T Consensus       624 t~k~~~g---~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kg-Y~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdA  699 (1004)
T PTZ00462        624 NYTKVGE---DCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKA-YRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKA  699 (1004)
T ss_pred             ccCCCCC---CCCCCcccccccccccccccccccccceeeccc-eEEecccccccchhhHHHHHHHHHHhcCCEEEEEEe
Confidence            75  344   6764211                  01223344 54332           1 289999999999999999


Q ss_pred             cccccC-CCCcccCCCCCCCCCCCCeEEEEEEEeec-----CCeeEEEEEcCCCCCCCCCcEEEEEe-CCCcccccCcee
Q psy4960         264 RLIESY-DGNPIRRNDWACNPHKLDHAVAIVGYGEK-----NGILTWIVRNSWGDIGPDHGYFQIER-GANACGIESYAY  336 (341)
Q Consensus       264 ~~f~~y-~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~-----~g~~ywivkNSWG~~WG~~GY~~i~r-~~n~Cgi~~~~~  336 (341)
                      .+|+.| .+|||..  ..|+...++|||+|||||++     ++++|||||||||+.||++|||||.| +.|+|||.....
T Consensus       700 sdf~~Y~~sGIyv~--~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin~i~t  777 (1004)
T PTZ00462        700 ENVLGYEFNGKKVQ--NLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIHS  777 (1004)
T ss_pred             ehHHhhhcCCcccc--CCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCCCCCccchhee
Confidence            778888 4898776  36876668999999999963     25799999999999999999999998 679999999998


Q ss_pred             EEeeC
Q psy4960         337 LASVK  341 (341)
Q Consensus       337 ~~~~~  341 (341)
                      +|+++
T Consensus       778 ~~~fn  782 (1004)
T PTZ00462        778 VVIFN  782 (1004)
T ss_pred             eeeEe
Confidence            88874


No 16 
>KOG1544|consensus
Probab=100.00  E-value=3.4e-44  Score=326.14  Aligned_cols=254  Identities=25%  Similarity=0.428  Sum_probs=196.9

Q ss_pred             ccccCCCCCHHHHHHh-ccccCCCchhhhhhhhhhhhhhhhcccCCCCCCCeeeccccCccccccccccCCccchHHHHH
Q psy4960          79 GTSGSSDRSPQEILQR-TGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT  157 (341)
Q Consensus        79 g~N~fsD~t~eE~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~  157 (341)
                      ...+|-.||.++-.+. ||+.+|...-.++++....    ++  +..+||+.||-|++.++.+.||.|||+|++.|||++
T Consensus       167 NYSaFWGmtL~DGiKyRLGTL~Ps~sv~nMNEi~~~----l~--p~~~LPE~F~As~KWp~liH~plDQgnCa~SWafST  240 (470)
T KOG1544|consen  167 NYSAFWGMTLDDGIKYRLGTLRPSSSVMNMNEIYTV----LN--PGEVLPEAFEASEKWPNLIHEPLDQGNCAGSWAFST  240 (470)
T ss_pred             chhhhhcccccccceeeecccCchhhhhhHHhHhhc----cC--cccccchhhhhhhcCCccccCccccCCcccceeeee
Confidence            3458999998886666 8776665422233322111    11  235699999999998889999999999999999999


Q ss_pred             HHHHHHHHHHHhC-C-CCcCChhHHhhcCC-CCCCCCCCcHHHHHHHHHHcCCCCCCCCCCcCCCC-Cccccccccc---
Q psy4960         158 TAILESQVALLKK-T-LYPLSKSQLVECDH-GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKEN-ITFRCTYEKE---  230 (341)
Q Consensus       158 ~~~le~~~~~~~~-~-~~~lS~q~l~dc~~-~~~gC~GG~~~~a~~~~~~~Gi~~e~~yPY~~~~~-~~~~C~~~~~---  230 (341)
                      +++...+++|... . ...||+|+|++|.. ...||.||.+..|+-|+.+.|++...+|||.+.+. ..+.|...+.   
T Consensus       241 aavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKrGvVsdhCYP~~~dQ~~~~~~C~m~sR~~g  320 (470)
T KOG1544|consen  241 AAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKRGVVSDHCYPFSGDQAGPAPPCMMHSRAMG  320 (470)
T ss_pred             ehhccceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecccccccccccccCCCCCCCCCceeeccccC
Confidence            9999999998753 3 67899999999984 46899999999999999999999999999986433 2234543211   


Q ss_pred             -----------------cceeeeccceeechH--HHHHHHHhcCCeEEEEec-cccccCCCCcccCCCCCCC-----CCC
Q psy4960         231 -----------------KAKVFVQDTWVTSGV--DHMMHLLQSGPIGVYLNH-RLIESYDGNPIRRNDWACN-----PHK  285 (341)
Q Consensus       231 -----------------~~~~~i~~~y~~~~~--dik~~l~~~gPv~v~~~~-~~f~~y~~Gv~~~~~~~~~-----~~~  285 (341)
                                       ...++..-.|.....  +|++.|+++|||.+.|.| ++|+.|++|||.+.+..-.     ...
T Consensus       321 rgkRqat~~CPn~~~~Sn~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~  400 (470)
T KOG1544|consen  321 RGKRQATAHCPNSYVNSNDIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRH  400 (470)
T ss_pred             cccccccCcCCCcccccCceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhc
Confidence                             122333332222222  399999999999999999 9999999999998533221     124


Q ss_pred             CCeEEEEEEEeecC-----CeeEEEEEcCCCCCCCCCcEEEEEeCCCcccccCceeEE
Q psy4960         286 LDHAVAIVGYGEKN-----GILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLA  338 (341)
Q Consensus       286 ~~Hav~iVGyg~~~-----g~~ywivkNSWG~~WG~~GY~~i~r~~n~Cgi~~~~~~~  338 (341)
                      +.|+|.|.|||++.     ..+|||..||||+.|||+|||||-||.|.|.||++++.+
T Consensus       401 gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNecdIEsfvIgA  458 (470)
T KOG1544|consen  401 GTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNECDIESFVIGA  458 (470)
T ss_pred             ccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccchhhhHhhhhh
Confidence            88999999999752     368999999999999999999999999999999998865


No 17 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.96  E-value=7.5e-30  Score=237.82  Aligned_cols=193  Identities=25%  Similarity=0.327  Sum_probs=131.1

Q ss_pred             CCCCeeeccccCccccccccccCCccchHHHHHHHHHHHHHHHHhCCCCcCChhHHhhcC--CCCCCC-----CCCcHHH
Q psy4960         125 PLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD--HGNLNC-----NGGNIDV  197 (341)
Q Consensus       125 ~lP~~~Dwr~~g~~~v~pV~dQg~cGsCwAfA~~~~le~~~~~~~~~~~~lS~q~l~dc~--~~~~gC-----~GG~~~~  197 (341)
                      .||+.||||+.|  .|+||||||.||+|||||+++++|+.+.-..  ...+|+-.+..-.  ....+|     +||....
T Consensus        98 s~~~~fd~r~~g--~vs~v~dQg~~Gscwaf~t~~sles~l~~~~--~w~~s~~nm~~ll~~~ye~~fd~~~~d~g~~~m  173 (372)
T COG4870          98 SLPSYFDRRDEG--KVSPVKDQGSGGSCWAFATTRSLESYLNPES--AWDFSENNMKNLLGVPYEKGFDYTSNDGGNADM  173 (372)
T ss_pred             cchhheeeeccC--CcccccccCcccceEeeeehhhhhheecccc--cccccccchhhhcCCCccccCCCccccCCcccc
Confidence            389999999999  9999999999999999999999999874332  4455554443221  112222     2666766


Q ss_pred             HHHHHHHc-CCCCCCCCCCcCCCCCccccccccc-c--ceeeeccceeechH-HHHHHHHhcCCeEEEEec--cccccCC
Q psy4960         198 AFEYVKQY-GLESQADYPYRNKENITFRCTYEKE-K--AKVFVQDTWVTSGV-DHMMHLLQSGPIGVYLNH--RLIESYD  270 (341)
Q Consensus       198 a~~~~~~~-Gi~~e~~yPY~~~~~~~~~C~~~~~-~--~~~~i~~~y~~~~~-dik~~l~~~gPv~v~~~~--~~f~~y~  270 (341)
                      +..|+... |-+.+.+-||.......+.|..... .  ..+.... ...+.. +|+.++...|-+...|.+  ..+....
T Consensus       174 ~~a~l~e~sgpv~et~d~y~~~s~~~~~~~p~~k~~~~~~~i~~~-~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~~~  252 (372)
T COG4870         174 SAAYLTEWSGPVYETDDPYSENSYFSPTNLPVTKHVQEAQIIPSR-KKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGIC  252 (372)
T ss_pred             ccccccccCCcchhhcCccccccccCCcCCchhhccccceecccc-hhhhcccchHHHHhhhccccceeEEecccccccc
Confidence            66677666 8888888888876662222221110 0  0111111 222222 399999999988866665  3333323


Q ss_pred             CCcccCCCCCCCCCCCCeEEEEEEEeec----------CCeeEEEEEcCCCCCCCCCcEEEEEeCC
Q psy4960         271 GNPIRRNDWACNPHKLDHAVAIVGYGEK----------NGILTWIVRNSWGDIGPDHGYFQIERGA  326 (341)
Q Consensus       271 ~Gv~~~~~~~~~~~~~~Hav~iVGyg~~----------~g~~ywivkNSWG~~WG~~GY~~i~r~~  326 (341)
                      .+.+..    .+....+|||+||||++.          .|.+.||||||||++||++|||||++..
T Consensus       253 ~~~~~~----~s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~y  314 (372)
T COG4870         253 IPYPYV----DSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYY  314 (372)
T ss_pred             cCCCCC----CccccccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEeee
Confidence            344433    122468999999999974          3567999999999999999999999986


No 18 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.92  E-value=4.6e-25  Score=215.70  Aligned_cols=184  Identities=21%  Similarity=0.364  Sum_probs=136.4

Q ss_pred             ccccccCCccchHHHHHHHHHHHHHHHH-hCCCCcCChhHHhh----------------cC------------CCCCCCC
Q psy4960         141 NPVESQGRCGSCWAFATTAILESQVALL-KKTLYPLSKSQLVE----------------CD------------HGNLNCN  191 (341)
Q Consensus       141 ~pV~dQg~cGsCwAfA~~~~le~~~~~~-~~~~~~lS~q~l~d----------------c~------------~~~~gC~  191 (341)
                      .||+||++.|.||.||+...|++.+.++ +...+.||+.++.-                +.            -.....+
T Consensus        55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~D  134 (437)
T cd00585          55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQND  134 (437)
T ss_pred             CCcccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcCC
Confidence            5999999999999999999999988875 45689999987764                21            0245688


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCCCcCCCCCc-------------------------------------------------
Q psy4960         192 GGNIDVAFEYVKQYGLESQADYPYRNKENIT-------------------------------------------------  222 (341)
Q Consensus       192 GG~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~-------------------------------------------------  222 (341)
                      ||....++..++++|+++.+.||-+.....+                                                 
T Consensus       135 GGqw~m~~~li~KYGvVPk~~~pet~~s~~t~~~n~~L~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~~~  214 (437)
T cd00585         135 GGQWDMLVNLIEKYGLVPKSVMPESFNSENSRRLNYLLNRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILAIA  214 (437)
T ss_pred             CCchHHHHHHHHHcCCCcccccCCCcCccchHHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHHH
Confidence            9999999999999999999999964211000                                                 


Q ss_pred             ----c------------------------------ccccc--------cc--c---cee-----------eeccceeech
Q psy4960         223 ----F------------------------------RCTYE--------KE--K---AKV-----------FVQDTWVTSG  244 (341)
Q Consensus       223 ----~------------------------------~C~~~--------~~--~---~~~-----------~i~~~y~~~~  244 (341)
                          |                              .|...        +.  .   ..+           +... |.++|
T Consensus       215 lG~pP~~F~~~y~dkd~~~~~~~~~TP~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~-y~Nvp  293 (437)
T cd00585         215 LGEPPEKFDWEYRDKDKKYHEIKELTPLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPIL-YLNVP  293 (437)
T ss_pred             cCCCCceEEEEEEeCCCCeeeCCCcCHHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccce-EEecC
Confidence                0                              00000        00  0   011           1223 66777


Q ss_pred             HH-----HHHHHHhcCCeEEEEeccccccCCCCcccCCCC------------------CCCCCCCCeEEEEEEEeec-CC
Q psy4960         245 VD-----HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW------------------ACNPHKLDHAVAIVGYGEK-NG  300 (341)
Q Consensus       245 ~d-----ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~------------------~~~~~~~~Hav~iVGyg~~-~g  300 (341)
                      .+     +.++|..++||.+++++..|..|++||++....                  .|.....+|||+|||||.+ +|
T Consensus       294 ~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D~~g  373 (437)
T cd00585         294 MDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLDEDG  373 (437)
T ss_pred             HHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEecCCC
Confidence            63     347888899999999997778999999965210                  1334457899999999964 47


Q ss_pred             e-eEEEEEcCCCCCCCCCcEEEEEeC
Q psy4960         301 I-LTWIVRNSWGDIGPDHGYFQIERG  325 (341)
Q Consensus       301 ~-~ywivkNSWG~~WG~~GY~~i~r~  325 (341)
                      + .||+|+||||+.||++||++|+++
T Consensus       374 ~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         374 KPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             CcceEEEEcccCCCCCCCcceehhHH
Confidence            6 699999999999999999999875


No 19 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.76  E-value=4.4e-18  Score=166.74  Aligned_cols=183  Identities=22%  Similarity=0.393  Sum_probs=114.6

Q ss_pred             ccccccCCccchHHHHHHHHHHHHHHHHhC-CCCcCChhHHh----------------hcCC------------CCCCCC
Q psy4960         141 NPVESQGRCGSCWAFATTAILESQVALLKK-TLYPLSKSQLV----------------ECDH------------GNLNCN  191 (341)
Q Consensus       141 ~pV~dQg~cGsCwAfA~~~~le~~~~~~~~-~~~~lS~q~l~----------------dc~~------------~~~gC~  191 (341)
                      .||.||.+.|.||.||+..+|+..+.++.+ ..+.||+.+|.                ++..            .....+
T Consensus        56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~D  135 (438)
T PF03051_consen   56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSD  135 (438)
T ss_dssp             -S--B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S
T ss_pred             CCCCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCC
Confidence            599999999999999999999999988766 68999998865                2221            134578


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCCCcCCCCCc-------------------------------------------------
Q psy4960         192 GGNIDVAFEYVKQYGLESQADYPYRNKENIT-------------------------------------------------  222 (341)
Q Consensus       192 GG~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~-------------------------------------------------  222 (341)
                      ||....+...++++||++.+.||-+.....+                                                 
T Consensus       136 GGqw~~~~nli~KYGvVPk~~mpet~~s~~t~~~n~~l~~~Lr~~a~~LR~~~~~~~~~~~l~~~k~~~l~~iy~il~~~  215 (438)
T PF03051_consen  136 GGQWDMVVNLIKKYGVVPKSVMPETFSSSNTSEMNEMLNTKLREYALELRKLVKAGKSEEELRKLKEEMLAEIYRILAIY  215 (438)
T ss_dssp             -B-HHHHHHHHHHH---BGGGSTTGCGCHBHHHHHHHHHHHHHHHHHHHHHHHHTTTTCHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCchHHHHHHHHHcCcCcHhhCCCCCCCCChHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            9999999999999999999999865311100                                                 


Q ss_pred             --------------------------c------ccccc----------c--c---cceeee-----------ccceeech
Q psy4960         223 --------------------------F------RCTYE----------K--E---KAKVFV-----------QDTWVTSG  244 (341)
Q Consensus       223 --------------------------~------~C~~~----------~--~---~~~~~i-----------~~~y~~~~  244 (341)
                                                |      .+...          +  .   ...+.+           .. |.++|
T Consensus       216 lG~PP~~F~~ey~dkd~~~~~~~~~TP~eF~~kyv~~~~ddyVsLin~P~~~~py~~~y~ve~~~Nv~~g~~~~-ylNvp  294 (438)
T PF03051_consen  216 LGEPPEKFTWEYRDKDKKYHRGKNYTPLEFYKKYVGFDLDDYVSLINDPRSHHPYNKLYTVEYLGNVVGGRPVR-YLNVP  294 (438)
T ss_dssp             H---SSSEEEEEE-TTS-EEEEEEE-HHHHHHHCTTS-GGGEEEEE--T-TTS-TTCEEEETTTTSSTT-EEEE-EEE--
T ss_pred             cCCCChheeEEEeccccccccccccCchhHHHHHhCCCCcceEEEeeCCCccCccceeEEEccCCCEECCccee-EeccC
Confidence                                      0      00000          0  0   011111           12 66777


Q ss_pred             HH-----HHHHHHhcCCeEEEEeccccccCCCCcccCCCC------------------CCCCCCCCeEEEEEEEee-cCC
Q psy4960         245 VD-----HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW------------------ACNPHKLDHAVAIVGYGE-KNG  300 (341)
Q Consensus       245 ~d-----ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~------------------~~~~~~~~Hav~iVGyg~-~~g  300 (341)
                      .|     ++.+|..+.||..+-+|..+...+.||.+....                  ....+..+|||+|||.+. ++|
T Consensus       295 id~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D~~g  374 (438)
T PF03051_consen  295 IDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLDEDG  374 (438)
T ss_dssp             HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-TTS
T ss_pred             HHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEeccCC
Confidence            63     888888999999999994455678888755221                  011234789999999995 667


Q ss_pred             e-eEEEEEcCCCCCCCCCcEEEEEe
Q psy4960         301 I-LTWIVRNSWGDIGPDHGYFQIER  324 (341)
Q Consensus       301 ~-~ywivkNSWG~~WG~~GY~~i~r  324 (341)
                      + .+|+|+||||+..|.+||+.|+.
T Consensus       375 ~p~~wkVeNSWG~~~g~kGy~~msd  399 (438)
T PF03051_consen  375 KPVRWKVENSWGTDNGDKGYFYMSD  399 (438)
T ss_dssp             SEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred             CeeEEEEEcCCCCCCCCCcEEEECH
Confidence            6 69999999999999999999974


No 20 
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.44  E-value=1.6e-13  Score=97.90  Aligned_cols=49  Identities=29%  Similarity=0.588  Sum_probs=42.6

Q ss_pred             HHHHHHHhCCccCChHHHHHHHHHHHHHHHhhh-------hcc--ccccCCCCCHHHH
Q psy4960          43 FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD-------EYY--GTSGSSDRSPQEI   91 (341)
Q Consensus        43 f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~-------~~y--g~N~fsD~t~eE~   91 (341)
                      |++|+++|+|.|.+.+|+..|+.+|++|++.|.       .+|  |+|+|||||++||
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf   58 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF   58 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence            899999999999999999999999999999999       456  9999999999997


No 21 
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.18  E-value=1.8e-11  Score=86.65  Aligned_cols=48  Identities=29%  Similarity=0.531  Sum_probs=44.6

Q ss_pred             HHHHHHHhCCccCChHHHHHHHHHHHHHHHhhh-------hcc--ccccCCCCCHHH
Q psy4960          43 FKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETD-------EYY--GTSGSSDRSPQE   90 (341)
Q Consensus        43 f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~I~-------~~y--g~N~fsD~t~eE   90 (341)
                      |++|+++|+|.|.+.+|...|+.+|.+|++.|.       .+|  |+|+|||||++|
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE   57 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE   57 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence            689999999999999999999999999999999       456  999999999886


No 22 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=99.08  E-value=3.1e-10  Score=105.35  Aligned_cols=183  Identities=20%  Similarity=0.345  Sum_probs=118.6

Q ss_pred             ccccccCCccchHHHHHHHHHHHHHHHHhC-CCCcCChhHHh----------------hcC------------CCCCCCC
Q psy4960         141 NPVESQGRCGSCWAFATTAILESQVALLKK-TLYPLSKSQLV----------------ECD------------HGNLNCN  191 (341)
Q Consensus       141 ~pV~dQg~cGsCwAfA~~~~le~~~~~~~~-~~~~lS~q~l~----------------dc~------------~~~~gC~  191 (341)
                      .||.||...|-||.||+...+...+...-+ +.+.||..++.                .-+            -...--+
T Consensus        58 d~vtNQk~SGRCWmFAAlNtfRhk~~~el~le~fElSQaytfFwDKlEKaN~FleqIi~tadq~ldsRlv~~LL~~PqqD  137 (444)
T COG3579          58 DKVTNQKQSGRCWMFAALNTFRHKLISELKLEDFELSQAYTFFWDKLEKANWFLEQIIETADQELDSRLVSFLLATPQQD  137 (444)
T ss_pred             CccccccccceehHHHHHHHHHHHHHHhcCcceeehhhHHHHHHHHHHHhhHHHHHHHhhcccchHHHHHHHHHcCcccc
Confidence            389999999999999999988776655444 46777765443                111            0133457


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCCCcCCCCC--------------------------------------------------
Q psy4960         192 GGNIDVAFEYVKQYGLESQADYPYRNKENI--------------------------------------------------  221 (341)
Q Consensus       192 GG~~~~a~~~~~~~Gi~~e~~yPY~~~~~~--------------------------------------------------  221 (341)
                      ||--......+.++|+++-+.||-.-....                                                  
T Consensus       138 GGQwdM~v~l~eKYGvVpK~~ypes~sSS~Sr~ln~~Ln~~LR~dAqiLR~a~~eg~~~~~v~~~kEe~l~eif~~l~~~  217 (444)
T COG3579         138 GGQWDMFVSLFEKYGVVPKSVYPESFSSSNSRELNALLNKLLRQDAQILRDALKEGADDDTVEALKEELLQEIFNFLAMT  217 (444)
T ss_pred             CchHHHHHHHHHHhCCCchhhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHH
Confidence            898888999999999999999986521110                                                  


Q ss_pred             -------------------------cc---ccccc-------------cc--c---ceeeecc----------ceeechH
Q psy4960         222 -------------------------TF---RCTYE-------------KE--K---AKVFVQD----------TWVTSGV  245 (341)
Q Consensus       222 -------------------------~~---~C~~~-------------~~--~---~~~~i~~----------~y~~~~~  245 (341)
                                               +|   .|++.             +.  +   ..+++.-          .|.+++.
T Consensus       218 lg~PP~~Fdf~YrdKd~~~h~~k~lTP~eFy~kyv~ldl~~yVslInaPtadkPygk~ytV~~LGnVvgg~~v~ylNv~m  297 (444)
T COG3579         218 LGLPPEKFDFAYRDKDNKYHKEKGLTPQEFYKKYVGLDLKDYVSLINAPTADKPYGKSYTVEFLGNVVGGRAVKYLNVDM  297 (444)
T ss_pred             cCCCchhcceEEeccccchhhhcCCCHHHHHHHhcCCCcccceeeccCCcCCCCCcceeehhhhccccCCceeEEecCcH
Confidence                                     00   01100             00  0   0111110          1444444


Q ss_pred             H-----HHHHHHhcCCeEEEEeccccccCCCCcccCCCC------------------CCCCCCCCeEEEEEEEe-ecCC-
Q psy4960         246 D-----HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW------------------ACNPHKLDHAVAIVGYG-EKNG-  300 (341)
Q Consensus       246 d-----ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~------------------~~~~~~~~Hav~iVGyg-~~~g-  300 (341)
                      +     ....+..+-||-.+-++..+..-+.||.+..-.                  ..+.....|||+|.|.+ +++| 
T Consensus       298 e~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g~  377 (444)
T COG3579         298 ERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETGN  377 (444)
T ss_pred             HHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCCC
Confidence            3     444566677999998887777777777543100                  01122256999999999 4454 


Q ss_pred             eeEEEEEcCCCCCCCCCcEEEEE
Q psy4960         301 ILTWIVRNSWGDIGPDHGYFQIE  323 (341)
Q Consensus       301 ~~ywivkNSWG~~WG~~GY~~i~  323 (341)
                      .--|.|.||||.+=|.+|||-++
T Consensus       378 p~rwkVENSWG~d~G~~GyfvaS  400 (444)
T COG3579         378 PLRWKVENSWGKDVGKKGYFVAS  400 (444)
T ss_pred             ceeeEeecccccccCCCceEeeh
Confidence            45899999999999999999876


No 23 
>KOG4128|consensus
Probab=97.63  E-value=1.1e-05  Score=75.29  Aligned_cols=76  Identities=26%  Similarity=0.477  Sum_probs=59.2

Q ss_pred             ccccccCCccchHHHHHHHHHHHHHHHHhC-CCCcCChhHHhh--------------------cCC----------CCCC
Q psy4960         141 NPVESQGRCGSCWAFATTAILESQVALLKK-TLYPLSKSQLVE--------------------CDH----------GNLN  189 (341)
Q Consensus       141 ~pV~dQg~cGsCwAfA~~~~le~~~~~~~~-~~~~lS~q~l~d--------------------c~~----------~~~g  189 (341)
                      +||.+|.+.|-||.|+.+..+---+.++-+ ....||..+|+-                    |..          .+..
T Consensus        63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~  142 (457)
T KOG4128|consen   63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPV  142 (457)
T ss_pred             cccccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCC
Confidence            699999999999999999988766655544 367888877651                    211          1445


Q ss_pred             CCCCcHHHHHHHHHHcCCCCCCCCCCc
Q psy4960         190 CNGGNIDVAFEYVKQYGLESQADYPYR  216 (341)
Q Consensus       190 C~GG~~~~a~~~~~~~Gi~~e~~yPY~  216 (341)
                      -+||....-++.++++|+.+..+||-.
T Consensus       143 ~DGGqw~MfvNlVkKYGviPKkcy~~s  169 (457)
T KOG4128|consen  143 PDGGQWQMFVNLVKKYGVIPKKCYLHS  169 (457)
T ss_pred             CCCchHHHHHHHHHHhCCCcHHhcccc
Confidence            679999999999999999999999743


No 24 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=96.41  E-value=0.044  Score=47.31  Aligned_cols=117  Identities=23%  Similarity=0.302  Sum_probs=67.7

Q ss_pred             ccCCccchHHHHHHHHHHHHHHHH--------hCCCCcCChhHHhhcCCCCCCCCCCcHHHHHHHHHHcCCCCCCCCCCc
Q psy4960         145 SQGRCGSCWAFATTAILESQVALL--------KKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYR  216 (341)
Q Consensus       145 dQg~cGsCwAfA~~~~le~~~~~~--------~~~~~~lS~q~l~dc~~~~~gC~GG~~~~a~~~~~~~Gi~~e~~yPY~  216 (341)
                      .||.-+=|-+||.+++|.......        +.....+|+++|..++        -.+...++|.+..|...       
T Consensus        18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l~~~~--------~~~~~~i~y~ks~g~~~-------   82 (175)
T PF05543_consen   18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQLKFTS--------LTPNQMIKYAKSQGRNP-------   82 (175)
T ss_dssp             --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCHHH----------B-HHHHHHHHHHTTEEE-------
T ss_pred             ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHHhhcC--------CCHHHHHHHHHHcCcch-------
Confidence            589999999999999887653211        1124567777776663        24567777776544221       


Q ss_pred             CCCCCccccccccccceeeeccceeechH-H-HHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEE
Q psy4960         217 NKENITFRCTYEKEKAKVFVQDTWVTSGV-D-HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVG  294 (341)
Q Consensus       217 ~~~~~~~~C~~~~~~~~~~i~~~y~~~~~-d-ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVG  294 (341)
                                       -...+    .+. | +++.+..+-|+.+..+.     ..+        . .....+|||+|||
T Consensus        83 -----------------~~~n~----~~s~~eV~~~~~~nk~i~i~~~~-----v~~--------~-~~~~~gHAlavvG  127 (175)
T PF05543_consen   83 -----------------QYNNR----MPSFDEVKKLIDNNKGIAILADR-----VEQ--------T-NGPHAGHALAVVG  127 (175)
T ss_dssp             -----------------EEECS-------HHHHHHHHHTT-EEEEEEEE-----TTS--------C-TTB--EEEEEEEE
T ss_pred             -----------------hHhcC----CCCHHHHHHHHHcCCCeEEEecc-----ccc--------C-CCCccceeEEEEe
Confidence                             00111    122 4 88888888888887652     111        1 1235799999999


Q ss_pred             Eee-cCCeeEEEEEcCCC
Q psy4960         295 YGE-KNGILTWIVRNSWG  311 (341)
Q Consensus       295 yg~-~~g~~ywivkNSWG  311 (341)
                      |-. .+|.++.++=|-|-
T Consensus       128 ya~~~~g~~~y~~WNPW~  145 (175)
T PF05543_consen  128 YAKPNNGQKTYYFWNPWW  145 (175)
T ss_dssp             EEEETTSEEEEEEE-TT-
T ss_pred             eeecCCCCeEEEEeCCcc
Confidence            986 56799999999984


No 25 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=96.28  E-value=0.025  Score=46.26  Aligned_cols=52  Identities=27%  Similarity=0.387  Sum_probs=31.6

Q ss_pred             HHHHHHHhcCCeEEEEec--cccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCC
Q psy4960         246 DHMMHLLQSGPIGVYLNH--RLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW  310 (341)
Q Consensus       246 dik~~l~~~gPv~v~~~~--~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSW  310 (341)
                      +|+++|.++.||.+.+..  ...   .++.+.       ....+|.|+|+||+.+.   +++|..+|
T Consensus        91 ~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~-------~~~~~H~vvi~Gy~~~~---~~~v~DP~  144 (144)
T PF13529_consen   91 DIKQEIDAGRPVIVSVNSGWRPP---NGDGYD-------GTYGGHYVVIIGYDEDG---YVYVNDPW  144 (144)
T ss_dssp             HHHHHHHTT--EEEEEETTSS-----TTEEEE-------E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred             HHHHHHHCCCcEEEEEEcccccC---CCCCcC-------CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence            399999999999999973  111   111122       23479999999998532   78888877


No 26 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=83.63  E-value=3.4  Score=37.08  Aligned_cols=64  Identities=22%  Similarity=0.375  Sum_probs=37.3

Q ss_pred             chH-HHHHHHHhcCCeEEEEeccccc--cCCCCcccCCCCCC---CCCCCCeEEEEEEEeecCCeeEEEEEc
Q psy4960         243 SGV-DHMMHLLQSGPIGVYLNHRLIE--SYDGNPIRRNDWAC---NPHKLDHAVAIVGYGEKNGILTWIVRN  308 (341)
Q Consensus       243 ~~~-dik~~l~~~gPv~v~~~~~~f~--~y~~Gv~~~~~~~~---~~~~~~Hav~iVGyg~~~g~~ywivkN  308 (341)
                      ++. +|..+|..+||+.+.++..-..  .-++-........|   .....+|-|+|+||+..  .+-++++|
T Consensus       111 vs~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~--~~~~~yrd  180 (212)
T PF09778_consen  111 VSIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA--TKEFEYRD  180 (212)
T ss_pred             ccHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC--CCeEEEeC
Confidence            444 4999999999888888862221  00222221110122   23458999999999943  23455555


No 27 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=79.97  E-value=4.1  Score=38.35  Aligned_cols=53  Identities=21%  Similarity=0.350  Sum_probs=34.5

Q ss_pred             HHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEc
Q psy4960         247 HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRN  308 (341)
Q Consensus       247 ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkN  308 (341)
                      |++.|..+.||.+.++.-.+ .|...-+       ......|.|+|+||+++ +..+.++-+
T Consensus        81 l~~~l~~g~pv~~~~D~~~l-py~~~~~-------~~~~~~H~i~v~G~d~~-~~~~~v~D~  133 (317)
T PF14399_consen   81 LKEALDAGRPVIVWVDMYYL-PYRPNYY-------KKHHADHYIVVYGYDEE-EDVFYVSDP  133 (317)
T ss_pred             HHHHHhCCCceEEEeccccC-CCCcccc-------ccccCCcEEEEEEEeCC-CCEEEEEcC
Confidence            99999987799999875222 2221111       12346899999999864 345666544


No 28 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=76.25  E-value=37  Score=29.05  Aligned_cols=34  Identities=24%  Similarity=0.216  Sum_probs=25.6

Q ss_pred             HHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEee
Q psy4960         247 HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGE  297 (341)
Q Consensus       247 ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~  297 (341)
                      +...|.++||+-++...            +     .+....|+++|.|-+.
T Consensus       101 ~~~LL~~yGPLwv~~~~------------P-----~~~~~~H~~ViTGI~~  134 (166)
T PF12385_consen  101 LANLLREYGPLWVAWEA------------P-----GDSWVAHASVITGIDG  134 (166)
T ss_pred             HHHHHHHcCCeEEEecC------------C-----CCcceeeEEEEEeecC
Confidence            89999999999998542            1     2234579999999874


No 29 
>PF08127 Propeptide_C1:  Peptidase family C1 propeptide;  InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=73.57  E-value=3.8  Score=26.81  Aligned_cols=32  Identities=16%  Similarity=0.138  Sum_probs=17.1

Q ss_pred             HHHHHHhhhhcc--ccccCCCCCHHHHHHhccccC
Q psy4960          67 FKQDGKETDEYY--GTSGSSDRSPQEILQRTGLRL   99 (341)
Q Consensus        67 F~~n~~~I~~~y--g~N~fsD~t~eE~~~~l~~~~   99 (341)
                      |++.+.....+|  |.| |.++|.+.++.++|...
T Consensus         5 ~I~~IN~~~~tWkAG~N-F~~~~~~~ik~LlGv~~   38 (41)
T PF08127_consen    5 FIDYINSKNTTWKAGRN-FENTSIEYIKRLLGVLP   38 (41)
T ss_dssp             HHHHHHHCT-SEEE-----SSB-HHHHHHCS-B-T
T ss_pred             HHHHHHcCCCcccCCCC-CCCCCHHHHHHHcCCCC
Confidence            344444445788  999 89999999999887654


No 30 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=68.83  E-value=13  Score=30.01  Aligned_cols=45  Identities=20%  Similarity=0.234  Sum_probs=30.9

Q ss_pred             HHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCC
Q psy4960         247 HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW  310 (341)
Q Consensus       247 ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSW  310 (341)
                      +++.|....||.+.+..        +   .     .....+|.|+|+||+.   .+..+|.+.|
T Consensus        70 ~~~~l~~~~Pvi~~~~~--------~---~-----~~~~~gH~vVv~g~~~---~~~~~i~DP~  114 (141)
T cd02549          70 LLRQLAAGHPVIVSVNL--------G---V-----SITPSGHAMVVIGYDR---KGNVYVNDPG  114 (141)
T ss_pred             HHHHHHCCCeEEEEEec--------C---c-----ccCCCCeEEEEEEEcC---CCCEEEECCC
Confidence            66888888999998764        1   1     0124799999999971   2335666665


No 31 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=66.04  E-value=10  Score=33.17  Aligned_cols=44  Identities=25%  Similarity=0.437  Sum_probs=32.9

Q ss_pred             HHHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCCC
Q psy4960         246 DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG  311 (341)
Q Consensus       246 dik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG  311 (341)
                      ||+..|.++.||.+-...  |..                ..-|+|+|+||+    +.++..-++||
T Consensus       125 ~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyD----k~n~yynDpyG  168 (195)
T COG4990         125 DIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYD----KYNIYYNDPYG  168 (195)
T ss_pred             HHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeeccc----ccceEeccccc
Confidence            499999999999987642  211                357999999998    44666777775


No 32 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=64.98  E-value=11  Score=35.77  Aligned_cols=42  Identities=26%  Similarity=0.352  Sum_probs=34.2

Q ss_pred             CCCeEEEEEEEeecC--CeeEEEEEcCCCCC--C------------------------CCCcEEEEEeCC
Q psy4960         285 KLDHAVAIVGYGEKN--GILTWIVRNSWGDI--G------------------------PDHGYFQIERGA  326 (341)
Q Consensus       285 ~~~Hav~iVGyg~~~--g~~ywivkNSWG~~--W------------------------G~~GY~~i~r~~  326 (341)
                      ..+||-.|++.-.-+  +.+...|||-||..  |                        .++|-|||+..+
T Consensus       234 ~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~~~d  303 (315)
T cd00044         234 VKGHAYSVLDVREVQEEGLRLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMSFED  303 (315)
T ss_pred             ccCcceEEeEEEEEccCceEEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEEhHH
Confidence            479999999998755  88999999999952  2                        368999998764


No 33 
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=34.01  E-value=72  Score=30.39  Aligned_cols=28  Identities=29%  Similarity=0.458  Sum_probs=22.5

Q ss_pred             CCCeEEEEEEEeecCCee--EEEEEcCCCC
Q psy4960         285 KLDHAVAIVGYGEKNGIL--TWIVRNSWGD  312 (341)
Q Consensus       285 ~~~Hav~iVGyg~~~g~~--ywivkNSWG~  312 (341)
                      ..+||=.|++...-++.+  ...|||-||.
T Consensus       226 v~~HaYsVl~v~~~~~~~~~Ll~lrNPWg~  255 (318)
T smart00230      226 VKGHAYSVTDVREVQGRRQELLRLRNPWGQ  255 (318)
T ss_pred             ccCccEEEEEEEEEecCCeEEEEEECCCCC
Confidence            369999999998644444  9999999993


No 34 
>KOG4702|consensus
Probab=25.63  E-value=1.3e+02  Score=22.05  Aligned_cols=33  Identities=18%  Similarity=0.331  Sum_probs=24.9

Q ss_pred             HHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHh
Q psy4960          40 VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKE   73 (341)
Q Consensus        40 ~~~f~~f~~~~~K~Y~~~~e~~~R~~iF~~n~~~   73 (341)
                      -.-|++|+..|.+.-.++ |..+|..-|.+-++.
T Consensus        28 pe~Fee~v~~~krel~pp-e~~~~~EE~~~~lRe   60 (77)
T KOG4702|consen   28 PEIFEEFVRGYKRELSPP-EATKRKEEYENFLRE   60 (77)
T ss_pred             hHHHHHHHHhccccCCCh-HHHhhHHHHHHHHHH
Confidence            346899999999988766 666677777666654


No 35 
>PF01640 Peptidase_C10:  Peptidase C10 family classification.;  InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=24.67  E-value=2.6e+02  Score=24.34  Aligned_cols=48  Identities=25%  Similarity=0.326  Sum_probs=30.2

Q ss_pred             HHHHHHhcCCeEEEEeccccccCCCCcccCCCCCCCCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCC--CCcEEE
Q psy4960         247 HMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP--DHGYFQ  321 (341)
Q Consensus       247 ik~~l~~~gPv~v~~~~~~f~~y~~Gv~~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG--~~GY~~  321 (341)
                      |+..|.++.||...-.           -.         ..+||.+|=||.   ...||  .--||  ||  .+||++
T Consensus       143 i~~el~~~rPV~~~g~-----------~~---------~~GHawViDGy~---~~~~~--H~NwG--W~G~~nGyy~  192 (192)
T PF01640_consen  143 IRNELDNGRPVLYSGN-----------SK---------SGGHAWVIDGYD---SDGYF--HCNWG--WGGSSNGYYR  192 (192)
T ss_dssp             HHHHHHTT--EEEEEE-----------ET---------TEEEEEEEEEEE---SSSEE--EEE-S--STTTT-EEEE
T ss_pred             HHHHHHcCCCEEEEEe-----------cC---------CCCeEEEEcCcc---CCCeE--EEeeC--ccCCCCCccC
Confidence            8889998999987632           11         129999999996   33465  44455  55  569885


No 36 
>cd03527 RuBisCO_small Ribulose bisphosphate carboxylase/oxygenase (Rubisco), small subunit. Rubisco is a bifunctional enzyme catalyzes the initial steps of two opposing metabolic pathways: photosynthetic carbon fixation and the competing process of photorespiration. Rubisco Form I, present in plants and green algae, is composed of eight large and eight small subunits. The nearly identical small subunits are encoded by a family of nuclear genes. After translation, the small subunits are translocated across the chloroplast membrane, where an N-terminal signal peptide is cleaved off. While the large subunits contain the catalytic activities, it has been shown that the small subunits are important for catalysis by enhancing the catalytic rate through inducing conformational changes in the large subunits.
Probab=21.55  E-value=73  Score=25.05  Aligned_cols=52  Identities=15%  Similarity=0.109  Sum_probs=31.2

Q ss_pred             HHHHHHhcCCeEEEEec-c-----ccccCCCCcccCCCC--------CCCCCCCCeEEEEEEEeec
Q psy4960         247 HMMHLLQSGPIGVYLNH-R-----LIESYDGNPIRRNDW--------ACNPHKLDHAVAIVGYGEK  298 (341)
Q Consensus       247 ik~~l~~~gPv~v~~~~-~-----~f~~y~~Gv~~~~~~--------~~~~~~~~Hav~iVGyg~~  298 (341)
                      |..+|.++--+.+.+.- .     .|..++-..|...+.        .|.....+|-|-|||+|..
T Consensus        21 I~yll~qG~~~~lE~ad~~~~~~~yW~mwklP~f~~~d~~~Vl~ei~~C~~~~p~~YVRliG~D~~   86 (99)
T cd03527          21 IDYIISNGWAPCLEFTEPEHYDNRYWTMWKLPMFGCTDPAQVLREIEACRKAYPDHYVRVVGFDNY   86 (99)
T ss_pred             HHHHHhCCCEEEEEcccCCCCCCCEEeeccCCCCCCCCHHHHHHHHHHHHHHCCCCeEEEEEEeCC
Confidence            77778777677777654 2     333333333322111        3545568999999999943


No 37 
>KOG4621|consensus
Probab=20.85  E-value=2.6e+02  Score=23.17  Aligned_cols=73  Identities=22%  Similarity=0.308  Sum_probs=40.6

Q ss_pred             HHHHHHHhcCCeEEEEec-c----ccc--cCCCCcccCCCCC--C-CCCCCCeEEEEEEEeecCCeeEEEEEcCCCCCCC
Q psy4960         246 DHMMHLLQSGPIGVYLNH-R----LIE--SYDGNPIRRNDWA--C-NPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGP  315 (341)
Q Consensus       246 dik~~l~~~gPv~v~~~~-~----~f~--~y~~Gv~~~~~~~--~-~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG  315 (341)
                      ||..+|+++.-|++.+-- .    ++-  -.+++.+.+..-.  | .+...+|-|+|-||+-  -.+-+.++|-   ...
T Consensus        61 dIqahLaqGnhiAIaLVdq~~Lhcdlceeplk~ccfspnghhcfcrtp~YqGHfiVi~GYd~--a~~c~~~ndP---A~a  135 (167)
T KOG4621|consen   61 DIQAHLAQGNHIAIALVDQDKLHCDLCEEPLKSCCFSPNGHHCFCRTPCYQGHFIVICGYDA--ARDCFEINDP---ASA  135 (167)
T ss_pred             HHHHHHhcCCeEEEEEecCCceehHHHHhHHHHhccCCCCccccccCCcccccEEEEecccc--ccCeEEEcCc---ccC
Confidence            688888865466655432 1    121  2345666653322  2 2335799999999983  3445666553   233


Q ss_pred             CCcEEEEE
Q psy4960         316 DHGYFQIE  323 (341)
Q Consensus       316 ~~GY~~i~  323 (341)
                      +-|--||+
T Consensus       136 dpg~c~~S  143 (167)
T KOG4621|consen  136 DPGHCRIS  143 (167)
T ss_pred             CCcceeeh
Confidence            34555554


Done!