Query         psy14862
Match_columns 263
No_of_seqs    212 out of 1464
Neff          6.9 
Searched_HMMs 46136
Date          Fri Aug 16 22:43:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14862.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14862hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542|consensus              100.0 5.2E-52 1.1E-56  375.5  12.3  180   75-260    67-329 (372)
  2 PTZ00203 cathepsin L protease; 100.0 3.1E-47 6.8E-52  354.2  15.8  145   74-221    33-178 (348)
  3 PTZ00021 falcipain-2; Provisio 100.0 1.4E-45   3E-50  354.0  16.4  181   74-261   164-434 (489)
  4 PTZ00200 cysteine proteinase;  100.0 7.4E-44 1.6E-48  340.3  16.7  182   69-259   116-398 (448)
  5 KOG1543|consensus              100.0 5.1E-38 1.1E-42  290.2  13.3  166   83-259    30-281 (325)
  6 smart00645 Pept_C1 Papain fami  99.9 2.7E-26 5.9E-31  194.5   7.4   89  168-261     1-134 (174)
  7 cd02621 Peptidase_C1A_Cathepsi  99.9 2.1E-25 4.5E-30  198.2   7.1   53  168-221     1-63  (243)
  8 cd02698 Peptidase_C1A_Cathepsi  99.9 2.9E-25 6.2E-30  197.1   7.0   53  168-221     1-62  (239)
  9 cd02248 Peptidase_C1A Peptidas  99.9 1.9E-24 4.2E-29  186.5   7.4   88  169-261     1-172 (210)
 10 cd02620 Peptidase_C1A_Cathepsi  99.9 2.8E-24 6.1E-29  190.4   6.0   52  169-221     1-58  (236)
 11 PF00112 Peptidase_C1:  Papain   99.9 7.1E-23 1.5E-27  176.6   3.9   89  168-261     1-179 (219)
 12 PTZ00364 dipeptidyl-peptidase   99.9 2.5E-22 5.4E-27  195.6   6.8   56  165-221   202-269 (548)
 13 PTZ00049 cathepsin C-like prot  99.8 1.1E-21 2.4E-26  193.8   7.1   56  165-221   378-447 (693)
 14 cd02619 Peptidase_C1 C1 Peptid  99.8 1.4E-19   3E-24  156.3   6.0   49  171-221     1-51  (223)
 15 PF08246 Inhibitor_I29:  Cathep  99.7 5.6E-18 1.2E-22  118.5   7.1   58   79-136     1-58  (58)
 16 KOG1544|consensus               99.7 2.3E-17   5E-22  149.2   1.8  110  108-221   151-265 (470)
 17 PTZ00462 Serine-repeat antigen  99.6 1.8E-16 3.8E-21  161.3   5.0   41  180-221   544-584 (1004)
 18 smart00848 Inhibitor_I29 Cathe  99.6 1.2E-15 2.5E-20  105.8   5.6   57   79-135     1-57  (57)
 19 COG4870 Cysteine protease [Pos  99.2 6.8E-12 1.5E-16  115.8   1.9   41  166-207    97-137 (372)
 20 cd00585 Peptidase_C1B Peptidas  98.1 8.3E-06 1.8E-10   78.6   6.9   40  181-221    55-95  (437)
 21 PF08127 Propeptide_C1:  Peptid  95.5   0.012 2.7E-07   38.1   2.6   31  108-140     4-34  (41)
 22 PF03051 Peptidase_C1_2:  Pepti  93.0    0.07 1.5E-06   51.8   2.7   39  182-221    57-96  (438)
 23 PF05543 Peptidase_C47:  Stapho  70.2     3.9 8.5E-05   34.8   2.6   36  186-221    18-61  (175)
 24 cd00042 CY Substituted updates  64.5       9 0.00019   28.9   3.5   22   28-50     80-101 (105)
 25 COG3579 PepC Aminopeptidase C   63.7     6.1 0.00013   37.3   2.7   22  182-204    59-80  (444)
 26 smart00043 CY Cystatin-like do  44.7      12 0.00027   28.3   1.3   15   32-46     85-99  (107)
 27 PHA02088 hypothetical protein   29.0      22 0.00049   27.6   0.5   43   22-90      6-48  (125)
 28 KOG2735|consensus               26.6      38 0.00083   32.6   1.6   27  184-212   368-394 (466)
 29 PF00031 Cystatin:  Cystatin do  25.5      37 0.00081   24.8   1.1   17   26-42     78-94  (94)
 30 KOG3554|consensus               24.4 2.5E+02  0.0054   27.9   6.6   19   75-93    292-310 (693)
 31 PF13529 Peptidase_C39_2:  Pept  24.3      56  0.0012   24.9   2.0   14  246-259   122-135 (144)
 32 KOG4128|consensus               24.2      24 0.00052   33.3  -0.2   40  181-221    63-103 (457)
 33 KOG4702|consensus               20.8   2E+02  0.0044   20.9   4.0   32   77-109    29-60  (77)

No 1  
>KOG1542|consensus
Probab=100.00  E-value=5.2e-52  Score=375.55  Aligned_cols=180  Identities=37%  Similarity=0.632  Sum_probs=155.6

Q ss_pred             HHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchhhhhhcccc
Q psy14862         75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST  154 (263)
Q Consensus        75 ~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~~~~~~~~~  154 (263)
                      ..++|..|+.+|+|+|.+.+|..+|+.||++|+..++++++.+.++..+|+|+|||||+|||++++++.+....   +..
T Consensus        67 ~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~---~~~  143 (372)
T KOG1542|consen   67 LEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGS---KLP  143 (372)
T ss_pred             hHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccccc---cCc
Confidence            47799999999999999999999999999999999999999987899999999999999999999988765311   011


Q ss_pred             CcccccCCCCCCCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc-------------
Q psy14862        155 FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP-------------  221 (263)
Q Consensus       155 ~~~~~~~~~~~~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL-------------  221 (263)
                      .............+|++||||++|+||| |||||+||||||||+||+||++++|++|++++||||||             
T Consensus       144 ~~~~~~~~~~~~~lP~~fDWR~kgaVTp-VKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~GG  222 (372)
T KOG1542|consen  144 GDAAEAPIEPGESLPESFDWRDKGAVTP-VKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCNGG  222 (372)
T ss_pred             cccccCcCCCCCCCCcccchhccCCccc-cccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCCCC
Confidence            1111111133567999999999999999 99999999999999999999999999999999999999             


Q ss_pred             -------------------------c---------------------------------------------ccCCCCCCc
Q psy14862        222 -------------------------K---------------------------------------------THGRFYKGG  231 (263)
Q Consensus       222 -------------------------K---------------------------------------------~~f~~Y~~G  231 (263)
                                               |                                             ..||+|++|
T Consensus       223 l~~nA~~~~~~~gGL~~E~dYPY~g~~~~~C~~~~~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgG  302 (372)
T KOG1542|consen  223 LMDNAFKYIKKAGGLEKEKDYPYTGKKGNQCHFDKSKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGG  302 (372)
T ss_pred             ChhHHHHHHHHhCCccccccCCccccCCCccccchhhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhccc
Confidence                                     0                                             789999999


Q ss_pred             cccCCCCcCCCCCCCCCeEEEEEEEecCC
Q psy14862        232 VMNLPHMLCSKGPYSLNHAVLNVGYDNES  260 (263)
Q Consensus       232 V~~~~~c~~~~~~~~~nHaVliVGYG~~~  260 (263)
                      |+.+..-.|+  +..+||||||||||...
T Consensus       303 V~~P~~~~Cs--~~~~~HaVLlvGyG~~g  329 (372)
T KOG1542|consen  303 VSCPSKYICS--PKLLNHAVLLVGYGSSG  329 (372)
T ss_pred             ccCCCcccCC--ccccCceEEEEeecCCC
Confidence            9999444447  77799999999999875


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=3.1e-47  Score=354.16  Aligned_cols=145  Identities=28%  Similarity=0.519  Sum_probs=118.5

Q ss_pred             HHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCch-hhhhhcc
Q psy14862         74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW-EQIENLK  152 (263)
Q Consensus        74 ~~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~-~~~~~~~  152 (263)
                      .+..+|++|+++|+|.|.+.+|+.+|+.||++|+++|++||+++ .+|++|+|+|+|||+|||.+++++... ...+ ..
T Consensus        33 ~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~-~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~~-~~  110 (348)
T PTZ00203         33 PAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARN-PHARFGITKFFDLSEAEFAARYLNGAAYFAAA-KQ  110 (348)
T ss_pred             HHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccC-CCeEEeccccccCCHHHHHHHhcCCCcccccc-cc
Confidence            36678999999999999998899999999999999999999875 699999999999999999987653211 1000 00


Q ss_pred             ccCcccccCCCCCCCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc
Q psy14862        153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP  221 (263)
Q Consensus       153 ~~~~~~~~~~~~~~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL  221 (263)
                      ..............++|++||||++|+|+| |||||.||||||||++++||++++|++|+++.||+|||
T Consensus       111 ~~~~~~~~~~~~~~~lP~~~DWR~~g~Vtp-VkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqL  178 (348)
T PTZ00203        111 HAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQL  178 (348)
T ss_pred             cccccccccccccccCCCCCcCCcCCCCCC-ccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHH
Confidence            000111111112236899999999999999 99999999999999999999999999999999999999


No 3  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=1.4e-45  Score=354.03  Aligned_cols=181  Identities=34%  Similarity=0.514  Sum_probs=146.9

Q ss_pred             HHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchh-hhhhcc
Q psy14862         74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWE-QIENLK  152 (263)
Q Consensus        74 ~~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~-~~~~~~  152 (263)
                      +....|++|+.+|+|+|.+.+|+.+|+.||++|+++|++||+.++.+|++|+|+|+|||.|||++++++.... ......
T Consensus       164 e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~~~  243 (489)
T PTZ00021        164 ENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSNGK  243 (489)
T ss_pred             HHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccccccccc
Confidence            3557899999999999999999999999999999999999987667999999999999999999887764321 000000


Q ss_pred             ccC--ccc----c-cCCCCCCCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc----
Q psy14862        153 STF--ETY----S-FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP----  221 (263)
Q Consensus       153 ~~~--~~~----~-~~~~~~~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL----  221 (263)
                      ...  ...    . .........|++||||++|+||| |||||.||||||||++++||++++|++|+++.||+|||    
T Consensus       244 ~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~g~Vtp-VKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVDCs  322 (489)
T PTZ00021        244 KSPRVINYDDVIKKYKPKDATFDHAKYDWRLHNGVTP-VKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCS  322 (489)
T ss_pred             cccccccccccccccccccccCCccccccccCCCCCC-cccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhhhc
Confidence            000  000    0 00001112499999999999999 99999999999999999999999999999999999999    


Q ss_pred             --------------------------------------------------------------c----------------c
Q psy14862        222 --------------------------------------------------------------K----------------T  223 (263)
Q Consensus       222 --------------------------------------------------------------K----------------~  223 (263)
                                                                                    |                .
T Consensus       323 ~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~~~~~~C~~~~~~~~~~i~~y~~i~~~~lk~al~~~GPVsv~i~a~~  402 (489)
T PTZ00021        323 FKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIPEDKFKEAIRFLGPISVSIAVSD  402 (489)
T ss_pred             cCCCCCCCcchHhhhhhhhhccccCcccccCccCCCCCccccccccccceeeeEEEecHHHHHHHHHhcCCeEEEEEeec
Confidence                                                                          0                5


Q ss_pred             cCCCCCCccccCCCCcCCCCCCCCCeEEEEEEEecCCC
Q psy14862        224 HGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNEST  261 (263)
Q Consensus       224 ~f~~Y~~GV~~~~~c~~~~~~~~~nHaVliVGYG~~~g  261 (263)
                      +|++|++|||+++ |     ...+||||+|||||.++|
T Consensus       403 ~f~~YkgGIy~~~-C-----~~~~nHAVlIVGYG~e~~  434 (489)
T PTZ00021        403 DFAFYKGGIFDGE-C-----GEEPNHAVILVGYGMEEI  434 (489)
T ss_pred             ccccCCCCcCCCC-C-----CCccceEEEEEEecCcCC
Confidence            8999999999874 7     445899999999998753


No 4  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=7.4e-44  Score=340.30  Aligned_cols=182  Identities=30%  Similarity=0.421  Sum_probs=146.0

Q ss_pred             chhHhHHHHHHHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchhhh
Q psy14862         69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQI  148 (263)
Q Consensus        69 l~~~~~~~~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~~~  148 (263)
                      ...+.++..+|++|+++|+|.|.+.+|+.+|+.||++|+++|++||..  .+|++|+|+|+|||+|||.+++++...+..
T Consensus       116 ~~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~--~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~  193 (448)
T PTZ00200        116 PKLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD--EPYSKEINKFSDLTEEEFRKLFPVIKVPPK  193 (448)
T ss_pred             ccchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc--CCeEEeccccccCCHHHHHHHhccCCCccc
Confidence            334455888999999999999999999999999999999999999964  589999999999999999987765432100


Q ss_pred             hhc--cc--------cCccc----ccC-----CC--CCCCCCcccccccCCCCCCcccCCC-CCchhHHHHhhhHHHHHH
Q psy14862        149 ENL--KS--------TFETY----SFN-----SS--NSYGLAESINYKDKGKVLPKVQDQH-LCGSCWAHSAVACLESAY  206 (263)
Q Consensus       149 ~~~--~~--------~~~~~----~~~-----~~--~~~~lP~s~DWR~~g~Vtp~VknQg-~CGSCWAFat~~~iEs~~  206 (263)
                      ...  ..        ....+    +..     ..  ....+|++||||+.|+|+| ||||| .||||||||++++||+++
T Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtp-VkdQG~~CGSCWAFat~~aiEs~~  272 (448)
T PTZ00200        194 SNSTSHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTK-VKDQGLNCGSCWAFSSVGSVESLY  272 (448)
T ss_pred             ccccccccccccccccccccccccccccccccccccccccCCCCccCCCCCCCCC-cccCCCccchHHHHhHHHHHHHHH
Confidence            000  00        00000    000     00  0123699999999999999 99999 999999999999999999


Q ss_pred             HHHcCcceecCCCCc----------------------------------------------------------------c
Q psy14862        207 AIKHNELIELSKQPP----------------------------------------------------------------K  222 (263)
Q Consensus       207 ~I~tg~~~~LSeQqL----------------------------------------------------------------K  222 (263)
                      +|++++++.||+|||                                                                +
T Consensus       273 ~i~~~~~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~Gi~~e~~YPY~~~~~~C~~~~~~~~~i~~y~~~~~~~~l~  352 (448)
T PTZ00200        273 KIYRDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNKGLSSSSDVPYLAKDGKCVVSSTKKVYIDSYLVAKGKDVLN  352 (448)
T ss_pred             HHhcCCCeecCHHHHhhccCccCCCCCCcHHHHHHHHhhcCccccccCCCCCCCCCCcCCCCCeeEecceEecCHHHHHH
Confidence            999999999999999                                                                0


Q ss_pred             ---------------ccCCCCCCccccCCCCcCCCCCCCCCeEEEEEEEecC
Q psy14862        223 ---------------THGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE  259 (263)
Q Consensus       223 ---------------~~f~~Y~~GV~~~~~c~~~~~~~~~nHaVliVGYG~~  259 (263)
                                     .+|++|++|||+++ |     +..+||||+|||||.+
T Consensus       353 ~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~-C-----~~~~nHaV~lVGyG~d  398 (448)
T PTZ00200        353 KSLVISPTVVYIAVSRELLKYKSGVYNGE-C-----GKSLNHAVLLVGEGYD  398 (448)
T ss_pred             HHHhcCCEEEEeecccccccCCCCccccc-c-----CCCCcEEEEEEEeccc
Confidence                           68999999999874 7     3458999999999964


No 5  
>KOG1543|consensus
Probab=100.00  E-value=5.1e-38  Score=290.19  Aligned_cols=166  Identities=33%  Similarity=0.515  Sum_probs=139.5

Q ss_pred             HHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHHhcccCCCchhhhhhccccCcccccCC
Q psy14862         83 VREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNS  162 (263)
Q Consensus        83 ~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~  162 (263)
                      +.+|.+.|.+..|...|+.+|.+|++.|+.||.....+|++|+|+|+|++.+||++.+.+..++..     ...... ..
T Consensus        30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~-----~~~~~~-~~  103 (325)
T KOG1543|consen   30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEI-----KRDKFT-EK  103 (325)
T ss_pred             hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccc-----cccccc-cc
Confidence            677777887778999999999999999999999856899999999999999999998777654421     001111 11


Q ss_pred             CCCCCCCcccccccCC-CCCCcccCCCCCchhHHHHhhhHHHHHHHHHcC-cceecCCCCc-------------------
Q psy14862        163 SNSYGLAESINYKDKG-KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQPP-------------------  221 (263)
Q Consensus       163 ~~~~~lP~s~DWR~~g-~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg-~~~~LSeQqL-------------------  221 (263)
                      ....++|++||||++| .++| |||||.||||||||++++||++++|++| .+++||+|||                   
T Consensus       104 ~~~~~~p~s~DwR~~~~~~~~-vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~~~~A  182 (325)
T KOG1543|consen  104 LDGDDLPDSFDWRDKGAVTPP-VKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGEPKNA  182 (325)
T ss_pred             cchhhCCCCccccccCCcCCC-cCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCCHHHH
Confidence            2245799999999996 5666 9999999999999999999999999999 8999999999                   


Q ss_pred             -----------------------------------------------------------------cccCCCCCCccccCC
Q psy14862        222 -----------------------------------------------------------------KTHGRFYKGGVMNLP  236 (263)
Q Consensus       222 -----------------------------------------------------------------K~~f~~Y~~GV~~~~  236 (263)
                                                                                       ..+|++|++|||.++
T Consensus       183 ~~yi~~~G~~t~~~~Ypy~~~~~~C~~~~~~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~  262 (325)
T KOG1543|consen  183 FKYIKKNGGVTECENYPYIGKDGTCKSNKKDKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEE  262 (325)
T ss_pred             HHHHHHhCCCCCCcCCCCcCCCCCccCCCccceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCC
Confidence                                                                             078999999999999


Q ss_pred             CCcCCCCCCCCCeEEEEEEEecC
Q psy14862        237 HMLCSKGPYSLNHAVLNVGYDNE  259 (263)
Q Consensus       237 ~c~~~~~~~~~nHaVliVGYG~~  259 (263)
                      .|.+    ..++|||+|||||+.
T Consensus       263 ~~~~----~~~~Hav~iVGyG~~  281 (325)
T KOG1543|consen  263 KGDD----KEGDHAVLIVGYGTG  281 (325)
T ss_pred             CCCC----CCCCceEEEEEEcCC
Confidence            8822    269999999999993


No 6  
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.93  E-value=2.7e-26  Score=194.54  Aligned_cols=89  Identities=47%  Similarity=0.809  Sum_probs=81.1

Q ss_pred             CCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc-----------------------c--
Q psy14862        168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP-----------------------K--  222 (263)
Q Consensus       168 lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL-----------------------K--  222 (263)
                      ||++||||+.++++| |+|||.||+|||||++++||++++|++++.+.||+|+|                       +  
T Consensus         1 lP~~~D~R~~~~~~~-v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~   79 (174)
T smart00645        1 LPESFDWRKKGAVTP-VKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKN   79 (174)
T ss_pred             CCCcCcccccCCCCc-cccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHc
Confidence            699999999999999 99999999999999999999999999999999999999                       0  


Q ss_pred             -------------------ccCCCCCCccccCCCCcCCCCCCCCCeEEEEEEEecC-CC
Q psy14862        223 -------------------THGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE-ST  261 (263)
Q Consensus       223 -------------------~~f~~Y~~GV~~~~~c~~~~~~~~~nHaVliVGYG~~-~g  261 (263)
                                         .+|++|++|||+.+.|  .  ...++|+|+|||||.+ +|
T Consensus        80 ~Gi~~e~~~PY~~~~~~~~~~f~~Y~~Gi~~~~~~--~--~~~~~Hav~ivGyg~~~~g  134 (174)
T smart00645       80 GGLETESCYPYTGSVAIDASDFQFYKSGIYDHPGC--G--SGTLDHAVLIVGYGTEENG  134 (174)
T ss_pred             CCcccccccCcccEEEEEcccccCCcCeEECCCCC--C--CCcccEEEEEEEEeecCCC
Confidence                               5799999999998767  4  3447999999999987 55


No 7  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.92  E-value=2.1e-25  Score=198.19  Aligned_cols=53  Identities=38%  Similarity=0.528  Sum_probs=50.9

Q ss_pred             CCcccccccCC----CCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCc------ceecCCCCc
Q psy14862        168 LAESINYKDKG----KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE------LIELSKQPP  221 (263)
Q Consensus       168 lP~s~DWR~~g----~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~------~~~LSeQqL  221 (263)
                      ||++||||+.+    +|+| |+|||.||||||||++++||++++|++++      .+.||+|||
T Consensus         1 lP~~fDwr~~~~~~~~v~~-v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l   63 (243)
T cd02621           1 LPKSFDWGDVNNGFNYVSP-VRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHV   63 (243)
T ss_pred             CCCcccccccCCCCccccc-CCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHh
Confidence            69999999998    9999 99999999999999999999999999876      789999999


No 8  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.91  E-value=2.9e-25  Score=197.08  Aligned_cols=53  Identities=34%  Similarity=0.641  Sum_probs=49.5

Q ss_pred             CCcccccccCC---CCCCcccCCC---CCchhHHHHhhhHHHHHHHHHcC---cceecCCCCc
Q psy14862        168 LAESINYKDKG---KVLPKVQDQH---LCGSCWAHSAVACLESAYAIKHN---ELIELSKQPP  221 (263)
Q Consensus       168 lP~s~DWR~~g---~Vtp~VknQg---~CGSCWAFat~~~iEs~~~I~tg---~~~~LSeQqL  221 (263)
                      ||++||||+.+   +|+| |||||   .||||||||++++||++++|+++   ..+.||+|||
T Consensus         1 lP~~~Dwr~~~~~~~v~~-vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~l   62 (239)
T cd02698           1 LPKSWDWRNVNGVNYVSP-TRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVV   62 (239)
T ss_pred             CCCCcccccCCCCcccCc-cccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHH
Confidence            69999999988   9999 99998   89999999999999999999875   3678999999


No 9  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.90  E-value=1.9e-24  Score=186.49  Aligned_cols=88  Identities=52%  Similarity=0.862  Sum_probs=80.8

Q ss_pred             CcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc---------------------------
Q psy14862        169 AESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP---------------------------  221 (263)
Q Consensus       169 P~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL---------------------------  221 (263)
                      |++||||+.+.++| |+|||.||+|||||+++++|++++|+++..+.||+|+|                           
T Consensus         1 P~~~d~r~~~~~~~-v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~G   79 (210)
T cd02248           1 PESVDWREKGAVTP-VKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNGG   79 (210)
T ss_pred             CCcccCCcCCCCCC-CccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHCC
Confidence            88999999999999 99999999999999999999999999999999999999                           


Q ss_pred             -----------------------------------------c----------------ccCCCCCCccccCCCCcCCCCC
Q psy14862        222 -----------------------------------------K----------------THGRFYKGGVMNLPHMLCSKGP  244 (263)
Q Consensus       222 -----------------------------------------K----------------~~f~~Y~~GV~~~~~c~~~~~~  244 (263)
                                                               |                ++|+.|++|||..+.|  +  .
T Consensus        80 i~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~--~--~  155 (210)
T cd02248          80 LASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCC--S--N  155 (210)
T ss_pred             cCccccCCccCCCCCccCCCCcccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCC--C--C
Confidence                                                     0                6999999999999866  3  4


Q ss_pred             CCCCeEEEEEEEecCCC
Q psy14862        245 YSLNHAVLNVGYDNEST  261 (263)
Q Consensus       245 ~~~nHaVliVGYG~~~g  261 (263)
                      ...+|||+|||||.++|
T Consensus       156 ~~~~Hav~iVGy~~~~~  172 (210)
T cd02248         156 TNLNHAVLLVGYGTENG  172 (210)
T ss_pred             CcCCEEEEEEEEeecCC
Confidence            56799999999998764


No 10 
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.90  E-value=2.8e-24  Score=190.41  Aligned_cols=52  Identities=38%  Similarity=0.603  Sum_probs=48.2

Q ss_pred             CcccccccC--CCC--CCcccCCCCCchhHHHHhhhHHHHHHHHHcC--cceecCCCCc
Q psy14862        169 AESINYKDK--GKV--LPKVQDQHLCGSCWAHSAVACLESAYAIKHN--ELIELSKQPP  221 (263)
Q Consensus       169 P~s~DWR~~--g~V--tp~VknQg~CGSCWAFat~~~iEs~~~I~tg--~~~~LSeQqL  221 (263)
                      |++||||++  +++  +| |+|||.||||||||++++||++++|+++  +.+.||+|||
T Consensus         1 p~~~DwR~~~~~~~~v~~-v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~l   58 (236)
T cd02620           1 PESFDAREKWPNCISIGE-IRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDL   58 (236)
T ss_pred             CCcccchhhCCCCCCccc-cCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHH
Confidence            899999997  554  59 9999999999999999999999999998  7889999999


No 11 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.86  E-value=7.1e-23  Score=176.59  Aligned_cols=89  Identities=46%  Similarity=0.747  Sum_probs=77.2

Q ss_pred             CCcccccccC-CCCCCcccCCCCCchhHHHHhhhHHHHHHHHHc-CcceecCCCCc------------------------
Q psy14862        168 LAESINYKDK-GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKH-NELIELSKQPP------------------------  221 (263)
Q Consensus       168 lP~s~DWR~~-g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~t-g~~~~LSeQqL------------------------  221 (263)
                      ||++||||+. +.++| |+|||.||+|||||+++++|++++++. +..+.||+|+|                        
T Consensus         1 lP~~~D~r~~~~~~~~-v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~   79 (219)
T PF00112_consen    1 LPKSFDWRDKGGRITP-VRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIK   79 (219)
T ss_dssp             STSSEEGGGTTTCSG----BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHH
T ss_pred             CCCCEecccCCCCcCc-cccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeec
Confidence            7999999998 48999 999999999999999999999999999 78999999999                        


Q ss_pred             -----------------------------------------------c----------------c-cCCCCCCccccCCC
Q psy14862        222 -----------------------------------------------K----------------T-HGRFYKGGVMNLPH  237 (263)
Q Consensus       222 -----------------------------------------------K----------------~-~f~~Y~~GV~~~~~  237 (263)
                                                                     |                . +|+.|++|||..+.
T Consensus        80 ~~~Gi~~e~~~pY~~~~~~~c~~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~  159 (219)
T PF00112_consen   80 NNNGIVTEEDYPYNGNENPTCKSKKSNSYYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPD  159 (219)
T ss_dssp             HHTSBEBTTTS--SSSSSCSSCHSGGGEEEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTS
T ss_pred             ccCcccccccccccccccccccccccccccccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeeccc
Confidence                                                           0                5 69999999999997


Q ss_pred             CcCCCCCCCCCeEEEEEEEecCCC
Q psy14862        238 MLCSKGPYSLNHAVLNVGYDNEST  261 (263)
Q Consensus       238 c~~~~~~~~~nHaVliVGYG~~~g  261 (263)
                      |  .  ....+|||+|||||.+.|
T Consensus       160 ~--~--~~~~~Hav~iVGy~~~~~  179 (219)
T PF00112_consen  160 C--S--NESGGHAVLIVGYDDENG  179 (219)
T ss_dssp             S--S--SSSEEEEEEEEEEEEETT
T ss_pred             c--c--cccccccccccccccccc
Confidence            7  4  467899999999998854


No 12 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.86  E-value=2.5e-22  Score=195.64  Aligned_cols=56  Identities=16%  Similarity=0.196  Sum_probs=51.7

Q ss_pred             CCCCCcccccccCC---CCCCcccCCCC---CchhHHHHhhhHHHHHHHHHcC------cceecCCCCc
Q psy14862        165 SYGLAESINYKDKG---KVLPKVQDQHL---CGSCWAHSAVACLESAYAIKHN------ELIELSKQPP  221 (263)
Q Consensus       165 ~~~lP~s~DWR~~g---~Vtp~VknQg~---CGSCWAFat~~~iEs~~~I~tg------~~~~LSeQqL  221 (263)
                      ..++|++||||++|   +|+| |||||.   ||||||||++++||++++|+++      +.+.||+|||
T Consensus       202 ~~~LP~sfDWR~~gg~~~Vtp-VrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqL  269 (548)
T PTZ00364        202 GDPPPAAWSWGDVGGASFLPA-APPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHV  269 (548)
T ss_pred             ccCCCCccccCcCCCCccCCC-CcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHH
Confidence            35799999999997   7999 999999   9999999999999999999984      4688999999


No 13 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.85  E-value=1.1e-21  Score=193.75  Aligned_cols=56  Identities=25%  Similarity=0.351  Sum_probs=49.2

Q ss_pred             CCCCCcccccccC----CCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcCc-----c-----eecCCCCc
Q psy14862        165 SYGLAESINYKDK----GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE-----L-----IELSKQPP  221 (263)
Q Consensus       165 ~~~lP~s~DWR~~----g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~-----~-----~~LSeQqL  221 (263)
                      ..+||++||||+.    ++|+| |+|||.||||||||++++||++++|++++     .     ..||+|||
T Consensus       378 ~~~LP~sfDWRd~~~~~~~vtp-VkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqL  447 (693)
T PTZ00049        378 IDELPKNFTWGDPFNNNTREYD-VTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTV  447 (693)
T ss_pred             cccCCCCEecCcCCCCCCcccC-CCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHh
Confidence            3579999999985    68999 99999999999999999999999999743     1     26899998


No 14 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.78  E-value=1.4e-19  Score=156.27  Aligned_cols=49  Identities=41%  Similarity=0.639  Sum_probs=47.0

Q ss_pred             ccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHHHHcC--cceecCCCCc
Q psy14862        171 SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN--ELIELSKQPP  221 (263)
Q Consensus       171 s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg--~~~~LSeQqL  221 (263)
                      ++|||+.+ ++| |+|||.||+|||||+++++|++++++++  +.+.||+|+|
T Consensus         1 ~~d~r~~~-~~~-v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l   51 (223)
T cd02619           1 SVDLRPLR-LTP-VKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYL   51 (223)
T ss_pred             CCcchhcC-CCC-cccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHH
Confidence            48999998 999 9999999999999999999999999998  8899999999


No 15 
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.74  E-value=5.6e-18  Score=118.54  Aligned_cols=58  Identities=40%  Similarity=0.734  Sum_probs=51.8

Q ss_pred             HHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHHH
Q psy14862         79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF  136 (263)
Q Consensus        79 F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eEf  136 (263)
                      |++|+++|+|.|.+.+|+.+|+.+|++|++.|++||+.+..+|++|+|+|||||.+||
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf   58 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF   58 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence            8899999999999999999999999999999999997666899999999999999998


No 16 
>KOG1544|consensus
Probab=99.65  E-value=2.3e-17  Score=149.16  Aligned_cols=110  Identities=18%  Similarity=0.055  Sum_probs=81.4

Q ss_pred             HHHHHhccCCCCceEEEe-ccCCCCChHHHhcccCCCchhhhhhccccCcccccCCCCCCCCCcccccccC--CCCCCcc
Q psy14862        108 KTIDYYTKHEQGTATYGV-NRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK--GKVLPKV  184 (263)
Q Consensus       108 ~~I~~~N~~~~~s~~~gi-N~FsDlT~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~s~DWR~~--g~Vtp~V  184 (263)
                      .+|++.|+.. .+|+++- .+|-.||.+.=.+..+|..++...  ......+........+||+.||.|++  +.+.| |
T Consensus       151 d~iE~in~G~-YgW~A~NYSaFWGmtL~DGiKyRLGTL~Ps~s--v~nMNEi~~~l~p~~~LPE~F~As~KWp~liH~-p  226 (470)
T KOG1544|consen  151 DMIEAINQGN-YGWQAGNYSAFWGMTLDDGIKYRLGTLRPSSS--VMNMNEIYTVLNPGEVLPEAFEASEKWPNLIHE-P  226 (470)
T ss_pred             HHHHHHhcCC-ccccccchhhhhcccccccceeeecccCchhh--hhhHHhHhhccCcccccchhhhhhhcCCccccC-c
Confidence            3678888664 6777754 489999998877766676654331  11111111111223679999999998  88999 9


Q ss_pred             cCCCCCchhHHHHhhhHHHHHHHHHcCc--ceecCCCCc
Q psy14862        185 QDQHLCGSCWAHSAVACLESAYAIKHNE--LIELSKQPP  221 (263)
Q Consensus       185 knQg~CGSCWAFat~~~iEs~~~I~tg~--~~~LSeQqL  221 (263)
                      -|||+||++|||||+++..++++|.+..  ...||+|||
T Consensus       227 lDQgnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnL  265 (470)
T KOG1544|consen  227 LDQGNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNL  265 (470)
T ss_pred             cccCCcccceeeeeehhccceeEEeeccccccccChHHh
Confidence            9999999999999999999999998743  467999999


No 17 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.63  E-value=1.8e-16  Score=161.32  Aligned_cols=41  Identities=22%  Similarity=0.289  Sum_probs=39.5

Q ss_pred             CCCcccCCCCCchhHHHHhhhHHHHHHHHHcCcceecCCCCc
Q psy14862        180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP  221 (263)
Q Consensus       180 Vtp~VknQg~CGSCWAFat~~~iEs~~~I~tg~~~~LSeQqL  221 (263)
                      ..| |+|||.||||||||+++++|++++|++++.+.||+|||
T Consensus       544 ~i~-VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqL  584 (1004)
T PTZ00462        544 KIQ-IEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYI  584 (1004)
T ss_pred             CCC-cccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHH
Confidence            578 99999999999999999999999999999999999999


No 18 
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.60  E-value=1.2e-15  Score=105.82  Aligned_cols=57  Identities=40%  Similarity=0.738  Sum_probs=54.0

Q ss_pred             HHHHHHHhCCccCChHHHHHHHHHHHHHHHHHHHhccCCCCceEEEeccCCCCChHH
Q psy14862         79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSE  135 (263)
Q Consensus        79 F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~I~~~N~~~~~s~~~giN~FsDlT~eE  135 (263)
                      |++|+.+|+|.|.+.+|...|+.+|++|++.|+.||+.+..+|++|+|+|||||++|
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE   57 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE   57 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence            688999999999999999999999999999999999876689999999999999986


No 19 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.18  E-value=6.8e-12  Score=115.78  Aligned_cols=41  Identities=41%  Similarity=0.664  Sum_probs=38.1

Q ss_pred             CCCCcccccccCCCCCCcccCCCCCchhHHHHhhhHHHHHHH
Q psy14862        166 YGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA  207 (263)
Q Consensus       166 ~~lP~s~DWR~~g~Vtp~VknQg~CGSCWAFat~~~iEs~~~  207 (263)
                      ..+|+.||||+.|-|+| ||+||.||+||||++++++|+...
T Consensus        97 ~s~~~~fd~r~~g~vs~-v~dQg~~Gscwaf~t~~sles~l~  137 (372)
T COG4870          97 ASLPSYFDRRDEGKVSP-VKDQGSGGSCWAFATTRSLESYLN  137 (372)
T ss_pred             ccchhheeeeccCCccc-ccccCcccceEeeeehhhhhheec
Confidence            34899999999999999 999999999999999999998654


No 20 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=98.06  E-value=8.3e-06  Score=78.56  Aligned_cols=40  Identities=28%  Similarity=0.490  Sum_probs=35.0

Q ss_pred             CCcccCCCCCchhHHHHhhhHHHHHHHHH-cCcceecCCCCc
Q psy14862        181 LPKVQDQHLCGSCWAHSAVACLESAYAIK-HNELIELSKQPP  221 (263)
Q Consensus       181 tp~VknQg~CGSCWAFat~~~iEs~~~I~-tg~~~~LSeQqL  221 (263)
                      .| |+||+.-|-||.||+..+++.....+ +.+.+.||+--|
T Consensus        55 ~~-vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl   95 (437)
T cd00585          55 EP-VTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYL   95 (437)
T ss_pred             CC-cccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHH
Confidence            38 99999999999999999999988875 456788998877


No 21 
>PF08127 Propeptide_C1:  Peptidase family C1 propeptide;  InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=95.55  E-value=0.012  Score=38.13  Aligned_cols=31  Identities=26%  Similarity=0.288  Sum_probs=21.6

Q ss_pred             HHHHHhccCCCCceEEEeccCCCCChHHHhccc
Q psy14862        108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNHGL  140 (263)
Q Consensus       108 ~~I~~~N~~~~~s~~~giN~FsDlT~eEf~~~~  140 (263)
                      ++|+..|+. +.+|++|.| |.+.|.+.+++++
T Consensus         4 e~I~~IN~~-~~tWkAG~N-F~~~~~~~ik~Ll   34 (41)
T PF08127_consen    4 EFIDYINSK-NTTWKAGRN-FENTSIEYIKRLL   34 (41)
T ss_dssp             HHHHHHHHC-T-SEEE-----SSB-HHHHHHCS
T ss_pred             HHHHHHHcC-CCcccCCCC-CCCCCHHHHHHHc
Confidence            578999998 589999999 8999999998864


No 22 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=93.01  E-value=0.07  Score=51.75  Aligned_cols=39  Identities=33%  Similarity=0.521  Sum_probs=28.0

Q ss_pred             CcccCCCCCchhHHHHhhhHHHHHHHHHcC-cceecCCCCc
Q psy14862        182 PKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQPP  221 (263)
Q Consensus       182 p~VknQg~CGSCWAFat~~~iEs~~~I~tg-~~~~LSeQqL  221 (263)
                      | |+||+.-|-||.||+..+++..+..+.+ +...||+--|
T Consensus        57 ~-vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl   96 (438)
T PF03051_consen   57 P-VTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYL   96 (438)
T ss_dssp             S---B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHH
T ss_pred             C-CCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHH
Confidence            8 9999999999999999999999888876 6677777665


No 23 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=70.20  E-value=3.9  Score=34.81  Aligned_cols=36  Identities=17%  Similarity=0.083  Sum_probs=21.5

Q ss_pred             CCCCCchhHHHHhhhHHHHHHHHH--------cCcceecCCCCc
Q psy14862        186 DQHLCGSCWAHSAVACLESAYAIK--------HNELIELSKQPP  221 (263)
Q Consensus       186 nQg~CGSCWAFat~~~iEs~~~I~--------tg~~~~LSeQqL  221 (263)
                      .||.=+=|-+|+.++.|-...-.+        ..--..+|+|||
T Consensus        18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l   61 (175)
T PF05543_consen   18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQL   61 (175)
T ss_dssp             --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCH
T ss_pred             ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHH
Confidence            468778899999999887653211        111356899999


No 24 
>cd00042 CY Substituted updates: Jan 30, 2002
Probab=64.55  E-value=9  Score=28.93  Aligned_cols=22  Identities=18%  Similarity=0.351  Sum_probs=17.1

Q ss_pred             hccceeEeeecccccccchhhhh
Q psy14862         28 ASGCVHTYLGWHPRWTGRVHNLI   50 (263)
Q Consensus        28 ~~~~~~~~~~~~~~w~~~~~~l~   50 (263)
                      ...-.|.++||..||++. ..++
T Consensus        80 ~~~~~C~~~V~~~pw~~~-~~l~  101 (105)
T cd00042          80 GKKKFCTAKVWEKPWENF-KELL  101 (105)
T ss_pred             CCCEEEEEEEEecCCCCc-eeee
Confidence            456789999999999977 4444


No 25 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=63.66  E-value=6.1  Score=37.25  Aligned_cols=22  Identities=27%  Similarity=0.679  Sum_probs=19.4

Q ss_pred             CcccCCCCCchhHHHHhhhHHHH
Q psy14862        182 PKVQDQHLCGSCWAHSAVACLES  204 (263)
Q Consensus       182 p~VknQg~CGSCWAFat~~~iEs  204 (263)
                      | |-||..-|-||.||+...+--
T Consensus        59 ~-vtNQk~SGRCWmFAAlNtfRh   80 (444)
T COG3579          59 K-VTNQKQSGRCWMFAALNTFRH   80 (444)
T ss_pred             c-cccccccceehHHHHHHHHHH
Confidence            7 999999999999999887643


No 26 
>smart00043 CY Cystatin-like domain. Cystatins are a family of cysteine protease inhibitors that occur mainly as single domain proteins. However some extracellular proteins such as  kininogen, His-rich glycoprotein and fetuin also contain these domains.
Probab=44.68  E-value=12  Score=28.27  Aligned_cols=15  Identities=20%  Similarity=0.603  Sum_probs=13.2

Q ss_pred             eeEeeecccccccch
Q psy14862         32 VHTYLGWHPRWTGRV   46 (263)
Q Consensus        32 ~~~~~~~~~~w~~~~   46 (263)
                      .|.+.||..||+...
T Consensus        85 ~C~~~V~~~pw~~~~   99 (107)
T smart00043       85 FCTAKVWEKPWENKI   99 (107)
T ss_pred             EEEEEEEecCCCCcc
Confidence            799999999999773


No 27 
>PHA02088 hypothetical protein
Probab=29.02  E-value=22  Score=27.56  Aligned_cols=43  Identities=9%  Similarity=0.288  Sum_probs=30.3

Q ss_pred             eechhhhccceeEeeecccccccchhhhhhhccCCCCCCccccccccchhHhHHHHHHHHHHHHhCCcc
Q psy14862         22 QYEAEWASGCVHTYLGWHPRWTGRVHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQY   90 (263)
Q Consensus        22 ~~~~~~~~~~~~~~~~~~~~w~~~~~~l~~~~~~~~~~~~~~~~~~~l~~~~~~~~~F~~f~~~y~K~Y   90 (263)
                      ..-...+.-..|-|+.||..|..+-.                          .-...-..|.++|+-+.
T Consensus         6 fi~~~~~~liac~~s~wd~~w~~k~~--------------------------~e~~~~n~wl~ky~~sm   48 (125)
T PHA02088          6 FIAGLVLGLIACCFSTWDSRWRKKAR--------------------------NEQAKINTWLAKYGISM   48 (125)
T ss_pred             EeHHHHHHHHHhhhHHhhHHHHHHHH--------------------------HHHHHHHHHHHHhceEE
Confidence            34445666678999999999998831                          12345678999998543


No 28 
>KOG2735|consensus
Probab=26.59  E-value=38  Score=32.60  Aligned_cols=27  Identities=26%  Similarity=0.456  Sum_probs=22.4

Q ss_pred             ccCCCCCchhHHHHhhhHHHHHHHHHcCc
Q psy14862        184 VQDQHLCGSCWAHSAVACLESAYAIKHNE  212 (263)
Q Consensus       184 VknQg~CGSCWAFat~~~iEs~~~I~tg~  212 (263)
                      +|--|  --||.|.++.++|...|||.|.
T Consensus       368 ~KrvG--~qcWv~~aI~~~El~IciKfg~  394 (466)
T KOG2735|consen  368 CKRVG--MQCWVFLAICALELLICIKFGS  394 (466)
T ss_pred             hhhcc--hhHHHHHHHHHHHhhhheeeCC
Confidence            44444  2699999999999999999986


No 29 
>PF00031 Cystatin:  Cystatin domain;  InterPro: IPR000010 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  The cystatins are cysteine proteinase inhibitors belonging to MEROPS inhibitor family I25, clan IH [, , ]. They mainly inhibit peptidases belonging to peptidase families C1 (papain family) and C13 (legumain family). The cystatin family includes:   The Type 1 cystatins, which are intracellular cystatins that are present in the cytosol of many cell types, but can also appear in body fluids at significant concentrations. They are single-chain polypeptides of about 100 residues, which have neither disulphide bonds nor carbohydrate side chains.  The Type 2 cystatins, which are mainly extracellular secreted polypeptides synthesised with a 19-28 residue signal peptide. They are broadly distributed and found in most body fluids.  The Type 3 cystatins, which are multidomain proteins. The mammalian representatives of this group are the kininogens. There are three different kininogens in mammals: H- (high molecular mass, IPR002395 from INTERPRO) and L- (low molecular mass) kininogen which are found in a number of species, and T-kininogen that is found only in rat.  Unclassified cystatins. These are cystatin-like proteins found in a range of organisms: plant phytocystatins, fetuin in mammals, insect cystatins and a puff adder venom cystatin which inhibits metalloproteases of the MEROPS peptidase family M12 (astacin/adamalysin). Also a number of the cystatins-like proteins have been shown to be devoid of inhibitory activity.   All true cystatins inhibit cysteine peptidases of the papain family (MEROPS peptidase family C1), and some also inhibit legumain family enzymes (MEROPS peptidase family C13). These peptidases play key roles in physiological processes, such as intracellular protein degradation (cathepsins B, H and L), are pivotal in the remodelling of bone (cathepsin K), and may be important in the control of antigen presentation (cathepsin S, mammalian legumain). Moreover, the activities of such peptidases are increased in pathophysiological conditions, such as cancer metastasis and inflammation. Additionally, such peptidases are essential for several pathogenic parasites and bacteria. Thus in animals cystatins not only have capacity to regulate normal body processes and perhaps cause disease when down-regulated, but in other organisms may also participate in defence against biotic and abiotic stress. ; GO: 0004869 cysteine-type endopeptidase inhibitor activity; PDB: 3L0R_B 2W9P_K 2W9Q_A 3S67_A 3QRD_D 1R4C_G 3GAX_A 1TIJ_B 1G96_A 3NX0_A ....
Probab=25.52  E-value=37  Score=24.83  Aligned_cols=17  Identities=18%  Similarity=0.401  Sum_probs=13.9

Q ss_pred             hhhccceeEeeeccccc
Q psy14862         26 EWASGCVHTYLGWHPRW   42 (263)
Q Consensus        26 ~~~~~~~~~~~~~~~~w   42 (263)
                      .....-.|.+.||.+||
T Consensus        78 ~~~~~~~C~~~v~~~pW   94 (94)
T PF00031_consen   78 EQPWTKFCKFTVWERPW   94 (94)
T ss_dssp             TTSSEEEEEEEEEEECG
T ss_pred             cCCceeeEEEEEEECCC
Confidence            35566789999999999


No 30 
>KOG3554|consensus
Probab=24.44  E-value=2.5e+02  Score=27.88  Aligned_cols=19  Identities=16%  Similarity=0.522  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHhCCccCCh
Q psy14862         75 HGNQFKDFVREYERQYDSD   93 (263)
Q Consensus        75 ~~~~F~~f~~~y~K~Y~s~   93 (263)
                      -..+|++=+.+|||.+.+.
T Consensus       292 EanLFEeALeKyGKDFndI  310 (693)
T KOG3554|consen  292 EANLFEEALEKYGKDFNDI  310 (693)
T ss_pred             hhHHHHHHHHHhcccHHHH
Confidence            4578999999999998664


No 31 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=24.27  E-value=56  Score=24.92  Aligned_cols=14  Identities=36%  Similarity=0.622  Sum_probs=11.1

Q ss_pred             CCCeEEEEEEEecC
Q psy14862        246 SLNHAVLNVGYDNE  259 (263)
Q Consensus       246 ~~nHaVliVGYG~~  259 (263)
                      ...|.|+|+||...
T Consensus       122 ~~~H~vvi~Gy~~~  135 (144)
T PF13529_consen  122 YGGHYVVIIGYDED  135 (144)
T ss_dssp             TTEEEEEEEEE-SS
T ss_pred             cCCEEEEEEEEeCC
Confidence            46999999999874


No 32 
>KOG4128|consensus
Probab=24.24  E-value=24  Score=33.33  Aligned_cols=40  Identities=30%  Similarity=0.451  Sum_probs=26.7

Q ss_pred             CCcccCCCCCchhHHHHhhhHHHHHHHHHcC-cceecCCCCc
Q psy14862        181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQPP  221 (263)
Q Consensus       181 tp~VknQg~CGSCWAFat~~~iEs~~~I~tg-~~~~LSeQqL  221 (263)
                      +| |-||.+-|-||.|+.+..+---+..+-+ +-..||...|
T Consensus        63 ~p-vtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayL  103 (457)
T KOG4128|consen   63 QP-VTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYL  103 (457)
T ss_pred             cc-cccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHH
Confidence            58 9999999999999998765432222222 2345666655


No 33 
>KOG4702|consensus
Probab=20.78  E-value=2e+02  Score=20.86  Aligned_cols=32  Identities=31%  Similarity=0.619  Sum_probs=22.9

Q ss_pred             HHHHHHHHHhCCccCChHHHHHHHHHHHHHHHH
Q psy14862         77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKT  109 (263)
Q Consensus        77 ~~F~~f~~~y~K~Y~s~~E~~~R~~iF~~Nl~~  109 (263)
                      ..|++|+..|.+.-..+ |...|.+-|.+-++.
T Consensus        29 e~Fee~v~~~krel~pp-e~~~~~EE~~~~lRe   60 (77)
T KOG4702|consen   29 EIFEEFVRGYKRELSPP-EATKRKEEYENFLRE   60 (77)
T ss_pred             HHHHHHHHhccccCCCh-HHHhhHHHHHHHHHH
Confidence            47999999999876554 666676666655443


Done!