Query         psy274
Match_columns 187
No_of_seqs    117 out of 1154
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 23:59:29 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy274.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/274hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542|consensus              100.0 3.1E-54 6.7E-59  349.7  12.8  180    1-185   189-372 (372)
  2 cd02621 Peptidase_C1A_Cathepsi 100.0 2.1E-50 4.5E-55  324.2  17.5  176    1-183    37-241 (243)
  3 cd02620 Peptidase_C1A_Cathepsi 100.0 3.3E-50 7.1E-55  321.8  16.7  176    1-181    36-235 (236)
  4 PTZ00203 cathepsin L protease; 100.0   7E-50 1.5E-54  334.5  18.4  178    1-184   158-340 (348)
  5 cd02698 Peptidase_C1A_Cathepsi 100.0 9.7E-50 2.1E-54  319.6  18.3  174    1-183    39-237 (239)
  6 cd02248 Peptidase_C1A Peptidas 100.0 1.4E-49 3.1E-54  311.9  17.5  175    1-182    32-210 (210)
  7 KOG1543|consensus              100.0 1.3E-48 2.9E-53  324.5  14.9  175    1-182   142-321 (325)
  8 PTZ00021 falcipain-2; Provisio 100.0   3E-48 6.5E-53  334.4  16.6  175    1-185   298-489 (489)
  9 PTZ00200 cysteine proteinase;  100.0   2E-47 4.4E-52  327.7  16.7  173    1-184   267-445 (448)
 10 PF00112 Peptidase_C1:  Papain  100.0 8.3E-47 1.8E-51  296.8  14.9  176    1-183    34-219 (219)
 11 PTZ00049 cathepsin C-like prot 100.0 1.6E-46 3.6E-51  330.8  17.5  181    1-186   417-678 (693)
 12 PTZ00364 dipeptidyl-peptidase  100.0 4.7E-46   1E-50  324.1  17.5  182    1-184   243-459 (548)
 13 PTZ00462 Serine-repeat antigen 100.0 2.4E-41 5.2E-46  305.8  17.0  182    1-186   564-783 (1004)
 14 cd02619 Peptidase_C1 C1 Peptid 100.0 7.3E-40 1.6E-44  257.8  15.7  165    1-170    29-213 (223)
 15 smart00645 Pept_C1 Papain fami 100.0 1.1E-39 2.4E-44  249.8  13.8  135    1-180    33-171 (174)
 16 KOG1544|consensus              100.0 4.8E-38   1E-42  253.4   7.7  177    4-182   246-458 (470)
 17 COG4870 Cysteine protease [Pos  99.7 2.1E-17 4.5E-22  136.3   9.4  132   33-174   168-318 (372)
 18 cd00585 Peptidase_C1B Peptidas  99.7 2.2E-17 4.8E-22  141.7   9.4   83   87-169   293-399 (437)
 19 PF03051 Peptidase_C1_2:  Pepti  99.0 2.5E-09 5.4E-14   92.3   8.9   79   89-169   300-400 (438)
 20 COG3579 PepC Aminopeptidase C   97.6 6.3E-05 1.4E-09   62.4   3.3   82   87-168   296-401 (444)
 21 PF13529 Peptidase_C39_2:  Pept  96.3  0.0067 1.5E-07   43.5   4.4   55   87-154    88-144 (144)
 22 PF09778 Guanylate_cyc_2:  Guan  92.9    0.59 1.3E-05   36.9   7.3   64   87-152   112-180 (212)
 23 PF05543 Peptidase_C47:  Stapho  91.0     1.6 3.4E-05   33.3   7.5   67   87-172    90-157 (175)
 24 PF14399 Transpep_BrtH:  NlpC/p  90.9    0.57 1.2E-05   38.6   5.5   55   87-150    77-131 (317)
 25 KOG4128|consensus               88.1   0.044 9.5E-07   45.8  -3.1   95   87-185   305-426 (457)
 26 COG4990 Uncharacterized protei  87.5     1.5 3.2E-05   33.7   5.0   47   87-155   122-168 (195)
 27 cd02549 Peptidase_C39A A sub-f  77.1     9.3  0.0002   27.1   5.8   45   91-154    70-114 (141)
 28 cd00044 CysPc Calpains, domain  74.2     5.7 0.00012   33.0   4.5   28  129-156   234-263 (315)
 29 PF12385 Peptidase_C70:  Papain  69.6      16 0.00034   27.6   5.4   39   87-142    97-135 (166)
 30 PF01640 Peptidase_C10:  Peptid  51.0      90   0.002   23.9   7.0   53   88-165   140-192 (192)
 31 cd00585 Peptidase_C1B Peptidas  46.3      19 0.00041   31.6   2.8   30   28-58    130-159 (437)
 32 smart00230 CysPc Calpain-like   42.7      60  0.0013   27.1   5.2   27  130-156   227-255 (318)
 33 KOG4621|consensus               38.9 1.1E+02  0.0023   22.4   5.2   77   87-168    58-144 (167)
 34 PF03051 Peptidase_C1_2:  Pepti  33.1      46   0.001   29.3   3.1   55    3-58     77-160 (438)
 35 PF07157 DNA_circ_N:  DNA circu  24.2 1.2E+02  0.0027   20.6   3.4   46   52-101    30-79  (93)
 36 PF00648 Peptidase_C2:  Calpain  23.4      69  0.0015   26.2   2.4   28  129-156   212-243 (298)
 37 smart00276 GLECT Galectin. Gal  21.0      56  0.0012   23.2   1.3   13  148-160    51-64  (128)

No 1  
>KOG1542|consensus
Probab=100.00  E-value=3.1e-54  Score=349.67  Aligned_cols=180  Identities=42%  Similarity=0.856  Sum_probs=172.3

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCCCCCCCCCccchhhHHH-HHhCcccccccccccCCCCCcc-cccCCCCcceeee
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEY-THQAGLESEKDYPYRNGNGEKF-KCAYDKSKVKLFT   78 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~-~~~~Gi~~e~~yPY~~~~~~~~-~C~~~~~~~~~~i   78 (187)
                      ++|++++|++++.++||||||+||+..++||+||. +.+||+| ++..|+.+|.+|||.   +..+ .|........+.|
T Consensus       189 ~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~GGl-~~nA~~~~~~~gGL~~E~dYPY~---g~~~~~C~~~~~~~~v~I  264 (372)
T KOG1542|consen  189 AVEGAWAIATGKLVSLSEQELVDCDSCDNGCNGGL-MDNAFKYIKKAGGLEKEKDYPYT---GKKGNQCHFDKSKIVVSI  264 (372)
T ss_pred             hhhhHHHhhcCcccccchhhhhcccCcCCcCCCCC-hhHHHHHHHHhCCccccccCCcc---ccCCCccccchhhceEEE
Confidence            58999999999999999999999999899999999 9999999 888999999999999   8877 9999999999999


Q ss_pred             ccceeEeC-ChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeecC-CccEEEEEccCCC
Q psy274           79 GKDFLYFN-GSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQD-DIPYWLARNSWGP  156 (187)
Q Consensus        79 ~~~~~~i~-~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~-g~~ywivkNSWG~  156 (187)
                      . +|..++ ++++|.+.|.+.|||+|+|++..+|.|.+||..+.+..|++..++|+|+|||||.++ .++|||||||||+
T Consensus       265 ~-~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~  343 (372)
T KOG1542|consen  265 K-DFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGT  343 (372)
T ss_pred             e-ccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhcccccCCCcccCCccccCceEEEEeecCCCCCCceEEEECCccc
Confidence            9 999999 999999999999999999999999999999999877889998899999999999888 9999999999999


Q ss_pred             CCCCCcEEEEEecCCeecccceeeEEEEE
Q psy274          157 IGPDEGFFKIERGNNACGIETIAGYATID  185 (187)
Q Consensus       157 ~WG~~Gy~~i~~~~n~cgi~~~~~~~~~~  185 (187)
                      +||++||+||.||.|.|||++.+..|.++
T Consensus       344 ~WGE~GY~~l~RG~N~CGi~~mvss~~v~  372 (372)
T KOG1542|consen  344 SWGEKGYYKLCRGSNACGIADMVSSAAVN  372 (372)
T ss_pred             cccccceEEEeccccccccccchhhhhcC
Confidence            99999999999999999999999888653


No 2  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=2.1e-50  Score=324.18  Aligned_cols=176  Identities=29%  Similarity=0.574  Sum_probs=153.2

Q ss_pred             CchhhHHHHhCC------CccCCHHHHhhhcCCCCCCCCCccchhhHHHHHhCcccccccccccCCCC-CcccccCCC-C
Q psy274            1 MLEGQYAIKTGK------LVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNG-EKFKCAYDK-S   72 (187)
Q Consensus         1 a~E~~~~i~~~~------~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~~~-~~~~C~~~~-~   72 (187)
                      +||++++|++++      .+.||+|+|+||+..+.||+||+ +..|++|++++|+++|++|||.   . ..+.|.... .
T Consensus        37 ~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~-~~~a~~~~~~~Gi~~e~~yPY~---~~~~~~C~~~~~~  112 (243)
T cd02621          37 ALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGF-PFLVGKFAEDFGIVTEDYFPYT---ADDDRPCKASPSE  112 (243)
T ss_pred             HHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCC-HHHHHHHHHhcCcCCCceeCCC---CCCCCCCCCCccc
Confidence            589999999887      79999999999997788999999 9999999999999999999999   5 677887654 3


Q ss_pred             cceeeeccceeEe------CChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCC----CCC--------CcccEE
Q psy274           73 KVKLFTGKDFLYF------NGSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEI----CSP--------NAIGHA  133 (187)
Q Consensus        73 ~~~~~i~~~~~~i------~~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~----~~~--------~~~~Ha  133 (187)
                      ...+++. .|..+      .++++||++|+++|||+++|++ ++|++|++|||+.  +.    |+.        ..++|+
T Consensus       113 ~~~~~~~-~~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~--~~~~~~C~~~~~~~~~~~~~~Ha  189 (243)
T cd02621         113 CRRYYFS-DYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHH--TDNDEVSDGDNDNFNPFELTNHA  189 (243)
T ss_pred             ccccccc-ceeEcccccccCCHHHHHHHHHHcCCEEEEEEecccccccCCeEECc--CCcccccccccccccCcccCCeE
Confidence            3444555 55544      3789999999999999999999 8999999999987  32    643        247999


Q ss_pred             EEEEEEeecC--CccEEEEEccCCCCCCCCcEEEEEecCCeecccceeeEEE
Q psy274          134 VLLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIETIAGYAT  183 (187)
Q Consensus       134 v~iVGyg~~~--g~~ywivkNSWG~~WG~~Gy~~i~~~~n~cgi~~~~~~~~  183 (187)
                      |+|||||+++  +.+|||||||||++||++|||||+|+.|.|||+++++++.
T Consensus       190 V~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~~~~  241 (243)
T cd02621         190 VLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVFAY  241 (243)
T ss_pred             EEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCcccCcccceEeec
Confidence            9999999876  8999999999999999999999999999999999998864


No 3  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=3.3e-50  Score=321.79  Aligned_cols=176  Identities=25%  Similarity=0.458  Sum_probs=150.0

Q ss_pred             CchhhHHHHhC--CCccCCHHHHhhhcCC-CCCCCCCccchhhHHHHHhCcccccccccccCCCCC--------------
Q psy274            1 MLEGQYAIKTG--KLVEFSKSQLVECAKQ-CSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGE--------------   63 (187)
Q Consensus         1 a~E~~~~i~~~--~~~~lS~q~l~dc~~~-~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~~~~--------------   63 (187)
                      +||++++|+++  +.+.||+|+|+||+.. +.||+||+ +..||+|++++|+++|++|||......              
T Consensus        36 ~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~-~~~a~~~i~~~G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~  114 (236)
T cd02620          36 AFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGY-PDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPY  114 (236)
T ss_pred             HHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCC-HHHHHHHHHhcCCCcCCEecCcCCCCccCCCCCCCCCCCCC
Confidence            58999999988  7899999999999987 88999999 999999998999999999999832211              


Q ss_pred             -cccccCCCC----cceeeeccceeEeC-ChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCCCcccEEEEE
Q psy274           64 -KFKCAYDKS----KVKLFTGKDFLYFN-GSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSPNAIGHAVLL  136 (187)
Q Consensus        64 -~~~C~~~~~----~~~~~i~~~~~~i~-~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~~~~~Hav~i  136 (187)
                       ...|.....    ...+++. .+..+. ++++||++|+++|||+++|.+ ++|+.|++|||+.   .|+...++|||+|
T Consensus       115 ~~~~C~~~~~~~~~~~~~~~~-~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~---~~~~~~~~HaV~i  190 (236)
T cd02620         115 CTPKCQDGCEKTYEEDKHKGK-SAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQH---TSGKQLGGHAVKI  190 (236)
T ss_pred             CCCCCCcCCccccceeeeeec-ceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEee---cCCCCcCCeEEEE
Confidence             113432221    2234555 666665 679999999999999999999 8999999999986   4666568999999


Q ss_pred             EEEeecCCccEEEEEccCCCCCCCCcEEEEEecCCeecccceeeE
Q psy274          137 VGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIETIAGY  181 (187)
Q Consensus       137 VGyg~~~g~~ywivkNSWG~~WG~~Gy~~i~~~~n~cgi~~~~~~  181 (187)
                      ||||++++++|||||||||++||++|||||+|+.|.|||++.++.
T Consensus       191 VGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~  235 (236)
T cd02620         191 IGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA  235 (236)
T ss_pred             EEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence            999998999999999999999999999999999999999999875


No 4  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=7e-50  Score=334.53  Aligned_cols=178  Identities=30%  Similarity=0.595  Sum_probs=154.7

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCCCCCCCCCccchhhHHHHHh---CcccccccccccCCCCCcccccCCCC-ccee
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQ---AGLESEKDYPYRNGNGEKFKCAYDKS-KVKL   76 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~~~---~Gi~~e~~yPY~~~~~~~~~C~~~~~-~~~~   76 (187)
                      ++|++++|++++.++||+|+|+||+..+.||+||+ +..||+|+++   .|+++|++|||....+..+.|..... ....
T Consensus       158 aiEs~~~i~~~~~~~LSeQqLvdC~~~~~GC~GG~-~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~  236 (348)
T PTZ00203        158 NIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGL-MLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGA  236 (348)
T ss_pred             HHHHHHHHhcCCCccCCHHHHHhccCCCCCCCCCC-HHHHHHHHHHhcCCCCCccccCCCccCCCCCCcCCCCcccccce
Confidence            58999999999999999999999998788999999 9999999653   46899999999933233336864332 2346


Q ss_pred             eeccceeEeC-ChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccCC
Q psy274           77 FTGKDFLYFN-GSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSWG  155 (187)
Q Consensus        77 ~i~~~~~~i~-~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG  155 (187)
                      ++. +|..+. +++.|+++|++.|||+++|++++|++|++|||..    |.....+|+|+|||||+++|.+|||||||||
T Consensus       237 ~i~-~~~~i~~~e~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy~~----c~~~~~nHaVliVGYG~~~g~~YWiikNSWG  311 (348)
T PTZ00203        237 RID-GYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIGEQLNHGVLLVGYNMTGEVPYWVIKNSWG  311 (348)
T ss_pred             Eec-ceeecCcCHHHHHHHHHhCCCEEEEEEhhhhcCccCceeec----cCCCCCCeEEEEEEEecCCCceEEEEEcCCC
Confidence            788 888887 7889999999999999999998899999999964    7655679999999999988999999999999


Q ss_pred             CCCCCCcEEEEEecCCeecccceeeEEEE
Q psy274          156 PIGPDEGFFKIERGNNACGIETIAGYATI  184 (187)
Q Consensus       156 ~~WG~~Gy~~i~~~~n~cgi~~~~~~~~~  184 (187)
                      ++||++|||||+|+.|.|||+++++.+.+
T Consensus       312 ~~WGe~GY~ri~rg~n~Cgi~~~~~~~~~  340 (348)
T PTZ00203        312 EDWGEKGYVRVTMGVNACLLTGYPVSVHV  340 (348)
T ss_pred             CCcCcCceEEEEcCCCcccccceEEEEec
Confidence            99999999999999999999999998865


No 5  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=9.7e-50  Score=319.61  Aligned_cols=174  Identities=27%  Similarity=0.513  Sum_probs=151.3

Q ss_pred             CchhhHHHHhC---CCccCCHHHHhhhcCCCCCCCCCccchhhHHHHHhCcccccccccccCCCCCcccccCC-------
Q psy274            1 MLEGQYAIKTG---KLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYD-------   70 (187)
Q Consensus         1 a~E~~~~i~~~---~~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~~-------   70 (187)
                      +||++++|+++   +.+.||+|||+||+. +.||+||+ +..||+|++++|+++|++|||.   +....|...       
T Consensus        39 aies~~~i~~~~~~~~~~lS~Q~lldC~~-~~gC~GG~-~~~a~~~~~~~Gl~~e~~yPY~---~~~~~C~~~~~~~~c~  113 (239)
T cd02698          39 ALADRINIARKGAWPSVYLSVQVVIDCAG-GGSCHGGD-PGGVYEYAHKHGIPDETCNPYQ---AKDGECNPFNRCGTCN  113 (239)
T ss_pred             HHHHHHHHHHCCCCCCcccCHHHHHhCCC-CCCccCcC-HHHHHHHHHHcCcCCCCeeCCc---CCCCCCcCCCCCCCcc
Confidence            58999999876   368999999999998 78999999 9999999988999999999998   555555421       


Q ss_pred             --------CCcceeeeccceeEeCChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCCCcccEEEEEEEEee
Q psy274           71 --------KSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGK  141 (187)
Q Consensus        71 --------~~~~~~~i~~~~~~i~~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~  141 (187)
                              +....++++ +|..+.+.++||++|+++|||+++|.+ ++|+.|++|||+.  +.|. ..++|+|+|||||+
T Consensus       114 ~~~~c~~~~~~~~~~i~-~~~~~~~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~--~~~~-~~~~HaV~IVGyG~  189 (239)
T cd02698         114 PFGECFAIKNYTLYFVS-DYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKE--YVQD-PLINHIISVAGWGV  189 (239)
T ss_pred             cCcccccccccceEEee-eceecCCHHHHHHHHHHcCCEEEEEEecccccccCCeEEcc--CCCC-CcCCeEEEEEEEEe
Confidence                    112345677 787777788999999999999999999 8999999999987  4454 45899999999998


Q ss_pred             cC-CccEEEEEccCCCCCCCCcEEEEEecC-----CeecccceeeEEE
Q psy274          142 QD-DIPYWLARNSWGPIGPDEGFFKIERGN-----NACGIETIAGYAT  183 (187)
Q Consensus       142 ~~-g~~ywivkNSWG~~WG~~Gy~~i~~~~-----n~cgi~~~~~~~~  183 (187)
                      ++ +++|||||||||++||++|||||+|+.     |+|+||+.++++.
T Consensus       190 ~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~~~~~~~  237 (239)
T cd02698         190 DENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEEDCAWAD  237 (239)
T ss_pred             cCCCCEEEEEEcCCCcccCcCceEEEEccCCcccccccccccceEEEe
Confidence            76 999999999999999999999999999     9999999999885


No 6  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=1.4e-49  Score=311.89  Aligned_cols=175  Identities=42%  Similarity=0.840  Sum_probs=161.3

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCC-CCCCCCCccchhhHHHHHhCcccccccccccCCCCCcccccCCCCcceeeec
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQ-CSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTG   79 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~-~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~   79 (187)
                      +||++++++++...+||+|+|++|... +.+|+||. +..||+++++.|+++|++|||.   .....|........+++.
T Consensus        32 ~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~-~~~a~~~~~~~Gi~~e~~yPY~---~~~~~C~~~~~~~~~~i~  107 (210)
T cd02248          32 ALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGN-PDNAFEYVKNGGLASESDYPYT---GKDGTCKYNSSKVGAKIT  107 (210)
T ss_pred             HHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCC-HHHhHHHHHHCCcCccccCCcc---CCCCCccCCCCcccEEEe
Confidence            589999999999999999999999986 88999999 9999999999999999999999   777889877767889999


Q ss_pred             cceeEeC--ChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccCCC
Q psy274           80 KDFLYFN--GSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSWGP  156 (187)
Q Consensus        80 ~~~~~i~--~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~  156 (187)
                       +|..+.  +.++||++|+++|||+++|.+ ++|+.|++|||..  +.|....++|+|+|||||++.+.+|||||||||+
T Consensus       108 -~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~--~~~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~  184 (210)
T cd02248         108 -GYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSG--PCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGT  184 (210)
T ss_pred             -eEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeC--CCCCCCcCCEEEEEEEEeecCCceEEEEEcCCCC
Confidence             999998  589999999999999999999 8999999999988  5564556899999999999889999999999999


Q ss_pred             CCCCCcEEEEEecCCeecccceeeEE
Q psy274          157 IGPDEGFFKIERGNNACGIETIAGYA  182 (187)
Q Consensus       157 ~WG~~Gy~~i~~~~n~cgi~~~~~~~  182 (187)
                      .||++|||||+|+.|.|||++.+.+|
T Consensus       185 ~WG~~Gy~~i~~~~~~cgi~~~~~~~  210 (210)
T cd02248         185 SWGEKGYIRIARGSNLCGIASYASYP  210 (210)
T ss_pred             ccccCcEEEEEcCCCccCceeeeecC
Confidence            99999999999999999999887764


No 7  
>KOG1543|consensus
Probab=100.00  E-value=1.3e-48  Score=324.52  Aligned_cols=175  Identities=34%  Similarity=0.687  Sum_probs=160.8

Q ss_pred             CchhhHHHHhC-CCccCCHHHHhhhcCC-CCCCCCCccchhhHHHHHhCcccc-cccccccCCCCCcccccCCCCcceee
Q psy274            1 MLEGQYAIKTG-KLVEFSKSQLVECAKQ-CSGCGGCDGLEQPIEYTHQAGLES-EKDYPYRNGNGEKFKCAYDKSKVKLF   77 (187)
Q Consensus         1 a~E~~~~i~~~-~~~~lS~q~l~dc~~~-~~gC~gG~~~~~a~~~~~~~Gi~~-e~~yPY~~~~~~~~~C~~~~~~~~~~   77 (187)
                      |||++++|+++ ..+.||+|||+||+.. ++||+||. +..||+|++++|+++ +.+|||.   +....|........+.
T Consensus       142 aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~-~~~A~~yi~~~G~~t~~~~Ypy~---~~~~~C~~~~~~~~~~  217 (325)
T KOG1543|consen  142 ALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGE-PKNAFKYIKKNGGVTECENYPYI---GKDGTCKSNKKDKTVT  217 (325)
T ss_pred             HHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCC-HHHHHHHHHHhCCCCCCcCCCCc---CCCCCccCCCccceeE
Confidence            68999999999 8999999999999996 89999999 999999988888888 9999999   7788999887777788


Q ss_pred             eccceeEeC-ChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccCC
Q psy274           78 TGKDFLYFN-GSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSWG  155 (187)
Q Consensus        78 i~~~~~~i~-~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG  155 (187)
                      +. ++..++ ++++|+++|+.+|||+++|.+ .+|+.|++|||..  +.|....++|||+|||||+.++.+|||||||||
T Consensus       218 ~~-~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~--~~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG  294 (325)
T KOG1543|consen  218 IK-GFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAE--EKGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWG  294 (325)
T ss_pred             ee-eeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeC--CCCCCCCCCceEEEEEEcCCCCceeEEEEcCCC
Confidence            88 888777 999999999999999999999 9999999999999  455443589999999999966789999999999


Q ss_pred             CCCCCCcEEEEEecCCeecccceeeEE
Q psy274          156 PIGPDEGFFKIERGNNACGIETIAGYA  182 (187)
Q Consensus       156 ~~WG~~Gy~~i~~~~n~cgi~~~~~~~  182 (187)
                      +.||++|||||.|+.|.|+|++.+.++
T Consensus       295 ~~WGe~Gy~ri~r~~~~~~I~~~~~~~  321 (325)
T KOG1543|consen  295 TDWGEKGYFRIARGVNKCGIASEASYG  321 (325)
T ss_pred             CCcccCceEEEecCCCchhhhcccccC
Confidence            999999999999999999999998883


No 8  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=3e-48  Score=334.39  Aligned_cols=175  Identities=33%  Similarity=0.594  Sum_probs=156.2

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCCCCCCCCCccchhhHHHH-HhCcccccccccccCCCCC-cccccCCCCcceeee
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYT-HQAGLESEKDYPYRNGNGE-KFKCAYDKSKVKLFT   78 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~-~~~Gi~~e~~yPY~~~~~~-~~~C~~~~~~~~~~i   78 (187)
                      |+|++++|++++.++||+|+|+||+..+.||+||+ +..||+|+ +..|+++|++|||.   +. ++.|........+++
T Consensus       298 alEs~~~I~~g~~v~LSeQqLVDCs~~n~GC~GG~-~~~Af~yi~~~gGl~tE~~YPY~---~~~~~~C~~~~~~~~~~i  373 (489)
T PTZ00021        298 VVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGL-IPNAFEDMIELGGLCSEDDYPYV---SDTPELCNIDRCKEKYKI  373 (489)
T ss_pred             HHHHHHHHHcCCCcccCHHHHhhhccCCCCCCCcc-hHhhhhhhhhccccCcccccCcc---CCCCCcccccccccccee
Confidence            58999999999999999999999998789999999 99999995 56699999999999   65 478976555566889


Q ss_pred             ccceeEeCChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCC----------ccE
Q psy274           79 GKDFLYFNGSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDD----------IPY  147 (187)
Q Consensus        79 ~~~~~~i~~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g----------~~y  147 (187)
                      . +|..++ ..+|+++|+..|||+++|.+ ++|++|++|||..   .|+. .++|||+|||||++++          .+|
T Consensus       374 ~-~y~~i~-~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~---~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~Y  447 (489)
T PTZ00021        374 K-SYVSIP-EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDG---ECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYY  447 (489)
T ss_pred             e-eEEEec-HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCC---CCCC-ccceEEEEEEecCcCCcccccccCCCCCE
Confidence            8 998885 67899999999999999999 8999999999976   6876 4799999999997642          479


Q ss_pred             EEEEccCCCCCCCCcEEEEEecC----CeecccceeeEEEEE
Q psy274          148 WLARNSWGPIGPDEGFFKIERGN----NACGIETIAGYATID  185 (187)
Q Consensus       148 wivkNSWG~~WG~~Gy~~i~~~~----n~cgi~~~~~~~~~~  185 (187)
                      ||||||||++||++|||||+|+.    |.|||.+.+.+|++|
T Consensus       448 WIVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a~yP~~~  489 (489)
T PTZ00021        448 YIIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEAYVPLIE  489 (489)
T ss_pred             EEEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccceeEecC
Confidence            99999999999999999999986    589999999999875


No 9  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=2e-47  Score=327.73  Aligned_cols=173  Identities=30%  Similarity=0.583  Sum_probs=151.9

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCCCCCCCCCccchhhHHHHHhCcccccccccccCCCCCcccccCCCCcceeeecc
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGK   80 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~   80 (187)
                      ++|++++|+++..++||+|||+||+..+.||+||+ +..||+|++++|+++|++|||.   +..+.|..... ...+|. 
T Consensus       267 aiEs~~~i~~~~~~~LSeQqLvDC~~~~~GC~GG~-~~~A~~yi~~~Gi~~e~~YPY~---~~~~~C~~~~~-~~~~i~-  340 (448)
T PTZ00200        267 SVESLYKIYRDKSVDLSEQELVNCDTKSQGCSGGY-PDTALEYVKNKGLSSSSDVPYL---AKDGKCVVSST-KKVYID-  340 (448)
T ss_pred             HHHHHHHHhcCCCeecCHHHHhhccCccCCCCCCc-HHHHHHHHhhcCccccccCCCC---CCCCCCcCCCC-CeeEec-
Confidence            58999999999999999999999997789999999 9999999988999999999999   88889986543 345688 


Q ss_pred             ceeEeCChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCCCcccEEEEEEEEee--cCCccEEEEEccCCCC
Q psy274           81 DFLYFNGSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGK--QDDIPYWLARNSWGPI  157 (187)
Q Consensus        81 ~~~~i~~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~--~~g~~ywivkNSWG~~  157 (187)
                      +|..+...+.++++| ..|||+++|.+ .+|+.|++|||..   +|+.. ++|||+|||||.  ++|.+|||||||||++
T Consensus       341 ~y~~~~~~~~l~~~l-~~GPV~v~i~~~~~f~~Yk~GIy~~---~C~~~-~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~  415 (448)
T PTZ00200        341 SYLVAKGKDVLNKSL-VISPTVVYIAVSRELLKYKSGVYNG---ECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTD  415 (448)
T ss_pred             ceEecCHHHHHHHHH-hcCCEEEEeecccccccCCCCcccc---ccCCC-CcEEEEEEEecccCCCCCceEEEEcCCCCC
Confidence            887665445555555 57999999999 8999999999976   68764 899999999994  4688999999999999


Q ss_pred             CCCCcEEEEEec---CCeecccceeeEEEE
Q psy274          158 GPDEGFFKIERG---NNACGIETIAGYATI  184 (187)
Q Consensus       158 WG~~Gy~~i~~~---~n~cgi~~~~~~~~~  184 (187)
                      ||++|||||+|+   .|.|||++.+.+|++
T Consensus       416 WGe~GY~ri~r~~~g~n~CGI~~~~~~P~~  445 (448)
T PTZ00200        416 WGENGYMRLERTNEGTDKCGILTVGLTPVF  445 (448)
T ss_pred             cccCeeEEEEeCCCCCCcCCccccceeeEE
Confidence            999999999996   489999999999987


No 10 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=8.3e-47  Score=296.82  Aligned_cols=176  Identities=34%  Similarity=0.646  Sum_probs=156.4

Q ss_pred             CchhhHHHHh-CCCccCCHHHHhhhcC-CCCCCCCCccchhhHHHHHh-CcccccccccccCCCCCc-ccccCCCCcc-e
Q psy274            1 MLEGQYAIKT-GKLVEFSKSQLVECAK-QCSGCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEK-FKCAYDKSKV-K   75 (187)
Q Consensus         1 a~E~~~~i~~-~~~~~lS~q~l~dc~~-~~~gC~gG~~~~~a~~~~~~-~Gi~~e~~yPY~~~~~~~-~~C~~~~~~~-~   75 (187)
                      ++|++++++. ...++||+|+|++|.. .+.+|+||+ +.+|++++++ .|+++|++|||.   ... +.|....... .
T Consensus        34 ~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~-~~~a~~~~~~~~Gi~~e~~~pY~---~~~~~~c~~~~~~~~~  109 (219)
T PF00112_consen   34 ALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGS-PFDALKYIKNNNGIVTEEDYPYN---GNENPTCKSKKSNSYY  109 (219)
T ss_dssp             HHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBE-HHHHHHHHHHHTSBEBTTTS--S---SSSSCSSCHSGGGEEE
T ss_pred             ceeccccccccccccccccccccccccccccccccCc-ccccceeecccCccccccccccc---cccccccccccccccc
Confidence            4899999999 7899999999999998 578999999 9999999777 999999999999   666 7898765443 4


Q ss_pred             eeeccceeEeC--ChHHHHHHHHhCCCEEEEEEC-c-cccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEE
Q psy274           76 LFTGKDFLYFN--GSETMKKILYKYGPLSVGLNG-H-LIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLAR  151 (187)
Q Consensus        76 ~~i~~~~~~i~--~~~~ik~~l~~~gPV~v~~~~-~-~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivk  151 (187)
                      .++. .|..+.  ++++||++|+++|||+++|.+ + +|+.|++|||..  +.|+...++|+|+|||||++.+++|||||
T Consensus       110 ~~i~-~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~--~~~~~~~~~Hav~iVGy~~~~~~~~wiv~  186 (219)
T PF00112_consen  110 VKIK-GYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDP--PDCSNESGGHAVLIVGYDDENGKGYWIVK  186 (219)
T ss_dssp             BEES-EEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECS--TSSSSSSEEEEEEEEEEEEETTEEEEEEE
T ss_pred             cccc-ccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeec--cccccccccccccccccccccceeeEeee
Confidence            7888 898888  589999999999999999999 7 699999999998  67777789999999999999999999999


Q ss_pred             ccCCCCCCCCcEEEEEecCC-eecccceeeEEE
Q psy274          152 NSWGPIGPDEGFFKIERGNN-ACGIETIAGYAT  183 (187)
Q Consensus       152 NSWG~~WG~~Gy~~i~~~~n-~cgi~~~~~~~~  183 (187)
                      ||||++||++|||||+|+.+ .|||++++++|+
T Consensus       187 NSWG~~WG~~Gy~~i~~~~~~~c~i~~~~~~~~  219 (219)
T PF00112_consen  187 NSWGTDWGDNGYFRISYDYNNECGIESQAVYPI  219 (219)
T ss_dssp             -SBTTTSTBTTEEEEESSSSSGGGTTSSEEEEE
T ss_pred             hhhCCccCCCeEEEEeeCCCCcCccCceeeecC
Confidence            99999999999999999987 999999999985


No 11 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=1.6e-46  Score=330.79  Aligned_cols=181  Identities=27%  Similarity=0.522  Sum_probs=150.4

Q ss_pred             CchhhHHHHhCC----------CccCCHHHHhhhcCCCCCCCCCccchhhHHHHHhCcccccccccccCCCCCcccccCC
Q psy274            1 MLEGQYAIKTGK----------LVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYD   70 (187)
Q Consensus         1 a~E~~~~i~~~~----------~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~~~~~~~C~~~   70 (187)
                      +||+|++|++++          ...||+|+|+||+..+.||+||+ +..|++|+++.||++|++|||.   +..+.|...
T Consensus       417 alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC~GG~-~~~A~kya~~~GI~tEscYPY~---a~~g~C~~~  492 (693)
T PTZ00049        417 AFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGCNGGF-PYLVSKMAKLQGIPLDKVFPYT---ATEQTCPYQ  492 (693)
T ss_pred             HHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCcCCCc-HHHHHHHHHHCCCCcCCccCCc---CCCCCCCCC
Confidence            589999999753          13799999999998789999999 9999999988999999999998   666667532


Q ss_pred             CC---------------------------------------cceeeeccceeEeC---------ChHHHHHHHHhCCCEE
Q psy274           71 KS---------------------------------------KVKLFTGKDFLYFN---------GSETMKKILYKYGPLS  102 (187)
Q Consensus        71 ~~---------------------------------------~~~~~i~~~~~~i~---------~~~~ik~~l~~~gPV~  102 (187)
                      ..                                       ..+++++ +|..+.         ++++||++|+.+|||+
T Consensus       493 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~y~k-~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVs  571 (693)
T PTZ00049        493 VDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEPARWYAK-DYNYIGGCYGCNQCNGEKIMMNEIYRNGPIV  571 (693)
T ss_pred             CCCccccccccccccccccccccccccccccccccccccccccceeee-eeEEecccccccCCCCHHHHHHHHHhcCCEE
Confidence            11                                       1223455 666552         5789999999999999


Q ss_pred             EEEEC-ccccccCCceeeCCC----CCCCCC--------------cccEEEEEEEEeec--CCc--cEEEEEccCCCCCC
Q psy274          103 VGLNG-HLIHFYNGTPIKKND----EICSPN--------------AIGHAVLLVGYGKQ--DDI--PYWLARNSWGPIGP  159 (187)
Q Consensus       103 v~~~~-~~f~~y~~giy~~~~----~~~~~~--------------~~~Hav~iVGyg~~--~g~--~ywivkNSWG~~WG  159 (187)
                      ++|.+ ++|++|++|||+..+    ..|+..              ..+|||+|||||++  +|+  +|||||||||+.||
T Consensus       572 VsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WG  651 (693)
T PTZ00049        572 ASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWG  651 (693)
T ss_pred             EEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccCCCcccCEEEEECCCCCCcc
Confidence            99999 899999999998621    136421              36999999999964  454  79999999999999


Q ss_pred             CCcEEEEEecCCeecccceeeEEEEEe
Q psy274          160 DEGFFKIERGNNACGIETIAGYATIDV  186 (187)
Q Consensus       160 ~~Gy~~i~~~~n~cgi~~~~~~~~~~~  186 (187)
                      ++|||||+|+.|.||||++++++..|+
T Consensus       652 enGYfKI~RG~N~CGIEs~a~~~~pd~  678 (693)
T PTZ00049        652 KEGYFKIIRGKNFSGIESQSLFIEPDF  678 (693)
T ss_pred             cCceEEEEcCCCccCCccceeEEeeec
Confidence            999999999999999999999999986


No 12 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=4.7e-46  Score=324.10  Aligned_cols=182  Identities=20%  Similarity=0.405  Sum_probs=149.5

Q ss_pred             CchhhHHHHhC------CCccCCHHHHhhhcCCCCCCCCCccchhhHHHHHhCcccccccc--cccCCCCCcccccCCCC
Q psy274            1 MLEGQYAIKTG------KLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDY--PYRNGNGEKFKCAYDKS   72 (187)
Q Consensus         1 a~E~~~~i~~~------~~~~lS~q~l~dc~~~~~gC~gG~~~~~a~~~~~~~Gi~~e~~y--PY~~~~~~~~~C~~~~~   72 (187)
                      |||++++|+++      +.+.||+|+|+||+..++||+||+ +..|++|++++|+++|++|  ||....+..+.|.....
T Consensus       243 alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCdGG~-p~~A~~yi~~~GI~tE~dY~~PY~~~dg~~~~Ck~~~~  321 (548)
T PTZ00364        243 AMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCAGGF-PEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRP  321 (548)
T ss_pred             HHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCCCCc-HHHHHHHHHhCCcccccccCCCCCCCCCCCCCCCCCcc
Confidence            58999999984      468999999999997789999999 9999999988999999999  99843233346876544


Q ss_pred             cceeeeccce------eEeC-ChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeC-----CC-CCCC----------CC
Q psy274           73 KVKLFTGKDF------LYFN-GSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKK-----ND-EICS----------PN  128 (187)
Q Consensus        73 ~~~~~i~~~~------~~i~-~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~-----~~-~~~~----------~~  128 (187)
                      ...++++ ++      ..+. ++++||++|+++|||+++|++ ++|+.|++|||..     .. ..|.          ..
T Consensus       322 ~~~y~~~-~~~~I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~  400 (548)
T PTZ00364        322 SRRYYFT-NYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFAS  400 (548)
T ss_pred             cceeeee-eeEEecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccc
Confidence            4444444 44      3333 678899999999999999999 8999999999862     00 1111          13


Q ss_pred             cccEEEEEEEEee-cCCccEEEEEccCCC--CCCCCcEEEEEecCCeecccceeeEEEE
Q psy274          129 AIGHAVLLVGYGK-QDDIPYWLARNSWGP--IGPDEGFFKIERGNNACGIETIAGYATI  184 (187)
Q Consensus       129 ~~~Hav~iVGyg~-~~g~~ywivkNSWG~--~WG~~Gy~~i~~~~n~cgi~~~~~~~~~  184 (187)
                      ..+|||+|||||+ ++|.+|||||||||+  +|||+|||||+|+.|+||||+.++.++.
T Consensus       401 ~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~~v~~~~  459 (548)
T PTZ00364        401 NVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVVVMYW  459 (548)
T ss_pred             cCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcccccceeeeeee
Confidence            5799999999996 578899999999999  9999999999999999999999997764


No 13 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00  E-value=2.4e-41  Score=305.76  Aligned_cols=182  Identities=21%  Similarity=0.385  Sum_probs=146.6

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCC--CCCCCCCccchhhHHHHHhC-cccccccccccCCCCCcccccCCCC-----
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQ--CSGCGGCDGLEQPIEYTHQA-GLESEKDYPYRNGNGEKFKCAYDKS-----   72 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~--~~gC~gG~~~~~a~~~~~~~-Gi~~e~~yPY~~~~~~~~~C~~~~~-----   72 (187)
                      |+|++++|++++.+.||+|+|+||+..  +.||.||+.+..++.|+++. |+++|++|||... ...+.|+....     
T Consensus       564 aLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPYt~k-~~~g~Cp~~~~~w~n~  642 (1004)
T PTZ00462        564 HLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLYNYT-KVGEDCPDEEDHWMNL  642 (1004)
T ss_pred             HHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCCccC-CCCCCCCCCccccccc
Confidence            589999999999999999999999864  68999998344556887665 5899999999731 14557864321     


Q ss_pred             -------------cceeeeccceeEeCC----------hHHHHHHHHhCCCEEEEEECccccccC-CceeeCCCCCCCCC
Q psy274           73 -------------KVKLFTGKDFLYFNG----------SETMKKILYKYGPLSVGLNGHLIHFYN-GTPIKKNDEICSPN  128 (187)
Q Consensus        73 -------------~~~~~i~~~~~~i~~----------~~~ik~~l~~~gPV~v~~~~~~f~~y~-~giy~~~~~~~~~~  128 (187)
                                   ...+.+. +|..+..          ++.||++|+..|||+|+|.+++|+.|. +|||..  ..|+..
T Consensus       643 ~~~~kll~~~~~~~~~i~~k-gY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsdf~~Y~~sGIyv~--~~Cgs~  719 (1004)
T PTZ00462        643 LDHGKILNHNKKEPNSLDGK-AYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENVLGYEFNGKKVQ--NLCGDD  719 (1004)
T ss_pred             ccccccccccccccceeecc-ceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeehHHhhhcCCcccc--CCCCCC
Confidence                         0123344 5655541          368999999999999999996688884 899876  579876


Q ss_pred             cccEEEEEEEEeec-----CCccEEEEEccCCCCCCCCcEEEEEe-cCCeecccceeeEEEEEe
Q psy274          129 AIGHAVLLVGYGKQ-----DDIPYWLARNSWGPIGPDEGFFKIER-GNNACGIETIAGYATIDV  186 (187)
Q Consensus       129 ~~~Hav~iVGyg~~-----~g~~ywivkNSWG~~WG~~Gy~~i~~-~~n~cgi~~~~~~~~~~~  186 (187)
                      .++|||+|||||++     ++++|||||||||+.||++|||||.| +.|.|||.....+|+++|
T Consensus       720 ~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~n~CGin~i~t~~~fn~  783 (1004)
T PTZ00462        720 TADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIHSVVIFNI  783 (1004)
T ss_pred             cCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCCCCCccchheeeeeEee
Confidence            78999999999964     25789999999999999999999998 689999999999999986


No 14 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00  E-value=7.3e-40  Score=257.77  Aligned_cols=165  Identities=27%  Similarity=0.359  Sum_probs=141.9

Q ss_pred             CchhhHHHHhC--CCccCCHHHHhhhcCCC-----CCCCCCccchhhHH-HHHhCcccccccccccCCCCCcccccC---
Q psy274            1 MLEGQYAIKTG--KLVEFSKSQLVECAKQC-----SGCGGCDGLEQPIE-YTHQAGLESEKDYPYRNGNGEKFKCAY---   69 (187)
Q Consensus         1 a~E~~~~i~~~--~~~~lS~q~l~dc~~~~-----~gC~gG~~~~~a~~-~~~~~Gi~~e~~yPY~~~~~~~~~C~~---   69 (187)
                      ++|++++++.+  +.++||+|+|++|....     .+|.||. +..++. ++++.|+++|++|||.   .....|..   
T Consensus        29 ~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~-~~~~~~~~~~~~Gi~~e~~~Py~---~~~~~~~~~~~  104 (223)
T cd02619          29 ALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGG-PLSALLKLVALKGIPPEEDYPYG---AESDGEEPKSE  104 (223)
T ss_pred             HHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCc-HHHHHHHHHHHcCCCccccCCCC---CCCCCCCCCCc
Confidence            58999999998  89999999999999763     6999999 999998 7889999999999999   55555542   


Q ss_pred             -CCCcceeeeccceeEeC--ChHHHHHHHHhCCCEEEEEEC-ccccccCCceee---CCCCCCCCCcccEEEEEEEEeec
Q psy274           70 -DKSKVKLFTGKDFLYFN--GSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIK---KNDEICSPNAIGHAVLLVGYGKQ  142 (187)
Q Consensus        70 -~~~~~~~~i~~~~~~i~--~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~---~~~~~~~~~~~~Hav~iVGyg~~  142 (187)
                       ......+++. .|..+.  ++++||++|+++|||+++|.+ +.|..|++|++.   .....+....++|||+|||||++
T Consensus       105 ~~~~~~~~~~~-~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~  183 (223)
T cd02619         105 AALNAAKVKLK-DYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDN  183 (223)
T ss_pred             cchhhcceeec-ceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCC
Confidence             2344667888 998887  579999999999999999999 999999999873   11134556678999999999988


Q ss_pred             C--CccEEEEEccCCCCCCCCcEEEEEecC
Q psy274          143 D--DIPYWLARNSWGPIGPDEGFFKIERGN  170 (187)
Q Consensus       143 ~--g~~ywivkNSWG~~WG~~Gy~~i~~~~  170 (187)
                      .  +++|||||||||+.||++||+||+++.
T Consensus       184 ~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~  213 (223)
T cd02619         184 YVEGKGAFIVKNSWGTDWGDNGYGRISYED  213 (223)
T ss_pred             CCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence            6  899999999999999999999999984


No 15 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00  E-value=1.1e-39  Score=249.83  Aligned_cols=135  Identities=43%  Similarity=0.870  Sum_probs=120.5

Q ss_pred             CchhhHHHHhCCCccCCHHHHhhhcCC-CCCCCCCccchhhHHHHHhC-cccccccccccCCCCCcccccCCCCcceeee
Q psy274            1 MLEGQYAIKTGKLVEFSKSQLVECAKQ-CSGCGGCDGLEQPIEYTHQA-GLESEKDYPYRNGNGEKFKCAYDKSKVKLFT   78 (187)
Q Consensus         1 a~E~~~~i~~~~~~~lS~q~l~dc~~~-~~gC~gG~~~~~a~~~~~~~-Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i   78 (187)
                      +||+++++++++.++||+|+|++|... +.+|+||+ +..|+++++++ |+++|++|||+   .                
T Consensus        33 ~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~-~~~a~~~~~~~~Gi~~e~~~PY~---~----------------   92 (174)
T smart00645       33 ALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGL-PDNAFEYIKKNGGLETESCYPYT---G----------------   92 (174)
T ss_pred             HHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcC-HHHHHHHHHHcCCcccccccCcc---c----------------
Confidence            589999999999999999999999986 67999999 99999998776 99999999999   3                


Q ss_pred             ccceeEeCChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeec-CCccEEEEEccCCCC
Q psy274           79 GKDFLYFNGSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQ-DDIPYWLARNSWGPI  157 (187)
Q Consensus        79 ~~~~~~i~~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~-~g~~ywivkNSWG~~  157 (187)
                                             ++.+.+.+|+.|++|||+.  +.|....++|+|+|||||.+ ++++|||||||||+.
T Consensus        93 -----------------------~~~~~~~~f~~Y~~Gi~~~--~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~  147 (174)
T smart00645       93 -----------------------SVAIDASDFQFYKSGIYDH--PGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTD  147 (174)
T ss_pred             -----------------------EEEEEcccccCCcCeEECC--CCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCC
Confidence                                   4555556699999999987  56766557999999999987 889999999999999


Q ss_pred             CCCCcEEEEEecC-Ceecccceee
Q psy274          158 GPDEGFFKIERGN-NACGIETIAG  180 (187)
Q Consensus       158 WG~~Gy~~i~~~~-n~cgi~~~~~  180 (187)
                      ||++|||||.|+. |.|+|+....
T Consensus       148 WG~~G~~~i~~~~~~~c~i~~~~~  171 (174)
T smart00645      148 WGENGYFRIARGKNNECGIEASVA  171 (174)
T ss_pred             cccCeEEEEEcCCCCccCceeeee
Confidence            9999999999997 9999977653


No 16 
>KOG1544|consensus
Probab=100.00  E-value=4.8e-38  Score=253.37  Aligned_cols=177  Identities=25%  Similarity=0.445  Sum_probs=144.1

Q ss_pred             hhHHHHhCC--CccCCHHHHhhhcCC-CCCCCCCccchhhHHHHHhCcccccccccccCC-CCCcccccCCCC-------
Q psy274            4 GQYAIKTGK--LVEFSKSQLVECAKQ-CSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNG-NGEKFKCAYDKS-------   72 (187)
Q Consensus         4 ~~~~i~~~~--~~~lS~q~l~dc~~~-~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~~~-~~~~~~C~~~~~-------   72 (187)
                      .|++|++..  ...||+|+|++|... ..||+||. +..|+=|+++.|++...||||... .+..+.|.....       
T Consensus       246 DRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~-lDRAWWYlRKrGvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkR  324 (470)
T KOG1544|consen  246 DRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGR-LDRAWWYLRKRGVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKR  324 (470)
T ss_pred             ceeEEeeccccccccChHHhcchhhhhhccCccCc-ccchheeeecccccccccccccCCCCCCCCCceeeccccCcccc
Confidence            456666543  688999999999876 79999999 999999999999999999999842 234444532221       


Q ss_pred             -------------cceeeeccceeEeC-ChHHHHHHHHhCCCEEEEEEC-ccccccCCceeeCCCCCCCC-----CcccE
Q psy274           73 -------------KVKLFTGKDFLYFN-GSETMKKILYKYGPLSVGLNG-HLIHFYNGTPIKKNDEICSP-----NAIGH  132 (187)
Q Consensus        73 -------------~~~~~i~~~~~~i~-~~~~ik~~l~~~gPV~v~~~~-~~f~~y~~giy~~~~~~~~~-----~~~~H  132 (187)
                                   ...++++ .-++|+ ++++||++||++|||-+.|.+ ++|+.|++|||+..+.....     ..+.|
T Consensus       325 qat~~CPn~~~~Sn~iyq~t-PPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtH  403 (470)
T KOG1544|consen  325 QATAHCPNSYVNSNDIYQVT-PPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTH  403 (470)
T ss_pred             cccCcCCCcccccCceeeec-CCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccc
Confidence                         1456666 444455 999999999999999999999 99999999999884322222     25799


Q ss_pred             EEEEEEEeecC-----CccEEEEEccCCCCCCCCcEEEEEecCCeecccceeeEE
Q psy274          133 AVLLVGYGKQD-----DIPYWLARNSWGPIGPDEGFFKIERGNNACGIETIAGYA  182 (187)
Q Consensus       133 av~iVGyg~~~-----g~~ywivkNSWG~~WG~~Gy~~i~~~~n~cgi~~~~~~~  182 (187)
                      +|.|.|||++.     ..+||+..||||+.||++|||||.||.|+|.||++++.|
T Consensus       404 sVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNecdIEsfvIgA  458 (470)
T KOG1544|consen  404 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNECDIESFVIGA  458 (470)
T ss_pred             eEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccchhhhHhhhhh
Confidence            99999999763     257999999999999999999999999999999998765


No 17 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.72  E-value=2.1e-17  Score=136.30  Aligned_cols=132  Identities=23%  Similarity=0.262  Sum_probs=85.4

Q ss_pred             CCccchhhHHH-HHhCcccccccccccCCCCCcccccCCCCcceeeeccceeEeC------ChHHHHHHHHhCCCEEEEE
Q psy274           33 GCDGLEQPIEY-THQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN------GSETMKKILYKYGPLSVGL  105 (187)
Q Consensus        33 gG~~~~~a~~~-~~~~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~i~------~~~~ik~~l~~~gPV~v~~  105 (187)
                      +|. ...+..| .+..|-+.|.+-||.   ...-.|.... +..-++. .-..++      +...||+++...|-+...|
T Consensus       168 ~g~-~~m~~a~l~e~sgpv~et~d~y~---~~s~~~~~~~-p~~k~~~-~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~  241 (372)
T COG4870         168 GGN-ADMSAAYLTEWSGPVYETDDPYS---ENSYFSPTNL-PVTKHVQ-EAQIIPSRKKYLDNGNIKAMFGFYGAVSSSM  241 (372)
T ss_pred             CCc-cccccccccccCCcchhhcCccc---cccccCCcCC-chhhccc-cceecccchhhhcccchHHHHhhhcccccee
Confidence            455 5555556 566788888888888   4444443211 1111111 111122      6777999999999888777


Q ss_pred             EC--ccccccCCceeeCCCCCCCCCcccEEEEEEEEeec----------CCccEEEEEccCCCCCCCCcEEEEEecCCee
Q psy274          106 NG--HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQ----------DDIPYWLARNSWGPIGPDEGFFKIERGNNAC  173 (187)
Q Consensus       106 ~~--~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~----------~g~~ywivkNSWG~~WG~~Gy~~i~~~~n~c  173 (187)
                      .+  ..+....-+.+..    .+....+|||+||||++.          .|.+.||||||||+.||++|||||++..-.-
T Consensus       242 ~id~~~~~~~~~~~~~~----~s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~ya~~  317 (372)
T COG4870         242 YIDATNSLGICIPYPYV----DSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYYALN  317 (372)
T ss_pred             EEecccccccccCCCCC----CccccccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEeeeccc
Confidence            66  3333333333433    121468999999999964          2467999999999999999999999976333


Q ss_pred             c
Q psy274          174 G  174 (187)
Q Consensus       174 g  174 (187)
                      |
T Consensus       318 g  318 (372)
T COG4870         318 G  318 (372)
T ss_pred             c
Confidence            4


No 18 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.72  E-value=2.2e-17  Score=141.68  Aligned_cols=83  Identities=19%  Similarity=0.268  Sum_probs=66.5

Q ss_pred             ChHHHH----HHHHhCCCEEEEEECccccccCCceeeCCCC------------------CCCCCcccEEEEEEEEeec-C
Q psy274           87 GSETMK----KILYKYGPLSVGLNGHLIHFYNGTPIKKNDE------------------ICSPNAIGHAVLLVGYGKQ-D  143 (187)
Q Consensus        87 ~~~~ik----~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~------------------~~~~~~~~Hav~iVGyg~~-~  143 (187)
                      ..+.++    +.|..++||.++.++..|+.|++||+...--                  .|.....+|||+||||+.+ +
T Consensus       293 p~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D~~  372 (437)
T cd00585         293 PMDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLDED  372 (437)
T ss_pred             CHHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEecCC
Confidence            455565    6778888999999997678999999965100                  1333356899999999964 4


Q ss_pred             Cc-cEEEEEccCCCCCCCCcEEEEEec
Q psy274          144 DI-PYWLARNSWGPIGPDEGFFKIERG  169 (187)
Q Consensus       144 g~-~ywivkNSWG~~WG~~Gy~~i~~~  169 (187)
                      |+ .||+|+||||+.||++||++|+++
T Consensus       373 g~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         373 GKPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             CCcceEEEEcccCCCCCCCcceehhHH
Confidence            76 699999999999999999999986


No 19 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.97  E-value=2.5e-09  Score=92.32  Aligned_cols=79  Identities=20%  Similarity=0.354  Sum_probs=56.2

Q ss_pred             HHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCC--------------------CCcccEEEEEEEEee-cCCc-c
Q psy274           89 ETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICS--------------------PNAIGHAVLLVGYGK-QDDI-P  146 (187)
Q Consensus        89 ~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~--------------------~~~~~Hav~iVGyg~-~~g~-~  146 (187)
                      +.+...|..+-||..+-++..+...+.||.+.  +...                    ....+|||+|+|... ++|+ .
T Consensus       300 ~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~--~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D~~g~p~  377 (438)
T PF03051_consen  300 DAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDT--DLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLDEDGKPV  377 (438)
T ss_dssp             HHHHHHHHTT--EEEEEETTTTEETTTTEE-T--TSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-TTSSEE
T ss_pred             HHHHHHHHcCCcEEEeccCCccccccchhhcc--chhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEeccCCCee
Confidence            34455677878999999995456667888765  2110                    113589999999995 6776 5


Q ss_pred             EEEEEccCCCCCCCCcEEEEEec
Q psy274          147 YWLARNSWGPIGPDEGFFKIERG  169 (187)
Q Consensus       147 ywivkNSWG~~WG~~Gy~~i~~~  169 (187)
                      +|+|+||||+..|.+||+.++..
T Consensus       378 ~wkVeNSWG~~~g~kGy~~msd~  400 (438)
T PF03051_consen  378 RWKVENSWGTDNGDKGYFYMSDD  400 (438)
T ss_dssp             EEEEE-SBTTTSTBTTEEEEEHH
T ss_pred             EEEEEcCCCCCCCCCcEEEECHH
Confidence            99999999999999999999864


No 20 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=97.56  E-value=6.3e-05  Score=62.43  Aligned_cols=82  Identities=20%  Similarity=0.291  Sum_probs=56.0

Q ss_pred             ChHHHHHH----HHhCCCEEEEEECccccccCCceeeCCC---C----------CC-----CCCcccEEEEEEEEe-ecC
Q psy274           87 GSETMKKI----LYKYGPLSVGLNGHLIHFYNGTPIKKND---E----------IC-----SPNAIGHAVLLVGYG-KQD  143 (187)
Q Consensus        87 ~~~~ik~~----l~~~gPV~v~~~~~~f~~y~~giy~~~~---~----------~~-----~~~~~~Hav~iVGyg-~~~  143 (187)
                      +.+-+|++    +..+-||-.+-++..++.-+.||...+-   +          ..     +.....||++|.|.+ +++
T Consensus       296 ~me~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~  375 (444)
T COG3579         296 DMERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDET  375 (444)
T ss_pred             cHHHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccC
Confidence            44455443    4455688888777777777777664410   0          00     111247999999999 555


Q ss_pred             Cc-cEEEEEccCCCCCCCCcEEEEEe
Q psy274          144 DI-PYWLARNSWGPIGPDEGFFKIER  168 (187)
Q Consensus       144 g~-~ywivkNSWG~~WG~~Gy~~i~~  168 (187)
                      |. --|.|.||||.+=|.+|||-.+-
T Consensus       376 g~p~rwkVENSWG~d~G~~GyfvaSd  401 (444)
T COG3579         376 GNPLRWKVENSWGKDVGKKGYFVASD  401 (444)
T ss_pred             CCceeeEeecccccccCCCceEeehH
Confidence            44 47999999999999999997763


No 21 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=96.30  E-value=0.0067  Score=43.46  Aligned_cols=55  Identities=24%  Similarity=0.449  Sum_probs=34.1

Q ss_pred             ChHHHHHHHHhCCCEEEEEEC--ccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccC
Q psy274           87 GSETMKKILYKYGPLSVGLNG--HLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSW  154 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~--~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSW  154 (187)
                      +.+.|+++|.++.||++.+..  ...   ....+.       .....|.|+|+||.++.   +++|..+|
T Consensus        88 ~~~~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~-------~~~~~H~vvi~Gy~~~~---~~~v~DP~  144 (144)
T PF13529_consen   88 SFDDIKQEIDAGRPVIVSVNSGWRPP---NGDGYD-------GTYGGHYVVIIGYDEDG---YVYVNDPW  144 (144)
T ss_dssp             -HHHHHHHHHTT--EEEEEETTSS-----TTEEEE-------E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred             cHHHHHHHHHCCCcEEEEEEcccccC---CCCCcC-------CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence            789999999998899999974  122   111121       22479999999999533   78887776


No 22 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=92.91  E-value=0.59  Score=36.85  Aligned_cols=64  Identities=23%  Similarity=0.367  Sum_probs=38.4

Q ss_pred             ChHHHHHHHHhCCCEEEEEECcccccc--CCceeeCCCCCC--C-CCcccEEEEEEEEeecCCccEEEEEc
Q psy274           87 GSETMKKILYKYGPLSVGLNGHLIHFY--NGTPIKKNDEIC--S-PNAIGHAVLLVGYGKQDDIPYWLARN  152 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~~~f~~y--~~giy~~~~~~~--~-~~~~~Hav~iVGyg~~~g~~ywivkN  152 (187)
                      +.++|..+|..++|+++-++..-+.-.  +.-......+.|  . ....+|.|+|+||+...++  +.+||
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~~~--~~yrd  180 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAATKE--FEYRD  180 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCCCe--EEEeC
Confidence            799999999999988887776333200  111111100122  1 2356899999999954432  55554


No 23 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=91.05  E-value=1.6  Score=33.33  Aligned_cols=67  Identities=18%  Similarity=0.322  Sum_probs=43.7

Q ss_pred             ChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEee-cCCccEEEEEccCCCCCCCCcEEE
Q psy274           87 GSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGK-QDDIPYWLARNSWGPIGPDEGFFK  165 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~-~~g~~ywivkNSWG~~WG~~Gy~~  165 (187)
                      +.+++++.+.++.|+.+....          ...    ......+||++||||-. .+|.++.++-|=|     +++++-
T Consensus        90 s~~eV~~~~~~nk~i~i~~~~----------v~~----~~~~~~gHAlavvGya~~~~g~~~y~~WNPW-----~~~~~~  150 (175)
T PF05543_consen   90 SFDEVKKLIDNNKGIAILADR----------VEQ----TNGPHAGHALAVVGYAKPNNGQKTYYFWNPW-----WNDVMI  150 (175)
T ss_dssp             -HHHHHHHHHTT-EEEEEEEE----------TTS----CTTB--EEEEEEEEEEEETTSEEEEEEE-TT------SS-EE
T ss_pred             CHHHHHHHHHcCCCeEEEecc----------ccc----CCCCccceeEEEEeeeecCCCCeEEEEeCCc-----cCCcEE
Confidence            788999999999999887652          111    12335799999999997 4568999999988     566777


Q ss_pred             EEecCCe
Q psy274          166 IERGNNA  172 (187)
Q Consensus       166 i~~~~n~  172 (187)
                      ++...+.
T Consensus       151 ~sa~s~~  157 (175)
T PF05543_consen  151 QSAKSNI  157 (175)
T ss_dssp             EETT--E
T ss_pred             EecCCCc
Confidence            7665443


No 24 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=90.89  E-value=0.57  Score=38.60  Aligned_cols=55  Identities=16%  Similarity=0.310  Sum_probs=35.6

Q ss_pred             ChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEE
Q psy274           87 GSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLA  150 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywiv  150 (187)
                      ..+.|++.|.++.||.+.++.-.+. |... |      .......|.|+|+||++++ ..+.++
T Consensus        77 ~~~~l~~~l~~g~pv~~~~D~~~lp-y~~~-~------~~~~~~~H~i~v~G~d~~~-~~~~v~  131 (317)
T PF14399_consen   77 AWEELKEALDAGRPVIVWVDMYYLP-YRPN-Y------YKKHHADHYIVVYGYDEEE-DVFYVS  131 (317)
T ss_pred             HHHHHHHHHhCCCceEEEeccccCC-CCcc-c------cccccCCcEEEEEEEeCCC-CEEEEE
Confidence            5678999999977999997752221 1111 1      1233468999999999654 334444


No 25 
>KOG4128|consensus
Probab=88.15  E-value=0.044  Score=45.84  Aligned_cols=95  Identities=19%  Similarity=0.190  Sum_probs=57.4

Q ss_pred             ChHHHHHHH----HhCCCEEEEEECccccccCCceeeCCC---C--------C---CC-----CCcccEEEEEEEEe-ec
Q psy274           87 GSETMKKIL----YKYGPLSVGLNGHLIHFYNGTPIKKND---E--------I---CS-----PNAIGHAVLLVGYG-KQ  142 (187)
Q Consensus        87 ~~~~ik~~l----~~~gPV~v~~~~~~f~~y~~giy~~~~---~--------~---~~-----~~~~~Hav~iVGyg-~~  142 (187)
                      +.+-|++.+    ..+.||-.+-++..+..-++|+.+-.-   +        .   ..     ...-.||+++.|.+ .+
T Consensus       305 ~~d~l~k~vv~sl~~~kaVwfgcd~~k~~~~K~G~~dl~l~~~~l~fG~~l~~~~KAeRl~y~eSlmthAml~T~v~~kd  384 (457)
T KOG4128|consen  305 SMDILMKIVVTSLEGDKAVWFGCDIRKAISLKSGPLDLRLHQFDLLFGFKLGESTKAERLDYRESLMTHAMLLTSVGLKD  384 (457)
T ss_pred             CHHHHHHHHHHHhcCCcceEEecccHhhhhcccCccchhhccCceeeeeeccccchhhhhhHHHHHHHHHHHhhhccccC
Confidence            455555543    345577777666556666666432200   0        0   00     01247999999999 32


Q ss_pred             ---CCccEEEEEccCCCCCCCCcEEEEEecCCeecccceeeEEEEE
Q psy274          143 ---DDIPYWLARNSWGPIGPDEGFFKIERGNNACGIETIAGYATID  185 (187)
Q Consensus       143 ---~g~~ywivkNSWG~~WG~~Gy~~i~~~~n~cgi~~~~~~~~~~  185 (187)
                         .+-.-|.|.||||++-|.+|+.++...    -.+.+++-.+||
T Consensus       385 ~~~g~~~~~rVenswgkd~gkkg~~~mt~e----wf~EY~feiVVd  426 (457)
T KOG4128|consen  385 PATGGLNEHRVENSWGKDLGKKGVNKMTAE----WFREYAFEIVVD  426 (457)
T ss_pred             cccCCchhhhhhchhhhhccccchhhhhHH----HHHhhheeEEee
Confidence               344579999999999999999665432    234555555555


No 26 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=87.54  E-value=1.5  Score=33.70  Aligned_cols=47  Identities=23%  Similarity=0.314  Sum_probs=35.8

Q ss_pred             ChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccCC
Q psy274           87 GSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSWG  155 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG  155 (187)
                      ++.+||..|.++.||.+..-.  |..                ..-|+|+|+||+    +.++..-+.||
T Consensus       122 sl~~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyD----k~n~yynDpyG  168 (195)
T COG4990         122 SLSDIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYD----KYNIYYNDPYG  168 (195)
T ss_pred             cHHHHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeeccc----ccceEeccccc
Confidence            889999999999999876653  211                347999999999    44566667774


No 27 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=77.05  E-value=9.3  Score=27.08  Aligned_cols=45  Identities=22%  Similarity=0.272  Sum_probs=31.1

Q ss_pred             HHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccC
Q psy274           91 MKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSW  154 (187)
Q Consensus        91 ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSW  154 (187)
                      ++++|....||++.+..           ..     .....+|+|+|+||. +  ....++.+.|
T Consensus        70 ~~~~l~~~~Pvi~~~~~-----------~~-----~~~~~gH~vVv~g~~-~--~~~~~i~DP~  114 (141)
T cd02549          70 LLRQLAAGHPVIVSVNL-----------GV-----SITPSGHAMVVIGYD-R--KGNVYVNDPG  114 (141)
T ss_pred             HHHHHHCCCeEEEEEec-----------Cc-----ccCCCCeEEEEEEEc-C--CCCEEEECCC
Confidence            78899998899998774           01     111368999999998 1  1225566665


No 28 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=74.23  E-value=5.7  Score=33.00  Aligned_cols=28  Identities=25%  Similarity=0.451  Sum_probs=24.6

Q ss_pred             cccEEEEEEEEeecC--CccEEEEEccCCC
Q psy274          129 AIGHAVLLVGYGKQD--DIPYWLARNSWGP  156 (187)
Q Consensus       129 ~~~Hav~iVGyg~~~--g~~ywivkNSWG~  156 (187)
                      ..+||-.|++.-+.+  +.+...+||-||.
T Consensus       234 ~~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~  263 (315)
T cd00044         234 VKGHAYSVLDVREVQEEGLRLLRLRNPWGV  263 (315)
T ss_pred             ccCcceEEeEEEEEccCceEEEEecCCccC
Confidence            358999999999766  8899999999994


No 29 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=69.64  E-value=16  Score=27.60  Aligned_cols=39  Identities=26%  Similarity=0.471  Sum_probs=28.6

Q ss_pred             ChHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeec
Q psy274           87 GSETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQ  142 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~  142 (187)
                      +.+.+...|.++||+-++...                + .+....|+++|+|-..+
T Consensus        97 t~e~~~~LL~~yGPLwv~~~~----------------P-~~~~~~H~~ViTGI~~d  135 (166)
T PF12385_consen   97 TAEGLANLLREYGPLWVAWEA----------------P-GDSWVAHASVITGIDGD  135 (166)
T ss_pred             CHHHHHHHHHHcCCeEEEecC----------------C-CCcceeeEEEEEeecCC
Confidence            688999999999999988552                1 22234688888887743


No 30 
>PF01640 Peptidase_C10:  Peptidase C10 family classification.;  InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=50.99  E-value=90  Score=23.91  Aligned_cols=53  Identities=25%  Similarity=0.467  Sum_probs=31.8

Q ss_pred             hHHHHHHHHhCCCEEEEEECccccccCCceeeCCCCCCCCCcccEEEEEEEEeecCCccEEEEEccCCCCCCCCcEEE
Q psy274           88 SETMKKILYKYGPLSVGLNGHLIHFYNGTPIKKNDEICSPNAIGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK  165 (187)
Q Consensus        88 ~~~ik~~l~~~gPV~v~~~~~~f~~y~~giy~~~~~~~~~~~~~Hav~iVGyg~~~g~~ywivkNSWG~~WG~~Gy~~  165 (187)
                      ...|+..|.++.||...-.           - .        ...||.+|=||.   ...||-+-=.||-.  .+||++
T Consensus       140 ~~~i~~el~~~rPV~~~g~-----------~-~--------~~GHawViDGy~---~~~~~H~NwGW~G~--~nGyy~  192 (192)
T PF01640_consen  140 MDMIRNELDNGRPVLYSGN-----------S-K--------SGGHAWVIDGYD---SDGYFHCNWGWGGS--SNGYYR  192 (192)
T ss_dssp             HHHHHHHHHTT--EEEEEE-----------E-T--------TEEEEEEEEEEE---SSSEEEEE-SSTTT--T-EEEE
T ss_pred             HHHHHHHHHcCCCEEEEEe-----------c-C--------CCCeEEEEcCcc---CCCeEEEeeCccCC--CCCccC
Confidence            3578889989889875432           1 1        129999999995   23466555444422  568875


No 31 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=46.26  E-value=19  Score=31.64  Aligned_cols=30  Identities=13%  Similarity=0.171  Sum_probs=26.3

Q ss_pred             CCCCCCCccchhhHHHHHhCccccccccccc
Q psy274           28 CSGCGGCDGLEQPIEYTHQAGLESEKDYPYR   58 (187)
Q Consensus        28 ~~gC~gG~~~~~a~~~~~~~Gi~~e~~yPY~   58 (187)
                      ..-.+||. -..++..+++.|+++.+.||-+
T Consensus       130 ~~~~DGGq-w~m~~~li~KYGvVPk~~~pet  159 (437)
T cd00585         130 NPQNDGGQ-WDMLVNLIEKYGLVPKSVMPES  159 (437)
T ss_pred             CCcCCCCc-hHHHHHHHHHcCCCcccccCCC
Confidence            45678898 8889999999999999999976


No 32 
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=42.67  E-value=60  Score=27.08  Aligned_cols=27  Identities=26%  Similarity=0.400  Sum_probs=22.6

Q ss_pred             ccEEEEEEEEeecCCcc--EEEEEccCCC
Q psy274          130 IGHAVLLVGYGKQDDIP--YWLARNSWGP  156 (187)
Q Consensus       130 ~~Hav~iVGyg~~~g~~--ywivkNSWG~  156 (187)
                      .+||-.|++...-++.+  ...+||-||.
T Consensus       227 ~~HaYsVl~v~~~~~~~~~Ll~lrNPWg~  255 (318)
T smart00230      227 KGHAYSVTDVREVQGRRQELLRLRNPWGQ  255 (318)
T ss_pred             cCccEEEEEEEEEecCCeEEEEEECCCCC
Confidence            58999999998666555  8999999993


No 33 
>KOG4621|consensus
Probab=38.95  E-value=1.1e+02  Score=22.45  Aligned_cols=77  Identities=16%  Similarity=0.274  Sum_probs=45.8

Q ss_pred             ChHHHHHHHHhCCCEEEEEEC-cccc------ccCCceeeCCCCCC--CC-CcccEEEEEEEEeecCCccEEEEEccCCC
Q psy274           87 GSETMKKILYKYGPLSVGLNG-HLIH------FYNGTPIKKNDEIC--SP-NAIGHAVLLVGYGKQDDIPYWLARNSWGP  156 (187)
Q Consensus        87 ~~~~ik~~l~~~gPV~v~~~~-~~f~------~y~~giy~~~~~~~--~~-~~~~Hav~iVGyg~~~g~~ywivkNSWG~  156 (187)
                      ++.+|+.+|+.+--|++.+.- +.+.      -.+++.+.++...|  .. -..+|.|+|-||+..  .+.+.++|-   
T Consensus        58 Si~dIqahLaqGnhiAIaLVdq~~Lhcdlceeplk~ccfspnghhcfcrtp~YqGHfiVi~GYd~a--~~c~~~ndP---  132 (167)
T KOG4621|consen   58 SIHDIQAHLAQGNHIAIALVDQDKLHCDLCEEPLKSCCFSPNGHHCFCRTPCYQGHFIVICGYDAA--RDCFEINDP---  132 (167)
T ss_pred             eHHHHHHHHhcCCeEEEEEecCCceehHHHHhHHHHhccCCCCccccccCCcccccEEEEeccccc--cCeEEEcCc---
Confidence            688999999985577776644 3221      22445565532222  22 246899999999933  334666653   


Q ss_pred             CCCCCcEEEEEe
Q psy274          157 IGPDEGFFKIER  168 (187)
Q Consensus       157 ~WG~~Gy~~i~~  168 (187)
                      .--+-|..||+.
T Consensus       133 A~adpg~c~~Si  144 (167)
T KOG4621|consen  133 ASADPGHCRISI  144 (167)
T ss_pred             ccCCCcceeehh
Confidence            333446666654


No 34 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=33.12  E-value=46  Score=29.30  Aligned_cols=55  Identities=16%  Similarity=0.193  Sum_probs=31.3

Q ss_pred             hhhHHHHhC-CCccCCHHHHh----------------hhcCC------------CCCCCCCccchhhHHHHHhCcccccc
Q psy274            3 EGQYAIKTG-KLVEFSKSQLV----------------ECAKQ------------CSGCGGCDGLEQPIEYTHQAGLESEK   53 (187)
Q Consensus         3 E~~~~i~~~-~~~~lS~q~l~----------------dc~~~------------~~gC~gG~~~~~a~~~~~~~Gi~~e~   53 (187)
                      +..+..+.+ +..+||+.+|.                ++...            ..-.+||. -..+...+++.|+++.+
T Consensus        77 R~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~DGGq-w~~~~nli~KYGvVPk~  155 (438)
T PF03051_consen   77 RHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSDGGQ-WDMVVNLIKKYGVVPKS  155 (438)
T ss_dssp             HHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S-B--HHHHHHHHHHH---BGG
T ss_pred             HHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCCCCc-hHHHHHHHHHcCcCcHh
Confidence            344444444 58999988875                32221            35577888 77777779999999999


Q ss_pred             ccccc
Q psy274           54 DYPYR   58 (187)
Q Consensus        54 ~yPY~   58 (187)
                      .||-+
T Consensus       156 ~mpet  160 (438)
T PF03051_consen  156 VMPET  160 (438)
T ss_dssp             GSTTG
T ss_pred             hCCCC
Confidence            99977


No 35 
>PF07157 DNA_circ_N:  DNA circularisation protein N-terminus;  InterPro: IPR009826 This entry represents the N terminus (approximately 100 residues) of a number of phage DNA circulation proteins.
Probab=24.24  E-value=1.2e+02  Score=20.62  Aligned_cols=46  Identities=17%  Similarity=0.322  Sum_probs=27.5

Q ss_pred             cccccccCCCCCcccccCCCCcceeeeccceeEeC----ChHHHHHHHHhCCCE
Q psy274           52 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN----GSETMKKILYKYGPL  101 (187)
Q Consensus        52 e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~i~----~~~~ik~~l~~~gPV  101 (187)
                      +-.|||.+   ....=+.-...+.++++ .|..=+    ..+.+.++|.+.||=
T Consensus        30 ~heyP~rd---~~~vEDlG~~~r~~~~~-a~~~G~dy~~~~~~L~~al~~~G~G   79 (93)
T PF07157_consen   30 VHEYPYRD---GPWVEDLGRKARRIRVT-AFFVGDDYEAQRDALIAALEAPGPG   79 (93)
T ss_pred             EEecCCCC---CcCeeecCCCCcEEEEE-EEEECCcHHHHHHHHHHHHcCCCCe
Confidence            34789983   23333334455667776 554433    556788888887774


No 36 
>PF00648 Peptidase_C2:  Calpain family cysteine protease This is family C2 in the peptidase classification. ;  InterPro: IPR001300 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only [].  All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit:    A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb.  Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related [].  Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad [].   Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma [].  Calpains are a family of cytosolic cysteine proteinases (see PDOC00126 from PROSITEDOC). Members of the calpain family are believed to function in various biological processes, including integrin-mediated cell migration, cytoskeletal remodeling, cell differentiation and apoptosis [, ]. The calpain family includes numerous members from C. elegans to mammals and with homologues in yeast and bacteria. The best characterised members are the m- and mu-calpains, both proteins are heterodimer composed of a large catalytic subunit and a small regulatory subunit. The large subunit comprises four domains (dI-dIV) while the small subunit has two domains (dV-dVI). Domain dI is a short region cleaved by autolysis, dII is the catalytic core, dIII is a C2-like domain, dIV consists of five calcium binding EF-hand motifs []. The crystal structure of calpain has been solved [, ]. The catalytic region consists of two distinct structural domains (dIIa and dIIb). dIIa contains a central helix flanked on three faces by a cluster of alpha-helices and is entirely unrelated to the corresponding domain in the typical thiol proteinases. The fold of dIIb is similar to the corresponding domain in other cysteine proteinases and contains two three-stranded anti-parallel beta-sheets. The catalytic triad residues (C,H,N) are located in dIIa and dIIb. The activation of the domain is dependent on the binding of two calcium atoms in two non EF-hand calcium binding sites located in the catalytic core, one close to the Cys active site in dIIa and one at the end of dIIb. Calcium-binding induced conformational changes in the catalytic domain which align the active site [][]. The profile covers the whole catalytic domain.; GO: 0004198 calcium-dependent cysteine-type endopeptidase activity, 0006508 proteolysis, 0005622 intracellular; PDB: 2NQA_A 1KFU_L 1KFX_L 1QXP_B 2R9C_A 1TL9_A 2G8E_A 1KXR_B 2G8J_A 2NQG_A ....
Probab=23.35  E-value=69  Score=26.16  Aligned_cols=28  Identities=25%  Similarity=0.417  Sum_probs=20.2

Q ss_pred             cccEEEEEEEEeecCC----ccEEEEEccCCC
Q psy274          129 AIGHAVLLVGYGKQDD----IPYWLARNSWGP  156 (187)
Q Consensus       129 ~~~Hav~iVGyg~~~g----~~ywivkNSWG~  156 (187)
                      ..+||-.|++....++    .+.-.+||-||.
T Consensus       212 ~~~HaY~Vl~~~~~~~~~~~~~lv~LrNPwg~  243 (298)
T PF00648_consen  212 VPGHAYAVLDVREVNGNGEGHRLVKLRNPWGS  243 (298)
T ss_dssp             BTTS-EEEEEEEEEEETTEEEEEEEEE-TTSS
T ss_pred             ccceeEEEEEEEeeccccceeEEEEEcCCCcc
Confidence            4689999999986433    567789999994


No 37 
>smart00276 GLECT Galectin. Galectin - galactose-binding lectin
Probab=21.00  E-value=56  Score=23.15  Aligned_cols=13  Identities=15%  Similarity=-0.212  Sum_probs=10.6

Q ss_pred             EEEEccC-CCCCCC
Q psy274          148 WLARNSW-GPIGPD  160 (187)
Q Consensus       148 wivkNSW-G~~WG~  160 (187)
                      =+|+||+ +-.||.
T Consensus        51 ~iV~Ns~~~g~Wg~   64 (128)
T smart00276       51 KIVCNSKLNGSWGS   64 (128)
T ss_pred             EEEEeCccCCccch
Confidence            6899998 668984


Done!