Query         psy17191
Match_columns 377
No_of_seqs    221 out of 1599
Neff          5.9 
Searched_HMMs 46136
Date          Sat Aug 17 00:11:12 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy17191.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17191hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF03051 Peptidase_C1_2:  Pepti 100.0  2E-103  5E-108  798.4  22.1  316   22-373    96-438 (438)
  2 COG3579 PepC Aminopeptidase C  100.0  1E-102  2E-107  751.9  18.4  320   22-377    98-444 (444)
  3 cd00585 Peptidase_C1B Peptidas 100.0  5E-101  1E-105  780.8  27.0  316   22-373    95-437 (437)
  4 KOG4128|consensus              100.0 1.5E-93 3.2E-98  687.3  13.3  323   22-377   103-457 (457)
  5 KOG1543|consensus               99.8 1.4E-20 3.1E-25  186.4  14.2  165  118-318   137-310 (325)
  6 KOG1542|consensus               99.8 1.1E-20 2.3E-25  184.7  11.2  157  120-316   186-356 (372)
  7 PTZ00203 cathepsin L protease;  99.8 4.5E-20 9.8E-25  184.3  14.5  160  119-316   154-325 (348)
  8 cd02620 Peptidase_C1A_Cathepsi  99.8   1E-18 2.2E-23  165.4  14.3  159  122-316    35-223 (236)
  9 cd02698 Peptidase_C1A_Cathepsi  99.8 1.3E-18 2.8E-23  165.0  14.6  149  136-322    55-224 (239)
 10 PTZ00021 falcipain-2; Provisio  99.8   1E-18 2.2E-23  180.6  14.9  160  119-316   294-469 (489)
 11 cd02248 Peptidase_C1A Peptidas  99.8 2.6E-18 5.6E-23  157.8  15.3  157  126-316    35-197 (210)
 12 PTZ00200 cysteine proteinase;   99.8 2.2E-18 4.7E-23  177.1  14.1  157  119-316   263-427 (448)
 13 cd02621 Peptidase_C1A_Cathepsi  99.8 5.6E-18 1.2E-22  160.7  14.0  156  134-316    54-227 (243)
 14 cd02619 Peptidase_C1 C1 Peptid  99.7 3.3E-17 7.1E-22  150.2  15.1  179  120-326    26-222 (223)
 15 PTZ00462 Serine-repeat antigen  99.7 2.1E-16 4.5E-21  172.3  14.7  165  118-315   559-764 (1004)
 16 PF00112 Peptidase_C1:  Papain   99.7 7.1E-17 1.5E-21  147.9   9.3  162  122-318    33-206 (219)
 17 PTZ00364 dipeptidyl-peptidase   99.7 2.7E-16 5.9E-21  164.5  13.9  167  133-316   259-444 (548)
 18 PTZ00049 cathepsin C-like prot  99.7 8.5E-16 1.9E-20  163.2  13.9   97  215-316   560-661 (693)
 19 smart00645 Pept_C1 Papain fami  99.6 1.8E-15 3.9E-20  137.1  12.0  127  119-316    29-159 (174)
 20 COG3579 PepC Aminopeptidase C   99.3 4.7E-13   1E-17  131.0   2.7   74    1-92    348-424 (444)
 21 PF03051 Peptidase_C1_2:  Pepti  99.1 1.2E-11 2.5E-16  127.1   1.1   78    1-96    346-426 (438)
 22 cd00585 Peptidase_C1B Peptidas  99.0 2.1E-10 4.4E-15  118.0   2.5   77    1-95    345-424 (437)
 23 KOG1544|consensus               98.8 7.1E-09 1.5E-13  101.5   6.5   89  212-316   354-445 (470)
 24 COG4870 Cysteine protease [Pos  98.7   1E-08 2.2E-13  101.9   4.3   47  273-321   263-318 (372)
 25 KOG4128|consensus               98.6 2.2E-08 4.7E-13   98.4   2.9   70    1-86    358-431 (457)
 26 PF14399 Transpep_BrtH:  NlpC/p  77.8     7.5 0.00016   37.8   7.2   61  207-299    73-133 (317)
 27 PF13529 Peptidase_C39_2:  Pept  77.2     7.3 0.00016   32.2   6.0   19  213-231    90-108 (144)
 28 PF05543 Peptidase_C47:  Stapho  70.8      21 0.00045   32.9   7.6   30  273-304   118-147 (175)
 29 PF09778 Guanylate_cyc_2:  Guan  65.6      11 0.00025   35.7   5.0  116  149-300    63-181 (212)
 30 cd00044 CysPc Calpains, domain  60.5      52  0.0011   32.4   9.0   51  274-326   235-312 (315)
 31 PF12385 Peptidase_C70:  Papain  48.4      28 0.00061   31.7   4.3   19  213-231    99-117 (166)
 32 COG4990 Uncharacterized protei  47.2      26 0.00057   32.6   4.0   24  275-305   148-171 (195)
 33 cd02549 Peptidase_C39A A sub-f  45.2   1E+02  0.0022   25.7   7.2   59  215-320    70-128 (141)
 34 COG5572 Predicted integral mem  42.8     8.3 0.00018   31.8   0.1   18    5-22      3-20  (104)
 35 PF09713 A_thal_3526:  Plant pr  35.8      25 0.00054   26.3   1.7   30    1-32     13-44  (54)
 36 PF01640 Peptidase_C10:  Peptid  35.5 1.5E+02  0.0033   27.1   7.2   59  206-312   134-192 (192)
 37 PF09028 Mac-1:  Mac 1;  InterP  33.2 1.5E+02  0.0032   30.1   7.0   47  206-290   227-273 (333)
 38 PF04046 PSP:  PSP;  InterPro:   27.0      38 0.00082   24.8   1.3   22  348-369     2-23  (48)
 39 cd02620 Peptidase_C1A_Cathepsi  26.4      18  0.0004   34.1  -0.5   39   14-71    184-222 (236)
 40 smart00581 PSP proline-rich do  25.9      42 0.00091   25.2   1.4   23  347-369     5-27  (54)
 41 PF07910 Peptidase_C78:  Peptid  25.5      72  0.0016   30.3   3.3   22  274-297   155-176 (218)
 42 PTZ00200 cysteine proteinase;   24.3      16 0.00035   38.4  -1.4   57   13-86    385-443 (448)
 43 PF00648 Peptidase_C2:  Calpain  23.7 4.8E+02    0.01   25.2   8.8   30  274-303   213-243 (298)
 44 smart00230 CysPc Calpain-like   23.2 4.8E+02    0.01   25.8   8.8   54  274-328   227-307 (318)
 45 COG3423 Nlp Predicted transcri  23.2      71  0.0015   25.7   2.3   44   96-139    15-64  (82)
 46 PTZ00203 cathepsin L protease;  20.9      27 0.00058   35.4  -0.6   40   13-71    285-324 (348)

No 1  
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=100.00  E-value=2.1e-103  Score=798.44  Aligned_cols=316  Identities=38%  Similarity=0.695  Sum_probs=272.7

Q ss_pred             hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191         22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY   94 (377)
Q Consensus        22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L   94 (377)
                      ++|||||||||+||++||+|++  +|+|+|+++++++.|++|||+|+|+.++|+||||||+++||       |+.||.+|
T Consensus        96 l~F~DKlEKaN~fLe~ii~~~~--~~~d~R~v~~ll~~~~~DGGqw~~~~nli~KYGvVPk~~mpet~~s~~t~~~n~~l  173 (438)
T PF03051_consen   96 LFFWDKLEKANYFLENIIDTAD--EPLDDRLVRFLLKNPVSDGGQWDMVVNLIKKYGVVPKSVMPETFSSSNTSEMNEML  173 (438)
T ss_dssp             HHHHHHHHHHHHHHHHHHHCCT--S-TTSHHHHHHHHSTT-S-B-HHHHHHHHHHH---BGGGSTTGCGCHBHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHhc--CCcchHHHHHHHhcCCCCCCchHHHHHHHHHcCcCcHhhCCCCCCCCChHHHHHHH
Confidence            6999999999999999999986  99999999999999999999999999999999999999999       89999999


Q ss_pred             hhhHHHHHHHHHH--hhccChhhh---hHHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191         95 NNQPVELLMKLAA--ESIKNNEAV---WFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ  169 (377)
Q Consensus        95 ~~~Lr~~a~~lr~--~~~~~~~~l---~~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T  169 (377)
                      +++||++|++||+  ++|++.++|   .++||++|||||++|||+||++|+|+|+||+++||+            .++||
T Consensus       174 ~~~Lr~~a~~LR~~~~~~~~~~~l~~~k~~~l~~iy~il~~~lG~PP~~F~~ey~dkd~~~~~------------~~~~T  241 (438)
T PF03051_consen  174 NTKLREYALELRKLVKAGKSEEELRKLKEEMLAEIYRILAIYLGEPPEKFTWEYRDKDKKYHR------------GKNYT  241 (438)
T ss_dssp             HHHHHHHHHHHHHHHHTTTTCHHHHHHHHHHHHHHHHHHHHHH---SSSEEEEEE-TTS-EEE------------EEEE-
T ss_pred             HHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHcCCCChheeEEEecccccccc------------ccccC
Confidence            9999999999999  566654444   499999999999999999999999999999999999            99999


Q ss_pred             cHHHHHHhhc-----------c---cCCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCC
Q psy17191        170 PVELLMKLAA-----------E---SIKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKR  234 (377)
Q Consensus       170 P~eF~~k~v~-----------D---~~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~  234 (377)
                      |++||++||+           |   +|||++.|++++++||+ |+++.|+|||+++|+++++++|++|+||||||||+++
T Consensus       242 P~eF~~kyv~~~~ddyVsLin~P~~~~py~~~y~ve~~~Nv~~g~~~~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k~  321 (438)
T PF03051_consen  242 PLEFYKKYVGFDLDDYVSLINDPRSHHPYNKLYTVEYLGNVVGGRPVRYLNVPIDELKDAAIKSLKAGYPVWFGCDVGKF  321 (438)
T ss_dssp             HHHHHHHCTTS-GGGEEEEE--T-TTS-TTCEEEETTTTSSTT-EEEEEEE--HHHHHHHHHHHHHTT--EEEEEETTTT
T ss_pred             chhHHHHHhCCCCcceEEEeeCCCccCccceeEEEccCCCEECCcceeEeccCHHHHHHHHHHHHHcCCcEEEeccCCcc
Confidence            9999999997           2   79999999999999999 9999999999999999999999999999999999999


Q ss_pred             CcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEe
Q psy17191        235 FANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMT  314 (377)
Q Consensus       235 f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms  314 (377)
                      ++++.||+|.+++|++.+||+++.  |+|++|+++++|.+||||+|||+++|++|+ |.+|+||||||++.|++|||+||
T Consensus       322 ~~~k~Gi~D~~~~d~~~~fg~~~~--~~K~~Rl~~~eS~~tHAM~itGv~~D~~g~-p~~wkVeNSWG~~~g~kGy~~ms  398 (438)
T PF03051_consen  322 FDRKNGIMDTDLYDYDSLFGVDFN--MSKAERLDYGESTMTHAMVITGVDLDEDGK-PVRWKVENSWGTDNGDKGYFYMS  398 (438)
T ss_dssp             EETTTTEE-TTSB-HHHHHT--S---S-HHHHHHTTSS--EEEEEEEEEEE-TTSS-EEEEEEE-SBTTTSTBTTEEEEE
T ss_pred             ccccchhhccchhhhhhhhccccc--cCHHHHHHhCCCCCceeEEEEEEEeccCCC-eeEEEEEcCCCCCCCCCcEEEEC
Confidence            999999999999999999999976  999999999999999999999999999999 89999999999999999999999


Q ss_pred             hhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhcCCCeecCCCCccC
Q psy17191        315 SPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG  373 (377)
Q Consensus       315 ~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~wdpm~  373 (377)
                      ++||++|||+|||||                   +|||++++++|++|||+||||||||
T Consensus       399 d~wf~e~v~~vvV~K-------------------~~lp~~~~~~~~~~p~~l~~wdpmg  438 (438)
T PF03051_consen  399 DDWFDEYVYQVVVHK-------------------KYLPKELLKKLEKEPIVLPPWDPMG  438 (438)
T ss_dssp             HHHHHHHEEEEEEEG-------------------GGS-HHHHHHHHS--EEE-TTSCCC
T ss_pred             HHHHHhhhheEEech-------------------hhCCHHHHHHhcCCCeECCCCCCCC
Confidence            999999999999999                   9999999999999999999999998


No 2  
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=100.00  E-value=1.1e-102  Score=751.94  Aligned_cols=320  Identities=32%  Similarity=0.578  Sum_probs=309.7

Q ss_pred             hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191         22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY   94 (377)
Q Consensus        22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L   94 (377)
                      +||||||||||||||+||+||+  +++++|++.+++..|..|||||+|+.++|+|||+|||++||       |+++|.+|
T Consensus        98 tfFwDKlEKaN~FleqIi~tad--q~ldsRlv~~LL~~PqqDGGQwdM~v~l~eKYGvVpK~~ypes~sSS~Sr~ln~~L  175 (444)
T COG3579          98 TFFWDKLEKANWFLEQIIETAD--QELDSRLVSFLLATPQQDGGQWDMFVSLFEKYGVVPKSVYPESFSSSNSRELNALL  175 (444)
T ss_pred             HHHHHHHHHhhHHHHHHHhhcc--cchHHHHHHHHHcCccccCchHHHHHHHHHHhCCCchhhcccccccccHHHHHHHH
Confidence            6999999999999999999997  99999999999999999999999999999999999999999       88999999


Q ss_pred             hhhHHHHHHHHHH--hhccC---hhhhhHHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191         95 NNQPVELLMKLAA--ESIKN---NEAVWFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ  169 (377)
Q Consensus        95 ~~~Lr~~a~~lr~--~~~~~---~~~l~~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T  169 (377)
                      |++||+.|.-||.  .+|++   .+++.++++.+||++|+..||.||+.|+|+||||+++||.            +..+|
T Consensus       176 n~~LR~dAqiLR~a~~eg~~~~~v~~~kEe~l~eif~~l~~~lg~PP~~Fdf~YrdKd~~~h~------------~k~lT  243 (444)
T COG3579         176 NKLLRQDAQILRDALKEGADDDTVEALKEELLQEIFNFLAMTLGLPPEKFDFAYRDKDNKYHK------------EKGLT  243 (444)
T ss_pred             HHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHcCCCchhcceEEeccccchhh------------hcCCC
Confidence            9999999999999  66665   3456699999999999999999999999999999999999            99999


Q ss_pred             cHHHHHHhhc-c-------------cCCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCC
Q psy17191        170 PVELLMKLAA-E-------------SIKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKR  234 (377)
Q Consensus       170 P~eF~~k~v~-D-------------~~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~  234 (377)
                      |++||++||+ |             ++||++.|++++.+||+ |+.+.|+|||++.|+.++++++++|+||||||||++.
T Consensus       244 P~eFy~kyv~ldl~~yVslInaPtadkPygk~ytV~~LGnVvgg~~v~ylNv~me~lkkl~~~q~qagetVwFG~dvgq~  323 (444)
T COG3579         244 PQEFYKKYVGLDLKDYVSLINAPTADKPYGKSYTVEFLGNVVGGRAVKYLNVDMERLKKLAIKQMQAGETVWFGCDVGQL  323 (444)
T ss_pred             HHHHHHHhcCCCcccceeeccCCcCCCCCcceeehhhhccccCCceeEEecCcHHHHHHHHHHHHhcCCcEEeecCchhh
Confidence            9999999998 1             69999999999999999 9999999999999999999999999999999999999


Q ss_pred             CcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEe
Q psy17191        235 FANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMT  314 (377)
Q Consensus       235 f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms  314 (377)
                      ++++.||++.+.++|++.+|+.+.  ++||+|++|+||.|+|||+|||++.|++|+ |.+|+||||||++.|.+|||.||
T Consensus       324 s~rk~Gimdtd~~~~~s~~g~~~~--q~KA~RldY~eSLmTHAMvlTGvd~d~~g~-p~rwkVENSWG~d~G~~GyfvaS  400 (444)
T COG3579         324 SDRKTGIMDTDIYDYESSLGINLT--QDKAGRLDYGESLMTHAMVLTGVDLDETGN-PLRWKVENSWGKDVGKKGYFVAS  400 (444)
T ss_pred             cccccceeeehhccchhhhCCCcc--cchhhccccchHHHHHHHHhhccccccCCC-ceeeEeecccccccCCCceEeeh
Confidence            999999999999999999999976  999999999999999999999999999998 99999999999999999999999


Q ss_pred             hhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhcCCCeecCCCCccCccCC
Q psy17191        315 SPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMGTLAQ  377 (377)
Q Consensus       315 ~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~wdpm~~l~~  377 (377)
                      +.||+||+|||||+|                   +|||+|++++++.|||+|+||||||+||+
T Consensus       401 d~wmdEytyQIvV~k-------------------~~l~~e~~~a~~~epivL~pWDPMGALA~  444 (444)
T COG3579         401 DAWMDEYTYQIVVDK-------------------KFLPKEELAAYEEEPIVLAPWDPMGALAK  444 (444)
T ss_pred             HhHhhhheeEEEEeh-------------------hhCCHHHHHhhcCCCeecCCCCccccccC
Confidence            999999999999999                   99999999999999999999999999985


No 3  
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=100.00  E-value=5.1e-101  Score=780.85  Aligned_cols=316  Identities=40%  Similarity=0.709  Sum_probs=303.7

Q ss_pred             hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191         22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY   94 (377)
Q Consensus        22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L   94 (377)
                      ++|||||||||+|||+||+|++  +|+++|++++++..|++|||+|+|++++|+||||||+++||       |++||.+|
T Consensus        95 l~f~dklEkaN~fle~ii~~~~--~~~~~R~v~~ll~~~~~DGGqw~m~~~li~KYGvVPk~~~pet~~s~~t~~~n~~L  172 (437)
T cd00585          95 LFFWDKLEKANYFLENIIETAD--EPLDDRLVQFLLANPQNDGGQWDMLVNLIEKYGLVPKSVMPESFNSENSRRLNYLL  172 (437)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhc--CCCccHHHHHHHhCCcCCCCchHHHHHHHHHcCCCcccccCCCcCccchHHHHHHH
Confidence            6999999999999999999996  99999999999999999999999999999999999999999       88999999


Q ss_pred             hhhHHHHHHHHHH--hhccChhhh---hHHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191         95 NNQPVELLMKLAA--ESIKNNEAV---WFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ  169 (377)
Q Consensus        95 ~~~Lr~~a~~lr~--~~~~~~~~l---~~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T  169 (377)
                      +++||++|++||+  +++++.++|   .++|+++||+||++|||+||++|+|+|+||+++||+            .+++|
T Consensus       173 ~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~~~lG~pP~~F~~~y~dkd~~~~~------------~~~~T  240 (437)
T cd00585         173 NRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILAIALGEPPEKFDWEYRDKDKKYHE------------IKELT  240 (437)
T ss_pred             HHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHcCCCCceEEEEEEeCCCCeee------------CCCcC
Confidence            9999999999999  555555556   499999999999999999999999999999999999            88999


Q ss_pred             cHHHHHHhhc-----------c-c--CCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCC
Q psy17191        170 PVELLMKLAA-----------E-S--IKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKR  234 (377)
Q Consensus       170 P~eF~~k~v~-----------D-~--~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~  234 (377)
                      |++||++||+           | +  |||++.|++++++||+ |+++.|+|||+++|+++++++|++|+||||||||+++
T Consensus       241 P~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~y~Nvp~d~l~~~~~~~L~~g~pV~~g~Dv~~~  320 (437)
T cd00585         241 PLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPILYLNVPMDVLKKAAIAQLKDGEPVWFGCDVGKF  320 (437)
T ss_pred             HHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccceEEecCHHHHHHHHHHHHhcCCCEEEEEEcChh
Confidence            9999999985           2 5  9999999999999999 9999999999999999999999999999999999988


Q ss_pred             CcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEe
Q psy17191        235 FANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMT  314 (377)
Q Consensus       235 f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms  314 (377)
                      +++++||++..+++++.+||+++.  |||++|+++++|.+||||+||||+.|++|+ |.||+||||||++||++||++||
T Consensus       321 ~~~k~GI~d~~~~~~~~~f~~~~~--~~KaeRl~~~es~~tHAM~ivGv~~D~~g~-p~yw~VkNSWG~~~G~~Gy~~ms  397 (437)
T cd00585         321 SDRKSGILDTDLFDYELLFGIDFG--LNKAERLDYGESLMTHAMVLTGVDLDEDGK-PVKWKVENSWGEKVGKKGYFVMS  397 (437)
T ss_pred             hccCCccccCcccchhhhcCcccc--CCHHHHHhhcCCcCCeEEEEEEEEecCCCC-cceEEEEcccCCCCCCCcceehh
Confidence            899999999999999999999976  999999999999999999999999998898 88999999999999999999999


Q ss_pred             hhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhcCCCeecCCCCccC
Q psy17191        315 SPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG  373 (377)
Q Consensus       315 ~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~wdpm~  373 (377)
                      ++||++|||+|||||                   +|||+++++++++|||+||||||||
T Consensus       398 ~~wf~~y~~~ivV~k-------------------k~l~~~~~~~l~~~p~~L~~WDpmg  437 (437)
T cd00585         398 DDWFDEYVYQVVVDK-------------------KYLPEEVLDLLKQEPIVLPPWDPMG  437 (437)
T ss_pred             HHHHHhhheEEEeec-------------------ccCCHHHHHHhcCCCEECCCCCCCC
Confidence            999999999999999                   9999999999999999999999998


No 4  
>KOG4128|consensus
Probab=100.00  E-value=1.5e-93  Score=687.25  Aligned_cols=323  Identities=38%  Similarity=0.639  Sum_probs=306.8

Q ss_pred             hhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcceeeCcccc-------hHHHHHHH
Q psy17191         22 AVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHGLMPKNCLV-------WIRIRIIY   94 (377)
Q Consensus        22 ~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYGvVPk~~~p-------t~~m~~~L   94 (377)
                      +|||||||||||||.+|+++|+|.+|+|+|+|++++.+|..|||+|.|++|+++||||+||++|+       |+.||.+|
T Consensus       103 LFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~~DGGqw~MfvNlVkKYGviPKkcy~~sysT~atrkmN~iL  182 (457)
T KOG4128|consen  103 LFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPVPDGGQWQMFVNLVKKYGVIPKKCYLHSYSTQATRKMNLIL  182 (457)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCCCCCchHHHHHHHHHHhCCCcHHhccccchhhHHHHHHHHH
Confidence            79999999999999999999999999999999999999999999999999999999999999999       78999999


Q ss_pred             hhhHHHHHHHHHH--hhccChhhhh---HHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCC
Q psy17191         95 NNQPVELLMKLAA--ESIKNNEAVW---FGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQ  169 (377)
Q Consensus        95 ~~~Lr~~a~~lr~--~~~~~~~~l~---~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~T  169 (377)
                      ++|||++|..||+  ..++...+|.   .+|+++|+|+++||||+||+.|||+|+||+|+|++            +...|
T Consensus       183 ~~KlREfa~~Lr~~~~~~~~~~~i~~t~~emm~eiFrviciclg~PPe~FTWey~DkdK~yq~------------i~~~T  250 (457)
T KOG4128|consen  183 KSKLREFASMLRAQFTFNGNGCRIPDTIQEMMPEIFRVICICLGEPPEVFTWEYYDKDKRYQC------------INDMT  250 (457)
T ss_pred             HHHHHHHHHHHHHHHhcCCCCCcchHHHHHHHHHHHHHHhhhcCCCcceeeEEEeecccceee------------cCCCc
Confidence            9999999999999  4443333444   89999999999999999999999999999999999            99999


Q ss_pred             cHHHHHHhhc-------------c---cCCCCcceeeccCCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC
Q psy17191        170 PVELLMKLAA-------------E---SIKNNEAVWFGCENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS  232 (377)
Q Consensus       170 P~eF~~k~v~-------------D---~~py~~~y~~~~~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~  232 (377)
                      |+|||+++++             |   +|||++.|.++..+|+. |++..|+|+|+|.+++++.++|+++.|||||||+.
T Consensus       251 pleFy~~~vk~~~n~~~~vcL~nDPR~~h~Y~kly~v~~lsNm~GG~~~~YnNq~~d~l~k~vv~sl~~~kaVwfgcd~~  330 (457)
T KOG4128|consen  251 PLEFYEVMVKCTVNLDAFVCLGNDPRISHSYHKLYQVANLSNMMGGKPHRYNNQSMDILMKIVVTSLEGDKAVWFGCDIR  330 (457)
T ss_pred             hHHHHHHHhccccCCCcceeecCCCCccccccceeeehhhhcccCCCceeecCCCHHHHHHHHHHHhcCCcceEEecccH
Confidence            9999999987             3   59999999999999999 99999999999999999999999999999999999


Q ss_pred             CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEe-c-CCCCCCCeEEEEcccCCCCCCCeE
Q psy17191        233 KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSI-D-KETEEPTKWRVENSWGEEQNHKGY  310 (377)
Q Consensus       233 ~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~-d-~~G~~~~yW~VkNSWG~~~G~kGY  310 (377)
                      +.++.|+|++|..+++++.+||.++. .++||||++|++|.+||||+|||++. | ++|- +..|+||||||.+.|.+||
T Consensus       331 k~~~~K~G~~dl~l~~~~l~fG~~l~-~~~KAeRl~y~eSlmthAml~T~v~~kd~~~g~-~~~~rVenswgkd~gkkg~  408 (457)
T KOG4128|consen  331 KAISLKSGPLDLRLHQFDLLFGFKLG-ESTKAERLDYRESLMTHAMLLTSVGLKDPATGG-LNEHRVENSWGKDLGKKGV  408 (457)
T ss_pred             hhhhcccCccchhhccCceeeeeecc-ccchhhhhhHHHHHHHHHHHhhhccccCcccCC-chhhhhhchhhhhccccch
Confidence            99999999999999999999998875 78999999999999999999999994 3 3566 7999999999999999999


Q ss_pred             EEEehhhhhcccEEEEEeCCCCCcccccchhhhhccccCCCCHHHHhhhc-CCCeecCCCCccCccCC
Q psy17191        311 ILMTSPWFKEYVFEVVVDKKYVPASVLDEYVFEVVVDKKYVPASVLDVFN-QEPTILPAWDPMGTLAQ  377 (377)
Q Consensus       311 ~~ms~~wf~~y~~~vvV~K~a~p~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~l~~wdpm~~l~~  377 (377)
                      ..|+++||+||+|+||||.                   ++||+|++.++. +|||+||+||||||||.
T Consensus       409 ~~mt~ewf~EY~feiVVd~-------------------k~vP~evl~~~~~~epi~LPaWDPMGALa~  457 (457)
T KOG4128|consen  409 NKMTAEWFREYAFEIVVDE-------------------KFVPSEVLHAARIHEPIELPAWDPMGALAN  457 (457)
T ss_pred             hhhhHHHHHhhheeEEeec-------------------ccCcHHHHHHhhhcCCeeccccCchhhhcC
Confidence            9999999999999999999                   999999999998 99999999999999984


No 5  
>KOG1543|consensus
Probab=99.84  E-value=1.4e-20  Score=186.38  Aligned_cols=165  Identities=17%  Similarity=0.179  Sum_probs=121.7

Q ss_pred             HHHHHHHHhHHHHhcC-CChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHhhc----ccCCCCcce-eec
Q psy17191        118 FGCENRIIRIRIIYNN-QPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKLAA----ESIKNNEAV-WFG  191 (377)
Q Consensus       118 ~~~l~~v~~il~i~lG-~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~v~----D~~py~~~y-~~~  191 (377)
                      -.....|..+.+|..| ..+.....++.||......   ||++|.     ...-.+|.+++-.    +.+||.-.. ...
T Consensus       137 Faa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~---GC~GG~-----~~~A~~yi~~~G~~t~~~~Ypy~~~~~~C~  208 (325)
T KOG1543|consen  137 FAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGD---GCNGGE-----PKNAFKYIKKNGGVTECENYPYIGKDGTCK  208 (325)
T ss_pred             HHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCC---CcCCCC-----HHHHHHHHHHhCCCCCCcCCCCcCCCCCcc
Confidence            4566778888888999 8999999999999886444   999986     2233344444322    268873222 222


Q ss_pred             cCC-ceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhh
Q psy17191        192 CEN-RIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMY  269 (377)
Q Consensus       192 ~~~-Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~  269 (377)
                      ... +-......+.++|.++  ..++.++...+||.+++|++. +..|++||+..+.++                     
T Consensus       209 ~~~~~~~~~~~~~~~~~~~e--~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~~~~---------------------  265 (325)
T KOG1543|consen  209 SNKKDKTVTIKGFYNVPANE--EAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEEKGD---------------------  265 (325)
T ss_pred             CCCccceeEeeeeeecCcCH--HHHHHHHHhcCCeEEEEeehhhhhhccCceEeCCCCC---------------------
Confidence            111 2224445777899883  556667777779999999996 447999999887642                     


Q ss_pred             CCC-CCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehhhh
Q psy17191        270 GES-SMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSPWF  318 (377)
Q Consensus       270 ~es-~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf  318 (377)
                        + ..+|||+||||+. .+|.  +||+|+||||++||++|||+|.++=-
T Consensus       266 --~~~~~Hav~iVGyG~-~~~~--~YWivkNSWG~~WGe~Gy~ri~r~~~  310 (325)
T KOG1543|consen  266 --DKEGDHAVLIVGYGT-GDGV--DYWIVKNSWGTDWGEKGYFRIARGVN  310 (325)
T ss_pred             --CCCCCceEEEEEEcC-CCCc--eeEEEEcCCCCCcccCceEEEecCCC
Confidence              2 4799999999999 6787  99999999999999999999988643


No 6  
>KOG1542|consensus
Probab=99.84  E-value=1.1e-20  Score=184.70  Aligned_cols=157  Identities=14%  Similarity=0.137  Sum_probs=119.0

Q ss_pred             HHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHhhc--------ccCCC---Cc-c
Q psy17191        120 CENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKLAA--------ESIKN---NE-A  187 (377)
Q Consensus       120 ~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~v~--------D~~py---~~-~  187 (377)
                      .-..|...-.|..|+.....++|+.|||..=    =|||||.       ++ . +-+|+.        +.|||   .+ .
T Consensus       186 ~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d----~gC~GGl-------~~-n-A~~~~~~~gGL~~E~dYPY~g~~~~~  252 (372)
T KOG1542|consen  186 TTGAVEGAWAIATGKLVSLSEQELVDCDSCD----NGCNGGL-------MD-N-AFKYIKKAGGLEKEKDYPYTGKKGNQ  252 (372)
T ss_pred             hhhhhhhHHHhhcCcccccchhhhhcccCcC----CcCCCCC-------hh-H-HHHHHHHhCCccccccCCccccCCCc
Confidence            4567888889999999999999999999642    2999996       32 2 224432        38999   23 6


Q ss_pred             eeeccCCceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeec--ccccccccccCCCCCcCCHHH
Q psy17191        188 VWFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDL--EIHNFKAVFDSDVSLPMSKAE  265 (377)
Q Consensus       188 y~~~~~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~--~~~d~~~~fg~~~~~~~~kae  265 (377)
                      |.+....++ ..+..+.-+|.++  +.|++.|.+.+||.+|+++.....|..||..+  ..|                  
T Consensus       253 C~~~~~~~~-v~I~~f~~l~~nE--~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P~~~~C------------------  311 (372)
T KOG1542|consen  253 CHFDKSKIV-VSIKDFSMLSNNE--DQIAAWLVTFGPLSVGINAKPMQFYRGGVSCPSKYIC------------------  311 (372)
T ss_pred             cccchhhce-EEEeccEecCCCH--HHHHHHHHhcCCeEEEEchHHHHHhcccccCCCcccC------------------
Confidence            666554444 4555787888877  44666777778999999976777899999865  232                  


Q ss_pred             HhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        266 RMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       266 R~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                          .....|||++||||+...-++  +|||||||||+.||++|||++.++
T Consensus       312 ----s~~~~~HaVLlvGyG~~g~~~--PYWIVKNSWG~~WGE~GY~~l~RG  356 (372)
T KOG1542|consen  312 ----SPKLLNHAVLLVGYGSSGYEK--PYWIVKNSWGTSWGEKGYYKLCRG  356 (372)
T ss_pred             ----CccccCceEEEEeecCCCCCC--ceEEEECCccccccccceEEEecc
Confidence                122389999999999852245  899999999999999999999875


No 7  
>PTZ00203 cathepsin L protease; Provisional
Probab=99.83  E-value=4.5e-20  Score=184.28  Aligned_cols=160  Identities=16%  Similarity=0.191  Sum_probs=111.5

Q ss_pred             HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh----hc--ccCCCC-ccee-e
Q psy17191        119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL----AA--ESIKNN-EAVW-F  190 (377)
Q Consensus       119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~----v~--D~~py~-~~y~-~  190 (377)
                      +....|...+.+.-|..+..+.+++.||+.. .   -||++|.     .....+|.++.    +.  +.+||- +.-. -
T Consensus       154 a~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~-~---~GC~GG~-----~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~  224 (348)
T PTZ00203        154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHV-D---NGCGGGL-----MLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVP  224 (348)
T ss_pred             hhHHHHHHHHHHhcCCCccCCHHHHHhccCC-C---CCCCCCC-----HHHHHHHHHHhcCCCCCccccCCCccCCCCCC
Confidence            3445566666777899889999999999753 2   3999996     12234455432    22  378983 1110 0


Q ss_pred             ccC--Cce-E-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHH
Q psy17191        191 GCE--NRI-I-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAER  266 (377)
Q Consensus       191 ~~~--~Nv-~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR  266 (377)
                      .|.  ... . .....|..++.++  +.++.+|..++||++++|+..+..|++||++.  |.                  
T Consensus       225 ~C~~~~~~~~~~~i~~~~~i~~~e--~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy~~--c~------------------  282 (348)
T PTZ00203        225 ECSNSSELAPGARIDGYVSMESSE--RVMAAWLAKNGPISIAVDASSFMSYHSGVLTS--CI------------------  282 (348)
T ss_pred             cCCCCcccccceEecceeecCcCH--HHHHHHHHhCCCEEEEEEhhhhcCccCceeec--cC------------------
Confidence            111  111 2 3344676676543  45666777778999999997666899999863  31                  


Q ss_pred             hhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        267 MMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       267 ~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                          .+..||||+||||+.+ +|.  +|||||||||++||++|||+|+++
T Consensus       283 ----~~~~nHaVliVGYG~~-~g~--~YWiikNSWG~~WGe~GY~ri~rg  325 (348)
T PTZ00203        283 ----GEQLNHGVLLVGYNMT-GEV--PYWVIKNSWGEDWGEKGYVRVTMG  325 (348)
T ss_pred             ----CCCCCeEEEEEEEecC-CCc--eEEEEEcCCCCCcCcCceEEEEcC
Confidence                1246999999999975 787  899999999999999999999875


No 8  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.79  E-value=1e-18  Score=165.36  Aligned_cols=159  Identities=16%  Similarity=0.150  Sum_probs=102.2

Q ss_pred             HHHHhHHHHhcC--CChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccCCCCc-ceee-----
Q psy17191        122 NRIIRIRIIYNN--QPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESIKNNE-AVWF-----  190 (377)
Q Consensus       122 ~~v~~il~i~lG--~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~py~~-~y~~-----  190 (377)
                      ..+...+.+..+  ..+..+.+++.||...-   ..||++|.     ....+++.++. +.  +.+||.. ....     
T Consensus        35 ~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~---~~gC~GG~-----~~~a~~~i~~~G~~~e~~yPY~~~~~~~~~~~~  106 (236)
T cd02620          35 EAFSDRLCIQSNGKENVLLSAQDLLSCCSGC---GDGCNGGY-----PDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGP  106 (236)
T ss_pred             HHHhhHHHHhcCCCCccccCHHHHHhhcCCC---CCCCCCCC-----HHHHHHHHHhcCCCcCCEecCcCCCCccCCCCC
Confidence            344455555556  66778899999997631   23999886     12334444332 11  3678832 1110     


Q ss_pred             -----------ccCCce----E---EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccc
Q psy17191        191 -----------GCENRI----I---RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKA  251 (377)
Q Consensus       191 -----------~~~~Nv----~---g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~  251 (377)
                                 .+....    .   .....+..++.++  ..++.+|.+.+||.++++++ .++.|++||++.+.     
T Consensus       107 ~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~--~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~~-----  179 (236)
T cd02620         107 PPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVPSDE--TDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTS-----  179 (236)
T ss_pred             CCCCCCCCCCCCCCcCCccccceeeeeecceeeeCCHH--HHHHHHHHHCCCeEEEEEechhhhhcCCcEEeecC-----
Confidence                       011111    1   1222333444322  34666777777999999997 45579999986431     


Q ss_pred             cccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        252 VFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       252 ~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                                        .....+|||+||||+.+ +|+  +||+||||||++||++|||+|+++
T Consensus       180 ------------------~~~~~~HaV~iVGyg~~-~g~--~YWivrNSWG~~WGe~Gy~ri~~~  223 (236)
T cd02620         180 ------------------GKQLGGHAVKIIGWGVE-NGV--PYWLAANSWGTDWGENGYFRILRG  223 (236)
T ss_pred             ------------------CCCcCCeEEEEEEEecc-CCe--eEEEEEeCCCCCCCCCcEEEEEcc
Confidence                              12235899999999976 787  899999999999999999999875


No 9  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.79  E-value=1.3e-18  Score=164.95  Aligned_cols=149  Identities=14%  Similarity=0.194  Sum_probs=100.0

Q ss_pred             hhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccCCCCcc-eeec-------cC--------Cce
Q psy17191        136 VELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESIKNNEA-VWFG-------CE--------NRI  196 (377)
Q Consensus       136 P~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~py~~~-y~~~-------~~--------~Nv  196 (377)
                      ...+.++..||..+     -||++|.     .....+|+++. +.  ..+||... ....       +.        .+.
T Consensus        55 ~~lS~Q~lldC~~~-----~gC~GG~-----~~~a~~~~~~~Gl~~e~~yPY~~~~~~C~~~~~~~~c~~~~~c~~~~~~  124 (239)
T cd02698          55 VYLSVQVVIDCAGG-----GSCHGGD-----PGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNY  124 (239)
T ss_pred             cccCHHHHHhCCCC-----CCccCcC-----HHHHHHHHHHcCcCCCCeeCCcCCCCCCcCCCCCCCcccCccccccccc
Confidence            35778888999763     2999996     22334555443 11  26788321 1100       00        111


Q ss_pred             E-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCC
Q psy17191        197 I-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSM  274 (377)
Q Consensus       197 ~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~  274 (377)
                      . .....|.+++.+   +.++.+|.+.+||.+++++.. ++.|++||++...+                       ....
T Consensus       125 ~~~~i~~~~~~~~~---~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~~~-----------------------~~~~  178 (239)
T cd02698         125 TLYFVSDYGSVSGR---DKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQ-----------------------DPLI  178 (239)
T ss_pred             ceEEeeeceecCCH---HHHHHHHHHcCCEEEEEEecccccccCCeEEccCCC-----------------------CCcC
Confidence            1 233456566532   335555656679999999985 55799999976532                       2346


Q ss_pred             CeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehhhhhccc
Q psy17191        275 THAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYV  322 (377)
Q Consensus       275 ~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf~~y~  322 (377)
                      +|||+||||+.+.+|+  +||+||||||++||++|||+|+++.+++--
T Consensus       179 ~HaV~IVGyG~~~~g~--~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~  224 (239)
T cd02698         179 NHIISVAGWGVDENGV--EYWIVRNSWGEPWGERGWFRIVTSSYKGAR  224 (239)
T ss_pred             CeEEEEEEEEecCCCC--EEEEEEcCCCcccCcCceEEEEccCCcccc
Confidence            8999999999874487  899999999999999999999998866443


No 10 
>PTZ00021 falcipain-2; Provisional
Probab=99.79  E-value=1e-18  Score=180.64  Aligned_cols=160  Identities=19%  Similarity=0.177  Sum_probs=109.0

Q ss_pred             HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh--hc--ccCCCCc----ceee
Q psy17191        119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL--AA--ESIKNNE----AVWF  190 (377)
Q Consensus       119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~--v~--D~~py~~----~y~~  190 (377)
                      +....+...+.|.-|.......+++.||+..    .-||++|..     ....++..+.  +.  +.+||..    .|..
T Consensus       294 Aa~~alEs~~~I~~g~~v~LSeQqLVDCs~~----n~GC~GG~~-----~~Af~yi~~~gGl~tE~~YPY~~~~~~~C~~  364 (489)
T PTZ00021        294 STVGVVESQYAIRKNELVSLSEQELVDCSFK----NNGCYGGLI-----PNAFEDMIELGGLCSEDDYPYVSDTPELCNI  364 (489)
T ss_pred             HHHHHHHHHHHHHcCCCcccCHHHHhhhccC----CCCCCCcch-----HhhhhhhhhccccCcccccCccCCCCCcccc
Confidence            4556666677788899999999999999853    249999961     1122233222  11  3789932    2322


Q ss_pred             ccCCceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhh
Q psy17191        191 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMY  269 (377)
Q Consensus       191 ~~~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~  269 (377)
                      ....+. ..+..|..+|.+.    +..+|...+||++++|++. +..|++||++.+ |                      
T Consensus       365 ~~~~~~-~~i~~y~~i~~~~----lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~-C----------------------  416 (489)
T PTZ00021        365 DRCKEK-YKIKSYVSIPEDK----FKEAIRFLGPISVSIAVSDDFAFYKGGIFDGE-C----------------------  416 (489)
T ss_pred             cccccc-ceeeeEEEecHHH----HHHHHHhcCCeEEEEEeecccccCCCCcCCCC-C----------------------
Confidence            222222 3445777887544    4455666679999999984 557999998753 3                      


Q ss_pred             CCCCCCeEEEEEEEEecCC-------CCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        270 GESSMTHAMVISAVSIDKE-------TEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       270 ~es~~~HAM~ivG~~~d~~-------G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                       ....+|||+||||+.+..       |+...|||||||||++||++|||+|+++
T Consensus       417 -~~~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~  469 (489)
T PTZ00021        417 -GEEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETD  469 (489)
T ss_pred             -CCccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcC
Confidence             123699999999997521       1112699999999999999999999876


No 11 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.78  E-value=2.6e-18  Score=157.76  Aligned_cols=157  Identities=17%  Similarity=0.195  Sum_probs=104.6

Q ss_pred             hHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-h-c-ccCCCCcc-eeeccCCceE-EEe
Q psy17191        126 RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-A-A-ESIKNNEA-VWFGCENRII-RIR  200 (377)
Q Consensus       126 ~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v-~-D~~py~~~-y~~~~~~Nv~-g~~  200 (377)
                      ..+.+..|.....+..++.++.....   -||++|..     ...++|.+++ + . +.+||... -.......-. .+.
T Consensus        35 ~~~~i~~~~~~~lS~q~l~~c~~~~~---~gC~GG~~-----~~a~~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i  106 (210)
T cd02248          35 GAYAIKTGKLVSLSEQQLVDCSTSGN---NGCNGGNP-----DNAFEYVKNGGLASESDYPYTGKDGTCKYNSSKVGAKI  106 (210)
T ss_pred             HHHHHHcCCCcccCHHHHhccCCCCC---CCCCCCCH-----HHhHHHHHHCCcCccccCCccCCCCCccCCCCcccEEE
Confidence            33344455666667777788765412   38998862     1234444443 1 1 37888531 1111111112 445


Q ss_pred             eEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEE
Q psy17191        201 IIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMV  279 (377)
Q Consensus       201 ~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~  279 (377)
                      ..|..++..+. +.++.+|..++||.+++++.. +..++.||++.+.++                      ....+|||+
T Consensus       107 ~~~~~i~~~~~-~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~----------------------~~~~~Hav~  163 (210)
T cd02248         107 TGYSNVPPGDE-EALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCS----------------------NTNLNHAVL  163 (210)
T ss_pred             eeEEEcCCCcH-HHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCC----------------------CCcCCEEEE
Confidence            56777776433 446777777889999999985 447899999776431                      234689999


Q ss_pred             EEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        280 ISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       280 ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                      ||||+.+ .|.  .||+||||||++||++||++|+.+
T Consensus       164 iVGy~~~-~~~--~ywiv~NSWG~~WG~~Gy~~i~~~  197 (210)
T cd02248         164 LVGYGTE-NGV--DYWIVKNSWGTSWGEKGYIRIARG  197 (210)
T ss_pred             EEEEeec-CCc--eEEEEEcCCCCccccCcEEEEEcC
Confidence            9999976 577  899999999999999999999863


No 12 
>PTZ00200 cysteine proteinase; Provisional
Probab=99.77  E-value=2.2e-18  Score=177.09  Aligned_cols=157  Identities=17%  Similarity=0.232  Sum_probs=105.5

Q ss_pred             HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccCCCCc---ceeecc
Q psy17191        119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESIKNNE---AVWFGC  192 (377)
Q Consensus       119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~py~~---~y~~~~  192 (377)
                      +....|...+.+.-|.......+++.||+..    .-||++|.     .....++.++. +.  +.+||..   .|... 
T Consensus       263 at~~aiEs~~~i~~~~~~~LSeQqLvDC~~~----~~GC~GG~-----~~~A~~yi~~~Gi~~e~~YPY~~~~~~C~~~-  332 (448)
T PTZ00200        263 SSVGSVESLYKIYRDKSVDLSEQELVNCDTK----SQGCSGGY-----PDTALEYVKNKGLSSSSDVPYLAKDGKCVVS-  332 (448)
T ss_pred             hHHHHHHHHHHHhcCCCeecCHHHHhhccCc----cCCCCCCc-----HHHHHHHHhhcCccccccCCCCCCCCCCcCC-
Confidence            3445566666677788888999999999852    24999996     11233444332 11  3789932   22211 


Q ss_pred             CCceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCC
Q psy17191        193 ENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGE  271 (377)
Q Consensus       193 ~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~e  271 (377)
                       ......+..|..++.+   +++.+++. .+||.++++++ .++.|++||++.+ |                       .
T Consensus       333 -~~~~~~i~~y~~~~~~---~~l~~~l~-~GPV~v~i~~~~~f~~Yk~GIy~~~-C-----------------------~  383 (448)
T PTZ00200        333 -STKKVYIDSYLVAKGK---DVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNGE-C-----------------------G  383 (448)
T ss_pred             -CCCeeEecceEecCHH---HHHHHHHh-cCCEEEEeecccccccCCCCccccc-c-----------------------C
Confidence             1111233355544433   33445554 46999999998 4558999998653 2                       1


Q ss_pred             CCCCeEEEEEEEEec-CCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        272 SSMTHAMVISAVSID-KETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       272 s~~~HAM~ivG~~~d-~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                      +..+|||+||||+.| ++|.  +||+||||||++||++|||+|+++
T Consensus       384 ~~~nHaV~lVGyG~d~~~g~--~YWIIkNSWG~~WGe~GY~ri~r~  427 (448)
T PTZ00200        384 KSLNHAVLLVGEGYDEKTKK--RYWIIKNSWGTDWGENGYMRLERT  427 (448)
T ss_pred             CCCcEEEEEEEecccCCCCC--ceEEEEcCCCCCcccCeeEEEEeC
Confidence            236999999999875 3677  899999999999999999999874


No 13 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.76  E-value=5.6e-18  Score=160.70  Aligned_cols=156  Identities=17%  Similarity=0.150  Sum_probs=94.0

Q ss_pred             CChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh--hc-ccCCCCc-ceeeccCCce---E-EEeeEEEe
Q psy17191        134 QPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL--AA-ESIKNNE-AVWFGCENRI---I-RIRIIYNN  205 (377)
Q Consensus       134 ~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~--v~-D~~py~~-~y~~~~~~Nv---~-g~~~~y~N  205 (377)
                      ..+....+++.||.. +.   -||++|.     ...-.++.+++  +. +.+||.. .-. .+..+.   . .....|.-
T Consensus        54 ~~~~lS~q~l~dC~~-~~---~GC~GG~-----~~~a~~~~~~~Gi~~e~~yPY~~~~~~-~C~~~~~~~~~~~~~~~~~  123 (243)
T cd02621          54 QQPILSPQHVLSCSQ-YS---QGCDGGF-----PFLVGKFAEDFGIVTEDYFPYTADDDR-PCKASPSECRRYYFSDYNY  123 (243)
T ss_pred             cCcccCHHHhhhhcC-CC---CCCCCCC-----HHHHHHHHHhcCcCCCceeCCCCCCCC-CCCCCccccccccccceeE
Confidence            366777888889874 22   3999996     11223344332  11 3688843 111 111111   0 11111211


Q ss_pred             cC----hhHHHHHHHHHhccCCeEEEEEEcCCCC-cCCCCeeeccc----ccccccccCCCCCcCCHHHHhhhCCCCCCe
Q psy17191        206 QP----VELLMKLAAESIKNNEAVWFGCEVSKRF-ANKLGLNDLEI----HNFKAVFDSDVSLPMSKAERMMYGESSMTH  276 (377)
Q Consensus       206 vp----~~~l~~~~~~~L~~g~PV~~g~Dv~~~f-~~k~GI~d~~~----~d~~~~fg~~~~~~~~kaeR~~~~es~~~H  276 (377)
                      +.    ..+. +.++.+|.+.+||.+++++...| .|+.||++.+.    |+-..   ...           ......+|
T Consensus       124 i~~~~~~~~~-~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~---~~~-----------~~~~~~~H  188 (243)
T cd02621         124 VGGCYGCTNE-DEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDN---DNF-----------NPFELTNH  188 (243)
T ss_pred             cccccccCCH-HHHHHHHHHcCCEEEEEEecccccccCCeEECcCCccccccccc---ccc-----------cCcccCCe
Confidence            10    1111 34556676678999999998544 79999998753    21100   000           00124689


Q ss_pred             EEEEEEEEecC-CCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        277 AMVISAVSIDK-ETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       277 AM~ivG~~~d~-~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                      ||+||||+.+. +|.  +||+||||||++||++|||+|+++
T Consensus       189 aV~iVGyg~~~~~g~--~YWiirNSWG~~WGe~Gy~~i~~~  227 (243)
T cd02621         189 AVLLVGWGEDEIKGE--KYWIVKNSWGSSWGEKGYFKIRRG  227 (243)
T ss_pred             EEEEEEeeccCCCCC--cEEEEEcCCCCCCCcCCeEEEecC
Confidence            99999999864 377  899999999999999999999874


No 14 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.74  E-value=3.3e-17  Score=150.25  Aligned_cols=179  Identities=18%  Similarity=0.093  Sum_probs=113.2

Q ss_pred             HHHHHHhHHHHhcC--CChhHHHHHHHhhCCCcce-eeeeccCCccccccCCCcHHHHHHhhc-------ccCCCCccee
Q psy17191        120 CENRIIRIRIIYNN--QPVELLMKLAAESIKNNEA-VWFGCENSRIRIIYNNQPVELLMKLAA-------ESIKNNEAVW  189 (377)
Q Consensus       120 ~l~~v~~il~i~lG--~pP~~F~~~~~d~~k~y~~-~~~Gc~~~~~~~~~~~TP~eF~~k~v~-------D~~py~~~y~  189 (377)
                      ....+...+.+..+  ++......++.|+...... ..-||++|.        |...+..++.       +.+||...-.
T Consensus        26 ~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~--------~~~~~~~~~~~~Gi~~e~~~Py~~~~~   97 (223)
T cd02619          26 SAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGG--------PLSALLKLVALKGIPPEEDYPYGAESD   97 (223)
T ss_pred             HHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCc--------HHHHHHHHHHHcCCCccccCCCCCCCC
Confidence            34444455555555  6667777788888765420 013888885        4444443332       3688843322


Q ss_pred             ec-c----CCceE-EEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CCcCCCCeeecccccccccccCCCCCcCC
Q psy17191        190 FG-C----ENRII-RIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RFANKLGLNDLEIHNFKAVFDSDVSLPMS  262 (377)
Q Consensus       190 ~~-~----~~Nv~-g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f~~k~GI~d~~~~d~~~~fg~~~~~~~~  262 (377)
                      .. .    ..+.. .+...|..+...++ +.++.+|.+++||.+++.+.. ++.++.|+...+...     . .      
T Consensus        98 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~-~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~-----~-~------  164 (223)
T cd02619          98 GEEPKSEAALNAAKVKLKDYRRVLKNNI-EDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVY-----L-L------  164 (223)
T ss_pred             CCCCCCccchhhcceeecceeEeCchhH-HHHHHHHHHCCCEEEEEEcccchhcccCccccccccc-----c-c------
Confidence            11 1    11112 33345555555433 457777888889999999985 446788875310000     0 0      


Q ss_pred             HHHHhhhCCCCCCeEEEEEEEEecC-CCCCCCeEEEEcccCCCCCCCeEEEEehhhhhcccEEEE
Q psy17191        263 KAERMMYGESSMTHAMVISAVSIDK-ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVV  326 (377)
Q Consensus       263 kaeR~~~~es~~~HAM~ivG~~~d~-~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf~~y~~~vv  326 (377)
                           .......+|||+||||+.+. .|+  .||+|+||||++||++||++|+.+-+.++++...
T Consensus       165 -----~~~~~~~~Hav~ivGy~~~~~~~~--~~~i~~NSwG~~wg~~Gy~~i~~~~~~~~~~~~~  222 (223)
T cd02619         165 -----YEDGDLGGHAVVIVGYDDNYVEGK--GAFIVKNSWGTDWGDNGYGRISYEDVYEMTFGAN  222 (223)
T ss_pred             -----cCCCccCCeEEEEEeecCCCCCCC--CEEEEEeCCCCccccCCEEEEehhhhhhhhhccc
Confidence                 01234568999999999752 166  8999999999999999999999999998877653


No 15 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.69  E-value=2.1e-16  Score=172.30  Aligned_cols=165  Identities=18%  Similarity=0.243  Sum_probs=111.1

Q ss_pred             HHHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHH---Hh--hc--ccCCCCc----
Q psy17191        118 FGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLM---KL--AA--ESIKNNE----  186 (377)
Q Consensus       118 ~~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~---k~--v~--D~~py~~----  186 (377)
                      -+....+..+.++.-|.......+++.||...+..  -||++|.       .|.+|+.   +.  +.  ..+||..    
T Consensus       559 FASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn--~GC~GG~-------~~~efl~yI~e~GgLptESdYPYt~k~~~  629 (1004)
T PTZ00462        559 FASKYHLETIKCMKGYEPHAISALYIANCSKGEHK--DRCDEGS-------NPLEFLQIIEDNGFLPADSNYLYNYTKVG  629 (1004)
T ss_pred             HHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCC--CCCCCCC-------cHHHHHHHHHHcCCCcccccCCCccCCCC
Confidence            34566677777788788888999999999875532  4999997       6777752   22  22  2788831    


Q ss_pred             -ceeeccC--CceE---------------EEeeEEEecCh-------hHHHHHHHHHhccCCeEEEEEEcCCCCcC-CCC
Q psy17191        187 -AVWFGCE--NRII---------------RIRIIYNNQPV-------ELLMKLAAESIKNNEAVWFGCEVSKRFAN-KLG  240 (377)
Q Consensus       187 -~y~~~~~--~Nv~---------------g~~~~y~Nvp~-------~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~-k~G  240 (377)
                       .|.....  .|.+               .....|..++.       +.+.+++..+|.+.+||.++++++.++.| .+|
T Consensus       630 g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsdf~~Y~~sG  709 (1004)
T PTZ00462        630 EDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENVLGYEFNG  709 (1004)
T ss_pred             CCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeehHHhhhcCC
Confidence             2211100  0111               01123433332       33345677788877799999999766677 489


Q ss_pred             eeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecC----CCCCCCeEEEEcccCCCCCCCeEEEEeh
Q psy17191        241 LNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDK----ETEEPTKWRVENSWGEEQNHKGYILMTS  315 (377)
Q Consensus       241 I~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~----~G~~~~yW~VkNSWG~~~G~kGY~~ms~  315 (377)
                      |+..+.|                      +....+|||+||||+.+.    +|+  .||+||||||+.||++|||+|.+
T Consensus       710 Iyv~~~C----------------------gs~~~nHAVlIVGYGt~in~eg~gk--~YWIVRNSWGt~WGEnGYFKI~r  764 (1004)
T PTZ00462        710 KKVQNLC----------------------GDDTADHAVNIVGYGNYINDEDEKK--SYWIVRNSWGKYWGDEGYFKVDM  764 (1004)
T ss_pred             ccccCCC----------------------CCCcCCceEEEEEecccccccCCCC--ceEEEEcCCCCCcCCCeEEEEEe
Confidence            8755544                      123468999999999742    255  89999999999999999999987


No 16 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.69  E-value=7.1e-17  Score=147.87  Aligned_cols=162  Identities=17%  Similarity=0.212  Sum_probs=103.7

Q ss_pred             HHHHhHHHHhc-CCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHH-h-hc--ccCCCCcc----eeecc
Q psy17191        122 NRIIRIRIIYN-NQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMK-L-AA--ESIKNNEA----VWFGC  192 (377)
Q Consensus       122 ~~v~~il~i~l-G~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k-~-v~--D~~py~~~----y~~~~  192 (377)
                      ..+...+.+.. +..+.....++.|+...+..   ||+++.     ...-.++.++ . +.  +.+||...    +... 
T Consensus        33 ~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~---~c~gg~-----~~~a~~~~~~~~Gi~~e~~~pY~~~~~~~c~~~-  103 (219)
T PF00112_consen   33 AALESRLAIQNNGKNVDLSEQYLIDCSNKYNK---GCDGGS-----PFDALKYIKNNNGIVTEEDYPYNGNENPTCKSK-  103 (219)
T ss_dssp             HHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSS---TTBBBE-----HHHHHHHHHHHTSBEBTTTS--SSSSSCSSCHS-
T ss_pred             cceecccccccccccccccccccccccccccc---ccccCc-----ccccceeecccCccccccccccccccccccccc-
Confidence            33444444444 67777777778888763333   898885     1233445544 2 11  36888532    2221 


Q ss_pred             CCce-EEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCC-CC-cCCCCeeecccccccccccCCCCCcCCHHHHhhh
Q psy17191        193 ENRI-IRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSK-RF-ANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMY  269 (377)
Q Consensus       193 ~~Nv-~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~-~f-~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~  269 (377)
                      ..+. ..+...|..++...+ +.++++|..++||.+++++.. .| .+++||++.+.++                     
T Consensus       104 ~~~~~~~~i~~~~~~~~~~~-~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~~~~~---------------------  161 (219)
T PF00112_consen  104 KSNSYYVKIKGYGKVKDNDI-EDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDPPDCS---------------------  161 (219)
T ss_dssp             GGGEEEBEESEEEEEESTCH-HHHHHHHHHHSSEEEEEEEESHHHHTEESSEECSTSSS---------------------
T ss_pred             ccccccccccccccccccch-hHHHHHHhhCceeeeeeeccccccccccceeeeccccc---------------------
Confidence            1111 123335555554332 446677878789999999996 35 6899999876331                     


Q ss_pred             CCCCCCeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehhhh
Q psy17191        270 GESSMTHAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSPWF  318 (377)
Q Consensus       270 ~es~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf  318 (377)
                       ....+|||+||||+.+ .|+  .||+|+||||++||++||++|+++-.
T Consensus       162 -~~~~~Hav~iVGy~~~-~~~--~~wiv~NSWG~~WG~~Gy~~i~~~~~  206 (219)
T PF00112_consen  162 -NESGGHAVLIVGYDDE-NGK--GYWIVKNSWGTDWGDNGYFRISYDYN  206 (219)
T ss_dssp             -SSSEEEEEEEEEEEEE-TTE--EEEEEE-SBTTTSTBTTEEEEESSSS
T ss_pred             -cccccccccccccccc-cce--eeEeeehhhCCccCCCeEEEEeeCCC
Confidence             2346899999999987 677  89999999999999999999998754


No 17 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.68  E-value=2.7e-16  Score=164.55  Aligned_cols=167  Identities=12%  Similarity=0.063  Sum_probs=98.8

Q ss_pred             CCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh-hc--ccC--CCCcc------eeeccCCceE-E--
Q psy17191        133 NQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL-AA--ESI--KNNEA------VWFGCENRII-R--  198 (377)
Q Consensus       133 G~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~-v~--D~~--py~~~------y~~~~~~Nv~-g--  198 (377)
                      |..+..+.++..||+. +.   -||++|.     .+.-.+|.+++ +.  +.+  ||...      |......... .  
T Consensus       259 g~~~~LS~QqLVDCs~-~n---~GCdGG~-----p~~A~~yi~~~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~  329 (548)
T PTZ00364        259 GQQTFLSARHVLDCSQ-YG---QGCAGGF-----PEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTN  329 (548)
T ss_pred             CcccCcCHHHHhcccC-CC---CCCCCCc-----HHHHHHHHHhCCcccccccCCCCCCCCCCCCCCCCCcccceeeeee
Confidence            6677889999999974 33   3999996     12223444333 11  367  98321      2111111111 1  


Q ss_pred             --EeeEEEecChhHHHHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCC
Q psy17191        199 --IRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMT  275 (377)
Q Consensus       199 --~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~  275 (377)
                        ....|..++.++  +.++.+|...+||.+++|+. .++.|++||+..-.+.+..-....   .-+...| .++....|
T Consensus       330 ~~~I~gyy~~~~~e--~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~---~~~~~~~-~~~~~~~n  403 (548)
T PTZ00364        330 YGPLGGYYGAVTDP--DEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTA---SADRPLR-HYFASNVN  403 (548)
T ss_pred             eEEecceeecCCcH--HHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeecccccccccc---ccCCccc-ccccccCC
Confidence              111343444333  23455565667999999998 566899998753200000000000   0000000 01123469


Q ss_pred             eEEEEEEEEecCCCCCCCeEEEEcccCC--CCCCCeEEEEehh
Q psy17191        276 HAMVISAVSIDKETEEPTKWRVENSWGE--EQNHKGYILMTSP  316 (377)
Q Consensus       276 HAM~ivG~~~d~~G~~~~yW~VkNSWG~--~~G~kGY~~ms~~  316 (377)
                      |||+||||+.+++|.  +||+||||||+  +||++|||+|.++
T Consensus       404 HAVlIVGYG~de~G~--~YWIVKNSWGt~~~WGE~GYfRI~RG  444 (548)
T PTZ00364        404 HTVLIIGWGTDENGG--DYWLVLDPWGSRRSWCDGGTRKIARG  444 (548)
T ss_pred             eEEEEEEecccCCCc--eEEEEECCCCCCCCcccCCeEEEEcC
Confidence            999999999876787  89999999999  9999999999986


No 18 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.65  E-value=8.5e-16  Score=163.16  Aligned_cols=97  Identities=20%  Similarity=0.263  Sum_probs=64.8

Q ss_pred             HHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHh---hhCCCCCCeEEEEEEEEecC-CC
Q psy17191        215 AAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERM---MYGESSMTHAMVISAVSIDK-ET  289 (377)
Q Consensus       215 ~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~---~~~es~~~HAM~ivG~~~d~-~G  289 (377)
                      ++..|...+||.+++|+. .+..|++||++.+.++.....+.+.    ++. .-   ..+....+|||+||||+.++ +|
T Consensus       560 Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~----~~~-~~~~~~~G~e~~NHAVlIVGwG~d~enG  634 (693)
T PTZ00049        560 MMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDL----PKH-NGVYNITGWEKVNHAIVLVGWGEEEING  634 (693)
T ss_pred             HHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCcc----ccc-cccccccccccCceEEEEEEeccccCCC
Confidence            445565667999999998 4558999999864321111111110    000 00   01222469999999999853 56


Q ss_pred             CCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        290 EEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       290 ~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                      ...+||||+||||+.||++|||+|.++
T Consensus       635 ~~~~YWIVRNSWGt~WGenGYfKI~RG  661 (693)
T PTZ00049        635 KLYKYWIGRNSWGKNWGKEGYFKIIRG  661 (693)
T ss_pred             cccCEEEEECCCCCCcccCceEEEEcC
Confidence            412799999999999999999999886


No 19 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.64  E-value=1.8e-15  Score=137.07  Aligned_cols=127  Identities=17%  Similarity=0.261  Sum_probs=86.7

Q ss_pred             HHHHHHHhHHHHhcCCChhHHHHHHHhhCCCcceeeeeccCCccccccCCCcHHHHHHh--hc--ccCCCCcceeeccCC
Q psy17191        119 GCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCENSRIRIIYNNQPVELLMKL--AA--ESIKNNEAVWFGCEN  194 (377)
Q Consensus       119 ~~l~~v~~il~i~lG~pP~~F~~~~~d~~k~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~--v~--D~~py~~~y~~~~~~  194 (377)
                      +....+...+.+.-|..+.....+..||.+...   -||+++.     ..+..+|.++.  +.  +.+||..        
T Consensus        29 a~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~---~gC~GG~-----~~~a~~~~~~~~Gi~~e~~~PY~~--------   92 (174)
T smart00645       29 SATGALEGRYCIKTGKLVSLSEQQLVDCSTGGN---NGCNGGL-----PDNAFEYIKKNGGLETESCYPYTG--------   92 (174)
T ss_pred             HHHHHHHHHHHHhcCCccccCHHHHhhhcCCCC---CCCCCcC-----HHHHHHHHHHcCCcccccccCccc--------
Confidence            334455555666667777788888888876422   3899886     22334555443  11  2445422        


Q ss_pred             ceEEEeeEEEecChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCC
Q psy17191        195 RIIRIRIIYNNQPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSM  274 (377)
Q Consensus       195 Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~  274 (377)
                                                     .+..++..+..|+.||++.+.|.                      ....
T Consensus        93 -------------------------------~~~~~~~~f~~Y~~Gi~~~~~~~----------------------~~~~  119 (174)
T smart00645       93 -------------------------------SVAIDASDFQFYKSGIYDHPGCG----------------------SGTL  119 (174)
T ss_pred             -------------------------------EEEEEcccccCCcCeEECCCCCC----------------------CCcc
Confidence                                           33345555568999999765331                      1225


Q ss_pred             CeEEEEEEEEecCCCCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        275 THAMVISAVSIDKETEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       275 ~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                      +|||+||||+.+.+|+  .||+|+||||+.||++|||+|+++
T Consensus       120 ~Hav~ivGyg~~~~g~--~yWii~NSwG~~WG~~G~~~i~~~  159 (174)
T smart00645      120 DHAVLIVGYGTEENGK--DYWIVKNSWGTDWGENGYFRIARG  159 (174)
T ss_pred             cEEEEEEEEeecCCCe--eEEEEECCCCCCcccCeEEEEEcC
Confidence            8999999999865788  799999999999999999999876


No 20 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=99.33  E-value=4.7e-13  Score=130.96  Aligned_cols=74  Identities=30%  Similarity=0.337  Sum_probs=67.0

Q ss_pred             CChhhhhhcCCcccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhc---
Q psy17191          1 MSKAERMMYGESSMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNH---   77 (377)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kY---   77 (377)
                      ++||+|++||||.||||||||++..|                +.|.|++|| |+++|+...|+.|+|.|++.|+++|   
T Consensus       348 q~KA~RldY~eSLmTHAMvlTGvd~d----------------~~g~p~rwk-VENSWG~d~G~~GyfvaSd~wmdEytyQ  410 (444)
T COG3579         348 QDKAGRLDYGESLMTHAMVLTGVDLD----------------ETGNPLRWK-VENSWGKDVGKKGYFVASDAWMDEYTYQ  410 (444)
T ss_pred             cchhhccccchHHHHHHHHhhccccc----------------cCCCceeeE-eecccccccCCCceEeehHhHhhhheeE
Confidence            58999999999999999999999999                567999999 8999999999999999999999999   


Q ss_pred             ceeeCcccchHHHHH
Q psy17191         78 GLMPKNCLVWIRIRI   92 (377)
Q Consensus        78 GvVPk~~~pt~~m~~   92 (377)
                      -+|.++..| .+.-+
T Consensus       411 IvV~k~~l~-~e~~~  424 (444)
T COG3579         411 IVVDKKFLP-KEELA  424 (444)
T ss_pred             EEEehhhCC-HHHHH
Confidence            588888888 44333


No 21 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.13  E-value=1.2e-11  Score=127.11  Aligned_cols=78  Identities=38%  Similarity=0.493  Sum_probs=64.2

Q ss_pred             CChhhhhhcCCcccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhc---
Q psy17191          1 MSKAERMMYGESSMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNH---   77 (377)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kY---   77 (377)
                      |+|++|+.|++|.|||||+|+++..|                ++|+|..+| |+++|+...|.+|++.|+.+||++|   
T Consensus       346 ~~K~~Rl~~~eS~~tHAM~itGv~~D----------------~~g~p~~wk-VeNSWG~~~g~kGy~~msd~wf~e~v~~  408 (438)
T PF03051_consen  346 MSKAERLDYGESTMTHAMVITGVDLD----------------EDGKPVRWK-VENSWGTDNGDKGYFYMSDDWFDEYVYQ  408 (438)
T ss_dssp             S-HHHHHHTTSS--EEEEEEEEEEE-----------------TTSSEEEEE-EE-SBTTTSTBTTEEEEEHHHHHHHEEE
T ss_pred             cCHHHHHHhCCCCCceeEEEEEEEec----------------cCCCeeEEE-EEcCCCCCCCCCcEEEECHHHHHhhhhe
Confidence            68999999999999999999999998                668888999 8999999999999999999999999   


Q ss_pred             ceeeCcccchHHHHHHHhh
Q psy17191         78 GLMPKNCLVWIRIRIIYNN   96 (377)
Q Consensus        78 GvVPk~~~pt~~m~~~L~~   96 (377)
                      -||+|++.| .++..+|..
T Consensus       409 vvV~K~~lp-~~~~~~~~~  426 (438)
T PF03051_consen  409 VVVHKKYLP-KELLKKLEK  426 (438)
T ss_dssp             EEEEGGGS--HHHHHHHHS
T ss_pred             EEechhhCC-HHHHHHhcC
Confidence            499999999 665555543


No 22 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=98.97  E-value=2.1e-10  Score=117.99  Aligned_cols=77  Identities=32%  Similarity=0.431  Sum_probs=68.0

Q ss_pred             CChhhhhhcCCcccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcc--
Q psy17191          1 MSKAERMMYGESSMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHG--   78 (377)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYG--   78 (377)
                      ||||+|+.+++|.|||||+|+++..|                +.|+|..|+ |+++|+..+|++|++.|..+||++|.  
T Consensus       345 ~~KaeRl~~~es~~tHAM~ivGv~~D----------------~~g~p~yw~-VkNSWG~~~G~~Gy~~ms~~wf~~y~~~  407 (437)
T cd00585         345 LNKAERLDYGESLMTHAMVLTGVDLD----------------EDGKPVKWK-VENSWGEKVGKKGYFVMSDDWFDEYVYQ  407 (437)
T ss_pred             CCHHHHHhhcCCcCCeEEEEEEEEec----------------CCCCcceEE-EEcccCCCCCCCcceehhHHHHHhhheE
Confidence            79999999999999999999998887                346778888 79999999999999999999999994  


Q ss_pred             -eeeCcccchHHHHHHHh
Q psy17191         79 -LMPKNCLVWIRIRIIYN   95 (377)
Q Consensus        79 -vVPk~~~pt~~m~~~L~   95 (377)
                       ||+|+..| .++..+|.
T Consensus       408 ivV~kk~l~-~~~~~~l~  424 (437)
T cd00585         408 VVVDKKYLP-EEVLDLLK  424 (437)
T ss_pred             EEeecccCC-HHHHHHhc
Confidence             99999999 55555553


No 23 
>KOG1544|consensus
Probab=98.80  E-value=7.1e-09  Score=101.48  Aligned_cols=89  Identities=22%  Similarity=0.318  Sum_probs=65.8

Q ss_pred             HHHHHHHhccCCeEEEEEEcC-CCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecC--C
Q psy17191        212 MKLAAESIKNNEAVWFGCEVS-KRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDK--E  288 (377)
Q Consensus       212 ~~~~~~~L~~g~PV~~g~Dv~-~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~--~  288 (377)
                      ++++...+++ +||-.-..|. .+|.|+.||+.....            ..-+.+|-+.   .-+|+|-|+|.+.+.  +
T Consensus       354 ~eImkElM~N-GPVQA~m~VHEDFF~YkgGiY~H~~~------------~~~~~e~yr~---~gtHsVk~tGWG~~~~~~  417 (470)
T KOG1544|consen  354 KEIMKELMEN-GPVQALMEVHEDFFLYKGGIYSHTPV------------SLGRPERYRR---HGTHSVKITGWGEETLPD  417 (470)
T ss_pred             HHHHHHHHhC-CChhhhhhhhhhhhhhccceeecccc------------ccCCchhhhh---cccceEEEeecccccCCC
Confidence            4555555555 5999999999 678999999964321            1223333222   358999999999863  6


Q ss_pred             CCCCCeEEEEcccCCCCCCCeEEEEehh
Q psy17191        289 TEEPTKWRVENSWGEEQNHKGYILMTSP  316 (377)
Q Consensus       289 G~~~~yW~VkNSWG~~~G~kGY~~ms~~  316 (377)
                      |+..+||+.-||||+.||++|||.+=++
T Consensus       418 G~~~KyW~aANSWG~~WGE~GYFriLRG  445 (470)
T KOG1544|consen  418 GRTLKYWTAANSWGPAWGERGYFRILRG  445 (470)
T ss_pred             CCeeEEEEeecccccccccCceEEEecc
Confidence            7756899999999999999999998654


No 24 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.71  E-value=1e-08  Score=101.93  Aligned_cols=47  Identities=19%  Similarity=0.370  Sum_probs=37.7

Q ss_pred             CCCeEEEEEEEEecCC---------CCCCCeEEEEcccCCCCCCCeEEEEehhhhhcc
Q psy17191        273 SMTHAMVISAVSIDKE---------TEEPTKWRVENSWGEEQNHKGYILMTSPWFKEY  321 (377)
Q Consensus       273 ~~~HAM~ivG~~~d~~---------G~~~~yW~VkNSWG~~~G~kGY~~ms~~wf~~y  321 (377)
                      .-+||++||||+..-+         |.  .-||||||||+.||++|||+|+..+-..-
T Consensus       263 ~~gHAv~iVGyDDs~~~n~~~~~~~g~--GAfiikNSWGt~wG~~GYfwisY~ya~~g  318 (372)
T COG4870         263 NWGHAVLIVGYDDSFDINNFKYGPPGD--GAFIIKNSWGTNWGENGYFWISYYYALNG  318 (372)
T ss_pred             cccceEEEEeccccccccccccCCCCC--ceEEEECccccccccCceEEEEeeecccc
Confidence            4589999999985321         22  47999999999999999999998765543


No 25 
>KOG4128|consensus
Probab=98.59  E-value=2.2e-08  Score=98.40  Aligned_cols=70  Identities=30%  Similarity=0.350  Sum_probs=60.6

Q ss_pred             CChhhhhhcCCcccchhhhhhhhh-hhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHHhhhcc-
Q psy17191          1 MSKAERMMYGESSMTHAMVISAVS-IDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNLIVNHG-   78 (377)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~f-~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~li~kYG-   78 (377)
                      |+||||++|+||.|||||+||.+- -|              -+ .|.+..+| |++.|+...|..|+..|...||++|- 
T Consensus       358 ~~KAeRl~y~eSlmthAml~T~v~~kd--------------~~-~g~~~~~r-Venswgkd~gkkg~~~mt~ewf~EY~f  421 (457)
T KOG4128|consen  358 STKAERLDYRESLMTHAMLLTSVGLKD--------------PA-TGGLNEHR-VENSWGKDLGKKGVNKMTAEWFREYAF  421 (457)
T ss_pred             cchhhhhhHHHHHHHHHHHhhhccccC--------------cc-cCCchhhh-hhchhhhhccccchhhhhHHHHHhhhe
Confidence            589999999999999999999986 33              11 35677888 89999999999999999999999994 


Q ss_pred             --eeeCcccc
Q psy17191         79 --LMPKNCLV   86 (377)
Q Consensus        79 --vVPk~~~p   86 (377)
                        ||...-.|
T Consensus       422 eiVVd~k~vP  431 (457)
T KOG4128|consen  422 EIVVDEKFVP  431 (457)
T ss_pred             eEEeecccCc
Confidence              88888888


No 26 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=77.76  E-value=7.5  Score=37.77  Aligned_cols=61  Identities=8%  Similarity=0.074  Sum_probs=40.0

Q ss_pred             ChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEec
Q psy17191        207 PVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSID  286 (377)
Q Consensus       207 p~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d  286 (377)
                      ..++..+.++++|.+|.||.+.+|+.--. |...-                           +.....+|.++|+||+.+
T Consensus        73 ~~~~~~~~l~~~l~~g~pv~~~~D~~~lp-y~~~~---------------------------~~~~~~~H~i~v~G~d~~  124 (317)
T PF14399_consen   73 SPDEAWEELKEALDAGRPVIVWVDMYYLP-YRPNY---------------------------YKKHHADHYIVVYGYDEE  124 (317)
T ss_pred             CHHHHHHHHHHHHhCCCceEEEeccccCC-CCccc---------------------------cccccCCcEEEEEEEeCC
Confidence            35667788999999999999999975211 11000                           011235899999999853


Q ss_pred             CCCCCCCeEEEEc
Q psy17191        287 KETEEPTKWRVEN  299 (377)
Q Consensus       287 ~~G~~~~yW~VkN  299 (377)
                        ++  .++++.+
T Consensus       125 --~~--~~~v~D~  133 (317)
T PF14399_consen  125 --ED--VFYVSDP  133 (317)
T ss_pred             --CC--EEEEEcC
Confidence              33  4666655


No 27 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=77.22  E-value=7.3  Score=32.25  Aligned_cols=19  Identities=11%  Similarity=0.081  Sum_probs=13.9

Q ss_pred             HHHHHHhccCCeEEEEEEc
Q psy17191        213 KLAAESIKNNEAVWFGCEV  231 (377)
Q Consensus       213 ~~~~~~L~~g~PV~~g~Dv  231 (377)
                      +.+++.|++|.||.+.++.
T Consensus        90 ~~i~~~i~~G~Pvi~~~~~  108 (144)
T PF13529_consen   90 DDIKQEIDAGRPVIVSVNS  108 (144)
T ss_dssp             HHHHHHHHTT--EEEEEET
T ss_pred             HHHHHHHHCCCcEEEEEEc
Confidence            4467889999999999974


No 28 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=70.76  E-value=21  Score=32.95  Aligned_cols=30  Identities=13%  Similarity=0.309  Sum_probs=22.7

Q ss_pred             CCCeEEEEEEEEecCCCCCCCeEEEEcccCCC
Q psy17191        273 SMTHAMVISAVSIDKETEEPTKWRVENSWGEE  304 (377)
Q Consensus       273 ~~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~  304 (377)
                      ...|||+++|+..-.+|+  .+..|=|-|-.+
T Consensus       118 ~~gHAlavvGya~~~~g~--~~y~~WNPW~~~  147 (175)
T PF05543_consen  118 HAGHALAVVGYAKPNNGQ--KTYYFWNPWWND  147 (175)
T ss_dssp             --EEEEEEEEEEEETTSE--EEEEEE-TT-SS
T ss_pred             ccceeEEEEeeeecCCCC--eEEEEeCCccCC
Confidence            357999999999865788  789999999765


No 29 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=65.58  E-value=11  Score=35.67  Aligned_cols=116  Identities=10%  Similarity=0.024  Sum_probs=61.1

Q ss_pred             CcceeeeeccCCccccccCCCcHHHHHHhhc-ccCCCCcceeeccCCceEEEeeEEEecChhHHHHHHHHHhccCCeEEE
Q psy17191        149 NNEAVWFGCENSRIRIIYNNQPVELLMKLAA-ESIKNNEAVWFGCENRIIRIRIIYNNQPVELLMKLAAESIKNNEAVWF  227 (377)
Q Consensus       149 ~y~~~~~Gc~~~~~~~~~~~TP~eF~~k~v~-D~~py~~~y~~~~~~Nv~g~~~~y~Nvp~~~l~~~~~~~L~~g~PV~~  227 (377)
                      .|..+++|.||+-       .-.+||++... |..--++.|.-....++.   +.--.|+++++    +.-|..|+|+-+
T Consensus        63 ~f~T~TlGvnp~y-------~~~~FY~~~~~~D~~RV~~lF~~A~~~gi~---V~~rsvs~~ei----~~hl~~g~~aIv  128 (212)
T PF09778_consen   63 SFYTVTLGVNPNY-------SVESFYKKNFDEDENRVNRLFQKAKAAGIN---VEKRSVSIQEI----IEHLSSGGPAIV  128 (212)
T ss_pred             eEecCccccCcCc-------cccchHHHhhhhHHHHHHHHHHHHHHcCCc---eEEeeccHHHH----HHHHhCCCcEEE
Confidence            3445668888874       66689998766 422223344333333332   11224777764    444667778888


Q ss_pred             EEEcCCCC-c-CCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCeEEEEcc
Q psy17191        228 GCEVSKRF-A-NKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTKWRVENS  300 (377)
Q Consensus       228 g~Dv~~~f-~-~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~yW~VkNS  300 (377)
                      =+|.+.-. + -+.-..+..   ....|+.+              .....|-++|+||+.+ .    +.+.++|-
T Consensus       129 LVd~~~L~C~~Ck~~~~~~~---~~~~~~~~--------------~~Y~GHYVVlcGyd~~-~----~~~~yrdP  181 (212)
T PF09778_consen  129 LVDASLLHCDLCKSNCFDPI---GSKCFGRS--------------PDYQGHYVVLCGYDAA-T----KEFEYRDP  181 (212)
T ss_pred             EEccccccChhhcccccccc---cccccCCC--------------CCccEEEEEEEeecCC-C----CeEEEeCC
Confidence            88865211 1 121111110   11122222              2346899999999875 2    34566654


No 30 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=60.51  E-value=52  Score=32.42  Aligned_cols=51  Identities=22%  Similarity=0.200  Sum_probs=36.2

Q ss_pred             CCeEEEEEEEEecCC-CCCCCeEEEEcccCCC------------C--------------CCCeEEEEehhhhhcccEEEE
Q psy17191        274 MTHAMVISAVSIDKE-TEEPTKWRVENSWGEE------------Q--------------NHKGYILMTSPWFKEYVFEVV  326 (377)
Q Consensus       274 ~~HAM~ivG~~~d~~-G~~~~yW~VkNSWG~~------------~--------------G~kGY~~ms~~wf~~y~~~vv  326 (377)
                      .+||=.|+++..-+. |.  +.-+++|.||..            |              .++|-|.|+-+=|-.+--.+.
T Consensus       235 ~~HaY~Vl~~~~~~~~~~--~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~~~df~~~F~~~~  312 (315)
T cd00044         235 KGHAYSVLDVREVQEEGL--RLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMSFEDFLRNFDGLY  312 (315)
T ss_pred             cCcceEEeEEEEEccCce--EEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEEhHHhheeeCeEE
Confidence            489999999986421 66  789999999863            1              356888888766655544443


No 31 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=48.37  E-value=28  Score=31.72  Aligned_cols=19  Identities=16%  Similarity=0.363  Sum_probs=14.2

Q ss_pred             HHHHHHhccCCeEEEEEEc
Q psy17191        213 KLAAESIKNNEAVWFGCEV  231 (377)
Q Consensus       213 ~~~~~~L~~g~PV~~g~Dv  231 (377)
                      +.+.+.|++.+|+|++.+.
T Consensus        99 e~~~~LL~~yGPLwv~~~~  117 (166)
T PF12385_consen   99 EGLANLLREYGPLWVAWEA  117 (166)
T ss_pred             HHHHHHHHHcCCeEEEecC
Confidence            3456678888999999653


No 32 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.21  E-value=26  Score=32.62  Aligned_cols=24  Identities=13%  Similarity=0.290  Sum_probs=18.1

Q ss_pred             CeEEEEEEEEecCCCCCCCeEEEEcccCCCC
Q psy17191        275 THAMVISAVSIDKETEEPTKWRVENSWGEEQ  305 (377)
Q Consensus       275 ~HAM~ivG~~~d~~G~~~~yW~VkNSWG~~~  305 (377)
                      -|+++||||+.       .++-.-++||-+.
T Consensus       148 ~H~v~itgyDk-------~n~yynDpyG~kn  171 (195)
T COG4990         148 IHSVLITGYDK-------YNIYYNDPYGYKN  171 (195)
T ss_pred             eeeeEeecccc-------cceEeccccccch
Confidence            69999999974       3566667887654


No 33 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=45.22  E-value=1e+02  Score=25.71  Aligned_cols=59  Identities=14%  Similarity=0.165  Sum_probs=37.2

Q ss_pred             HHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEecCCCCCCCe
Q psy17191        215 AAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSIDKETEEPTK  294 (377)
Q Consensus       215 ~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~d~~G~~~~y  294 (377)
                      +...|+.|.||.+.++.+       +                             ......|.++|+||+ + +    ..
T Consensus        70 ~~~~l~~~~Pvi~~~~~~-------~-----------------------------~~~~~gH~vVv~g~~-~-~----~~  107 (141)
T cd02549          70 LLRQLAAGHPVIVSVNLG-------V-----------------------------SITPSGHAMVVIGYD-R-K----GN  107 (141)
T ss_pred             HHHHHHCCCeEEEEEecC-------c-----------------------------ccCCCCeEEEEEEEc-C-C----CC
Confidence            556788899999988751       0                             012358999999997 1 1    23


Q ss_pred             EEEEcccCCCCCCCeEEEEehhhhhc
Q psy17191        295 WRVENSWGEEQNHKGYILMTSPWFKE  320 (377)
Q Consensus       295 W~VkNSWG~~~G~kGY~~ms~~wf~~  320 (377)
                      ..|-+.|.     .....|+.+.|.+
T Consensus       108 ~~i~DP~~-----~~~~~~~~~~f~~  128 (141)
T cd02549         108 VYVNDPGG-----GRRLVVSFDEFEK  128 (141)
T ss_pred             EEEECCCC-----CcCEEEeHHHHHH
Confidence            55666653     2344666665543


No 34 
>COG5572 Predicted integral membrane protein [Function unknown]
Probab=42.79  E-value=8.3  Score=31.80  Aligned_cols=18  Identities=44%  Similarity=0.407  Sum_probs=15.5

Q ss_pred             hhhhcCCcccchhhhhhh
Q psy17191          5 ERMMYGESSMTHAMVISA   22 (377)
Q Consensus         5 ~~~~~~~~~~~~~~~~~~   22 (377)
                      -|++..||+|||||+-..
T Consensus         3 ~rl~a~~s~~~~am~aaA   20 (104)
T COG5572           3 GRLGANESLMTRAMTAAA   20 (104)
T ss_pred             ccccccHHHHHHHHHHHH
Confidence            588999999999998765


No 35 
>PF09713 A_thal_3526:  Plant protein 1589 of unknown function (A_thal_3526);  InterPro: IPR006476 This plant-specific family of proteins are defined by an uncharacterised region 57 residues in length. It is found toward the N terminus of most proteins that contain it. Examples include at least several proteins from Arabidopsis thaliana (Mouse-ear cress) and Oryza sativa (Rice). The function of the proteins are unknown.
Probab=35.82  E-value=25  Score=26.31  Aligned_cols=30  Identities=23%  Similarity=0.266  Sum_probs=19.4

Q ss_pred             CChhhhhhc--CCcccchhhhhhhhhhhhhhhhh
Q psy17191          1 MSKAERMMY--GESSMTHAMVISAVSIDKIERCN   32 (377)
Q Consensus         1 ~~~~~~~~~--~~~~~~~~~~~~~~f~DklEkan   32 (377)
                      |||+|-+.+  ..+...+.++.  .-|.+|||.|
T Consensus        13 Msk~E~v~~L~~~a~I~P~~T~--~VW~~Le~eN   44 (54)
T PF09713_consen   13 MSKEECVRALQKQANIEPVFTS--TVWQKLEKEN   44 (54)
T ss_pred             CCHHHHHHHHHHHcCCChHHHH--HHHHHHHHHC
Confidence            677776554  24445555444  4599999988


No 36 
>PF01640 Peptidase_C10:  Peptidase C10 family classification.;  InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=35.53  E-value=1.5e+02  Score=27.15  Aligned_cols=59  Identities=22%  Similarity=0.372  Sum_probs=38.4

Q ss_pred             cChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEe
Q psy17191        206 QPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSI  285 (377)
Q Consensus       206 vp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~  285 (377)
                      ...++..+.+...|.+|.||.+..--                                       .+. .||.+|=||+.
T Consensus       134 y~~~~~~~~i~~el~~~rPV~~~g~~---------------------------------------~~~-GHawViDGy~~  173 (192)
T PF01640_consen  134 YTDSEWMDMIRNELDNGRPVLYSGNS---------------------------------------KSG-GHAWVIDGYDS  173 (192)
T ss_dssp             S-HHHHHHHHHHHHHTT--EEEEEEE---------------------------------------TTE-EEEEEEEEEES
T ss_pred             cCHHHHHHHHHHHHHcCCCEEEEEec---------------------------------------CCC-CeEEEEcCccC
Confidence            34456667889999999999987631                                       011 79999999964


Q ss_pred             cCCCCCCCeEEEEcccCCCCCCCeEEE
Q psy17191        286 DKETEEPTKWRVENSWGEEQNHKGYIL  312 (377)
Q Consensus       286 d~~G~~~~yW~VkNSWG~~~G~kGY~~  312 (377)
                      +      .|+-+==.||-.  .+|||.
T Consensus       174 ~------~~~H~NwGW~G~--~nGyy~  192 (192)
T PF01640_consen  174 D------GYFHCNWGWGGS--SNGYYR  192 (192)
T ss_dssp             S------SEEEEE-SSTTT--T-EEEE
T ss_pred             C------CeEEEeeCccCC--CCCccC
Confidence            3      678776666654  568873


No 37 
>PF09028 Mac-1:  Mac 1;  InterPro: IPR015117 The bacterial protein Mac 1 adopts an alpha/beta fold, with 14 beta strands and 9 alpha helices. The N-terminal domain is made up predominantly of alpha helices, whereas the C-terminal domain consists predominantly of beta sheets. Mac 1 blocks polymorphonuclear opsonophagocytosis, inhibits the production of reactive oxygen species and contains IgG endopeptidase activity. ; PDB: 1Y08_A 2AVW_C 2AU1_A.
Probab=33.15  E-value=1.5e+02  Score=30.13  Aligned_cols=47  Identities=19%  Similarity=0.258  Sum_probs=32.4

Q ss_pred             cChhHHHHHHHHHhccCCeEEEEEEcCCCCcCCCCeeecccccccccccCCCCCcCCHHHHhhhCCCCCCeEEEEEEEEe
Q psy17191        206 QPVELLMKLAAESIKNNEAVWFGCEVSKRFANKLGLNDLEIHNFKAVFDSDVSLPMSKAERMMYGESSMTHAMVISAVSI  285 (377)
Q Consensus       206 vp~~~l~~~~~~~L~~g~PV~~g~Dv~~~f~~k~GI~d~~~~d~~~~fg~~~~~~~~kaeR~~~~es~~~HAM~ivG~~~  285 (377)
                      -..+++.+.+..+|.+|+.|.+....                                      +.+..+|+|++=|+..
T Consensus       227 ~~~~~fs~~ik~~L~~g~aigLs~~~--------------------------------------~n~~~~HiItlWGae~  268 (333)
T PF09028_consen  227 RSYEDFSEKIKEALDNGKAIGLSYSP--------------------------------------GNSSGNHIITLWGAEY  268 (333)
T ss_dssp             --HHHHHHHHHHHHHTT-EEEEEE----------------------------------------SSTTS-EEEEEEEEEE
T ss_pred             CCHHHHHHHHHHHHhcCCEEEEEeec--------------------------------------CCCCCCcEEEEeeeEE
Confidence            34678889999999999888777632                                      1233689999999999


Q ss_pred             cCCCC
Q psy17191        286 DKETE  290 (377)
Q Consensus       286 d~~G~  290 (377)
                      |++|+
T Consensus       269 D~~G~  273 (333)
T PF09028_consen  269 DDNGN  273 (333)
T ss_dssp             -TTS-
T ss_pred             cCCCC
Confidence            98887


No 38 
>PF04046 PSP:  PSP;  InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins.
Probab=27.05  E-value=38  Score=24.78  Aligned_cols=22  Identities=9%  Similarity=0.302  Sum_probs=19.3

Q ss_pred             cCCCCHHHHhhhcCCCeecCCC
Q psy17191        348 KKYVPASVLDVFNQEPTILPAW  369 (377)
Q Consensus       348 ~~~~~~~~~~~~~~~p~~l~~w  369 (377)
                      |-.+|+++++++.-.+-.+|||
T Consensus         2 PG~lS~~LR~ALg~~~~~~PPw   23 (48)
T PF04046_consen    2 PGKLSDELREALGMQENDPPPW   23 (48)
T ss_pred             CcccCHHHHHHcCCCCCCCChH
Confidence            5578999999999888889998


No 39 
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=26.41  E-value=18  Score=34.11  Aligned_cols=39  Identities=21%  Similarity=0.207  Sum_probs=27.9

Q ss_pred             cchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhh
Q psy17191         14 MTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLV   71 (377)
Q Consensus        14 ~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~   71 (377)
                      .+||+++.++--+                   ...+--+++++|+..+|++|++-+..
T Consensus       184 ~~HaV~iVGyg~~-------------------~g~~YWivrNSWG~~WGe~Gy~ri~~  222 (236)
T cd02620         184 GGHAVKIIGWGVE-------------------NGVPYWLAANSWGTDWGENGYFRILR  222 (236)
T ss_pred             CCeEEEEEEEecc-------------------CCeeEEEEEeCCCCCCCCCcEEEEEc
Confidence            4699999994322                   12333468899999999999996543


No 40 
>smart00581 PSP proline-rich domain in spliceosome associated proteins.
Probab=25.93  E-value=42  Score=25.16  Aligned_cols=23  Identities=13%  Similarity=0.400  Sum_probs=20.6

Q ss_pred             ccCCCCHHHHhhhcCCCeecCCC
Q psy17191        347 DKKYVPASVLDVFNQEPTILPAW  369 (377)
Q Consensus       347 ~~~~~~~~~~~~~~~~p~~l~~w  369 (377)
                      +|-.+|+++++||.-.+-.+|||
T Consensus         5 kPG~lS~~LR~ALG~~~~~pPPW   27 (54)
T smart00581        5 KPGRISDELREALGLPPGQPPPW   27 (54)
T ss_pred             cCCcCCHHHHHHcCCCCCCCChH
Confidence            47789999999999888889998


No 41 
>PF07910 Peptidase_C78:  Peptidase family C78;  InterPro: IPR012462 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This entry contains UfSP1 and UfSP2, which are cysteine peptidases required for the processing and activation of Ubiquitin fold modifier 1 (Ufm1, IPR005375 from INTERPRO) and for its release from conjugated cellular proteins. UfSP1 and UfSP2 are 217 aa and 461 aa respectively [, ]. The peptidases belong to MEROPS peptidase family C78, clan CA. The UfSP2 family have an N-terminal extension with one or more zinc finger domains of the C2H2 type (IPR007087 from INTERPRO), which have been shown to be involved in protein:protein interaction. UfSP2 is present in most, if not all, multi-cellular organisms including plants, nematodes, flies, and mammals, whereas UfSP1 is not present in plants and nematodes []. ; PDB: 3OQC_B 2Z84_A.
Probab=25.49  E-value=72  Score=30.29  Aligned_cols=22  Identities=23%  Similarity=0.268  Sum_probs=15.0

Q ss_pred             CCeEEEEEEEEecCCCCCCCeEEE
Q psy17191        274 MTHAMVISAVSIDKETEEPTKWRV  297 (377)
Q Consensus       274 ~~HAM~ivG~~~d~~G~~~~yW~V  297 (377)
                      ..|+++|||+....+|.  .+.+|
T Consensus       155 ~ghS~TIvGie~~~~g~--~~LLV  176 (218)
T PF07910_consen  155 DGHSRTIVGIERNKDGE--VNLLV  176 (218)
T ss_dssp             TTEEEEEEEEEE-TT----EEEEE
T ss_pred             cccceEEEEEEECCCCC--EEEEE
Confidence            58999999999976776  45444


No 42 
>PTZ00200 cysteine proteinase; Provisional
Probab=24.32  E-value=16  Score=38.39  Aligned_cols=57  Identities=12%  Similarity=0.121  Sum_probs=37.9

Q ss_pred             ccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhhHH--hhhcceeeCcccc
Q psy17191         13 SMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLVNL--IVNHGLMPKNCLV   86 (377)
Q Consensus        13 ~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~~l--i~kYGvVPk~~~p   86 (377)
                      ..+||+++.+.=+|                + ....+--+++++|+..+|+.|++-+..+-  ..+-||.-...+|
T Consensus       385 ~~nHaV~lVGyG~d----------------~-~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~~~~~~P  443 (448)
T PTZ00200        385 SLNHAVLLVGEGYD----------------E-KTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGILTVGLTP  443 (448)
T ss_pred             CCcEEEEEEEeccc----------------C-CCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCccccceee
Confidence            36899999996555                1 11244557899999999999999776431  2345655444444


No 43 
>PF00648 Peptidase_C2:  Calpain family cysteine protease This is family C2 in the peptidase classification. ;  InterPro: IPR001300 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only [].  All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit:    A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb.  Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related [].  Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad [].   Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma [].  Calpains are a family of cytosolic cysteine proteinases (see PDOC00126 from PROSITEDOC). Members of the calpain family are believed to function in various biological processes, including integrin-mediated cell migration, cytoskeletal remodeling, cell differentiation and apoptosis [, ]. The calpain family includes numerous members from C. elegans to mammals and with homologues in yeast and bacteria. The best characterised members are the m- and mu-calpains, both proteins are heterodimer composed of a large catalytic subunit and a small regulatory subunit. The large subunit comprises four domains (dI-dIV) while the small subunit has two domains (dV-dVI). Domain dI is a short region cleaved by autolysis, dII is the catalytic core, dIII is a C2-like domain, dIV consists of five calcium binding EF-hand motifs []. The crystal structure of calpain has been solved [, ]. The catalytic region consists of two distinct structural domains (dIIa and dIIb). dIIa contains a central helix flanked on three faces by a cluster of alpha-helices and is entirely unrelated to the corresponding domain in the typical thiol proteinases. The fold of dIIb is similar to the corresponding domain in other cysteine proteinases and contains two three-stranded anti-parallel beta-sheets. The catalytic triad residues (C,H,N) are located in dIIa and dIIb. The activation of the domain is dependent on the binding of two calcium atoms in two non EF-hand calcium binding sites located in the catalytic core, one close to the Cys active site in dIIa and one at the end of dIIb. Calcium-binding induced conformational changes in the catalytic domain which align the active site [][]. The profile covers the whole catalytic domain.; GO: 0004198 calcium-dependent cysteine-type endopeptidase activity, 0006508 proteolysis, 0005622 intracellular; PDB: 2NQA_A 1KFU_L 1KFX_L 1QXP_B 2R9C_A 1TL9_A 2G8E_A 1KXR_B 2G8J_A 2NQG_A ....
Probab=23.74  E-value=4.8e+02  Score=25.18  Aligned_cols=30  Identities=23%  Similarity=0.341  Sum_probs=19.3

Q ss_pred             CCeEEEEEEEEecC-CCCCCCeEEEEcccCC
Q psy17191        274 MTHAMVISAVSIDK-ETEEPTKWRVENSWGE  303 (377)
Q Consensus       274 ~~HAM~ivG~~~d~-~G~~~~yW~VkNSWG~  303 (377)
                      .+||-.|+++.... +|..-+.-+++|.||.
T Consensus       213 ~~HaY~Vl~~~~~~~~~~~~~lv~LrNPwg~  243 (298)
T PF00648_consen  213 PGHAYAVLDVREVNGNGEGHRLVKLRNPWGS  243 (298)
T ss_dssp             TTS-EEEEEEEEEEETTEEEEEEEEE-TTSS
T ss_pred             cceeEEEEEEEeeccccceeEEEEEcCCCcc
Confidence            48999999997631 2321256789999985


No 44 
>smart00230 CysPc Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
Probab=23.19  E-value=4.8e+02  Score=25.84  Aligned_cols=54  Identities=22%  Similarity=0.281  Sum_probs=37.1

Q ss_pred             CCeEEEEEEEEecCCCCCCCeEEEEcccCC------------CC---------------CCCeEEEEehhhhhcccEEEE
Q psy17191        274 MTHAMVISAVSIDKETEEPTKWRVENSWGE------------EQ---------------NHKGYILMTSPWFKEYVFEVV  326 (377)
Q Consensus       274 ~~HAM~ivG~~~d~~G~~~~yW~VkNSWG~------------~~---------------G~kGY~~ms~~wf~~y~~~vv  326 (377)
                      .+||=.|+++..- +|.+..--+++|-||.            .|               .++|-|.|+-+=|-.+--.|.
T Consensus       227 ~~HaYsVl~v~~~-~~~~~~Ll~lrNPWg~~eW~G~wsd~s~~W~~~~~~~~~~l~~~~~~dG~FWM~~~df~~~F~~~~  305 (318)
T smart00230      227 KGHAYSVTDVREV-QGRRQELLRLRNPWGQVEWNGPWSDDSPEWRSVSASEKKNLGLTFDDDGEFWMSFEDFLRHFDKVE  305 (318)
T ss_pred             cCccEEEEEEEEE-ecCCeEEEEEECCCCCCCcCCCCCCCCccccccCHHHHHHhCCCCCCCCEEEEEhHHHHhhCCeEE
Confidence            4899999999864 4541137999999984            12               257888888766665555555


Q ss_pred             Ee
Q psy17191        327 VD  328 (377)
Q Consensus       327 V~  328 (377)
                      |=
T Consensus       306 vc  307 (318)
T smart00230      306 IC  307 (318)
T ss_pred             Ee
Confidence            53


No 45 
>COG3423 Nlp Predicted transcriptional regulator [Transcription]
Probab=23.18  E-value=71  Score=25.73  Aligned_cols=44  Identities=18%  Similarity=-0.119  Sum_probs=25.5

Q ss_pred             hhHHHHHHHHHH---hhccChhhhh---HHHHHHHHhHHHHhcCCChhHH
Q psy17191         96 NQPVELLMKLAA---ESIKNNEAVW---FGCENRIIRIRIIYNNQPVELL  139 (377)
Q Consensus        96 ~~Lr~~a~~lr~---~~~~~~~~l~---~~~l~~v~~il~i~lG~pP~~F  139 (377)
                      .-|++-...||.   +.|-+...|.   ..--..-.+|++.+||.||++.
T Consensus        15 A~Lkk~G~Sl~~LS~~agls~~tL~n~L~rp~pkgEriIA~algv~P~eI   64 (82)
T COG3423          15 AALKKKGTSLAALSREAGLSSSTLANALDRPWPKGERIIADALGVPPEEI   64 (82)
T ss_pred             HHHHHccccHHHHHHHcCCCHHHHHHHHcCCCchHHHHHHHHhCCCHHHh
Confidence            445555555665   3454443332   1111235789999999999763


No 46 
>PTZ00203 cathepsin L protease; Provisional
Probab=20.95  E-value=27  Score=35.43  Aligned_cols=40  Identities=10%  Similarity=0.136  Sum_probs=29.7

Q ss_pred             ccchhhhhhhhhhhhhhhhhHHHHHHHHHhccCCCCCcchheecccCCCCCCCchhhhh
Q psy17191         13 SMTHAMVISAVSIDKIERCNFYLNTVVETARRGEPVDGRLVSFLMQSPNGDGGQWDMLV   71 (377)
Q Consensus        13 ~~~~~~~~~~~f~DklEkan~fl~~~i~~~~~~~~~~~R~v~~~l~~~~gdgG~~~m~~   71 (377)
                      .++||+++.+  |+                . +...+--+++++|+..+|+.|++-+..
T Consensus       285 ~~nHaVliVG--YG----------------~-~~g~~YWiikNSWG~~WGe~GY~ri~r  324 (348)
T PTZ00203        285 QLNHGVLLVG--YN----------------M-TGEVPYWVIKNSWGEDWGEKGYVRVTM  324 (348)
T ss_pred             CCCeEEEEEE--Ee----------------c-CCCceEEEEEcCCCCCcCcCceEEEEc
Confidence            4689999999  54                0 112445568999999999999997754


Done!