Query         048002
Match_columns 351
No_of_seqs    276 out of 1793
Neff          8.2 
Searched_HMMs 46136
Date          Fri Mar 29 05:19:37 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/048002.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/048002hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542 Cysteine proteinase Ca 100.0 5.9E-78 1.3E-82  548.2  21.8  280   30-337    65-371 (372)
  2 PTZ00203 cathepsin L protease; 100.0 5.1E-74 1.1E-78  545.7  32.5  309    5-346    10-346 (348)
  3 PTZ00021 falcipain-2; Provisio 100.0   4E-71 8.7E-76  540.5  28.8  288   29-338   162-489 (489)
  4 PTZ00200 cysteine proteinase;  100.0 8.4E-70 1.8E-74  529.6  30.2  291   24-339   114-447 (448)
  5 KOG1543 Cysteine proteinase Ca 100.0   2E-63 4.4E-68  469.8  26.6  266   40-336    30-323 (325)
  6 cd02621 Peptidase_C1A_Cathepsi 100.0 2.2E-51 4.7E-56  375.8  19.4  190  122-334     1-239 (243)
  7 cd02698 Peptidase_C1A_Cathepsi 100.0 4.9E-51 1.1E-55  372.2  19.3  207  122-336     1-237 (239)
  8 cd02248 Peptidase_C1A Peptidas 100.0 1.4E-50 3.1E-55  362.2  19.5  188  123-335     1-210 (210)
  9 cd02620 Peptidase_C1A_Cathepsi 100.0 1.8E-50 3.8E-55  367.9  18.1  203  123-333     1-234 (236)
 10 PTZ00364 dipeptidyl-peptidase  100.0 1.1E-48 2.4E-53  386.8  20.7  197  120-335   203-459 (548)
 11 PF00112 Peptidase_C1:  Papain  100.0 7.4E-49 1.6E-53  352.3  14.0  194  122-336     1-219 (219)
 12 PTZ00049 cathepsin C-like prot 100.0   1E-47 2.2E-52  383.6  21.1  212  119-336   378-675 (693)
 13 smart00645 Pept_C1 Papain fami 100.0 2.2E-43 4.8E-48  306.7  14.7  168  122-331     1-169 (174)
 14 cd02619 Peptidase_C1 C1 Peptid 100.0 5.5E-41 1.2E-45  301.9  18.0  177  125-319     1-213 (223)
 15 KOG1544 Predicted cysteine pro 100.0 3.1E-42 6.7E-47  310.3   6.7  263   64-333   151-456 (470)
 16 PTZ00462 Serine-repeat antigen 100.0 2.5E-40 5.5E-45  339.1  20.5  206  134-345   544-789 (1004)
 17 COG4870 Cysteine protease [Pos  99.9 5.6E-25 1.2E-29  203.6   5.7  184  120-321    97-316 (372)
 18 cd00585 Peptidase_C1B Peptidas  99.8 6.4E-20 1.4E-24  178.5   9.7   76  135-211    55-159 (437)
 19 PF08246 Inhibitor_I29:  Cathep  99.7 2.5E-16 5.4E-21  111.6   7.0   56   36-91      1-58  (58)
 20 PF03051 Peptidase_C1_2:  Pepti  99.6 4.6E-15 9.9E-20  144.8   8.2   76  135-211    56-160 (438)
 21 smart00848 Inhibitor_I29 Cathe  99.5 3.7E-14 8.1E-19   99.9   5.5   55   36-90      1-57  (57)
 22 COG3579 PepC Aminopeptidase C   98.8   3E-08 6.5E-13   91.5  10.7   75  136-211    59-162 (444)
 23 PF08127 Propeptide_C1:  Peptid  96.7  0.0013 2.8E-08   42.6   2.2   35   63-99      3-37  (41)
 24 KOG4128 Bleomycin hydrolases a  96.6  0.0012 2.5E-08   61.5   2.4   75  135-210    63-168 (457)
 25 PF13529 Peptidase_C39_2:  Pept  75.0      24 0.00051   28.1   8.5   21  255-275    87-108 (144)
 26 PF07172 GRP:  Glycine rich pro  54.0     8.1 0.00017   29.9   1.5   19    1-19      5-23  (95)
 27 KOG4128 Bleomycin hydrolases a  47.0     4.3 9.4E-05   38.4  -1.1   24  293-316   389-412 (457)
 28 KOG4654 Uncharacterized conser  28.7   2E+02  0.0043   25.2   6.1   75   19-94    112-201 (252)
 29 TIGR02744 TrbI_Ftype type-F co  28.1 1.8E+02  0.0039   23.2   5.4   46   28-73     35-82  (112)
 30 COG4871 Uncharacterized protei  24.2      49  0.0011   28.1   1.6   17  136-152   135-153 (193)
 31 PF05543 Peptidase_C47:  Stapho  23.9 4.8E+02    0.01   22.6   8.4  119  139-317    18-153 (175)

No 1  
>KOG1542 consensus Cysteine proteinase Cathepsin F [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=5.9e-78  Score=548.20  Aligned_cols=280  Identities=39%  Similarity=0.700  Sum_probs=246.0

Q ss_pred             HHHHHHHHHHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHccCC-CCeEEEcccCCCCChhhhhhhcCCcccc-CccCC
Q 048002           30 ECLWDLYERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMD-KPYKLRLNRFADMTNHEFMSSRSSKVSH-HRMLH  106 (351)
Q Consensus        30 ~~~~~~f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N~~~-~s~~~g~N~FsD~t~eEf~~~~~~~~~~-~~~~~  106 (351)
                      ..+.+.|..|+.+| |+|.+.+|..+|+.+|++|+..++++++.+ .|.+.|+|+|||||+|||++++++.+.. .+...
T Consensus        65 l~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~~  144 (372)
T KOG1542|consen   65 LGLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLPG  144 (372)
T ss_pred             cchHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccccccCcc
Confidence            34478999999999 999999999999999999999999998886 4999999999999999999999876541 11111


Q ss_pred             CCCCccccccCCCCCCCCceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHHHhhCCCccCCHHHhhhcCCCCCCCCC
Q 048002          107 GPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDG  186 (351)
Q Consensus       107 ~~~~~~~~~~~~~~~lP~~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~~~~lS~q~l~dc~~~~~gC~G  186 (351)
                      ....   ........||++||||++|.||||||||+||||||||+++++|++++|++|++++||||+|+||+..++||+|
T Consensus       145 ~~~~---~~~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~G  221 (372)
T KOG1542|consen  145 DAAE---APIEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCNG  221 (372)
T ss_pred             cccc---CcCCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCCC
Confidence            1111   1113346899999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CchHHHHHHHHHcCCCCCCCCCccccCCC-CCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHH-
Q 048002          187 GLMEQALNFIAKSEGLTTEKSYPYTAKDG-SCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAV-  264 (351)
Q Consensus       187 G~~~~a~~~~~~~~Gi~~e~~yPY~~~~~-~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l-  264 (351)
                      |.+..|++|+.+.+|+..|.+|||++..+ .|...+                  ....+.|+++..++. ||++|.+.| 
T Consensus       222 Gl~~nA~~~~~~~gGL~~E~dYPY~g~~~~~C~~~~------------------~~~~v~I~~f~~l~~-nE~~ia~wLv  282 (372)
T KOG1542|consen  222 GLMDNAFKYIKKAGGLEKEKDYPYTGKKGNQCHFDK------------------SKIVVSIKDFSMLSN-NEDQIAAWLV  282 (372)
T ss_pred             CChhHHHHHHHHhCCccccccCCccccCCCccccch------------------hhceEEEeccEecCC-CHHHHHHHHH
Confidence            99999999988878999999999999887 899876                  567889999999976 888888887 


Q ss_pred             hcCCEEEEEecCCCCccCCCC----------------------ccccCCCCccEEEEEcCCCCCccCCceEEEEecCCCC
Q 048002          265 ANQPVAVAIDAGGKDFQFYSE----------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAE  322 (351)
Q Consensus       265 ~~gPV~v~~~~~~~~f~~Y~~----------------------Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~  322 (351)
                      ++|||+|+|++.  .+++|++                      |||.+.-.++|||||||||++|||+||+||.||.   
T Consensus       283 ~~GPi~vgiNa~--~mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~---  357 (372)
T KOG1542|consen  283 TFGPLSVGINAK--PMQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGS---  357 (372)
T ss_pred             hcCCeEEEEchH--HHHHhcccccCCCcccCCccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccc---
Confidence            789999999975  7999988                      9999722899999999999999999999999995   


Q ss_pred             CCCcccccccceeee
Q 048002          323 EGLCGITLEASYPVK  337 (351)
Q Consensus       323 ~~~Cgi~~~~~yp~~  337 (351)
                       |.|||+++++-+.+
T Consensus       358 -N~CGi~~mvss~~v  371 (372)
T KOG1542|consen  358 -NACGIADMVSSAAV  371 (372)
T ss_pred             -cccccccchhhhhc
Confidence             56999999876654


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=5.1e-74  Score=545.70  Aligned_cols=309  Identities=34%  Similarity=0.633  Sum_probs=248.1

Q ss_pred             HHHHHHHHHHhhccccCCccccCCHHHHHHHHHHHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHccCCCCeEEEcccC
Q 048002            5 VGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRF   83 (351)
Q Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N~~~~s~~~g~N~F   83 (351)
                      .+...|++++++++....   +.....+..+|++||++| |.|.+.+|+.+|+.||++|+++|++||+++.+|++|+|+|
T Consensus        10 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~~~~lg~N~F   86 (348)
T PTZ00203         10 AVAVVCVVLAAACAPARA---IYVGTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKF   86 (348)
T ss_pred             HHHHHHHHHHHhhccchh---cccccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCCCeEEecccc
Confidence            445566667776654321   223456777899999999 9999888999999999999999999998888999999999


Q ss_pred             CCCChhhhhhhcCCccc-cCccCCCCCCcccccc--CCCCCCCCceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHH
Q 048002           84 ADMTNHEFMSSRSSKVS-HHRMLHGPRRQTGFMH--GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINK  160 (351)
Q Consensus        84 sD~t~eEf~~~~~~~~~-~~~~~~~~~~~~~~~~--~~~~~lP~~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~  160 (351)
                      +|||.|||++++++... .... ...... .+..  ....++|++||||++|+|+||||||.||||||||+++++|++++
T Consensus        87 aDlT~eEf~~~~l~~~~~~~~~-~~~~~~-~~~~~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~  164 (348)
T PTZ00203         87 FDLSEAEFAARYLNGAAYFAAA-KQHAGQ-HYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWA  164 (348)
T ss_pred             ccCCHHHHHHHhcCCCcccccc-cccccc-cccccccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHH
Confidence            99999999988764211 1100 000000 0111  12246899999999999999999999999999999999999999


Q ss_pred             HhhCCCccCCHHHhhhcCCCCCCCCCCchHHHHHHHHHc--CCCCCCCCCccccCCC---CCCCCCccchhhhhhccccc
Q 048002          161 IKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKS--EGLTTEKSYPYTAKDG---SCELPTSMVSIIYRVHICSW  235 (351)
Q Consensus       161 i~~~~~~~lS~q~l~dc~~~~~gC~GG~~~~a~~~~~~~--~Gi~~e~~yPY~~~~~---~c~~~~~~~~~~~~~~~~~~  235 (351)
                      |++++.+.||+|+|+||+..+.||+||++..|++|+.++  +|+++|++|||.+.++   .|....              
T Consensus       165 i~~~~~~~LSeQqLvdC~~~~~GC~GG~~~~a~~yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~--------------  230 (348)
T PTZ00203        165 VAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSS--------------  230 (348)
T ss_pred             HhcCCCccCCHHHHHhccCCCCCCCCCCHHHHHHHHHHhcCCCCCccccCCCccCCCCCCcCCCCc--------------
Confidence            999999999999999999878899999999999999764  5789999999988765   465322              


Q ss_pred             CCCCCCCcEEecceEEcCCChHHHHHHHHh-cCCEEEEEecCCCCccCCCC------------------ccccCCCCccE
Q 048002          236 NGDKNAPEVILDGYEMVPESDENALMKAVA-NQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKY  296 (351)
Q Consensus       236 ~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~-~gPV~v~~~~~~~~f~~Y~~------------------Gyg~~~~g~~y  296 (351)
                         ......++.+|..++. +++.|+.+|. +|||+|+|++.  +|++|++                  |||.+ +|++|
T Consensus       231 ---~~~~~~~i~~~~~i~~-~e~~~~~~l~~~GPv~v~i~a~--~f~~Y~~GIy~~c~~~~~nHaVliVGYG~~-~g~~Y  303 (348)
T PTZ00203        231 ---ELAPGARIDGYVSMES-SERVMAAWLAKNGPISIAVDAS--SFMSYHSGVLTSCIGEQLNHGVLLVGYNMT-GEVPY  303 (348)
T ss_pred             ---ccccceEecceeecCc-CHHHHHHHHHhCCCEEEEEEhh--hhcCccCceeeccCCCCCCeEEEEEEEecC-CCceE
Confidence               1123467889988865 7888999885 69999999985  7999998                  99986 78999


Q ss_pred             EEEEcCCCCCccCCceEEEEecCCCCCCCcccccccceeeecCCCCCCCC
Q 048002          297 WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSRHP  346 (351)
Q Consensus       297 WivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~yp~~~~~~~~~~~  346 (351)
                      ||||||||++|||+|||||+|+.    |.|||++   ||+......+.+|
T Consensus       304 WiikNSWG~~WGe~GY~ri~rg~----n~Cgi~~---~~~~~~~~~~~~~  346 (348)
T PTZ00203        304 WVIKNSWGEDWGEKGYVRVTMGV----NACLLTG---YPVSVHVSQSPTP  346 (348)
T ss_pred             EEEEcCCCCCcCcCceEEEEcCC----Ccccccc---eEEEEecCCCCCC
Confidence            99999999999999999999984    4699995   4554566666665


No 3  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=4e-71  Score=540.51  Aligned_cols=288  Identities=34%  Similarity=0.569  Sum_probs=238.0

Q ss_pred             HHHHHHHHHHHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHccCC-CCeEEEcccCCCCChhhhhhhcCCcccc--Ccc
Q 048002           29 EECLWDLYERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMD-KPYKLRLNRFADMTNHEFMSSRSSKVSH--HRM  104 (351)
Q Consensus        29 ~~~~~~~f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N~~~-~s~~~g~N~FsD~t~eEf~~~~~~~~~~--~~~  104 (351)
                      ..+....|++|+.+| |+|.+.+|+..|+.+|++|+++|++||+++ .+|++|+|+|+|||.|||++++++....  ...
T Consensus       162 n~e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~  241 (489)
T PTZ00021        162 NLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSN  241 (489)
T ss_pred             ChHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccccccc
Confidence            345557899999999 999999999999999999999999999864 7999999999999999999888764321  000


Q ss_pred             CCCCCCccc-------cccCCCCCCCCceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHHHhhCCCccCCHHHhhhc
Q 048002          105 LHGPRRQTG-------FMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDC  177 (351)
Q Consensus       105 ~~~~~~~~~-------~~~~~~~~lP~~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~~~~lS~q~l~dc  177 (351)
                      .........       +........|.+||||++|.|+||||||.||||||||+++++|++++|++++.+.||+|+|+||
T Consensus       242 ~~~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVDC  321 (489)
T PTZ00021        242 GKKSPRVINYDDVIKKYKPKDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDC  321 (489)
T ss_pred             cccccccccccccccccccccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhhh
Confidence            000000000       0011111249999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCCCCCCCCchHHHHHHHHHcCCCCCCCCCccccC-CCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCCh
Q 048002          178 DKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK-DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESD  256 (351)
Q Consensus       178 ~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~-~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~  256 (351)
                      +..+.||+||++..|+.|+.+++||++|++|||.+. .+.|....                  ....++|.+|..++   
T Consensus       322 s~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~~~~~~C~~~~------------------~~~~~~i~~y~~i~---  380 (489)
T PTZ00021        322 SFKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVSDTPELCNIDR------------------CKEKYKIKSYVSIP---  380 (489)
T ss_pred             ccCCCCCCCcchHhhhhhhhhccccCcccccCccCCCCCcccccc------------------ccccceeeeEEEec---
Confidence            988899999999999999987779999999999986 36786543                  23456888998885   


Q ss_pred             HHHHHHHHh-cCCEEEEEecCCCCccCCCC------------------ccccCC---------CCccEEEEEcCCCCCcc
Q 048002          257 ENALMKAVA-NQPVAVAIDAGGKDFQFYSE------------------GYGATQ---------DGTKYWIVKNSWGTDWE  308 (351)
Q Consensus       257 ~~~i~~~l~-~gPV~v~~~~~~~~f~~Y~~------------------Gyg~~~---------~g~~yWivkNSWG~~WG  308 (351)
                      +++|+++|. .|||+|+|++. .+|++|++                  |||+++         .+.+|||||||||++||
T Consensus       381 ~~~lk~al~~~GPVsv~i~a~-~~f~~YkgGIy~~~C~~~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WG  459 (489)
T PTZ00021        381 EDKFKEAIRFLGPISVSIAVS-DDFAFYKGGIFDGECGEEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWG  459 (489)
T ss_pred             HHHHHHHHHhcCCeEEEEEee-cccccCCCCcCCCCCCCccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcc
Confidence            468999995 69999999997 68999987                  999752         12579999999999999


Q ss_pred             CCceEEEEecCCCCCCCcccccccceeeec
Q 048002          309 EKGYIRMLRGIDAEEGLCGITLEASYPVKL  338 (351)
Q Consensus       309 e~Gy~~i~r~~~~~~~~Cgi~~~~~yp~~~  338 (351)
                      |+|||||+|+.+...|+|||++.+.||+++
T Consensus       460 E~GY~rI~r~~~g~~n~CGI~t~a~yP~~~  489 (489)
T PTZ00021        460 EKGFIRIETDENGLMKTCSLGTEAYVPLIE  489 (489)
T ss_pred             cCeEEEEEcCCCCCCCCCCCcccceeEecC
Confidence            999999999975445789999999999863


No 4  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=8.4e-70  Score=529.64  Aligned_cols=291  Identities=34%  Similarity=0.623  Sum_probs=239.8

Q ss_pred             cccCCHHHHHHHHHHHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHccCCCCeEEEcccCCCCChhhhhhhcCCccccC
Q 048002           24 SDLASEECLWDLYERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHH  102 (351)
Q Consensus        24 ~~~~~~~~~~~~f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N~~~~s~~~g~N~FsD~t~eEf~~~~~~~~~~~  102 (351)
                      .+...+.++...|++|+++| |.|.+.+|+.+|+.+|++|+++|++||. +.+|++|+|+|+|||.|||.+++++...+.
T Consensus       114 ~~~~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~-~~~y~lgiN~FsDlT~eEF~~~~~~~~~~~  192 (448)
T PTZ00200        114 DDPKLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKG-DEPYSKEINKFSDLTEEEFRKLFPVIKVPP  192 (448)
T ss_pred             CCccchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcC-cCCeEEeccccccCCHHHHHHHhccCCCcc
Confidence            34456677888999999999 9999999999999999999999999996 468999999999999999998876533211


Q ss_pred             ccC---CC-------CCCcc---cccc-----C---C-CCCCCCceeccCCCCCCccCCCC-CCchHHHHHHHHHhhhhH
Q 048002          103 RML---HG-------PRRQT---GFMH-----G---K-TQDLPPSVDWRKQGAVTGVKDQG-RCGSCWAFSTVVSVEGIN  159 (351)
Q Consensus       103 ~~~---~~-------~~~~~---~~~~-----~---~-~~~lP~~~Dwr~~g~v~pVknQg-~cGsCwAfA~~~~~e~~~  159 (351)
                      ...   ..       .....   ....     .   . ...+|++||||+.|.|+|||||| .||||||||+++++|+++
T Consensus       193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWAFat~~aiEs~~  272 (448)
T PTZ00200        193 KSNSTSHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLY  272 (448)
T ss_pred             cccccccccccccccccccccccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHHHhHHHHHHHHH
Confidence            000   00       00000   0000     0   0 12369999999999999999999 999999999999999999


Q ss_pred             HHhhCCCccCCHHHhhhcCCCCCCCCCCchHHHHHHHHHcCCCCCCCCCccccCCCCCCCCCccchhhhhhcccccCCCC
Q 048002          160 KIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDK  239 (351)
Q Consensus       160 ~i~~~~~~~lS~q~l~dc~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~  239 (351)
                      +|+++..+.||+|+|+||+..+.||+||++..|++|+.++ ||++|++|||.+..+.|....                  
T Consensus       273 ~i~~~~~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~-Gi~~e~~YPY~~~~~~C~~~~------------------  333 (448)
T PTZ00200        273 KIYRDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNK-GLSSSSDVPYLAKDGKCVVSS------------------  333 (448)
T ss_pred             HHhcCCCeecCHHHHhhccCccCCCCCCcHHHHHHHHhhc-CccccccCCCCCCCCCCcCCC------------------
Confidence            9999999999999999999878899999999999999887 999999999999888997543                  


Q ss_pred             CCCcEEecceEEcCCChHHHHHHHHhcCCEEEEEecCCCCccCCCC------------------ccccC-CCCccEEEEE
Q 048002          240 NAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGAT-QDGTKYWIVK  300 (351)
Q Consensus       240 ~~~~~~i~~~~~v~~~~~~~i~~~l~~gPV~v~~~~~~~~f~~Y~~------------------Gyg~~-~~g~~yWivk  300 (351)
                       .....|.+|..++  ..+.|++++..|||+|+|+++ .+|+.|++                  |||.+ .+|.+|||||
T Consensus       334 -~~~~~i~~y~~~~--~~~~l~~~l~~GPV~v~i~~~-~~f~~Yk~GIy~~~C~~~~nHaV~lVGyG~d~~~g~~YWIIk  409 (448)
T PTZ00200        334 -TKKVYIDSYLVAK--GKDVLNKSLVISPTVVYIAVS-RELLKYKSGVYNGECGKSLNHAVLLVGEGYDEKTKKRYWIIK  409 (448)
T ss_pred             -CCeeEecceEecC--HHHHHHHHHhcCCEEEEeecc-cccccCCCCccccccCCCCcEEEEEEEecccCCCCCceEEEE
Confidence             2335688887663  356778788889999999997 79999988                  88753 3688999999


Q ss_pred             cCCCCCccCCceEEEEecCCCCCCCcccccccceeeecC
Q 048002          301 NSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLH  339 (351)
Q Consensus       301 NSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~yp~~~~  339 (351)
                      ||||++|||+|||||+|+.. ..|.|||++.+.||++..
T Consensus       410 NSWG~~WGe~GY~ri~r~~~-g~n~CGI~~~~~~P~~~~  447 (448)
T PTZ00200        410 NSWGTDWGENGYMRLERTNE-GTDKCGILTVGLTPVFYS  447 (448)
T ss_pred             cCCCCCcccCeeEEEEeCCC-CCCcCCccccceeeEEec
Confidence            99999999999999999742 246899999999999843


No 5  
>KOG1543 consensus Cysteine proteinase Cathepsin L [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=2e-63  Score=469.82  Aligned_cols=266  Identities=45%  Similarity=0.808  Sum_probs=230.8

Q ss_pred             HHhc-cccCChHHHHHHHHHHHHHHHHHHHHccC-CCCeEEEcccCCCCChhhhhhhcCCccccCccCCCCCCccccccC
Q 048002           40 RSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHG  117 (351)
Q Consensus        40 ~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N~~-~~s~~~g~N~FsD~t~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~  117 (351)
                      +.+| +.|.+..|...|+.+|.+|++.|+.||.. ..+|++|+|+|+|++.+|++..+.+.+.+..  .. ...  ....
T Consensus        30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~--~~-~~~--~~~~  104 (325)
T KOG1543|consen   30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEI--KR-DKF--TEKL  104 (325)
T ss_pred             hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccc--cc-ccc--cccc
Confidence            7778 99987789999999999999999999998 6899999999999999999988876554221  00 000  1112


Q ss_pred             CCCCCCCceeccCCC-CCCccCCCCCCchHHHHHHHHHhhhhHHHhhC-CCccCCHHHhhhcCC-CCCCCCCCchHHHHH
Q 048002          118 KTQDLPPSVDWRKQG-AVTGVKDQGRCGSCWAFSTVVSVEGINKIKTG-ELWSLSEQELVDCDK-DNHGCDGGLMEQALN  194 (351)
Q Consensus       118 ~~~~lP~~~Dwr~~g-~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~-~~~~lS~q~l~dc~~-~~~gC~GG~~~~a~~  194 (351)
                      ...++|++||||+++ .++||||||.||||||||++++||++++|+++ .++.||+|+|+||+. .++||+||.+..|++
T Consensus       105 ~~~~~p~s~DwR~~~~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~~~~A~~  184 (325)
T KOG1543|consen  105 DGDDLPDSFDWRDKGAVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGEPKNAFK  184 (325)
T ss_pred             chhhCCCCccccccCCcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCCHHHHHH
Confidence            245799999999996 55669999999999999999999999999999 899999999999998 588999999999999


Q ss_pred             HHHHcCCCCC-CCCCccccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHHh-cCCEEEE
Q 048002          195 FIAKSEGLTT-EKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVA-NQPVAVA  272 (351)
Q Consensus       195 ~~~~~~Gi~~-e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~-~gPV~v~  272 (351)
                      |+.++ |+++ +.+|||.+..+.|....                  ....+.+.++..++.. +++|+.+|+ +|||+|+
T Consensus       185 yi~~~-G~~t~~~~Ypy~~~~~~C~~~~------------------~~~~~~~~~~~~~~~~-e~~i~~~v~~~GPv~v~  244 (325)
T KOG1543|consen  185 YIKKN-GGVTECENYPYIGKDGTCKSNK------------------KDKTVTIKGFYNVPAN-EEAIAEAVAKNGPVSVA  244 (325)
T ss_pred             HHHHh-CCCCCCcCCCCcCCCCCccCCC------------------ccceeEeeeeeecCcC-HHHHHHHHHhcCCeEEE
Confidence            99998 6666 99999999999998876                  2456778888888775 999999995 5899999


Q ss_pred             EecCCCCccCCCC--------------------ccccCCCCccEEEEEcCCCCCccCCceEEEEecCCCCCCCccccccc
Q 048002          273 IDAGGKDFQFYSE--------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEA  332 (351)
Q Consensus       273 ~~~~~~~f~~Y~~--------------------Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~  332 (351)
                      |++. .+|++|++                    |||+ .++.+|||||||||++|||+|||||.|+++.    |+|+..+
T Consensus       245 ~~a~-~~F~~Y~~GVy~~~~~~~~~~~Hav~iVGyG~-~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~----~~I~~~~  318 (325)
T KOG1543|consen  245 IDAY-EDFSLYKGGVYAEEKGDDKEGDHAVLIVGYGT-GDGVDYWIVKNSWGTDWGEKGYFRIARGVNK----CGIASEA  318 (325)
T ss_pred             Eeeh-hhhhhccCceEeCCCCCCCCCCceEEEEEEcC-CCCceeEEEEcCCCCCcccCceEEEecCCCc----hhhhccc
Confidence            9999 59999998                    9999 5889999999999999999999999999765    9999999


Q ss_pred             ce-ee
Q 048002          333 SY-PV  336 (351)
Q Consensus       333 ~y-p~  336 (351)
                      .| |+
T Consensus       319 ~~~p~  323 (325)
T KOG1543|consen  319 SYGPI  323 (325)
T ss_pred             ccCCC
Confidence            88 65


No 6  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=2.2e-51  Score=375.75  Aligned_cols=190  Identities=39%  Similarity=0.783  Sum_probs=159.4

Q ss_pred             CCCceeccCCC----CCCccCCCCCCchHHHHHHHHHhhhhHHHhhCC------CccCCHHHhhhcCCCCCCCCCCchHH
Q 048002          122 LPPSVDWRKQG----AVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGE------LWSLSEQELVDCDKDNHGCDGGLMEQ  191 (351)
Q Consensus       122 lP~~~Dwr~~g----~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~------~~~lS~q~l~dc~~~~~gC~GG~~~~  191 (351)
                      ||++||||+.+    +|+||||||.||||||||++++||++++|++++      .+.||+|+|+||+..+.||+||++..
T Consensus         1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~   80 (243)
T cd02621           1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL   80 (243)
T ss_pred             CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence            79999999998    999999999999999999999999999998876      68999999999998778999999999


Q ss_pred             HHHHHHHcCCCCCCCCCcccc-CCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEc----CCChHHHHHHHH-h
Q 048002          192 ALNFIAKSEGLTTEKSYPYTA-KDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV----PESDENALMKAV-A  265 (351)
Q Consensus       192 a~~~~~~~~Gi~~e~~yPY~~-~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v----~~~~~~~i~~~l-~  265 (351)
                      ++.|+.++ |+++|.+|||.. ..+.|.....                 ....+++..|..+    ...++++|+++| .
T Consensus        81 a~~~~~~~-Gi~~e~~yPY~~~~~~~C~~~~~-----------------~~~~~~~~~~~~i~~~~~~~~~~~ik~~i~~  142 (243)
T cd02621          81 VGKFAEDF-GIVTEDYFPYTADDDRPCKASPS-----------------ECRRYYFSDYNYVGGCYGCTNEDEMKWEIYR  142 (243)
T ss_pred             HHHHHHhc-CcCCCceeCCCCCCCCCCCCCcc-----------------ccccccccceeEcccccccCCHHHHHHHHHH
Confidence            99999887 999999999998 6777875430                 1111222333322    124788999998 5


Q ss_pred             cCCEEEEEecCCCCccCCCC--------------------------------ccccCC-CCccEEEEEcCCCCCccCCce
Q 048002          266 NQPVAVAIDAGGKDFQFYSE--------------------------------GYGATQ-DGTKYWIVKNSWGTDWEEKGY  312 (351)
Q Consensus       266 ~gPV~v~~~~~~~~f~~Y~~--------------------------------Gyg~~~-~g~~yWivkNSWG~~WGe~Gy  312 (351)
                      +|||+|+|++. ++|++|++                                |||++. +|.+|||||||||++|||+||
T Consensus       143 ~GPv~v~~~~~-~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy  221 (243)
T cd02621         143 NGPIVVAFEVY-SDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGY  221 (243)
T ss_pred             cCCEEEEEEec-ccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCe
Confidence            79999999997 78987742                                788763 388999999999999999999


Q ss_pred             EEEEecCCCCCCCcccccccce
Q 048002          313 IRMLRGIDAEEGLCGITLEASY  334 (351)
Q Consensus       313 ~~i~r~~~~~~~~Cgi~~~~~y  334 (351)
                      |||+|+.    +.|||++.+.+
T Consensus       222 ~~i~~~~----~~cgi~~~~~~  239 (243)
T cd02621         222 FKIRRGT----NECGIESQAVF  239 (243)
T ss_pred             EEEecCC----cccCcccceEe
Confidence            9999985    46999998865


No 7  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=4.9e-51  Score=372.24  Aligned_cols=207  Identities=29%  Similarity=0.567  Sum_probs=169.2

Q ss_pred             CCCceeccCCC---CCCccCCCC---CCchHHHHHHHHHhhhhHHHhhC---CCccCCHHHhhhcCCCCCCCCCCchHHH
Q 048002          122 LPPSVDWRKQG---AVTGVKDQG---RCGSCWAFSTVVSVEGINKIKTG---ELWSLSEQELVDCDKDNHGCDGGLMEQA  192 (351)
Q Consensus       122 lP~~~Dwr~~g---~v~pVknQg---~cGsCwAfA~~~~~e~~~~i~~~---~~~~lS~q~l~dc~~~~~gC~GG~~~~a  192 (351)
                      ||++||||+.+   +|+||||||   .||||||||++++||++++|+++   ..+.||+|+|+||+. +.||+||++..+
T Consensus         1 lP~~~Dwr~~~~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~-~~gC~GG~~~~a   79 (239)
T cd02698           1 LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAG-GGSCHGGDPGGV   79 (239)
T ss_pred             CCCCcccccCCCCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCC-CCCccCcCHHHH
Confidence            69999999987   999999998   89999999999999999999875   357899999999997 789999999999


Q ss_pred             HHHHHHcCCCCCCCCCccccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHH-hcCCEEE
Q 048002          193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAV-ANQPVAV  271 (351)
Q Consensus       193 ~~~~~~~~Gi~~e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l-~~gPV~v  271 (351)
                      ++|+.++ |+++|.+|||....+.|..... ...+.....|...  .....+++++|..++  +++.|+++| .+|||+|
T Consensus        80 ~~~~~~~-Gl~~e~~yPY~~~~~~C~~~~~-~~~c~~~~~c~~~--~~~~~~~i~~~~~~~--~~~~i~~~l~~~GPV~v  153 (239)
T cd02698          80 YEYAHKH-GIPDETCNPYQAKDGECNPFNR-CGTCNPFGECFAI--KNYTLYFVSDYGSVS--GRDKMMAEIYARGPISC  153 (239)
T ss_pred             HHHHHHc-CcCCCCeeCCcCCCCCCcCCCC-CCCcccCcccccc--cccceEEeeeceecC--CHHHHHHHHHHcCCEEE
Confidence            9999987 9999999999988777864221 0011111122111  123456788888774  467888887 6799999


Q ss_pred             EEecCCCCccCCCC-------------------ccccCCCCccEEEEEcCCCCCccCCceEEEEecCC-CCCCCcccccc
Q 048002          272 AIDAGGKDFQFYSE-------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGID-AEEGLCGITLE  331 (351)
Q Consensus       272 ~~~~~~~~f~~Y~~-------------------Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~-~~~~~Cgi~~~  331 (351)
                      +|.+. .+|+.|++                   |||++.+|++|||||||||++|||+|||||+|+.. +..++|||++.
T Consensus       154 ~i~~~-~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~~  232 (239)
T cd02698         154 GIMAT-EALENYTGGVYKEYVQDPLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEED  232 (239)
T ss_pred             EEEec-ccccccCCeEEccCCCCCcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccCCcccccccccccc
Confidence            99998 68999988                   99987348999999999999999999999999961 22357999999


Q ss_pred             cceee
Q 048002          332 ASYPV  336 (351)
Q Consensus       332 ~~yp~  336 (351)
                      +.|+.
T Consensus       233 ~~~~~  237 (239)
T cd02698         233 CAWAD  237 (239)
T ss_pred             eEEEe
Confidence            98874


No 8  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=1.4e-50  Score=362.20  Aligned_cols=188  Identities=58%  Similarity=1.076  Sum_probs=170.5

Q ss_pred             CCceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHHHhhCCCccCCHHHhhhcCCC-CCCCCCCchHHHHHHHHHcCC
Q 048002          123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAKSEG  201 (351)
Q Consensus       123 P~~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~~~~lS~q~l~dc~~~-~~gC~GG~~~~a~~~~~~~~G  201 (351)
                      |++||||+.+.++||+|||.||+|||||++++||++++++++..++||+|+|++|... +.+|.||.+..|++++.+. |
T Consensus         1 P~~~d~r~~~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~-G   79 (210)
T cd02248           1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNG-G   79 (210)
T ss_pred             CCcccCCcCCCCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHC-C
Confidence            8899999999999999999999999999999999999999998899999999999974 7899999999999999886 9


Q ss_pred             CCCCCCCccccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHH-hcCCEEEEEecCCCCc
Q 048002          202 LTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAV-ANQPVAVAIDAGGKDF  280 (351)
Q Consensus       202 i~~e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l-~~gPV~v~~~~~~~~f  280 (351)
                      +++|++|||......|....                  .....++.+|..++..++++||++| .+|||+++|.+. ++|
T Consensus        80 i~~e~~yPY~~~~~~C~~~~------------------~~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~-~~f  140 (210)
T cd02248          80 LASESDYPYTGKDGTCKYNS------------------SKVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDAS-SSF  140 (210)
T ss_pred             cCccccCCccCCCCCccCCC------------------CcccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecC-ccc
Confidence            99999999998777887654                  3456889999999876789999999 558999999997 789


Q ss_pred             cCCCC--------------------ccccCCCCccEEEEEcCCCCCccCCceEEEEecCCCCCCCccccccccee
Q 048002          281 QFYSE--------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP  335 (351)
Q Consensus       281 ~~Y~~--------------------Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~yp  335 (351)
                      +.|++                    |||++ .+.+|||||||||++||++|||||+|+.    +.|||+..+.||
T Consensus       141 ~~y~~Giy~~~~~~~~~~~Hav~iVGy~~~-~~~~ywiv~NSWG~~WG~~Gy~~i~~~~----~~cgi~~~~~~~  210 (210)
T cd02248         141 QFYKGGIYSGPCCSNTNLNHAVLLVGYGTE-NGVDYWIVKNSWGTSWGEKGYIRIARGS----NLCGIASYASYP  210 (210)
T ss_pred             ccCCCCceeCCCCCCCcCCEEEEEEEEeec-CCceEEEEEcCCCCccccCcEEEEEcCC----CccCceeeeecC
Confidence            98876                    99987 6899999999999999999999999985    469999988876


No 9  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=1.8e-50  Score=367.92  Aligned_cols=203  Identities=35%  Similarity=0.671  Sum_probs=160.6

Q ss_pred             CCceeccCC--CC--CCccCCCCCCchHHHHHHHHHhhhhHHHhhC--CCccCCHHHhhhcCCC-CCCCCCCchHHHHHH
Q 048002          123 PPSVDWRKQ--GA--VTGVKDQGRCGSCWAFSTVVSVEGINKIKTG--ELWSLSEQELVDCDKD-NHGCDGGLMEQALNF  195 (351)
Q Consensus       123 P~~~Dwr~~--g~--v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~--~~~~lS~q~l~dc~~~-~~gC~GG~~~~a~~~  195 (351)
                      |++||||++  ++  |+||+|||.||||||||++++||++++|+++  +.+.||+|+|+||+.. +.||+||++..|++|
T Consensus         1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~   80 (236)
T cd02620           1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY   80 (236)
T ss_pred             CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence            889999997  45  4599999999999999999999999999888  7789999999999875 789999999999999


Q ss_pred             HHHcCCCCCCCCCccccCCCCCCCCCcc--chhhhhhcccccCCC--CCCCcEEecceEEcCCChHHHHHHHH-hcCCEE
Q 048002          196 IAKSEGLTTEKSYPYTAKDGSCELPTSM--VSIIYRVHICSWNGD--KNAPEVILDGYEMVPESDENALMKAV-ANQPVA  270 (351)
Q Consensus       196 ~~~~~Gi~~e~~yPY~~~~~~c~~~~~~--~~~~~~~~~~~~~~~--~~~~~~~i~~~~~v~~~~~~~i~~~l-~~gPV~  270 (351)
                      ++++ |+++|++|||......|......  .........|.....  .....+++.++..+. .++++||.+| .+|||+
T Consensus        81 i~~~-G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~-~~~~~ik~~l~~~GPv~  158 (236)
T cd02620          81 LTTT-GVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKSAYSVP-SDETDIMKEIMTNGPVQ  158 (236)
T ss_pred             HHhc-CCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecceeeeC-CHHHHHHHHHHHCCCeE
Confidence            9987 99999999998876544211000  000000112321110  012234556666665 3788999998 579999


Q ss_pred             EEEecCCCCccCCCC-------------------ccccCCCCccEEEEEcCCCCCccCCceEEEEecCCCCCCCcccccc
Q 048002          271 VAIDAGGKDFQFYSE-------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLE  331 (351)
Q Consensus       271 v~~~~~~~~f~~Y~~-------------------Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~  331 (351)
                      |+|.+. ++|+.|++                   |||++ +|++|||||||||++|||+|||||+|+.    +.|||++.
T Consensus       159 v~i~~~-~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~-~g~~YWivrNSWG~~WGe~Gy~ri~~~~----~~cgi~~~  232 (236)
T cd02620         159 AAFTVY-EDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVE-NGVPYWLAANSWGTDWGENGYFRILRGS----NECGIESE  232 (236)
T ss_pred             EEEEec-hhhhhcCCcEEeecCCCCcCCeEEEEEEEecc-CCeeEEEEEeCCCCCCCCCcEEEEEccC----cccccccc
Confidence            999996 79999977                   99987 8899999999999999999999999985    46999988


Q ss_pred             cc
Q 048002          332 AS  333 (351)
Q Consensus       332 ~~  333 (351)
                      ++
T Consensus       233 ~~  234 (236)
T cd02620         233 VV  234 (236)
T ss_pred             ee
Confidence            65


No 10 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=1.1e-48  Score=386.76  Aligned_cols=197  Identities=25%  Similarity=0.532  Sum_probs=160.0

Q ss_pred             CCCCCceeccCCC---CCCccCCCCC---CchHHHHHHHHHhhhhHHHhhC------CCccCCHHHhhhcCCCCCCCCCC
Q 048002          120 QDLPPSVDWRKQG---AVTGVKDQGR---CGSCWAFSTVVSVEGINKIKTG------ELWSLSEQELVDCDKDNHGCDGG  187 (351)
Q Consensus       120 ~~lP~~~Dwr~~g---~v~pVknQg~---cGsCwAfA~~~~~e~~~~i~~~------~~~~lS~q~l~dc~~~~~gC~GG  187 (351)
                      .+||++||||++|   +|+||||||.   ||||||||++++||++++|+++      ..+.||+|+|+||+..+.||+||
T Consensus       203 ~~LP~sfDWR~~gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCdGG  282 (548)
T PTZ00364        203 DPPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCAGG  282 (548)
T ss_pred             cCCCCccccCcCCCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCCCC
Confidence            5799999999987   7999999999   9999999999999999999884      46889999999999878999999


Q ss_pred             chHHHHHHHHHcCCCCCCCCC--ccccCCC---CCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHH
Q 048002          188 LMEQALNFIAKSEGLTTEKSY--PYTAKDG---SCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMK  262 (351)
Q Consensus       188 ~~~~a~~~~~~~~Gi~~e~~y--PY~~~~~---~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~  262 (351)
                      ++..|++|+.++ ||++|++|  ||.+.++   .|.....  ...+          .......+.+|..+. .++++|+.
T Consensus       283 ~p~~A~~yi~~~-GI~tE~dY~~PY~~~dg~~~~Ck~~~~--~~~y----------~~~~~~~I~gyy~~~-~~e~~I~~  348 (548)
T PTZ00364        283 FPEEVGKFAETF-GILTTDSYYIPYDSGDGVERACKTRRP--SRRY----------YFTNYGPLGGYYGAV-TDPDEIIW  348 (548)
T ss_pred             cHHHHHHHHHhC-CcccccccCCCCCCCCCCCCCCCCCcc--ccee----------eeeeeEEecceeecC-CcHHHHHH
Confidence            999999999887 99999999  9987655   4754320  0000          001123455555554 36788988


Q ss_pred             HH-hcCCEEEEEecCCCCccCCCC--------------------------------------ccccCCCCccEEEEEcCC
Q 048002          263 AV-ANQPVAVAIDAGGKDFQFYSE--------------------------------------GYGATQDGTKYWIVKNSW  303 (351)
Q Consensus       263 ~l-~~gPV~v~~~~~~~~f~~Y~~--------------------------------------Gyg~~~~g~~yWivkNSW  303 (351)
                      +| .+|||+|+|+++ .+|..|++                                      |||.+++|.+||||||||
T Consensus       349 eI~~~GPVsVaIda~-~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YWIVKNSW  427 (548)
T PTZ00364        349 EIYRHGPVPASVYAN-SDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPW  427 (548)
T ss_pred             HHHHcCCeEEEEEec-hHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEEEEECCC
Confidence            88 679999999997 56665421                                      788654788999999999


Q ss_pred             CC--CccCCceEEEEecCCCCCCCcccccccc--ee
Q 048002          304 GT--DWEEKGYIRMLRGIDAEEGLCGITLEAS--YP  335 (351)
Q Consensus       304 G~--~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~--yp  335 (351)
                      |+  +|||+|||||+||.    |.|||++.++  +|
T Consensus       428 Gt~~~WGE~GYfRI~RG~----N~CGIes~~v~~~~  459 (548)
T PTZ00364        428 GSRRSWCDGGTRKIARGV----NAYNIESEVVVMYW  459 (548)
T ss_pred             CCCCCcccCCeEEEEcCC----Ccccccceeeeeee
Confidence            99  99999999999995    4599999876  66


No 11 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=7.4e-49  Score=352.35  Aligned_cols=194  Identities=44%  Similarity=0.883  Sum_probs=164.6

Q ss_pred             CCCceeccCC-CCCCccCCCCCCchHHHHHHHHHhhhhHHHhh-CCCccCCHHHhhhcCC-CCCCCCCCchHHHHHHHHH
Q 048002          122 LPPSVDWRKQ-GAVTGVKDQGRCGSCWAFSTVVSVEGINKIKT-GELWSLSEQELVDCDK-DNHGCDGGLMEQALNFIAK  198 (351)
Q Consensus       122 lP~~~Dwr~~-g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~-~~~~~lS~q~l~dc~~-~~~gC~GG~~~~a~~~~~~  198 (351)
                      ||++||||+. +.++||+|||.||+|||||+++++|++++++. ...++||+|+|++|.. .+.+|+||++..|++++++
T Consensus         1 lP~~~D~r~~~~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~~   80 (219)
T PF00112_consen    1 LPKSFDWRDKGGRITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIKN   80 (219)
T ss_dssp             STSSEEGGGTTTCSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHHH
T ss_pred             CCCCEecccCCCCcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeecc
Confidence            7999999998 48999999999999999999999999999999 7889999999999997 6789999999999999999


Q ss_pred             cCCCCCCCCCccccCC-CCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHHh-cCCEEEEEecC
Q 048002          199 SEGLTTEKSYPYTAKD-GSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVA-NQPVAVAIDAG  276 (351)
Q Consensus       199 ~~Gi~~e~~yPY~~~~-~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~-~gPV~v~~~~~  276 (351)
                      +.|+++|++|||.... ..|.....                 .....++.+|..+...++++|+++|. +|||++++.+.
T Consensus        81 ~~Gi~~e~~~pY~~~~~~~c~~~~~-----------------~~~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~  143 (219)
T PF00112_consen   81 NNGIVTEEDYPYNGNENPTCKSKKS-----------------NSYYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVS  143 (219)
T ss_dssp             HTSBEBTTTS--SSSSSCSSCHSGG-----------------GEEEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEE
T ss_pred             cCccccccccccccccccccccccc-----------------ccccccccccccccccchhHHHHHHhhCceeeeeeecc
Confidence            3399999999999876 57775430                 11246888999988777999999995 59999999998


Q ss_pred             CCCccCCCC--------------------ccccCCCCccEEEEEcCCCCCccCCceEEEEecCCCCCCCcccccccceee
Q 048002          277 GKDFQFYSE--------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV  336 (351)
Q Consensus       277 ~~~f~~Y~~--------------------Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~yp~  336 (351)
                      ..+|..|++                    |||++ .+++|||||||||++||++|||||+|+.+   ++|||++.++||+
T Consensus       144 ~~~f~~~~~gi~~~~~~~~~~~~Hav~iVGy~~~-~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~---~~c~i~~~~~~~~  219 (219)
T PF00112_consen  144 SEDFQNYKSGIYDPPDCSNESGGHAVLIVGYDDE-NGKGYWIVKNSWGTDWGDNGYFRISYDYN---NECGIESQAVYPI  219 (219)
T ss_dssp             SHHHHTEESSEECSTSSSSSSEEEEEEEEEEEEE-TTEEEEEEE-SBTTTSTBTTEEEEESSSS---SGGGTTSSEEEEE
T ss_pred             ccccccccceeeeccccccccccccccccccccc-cceeeEeeehhhCCccCCCeEEEEeeCCC---CcCccCceeeecC
Confidence            335888876                    99997 79999999999999999999999999975   3699999999996


No 12 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=1e-47  Score=383.60  Aligned_cols=212  Identities=25%  Similarity=0.541  Sum_probs=163.9

Q ss_pred             CCCCCCceeccCC----CCCCccCCCCCCchHHHHHHHHHhhhhHHHhhCC-----C-----ccCCHHHhhhcCCCCCCC
Q 048002          119 TQDLPPSVDWRKQ----GAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGE-----L-----WSLSEQELVDCDKDNHGC  184 (351)
Q Consensus       119 ~~~lP~~~Dwr~~----g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~-----~-----~~lS~q~l~dc~~~~~gC  184 (351)
                      ..+||.+||||+.    +.++||+|||.||||||||++++||++++|++++     .     ..||+|+|+||+..+.||
T Consensus       378 ~~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC  457 (693)
T PTZ00049        378 IDELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGC  457 (693)
T ss_pred             cccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCc
Confidence            4589999999984    6799999999999999999999999999998643     1     279999999999888999


Q ss_pred             CCCchHHHHHHHHHcCCCCCCCCCccccCCCCCCCCCccchh-hh---hh---------ccccc--------CCCCCCCc
Q 048002          185 DGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSI-IY---RV---------HICSW--------NGDKNAPE  243 (351)
Q Consensus       185 ~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~c~~~~~~~~~-~~---~~---------~~~~~--------~~~~~~~~  243 (351)
                      +||++..|++|+.++ ||++|++|||.+..+.|......... ..   ..         ..|..        ........
T Consensus       458 ~GG~~~~A~kya~~~-GI~tEscYPY~a~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r  536 (693)
T PTZ00049        458 NGGFPYLVSKMAKLQ-GIPLDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEPAR  536 (693)
T ss_pred             CCCcHHHHHHHHHHC-CCCcCCccCCcCCCCCCCCCCCCccccccccccccccccccccccccccccccccccccccccc
Confidence            999999999999887 99999999999988889654311100 00   00         00000        00011234


Q ss_pred             EEecceEEcC-------CChHHHHHHHH-hcCCEEEEEecCCCCccCCCC------------------------------
Q 048002          244 VILDGYEMVP-------ESDENALMKAV-ANQPVAVAIDAGGKDFQFYSE------------------------------  285 (351)
Q Consensus       244 ~~i~~~~~v~-------~~~~~~i~~~l-~~gPV~v~~~~~~~~f~~Y~~------------------------------  285 (351)
                      +.++.|..+.       ..+++.|+.+| .+|||+|+|++. .+|++|++                              
T Consensus       537 ~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~-~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~  615 (693)
T PTZ00049        537 WYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEAS-PDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGW  615 (693)
T ss_pred             eeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEec-hhhhcCCCccccCcccccccccCCcccccccccccccc
Confidence            4566666653       24688899888 579999999997 67887753                              


Q ss_pred             ----------ccccCC-CCc--cEEEEEcCCCCCccCCceEEEEecCCCCCCCcccccccceee
Q 048002          286 ----------GYGATQ-DGT--KYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV  336 (351)
Q Consensus       286 ----------Gyg~~~-~g~--~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~yp~  336 (351)
                                |||.+. +|.  +|||||||||++||++|||||+|+.    |.|||++.++|+.
T Consensus       616 e~~NHAVlIVGwG~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~----N~CGIEs~a~~~~  675 (693)
T PTZ00049        616 EKVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGK----NFSGIESQSLFIE  675 (693)
T ss_pred             ccCceEEEEEEeccccCCCcccCEEEEECCCCCCcccCceEEEEcCC----CccCCccceeEEe
Confidence                      577542 353  7999999999999999999999995    4699999998865


No 13 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00  E-value=2.2e-43  Score=306.68  Aligned_cols=168  Identities=51%  Similarity=0.934  Sum_probs=125.5

Q ss_pred             CCCceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHHHhhCCCccCCHHHhhhcCCC-CCCCCCCchHHHHHHHHHcC
Q 048002          122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAKSE  200 (351)
Q Consensus       122 lP~~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~~~~lS~q~l~dc~~~-~~gC~GG~~~~a~~~~~~~~  200 (351)
                      ||++||||+.++++||+|||.||+|||||+++++|+++++++++.++||+|+|++|... +.||+||++..|++|+.++.
T Consensus         1 lP~~~D~R~~~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~~   80 (174)
T smart00645        1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKNG   80 (174)
T ss_pred             CCCcCcccccCCCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHcC
Confidence            69999999999999999999999999999999999999999998999999999999974 66999999999999998865


Q ss_pred             CCCCCCCCccccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHHhcCCEEEEEecCCCCc
Q 048002          201 GLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDF  280 (351)
Q Consensus       201 Gi~~e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~~gPV~v~~~~~~~~f  280 (351)
                      |+++|++|||...   +....                  ..-...-.|....+.....         ....++.+.    
T Consensus        81 Gi~~e~~~PY~~~---~~~~~------------------~~f~~Y~~Gi~~~~~~~~~---------~~~Hav~iv----  126 (174)
T smart00645       81 GLETESCYPYTGS---VAIDA------------------SDFQFYKSGIYDHPGCGSG---------TLDHAVLIV----  126 (174)
T ss_pred             CcccccccCcccE---EEEEc------------------ccccCCcCeEECCCCCCCC---------cccEEEEEE----
Confidence            9999999999650   00000                  0000000011100000000         011122222    


Q ss_pred             cCCCCccccCCCCccEEEEEcCCCCCccCCceEEEEecCCCCCCCcccccc
Q 048002          281 QFYSEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLE  331 (351)
Q Consensus       281 ~~Y~~Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~  331 (351)
                           |||.+.+|++|||||||||+.|||+|||||.|+.   .+.|||+..
T Consensus       127 -----Gyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~---~~~c~i~~~  169 (174)
T smart00645      127 -----GYGTEENGKDYWIVKNSWGTDWGENGYFRIARGK---NNECGIEAS  169 (174)
T ss_pred             -----EEeecCCCeeEEEEECCCCCCcccCeEEEEEcCC---CCccCceee
Confidence                 8886546889999999999999999999999985   146999654


No 14 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00  E-value=5.5e-41  Score=301.86  Aligned_cols=177  Identities=40%  Similarity=0.626  Sum_probs=151.3

Q ss_pred             ceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHHHhhC--CCccCCHHHhhhcCCC-----CCCCCCCchHHHHH-HH
Q 048002          125 SVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTG--ELWSLSEQELVDCDKD-----NHGCDGGLMEQALN-FI  196 (351)
Q Consensus       125 ~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~--~~~~lS~q~l~dc~~~-----~~gC~GG~~~~a~~-~~  196 (351)
                      .+|||+.+ ++||+|||.||+|||||+++++|++++++++  +.+.||+|+|++|...     ..+|.||.+..++. ++
T Consensus         1 ~~d~r~~~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~   79 (223)
T cd02619           1 SVDLRPLR-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLV   79 (223)
T ss_pred             CCcchhcC-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHHHHHH
Confidence            48999988 9999999999999999999999999999987  8899999999999873     26999999999998 77


Q ss_pred             HHcCCCCCCCCCccccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHHh-cCCEEEEEec
Q 048002          197 AKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVA-NQPVAVAIDA  275 (351)
Q Consensus       197 ~~~~Gi~~e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~-~gPV~v~~~~  275 (351)
                      +.+ |+++|.+|||......|.....              .......+++..|..+...++++||++|. .|||+++|.+
T Consensus        80 ~~~-Gi~~e~~~Py~~~~~~~~~~~~--------------~~~~~~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~  144 (223)
T cd02619          80 ALK-GIPPEEDYPYGAESDGEEPKSE--------------AALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDV  144 (223)
T ss_pred             HHc-CCCccccCCCCCCCCCCCCCCc--------------cchhhcceeecceeEeCchhHHHHHHHHHHCCCEEEEEEc
Confidence            776 9999999999988777654310              00123457888999887777899999995 5899999999


Q ss_pred             CCCCccCCCC-------------------------ccccCCC--CccEEEEEcCCCCCccCCceEEEEecC
Q 048002          276 GGKDFQFYSE-------------------------GYGATQD--GTKYWIVKNSWGTDWEEKGYIRMLRGI  319 (351)
Q Consensus       276 ~~~~f~~Y~~-------------------------Gyg~~~~--g~~yWivkNSWG~~WGe~Gy~~i~r~~  319 (351)
                      . ..|..|++                         |||++ .  +++|||||||||+.||++||+||+++.
T Consensus       145 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~-~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~~  213 (223)
T cd02619         145 Y-SGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDN-YVEGKGAFIVKNSWGTDWGDNGYGRISYED  213 (223)
T ss_pred             c-cchhcccCccccccccccccCCCccCCeEEEEEeecCC-CCCCCCEEEEEeCCCCccccCCEEEEehhh
Confidence            7 67776653                         88876 4  789999999999999999999999985


No 15 
>KOG1544 consensus Predicted cysteine proteinase TIN-ag [General function prediction only]
Probab=100.00  E-value=3.1e-42  Score=310.31  Aligned_cols=263  Identities=24%  Similarity=0.461  Sum_probs=203.3

Q ss_pred             HHHHHHccCCCCeEEE-cccCCCCChhhhhhhcCCccccCccCCCCCCccccccCCCCCCCCceeccCC--CCCCccCCC
Q 048002           64 KRIHKVNQMDKPYKLR-LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQ--GAVTGVKDQ  140 (351)
Q Consensus        64 ~~I~~~N~~~~s~~~g-~N~FsD~t~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~--g~v~pVknQ  140 (351)
                      .+|+++|..+.+|+++ +.+|..||.++-.+..||..++++++....+-. .......+||+.||.|++  +++.|+-||
T Consensus       151 d~iE~in~G~YgW~A~NYSaFWGmtL~DGiKyRLGTL~Ps~sv~nMNEi~-~~l~p~~~LPE~F~As~KWp~liH~plDQ  229 (470)
T KOG1544|consen  151 DMIEAINQGNYGWQAGNYSAFWGMTLDDGIKYRLGTLRPSSSVMNMNEIY-TVLNPGEVLPEAFEASEKWPNLIHEPLDQ  229 (470)
T ss_pred             HHHHHHhcCCccccccchhhhhcccccccceeeecccCchhhhhhHHhHh-hccCcccccchhhhhhhcCCccccCcccc
Confidence            3699999988899997 679999999998888888766544332222110 011223689999999997  899999999


Q ss_pred             CCCchHHHHHHHHHhhhhHHHhhCC--CccCCHHHhhhcCC-CCCCCCCCchHHHHHHHHHcCCCCCCCCCccccCC---
Q 048002          141 GRCGSCWAFSTVVSVEGINKIKTGE--LWSLSEQELVDCDK-DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKD---  214 (351)
Q Consensus       141 g~cGsCwAfA~~~~~e~~~~i~~~~--~~~lS~q~l~dc~~-~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~---  214 (351)
                      |+|++.|||+++++...+++|.+..  ...||+|+|++|.. ..+||.||..+.|+=|+.+. |+|...||||...+   
T Consensus       230 gnCa~SWafSTaavasDRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKr-GvVsdhCYP~~~dQ~~~  308 (470)
T KOG1544|consen  230 GNCAGSWAFSTAAVASDRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKR-GVVSDHCYPFSGDQAGP  308 (470)
T ss_pred             CCcccceeeeeehhccceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecc-cccccccccccCCCCCC
Confidence            9999999999999999999998653  35899999999987 67899999999999999987 99999999997633   


Q ss_pred             -CCCCCCCc--cchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHHhcCCEEEEEecCCCCccCCCC------
Q 048002          215 -GSCELPTS--MVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------  285 (351)
Q Consensus       215 -~~c~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~~gPV~v~~~~~~~~f~~Y~~------  285 (351)
                       +.|-..+.  .........-|..+...+...++++.-..|.++.++.|++++.+|||.+.|.|- +||.+|++      
T Consensus       309 ~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~iyq~tPPYrVSSnE~eImkElM~NGPVQA~m~VH-EDFF~YkgGiY~H~  387 (470)
T KOG1544|consen  309 APPCMMHSRAMGRGKRQATAHCPNSYVNSNDIYQVTPPYRVSSNEKEIMKELMENGPVQALMEVH-EDFFLYKGGIYSHT  387 (470)
T ss_pred             CCCceeeccccCcccccccCcCCCcccccCceeeecCCeeccCCHHHHHHHHHhCCChhhhhhhh-hhhhhhccceeecc
Confidence             44654331  011111223365554444567777776677665455555555999999999996 99999987      


Q ss_pred             ---------------------ccccCC--CC--ccEEEEEcCCCCCccCCceEEEEecCCCCCCCcccccccc
Q 048002          286 ---------------------GYGATQ--DG--TKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEAS  333 (351)
Q Consensus       286 ---------------------Gyg~~~--~g--~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~  333 (351)
                                           |||++.  .|  .+|||..||||+.|||+|||||.||+|+    |-|+++.+
T Consensus       388 ~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNe----cdIEsfvI  456 (470)
T KOG1544|consen  388 PVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNE----CDIESFVI  456 (470)
T ss_pred             ccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccc----hhhhHhhh
Confidence                                 999862  23  5799999999999999999999999865    99999864


No 16 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00  E-value=2.5e-40  Score=339.10  Aligned_cols=206  Identities=22%  Similarity=0.375  Sum_probs=152.4

Q ss_pred             CCccCCCCCCchHHHHHHHHHhhhhHHHhhCCCccCCHHHhhhcCC--CCCCCCCCc-hHHHHHHHHHcCCCCCCCCCcc
Q 048002          134 VTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK--DNHGCDGGL-MEQALNFIAKSEGLTTEKSYPY  210 (351)
Q Consensus       134 v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~~~~lS~q~l~dc~~--~~~gC~GG~-~~~a~~~~~~~~Gi~~e~~yPY  210 (351)
                      ..||||||.||+|||||+++++|++++|+++..+.||+|+|+||+.  .+.||.||. +..++.|+.+++|+++|++|||
T Consensus       544 ~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPY  623 (1004)
T PTZ00462        544 KIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLY  623 (1004)
T ss_pred             CCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCC
Confidence            5789999999999999999999999999999999999999999986  368999997 4456689988767899999999


Q ss_pred             cc--CCCCCCCCCccchhhhhh-cccccCCCCCCCcEEecceEEcCCC----h----HHHHHHHHh-cCCEEEEEecCCC
Q 048002          211 TA--KDGSCELPTSMVSIIYRV-HICSWNGDKNAPEVILDGYEMVPES----D----ENALMKAVA-NQPVAVAIDAGGK  278 (351)
Q Consensus       211 ~~--~~~~c~~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~v~~~----~----~~~i~~~l~-~gPV~v~~~~~~~  278 (351)
                      ..  ..+.|+.....+..++.. ..+.... .....+.+.+|..+...    +    +++|+.+|. +|||+|+|++.  
T Consensus       624 t~k~~~g~Cp~~~~~w~n~~~~~kll~~~~-~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAs--  700 (1004)
T PTZ00462        624 NYTKVGEDCPDEEDHWMNLLDHGKILNHNK-KEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAE--  700 (1004)
T ss_pred             ccCCCCCCCCCCcccccccccccccccccc-cccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEee--
Confidence            75  456787543211111110 0000000 01123455677666432    1    468888885 59999999985  


Q ss_pred             CccCCC-C--------------------ccccC----CCCccEEEEEcCCCCCccCCceEEEEecCCCCCCCcccccccc
Q 048002          279 DFQFYS-E--------------------GYGAT----QDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEAS  333 (351)
Q Consensus       279 ~f~~Y~-~--------------------Gyg~~----~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~~~~~Cgi~~~~~  333 (351)
                      +|+.|. +                    |||.+    ..+++|||||||||+.|||+|||||.|..   .+.|||.....
T Consensus       701 df~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g---~n~CGin~i~t  777 (1004)
T PTZ00462        701 NVLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYG---PSHCEDNFIHS  777 (1004)
T ss_pred             hHHhhhcCCccccCCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCC---CCCCccchhee
Confidence            687774 2                    99974    13679999999999999999999999943   35699998888


Q ss_pred             eeeecCCCCCCC
Q 048002          334 YPVKLHPENSRH  345 (351)
Q Consensus       334 yp~~~~~~~~~~  345 (351)
                      +|+++-.-+..+
T Consensus       778 ~~~fn~d~~~~~  789 (1004)
T PTZ00462        778 VVIFNIDLPKNK  789 (1004)
T ss_pred             eeeEeecccccc
Confidence            888764444443


No 17 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.91  E-value=5.6e-25  Score=203.60  Aligned_cols=184  Identities=29%  Similarity=0.493  Sum_probs=119.4

Q ss_pred             CCCCCceeccCCCCCCccCCCCCCchHHHHHHHHHhhhhHHHhhCCCccCCHHH-----hhhcCC-CCCC-CCCCchHHH
Q 048002          120 QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQE-----LVDCDK-DNHG-CDGGLMEQA  192 (351)
Q Consensus       120 ~~lP~~~Dwr~~g~v~pVknQg~cGsCwAfA~~~~~e~~~~i~~~~~~~lS~q~-----l~dc~~-~~~g-C~GG~~~~a  192 (351)
                      ..+|+.||||+.|.|+||||||.||+|||||+++++|+.+.-..  ..++|+-.     .+.|.. ...+ -+||....+
T Consensus        97 ~s~~~~fd~r~~g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~--~w~~s~~nm~~ll~~~ye~~fd~~~~d~g~~~m~  174 (372)
T COG4870          97 ASLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPES--AWDFSENNMKNLLGVPYEKGFDYTSNDGGNADMS  174 (372)
T ss_pred             ccchhheeeeccCCcccccccCcccceEeeeehhhhhheecccc--cccccccchhhhcCCCccccCCCccccCCccccc
Confidence            35899999999999999999999999999999999999765443  23334333     233322 1111 248888888


Q ss_pred             HHHHHHcCCCCCCCCCccccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHH-hcCCEE-
Q 048002          193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAV-ANQPVA-  270 (351)
Q Consensus       193 ~~~~~~~~Gi~~e~~yPY~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l-~~gPV~-  270 (351)
                      ..|+.+..|.+.+.+-||......|.....   ...+.+.|.         +.-.....+   +...|+.++ ..|-+. 
T Consensus       175 ~a~l~e~sgpv~et~d~y~~~s~~~~~~~p---~~k~~~~~~---------~i~~~~~~L---dnG~i~~~~~~yg~~s~  239 (372)
T COG4870         175 AAYLTEWSGPVYETDDPYSENSYFSPTNLP---VTKHVQEAQ---------IIPSRKKYL---DNGNIKAMFGFYGAVSS  239 (372)
T ss_pred             cccccccCCcchhhcCccccccccCCcCCc---hhhccccce---------ecccchhhh---cccchHHHHhhhccccc
Confidence            888888889999999999887666654320   001111110         011111122   222355555 345333 


Q ss_pred             -EEEecCCCCcc-----CCCC------------ccccC---------CCCccEEEEEcCCCCCccCCceEEEEecCCC
Q 048002          271 -VAIDAGGKDFQ-----FYSE------------GYGAT---------QDGTKYWIVKNSWGTDWEEKGYIRMLRGIDA  321 (351)
Q Consensus       271 -v~~~~~~~~f~-----~Y~~------------Gyg~~---------~~g~~yWivkNSWG~~WGe~Gy~~i~r~~~~  321 (351)
                       +.|++. ..+.     .|..            ||++.         +.|.+.||||||||++||++|||||++....
T Consensus       240 ~~~id~~-~~~~~~~~~~~~~s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~GYfwisY~ya~  316 (372)
T COG4870         240 SMYIDAT-NSLGICIPYPYVDSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYYAL  316 (372)
T ss_pred             eeEEecc-cccccccCCCCCCccccccceEEEEeccccccccccccCCCCCceEEEECccccccccCceEEEEeeecc
Confidence             335554 3333     1111            89875         3467899999999999999999999998643


No 18 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.81  E-value=6.4e-20  Score=178.51  Aligned_cols=76  Identities=28%  Similarity=0.409  Sum_probs=64.0

Q ss_pred             CccCCCCCCchHHHHHHHHHhhhhHHHh-hCCCccCCHHHhhhcCC----------------------------CCCCCC
Q 048002          135 TGVKDQGRCGSCWAFSTVVSVEGINKIK-TGELWSLSEQELVDCDK----------------------------DNHGCD  185 (351)
Q Consensus       135 ~pVknQg~cGsCwAfA~~~~~e~~~~i~-~~~~~~lS~q~l~dc~~----------------------------~~~gC~  185 (351)
                      .||+||+.-|.||.||+.+++++.+..+ +.+.++||+.+++..++                            .....+
T Consensus        55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~D  134 (437)
T cd00585          55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQND  134 (437)
T ss_pred             CCcccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcCC
Confidence            4899999999999999999999987764 45689999998876222                            234579


Q ss_pred             CCchHHHHHHHHHcCCCCCCCCCccc
Q 048002          186 GGLMEQALNFIAKSEGLTTEKSYPYT  211 (351)
Q Consensus       186 GG~~~~a~~~~~~~~Gi~~e~~yPY~  211 (351)
                      ||...++...+.+. |+++.+.||-+
T Consensus       135 GGqw~m~~~li~KY-GvVPk~~~pet  159 (437)
T cd00585         135 GGQWDMLVNLIEKY-GLVPKSVMPES  159 (437)
T ss_pred             CCchHHHHHHHHHc-CCCcccccCCC
Confidence            99999999999997 99999999943


No 19 
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.66  E-value=2.5e-16  Score=111.55  Aligned_cols=56  Identities=39%  Similarity=0.714  Sum_probs=49.5

Q ss_pred             HHHHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHc-cCCCCeEEEcccCCCCChhhh
Q 048002           36 YERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVN-QMDKPYKLRLNRFADMTNHEF   91 (351)
Q Consensus        36 f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N-~~~~s~~~g~N~FsD~t~eEf   91 (351)
                      |++|+++| |.|.+.+|+.+|+.+|++|++.|.+|| ..+.+|++|+|+|+|||.+||
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf   58 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF   58 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence            89999999 999999999999999999999999999 446899999999999999997


No 20 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.57  E-value=4.6e-15  Score=144.75  Aligned_cols=76  Identities=26%  Similarity=0.388  Sum_probs=50.9

Q ss_pred             CccCCCCCCchHHHHHHHHHhhhhHHHhhC-CCccCCHHHhhhc----------------CC------------CCCCCC
Q 048002          135 TGVKDQGRCGSCWAFSTVVSVEGINKIKTG-ELWSLSEQELVDC----------------DK------------DNHGCD  185 (351)
Q Consensus       135 ~pVknQg~cGsCwAfA~~~~~e~~~~i~~~-~~~~lS~q~l~dc----------------~~------------~~~gC~  185 (351)
                      .||+||+.-|.||.||+..+++..+..+.+ +.++||+.+|...                ..            .....+
T Consensus        56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~D  135 (438)
T PF03051_consen   56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSD  135 (438)
T ss_dssp             -S--B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S
T ss_pred             CCCCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCC
Confidence            489999999999999999999999887765 7899999987632                11            123479


Q ss_pred             CCchHHHHHHHHHcCCCCCCCCCccc
Q 048002          186 GGLMEQALNFIAKSEGLTTEKSYPYT  211 (351)
Q Consensus       186 GG~~~~a~~~~~~~~Gi~~e~~yPY~  211 (351)
                      ||...++...+.+. |||+.+.||-+
T Consensus       136 GGqw~~~~nli~KY-GvVPk~~mpet  160 (438)
T PF03051_consen  136 GGQWDMVVNLIKKY-GVVPKSVMPET  160 (438)
T ss_dssp             -B-HHHHHHHHHHH----BGGGSTTG
T ss_pred             CCchHHHHHHHHHc-CcCcHhhCCCC
Confidence            99999999999997 99999999853


No 21 
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.49  E-value=3.7e-14  Score=99.93  Aligned_cols=55  Identities=40%  Similarity=0.702  Sum_probs=52.2

Q ss_pred             HHHHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHccCC-CCeEEEcccCCCCChhh
Q 048002           36 YERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMD-KPYKLRLNRFADMTNHE   90 (351)
Q Consensus        36 f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~~~I~~~N~~~-~s~~~g~N~FsD~t~eE   90 (351)
                      |++|+.+| |.|.+.+|...|+.+|++|++.|+.||+.+ .+|++|+|+|+|||.+|
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE   57 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE   57 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence            68999999 999999999999999999999999999876 89999999999999987


No 22 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=98.82  E-value=3e-08  Score=91.50  Aligned_cols=75  Identities=24%  Similarity=0.392  Sum_probs=59.2

Q ss_pred             ccCCCCCCchHHHHHHHHHhhhhHHHhhC-CCccCCHHHhhhcCC----------------------------CCCCCCC
Q 048002          136 GVKDQGRCGSCWAFSTVVSVEGINKIKTG-ELWSLSEQELVDCDK----------------------------DNHGCDG  186 (351)
Q Consensus       136 pVknQg~cGsCwAfA~~~~~e~~~~i~~~-~~~~lS~q~l~dc~~----------------------------~~~gC~G  186 (351)
                      ||.||...|.||.||+...+...+..+-+ +.+.||..+++..++                            ...--+|
T Consensus        59 ~vtNQk~SGRCWmFAAlNtfRhk~~~el~le~fElSQaytfFwDKlEKaN~FleqIi~tadq~ldsRlv~~LL~~PqqDG  138 (444)
T COG3579          59 KVTNQKQSGRCWMFAALNTFRHKLISELKLEDFELSQAYTFFWDKLEKANWFLEQIIETADQELDSRLVSFLLATPQQDG  138 (444)
T ss_pred             ccccccccceehHHHHHHHHHHHHHHhcCcceeehhhHHHHHHHHHHHhhHHHHHHHhhcccchHHHHHHHHHcCccccC
Confidence            89999999999999999998766554433 567899887765443                            1223689


Q ss_pred             CchHHHHHHHHHcCCCCCCCCCccc
Q 048002          187 GLMEQALNFIAKSEGLTTEKSYPYT  211 (351)
Q Consensus       187 G~~~~a~~~~~~~~Gi~~e~~yPY~  211 (351)
                      |..++....+.+. |+++.++||=.
T Consensus       139 GQwdM~v~l~eKY-GvVpK~~ypes  162 (444)
T COG3579         139 GQWDMFVSLFEKY-GVVPKSVYPES  162 (444)
T ss_pred             chHHHHHHHHHHh-CCCchhhcccc
Confidence            9999999998886 99999999843


No 23 
>PF08127 Propeptide_C1:  Peptidase family C1 propeptide;  InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=96.67  E-value=0.0013  Score=42.60  Aligned_cols=35  Identities=14%  Similarity=0.114  Sum_probs=22.5

Q ss_pred             HHHHHHHccCCCCeEEEcccCCCCChhhhhhhcCCcc
Q 048002           63 LKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKV   99 (351)
Q Consensus        63 ~~~I~~~N~~~~s~~~g~N~FsD~t~eEf~~~~~~~~   99 (351)
                      -++|+.+|+.+.+|++|.| |.+.|.++++.+ +|..
T Consensus         3 de~I~~IN~~~~tWkAG~N-F~~~~~~~ik~L-lGv~   37 (41)
T PF08127_consen    3 DEFIDYINSKNTTWKAGRN-FENTSIEYIKRL-LGVL   37 (41)
T ss_dssp             HHHHHHHHHCT-SEEE-----SSB-HHHHHHC-S-B-
T ss_pred             HHHHHHHHcCCCcccCCCC-CCCCCHHHHHHH-cCCC
Confidence            3689999999999999999 899999888665 4544


No 24 
>KOG4128 consensus Bleomycin hydrolases and aminopeptidases of cysteine protease family [Amino acid transport and metabolism]
Probab=96.63  E-value=0.0012  Score=61.53  Aligned_cols=75  Identities=28%  Similarity=0.379  Sum_probs=58.5

Q ss_pred             CccCCCCCCchHHHHHHHHHhhhhHHHhhC-CCccCCHHHhhhcCC------------------------------CCCC
Q 048002          135 TGVKDQGRCGSCWAFSTVVSVEGINKIKTG-ELWSLSEQELVDCDK------------------------------DNHG  183 (351)
Q Consensus       135 ~pVknQg~cGsCwAfA~~~~~e~~~~i~~~-~~~~lS~q~l~dc~~------------------------------~~~g  183 (351)
                      +||.||.+.|-||.|+.+..+.-.+..+-+ ..+.||..+|+..++                              .+..
T Consensus        63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~  142 (457)
T KOG4128|consen   63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPV  142 (457)
T ss_pred             cccccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCC
Confidence            699999999999999999987654443322 457899988764221                              2334


Q ss_pred             CCCCchHHHHHHHHHcCCCCCCCCCcc
Q 048002          184 CDGGLMEQALNFIAKSEGLTTEKSYPY  210 (351)
Q Consensus       184 C~GG~~~~a~~~~~~~~Gi~~e~~yPY  210 (351)
                      -+||...+.++.+++. |+.+.+|||-
T Consensus       143 ~DGGqw~MfvNlVkKY-GviPKkcy~~  168 (457)
T KOG4128|consen  143 PDGGQWQMFVNLVKKY-GVIPKKCYLH  168 (457)
T ss_pred             CCCchHHHHHHHHHHh-CCCcHHhccc
Confidence            5899999999999886 9999999984


No 25 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=74.99  E-value=24  Score=28.10  Aligned_cols=21  Identities=14%  Similarity=0.400  Sum_probs=16.0

Q ss_pred             ChHHHHHHHHhcC-CEEEEEec
Q 048002          255 SDENALMKAVANQ-PVAVAIDA  275 (351)
Q Consensus       255 ~~~~~i~~~l~~g-PV~v~~~~  275 (351)
                      .+.+.|++.|..| ||.+.+..
T Consensus        87 ~~~~~i~~~i~~G~Pvi~~~~~  108 (144)
T PF13529_consen   87 ASFDDIKQEIDAGRPVIVSVNS  108 (144)
T ss_dssp             S-HHHHHHHHHTT--EEEEEET
T ss_pred             CcHHHHHHHHHCCCcEEEEEEc
Confidence            4568899999887 99999974


No 26 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=54.00  E-value=8.1  Score=29.91  Aligned_cols=19  Identities=26%  Similarity=0.543  Sum_probs=11.2

Q ss_pred             ChHHHHHHHHHHHHhhccc
Q 048002            1 TFFLVGLSLVLVFGVAESF   19 (351)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~   19 (351)
                      +||||.|+|++++++.+.+
T Consensus         5 ~~llL~l~LA~lLlisSev   23 (95)
T PF07172_consen    5 AFLLLGLLLAALLLISSEV   23 (95)
T ss_pred             HHHHHHHHHHHHHHHHhhh
Confidence            3666666666666665543


No 27 
>KOG4128 consensus Bleomycin hydrolases and aminopeptidases of cysteine protease family [Amino acid transport and metabolism]
Probab=47.03  E-value=4.3  Score=38.40  Aligned_cols=24  Identities=42%  Similarity=0.663  Sum_probs=20.4

Q ss_pred             CccEEEEEcCCCCCccCCceEEEE
Q 048002          293 GTKYWIVKNSWGTDWEEKGYIRML  316 (351)
Q Consensus       293 g~~yWivkNSWG~~WGe~Gy~~i~  316 (351)
                      +-.-|.|.||||++-|-+||..|.
T Consensus       389 ~~~~~rVenswgkd~gkkg~~~mt  412 (457)
T KOG4128|consen  389 GLNEHRVENSWGKDLGKKGVNKMT  412 (457)
T ss_pred             Cchhhhhhchhhhhccccchhhhh
Confidence            445899999999999999996654


No 28 
>KOG4654 consensus Uncharacterized conserved protein [Function unknown]
Probab=28.66  E-value=2e+02  Score=25.20  Aligned_cols=75  Identities=15%  Similarity=0.155  Sum_probs=42.0

Q ss_pred             ccCCccccCCHHHHHHHHHHHHHhccccCCh---------------HHHHHHHHHHHHHHHHHHHHccCCCCeEEEcccC
Q 048002           19 FDYQESDLASEECLWDLYERWRSHHTVSRDL---------------KEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRF   83 (351)
Q Consensus        19 ~~~~~~~~~~~~~~~~~f~~~~~~~k~Y~~~---------------~E~~~R~~~f~~n~~~I~~~N~~~~s~~~g~N~F   83 (351)
                      +++.++-+...+++..+|-+-..-||.+.+-               ++..+-......=+..|+.+|.+-.+| +..|+.
T Consensus       112 Is~GDafl~~pde~ddLfYeii~mhknFdn~~S~vlrlstnagq~kdaaskv~~AL~ni~aiiehfnpKiedy-aavnhi  190 (252)
T KOG4654|consen  112 ISNGDAFLIRPDELDDLFYEIIHMHKNFDNFSSKVLRLSTNAGQIKDAASKVLGALNNILAIIEHFNPKIEDY-AAVNHI  190 (252)
T ss_pred             HhCCCeeeeCchHHHHHHHHHHHHhcchhhHHHHHHHhccccccCchHHHHHHHHHHHHHHHHHhcCchhhhH-HHhccc
Confidence            4444444455566666766655555554321               222222333333344566667776666 458999


Q ss_pred             CCCChhhhhhh
Q 048002           84 ADMTNHEFMSS   94 (351)
Q Consensus        84 sD~t~eEf~~~   94 (351)
                      +.+|.+|....
T Consensus       191 ~qlsadeV~eV  201 (252)
T KOG4654|consen  191 PQLSADEVEEV  201 (252)
T ss_pred             ccccHHHHHHH
Confidence            99999987543


No 29 
>TIGR02744 TrbI_Ftype type-F conjugative transfer system protein TrbI. This protein is an essential component of the F-type conjugative transfer sytem for plasmid DNA transfer and has been shown to be localized to the periplasm.
Probab=28.07  E-value=1.8e+02  Score=23.20  Aligned_cols=46  Identities=4%  Similarity=-0.013  Sum_probs=35.3

Q ss_pred             CHHHHHHHHHHHHHhc-cccCChHHHHHHHHHHHHHH-HHHHHHccCC
Q 048002           28 SEECLWDLYERWRSHH-TVSRDLKEKQIRFNVFKQNL-KRIHKVNQMD   73 (351)
Q Consensus        28 ~~~~~~~~f~~~~~~~-k~Y~~~~E~~~R~~~f~~n~-~~I~~~N~~~   73 (351)
                      -.-++...-++|...- +.-.+.+|.+.+-..|..-+ +.+.++++++
T Consensus        35 V~fdmk~tld~F~~q~~~~~lte~q~~~~~~rF~~~L~~~L~~yq~~H   82 (112)
T TIGR02744        35 VAFDMKQTLDAFFDSASQKKLSEAQQKALLGRFNALLEAELQAWQAQH   82 (112)
T ss_pred             EEEecHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            3356777788888888 66668888899999999988 4567777665


No 30 
>COG4871 Uncharacterized protein conserved in archaea [Function unknown]
Probab=24.19  E-value=49  Score=28.15  Aligned_cols=17  Identities=35%  Similarity=0.870  Sum_probs=12.5

Q ss_pred             ccCCCCCCc--hHHHHHHH
Q 048002          136 GVKDQGRCG--SCWAFSTV  152 (351)
Q Consensus       136 pVknQg~cG--sCwAfA~~  152 (351)
                      |-.|=|.||  +|.|||.-
T Consensus       135 P~tNCg~CGEqtCmaFAiK  153 (193)
T COG4871         135 PQTNCGKCGEQTCMAFAIK  153 (193)
T ss_pred             CCCccccchhHHHHHHHHH
Confidence            446677776  89999864


No 31 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=23.88  E-value=4.8e+02  Score=22.58  Aligned_cols=119  Identities=17%  Similarity=0.202  Sum_probs=63.2

Q ss_pred             CCCCCchHHHHHHHHHhhhhHHHhh--------CCCccCCHHHhhhcCCCCCCCCCCchHHHHHHHHHcCCCCCCCCCcc
Q 048002          139 DQGRCGSCWAFSTVVSVEGINKIKT--------GELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPY  210 (351)
Q Consensus       139 nQg~cGsCwAfA~~~~~e~~~~i~~--------~~~~~lS~q~l~dc~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY  210 (351)
                      .||.=+=|-+||.+++|-.....+.        .-...+|+++|.+++        -.+.+.++|.+.. |...      
T Consensus        18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l~~~~--------~~~~~~i~y~ks~-g~~~------   82 (175)
T PF05543_consen   18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQLKFTS--------LTPNQMIKYAKSQ-GRNP------   82 (175)
T ss_dssp             --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCHHH----------B-HHHHHHHHHHT-TEEE------
T ss_pred             ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHHhhcC--------CCHHHHHHHHHHc-Ccch------
Confidence            5888899999999998875422110        012356666666653        2345677776554 4221      


Q ss_pred             ccCCCCCCCCCccchhhhhhcccccCCCCCCCcEEecceEEcCCChHHHHHHHHhc-CCEEEEEecCCCCccCCC--C--
Q 048002          211 TAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYS--E--  285 (351)
Q Consensus       211 ~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~v~~~~~~~i~~~l~~-gPV~v~~~~~~~~f~~Y~--~--  285 (351)
                                                           .+.. ...+.+++++.+.+ .|+.+........  .+.  +  
T Consensus        83 -------------------------------------~~~n-~~~s~~eV~~~~~~nk~i~i~~~~v~~~--~~~~~gHA  122 (175)
T PF05543_consen   83 -------------------------------------QYNN-RMPSFDEVKKLIDNNKGIAILADRVEQT--NGPHAGHA  122 (175)
T ss_dssp             -------------------------------------EEEC-S---HHHHHHHHHTT-EEEEEEEETTSC--TTB--EEE
T ss_pred             -------------------------------------hHhc-CCCCHHHHHHHHHcCCCeEEEecccccC--CCCcccee
Confidence                                                 0100 01246778888854 5888766654233  322  2  


Q ss_pred             ----ccccCCCCccEEEEEcCCCCCccCCceEEEEe
Q 048002          286 ----GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLR  317 (351)
Q Consensus       286 ----Gyg~~~~g~~yWivkNSWG~~WGe~Gy~~i~r  317 (351)
                          ||-.-.+|.++.++=|=|     +++++-++.
T Consensus       123 lavvGya~~~~g~~~y~~WNPW-----~~~~~~~sa  153 (175)
T PF05543_consen  123 LAVVGYAKPNNGQKTYYFWNPW-----WNDVMIQSA  153 (175)
T ss_dssp             EEEEEEEEETTSEEEEEEE-TT------SS-EEEET
T ss_pred             EEEEeeeecCCCCeEEEEeCCc-----cCCcEEEec
Confidence                887655779999999999     345555443


Done!