Query         044448
Match_columns 308
No_of_seqs    254 out of 1578
Neff          8.0 
Searched_HMMs 46136
Date          Fri Mar 29 03:23:59 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044448.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044448hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542 Cysteine proteinase Ca 100.0 1.5E-82 3.2E-87  569.2  25.7  284    9-307    65-369 (372)
  2 PTZ00203 cathepsin L protease; 100.0 8.7E-76 1.9E-80  547.1  31.7  285    8-306    31-337 (348)
  3 PTZ00021 falcipain-2; Provisio 100.0   4E-75 8.7E-80  558.0  30.5  292    8-308   162-487 (489)
  4 PTZ00200 cysteine proteinase;  100.0 6.3E-75 1.4E-79  554.8  31.5  292    6-308   117-444 (448)
  5 KOG1543 Cysteine proteinase Ca 100.0 7.9E-67 1.7E-71  483.5  27.9  270   19-308    30-323 (325)
  6 cd02698 Peptidase_C1A_Cathepsi 100.0 1.3E-58 2.8E-63  413.7  22.4  207   92-307     1-236 (239)
  7 cd02621 Peptidase_C1A_Cathepsi 100.0 4.2E-58   9E-63  411.5  22.4  207   92-306     1-239 (243)
  8 cd02248 Peptidase_C1A Peptidas 100.0 1.5E-57 3.3E-62  398.5  22.8  203   93-307     1-210 (210)
  9 cd02620 Peptidase_C1A_Cathepsi 100.0 2.8E-57 6.1E-62  404.3  20.7  201   93-305     1-234 (236)
 10 PF00112 Peptidase_C1:  Papain  100.0 1.6E-55 3.5E-60  386.9  14.9  208   92-308     1-219 (219)
 11 PTZ00049 cathepsin C-like prot 100.0   2E-54 4.4E-59  423.7  22.9  209   90-307   379-674 (693)
 12 PTZ00364 dipeptidyl-peptidase  100.0 1.3E-53 2.8E-58  413.9  22.3  203   91-305   204-455 (548)
 13 smart00645 Pept_C1 Papain fami 100.0 4.5E-49 9.7E-54  335.8  17.6  165   92-304     1-170 (174)
 14 cd02619 Peptidase_C1 C1 Peptid 100.0 1.8E-46 3.8E-51  330.2  20.5  191   95-292     1-213 (223)
 15 PTZ00462 Serine-repeat antigen 100.0 3.8E-45 8.1E-50  367.7  21.2  200  104-308   544-780 (1004)
 16 KOG1544 Predicted cysteine pro 100.0 6.9E-42 1.5E-46  303.5   5.8  247   50-305   170-456 (470)
 17 COG4870 Cysteine protease [Pos 100.0 8.9E-30 1.9E-34  231.5   7.3  194   91-292    98-314 (372)
 18 cd00585 Peptidase_C1B Peptidas  99.9 1.1E-24 2.4E-29  207.7  15.3  180  105-291    55-399 (437)
 19 PF03051 Peptidase_C1_2:  Pepti  99.7 6.9E-17 1.5E-21  154.5  16.6  179  105-290    56-399 (438)
 20 PF08246 Inhibitor_I29:  Cathep  99.4   4E-13 8.8E-18   93.4   5.8   45   15-59      1-58  (58)
 21 smart00848 Inhibitor_I29 Cathe  99.1 6.7E-11 1.4E-15   81.7   3.8   44   15-58      1-57  (57)
 22 COG3579 PepC Aminopeptidase C   99.0 5.1E-09 1.1E-13   95.0  10.8   78  210-289   296-400 (444)
 23 KOG4128 Bleomycin hydrolases a  97.9 1.2E-05 2.7E-10   73.2   4.6   73  105-178    63-168 (457)
 24 PF13529 Peptidase_C39_2:  Pept  96.9   0.011 2.5E-07   47.2   9.8   56  209-276    87-144 (144)
 25 PF05543 Peptidase_C47:  Stapho  94.0    0.34 7.3E-06   40.9   7.9  118  109-277    18-145 (175)
 26 PF14399 Transpep_BrtH:  NlpC/p  91.2    0.57 1.2E-05   43.2   6.5   55  211-274    78-133 (317)
 27 COG4990 Uncharacterized protei  85.0     2.6 5.6E-05   35.8   5.7   52  204-277   116-168 (195)
 28 PF09778 Guanylate_cyc_2:  Guan  83.1       5 0.00011   35.1   7.0   51  210-260   112-172 (212)
 29 cd02549 Peptidase_C39A A sub-f  60.9      20 0.00044   28.2   5.1   33  214-258    70-103 (141)
 30 PF12385 Peptidase_C70:  Papain  42.7      62  0.0013   27.1   5.1   38  210-260    97-135 (166)
 31 PF11567 PfUIS3:  Plasmodium fa  34.3      10 0.00022   28.2  -0.6   31   31-61     19-49  (101)
 32 PF05391 Lsm_interact:  Lsm int  32.4      34 0.00074   18.4   1.4   12   53-64      9-20  (21)
 33 KOG4702 Uncharacterized conser  26.3 1.7E+02  0.0036   20.9   4.3   33   12-45     28-60  (77)
 34 PF01640 Peptidase_C10:  Peptid  25.2 2.7E+02  0.0059   23.7   6.6   49  212-287   141-192 (192)
 35 KOG4621 Uncharacterized conser  25.2 1.7E+02  0.0037   23.6   4.8   51  210-260    58-123 (167)
 36 PF08664 YcbB:  YcbB domain;  I  23.1 1.5E+02  0.0033   24.0   4.3   56    9-65     40-104 (134)
 37 cd00044 CysPc Calpains, domain  21.0 1.1E+02  0.0024   28.2   3.6   40  248-289   235-300 (315)
 38 PF07351 DUF1480:  Protein of u  20.3 1.2E+02  0.0026   22.0   2.7   23  239-261    28-50  (80)

No 1  
>KOG1542 consensus Cysteine proteinase Cathepsin F [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=1.5e-82  Score=569.19  Aligned_cols=284  Identities=36%  Similarity=0.669  Sum_probs=250.3

Q ss_pred             hHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHHHhhhcCCCCCCCCCCC
Q 044448            9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPPTDHPH   75 (308)
Q Consensus         9 ~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf~~~~~~~~~~~~~~~~   75 (308)
                      ..+.+.|..|+.+|+|+|.+.+|...|+.||++|+..             +|+|+|||||+|||++++++.+......+.
T Consensus        65 l~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~~~~  144 (372)
T KOG1542|consen   65 LGLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSKLPG  144 (372)
T ss_pred             cchHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhccccccccCcc
Confidence            4568899999999999999999999999999999987             899999999999999999876653111110


Q ss_pred             CCCCCccccCCCCCCCCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC-CCCCC
Q 044448           76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCA  153 (308)
Q Consensus        76 ~~~~~~~~~~~~~~~~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~-~~gC~  153 (308)
                      .. .. ..  ......||++||||++|.||||||||+ |||||||+|+++|+++.|++|++++||||||+||+. ++||+
T Consensus       145 ~~-~~-~~--~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~  220 (372)
T KOG1542|consen  145 DA-AE-AP--IEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCN  220 (372)
T ss_pred             cc-cc-Cc--CCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCC
Confidence            00 00 11  112236999999999999999999999 999999999999999999999999999999999999 99999


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCcCCCCCCCC-CCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEEEecCc
Q 044448          154 KNFLENAFEYIRQYQRLASECVYPYQGRQDY-YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDATW  231 (308)
Q Consensus       154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~-~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~i~~~~  231 (308)
                      ||.+.+|++|+++.+|+..|.+|||++ ..+ .|.  .+ .....+.|++|..++ .||++|.+.|. +|||+|+|++..
T Consensus       221 GGl~~nA~~~~~~~gGL~~E~dYPY~g-~~~~~C~--~~-~~~~~v~I~~f~~l~-~nE~~ia~wLv~~GPi~vgiNa~~  295 (372)
T KOG1542|consen  221 GGLMDNAFKYIKKAGGLEKEKDYPYTG-KKGNQCH--FD-KSKIVVSIKDFSMLS-NNEDQIAAWLVTFGPLSVGINAKP  295 (372)
T ss_pred             CCChhHHHHHHHHhCCccccccCCccc-cCCCccc--cc-hhhceEEEeccEecC-CCHHHHHHHHHhcCCeEEEEchHH
Confidence            999999999988888999999999999 777 999  66 567889999999998 69999999888 699999999779


Q ss_pred             ccccCCceEeC--C-CCCC-CCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccccc
Q 044448          232 FNFYHGGVFTG--P-CGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP  307 (308)
Q Consensus       232 f~~y~~Giy~~--~-c~~~-~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp  307 (308)
                      +|+|++||+.+  . |+.. ++|||+|||||.+.  - .++|||||||||++|||+||+|+.||.   |.|||+++++-+
T Consensus       296 mQ~YrgGV~~P~~~~Cs~~~~~HaVLlvGyG~~g--~-~~PYWIVKNSWG~~WGE~GY~~l~RG~---N~CGi~~mvss~  369 (372)
T KOG1542|consen  296 MQFYRGGVSCPSKYICSPKLLNHAVLLVGYGSSG--Y-EKPYWIVKNSWGTSWGEKGYYKLCRGS---NACGIADMVSSA  369 (372)
T ss_pred             HHHhcccccCCCcccCCccccCceEEEEeecCCC--C-CCceEEEECCccccccccceEEEeccc---cccccccchhhh
Confidence            99999999988  3 9875 99999999999973  2 589999999999999999999999997   789999998754


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=8.7e-76  Score=547.06  Aligned_cols=285  Identities=34%  Similarity=0.635  Sum_probs=238.4

Q ss_pred             hhHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH------------hcCCCCCCCCHHHHHhhhcC-CCCCCCCCC
Q 044448            8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG-YKPPPTDHP   74 (308)
Q Consensus         8 ~~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~------------~g~N~fsDlt~eEf~~~~~~-~~~~~~~~~   74 (308)
                      +..+..+|++||++|+|.|.+.+|+.+|++||++|+++            ||+|+|+|||+|||.+++++ .........
T Consensus        31 ~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~~~~~~  110 (348)
T PTZ00203         31 GTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAARYLNGAAYFAAAKQ  110 (348)
T ss_pred             ccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCCCeEEeccccccCCHHHHHHHhcCCCcccccccc
Confidence            46788899999999999999988999999999999998            89999999999999987763 221110100


Q ss_pred             CCCCCCccccCCCCCCCCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC-CCCC
Q 044448           75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGC  152 (308)
Q Consensus        75 ~~~~~~~~~~~~~~~~~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~-~~gC  152 (308)
                      ... .. +........+||++||||++|+|+||||||. |||||||++++||++++|++++++.||+|+|+||+. +.||
T Consensus       111 ~~~-~~-~~~~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~~~GC  188 (348)
T PTZ00203        111 HAG-QH-YRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGC  188 (348)
T ss_pred             ccc-cc-ccccccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHHHhccCCCCCC
Confidence            000 00 1111111125899999999999999999999 999999999999999999999999999999999998 7899


Q ss_pred             CCCcHHHHHHHHHHc--CCCCCCCCcCCCCCCCC---CCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEE
Q 044448          153 AKNFLENAFEYIRQY--QRLASECVYPYQGRQDY---YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVA  226 (308)
Q Consensus       153 ~GG~~~~a~~~~~~~--~Gi~~e~~yPY~~~~~~---~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~  226 (308)
                      +||++..||+|+.++  +|+++|++|||.+ .++   .|.  ........+++.+|..++. ++++|+.+|+ +|||+|+
T Consensus       189 ~GG~~~~a~~yi~~~~~ggi~~e~~YPY~~-~~~~~~~C~--~~~~~~~~~~i~~~~~i~~-~e~~~~~~l~~~GPv~v~  264 (348)
T PTZ00203        189 GGGLMLQAFEWVLRNMNGTVFTEKSYPYVS-GNGDVPECS--NSSELAPGARIDGYVSMES-SERVMAAWLAKNGPISIA  264 (348)
T ss_pred             CCCCHHHHHHHHHHhcCCCCCccccCCCcc-CCCCCCcCC--CCcccccceEecceeecCc-CHHHHHHHHHhCCCEEEE
Confidence            999999999999764  5789999999998 655   687  4312234567889998874 7889999998 5999999


Q ss_pred             EecCcccccCCceEeCCCCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccc
Q 044448          227 IDATWFNFYHGGVFTGPCGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAA  305 (308)
Q Consensus       227 i~~~~f~~y~~Giy~~~c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~  305 (308)
                      |++.+|++|++|||+. |.. .++|||+|||||.+   + |++|||||||||++|||+|||||+|+.   |.|||++.++
T Consensus       265 i~a~~f~~Y~~GIy~~-c~~~~~nHaVliVGYG~~---~-g~~YWiikNSWG~~WGe~GY~ri~rg~---n~Cgi~~~~~  336 (348)
T PTZ00203        265 VDASSFMSYHSGVLTS-CIGEQLNHGVLLVGYNMT---G-EVPYWVIKNSWGEDWGEKGYVRVTMGV---NACLLTGYPV  336 (348)
T ss_pred             EEhhhhcCccCceeec-cCCCCCCeEEEEEEEecC---C-CceEEEEEcCCCCCcCcCceEEEEcCC---CcccccceEE
Confidence            9988999999999985 864 58999999999987   5 889999999999999999999999986   7899998775


Q ss_pred             c
Q 044448          306 Y  306 (308)
Q Consensus       306 y  306 (308)
                      .
T Consensus       337 ~  337 (348)
T PTZ00203        337 S  337 (348)
T ss_pred             E
Confidence            4


No 3  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=4e-75  Score=558.02  Aligned_cols=292  Identities=33%  Similarity=0.587  Sum_probs=244.7

Q ss_pred             hhHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHHHhhhcCCCCCC--CC
Q 044448            8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPP--TD   72 (308)
Q Consensus         8 ~~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf~~~~~~~~~~~--~~   72 (308)
                      +.+....|++||++|+|+|.+.+|+.+|+.||++|+++             +|+|+|+|||.|||++++++.....  ..
T Consensus       162 n~e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~~  241 (489)
T PTZ00021        162 NLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKSN  241 (489)
T ss_pred             ChHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhccccccccccc
Confidence            35566889999999999999998999999999999998             8999999999999999887654211  00


Q ss_pred             -CCCCCCCCccc-----cCCCCCCCCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhh
Q 044448           73 -HPHSNRSNWFK-----NLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVD  145 (308)
Q Consensus        73 -~~~~~~~~~~~-----~~~~~~~~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~d  145 (308)
                       ........ +.     ..+.....+|++||||+.|.|+||||||. |||||||++++||++++|+++.++.||+|+|+|
T Consensus       242 ~~~~~~~~~-~~~~~~~~~~~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVD  320 (489)
T PTZ00021        242 GKKSPRVIN-YDDVIKKYKPKDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVD  320 (489)
T ss_pred             ccccccccc-ccccccccccccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhh
Confidence             00000000 00     00001112499999999999999999999 999999999999999999999999999999999


Q ss_pred             cCC-CCCCCCCcHHHHHHHHHHcCCCCCCCCcCCCCCC-CCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCC
Q 044448          146 CST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ-DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQP  222 (308)
Q Consensus       146 c~~-~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~-~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gP  222 (308)
                      |+. +.||+||++..|+.|+.+++||++|++|||.+ . .+.|.  .. .....++|.+|..++   +++|+++|+ .||
T Consensus       321 Cs~~n~GC~GG~~~~Af~yi~~~gGl~tE~~YPY~~-~~~~~C~--~~-~~~~~~~i~~y~~i~---~~~lk~al~~~GP  393 (489)
T PTZ00021        321 CSFKNNGCYGGLIPNAFEDMIELGGLCSEDDYPYVS-DTPELCN--ID-RCKEKYKIKSYVSIP---EDKFKEAIRFLGP  393 (489)
T ss_pred             hccCCCCCCCcchHhhhhhhhhccccCcccccCccC-CCCCccc--cc-cccccceeeeEEEec---HHHHHHHHHhcCC
Confidence            998 88999999999999998877999999999998 6 47898  44 344568899999987   578999998 599


Q ss_pred             eEEEEecC-cccccCCceEeCCCCCCCCeEEEEEEeCCcCC-------CCCCCCeEEEecCCCCCcCCCceEEEEeCCCC
Q 044448          223 VSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTE-------AEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG  294 (308)
Q Consensus       223 V~v~i~~~-~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~-------~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~  294 (308)
                      |+|+|++. +|++|++|||+.+|+..++|||+|||||++..       .. +.+|||||||||++|||+|||||+|+.++
T Consensus       394 Vsv~i~a~~~f~~YkgGIy~~~C~~~~nHAVlIVGYG~e~~~~~~~~~~~-~~~YWIVKNSWGt~WGE~GY~rI~r~~~g  472 (489)
T PTZ00021        394 ISVSIAVSDDFAFYKGGIFDGECGEEPNHAVILVGYGMEEIYNSDTKKME-KRYYYIIKNSWGESWGEKGFIRIETDENG  472 (489)
T ss_pred             eEEEEEeecccccCCCCcCCCCCCCccceEEEEEEecCcCCcccccccCC-CCCEEEEECCCCCCcccCeEEEEEcCCCC
Confidence            99999998 99999999999889888999999999997521       01 35799999999999999999999999754


Q ss_pred             -CCCccccccccccC
Q 044448          295 -SGLCNIAANAAYPL  308 (308)
Q Consensus       295 -~~~Cgi~~~~~yp~  308 (308)
                       .|+|||++.++||+
T Consensus       473 ~~n~CGI~t~a~yP~  487 (489)
T PTZ00021        473 LMKTCSLGTEAYVPL  487 (489)
T ss_pred             CCCCCCCcccceeEe
Confidence             47999999999995


No 4  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=6.3e-75  Score=554.77  Aligned_cols=292  Identities=32%  Similarity=0.569  Sum_probs=243.6

Q ss_pred             CChhHHHHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-----------hcCCCCCCCCHHHHHhhhcCCCCCCCCC-
Q 044448            6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-----------LRLNKFADLTREKFLASYTGYKPPPTDH-   73 (308)
Q Consensus         6 ~~~~~~~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-----------~g~N~fsDlt~eEf~~~~~~~~~~~~~~-   73 (308)
                      ....++..+|++|+++|+|.|.+.+|+.+|+.||++|+++           +|+|+|+|||+|||.+++++.+.+.... 
T Consensus       117 ~~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~  196 (448)
T PTZ00200        117 KLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNS  196 (448)
T ss_pred             cchHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCcCCeEEeccccccCCHHHHHHHhccCCCcccccc
Confidence            3346677899999999999999999999999999999998           8999999999999999877654321100 


Q ss_pred             --CCC-------CCCCcccc---------CCC-C-CCCCCceeecCCCCCCCccCCCC-C-CchHHHHHHHHHHHHHHHh
Q 044448           74 --PHS-------NRSNWFKN---------LNS-S-KMSFYDSIDWNERGAVTPVKDQG-S-YCCWAFTAVATVEGLNKIR  131 (308)
Q Consensus        74 --~~~-------~~~~~~~~---------~~~-~-~~~lP~~~Dwr~~g~v~pv~dQg-~-gsCwAfa~~~~~e~~~~i~  131 (308)
                        ...       .... +..         ..+ . ...+|++||||+.|.|+|||||| . |||||||+++++|++++|+
T Consensus       197 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~  275 (448)
T PTZ00200        197 TSHNNDFKARHVSNPT-YLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIY  275 (448)
T ss_pred             cccccccccccccccc-cccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHHHhHHHHHHHHHHHh
Confidence              000       0000 100         000 0 01269999999999999999999 9 9999999999999999999


Q ss_pred             cCCcccCCHHHHhhcCC-CCCCCCCcHHHHHHHHHHcCCCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCC
Q 044448          132 TGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT  210 (308)
Q Consensus       132 ~~~~~~lS~q~l~dc~~-~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~  210 (308)
                      ++..+.||+|+|+||+. +.||+||++..|++|++++ ||++|++|||.+ ..+.|.  ..  ....+.|.+|..++  .
T Consensus       276 ~~~~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~~~-Gi~~e~~YPY~~-~~~~C~--~~--~~~~~~i~~y~~~~--~  347 (448)
T PTZ00200        276 RDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNK-GLSSSSDVPYLA-KDGKCV--VS--STKKVYIDSYLVAK--G  347 (448)
T ss_pred             cCCCeecCHHHHhhccCccCCCCCCcHHHHHHHHhhc-CccccccCCCCC-CCCCCc--CC--CCCeeEecceEecC--H
Confidence            99999999999999998 8899999999999999877 999999999999 889998  54  23456788888765  4


Q ss_pred             HHHHHHHHhcCCeEEEEecC-cccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEE
Q 044448          211 EEGLQDVVSRQPVSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF  289 (308)
Q Consensus       211 ~~~lk~~l~~gPV~v~i~~~-~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~  289 (308)
                      .+.|+++|.+|||+|+|+++ +|+.|++|||+++|+..++|||+|||||.+.+ + |.+|||||||||++|||+|||||+
T Consensus       348 ~~~l~~~l~~GPV~v~i~~~~~f~~Yk~GIy~~~C~~~~nHaV~lVGyG~d~~-~-g~~YWIIkNSWG~~WGe~GY~ri~  425 (448)
T PTZ00200        348 KDVLNKSLVISPTVVYIAVSRELLKYKSGVYNGECGKSLNHAVLLVGEGYDEK-T-KKRYWIIKNSWGTDWGENGYMRLE  425 (448)
T ss_pred             HHHHHHHHhcCCEEEEeecccccccCCCCccccccCCCCcEEEEEEEecccCC-C-CCceEEEEcCCCCCcccCeeEEEE
Confidence            56677777789999999998 99999999999889877999999999996421 5 789999999999999999999999


Q ss_pred             eCCCCCCCccccccccccC
Q 044448          290 RGVGGSGLCNIAANAAYPL  308 (308)
Q Consensus       290 ~~~~~~~~Cgi~~~~~yp~  308 (308)
                      |+..+.|.|||++.+.||+
T Consensus       426 r~~~g~n~CGI~~~~~~P~  444 (448)
T PTZ00200        426 RTNEGTDKCGILTVGLTPV  444 (448)
T ss_pred             eCCCCCCcCCccccceeeE
Confidence            9742248899999999995


No 5  
>KOG1543 consensus Cysteine proteinase Cathepsin L [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=7.9e-67  Score=483.50  Aligned_cols=270  Identities=43%  Similarity=0.771  Sum_probs=234.2

Q ss_pred             HHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHHHhhhcCCCCCCCCCCCCCCCCccccC
Q 044448           19 MVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL   85 (308)
Q Consensus        19 ~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (308)
                      +.+|.+.|.+..|...|+.+|.+|++.             +|+|+|+|++.+||+..+.+++++....     .. +...
T Consensus        30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~~~~~~~~~-----~~-~~~~  103 (325)
T KOG1543|consen   30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTGKKPPEIKR-----DK-FTEK  103 (325)
T ss_pred             hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhccccCccccc-----cc-cccc
Confidence            567788887778999999999999766             8999999999999999988776543311     11 1111


Q ss_pred             CCCCCCCCceeecCCCC-CCCccCCCCC-CchHHHHHHHHHHHHHHHhcC-CcccCCHHHHhhcCC--CCCCCCCcHHHH
Q 044448           86 NSSKMSFYDSIDWNERG-AVTPVKDQGS-YCCWAFTAVATVEGLNKIRTG-QLVTRSKHQLVDCST--LNGCAKNFLENA  160 (308)
Q Consensus        86 ~~~~~~lP~~~Dwr~~g-~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~-~~~~lS~q~l~dc~~--~~gC~GG~~~~a  160 (308)
                       ....++|++||||++| .++||||||. |||||||++++||++++|+++ .++.||+|+|+||+.  +.||.||.+..|
T Consensus       104 -~~~~~~p~s~DwR~~~~~~~~vkdQg~CgsCWAFaa~~aie~~~~i~~g~~l~sLSeq~lvdC~~~~~~GC~GG~~~~A  182 (325)
T KOG1543|consen  104 -LDGDDLPDSFDWRDKGAVTPPVKDQGSCGSCWAFAATGALEDRYNIKTGGKLLSLSEQDLVDCCGECGDGCNGGEPKNA  182 (325)
T ss_pred             -cchhhCCCCccccccCCcCCCcCCCCcCcchHHHHHHHHHHHHHHHHhCCccCccChhhhhhccCCCCCCcCCCCHHHH
Confidence             1112699999999996 5556999999 999999999999999999999 899999999999998  889999999999


Q ss_pred             HHHHHHcCCCCC-CCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEEEecC-cccccCC
Q 044448          161 FEYIRQYQRLAS-ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDAT-WFNFYHG  237 (308)
Q Consensus       161 ~~~~~~~~Gi~~-e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~i~~~-~f~~y~~  237 (308)
                      ++|+.++ |+++ +++|||.+ ..+.|.  .+ .....+.+.++..++. ++++|+++|+ +|||+|+|++. +|+.|++
T Consensus       183 ~~yi~~~-G~~t~~~~Ypy~~-~~~~C~--~~-~~~~~~~~~~~~~~~~-~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~  256 (325)
T KOG1543|consen  183 FKYIKKN-GGVTECENYPYIG-KDGTCK--SN-KKDKTVTIKGFYNVPA-NEEAIAEAVAKNGPVSVAIDAYEDFSLYKG  256 (325)
T ss_pred             HHHHHHh-CCCCCCcCCCCcC-CCCCcc--CC-CccceeEeeeeeecCc-CHHHHHHHHHhcCCeEEEEeehhhhhhccC
Confidence            9999999 6666 99999999 999999  65 3367788889998885 5999999999 59999999999 9999999


Q ss_pred             ceEeCC-CCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccccc-cC
Q 044448          238 GVFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAY-PL  308 (308)
Q Consensus       238 Giy~~~-c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~y-p~  308 (308)
                      |||.++ |.. .++|||+|||||+ .  + +.+|||||||||+.|||+|||||.|+.   +.|+|++.++| |+
T Consensus       257 GVy~~~~~~~~~~~Hav~iVGyG~-~--~-~~~YWivkNSWG~~WGe~Gy~ri~r~~---~~~~I~~~~~~~p~  323 (325)
T KOG1543|consen  257 GVYAEEKGDDKEGDHAVLIVGYGT-G--D-GVDYWIVKNSWGTDWGEKGYFRIARGV---NKCGIASEASYGPI  323 (325)
T ss_pred             ceEeCCCCCCCCCCceEEEEEEcC-C--C-CceeEEEEcCCCCCcccCceEEEecCC---CchhhhcccccCCC
Confidence            999998 444 5999999999999 3  5 889999999999999999999999998   67999999998 64


No 6  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=1.3e-58  Score=413.69  Aligned_cols=207  Identities=25%  Similarity=0.448  Sum_probs=180.8

Q ss_pred             CCceeecCCCC---CCCccCCCC---C-CchHHHHHHHHHHHHHHHhcC---CcccCCHHHHhhcCCCCCCCCCcHHHHH
Q 044448           92 FYDSIDWNERG---AVTPVKDQG---S-YCCWAFTAVATVEGLNKIRTG---QLVTRSKHQLVDCSTLNGCAKNFLENAF  161 (308)
Q Consensus        92 lP~~~Dwr~~g---~v~pv~dQg---~-gsCwAfa~~~~~e~~~~i~~~---~~~~lS~q~l~dc~~~~gC~GG~~~~a~  161 (308)
                      ||++||||+.+   +|+||||||   . |||||||++++||++++|+++   ..+.||+|+|+||+.+.||+||++..|+
T Consensus         1 lP~~~Dwr~~~~~~~v~~vk~Qg~~~~CGsCwAfa~~~aies~~~i~~~~~~~~~~lS~Q~lldC~~~~gC~GG~~~~a~   80 (239)
T cd02698           1 LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCAGGGSCHGGDPGGVY   80 (239)
T ss_pred             CCCCcccccCCCCcccCccccCCCCCCCCcchHHHhHHHHHHHHHHHHCCCCCCcccCHHHHHhCCCCCCccCcCHHHHH
Confidence            69999999987   999999998   8 999999999999999999875   3578999999999988899999999999


Q ss_pred             HHHHHcCCCCCCCCcCCCCCCCCCCcccccC--------------CCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEE
Q 044448          162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSS--------------ASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVA  226 (308)
Q Consensus       162 ~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~--------------~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~  226 (308)
                      +|++++ |+++|++|||.. ....|.  ...              .....+++.+|..++  ++++|+++|. +|||+|+
T Consensus        81 ~~~~~~-Gl~~e~~yPY~~-~~~~C~--~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~~--~~~~i~~~l~~~GPV~v~  154 (239)
T cd02698          81 EYAHKH-GIPDETCNPYQA-KDGECN--PFNRCGTCNPFGECFAIKNYTLYFVSDYGSVS--GRDKMMAEIYARGPISCG  154 (239)
T ss_pred             HHHHHc-CcCCCCeeCCcC-CCCCCc--CCCCCCCcccCcccccccccceEEeeeceecC--CHHHHHHHHHHcCCEEEE
Confidence            999987 999999999998 666665  210              012346778887775  5788999887 6999999


Q ss_pred             EecC-cccccCCceEeCC-CCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCC--CCCccccc
Q 044448          227 IDAT-WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAA  302 (308)
Q Consensus       227 i~~~-~f~~y~~Giy~~~-c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~--~~~Cgi~~  302 (308)
                      |.+. +|+.|++|||+.+ |...++|||+|||||++.  + +++|||||||||++|||+|||||+|+...  .|+|||++
T Consensus       155 i~~~~~f~~Y~~GIy~~~~~~~~~~HaV~IVGyG~~~--~-g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~  231 (239)
T cd02698         155 IMATEALENYTGGVYKEYVQDPLINHIISVAGWGVDE--N-GVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEE  231 (239)
T ss_pred             EEecccccccCCeEEccCCCCCcCCeEEEEEEEEecC--C-CCEEEEEEcCCCcccCcCceEEEEccCCccccccccccc
Confidence            9999 9999999999887 556689999999999874  5 78999999999999999999999999721  47899999


Q ss_pred             ccccc
Q 044448          303 NAAYP  307 (308)
Q Consensus       303 ~~~yp  307 (308)
                      .+.|+
T Consensus       232 ~~~~~  236 (239)
T cd02698         232 DCAWA  236 (239)
T ss_pred             ceEEE
Confidence            99886


No 7  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=4.2e-58  Score=411.53  Aligned_cols=207  Identities=29%  Similarity=0.601  Sum_probs=177.8

Q ss_pred             CCceeecCCCC----CCCccCCCCC-CchHHHHHHHHHHHHHHHhcCC------cccCCHHHHhhcCC-CCCCCCCcHHH
Q 044448           92 FYDSIDWNERG----AVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQ------LVTRSKHQLVDCST-LNGCAKNFLEN  159 (308)
Q Consensus        92 lP~~~Dwr~~g----~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~------~~~lS~q~l~dc~~-~~gC~GG~~~~  159 (308)
                      ||++||||+.+    +|+||||||. |||||||++++||++++|+++.      .+.||+|+|+||+. +.||+||++..
T Consensus         1 lP~~fDwr~~~~~~~~v~~v~dQg~CGsCwAfa~~~~ies~~~i~~~~~~~~~~~~~lS~q~l~dC~~~~~GC~GG~~~~   80 (243)
T cd02621           1 LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCSQYSQGCDGGFPFL   80 (243)
T ss_pred             CCCcccccccCCCCcccccCCCCCcCccHHHHHHHHHHHHHHHHHhCCCCccccCcccCHHHhhhhcCCCCCCCCCCHHH
Confidence            79999999988    9999999999 9999999999999999998876      68999999999998 88999999999


Q ss_pred             HHHHHHHcCCCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcC----CCCHHHHHHHHh-cCCeEEEEecC-ccc
Q 044448          160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ----PATEEGLQDVVS-RQPVSVAIDAT-WFN  233 (308)
Q Consensus       160 a~~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~----~~~~~~lk~~l~-~gPV~v~i~~~-~f~  233 (308)
                      |++|+.++ ||++|++|||.....+.|.  ........+++..|..+.    ..++++||++|. +|||+|+|++. +|+
T Consensus        81 a~~~~~~~-Gi~~e~~yPY~~~~~~~C~--~~~~~~~~~~~~~~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F~  157 (243)
T cd02621          81 VGKFAEDF-GIVTEDYFPYTADDDRPCK--ASPSECRRYYFSDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFD  157 (243)
T ss_pred             HHHHHHhc-CcCCCceeCCCCCCCCCCC--CCccccccccccceeEcccccccCCHHHHHHHHHHcCCEEEEEEeccccc
Confidence            99999887 9999999999862356788  442133444555555442    247899999998 59999999999 999


Q ss_pred             ccCCceEeCC-----CCC---------CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcc
Q 044448          234 FYHGGVFTGP-----CGN---------TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCN  299 (308)
Q Consensus       234 ~y~~Giy~~~-----c~~---------~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cg  299 (308)
                      +|++|||+.+     |..         .++|||+|||||++.. + +.+|||||||||++|||+|||||+|+.   |.||
T Consensus       158 ~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~-~-g~~YWiirNSWG~~WGe~Gy~~i~~~~---~~cg  232 (243)
T cd02621         158 FYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI-K-GEKYWIVKNSWGSSWGEKGYFKIRRGT---NECG  232 (243)
T ss_pred             ccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC-C-CCcEEEEEcCCCCCCCcCCeEEEecCC---cccC
Confidence            9999999875     642         4799999999998631 3 789999999999999999999999986   7899


Q ss_pred             ccccccc
Q 044448          300 IAANAAY  306 (308)
Q Consensus       300 i~~~~~y  306 (308)
                      |++.+++
T Consensus       233 i~~~~~~  239 (243)
T cd02621         233 IESQAVF  239 (243)
T ss_pred             cccceEe
Confidence            9999854


No 8  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=1.5e-57  Score=398.55  Aligned_cols=203  Identities=53%  Similarity=0.975  Sum_probs=187.6

Q ss_pred             CceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHHcCC
Q 044448           93 YDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQR  169 (308)
Q Consensus        93 P~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~~~G  169 (308)
                      |++||||+.+.++||+|||. |+|||||++++||++++++++..+.||+|+|++|..  +.+|.||.+..|++++.+. |
T Consensus         1 P~~~d~r~~~~~~~v~dQg~cgsCwAfa~~~~le~~~~i~~~~~~~lS~q~l~~c~~~~~~gC~GG~~~~a~~~~~~~-G   79 (210)
T cd02248           1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGNNGCNGGNPDNAFEYVKNG-G   79 (210)
T ss_pred             CCcccCCcCCCCCCCccCCCCcchHHhHHHHHHHHHHHHHcCCCcccCHHHHhccCCCCCCCCCCCCHHHhHHHHHHC-C
Confidence            78999999999999999999 999999999999999999999889999999999997  6899999999999998877 9


Q ss_pred             CCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecC-cccccCCceEeCC-C-C
Q 044448          170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT-WFNFYHGGVFTGP-C-G  245 (308)
Q Consensus       170 i~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~-~f~~y~~Giy~~~-c-~  245 (308)
                      +++|++|||.. ....|.  .. .....++|++|..++..++++||++|++ |||+++|.+. +|+.|++|||..+ | .
T Consensus        80 i~~e~~yPY~~-~~~~C~--~~-~~~~~~~i~~~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~~~~~  155 (210)
T cd02248          80 LASESDYPYTG-KDGTCK--YN-SSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGPCCSN  155 (210)
T ss_pred             cCccccCCccC-CCCCcc--CC-CCcccEEEeeEEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCCCCCC
Confidence            99999999998 888999  55 4467899999999987678999999995 9999999999 9999999999987 5 3


Q ss_pred             CCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccccc
Q 044448          246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP  307 (308)
Q Consensus       246 ~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp  307 (308)
                      ..++|||+|||||++   . +.+|||||||||+.||++|||||+++.   +.|||++.+.||
T Consensus       156 ~~~~Hav~iVGy~~~---~-~~~ywiv~NSWG~~WG~~Gy~~i~~~~---~~cgi~~~~~~~  210 (210)
T cd02248         156 TNLNHAVLLVGYGTE---N-GVDYWIVKNSWGTSWGEKGYIRIARGS---NLCGIASYASYP  210 (210)
T ss_pred             CcCCEEEEEEEEeec---C-CceEEEEEcCCCCccccCcEEEEEcCC---CccCceeeeecC
Confidence            568999999999998   4 889999999999999999999999996   689999999887


No 9  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=2.8e-57  Score=404.32  Aligned_cols=201  Identities=28%  Similarity=0.573  Sum_probs=172.5

Q ss_pred             CceeecCCC--CCC--CccCCCCC-CchHHHHHHHHHHHHHHHhcC--CcccCCHHHHhhcCC--CCCCCCCcHHHHHHH
Q 044448           93 YDSIDWNER--GAV--TPVKDQGS-YCCWAFTAVATVEGLNKIRTG--QLVTRSKHQLVDCST--LNGCAKNFLENAFEY  163 (308)
Q Consensus        93 P~~~Dwr~~--g~v--~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~--~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~  163 (308)
                      |++||||++  +++  +||+|||. |||||||++++||+++.|+++  +.+.||+|+|+||+.  +.||+||++..|++|
T Consensus         1 p~~~DwR~~~~~~~~v~~v~dQg~CGsCwAfa~~~~le~~~~i~~~~~~~~~LS~Q~lidC~~~~~~gC~GG~~~~a~~~   80 (236)
T cd02620           1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGDGCNGGYPDAAWKY   80 (236)
T ss_pred             CCcccchhhCCCCCCccccCCcccchhHHHHHHHHHHhhHHHHhcCCCCccccCHHHHHhhcCCCCCCCCCCCHHHHHHH
Confidence            899999996  454  59999999 999999999999999999987  778999999999987  689999999999999


Q ss_pred             HHHcCCCCCCCCcCCCCCCCCC------------------CcccccCC---CCccEEEeeeEEcCCCCHHHHHHHHh-cC
Q 044448          164 IRQYQRLASECVYPYQGRQDYY------------------CDWWRSSA---SGKYGAIRGYQYVQPATEEGLQDVVS-RQ  221 (308)
Q Consensus       164 ~~~~~Gi~~e~~yPY~~~~~~~------------------C~~~~~~~---~~~~~~i~~~~~v~~~~~~~lk~~l~-~g  221 (308)
                      ++++ |+++|++|||.+ ....                  |.  ....   ....+++..+..+. .++++||.+|. +|
T Consensus        81 i~~~-G~~~e~~yPY~~-~~~~~~~~~~~~~~~~~~~~~~C~--~~~~~~~~~~~~~~~~~~~~~-~~~~~ik~~l~~~G  155 (236)
T cd02620          81 LTTT-GVVTGGCQPYTI-PPCGHHPEGPPPCCGTPYCTPKCQ--DGCEKTYEEDKHKGKSAYSVP-SDETDIMKEIMTNG  155 (236)
T ss_pred             HHhc-CCCcCCEecCcC-CCCccCCCCCCCCCCCCCCCCCCC--cCCccccceeeeeecceeeeC-CHHHHHHHHHHHCC
Confidence            9987 999999999988 5432                  33  2100   11234555666665 47899999998 59


Q ss_pred             CeEEEEecC-cccccCCceEeCCCCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcc
Q 044448          222 PVSVAIDAT-WFNFYHGGVFTGPCGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCN  299 (308)
Q Consensus       222 PV~v~i~~~-~f~~y~~Giy~~~c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cg  299 (308)
                      ||+|+|.+. +|+.|++|||+.+|+. .++|||+|||||++   + +++|||||||||++|||+|||||+|+.   |+||
T Consensus       156 Pv~v~i~~~~~f~~Y~~Giy~~~~~~~~~~HaV~iVGyg~~---~-g~~YWivrNSWG~~WGe~Gy~ri~~~~---~~cg  228 (236)
T cd02620         156 PVQAAFTVYEDFLYYKSGVYQHTSGKQLGGHAVKIIGWGVE---N-GVPYWLAANSWGTDWGENGYFRILRGS---NECG  228 (236)
T ss_pred             CeEEEEEechhhhhcCCcEEeecCCCCcCCeEEEEEEEecc---C-CeeEEEEEeCCCCCCCCCcEEEEEccC---cccc
Confidence            999999998 9999999999876654 47999999999987   5 889999999999999999999999986   7899


Q ss_pred             cccccc
Q 044448          300 IAANAA  305 (308)
Q Consensus       300 i~~~~~  305 (308)
                      |++.++
T Consensus       229 i~~~~~  234 (236)
T cd02620         229 IESEVV  234 (236)
T ss_pred             ccccee
Confidence            999875


No 10 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=1.6e-55  Score=386.92  Aligned_cols=208  Identities=37%  Similarity=0.757  Sum_probs=180.8

Q ss_pred             CCceeecCCC-CCCCccCCCCC-CchHHHHHHHHHHHHHHHhc-CCcccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHH
Q 044448           92 FYDSIDWNER-GAVTPVKDQGS-YCCWAFTAVATVEGLNKIRT-GQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQ  166 (308)
Q Consensus        92 lP~~~Dwr~~-g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~-~~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~  166 (308)
                      ||++||||+. +.++||+||+. |+|||||+++++|++++++. ...+.||+|+|++|..  +.+|+||++..|++++++
T Consensus         1 lP~~~D~r~~~~~~~~v~dQg~~gsCwafa~~~~~e~~~~~~~~~~~~~lS~q~l~~~~~~~~~~c~gg~~~~a~~~~~~   80 (219)
T PF00112_consen    1 LPKSFDWRDKGGRITPVRDQGSCGSCWAFAAAAALESRLAIQNNGKNVDLSEQYLIDCSNKYNKGCDGGSPFDALKYIKN   80 (219)
T ss_dssp             STSSEEGGGTTTCSG---BTTSSBTHHHHHHHHHHHHHHHHHHTSSCEEB-HHHHHHHSTGTSSTTBBBEHHHHHHHHHH
T ss_pred             CCCCEecccCCCCcCccccCCcccccccchhccceeccccccccccccccccccccccccccccccccCcccccceeecc
Confidence            7999999998 48999999999 99999999999999999999 7889999999999997  789999999999999998


Q ss_pred             cCCCCCCCCcCCCCCCC-CCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecC--cccccCCceEeC
Q 044448          167 YQRLASECVYPYQGRQD-YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT--WFNFYHGGVFTG  242 (308)
Q Consensus       167 ~~Gi~~e~~yPY~~~~~-~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~--~f~~y~~Giy~~  242 (308)
                      +.|+++|++|||.. .. ..|.  ........+++..|..+...++++||++|.+ |||+++|.+.  +|+.|++|||..
T Consensus        81 ~~Gi~~e~~~pY~~-~~~~~c~--~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~  157 (219)
T PF00112_consen   81 NNGIVTEEDYPYNG-NENPTCK--SKKSNSYYVKIKGYGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDP  157 (219)
T ss_dssp             HTSBEBTTTS--SS-SSSCSSC--HSGGGEEEBEESEEEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECS
T ss_pred             cCcccccccccccc-ccccccc--ccccccccccccccccccccchhHHHHHHhhCceeeeeeeccccccccccceeeec
Confidence            34999999999998 66 7898  4411212478999999987779999999996 9999999999  499999999998


Q ss_pred             C-CCC-CCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccccccC
Q 044448          243 P-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL  308 (308)
Q Consensus       243 ~-c~~-~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp~  308 (308)
                      + |.. .++|||+|||||++   . +++|||||||||++||++|||||+|+.+  ++|||++.++||+
T Consensus       158 ~~~~~~~~~Hav~iVGy~~~---~-~~~~wiv~NSWG~~WG~~Gy~~i~~~~~--~~c~i~~~~~~~~  219 (219)
T PF00112_consen  158 PDCSNESGGHAVLIVGYDDE---N-GKGYWIVKNSWGTDWGDNGYFRISYDYN--NECGIESQAVYPI  219 (219)
T ss_dssp             TSSSSSSEEEEEEEEEEEEE---T-TEEEEEEE-SBTTTSTBTTEEEEESSSS--SGGGTTSSEEEEE
T ss_pred             cccccccccccccccccccc---c-ceeeEeeehhhCCccCCCeEEEEeeCCC--CcCccCceeeecC
Confidence            6 764 68999999999998   4 8999999999999999999999999974  4899999999995


No 11 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=2e-54  Score=423.72  Aligned_cols=209  Identities=20%  Similarity=0.404  Sum_probs=176.1

Q ss_pred             CCCCceeecCCC----CCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCC-----c-----ccCCHHHHhhcCC-CCCCC
Q 044448           90 MSFYDSIDWNER----GAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQ-----L-----VTRSKHQLVDCST-LNGCA  153 (308)
Q Consensus        90 ~~lP~~~Dwr~~----g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~-----~-----~~lS~q~l~dc~~-~~gC~  153 (308)
                      .+||++||||+.    +.++||+|||. |||||||++++||++++|++++     .     ..||+|+|+||+. +.||+
T Consensus       379 ~~LP~sfDWRd~~~~~~~vtpVkdQG~CGSCWAFAat~alEsR~~Ia~~~~l~~~~~~~~~~~LS~QqLLDCs~~nqGC~  458 (693)
T PTZ00049        379 DELPKNFTWGDPFNNNTREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYDQGCN  458 (693)
T ss_pred             ccCCCCEecCcCCCCCCcccCCCCCccCcHHHHHHHHHHHHHHHHHHhccccccccccccccCcCHHHhcccCCCCCCcC
Confidence            369999999984    67999999999 9999999999999999998643     1     2799999999998 89999


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCcCCCCCCCCCCcccccCCC--------------------------------------C
Q 044448          154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSAS--------------------------------------G  195 (308)
Q Consensus       154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~--------------------------------------~  195 (308)
                      ||++..|++|+.++ ||++|++|||.+ ..+.|+  .....                                      .
T Consensus       459 GG~~~~A~kya~~~-GI~tEscYPY~a-~~g~C~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  534 (693)
T PTZ00049        459 GGFPYLVSKMAKLQ-GIPLDKVFPYTA-TEQTCP--YQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEP  534 (693)
T ss_pred             CCcHHHHHHHHHHC-CCCcCCccCCcC-CCCCCC--CCCCCccccccccccccccccccccccccccccccccccccccc
Confidence            99999999999887 999999999998 778886  32110                                      1


Q ss_pred             ccEEEeeeEEcCC-------CCHHHHHHHHh-cCCeEEEEecC-cccccCCceEeCC-------CCC-------------
Q 044448          196 KYGAIRGYQYVQP-------ATEEGLQDVVS-RQPVSVAIDAT-WFNFYHGGVFTGP-------CGN-------------  246 (308)
Q Consensus       196 ~~~~i~~~~~v~~-------~~~~~lk~~l~-~gPV~v~i~~~-~f~~y~~Giy~~~-------c~~-------------  246 (308)
                      .++.+++|..+..       .++++|+.+|. +|||+|+|++. +|++|++|||+.+       |..             
T Consensus       535 ~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G  614 (693)
T PTZ00049        535 ARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITG  614 (693)
T ss_pred             cceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccc
Confidence            1234566666531       46889999998 59999999999 9999999999852       642             


Q ss_pred             --CCCeEEEEEEeCCcCCCCCC--CCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccccc
Q 044448          247 --TPNHGVTIVGYGTTTEAEGQ--QPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP  307 (308)
Q Consensus       247 --~~~Hav~iVGyg~~~~~~~g--~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~yp  307 (308)
                        .++|||+|||||.+.+ + |  .+|||||||||+.||++|||||+|+.   |.|||++.++|+
T Consensus       615 ~e~~NHAVlIVGwG~d~e-n-G~~~~YWIVRNSWGt~WGenGYfKI~RG~---N~CGIEs~a~~~  674 (693)
T PTZ00049        615 WEKVNHAIVLVGWGEEEI-N-GKLYKYWIGRNSWGKNWGKEGYFKIIRGK---NFSGIESQSLFI  674 (693)
T ss_pred             cccCceEEEEEEeccccC-C-CcccCEEEEECCCCCCcccCceEEEEcCC---CccCCccceeEE
Confidence              3699999999998531 3 5  37999999999999999999999997   789999999886


No 12 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=1.3e-53  Score=413.92  Aligned_cols=203  Identities=22%  Similarity=0.405  Sum_probs=172.8

Q ss_pred             CCCceeecCCCC---CCCccCCCCC----CchHHHHHHHHHHHHHHHhcC------CcccCCHHHHhhcCC-CCCCCCCc
Q 044448           91 SFYDSIDWNERG---AVTPVKDQGS----YCCWAFTAVATVEGLNKIRTG------QLVTRSKHQLVDCST-LNGCAKNF  156 (308)
Q Consensus        91 ~lP~~~Dwr~~g---~v~pv~dQg~----gsCwAfa~~~~~e~~~~i~~~------~~~~lS~q~l~dc~~-~~gC~GG~  156 (308)
                      +||++||||+.|   +|+||||||.    |||||||++++||++++|+++      ..+.||+|+|+||+. ++||+||+
T Consensus       204 ~LP~sfDWR~~gg~~~VtpVrdQg~~~~CGSCWAFAav~alEsr~~I~tn~~~~~g~~~~LS~QqLVDCs~~n~GCdGG~  283 (548)
T PTZ00364        204 PPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTFLSARHVLDCSQYGQGCAGGF  283 (548)
T ss_pred             CCCCccccCcCCCCccCCCCcCCCCCCCCcCHHHHHHHHHHHHHHHHHhCCCcccCcccCcCHHHHhcccCCCCCCCCCc
Confidence            699999999987   7999999973    999999999999999999883      468899999999998 89999999


Q ss_pred             HHHHHHHHHHcCCCCCCCCc--CCCCCCCC---CCcccccCCCCccEEEee------eEEcCCCCHHHHHHHHh-cCCeE
Q 044448          157 LENAFEYIRQYQRLASECVY--PYQGRQDY---YCDWWRSSASGKYGAIRG------YQYVQPATEEGLQDVVS-RQPVS  224 (308)
Q Consensus       157 ~~~a~~~~~~~~Gi~~e~~y--PY~~~~~~---~C~~~~~~~~~~~~~i~~------~~~v~~~~~~~lk~~l~-~gPV~  224 (308)
                      +..|++|+.++ ||++|++|  ||.+ .++   .|+  .. .....+.+++      |..+. .++++|+.+|+ +|||+
T Consensus       284 p~~A~~yi~~~-GI~tE~dY~~PY~~-~dg~~~~Ck--~~-~~~~~y~~~~~~~I~gyy~~~-~~e~~I~~eI~~~GPVs  357 (548)
T PTZ00364        284 PEEVGKFAETF-GILTTDSYYIPYDS-GDGVERACK--TR-RPSRRYYFTNYGPLGGYYGAV-TDPDEIIWEIYRHGPVP  357 (548)
T ss_pred             HHHHHHHHHhC-CcccccccCCCCCC-CCCCCCCCC--CC-cccceeeeeeeEEecceeecC-CcHHHHHHHHHHcCCeE
Confidence            99999999877 99999999  9987 555   587  44 2333444444      33333 47889999998 59999


Q ss_pred             EEEecC-cccccCCceEeC---------CC-----------CCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCC--CcC
Q 044448          225 VAIDAT-WFNFYHGGVFTG---------PC-----------GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT--NWD  281 (308)
Q Consensus       225 v~i~~~-~f~~y~~Giy~~---------~c-----------~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~--~WG  281 (308)
                      |+|++. +|+.|++|||.+         .|           ...++|||+|||||.+.  + |.+|||||||||+  +||
T Consensus       358 VaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de--~-G~~YWIVKNSWGt~~~WG  434 (548)
T PTZ00364        358 ASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE--N-GGDYWLVLDPWGSRRSWC  434 (548)
T ss_pred             EEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccC--C-CceEEEEECCCCCCCCcc
Confidence            999999 999999999752         12           13479999999999864  5 8899999999999  999


Q ss_pred             CCceEEEEeCCCCCCCcccccccc
Q 044448          282 EGGSMRIFRGVGGSGLCNIAANAA  305 (308)
Q Consensus       282 e~Gy~~i~~~~~~~~~Cgi~~~~~  305 (308)
                      |+|||||+|+.   |.|||++.++
T Consensus       435 E~GYfRI~RG~---N~CGIes~~v  455 (548)
T PTZ00364        435 DGGTRKIARGV---NAYNIESEVV  455 (548)
T ss_pred             cCCeEEEEcCC---Ccccccceee
Confidence            99999999997   7899999987


No 13 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=100.00  E-value=4.5e-49  Score=335.81  Aligned_cols=165  Identities=53%  Similarity=1.004  Sum_probs=146.5

Q ss_pred             CCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHHcC
Q 044448           92 FYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQ  168 (308)
Q Consensus        92 lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~~~  168 (308)
                      ||++||||+.++++||+||+. |+|||||++++||+++++++++.+.||+|+|++|..  +.+|+||++..|++|+.++.
T Consensus         1 lP~~~D~R~~~~~~~v~dQg~CGsCwAfa~~~~ie~~~~i~~~~~~~lS~q~l~~C~~~~~~gC~GG~~~~a~~~~~~~~   80 (174)
T smart00645        1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSTGGNNGCNGGLPDNAFEYIKKNG   80 (174)
T ss_pred             CCCcCcccccCCCCccccCcccchHHHHHHHHHHHHHHHHhcCCccccCHHHHhhhcCCCCCCCCCcCHHHHHHHHHHcC
Confidence            699999999999999999999 999999999999999999999899999999999997  56999999999999998766


Q ss_pred             CCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhcCCeEEEEecCcccccCCceEeCC-CCCC
Q 044448          169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGGVFTGP-CGNT  247 (308)
Q Consensus       169 Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~gPV~v~i~~~~f~~y~~Giy~~~-c~~~  247 (308)
                      |+++|++|||..                                           ++.+.+.+|+.|++|||+.+ |...
T Consensus        81 Gi~~e~~~PY~~-------------------------------------------~~~~~~~~f~~Y~~Gi~~~~~~~~~  117 (174)
T smart00645       81 GLETESCYPYTG-------------------------------------------SVAIDASDFQFYKSGIYDHPGCGSG  117 (174)
T ss_pred             CcccccccCccc-------------------------------------------EEEEEcccccCCcCeEECCCCCCCC
Confidence            899999999831                                           44555448999999999985 8654


Q ss_pred             -CCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccc
Q 044448          248 -PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANA  304 (308)
Q Consensus       248 -~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~  304 (308)
                       ++|+|+|||||.+.  + +++|||||||||+.|||+|||||+|+..  +.|||+...
T Consensus       118 ~~~Hav~ivGyg~~~--~-g~~yWii~NSwG~~WG~~G~~~i~~~~~--~~c~i~~~~  170 (174)
T smart00645      118 TLDHAVLIVGYGTEE--N-GKDYWIVKNSWGTDWGENGYFRIARGKN--NECGIEASV  170 (174)
T ss_pred             cccEEEEEEEEeecC--C-CeeEEEEECCCCCCcccCeEEEEEcCCC--CccCceeee
Confidence             79999999999862  4 8899999999999999999999999852  679995543


No 14 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=100.00  E-value=1.8e-46  Score=330.15  Aligned_cols=191  Identities=27%  Similarity=0.434  Sum_probs=165.5

Q ss_pred             eeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcC--CcccCCHHHHhhcCC-C-----CCCCCCcHHHHHH-HH
Q 044448           95 SIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTG--QLVTRSKHQLVDCST-L-----NGCAKNFLENAFE-YI  164 (308)
Q Consensus        95 ~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~--~~~~lS~q~l~dc~~-~-----~gC~GG~~~~a~~-~~  164 (308)
                      .+|||+.+ ++||+|||. |+|||||+++++|+++++++.  +.+.||+|+|++|.. .     .+|.||.+..++. ++
T Consensus         1 ~~d~r~~~-~~~v~dQg~~gsCwafa~~~~les~~~~~~~~~~~~~lS~q~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~   79 (223)
T cd02619           1 SVDLRPLR-LTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLSALLKLV   79 (223)
T ss_pred             CCcchhcC-CCCcccCCCCcCcHHHHHHHHHHHHHHHhcCCcccccCCHHHHHHhccccccccCCCCCCCcHHHHHHHHH
Confidence            47999998 999999999 999999999999999999987  889999999999998 2     5899999999998 77


Q ss_pred             HHcCCCCCCCCcCCCCCCCCCCccccc---CCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecC-cccccCCce
Q 044448          165 RQYQRLASECVYPYQGRQDYYCDWWRS---SASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT-WFNFYHGGV  239 (308)
Q Consensus       165 ~~~~Gi~~e~~yPY~~~~~~~C~~~~~---~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~-~f~~y~~Gi  239 (308)
                      .++ ||++|++|||.. ....|.  ..   ......+++..|..+...++++||++|.+ |||+++|.+. .|..|++|+
T Consensus        80 ~~~-Gi~~e~~~Py~~-~~~~~~--~~~~~~~~~~~~~~~~y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~  155 (223)
T cd02619          80 ALK-GIPPEEDYPYGA-ESDGEE--PKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGI  155 (223)
T ss_pred             HHc-CCCccccCCCCC-CCCCCC--CCCccchhhcceeecceeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCcc
Confidence            766 999999999998 666665  21   13345688999999987778999999995 9999999999 999999998


Q ss_pred             Ee-----CC-CC-CCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcCCCceEEEEeCC
Q 044448          240 FT-----GP-CG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV  292 (308)
Q Consensus       240 y~-----~~-c~-~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~  292 (308)
                      +.     .. |. ..++|||+|||||++.. . +++|||||||||+.||++||+||+++.
T Consensus       156 ~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~-~-~~~~~i~~NSwG~~wg~~Gy~~i~~~~  213 (223)
T cd02619         156 IYEEIVYLLYEDGDLGGHAVVIVGYDDNYV-E-GKGAFIVKNSWGTDWGDNGYGRISYED  213 (223)
T ss_pred             ccccccccccCCCccCCeEEEEEeecCCCC-C-CCCEEEEEeCCCCccccCCEEEEehhh
Confidence            73     22 33 35899999999998742 3 679999999999999999999999984


No 15 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00  E-value=3.8e-45  Score=367.70  Aligned_cols=200  Identities=21%  Similarity=0.367  Sum_probs=159.8

Q ss_pred             CCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHHhhcCC---CCCCCCCc-HHHHHHHHHHcCCCCCCCCcCC
Q 044448          104 VTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNF-LENAFEYIRQYQRLASECVYPY  178 (308)
Q Consensus       104 v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l~dc~~---~~gC~GG~-~~~a~~~~~~~~Gi~~e~~yPY  178 (308)
                      ..||||||. |+|||||+++++|++++|+++..+.||+|+|+||+.   +.||.||+ +..++.|+.+++||++|++|||
T Consensus       544 ~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~efl~yI~e~GgLptESdYPY  623 (1004)
T PTZ00462        544 KIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQIIEDNGFLPADSNYLY  623 (1004)
T ss_pred             CCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHHHHHHHHHcCCCcccccCCC
Confidence            579999999 999999999999999999999999999999999986   57999997 5566799988866899999999


Q ss_pred             CCC-CCCCCcccccCC-----------------CCccEEEeeeEEcCCC----C----HHHHHHHHhc-CCeEEEEecCc
Q 044448          179 QGR-QDYYCDWWRSSA-----------------SGKYGAIRGYQYVQPA----T----EEGLQDVVSR-QPVSVAIDATW  231 (308)
Q Consensus       179 ~~~-~~~~C~~~~~~~-----------------~~~~~~i~~~~~v~~~----~----~~~lk~~l~~-gPV~v~i~~~~  231 (308)
                      ... ..+.|+  ....                 ....+.+.+|..+...    +    +++|+++|++ |||+|+|++.+
T Consensus       624 t~k~~~g~Cp--~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~IK~eI~~kGPVaV~IdAsd  701 (1004)
T PTZ00462        624 NYTKVGEDCP--DEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAEN  701 (1004)
T ss_pred             ccCCCCCCCC--CCcccccccccccccccccccccceeeccceEEecccccccchhhHHHHHHHHHHhcCCEEEEEEeeh
Confidence            741 456787  3210                 0113345667666532    1    4688999995 99999999878


Q ss_pred             cccc-CCceEeCC-CCC-CCCeEEEEEEeCCcCC--CCCCCCeEEEecCCCCCcCCCceEEEEeCCCCCCCccccccccc
Q 044448          232 FNFY-HGGVFTGP-CGN-TPNHGVTIVGYGTTTE--AEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAY  306 (308)
Q Consensus       232 f~~y-~~Giy~~~-c~~-~~~Hav~iVGyg~~~~--~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~y  306 (308)
                      |+.| .+|||..+ |+. .++|||+|||||.+..  .. +++|||||||||+.|||+|||||.|..  .++|||+....+
T Consensus       702 f~~Y~~sGIyv~~~Cgs~~~nHAVlIVGYGt~in~eg~-gk~YWIVRNSWGt~WGEnGYFKI~r~g--~n~CGin~i~t~  778 (1004)
T PTZ00462        702 VLGYEFNGKKVQNLCGDDTADHAVNIVGYGNYINDEDE-KKSYWIVRNSWGKYWGDEGYFKVDMYG--PSHCEDNFIHSV  778 (1004)
T ss_pred             HHhhhcCCccccCCCCCCcCCceEEEEEecccccccCC-CCceEEEEcCCCCCcCCCeEEEEEeCC--CCCCccchheee
Confidence            8888 48987665 985 5899999999997521  13 578999999999999999999999953  288999877665


Q ss_pred             cC
Q 044448          307 PL  308 (308)
Q Consensus       307 p~  308 (308)
                      |+
T Consensus       779 ~~  780 (1004)
T PTZ00462        779 VI  780 (1004)
T ss_pred             ee
Confidence            53


No 16 
>KOG1544 consensus Predicted cysteine proteinase TIN-ag [General function prediction only]
Probab=100.00  E-value=6.9e-42  Score=303.55  Aligned_cols=247  Identities=24%  Similarity=0.425  Sum_probs=189.6

Q ss_pred             CCCCCCHHHHHhhhcCCCCCCCC-CCCCCCCCccccCCCCCCCCCceeecCCC--CCCCccCCCCC-CchHHHHHHHHHH
Q 044448           50 KFADLTREKFLASYTGYKPPPTD-HPHSNRSNWFKNLNSSKMSFYDSIDWNER--GAVTPVKDQGS-YCCWAFTAVATVE  125 (308)
Q Consensus        50 ~fsDlt~eEf~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~lP~~~Dwr~~--g~v~pv~dQg~-gsCwAfa~~~~~e  125 (308)
                      +|..||.++=.+..+|..+|... ..|+++.. .. .+.  .+||+.||-|++  +++.|+.|||+ ++.|||+++++..
T Consensus       170 aFWGmtL~DGiKyRLGTL~Ps~sv~nMNEi~~-~l-~p~--~~LPE~F~As~KWp~liH~plDQgnCa~SWafSTaavas  245 (470)
T KOG1544|consen  170 AFWGMTLDDGIKYRLGTLRPSSSVMNMNEIYT-VL-NPG--EVLPEAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVAS  245 (470)
T ss_pred             hhhcccccccceeeecccCchhhhhhHHhHhh-cc-Ccc--cccchhhhhhhcCCccccCccccCCcccceeeeeehhcc
Confidence            78888888766666776665543 33432111 11 111  259999999986  89999999999 9999999999999


Q ss_pred             HHHHHhcCC--cccCCHHHHhhcCC--CCCCCCCcHHHHHHHHHHcCCCCCCCCcCCCCC---CCCCCccc---------
Q 044448          126 GLNKIRTGQ--LVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGR---QDYYCDWW---------  189 (308)
Q Consensus       126 ~~~~i~~~~--~~~lS~q~l~dc~~--~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~---~~~~C~~~---------  189 (308)
                      .+++|.+..  ...||+|+|++|..  .+||.||..+.|+=|+.+. |++...+|||...   ..+.|..-         
T Consensus       246 DRiAI~S~GR~t~~LSpQnLlSC~~h~q~GC~gG~lDRAWWYlRKr-GvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkR  324 (470)
T KOG1544|consen  246 DRVAIHSLGRMTPVLSPQNLLSCDTHQQQGCRGGRLDRAWWYLRKR-GVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKR  324 (470)
T ss_pred             ceeEEeeccccccccChHHhcchhhhhhccCccCcccchheeeecc-cccccccccccCCCCCCCCCceeeccccCcccc
Confidence            999998643  36799999999998  7899999999999999888 9999999999863   22334310         


Q ss_pred             -------ccC-CCCccEEEeeeEEcCCCCHHHHHHHHh-cCCeEEEEecC-cccccCCceEeCCCC---------CCCCe
Q 044448          190 -------RSS-ASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDAT-WFNFYHGGVFTGPCG---------NTPNH  250 (308)
Q Consensus       190 -------~~~-~~~~~~~i~~~~~v~~~~~~~lk~~l~-~gPV~v~i~~~-~f~~y~~Giy~~~c~---------~~~~H  250 (308)
                             ... .+...++++.=..|. .++++|++.|+ +|||-+.|.+. +|+.|++|||.+...         ..+.|
T Consensus       325 qat~~CPn~~~~Sn~iyq~tPPYrVS-SnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtH  403 (470)
T KOG1544|consen  325 QATAHCPNSYVNSNDIYQVTPPYRVS-SNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTH  403 (470)
T ss_pred             cccCcCCCcccccCceeeecCCeecc-CCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccc
Confidence                   110 122344444444454 46777877777 79999999999 999999999987521         14889


Q ss_pred             EEEEEEeCCcCCCCC-CCCeEEEecCCCCCcCCCceEEEEeCCCCCCCcccccccc
Q 044448          251 GVTIVGYGTTTEAEG-QQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAA  305 (308)
Q Consensus       251 av~iVGyg~~~~~~~-g~~ywivkNSWG~~WGe~Gy~~i~~~~~~~~~Cgi~~~~~  305 (308)
                      +|.|.|||++..++| ..+|||..||||+.|||+|||||-|+.   |.|-|+++.+
T Consensus       404 sVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGv---NecdIEsfvI  456 (470)
T KOG1544|consen  404 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGV---NECDIESFVI  456 (470)
T ss_pred             eEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccc---cchhhhHhhh
Confidence            999999999864341 357999999999999999999999998   7799998754


No 17 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.96  E-value=8.9e-30  Score=231.45  Aligned_cols=194  Identities=24%  Similarity=0.332  Sum_probs=134.6

Q ss_pred             CCCceeecCCCCCCCccCCCCC-CchHHHHHHHHHHHHHHHhcCCcccCCHHHH-----hhcCC--CC-CCCCCcHHHHH
Q 044448           91 SFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQL-----VDCST--LN-GCAKNFLENAF  161 (308)
Q Consensus        91 ~lP~~~Dwr~~g~v~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~~~~~lS~q~l-----~dc~~--~~-gC~GG~~~~a~  161 (308)
                      .+|+.||||+.|.|+||||||. |+||||++++++|+.+.-..  ...+|+-.+     +-|..  .. --+||....+.
T Consensus        98 s~~~~fd~r~~g~vs~v~dQg~~Gscwaf~t~~sles~l~~~~--~w~~s~~nm~~ll~~~ye~~fd~~~~d~g~~~m~~  175 (372)
T COG4870          98 SLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPES--AWDFSENNMKNLLGVPYEKGFDYTSNDGGNADMSA  175 (372)
T ss_pred             cchhheeeeccCCcccccccCcccceEeeeehhhhhheecccc--cccccccchhhhcCCCccccCCCccccCCcccccc
Confidence            4899999999999999999999 99999999999999885443  233444333     22322  11 12377777777


Q ss_pred             HHHHHcCCCCCCCCcCCCCCCCCCCcccccCCCCccEEEeeeEEcCCC----CHHHHHHHHh-cCCeE--EEEecCcccc
Q 044448          162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPA----TEEGLQDVVS-RQPVS--VAIDATWFNF  234 (308)
Q Consensus       162 ~~~~~~~Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~----~~~~lk~~l~-~gPV~--v~i~~~~f~~  234 (308)
                      .|+.+..|-+.+.+-||.. ....|.  ..  .....++.....++..    +...|++++. .|-++  +.|++..+..
T Consensus       176 a~l~e~sgpv~et~d~y~~-~s~~~~--~~--~p~~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id~~~~~~  250 (372)
T COG4870         176 AYLTEWSGPVYETDDPYSE-NSYFSP--TN--LPVTKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLG  250 (372)
T ss_pred             ccccccCCcchhhcCcccc-ccccCC--cC--CchhhccccceecccchhhhcccchHHHHhhhccccceeEEecccccc
Confidence            7888888999999999988 666666  32  1222233444444421    2334666666 36554  3355553333


Q ss_pred             cCCceEeCCCCCCCCeEEEEEEeCCcCC-------CCCCCCeEEEecCCCCCcCCCceEEEEeCC
Q 044448          235 YHGGVFTGPCGNTPNHGVTIVGYGTTTE-------AEGQQPYWLVKNRWGTNWDEGGSMRIFRGV  292 (308)
Q Consensus       235 y~~Giy~~~c~~~~~Hav~iVGyg~~~~-------~~~g~~ywivkNSWG~~WGe~Gy~~i~~~~  292 (308)
                      ..-+.|..+.....+|||+||||++...       +. |.+.||||||||+.||++|||||+...
T Consensus       251 ~~~~~~~~~s~~~~gHAv~iVGyDDs~~~n~~~~~~~-g~GAfiikNSWGt~wG~~GYfwisY~y  314 (372)
T COG4870         251 ICIPYPYVDSGENWGHAVLIVGYDDSFDINNFKYGPP-GDGAFIIKNSWGTNWGENGYFWISYYY  314 (372)
T ss_pred             cccCCCCCCccccccceEEEEeccccccccccccCCC-CCceEEEECccccccccCceEEEEeee
Confidence            3344444333356899999999999653       33 678999999999999999999999875


No 18 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.92  E-value=1.1e-24  Score=207.74  Aligned_cols=180  Identities=18%  Similarity=0.225  Sum_probs=129.1

Q ss_pred             CccCCCCC-CchHHHHHHHHHHHHHHHh-cCCcccCCHHHHhh----------------cCC-------------CCCCC
Q 044448          105 TPVKDQGS-YCCWAFTAVATVEGLNKIR-TGQLVTRSKHQLVD----------------CST-------------LNGCA  153 (308)
Q Consensus       105 ~pv~dQg~-gsCwAfa~~~~~e~~~~i~-~~~~~~lS~q~l~d----------------c~~-------------~~gC~  153 (308)
                      .||+||+. |.||.||+...|++.+..+ +...+.||+.++.-                +..             ....+
T Consensus        55 ~~vtnQ~~SGrCW~FA~Ln~lr~~~~k~~~~~~felSq~Yl~f~dklEkaN~fle~ii~~~~~~~~~R~v~~ll~~~~~D  134 (437)
T cd00585          55 EPVTNQKSSGRCWLFAALNVLRHQFMKKLNLKEFEFSQSYLFFWDKLEKANYFLENIIETADEPLDDRLVQFLLANPQND  134 (437)
T ss_pred             CCcccCCCCchhHHHHCHHHHHHHHHHHcCCCCEEeCcHHHHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHhCCcCC
Confidence            48999999 9999999999999988774 45678999987754                221             34578


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCcCCCCC--------------------------CCC-----------------------
Q 044448          154 KNFLENAFEYIRQYQRLASECVYPYQGR--------------------------QDY-----------------------  184 (308)
Q Consensus       154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~--------------------------~~~-----------------------  184 (308)
                      ||....+...+.+. |+++.+.||-+..                          ..+                       
T Consensus       135 GGqw~m~~~li~KY-GvVPk~~~pet~~s~~t~~~n~~L~~kLr~~a~~lr~~~~~~~~~~~l~~~~~~~~~~iy~il~~  213 (437)
T cd00585         135 GGQWDMLVNLIEKY-GLVPKSVMPESFNSENSRRLNYLLNRKLREDALELRKLVAKGASKEEIEAKKEEMLKEVYRILAI  213 (437)
T ss_pred             CCchHHHHHHHHHc-CCCcccccCCCcCccchHHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999887 9999999984321                          000                       


Q ss_pred             ---CCcc---cc--cC---------------------------------CC--C---ccE-----------EEeeeEEcC
Q 044448          185 ---YCDW---WR--SS---------------------------------AS--G---KYG-----------AIRGYQYVQ  207 (308)
Q Consensus       185 ---~C~~---~~--~~---------------------------------~~--~---~~~-----------~i~~~~~v~  207 (308)
                         .++.   |.  ++                                 +.  .   ..+           ....|..+|
T Consensus       214 ~lG~pP~~F~~~y~dkd~~~~~~~~~TP~~F~~~yv~~~~~dyV~l~~~p~~~~p~~~~y~ve~~~Nv~~g~~~~y~Nvp  293 (437)
T cd00585         214 ALGEPPEKFDWEYRDKDKKYHEIKELTPLEFYKKYVKFDLDDYVSLINDPRPDKPYNKLYTVEYLGNVVGGRPILYLNVP  293 (437)
T ss_pred             HcCCCCceEEEEEEeCCCCeeeCCCcCHHHHHHHhcCCCccceEEEEeCCCCCCCCCceEEEecCCcccccccceEEecC
Confidence               0000   00  00                                 00  0   000           111222332


Q ss_pred             CCCHHHHH----HHHhc-CCeEEEEecCcccccCCceEeCC----------------------CCCCCCeEEEEEEeCCc
Q 044448          208 PATEEGLQ----DVVSR-QPVSVAIDATWFNFYHGGVFTGP----------------------CGNTPNHGVTIVGYGTT  260 (308)
Q Consensus       208 ~~~~~~lk----~~l~~-gPV~v~i~~~~f~~y~~Giy~~~----------------------c~~~~~Hav~iVGyg~~  260 (308)
                         ++.|+    ++|.. +||.+++++..|+.|++||++..                      |.+..+|||+|||||.+
T Consensus       294 ---~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D  370 (437)
T cd00585         294 ---MDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLD  370 (437)
T ss_pred             ---HHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEec
Confidence               56665    45565 69999999997779999999653                      23457899999999987


Q ss_pred             CCCCCCC-CeEEEecCCCCCcCCCceEEEEeC
Q 044448          261 TEAEGQQ-PYWLVKNRWGTNWDEGGSMRIFRG  291 (308)
Q Consensus       261 ~~~~~g~-~ywivkNSWG~~WGe~Gy~~i~~~  291 (308)
                      .  + |+ .||+||||||+.||++||++|+++
T Consensus       371 ~--~-g~p~yw~VkNSWG~~~G~~Gy~~ms~~  399 (437)
T cd00585         371 E--D-GKPVKWKVENSWGEKVGKKGYFVMSDD  399 (437)
T ss_pred             C--C-CCcceEEEEcccCCCCCCCcceehhHH
Confidence            5  5 65 699999999999999999999875


No 19 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.74  E-value=6.9e-17  Score=154.48  Aligned_cols=179  Identities=21%  Similarity=0.285  Sum_probs=107.9

Q ss_pred             CccCCCCC-CchHHHHHHHHHHHHHHHhcC-CcccCCHHHHh----------------hcCC-------------CCCCC
Q 044448          105 TPVKDQGS-YCCWAFTAVATVEGLNKIRTG-QLVTRSKHQLV----------------DCST-------------LNGCA  153 (308)
Q Consensus       105 ~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~-~~~~lS~q~l~----------------dc~~-------------~~gC~  153 (308)
                      .||.||+. |.||.||+..+++..+..+.+ ....||+.+|.                ++..             ....+
T Consensus        56 ~~vtnQk~SGRCW~FA~lN~lR~~~~kk~~l~~felSq~Yl~F~DKlEKaN~fLe~ii~~~~~~~d~R~v~~ll~~~~~D  135 (438)
T PF03051_consen   56 GPVTNQKSSGRCWLFAALNVLRHEIMKKLNLKDFELSQNYLFFWDKLEKANYFLENIIDTADEPLDDRLVRFLLKNPVSD  135 (438)
T ss_dssp             -S--B--BSSTHHHHHHHHHHHHHHHHHCT-SS--B-HHHHHHHHHHHHHHHHHHHHHHCCTS-TTSHHHHHHHHSTT-S
T ss_pred             CCCCCCCCCCCcchhhchHHHHHHHHHHcCCCceEeechHHHHHHHHHHHHHHHHHHHHHhcCCcchHHHHHHHhcCCCC
Confidence            49999999 999999999999999988765 67899998864                3332             34578


Q ss_pred             CCcHHHHHHHHHHcCCCCCCCCcCCCCC--------------------------CC------------------------
Q 044448          154 KNFLENAFEYIRQYQRLASECVYPYQGR--------------------------QD------------------------  183 (308)
Q Consensus       154 GG~~~~a~~~~~~~~Gi~~e~~yPY~~~--------------------------~~------------------------  183 (308)
                      ||....+...|++. ||++.+.||-+..                          ..                        
T Consensus       136 GGqw~~~~nli~KY-GvVPk~~mpet~~s~~t~~~n~~l~~~Lr~~a~~LR~~~~~~~~~~~l~~~k~~~l~~iy~il~~  214 (438)
T PF03051_consen  136 GGQWDMVVNLIKKY-GVVPKSVMPETFSSSNTSEMNEMLNTKLREYALELRKLVKAGKSEEELRKLKEEMLAEIYRILAI  214 (438)
T ss_dssp             -B-HHHHHHHHHHH----BGGGSTTGCGCHBHHHHHHHHHHHHHHHHHHHHHHHHTTTTCHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCchHHHHHHHHHc-CcCcHhhCCCCCCCCChHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHH
Confidence            99999999999887 9999999985431                          00                        


Q ss_pred             --CCCcc---cc--cC---------------------------------C-----CCccEEE-----------eeeEEcC
Q 044448          184 --YYCDW---WR--SS---------------------------------A-----SGKYGAI-----------RGYQYVQ  207 (308)
Q Consensus       184 --~~C~~---~~--~~---------------------------------~-----~~~~~~i-----------~~~~~v~  207 (308)
                        |.++.   |.  .+                                 +     -...+.+           ..|..+|
T Consensus       215 ~lG~PP~~F~~ey~dkd~~~~~~~~~TP~eF~~kyv~~~~ddyVsLin~P~~~~py~~~y~ve~~~Nv~~g~~~~ylNvp  294 (438)
T PF03051_consen  215 YLGEPPEKFTWEYRDKDKKYHRGKNYTPLEFYKKYVGFDLDDYVSLINDPRSHHPYNKLYTVEYLGNVVGGRPVRYLNVP  294 (438)
T ss_dssp             HH---SSSEEEEEE-TTS-EEEEEEE-HHHHHHHCTTS-GGGEEEEE--T-TTS-TTCEEEETTTTSSTT-EEEEEEE--
T ss_pred             HcCCCChheeEEEeccccccccccccCchhHHHHHhCCCCcceEEEeeCCCccCccceeEEEccCCCEECCcceeEeccC
Confidence              00000   00  00                                 0     0001110           1122332


Q ss_pred             CCCHHHHHHH----HhcC-CeEEEEecCcccccCCceEeCCCC----------------------CCCCeEEEEEEeCCc
Q 044448          208 PATEEGLQDV----VSRQ-PVSVAIDATWFNFYHGGVFTGPCG----------------------NTPNHGVTIVGYGTT  260 (308)
Q Consensus       208 ~~~~~~lk~~----l~~g-PV~v~i~~~~f~~y~~Giy~~~c~----------------------~~~~Hav~iVGyg~~  260 (308)
                         .+.|+++    |..| ||..+-++..+...+.||.+...-                      +..+|||+|||.+.+
T Consensus       295 ---id~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D  371 (438)
T PF03051_consen  295 ---IDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLD  371 (438)
T ss_dssp             ---HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-
T ss_pred             ---HHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEec
Confidence               5666554    4456 999999999555668898754310                      136899999999987


Q ss_pred             CCCCCCC-CeEEEecCCCCCcCCCceEEEEe
Q 044448          261 TEAEGQQ-PYWLVKNRWGTNWDEGGSMRIFR  290 (308)
Q Consensus       261 ~~~~~g~-~ywivkNSWG~~WGe~Gy~~i~~  290 (308)
                      .  + |+ .+|+|+||||+..|.+||+.|+.
T Consensus       372 ~--~-g~p~~wkVeNSWG~~~g~kGy~~msd  399 (438)
T PF03051_consen  372 E--D-GKPVRWKVENSWGTDNGDKGYFYMSD  399 (438)
T ss_dssp             T--T-SSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred             c--C-CCeeEEEEEcCCCCCCCCCcEEEECH
Confidence            5  5 65 59999999999999999999984


No 20 
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.41  E-value=4e-13  Score=93.44  Aligned_cols=45  Identities=42%  Similarity=0.735  Sum_probs=39.7

Q ss_pred             HHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHHH
Q 044448           15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKF   59 (308)
Q Consensus        15 f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eEf   59 (308)
                      |++|+++|+|.|.+.+|+.+|+.||++|++.             +|||+|||||++||
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf   58 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF   58 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence            8999999999999999999999999999998             99999999999998


No 21 
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.10  E-value=6.7e-11  Score=81.73  Aligned_cols=44  Identities=48%  Similarity=0.872  Sum_probs=41.4

Q ss_pred             HHHHHHHhCCccCCHHHHHHHHHHHHHHHHH-------------hcCCCCCCCCHHH
Q 044448           15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREK   58 (308)
Q Consensus        15 f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~-------------~g~N~fsDlt~eE   58 (308)
                      |++|+++|+|.|.+.+|...|+.+|.+|++.             +|+|+|||||++|
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE   57 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE   57 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence            6899999999999999999999999999987             8999999999876


No 22 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=98.96  E-value=5.1e-09  Score=95.02  Aligned_cols=78  Identities=19%  Similarity=0.322  Sum_probs=57.5

Q ss_pred             CHHHHHHHHh----cC-CeEEEEecCcccccCCceEeCC------------C---------C-CCCCeEEEEEEeCCcCC
Q 044448          210 TEEGLQDVVS----RQ-PVSVAIDATWFNFYHGGVFTGP------------C---------G-NTPNHGVTIVGYGTTTE  262 (308)
Q Consensus       210 ~~~~lk~~l~----~g-PV~v~i~~~~f~~y~~Giy~~~------------c---------~-~~~~Hav~iVGyg~~~~  262 (308)
                      +.+.+|++..    .| ||-.+-++.-+..-+.||.+-.            .         + +-..|||+|.|.+.+. 
T Consensus       296 ~me~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~-  374 (444)
T COG3579         296 DMERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDE-  374 (444)
T ss_pred             cHHHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhcccccc-
Confidence            4677777533    35 8888888877777888876421            0         0 0256999999999876 


Q ss_pred             CCCCCCeEEEecCCCCCcCCCceEEEE
Q 044448          263 AEGQQPYWLVKNRWGTNWDEGGSMRIF  289 (308)
Q Consensus       263 ~~~g~~ywivkNSWG~~WGe~Gy~~i~  289 (308)
                       +|..-=|.|.||||..=|.+|||-++
T Consensus       375 -~g~p~rwkVENSWG~d~G~~GyfvaS  400 (444)
T COG3579         375 -TGNPLRWKVENSWGKDVGKKGYFVAS  400 (444)
T ss_pred             -CCCceeeEeecccccccCCCceEeeh
Confidence             42234699999999999999999876


No 23 
>KOG4128 consensus Bleomycin hydrolases and aminopeptidases of cysteine protease family [Amino acid transport and metabolism]
Probab=97.91  E-value=1.2e-05  Score=73.18  Aligned_cols=73  Identities=18%  Similarity=0.216  Sum_probs=52.4

Q ss_pred             CccCCCCC-CchHHHHHHHHHHHHHHHhcC-CcccCCHHHHhh--------------------cCC-----------CCC
Q 044448          105 TPVKDQGS-YCCWAFTAVATVEGLNKIRTG-QLVTRSKHQLVD--------------------CST-----------LNG  151 (308)
Q Consensus       105 ~pv~dQg~-gsCwAfa~~~~~e~~~~i~~~-~~~~lS~q~l~d--------------------c~~-----------~~g  151 (308)
                      +||.||.+ |-||.|+.+..+---...+-+ ....||..+|.-                    |..           +.-
T Consensus        63 ~pvtnqkssGrcWift~ln~lrl~~~~kLnl~eFElSqayLFFwdKlErcnyFL~~vvd~a~r~ep~DgRlvq~Ll~nP~  142 (457)
T KOG4128|consen   63 QPVTNQKSSGRCWIFTGLNLLRLEMDRKLNLPEFELSQAYLFFWDKLERCNYFLWTVVDLAMRCEPLDGRLVQNLLKNPV  142 (457)
T ss_pred             cccccCcCCCceEEEechhHHHHHHHhcCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccHHHHHHHhCCC
Confidence            69999999 999999999987554444332 346788876631                    222           333


Q ss_pred             CCCCcHHHHHHHHHHcCCCCCCCCcCC
Q 044448          152 CAKNFLENAFEYIRQYQRLASECVYPY  178 (308)
Q Consensus       152 C~GG~~~~a~~~~~~~~Gi~~e~~yPY  178 (308)
                      -+||.-..-.+.+++. |+.+..+||-
T Consensus       143 ~DGGqw~MfvNlVkKY-GviPKkcy~~  168 (457)
T KOG4128|consen  143 PDGGQWQMFVNLVKKY-GVIPKKCYLH  168 (457)
T ss_pred             CCCchHHHHHHHHHHh-CCCcHHhccc
Confidence            4688777777787776 9999999964


No 24 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=96.87  E-value=0.011  Score=47.15  Aligned_cols=56  Identities=25%  Similarity=0.509  Sum_probs=34.5

Q ss_pred             CCHHHHHHHHhcC-CeEEEEecC-cccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCC
Q 044448          209 ATEEGLQDVVSRQ-PVSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRW  276 (308)
Q Consensus       209 ~~~~~lk~~l~~g-PV~v~i~~~-~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSW  276 (308)
                      .+.+.|++.|.+| ||.+.+... .-.  .+..+..   ....|.|+|+||+.+     +  +++|-.+|
T Consensus        87 ~~~~~i~~~i~~G~Pvi~~~~~~~~~~--~~~~~~~---~~~~H~vvi~Gy~~~-----~--~~~v~DP~  144 (144)
T PF13529_consen   87 ASFDDIKQEIDAGRPVIVSVNSGWRPP--NGDGYDG---TYGGHYVVIIGYDED-----G--YVYVNDPW  144 (144)
T ss_dssp             S-HHHHHHHHHTT--EEEEEETTSS----TTEEEEE----TTEEEEEEEEE-SS-----E---EEEE-TT
T ss_pred             CcHHHHHHHHHCCCcEEEEEEcccccC--CCCCcCC---CcCCEEEEEEEEeCC-----C--EEEEeCCC
Confidence            4679999999985 999998743 111  1112211   357999999999874     4  78887776


No 25 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=93.97  E-value=0.34  Score=40.86  Aligned_cols=118  Identities=17%  Similarity=0.273  Sum_probs=67.2

Q ss_pred             CCCC-CchHHHHHHHHHHHHHH--------HhcCCcccCCHHHHhhcCCCCCCCCCcHHHHHHHHHHcCCCCCCCCcCCC
Q 044448          109 DQGS-YCCWAFTAVATVEGLNK--------IRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ  179 (308)
Q Consensus       109 dQg~-gsCwAfa~~~~~e~~~~--------i~~~~~~~lS~q~l~dc~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~  179 (308)
                      .||. +=|-+||.+++|-....        |.+.-.+.+|+++|.+++.       .+...++|.+.. |....    | 
T Consensus        18 tQg~~pWCa~Ya~aailN~~~~~~~~~A~~iMr~~yPn~s~~~l~~~~~-------~~~~~i~y~ks~-g~~~~----~-   84 (175)
T PF05543_consen   18 TQGYNPWCAGYAMAAILNATTNTKIYNAKDIMRYLYPNVSEEQLKFTSL-------TPNQMIKYAKSQ-GRNPQ----Y-   84 (175)
T ss_dssp             --SSSS-HHHHHHHHHHHHHCT-S---HHHHHHHHSTTS-CCCHHH--B--------HHHHHHHHHHT-TEEEE----E-
T ss_pred             ccCcCcHHHHHHHHHHHHhhhCcCcCCHHHHHHHHCCCCCHHHHhhcCC-------CHHHHHHHHHHc-Ccchh----H-
Confidence            4888 99999999998866421        1122246788888877642       466788887655 43210    0 


Q ss_pred             CCCCCCCcccccCCCCccEEEeeeEEcCCCCHHHHHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeC
Q 044448          180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYG  258 (308)
Q Consensus       180 ~~~~~~C~~~~~~~~~~~~~i~~~~~v~~~~~~~lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg  258 (308)
                                .+             ..+  +.+++++.+.+ -|+.+..+...     +     ..+...+|||+||||-
T Consensus        85 ----------~n-------------~~~--s~~eV~~~~~~nk~i~i~~~~v~-----~-----~~~~~~gHAlavvGya  129 (175)
T PF05543_consen   85 ----------NN-------------RMP--SFDEVKKLIDNNKGIAILADRVE-----Q-----TNGPHAGHALAVVGYA  129 (175)
T ss_dssp             ----------EC-------------S-----HHHHHHHHHTT-EEEEEEEETT-----S-----CTTB--EEEEEEEEEE
T ss_pred             ----------hc-------------CCC--CHHHHHHHHHcCCCeEEEecccc-----c-----CCCCccceeEEEEeee
Confidence                      00             011  47889998885 67777655311     1     1223579999999997


Q ss_pred             CcCCCCCCCCeEEEecCCC
Q 044448          259 TTTEAEGQQPYWLVKNRWG  277 (308)
Q Consensus       259 ~~~~~~~g~~ywivkNSWG  277 (308)
                      .-.  + |.++.++=|=|-
T Consensus       130 ~~~--~-g~~~y~~WNPW~  145 (175)
T PF05543_consen  130 KPN--N-GQKTYYFWNPWW  145 (175)
T ss_dssp             EET--T-SEEEEEEE-TT-
T ss_pred             ecC--C-CCeEEEEeCCcc
Confidence            643  4 788999977774


No 26 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=91.19  E-value=0.57  Score=43.17  Aligned_cols=55  Identities=18%  Similarity=0.422  Sum_probs=35.5

Q ss_pred             HHHHHHHHhcC-CeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEec
Q 044448          211 EEGLQDVVSRQ-PVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN  274 (308)
Q Consensus       211 ~~~lk~~l~~g-PV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkN  274 (308)
                      .+.|+++|.+| ||.+.++.+.+ -|...-|.   .....|.|+|+||+++     +..+.++-.
T Consensus        78 ~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~~---~~~~~H~i~v~G~d~~-----~~~~~v~D~  133 (317)
T PF14399_consen   78 WEELKEALDAGRPVIVWVDMYYL-PYRPNYYK---KHHADHYIVVYGYDEE-----EDVFYVSDP  133 (317)
T ss_pred             HHHHHHHHhCCCceEEEeccccC-CCCccccc---cccCCcEEEEEEEeCC-----CCEEEEEcC
Confidence            45778888877 99999887522 22221111   1236899999999976     345666644


No 27 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=84.98  E-value=2.6  Score=35.80  Aligned_cols=52  Identities=17%  Similarity=0.312  Sum_probs=36.7

Q ss_pred             EEcCCCCHHHHHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCC
Q 044448          204 QYVQPATEEGLQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG  277 (308)
Q Consensus       204 ~~v~~~~~~~lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG  277 (308)
                      ..++..++.+|+..|.+ .||.+-.-.  |-.            ..-|+|+|.||++.        |+..-++||
T Consensus       116 ~d~tGksl~~ik~ql~kg~PV~iw~T~--~~~------------~s~H~v~itgyDk~--------n~yynDpyG  168 (195)
T COG4990         116 VDLTGKSLSDIKGQLLKGRPVVIWVTN--FHS------------YSIHSVLITGYDKY--------NIYYNDPYG  168 (195)
T ss_pred             ccCcCCcHHHHHHHHhcCCcEEEEEec--ccc------------cceeeeEeeccccc--------ceEeccccc
Confidence            34566789999999997 599876543  211            34799999999874        355556664


No 28 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=83.12  E-value=5  Score=35.13  Aligned_cols=51  Identities=22%  Similarity=0.452  Sum_probs=32.4

Q ss_pred             CHHHHHHHHhc-CCeEEEEecCccc--ccCCceEeC---CC--C--CCCCeEEEEEEeCCc
Q 044448          210 TEEGLQDVVSR-QPVSVAIDATWFN--FYHGGVFTG---PC--G--NTPNHGVTIVGYGTT  260 (308)
Q Consensus       210 ~~~~lk~~l~~-gPV~v~i~~~~f~--~y~~Giy~~---~c--~--~~~~Hav~iVGyg~~  260 (308)
                      +.++|..+|.. ||+.+-++..-..  .-+.-....   .|  .  .-.+|-|+|+||+.+
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~  172 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA  172 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC
Confidence            58999999996 6777777776111  112222211   12  1  247899999999986


No 29 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=60.89  E-value=20  Score=28.23  Aligned_cols=33  Identities=21%  Similarity=0.454  Sum_probs=23.6

Q ss_pred             HHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeC
Q 044448          214 LQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYG  258 (308)
Q Consensus       214 lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg  258 (308)
                      +++.|.. -||.+.++..            .-....+|.|+|+||+
T Consensus        70 ~~~~l~~~~Pvi~~~~~~------------~~~~~~gH~vVv~g~~  103 (141)
T cd02549          70 LLRQLAAGHPVIVSVNLG------------VSITPSGHAMVVIGYD  103 (141)
T ss_pred             HHHHHHCCCeEEEEEecC------------cccCCCCeEEEEEEEc
Confidence            7778886 5998877641            0112468999999998


No 30 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=42.68  E-value=62  Score=27.06  Aligned_cols=38  Identities=32%  Similarity=0.439  Sum_probs=28.1

Q ss_pred             CHHHHHHHHh-cCCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCc
Q 044448          210 TEEGLQDVVS-RQPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTT  260 (308)
Q Consensus       210 ~~~~lk~~l~-~gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~  260 (308)
                      +.+.+...|. +||+-++...             |-..-..|+++|.|-+.+
T Consensus        97 t~e~~~~LL~~yGPLwv~~~~-------------P~~~~~~H~~ViTGI~~d  135 (166)
T PF12385_consen   97 TAEGLANLLREYGPLWVAWEA-------------PGDSWVAHASVITGIDGD  135 (166)
T ss_pred             CHHHHHHHHHHcCCeEEEecC-------------CCCcceeeEEEEEeecCC
Confidence            5789999999 5999988654             211224699999998765


No 31 
>PF11567 PfUIS3:  Plasmodium falciparum UIS3 membrane protein;  InterPro: IPR021626  UIS3 is a membrane protein essential for sporozoite development in infected hepatocytes. This family is 130-229 of the Plasmodium falciparum UIS3 protein which is compact and has an all alpha-helical structure.PfUIS3(130-229) interacts with lipids, phospholipid lysosomes, the human liver fatty acid-binding protein and with the lipid phosphatidylethanolamine. The interaction with liver fatty acid-binding protein provides the parasite with a method to import essential fatty acids/lipids during rapid growth phases of sporozoites []. ; PDB: 2VWA_C.
Probab=34.31  E-value=10  Score=28.19  Aligned_cols=31  Identities=26%  Similarity=0.368  Sum_probs=22.2

Q ss_pred             HHHHHHHHHHHHHHHhcCCCCCCCCHHHHHh
Q 044448           31 EKEMRFKIFKKNHEFLRLNKFADLTREKFLA   61 (308)
Q Consensus        31 e~~~R~~iF~~N~~~~g~N~fsDlt~eEf~~   61 (308)
                      --.+||.+|..|.+...--+|++||.+.-.-
T Consensus        19 vpiKrfN~F~Dn~rla~qhHF~~LSn~Qq~y   49 (101)
T PF11567_consen   19 VPIKRFNIFMDNARLAAQHHFSNLSNEQQKY   49 (101)
T ss_dssp             --HHHHHHHHHHHHHHHHHHHHHS-HHHHHH
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHhcCcHHHHH
Confidence            4578999999999984445788888776544


No 32 
>PF05391 Lsm_interact:  Lsm interaction motif;  InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=32.39  E-value=34  Score=18.41  Aligned_cols=12  Identities=8%  Similarity=0.252  Sum_probs=9.6

Q ss_pred             CCCHHHHHhhhc
Q 044448           53 DLTREKFLASYT   64 (308)
Q Consensus        53 Dlt~eEf~~~~~   64 (308)
                      -+++++|+++++
T Consensus         9 p~SNddFrkmfl   20 (21)
T PF05391_consen    9 PKSNDDFRKMFL   20 (21)
T ss_pred             ccchHHHHHHHc
Confidence            478899998876


No 33 
>KOG4702 consensus Uncharacterized conserved protein [Function unknown]
Probab=26.33  E-value=1.7e+02  Score=20.94  Aligned_cols=33  Identities=12%  Similarity=0.149  Sum_probs=24.5

Q ss_pred             HHHHHHHHHHhCCccCCHHHHHHHHHHHHHHHHH
Q 044448           12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF   45 (308)
Q Consensus        12 ~~~f~~~~~~~~k~Y~~~~e~~~R~~iF~~N~~~   45 (308)
                      -..|++|+..|++.-.++ |...|..-|.+-++.
T Consensus        28 pe~Fee~v~~~krel~pp-e~~~~~EE~~~~lRe   60 (77)
T KOG4702|consen   28 PEIFEEFVRGYKRELSPP-EATKRKEEYENFLRE   60 (77)
T ss_pred             hHHHHHHHHhccccCCCh-HHHhhHHHHHHHHHH
Confidence            357999999999987665 666677666665553


No 34 
>PF01640 Peptidase_C10:  Peptidase C10 family classification.;  InterPro: IPR000200 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to MEROPS peptidase family C10 (streptopain family, clan CA). Streptopain is a cysteine protease found in Streptococcus pyogenes that shows some structural and functional similarity to papain (family C1) [, ]. The order of the catalytic cysteine/histidine dyad is the same and the surrounding sequences are similar. The two proteins also show similar specificities, both preferring a hydrophobic residue at the P2 site [, ]. Streptopain shows a high degree of sequence similarity to the S. pyogenes exotoxin B, and strong similarity to the prtT gene product of Porphyromonas gingivalis (Bacteroides gingivalis), both of which have been included in the family [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 4D8I_A 4D8E_A 4D8B_A 3BBA_B 3BB7_A 2JTC_A 1PVJ_A 1DKI_D 2UZJ_A.
Probab=25.24  E-value=2.7e+02  Score=23.66  Aligned_cols=49  Identities=24%  Similarity=0.620  Sum_probs=28.8

Q ss_pred             HHHHHHHhc-CCeEEEEecCcccccCCceEeCCCCCCCCeEEEEEEeCCcCCCCCCCCeEEEecCCCCCcC--CCceEE
Q 044448          212 EGLQDVVSR-QPVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD--EGGSMR  287 (308)
Q Consensus       212 ~~lk~~l~~-gPV~v~i~~~~f~~y~~Giy~~~c~~~~~Hav~iVGyg~~~~~~~g~~ywivkNSWG~~WG--e~Gy~~  287 (308)
                      +.|+..|.+ .||.+.-..           .     ..+||.+|=||..       ..|+-+  -||  ||  .+||++
T Consensus       141 ~~i~~el~~~rPV~~~g~~-----------~-----~~GHawViDGy~~-------~~~~H~--NwG--W~G~~nGyy~  192 (192)
T PF01640_consen  141 DMIRNELDNGRPVLYSGNS-----------K-----SGGHAWVIDGYDS-------DGYFHC--NWG--WGGSSNGYYR  192 (192)
T ss_dssp             HHHHHHHHTT--EEEEEEE-----------T-----TEEEEEEEEEEES-------SSEEEE--E-S--STTTT-EEEE
T ss_pred             HHHHHHHHcCCCEEEEEec-----------C-----CCCeEEEEcCccC-------CCeEEE--eeC--ccCCCCCccC
Confidence            456777775 699754321           0     1299999999954       357766  455  54  568875


No 35 
>KOG4621 consensus Uncharacterized conserved protein [Function unknown]
Probab=25.19  E-value=1.7e+02  Score=23.57  Aligned_cols=51  Identities=16%  Similarity=0.284  Sum_probs=32.7

Q ss_pred             CHHHHHHHHhcC-CeEEEEecC-----ccc--ccCCceEeCC-----CCC--CCCeEEEEEEeCCc
Q 044448          210 TEEGLQDVVSRQ-PVSVAIDAT-----WFN--FYHGGVFTGP-----CGN--TPNHGVTIVGYGTT  260 (308)
Q Consensus       210 ~~~~lk~~l~~g-PV~v~i~~~-----~f~--~y~~Giy~~~-----c~~--~~~Hav~iVGyg~~  260 (308)
                      ++.+|...|+.| -|++.+.-.     ++-  --+++.+.+.     |.+  ..+|-|+|-||+-.
T Consensus        58 Si~dIqahLaqGnhiAIaLVdq~~Lhcdlceeplk~ccfspnghhcfcrtp~YqGHfiVi~GYd~a  123 (167)
T KOG4621|consen   58 SIHDIQAHLAQGNHIAIALVDQDKLHCDLCEEPLKSCCFSPNGHHCFCRTPCYQGHFIVICGYDAA  123 (167)
T ss_pred             eHHHHHHHHhcCCeEEEEEecCCceehHHHHhHHHHhccCCCCccccccCCcccccEEEEeccccc
Confidence            478899999987 676655432     221  2244555432     333  37899999999875


No 36 
>PF08664 YcbB:  YcbB domain;  InterPro: IPR013972  YcbB is a DNA-binding protein []. 
Probab=23.10  E-value=1.5e+02  Score=23.98  Aligned_cols=56  Identities=16%  Similarity=0.187  Sum_probs=39.0

Q ss_pred             hHHHHHHHHHHHHh-------CCccCCHHHHHHHHHHHH--HHHHHhcCCCCCCCCHHHHHhhhcC
Q 044448            9 GNIAAKHEQWMVEF-------ARTYKDQAEKEMRFKIFK--KNHEFLRLNKFADLTREKFLASYTG   65 (308)
Q Consensus         9 ~~~~~~f~~~~~~~-------~k~Y~~~~e~~~R~~iF~--~N~~~~g~N~fsDlt~eEf~~~~~~   65 (308)
                      -.+...|.+.....       .|.=+. .|...|+.|++  .|+..||+.-|++-.-+|+...+-.
T Consensus        40 ~~Lk~~f~~~~~~~~~~~~~~~~e~Ka-~EQRIRRai~~al~nlAsLGl~Dy~N~~Fe~YA~~lFd  104 (134)
T PF08664_consen   40 PSLKEIFEELAQKKLASDEEIEKEKKA-IEQRIRRAIKQALTNLASLGLEDYSNPIFEEYASRLFD  104 (134)
T ss_pred             CcHHHHHHHHHHhhccchhhhhHHHHH-HHHHHHHHHHHHHHHHHHhCCcccCChHHHHHHHHcCC
Confidence            34677788777666       333332 36677888884  6777799998888888887766543


No 37 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=20.98  E-value=1.1e+02  Score=28.21  Aligned_cols=40  Identities=20%  Similarity=0.542  Sum_probs=0.0

Q ss_pred             CCeEEEEEEeCCcCCCCCCCCeEEEecCCC------------CCc--------------CCCceEEEE
Q 044448          248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWG------------TNW--------------DEGGSMRIF  289 (308)
Q Consensus       248 ~~Hav~iVGyg~~~~~~~g~~ywivkNSWG------------~~W--------------Ge~Gy~~i~  289 (308)
                      .+||=.|++.-.... . +.+...+||-||            +.|              .++|-|+|+
T Consensus       235 ~~HaY~Vl~~~~~~~-~-~~~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~  300 (315)
T cd00044         235 KGHAYSVLDVREVQE-E-GLRLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMS  300 (315)
T ss_pred             cCcceEEeEEEEEcc-C-ceEEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEE


No 38 
>PF07351 DUF1480:  Protein of unknown function (DUF1480);  InterPro: IPR009950 This family consists of several hypothetical Enterobacterial proteins of around 80 residues in length. The function of this family is unknown.
Probab=20.34  E-value=1.2e+02  Score=22.04  Aligned_cols=23  Identities=22%  Similarity=0.554  Sum_probs=18.1

Q ss_pred             eEeCCCCCCCCeEEEEEEeCCcC
Q 044448          239 VFTGPCGNTPNHGVTIVGYGTTT  261 (308)
Q Consensus       239 iy~~~c~~~~~Hav~iVGyg~~~  261 (308)
                      ....||.++.+-+|-|=||+.+.
T Consensus        28 tlsIPCksdpdlcmQLDgWDe~T   50 (80)
T PF07351_consen   28 TLSIPCKSDPDLCMQLDGWDEHT   50 (80)
T ss_pred             eEEeecCCChhheeEecccccCC
Confidence            34457988888999999998764


Done!