Query         psy1727
Match_columns 240
No_of_seqs    215 out of 1710
Neff          8.4 
Searched_HMMs 46136
Date          Fri Aug 16 17:40:15 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1727.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1727hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1542|consensus              100.0 4.2E-57 9.2E-62  383.6  17.7  210   10-231    62-366 (372)
  2 PTZ00203 cathepsin L protease; 100.0 1.9E-51   4E-56  362.1  19.6  210    7-232    26-336 (348)
  3 PTZ00021 falcipain-2; Provisio 100.0 3.3E-50 7.2E-55  364.5  18.8  212   11-235   161-486 (489)
  4 PTZ00200 cysteine proteinase;  100.0 2.8E-47 6.1E-52  344.1  19.4  206   12-236   119-444 (448)
  5 KOG1543|consensus              100.0 3.7E-44 7.9E-49  313.6  17.7  152   62-228   162-316 (325)
  6 cd02621 Peptidase_C1A_Cathepsi 100.0 1.1E-40 2.4E-45  282.2  14.4  164   62-235    62-242 (243)
  7 cd02698 Peptidase_C1A_Cathepsi 100.0 1.2E-40 2.7E-45  281.1  14.2  157   62-235    61-238 (239)
  8 cd02620 Peptidase_C1A_Cathepsi 100.0 1.2E-39 2.7E-44  274.5  13.4  152   62-230    57-232 (236)
  9 cd02248 Peptidase_C1A Peptidas 100.0 6.3E-38 1.4E-42  259.2  15.5  151   62-228    51-204 (210)
 10 PTZ00049 cathepsin C-like prot 100.0 4.4E-38 9.4E-43  291.8  15.5  166   62-236   446-677 (693)
 11 PTZ00364 dipeptidyl-peptidase  100.0 5.8E-38 1.3E-42  287.8  15.4  164   62-235   268-459 (548)
 12 PF00112 Peptidase_C1:  Papain  100.0 1.1E-36 2.5E-41  252.4   9.8  159   62-235    54-218 (219)
 13 PTZ00462 Serine-repeat antigen 100.0 5.9E-34 1.3E-38  271.2   9.9  190   30-229   526-774 (1004)
 14 smart00645 Pept_C1 Papain fami 100.0 1.2E-31 2.6E-36  215.9  10.9  120   62-233    52-172 (174)
 15 cd02619 Peptidase_C1 C1 Peptid 100.0 3.3E-31 7.1E-36  220.0  12.7  149   62-221    50-213 (223)
 16 KOG1544|consensus              100.0 5.3E-30 1.2E-34  216.2   7.8  162   62-229   264-454 (470)
 17 cd00585 Peptidase_C1B Peptidas  99.7 1.1E-17 2.4E-22  151.0   8.9   95  119-222   286-401 (437)
 18 COG4870 Cysteine protease [Pos  99.6 2.9E-15 6.3E-20  129.6   7.4  136   75-221   168-314 (372)
 19 PF03051 Peptidase_C1_2:  Pepti  99.0 1.3E-09 2.7E-14   99.0   9.6   94  121-221   289-401 (438)
 20 PF08246 Inhibitor_I29:  Cathep  98.8 4.5E-09 9.8E-14   69.1   4.1   36   19-54      1-36  (58)
 21 smart00848 Inhibitor_I29 Cathe  98.5 1.2E-07 2.6E-12   61.8   4.0   37   19-55      1-37  (57)
 22 COG3579 PepC Aminopeptidase C   97.6 7.3E-05 1.6E-09   64.9   4.0   83  133-221   305-403 (444)
 23 PF13529 Peptidase_C39_2:  Pept  92.1    0.28   6E-06   36.8   4.5   57  128-203    88-144 (144)
 24 PF14399 Transpep_BrtH:  NlpC/p  88.9     1.6 3.5E-05   37.9   7.1   47  129-185    78-124 (317)
 25 PF09778 Guanylate_cyc_2:  Guan  88.2     2.2 4.8E-05   35.3   6.9   58  128-185   112-172 (212)
 26 PF05543 Peptidase_C47:  Stapho  82.2       8 0.00017   31.0   7.2   65  128-220    90-154 (175)
 27 KOG4128|consensus               81.7    0.19 4.2E-06   44.0  -2.2   87  129-219   306-413 (457)
 28 PTZ00203 cathepsin L protease;  69.2     4.1 8.9E-05   36.4   2.7   36  195-233   287-322 (348)
 29 KOG3404|consensus               68.7     7.7 0.00017   29.2   3.6   44    6-49     34-77  (145)
 30 KOG1543|consensus               66.9     7.7 0.00017   34.3   3.9   41  193-236   268-308 (325)
 31 cd02698 Peptidase_C1A_Cathepsi  66.4       5 0.00011   33.6   2.6   37  195-233   179-215 (239)
 32 PTZ00200 cysteine proteinase;   65.4     5.9 0.00013   36.7   3.0   39  195-234   387-425 (448)
 33 cd02621 Peptidase_C1A_Cathepsi  64.9     5.9 0.00013   33.2   2.7   39  195-234   187-225 (243)
 34 COG4990 Uncharacterized protei  63.3      16 0.00035   29.4   4.7   47  128-204   122-168 (195)
 35 smart00645 Pept_C1 Papain fami  62.4      12 0.00026   29.7   4.0   39  194-234   119-157 (174)
 36 cd02620 Peptidase_C1A_Cathepsi  59.6      11 0.00023   31.5   3.4   37  196-235   186-222 (236)
 37 KOG1542|consensus               53.8      11 0.00024   33.5   2.5   37  195-234   317-354 (372)
 38 cd02549 Peptidase_C39A A sub-f  52.9      45 0.00097   24.7   5.6   34  132-183    70-103 (141)
 39 PTZ00364 dipeptidyl-peptidase   52.4      13 0.00027   35.4   2.8   37  195-233   403-441 (548)
 40 KOG4702|consensus               49.1      34 0.00074   23.0   3.7   32   17-49     29-60  (77)
 41 PTZ00021 falcipain-2; Provisio  42.9      24 0.00053   33.1   3.1   41  194-234   420-467 (489)
 42 cd02619 Peptidase_C1 C1 Peptid  42.8      25 0.00055   28.3   3.0   29  206-234   179-210 (223)
 43 PTZ00049 cathepsin C-like prot  40.3      27 0.00058   34.1   3.0   37  196-233   620-658 (693)
 44 PF12385 Peptidase_C70:  Papain  38.0      73  0.0016   25.2   4.6   38  129-185    98-135 (166)
 45 cd02248 Peptidase_C1A Peptidas  37.7      26 0.00056   28.2   2.3   37  195-234   159-195 (210)
 46 PTZ00462 Serine-repeat antigen  33.4      40 0.00087   34.3   3.1   39  196-234   723-763 (1004)
 47 cd00044 CysPc Calpains, domain  32.3      75  0.0016   27.7   4.4   30  172-205   234-263 (315)
 48 PF03714 PUD:  Bacterial pullan  30.1      13 0.00028   26.8  -0.7   41  196-238    61-103 (103)
 49 PF00112 Peptidase_C1:  Papain   27.4      32 0.00069   27.6   1.2   36  196-234   167-202 (219)
 50 PF04928 PAP_central:  Poly(A)   22.0 1.6E+02  0.0034   25.2   4.4   39   10-50     17-55  (254)

No 1  
>KOG1542|consensus
Probab=100.00  E-value=4.2e-57  Score=383.62  Aligned_cols=210  Identities=46%  Similarity=0.812  Sum_probs=186.5

Q ss_pred             CChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhc---CCc------------------------------
Q psy1727          10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIR---GEG------------------------------   56 (240)
Q Consensus        10 ~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~---n~g------------------------------   56 (240)
                      +..+..+++|..|+.+|+|+|.+.+|..+|+.||++|+..++++   ++|                              
T Consensus        62 ~~~l~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~~~  141 (372)
T KOG1542|consen   62 PRGLGLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRGSK  141 (372)
T ss_pred             CcccchHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhcccccccc
Confidence            33455589999999999999999999999999999999999543   232                              


Q ss_pred             --------------------------------------------ceece-----------------eEecCCCCCCCCCC
Q psy1727          57 --------------------------------------------THLAL-----------------KLVDCDKVDAGCGG   75 (240)
Q Consensus        57 --------------------------------------------~~g~l-----------------qlvDCs~~~~gC~G   75 (240)
                                                                  +||+|                 |||||+.++.||+|
T Consensus       142 ~~~~~~~~~~~~~~~lP~~fDWR~kgaVTpVKnQG~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d~gC~G  221 (372)
T KOG1542|consen  142 LPGDAAEAPIEPGESLPESFDWRDKGAVTPVKNQGMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCDNGCNG  221 (372)
T ss_pred             CccccccCcCCCCCCCCcccchhccCCccccccCCcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcCCcCCC
Confidence                                                        24443                 99999999999999


Q ss_pred             CccchHHHHHHHhcCCCccCCcccccccCCC-cccCCcccceeEEeEeeeechhHHHHHHHHHhCCCeEEEEeccccccc
Q psy1727          76 GLMSNAFETIISKLGGGLEGEKDYPYKGSNR-ACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFY  154 (240)
Q Consensus        76 G~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~-~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~GPv~v~~~~~~f~~Y  154 (240)
                      |.+..||+|++ +. +||..|.+|||++..+ .|..++....+.|.+|..++.||++|.++|.++|||+|+|++..+|+|
T Consensus       222 Gl~~nA~~~~~-~~-gGL~~E~dYPY~g~~~~~C~~~~~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~Y  299 (372)
T KOG1542|consen  222 GLMDNAFKYIK-KA-GGLEKEKDYPYTGKKGNQCHFDKSKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFY  299 (372)
T ss_pred             CChhHHHHHHH-Hh-CCccccccCCccccCCCccccchhhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHh
Confidence            99999999977 66 7999999999999997 999999999999999999999999999999999999999998889999


Q ss_pred             cCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEECCCCCccccceeE
Q psy1727         155 FGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTK  231 (240)
Q Consensus       155 ~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~  231 (240)
                      .+||+.+....|+.  ..++|+|||||||...     -.++|||||||||+.|||+|  ||||.||. +.||++..+
T Consensus       300 rgGV~~P~~~~Cs~--~~~~HaVLlvGyG~~g-----~~~PYWIVKNSWG~~WGE~G--Y~~l~RG~-N~CGi~~mv  366 (372)
T KOG1542|consen  300 RGGVSCPSKYICSP--KLLNHAVLLVGYGSSG-----YEKPYWIVKNSWGTSWGEKG--YYKLCRGS-NACGIADMV  366 (372)
T ss_pred             cccccCCCcccCCc--cccCceEEEEeecCCC-----CCCceEEEECCccccccccc--eEEEeccc-cccccccch
Confidence            99999995558996  5599999999999882     27899999999999999999  99999996 567777443


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=100.00  E-value=1.9e-51  Score=362.13  Aligned_cols=210  Identities=37%  Similarity=0.657  Sum_probs=174.0

Q ss_pred             CCCCChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC-------c-----------------------
Q psy1727           7 PHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE-------G-----------------------   56 (240)
Q Consensus         7 ~~~~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~-------g-----------------------   56 (240)
                      +..+-+++++.+|++||++|+|.|.+.+|+.+||+||++|+++|+++|.       |                       
T Consensus        26 ~~~~~~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~~~~lg~N~FaDlT~eEf~~~~l~~~~~~  105 (348)
T PTZ00203         26 RAIYVGTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAARYLNGAAYF  105 (348)
T ss_pred             hhcccccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCCCeEEeccccccCCHHHHHHHhcCCCccc
Confidence            3344467778999999999999999988999999999999999987763       1                       


Q ss_pred             -------------------------------------------------ceece-----------------eEecCCCCC
Q psy1727          57 -------------------------------------------------THLAL-----------------KLVDCDKVD   70 (240)
Q Consensus        57 -------------------------------------------------~~g~l-----------------qlvDCs~~~   70 (240)
                                                                       ++++|                 |||||+..+
T Consensus       106 ~~~~~~~~~~~~~~~~~~~~lP~~~DWR~~g~VtpVkdQg~CGSCWAfa~~~aiEs~~~i~~~~~~~LSeQqLvdC~~~~  185 (348)
T PTZ00203        106 AAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVD  185 (348)
T ss_pred             ccccccccccccccccccccCCCCCcCCcCCCCCCccccCCCccHHHHhhHHHHHHHHHHhcCCCccCCHHHHHhccCCC
Confidence                                                             01111                 999999878


Q ss_pred             CCCCCCccchHHHHHHHhc-CCCccCCcccccccCCC---cccCCcc-cceeEEeEeeeechhHHHHHHHHHhCCCeEEE
Q psy1727          71 AGCGGGLMSNAFETIISKL-GGGLEGEKDYPYKGSNR---ACHLNKE-EIRVKIQSYVNVSSDETEMAKYLVKNGPMAVA  145 (240)
Q Consensus        71 ~gC~GG~~~~A~~yi~~~~-~~Gi~~e~~yPY~~~~~---~C~~~~~-~~~~~i~~~~~v~~~~~~i~~~l~~~GPv~v~  145 (240)
                      .||+||++..||+|++ ++ .+|+++|++|||.+.++   .|..... ...+++.+|..++.++++|+.+|+++|||+|+
T Consensus       186 ~GC~GG~~~~a~~yi~-~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~  264 (348)
T PTZ00203        186 NGCGGGLMLQAFEWVL-RNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIA  264 (348)
T ss_pred             CCCCCCCHHHHHHHHH-HhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecceeecCcCHHHHHHHHHhCCCEEEE
Confidence            8999999999999997 42 14689999999998765   6874332 23567888888877888999999999999999


Q ss_pred             EeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEECCCCCcc
Q psy1727         146 INANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRW  225 (240)
Q Consensus       146 ~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~  225 (240)
                      |++..|++|++|||+    .|..  ..++|+|+|||||.+      +|++|||||||||++|||+|  ||||+||.| .|
T Consensus       265 i~a~~f~~Y~~GIy~----~c~~--~~~nHaVliVGYG~~------~g~~YWiikNSWG~~WGe~G--Y~ri~rg~n-~C  329 (348)
T PTZ00203        265 VDASSFMSYHSGVLT----SCIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKG--YVRVTMGVN-AC  329 (348)
T ss_pred             EEhhhhcCccCceee----ccCC--CCCCeEEEEEEEecC------CCceEEEEEcCCCCCcCcCc--eEEEEcCCC-cc
Confidence            999779999999995    3764  457999999999987      68999999999999999999  999999975 57


Q ss_pred             ccceeEE
Q psy1727         226 GEQVTKS  232 (240)
Q Consensus       226 g~~~~~~  232 (240)
                      |+.....
T Consensus       330 gi~~~~~  336 (348)
T PTZ00203        330 LLTGYPV  336 (348)
T ss_pred             cccceEE
Confidence            7764443


No 3  
>PTZ00021 falcipain-2; Provisional
Probab=100.00  E-value=3.3e-50  Score=364.53  Aligned_cols=212  Identities=36%  Similarity=0.668  Sum_probs=175.9

Q ss_pred             ChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC--------c-------------------c------
Q psy1727          11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE--------G-------------------T------   57 (240)
Q Consensus        11 ~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~--------g-------------------~------   57 (240)
                      .+++...+|++|+++|+|+|.+.+|+..||.||++|+++|+++|.        |                   .      
T Consensus       161 ~n~e~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l~~~~~~~~~  240 (489)
T PTZ00021        161 TNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYLTLKSFDFKS  240 (489)
T ss_pred             cChHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhcccccccccc
Confidence            356677889999999999999999999999999999999987663        1                   0      


Q ss_pred             -------------------------------------------------------eece-----------------eEec
Q psy1727          58 -------------------------------------------------------HLAL-----------------KLVD   65 (240)
Q Consensus        58 -------------------------------------------------------~g~l-----------------qlvD   65 (240)
                                                                             +++|                 ||||
T Consensus       241 ~~~~~~~~~~~~~~~~~~~~~~~~~~P~s~DWR~~g~VtpVKdQG~CGSCWAFAa~~alEs~~~I~~g~~v~LSeQqLVD  320 (489)
T PTZ00021        241 NGKKSPRVINYDDVIKKYKPKDATFDHAKYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVD  320 (489)
T ss_pred             ccccccccccccccccccccccccCCccccccccCCCCCCcccccccccHHHHHHHHHHHHHHHHHcCCCcccCHHHHhh
Confidence                                                                   0000                 8999


Q ss_pred             CCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccC-CCcccCCcccceeEEeEeeeechhHHHHHHHHHhCCCeEE
Q psy1727          66 CDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGS-NRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAV  144 (240)
Q Consensus        66 Cs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~-~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~GPv~v  144 (240)
                      |+..+.||+||++..||+|+. ++ +||++|++|||.+. .+.|........++|.+|..++  +++|+++|+..|||+|
T Consensus       321 Cs~~n~GC~GG~~~~Af~yi~-~~-gGl~tE~~YPY~~~~~~~C~~~~~~~~~~i~~y~~i~--~~~lk~al~~~GPVsv  396 (489)
T PTZ00021        321 CSFKNNGCYGGLIPNAFEDMI-EL-GGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIP--EDKFKEAIRFLGPISV  396 (489)
T ss_pred             hccCCCCCCCcchHhhhhhhh-hc-cccCcccccCccCCCCCccccccccccceeeeEEEec--HHHHHHHHHhcCCeEE
Confidence            998889999999999999998 66 69999999999987 4789876666678899998886  5689999999999999


Q ss_pred             EEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccc----cccCCcCEEEEecccCCCcCCCCcceEEEEC
Q psy1727         145 AINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTK----FTHKIQPYWIIKNSWGPHWGEKTMPFWIIKN  219 (240)
Q Consensus       145 ~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~----~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~  219 (240)
                      +|++.. |++|++|||++.   |+   ..++|||+|||||++...    ....+.+|||||||||+.|||+|  ||||+|
T Consensus       397 ~i~a~~~f~~YkgGIy~~~---C~---~~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~G--Y~rI~r  468 (489)
T PTZ00021        397 SIAVSDDFAFYKGGIFDGE---CG---EEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKG--FIRIET  468 (489)
T ss_pred             EEEeecccccCCCCcCCCC---CC---CccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCe--EEEEEc
Confidence            999964 999999999764   87   458999999999975210    01124689999999999999999  999999


Q ss_pred             CCC---CccccceeEEecC
Q psy1727         220 SWG---PRWGEQVTKSIYS  235 (240)
Q Consensus       220 ~~~---~~~g~~~~~~~~~  235 (240)
                      +.+   +.||+.. .+.||
T Consensus       469 ~~~g~~n~CGI~t-~a~yP  486 (489)
T PTZ00021        469 DENGLMKTCSLGT-EAYVP  486 (489)
T ss_pred             CCCCCCCCCCCcc-cceeE
Confidence            975   6899974 44554


No 4  
>PTZ00200 cysteine proteinase; Provisional
Probab=100.00  E-value=2.8e-47  Score=344.07  Aligned_cols=206  Identities=30%  Similarity=0.519  Sum_probs=167.0

Q ss_pred             hhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC------c---------------------c-e-----
Q psy1727          12 KLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE------G---------------------T-H-----   58 (240)
Q Consensus        12 ~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~------g---------------------~-~-----   58 (240)
                      +++...+|++|+++|+|.|.+.+|+..||.||++|+++|+++|.      |                     . .     
T Consensus       119 e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~~~y~lgiN~FsDlT~eEF~~~~~~~~~~~~~~~~~  198 (448)
T PTZ00200        119 EFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTS  198 (448)
T ss_pred             hHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCcCCeEEeccccccCCHHHHHHHhccCCCcccccccc
Confidence            34445789999999999999999999999999999999976663      1                     0 0     


Q ss_pred             --------------------------e------c-e--------------------------------------------
Q psy1727          59 --------------------------L------A-L--------------------------------------------   61 (240)
Q Consensus        59 --------------------------g------~-l--------------------------------------------   61 (240)
                                                .      . +                                            
T Consensus       199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~DWR~~g~vtpVkdQG~~CGSCWAFat~~aiEs~~~i~~~~  278 (448)
T PTZ00200        199 HNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDK  278 (448)
T ss_pred             cccccccccccccccccccccccccccccccccccCCCCccCCCCCCCCCcccCCCccchHHHHhHHHHHHHHHHHhcCC
Confidence                                      0      0 1                                            


Q ss_pred             -------eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeechhHHHHHH
Q psy1727          62 -------KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAK  134 (240)
Q Consensus        62 -------qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~  134 (240)
                             |||||+..+.||+||++..||+|++ ++  ||++|++|||.+..+.|..... ..+.|.+|..++ +.+.+++
T Consensus       279 ~~~LSeQqLvDC~~~~~GC~GG~~~~A~~yi~-~~--Gi~~e~~YPY~~~~~~C~~~~~-~~~~i~~y~~~~-~~~~l~~  353 (448)
T PTZ00200        279 SVDLSEQELVNCDTKSQGCSGGYPDTALEYVK-NK--GLSSSSDVPYLAKDGKCVVSST-KKVYIDSYLVAK-GKDVLNK  353 (448)
T ss_pred             CeecCHHHHhhccCccCCCCCCcHHHHHHHHh-hc--CccccccCCCCCCCCCCcCCCC-CeeEecceEecC-HHHHHHH
Confidence                   8999998789999999999999998 76  9999999999999999986543 345678887654 3444555


Q ss_pred             HHHhCCCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcc
Q psy1727         135 YLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMP  213 (240)
Q Consensus       135 ~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~  213 (240)
                      ++ .+|||+|+|++.. |++|++|||++.   |..   .++|||+|||||.+..    +|.+|||||||||++|||+|  
T Consensus       354 ~l-~~GPV~v~i~~~~~f~~Yk~GIy~~~---C~~---~~nHaV~lVGyG~d~~----~g~~YWIIkNSWG~~WGe~G--  420 (448)
T PTZ00200        354 SL-VISPTVVYIAVSRELLKYKSGVYNGE---CGK---SLNHAVLLVGEGYDEK----TKKRYWIIKNSWGTDWGENG--  420 (448)
T ss_pred             HH-hcCCEEEEeecccccccCCCCccccc---cCC---CCcEEEEEEEecccCC----CCCceEEEEcCCCCCcccCe--
Confidence            54 6899999999975 999999999764   874   4899999999996421    57899999999999999999  


Q ss_pred             eEEEECCC--CCccccceeEEecCC
Q psy1727         214 FWIIKNSW--GPRWGEQVTKSIYSS  236 (240)
Q Consensus       214 y~~i~~~~--~~~~g~~~~~~~~~~  236 (240)
                      ||||+|+.  .+.||+.. .+.||.
T Consensus       421 Y~ri~r~~~g~n~CGI~~-~~~~P~  444 (448)
T PTZ00200        421 YMRLERTNEGTDKCGILT-VGLTPV  444 (448)
T ss_pred             eEEEEeCCCCCCcCCccc-cceeeE
Confidence            99999974  46799874 445553


No 5  
>KOG1543|consensus
Probab=100.00  E-value=3.7e-44  Score=313.64  Aligned_cols=152  Identities=40%  Similarity=0.689  Sum_probs=137.9

Q ss_pred             eEecCCC-CCCCCCCCccchHHHHHHHhcCCCccC-CcccccccCCCcccCCcccceeEEeEeeeechhHHHHHHHHHhC
Q psy1727          62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEG-EKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKN  139 (240)
Q Consensus        62 qlvDCs~-~~~gC~GG~~~~A~~yi~~~~~~Gi~~-e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~  139 (240)
                      |||||+. ++.||+||++..||+|++ ++  |+++ +.+|||.+..+.|..+.....+.+.++..++.++.+|+.+|+.+
T Consensus       162 ~lvdC~~~~~~GC~GG~~~~A~~yi~-~~--G~~t~~~~Ypy~~~~~~C~~~~~~~~~~~~~~~~~~~~e~~i~~~v~~~  238 (325)
T KOG1543|consen  162 DLVDCCGECGDGCNGGEPKNAFKYIK-KN--GGVTECENYPYIGKDGTCKSNKKDKTVTIKGFYNVPANEEAIAEAVAKN  238 (325)
T ss_pred             hhhhccCCCCCCcCCCCHHHHHHHHH-Hh--CCCCCCcCCCCcCCCCCccCCCccceeEeeeeeecCcCHHHHHHHHHhc
Confidence            9999998 589999999999999999 87  6666 99999999999999888777888899999998999999999999


Q ss_pred             CCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEE
Q psy1727         140 GPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIK  218 (240)
Q Consensus       140 GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~  218 (240)
                      |||+|+|++.. |++|++|||.++  .|..  ..++|||+|||||+.      ++.+|||||||||+.|||+|  ||||.
T Consensus       239 GPv~v~~~a~~~F~~Y~~GVy~~~--~~~~--~~~~Hav~iVGyG~~------~~~~YWivkNSWG~~WGe~G--y~ri~  306 (325)
T KOG1543|consen  239 GPVSVAIDAYEDFSLYKGGVYAEE--KGDD--KEGDHAVLIVGYGTG------DGVDYWIVKNSWGTDWGEKG--YFRIA  306 (325)
T ss_pred             CCeEEEEeehhhhhhccCceEeCC--CCCC--CCCCceEEEEEEcCC------CCceeEEEEcCCCCCcccCc--eEEEe
Confidence            99999999987 999999999988  5663  159999999999993      78999999999999999999  99999


Q ss_pred             CCCCCccccc
Q psy1727         219 NSWGPRWGEQ  228 (240)
Q Consensus       219 ~~~~~~~g~~  228 (240)
                      |+.+.++++.
T Consensus       307 r~~~~~~I~~  316 (325)
T KOG1543|consen  307 RGVNKCGIAS  316 (325)
T ss_pred             cCCCchhhhc
Confidence            9998777443


No 6  
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=100.00  E-value=1.1e-40  Score=282.18  Aligned_cols=164  Identities=31%  Similarity=0.620  Sum_probs=129.9

Q ss_pred             eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCccccccc-CCCcccCCc-ccceeEEeEeeee-----chhHHHHHH
Q psy1727          62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKG-SNRACHLNK-EEIRVKIQSYVNV-----SSDETEMAK  134 (240)
Q Consensus        62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~-~~~~C~~~~-~~~~~~i~~~~~v-----~~~~~~i~~  134 (240)
                      ||+||+..+.||+||++..|++|++ +.  ||++|++|||+. ..+.|.... ....+++..|..+     ..++++|++
T Consensus        62 ~l~dC~~~~~GC~GG~~~~a~~~~~-~~--Gi~~e~~yPY~~~~~~~C~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ik~  138 (243)
T cd02621          62 HVLSCSQYSQGCDGGFPFLVGKFAE-DF--GIVTEDYFPYTADDDRPCKASPSECRRYYFSDYNYVGGCYGCTNEDEMKW  138 (243)
T ss_pred             HhhhhcCCCCCCCCCCHHHHHHHHH-hc--CcCCCceeCCCCCCCCCCCCCccccccccccceeEcccccccCCHHHHHH
Confidence            8999998788999999999999998 77  999999999998 678898654 3333444444443     236889999


Q ss_pred             HHHhCCCeEEEEeccc-cccccCCcccCCC--cccCCC------CCCCCccEEEEEEeecccccccCCcCEEEEecccCC
Q psy1727         135 YLVKNGPMAVAINANA-MQFYFGGVSHPLK--FLCKGG------MDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGP  205 (240)
Q Consensus       135 ~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~--~~c~~~------~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~  205 (240)
                      +|+++|||+++|++.. |++|++|||+...  ..|...      ...++|+|+|||||++..    ++.+|||||||||+
T Consensus       139 ~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~----~g~~YWiirNSWG~  214 (243)
T cd02621         139 EIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEI----KGEKYWIVKNSWGS  214 (243)
T ss_pred             HHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCC----CCCcEEEEEcCCCC
Confidence            9999999999999975 9999999998751  126531      024799999999998720    27899999999999


Q ss_pred             CcCCCCcceEEEECCCCCcccc-ceeEEecC
Q psy1727         206 HWGEKTMPFWIIKNSWGPRWGE-QVTKSIYS  235 (240)
Q Consensus       206 ~WG~~G~~y~~i~~~~~~~~g~-~~~~~~~~  235 (240)
                      .|||+|  ||||+|+.| .||+ ......+|
T Consensus       215 ~WGe~G--y~~i~~~~~-~cgi~~~~~~~~~  242 (243)
T cd02621         215 SWGEKG--YFKIRRGTN-ECGIESQAVFAYP  242 (243)
T ss_pred             CCCcCC--eEEEecCCc-ccCcccceEeecc
Confidence            999999  999999975 5566 44444444


No 7  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=100.00  E-value=1.2e-40  Score=281.10  Aligned_cols=157  Identities=27%  Similarity=0.482  Sum_probs=131.2

Q ss_pred             eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCC---------------cccceeEEeEeeeec
Q psy1727          62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN---------------KEEIRVKIQSYVNVS  126 (240)
Q Consensus        62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~---------------~~~~~~~i~~~~~v~  126 (240)
                      |||||+. +.||+||++..||+|++ ++  |+++|++|||...++.|...               +....+++.+|..++
T Consensus        61 ~lldC~~-~~gC~GG~~~~a~~~~~-~~--Gl~~e~~yPY~~~~~~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~~~~~~  136 (239)
T cd02698          61 VVIDCAG-GGSCHGGDPGGVYEYAH-KH--GIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVS  136 (239)
T ss_pred             HHHhCCC-CCCccCcCHHHHHHHHH-Hc--CcCCCCeeCCcCCCCCCcCCCCCCCcccCcccccccccceEEeeeceecC
Confidence            8999998 78999999999999998 77  99999999999877666521               112346677777775


Q ss_pred             hhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCC
Q psy1727         127 SDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGP  205 (240)
Q Consensus       127 ~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~  205 (240)
                       ++++|+++|+++|||+|+|.+.. |+.|++|||+..  .|.   ..++|+|+|||||++.     ++++|||||||||+
T Consensus       137 -~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~~~--~~~---~~~~HaV~IVGyG~~~-----~g~~YWiikNSWG~  205 (239)
T cd02698         137 -GRDKMMAEIYARGPISCGIMATEALENYTGGVYKEY--VQD---PLINHIISVAGWGVDE-----NGVEYWIVRNSWGE  205 (239)
T ss_pred             -CHHHHHHHHHHcCCEEEEEEecccccccCCeEEccC--CCC---CcCCeEEEEEEEEecC-----CCCEEEEEEcCCCc
Confidence             57889999999999999999975 999999999876  454   5689999999999872     38899999999999


Q ss_pred             CcCCCCcceEEEECCC-----CCccccceeEEecC
Q psy1727         206 HWGEKTMPFWIIKNSW-----GPRWGEQVTKSIYS  235 (240)
Q Consensus       206 ~WG~~G~~y~~i~~~~-----~~~~g~~~~~~~~~  235 (240)
                      +|||+|  ||||+|+.     |++++|...+...|
T Consensus       206 ~WGe~G--y~~i~rg~~~~~~~~~~i~~~~~~~~~  238 (239)
T cd02698         206 PWGERG--WFRIVTSSYKGARYNLAIEEDCAWADP  238 (239)
T ss_pred             ccCcCc--eEEEEccCCcccccccccccceEEEee
Confidence            999999  99999998     88877764444433


No 8  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=100.00  E-value=1.2e-39  Score=274.50  Aligned_cols=152  Identities=30%  Similarity=0.536  Sum_probs=125.1

Q ss_pred             eEecCCCC-CCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCc------------------ccCCcc----cceeE
Q psy1727          62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRA------------------CHLNKE----EIRVK  118 (240)
Q Consensus        62 qlvDCs~~-~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~------------------C~~~~~----~~~~~  118 (240)
                      |||||+.. +.||+||++..||+|++ ++  |+++|++|||...+..                  |.....    ...++
T Consensus        57 ~lidC~~~~~~gC~GG~~~~a~~~i~-~~--G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~  133 (236)
T cd02620          57 DLLSCCSGCGDGCNGGYPDAAWKYLT-TT--GVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHK  133 (236)
T ss_pred             HHHhhcCCCCCCCCCCCHHHHHHHHH-hc--CCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeee
Confidence            99999985 88999999999999998 77  9999999999876532                  432211    12234


Q ss_pred             EeEeeeechhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEE
Q psy1727         119 IQSYVNVSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYW  197 (240)
Q Consensus       119 i~~~~~v~~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yW  197 (240)
                      +..+..+..++++|+++|+++|||+|+|.+.. |++|++|||+..   |..  ..++|||+|||||++      ++++||
T Consensus       134 ~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~~~---~~~--~~~~HaV~iVGyg~~------~g~~YW  202 (236)
T cd02620         134 GKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHT---SGK--QLGGHAVKIIGWGVE------NGVPYW  202 (236)
T ss_pred             ecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEeec---CCC--CcCCeEEEEEEEecc------CCeeEE
Confidence            45555565578999999999999999999965 999999999754   543  457999999999987      688999


Q ss_pred             EEecccCCCcCCCCcceEEEECCCCCcccccee
Q psy1727         198 IIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVT  230 (240)
Q Consensus       198 ivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~  230 (240)
                      |||||||+.|||+|  ||||+|+. +.||+...
T Consensus       203 ivrNSWG~~WGe~G--y~ri~~~~-~~cgi~~~  232 (236)
T cd02620         203 LAANSWGTDWGENG--YFRILRGS-NECGIESE  232 (236)
T ss_pred             EEEeCCCCCCCCCc--EEEEEccC-cccccccc
Confidence            99999999999999  99999997 57787743


No 9  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=100.00  E-value=6.3e-38  Score=259.20  Aligned_cols=151  Identities=43%  Similarity=0.823  Sum_probs=135.1

Q ss_pred             eEecCCCC-CCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeech-hHHHHHHHHHhC
Q psy1727          62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSS-DETEMAKYLVKN  139 (240)
Q Consensus        62 qlvDCs~~-~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~-~~~~i~~~l~~~  139 (240)
                      +|+||+.. +.+|.||.+..||+++. +.  |+++|++|||......|........+++.+|..++. ++++||++|+++
T Consensus        51 ~l~~c~~~~~~gC~GG~~~~a~~~~~-~~--Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~i~~~~~~~ik~~l~~~  127 (210)
T cd02248          51 QLVDCSTSGNNGCNGGNPDNAFEYVK-NG--GLASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANY  127 (210)
T ss_pred             HHhccCCCCCCCCCCCCHHHhHHHHH-HC--CcCccccCCccCCCCCccCCCCcccEEEeeEEEcCCCcHHHHHHHHhhc
Confidence            89999985 78999999999999998 76  999999999999888998776667889999998876 488999999999


Q ss_pred             CCeEEEEeccc-cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEE
Q psy1727         140 GPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIK  218 (240)
Q Consensus       140 GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~  218 (240)
                      |||++++.+.. |+.|++|||..+  .|..  ..++|||+|||||++      .+.+|||||||||+.||++|  ||||+
T Consensus       128 gPV~~~~~~~~~f~~y~~Giy~~~--~~~~--~~~~Hav~iVGy~~~------~~~~ywiv~NSWG~~WG~~G--y~~i~  195 (210)
T cd02248         128 GPVSVAIDASSSFQFYKGGIYSGP--CCSN--TNLNHAVLLVGYGTE------NGVDYWIVKNSWGTSWGEKG--YIRIA  195 (210)
T ss_pred             CCEEEEEecCcccccCCCCceeCC--CCCC--CcCCEEEEEEEEeec------CCceEEEEEcCCCCccccCc--EEEEE
Confidence            99999999975 999999999876  5532  568999999999998      57899999999999999999  99999


Q ss_pred             CCCCCccccc
Q psy1727         219 NSWGPRWGEQ  228 (240)
Q Consensus       219 ~~~~~~~g~~  228 (240)
                      ++. +.||+.
T Consensus       196 ~~~-~~cgi~  204 (210)
T cd02248         196 RGS-NLCGIA  204 (210)
T ss_pred             cCC-CccCce
Confidence            987 578887


No 10 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=100.00  E-value=4.4e-38  Score=291.83  Aligned_cols=166  Identities=23%  Similarity=0.469  Sum_probs=128.9

Q ss_pred             eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCccc---------------------------
Q psy1727          62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEE---------------------------  114 (240)
Q Consensus        62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~---------------------------  114 (240)
                      |||||+..++||+||++..|++|++ +.  ||++|.+|||++..+.|......                           
T Consensus       446 qLLDCs~~nqGC~GG~~~~A~kya~-~~--GI~tEscYPY~a~~g~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~  522 (693)
T PTZ00049        446 TVLSCSFYDQGCNGGFPYLVSKMAK-LQ--GIPLDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSETQSDM  522 (693)
T ss_pred             HhcccCCCCCCcCCCcHHHHHHHHH-HC--CCCcCCccCCcCCCCCCCCCCCCccccccccccccccccccccccccccc
Confidence            8999998889999999999999998 77  99999999999888888642111                           


Q ss_pred             ------------ceeEEeEeeeec--------hhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCC----cccCCC
Q psy1727         115 ------------IRVKIQSYVNVS--------SDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLK----FLCKGG  169 (240)
Q Consensus       115 ------------~~~~i~~~~~v~--------~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~----~~c~~~  169 (240)
                                  ..+.+.+|..+.        .++++|+++|+++|||+|+|++.. |++|++|||+...    ..|..+
T Consensus       523 ~~~~~~~~~~~~~r~y~k~y~yI~g~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d  602 (693)
T PTZ00049        523 HADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVD  602 (693)
T ss_pred             cccccccccccccceeeeeeEEecccccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCc
Confidence                        123345555553        258899999999999999999975 9999999998531    137532


Q ss_pred             ------------CCCCCccEEEEEEeecccccccCC--cCEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEecC
Q psy1727         170 ------------MDNLDHGVLIVGYGVHKTKFTHKI--QPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYS  235 (240)
Q Consensus       170 ------------~~~~~Hav~iVGyg~~~~~~~~~g--~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~  235 (240)
                                  ...++|||+|||||.+..    +|  .+|||||||||+.||++|  ||||+||.|.|.++.....+.|
T Consensus       603 ~~~~~~~~~~~G~e~~NHAVlIVGwG~d~e----nG~~~~YWIVRNSWGt~WGenG--YfKI~RG~N~CGIEs~a~~~~p  676 (693)
T PTZ00049        603 LPKHNGVYNITGWEKVNHAIVLVGWGEEEI----NGKLYKYWIGRNSWGKNWGKEG--YFKIIRGKNFSGIESQSLFIEP  676 (693)
T ss_pred             cccccccccccccccCceEEEEEEeccccC----CCcccCEEEEECCCCCCcccCc--eEEEEcCCCccCCccceeEEee
Confidence                        124799999999997521    34  489999999999999999  9999999865544454444444


Q ss_pred             C
Q psy1727         236 S  236 (240)
Q Consensus       236 ~  236 (240)
                      +
T Consensus       677 d  677 (693)
T PTZ00049        677 D  677 (693)
T ss_pred             e
Confidence            3


No 11 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=100.00  E-value=5.8e-38  Score=287.77  Aligned_cols=164  Identities=21%  Similarity=0.464  Sum_probs=131.0

Q ss_pred             eEecCCCCCCCCCCCccchHHHHHHHhcCCCccCCccc--ccccCCC---cccCCcccceeEEeE------eeeechhHH
Q psy1727          62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY--PYKGSNR---ACHLNKEEIRVKIQS------YVNVSSDET  130 (240)
Q Consensus        62 qlvDCs~~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~y--PY~~~~~---~C~~~~~~~~~~i~~------~~~v~~~~~  130 (240)
                      |||||+..+.||+||++..|++|++ ++  ||++|++|  ||++..+   .|+.......+.+.+      |..+..+++
T Consensus       268 qLVDCs~~n~GCdGG~p~~A~~yi~-~~--GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~~~~I~gyy~~~~~e~  344 (548)
T PTZ00364        268 HVLDCSQYGQGCAGGFPEEVGKFAE-TF--GILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPD  344 (548)
T ss_pred             HHhcccCCCCCCCCCcHHHHHHHHH-hC--CcccccccCCCCCCCCCCCCCCCCCcccceeeeeeeEEecceeecCCcHH
Confidence            8999998889999999999999998 77  99999999  9987665   587654444444444      333344688


Q ss_pred             HHHHHHHhCCCeEEEEeccc-cccccCCcccCC------CcccCCC--------CCCCCccEEEEEEeecccccccCCcC
Q psy1727         131 EMAKYLVKNGPMAVAINANA-MQFYFGGVSHPL------KFLCKGG--------MDNLDHGVLIVGYGVHKTKFTHKIQP  195 (240)
Q Consensus       131 ~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~------~~~c~~~--------~~~~~Hav~iVGyg~~~~~~~~~g~~  195 (240)
                      +|+++|+++|||+|+|++.. |+.|++|||...      ...|...        ...++|||+|||||.+.     +|.+
T Consensus       345 ~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de-----~G~~  419 (548)
T PTZ00364        345 EIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDE-----NGGD  419 (548)
T ss_pred             HHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccC-----CCce
Confidence            99999999999999999975 889999998621      0012110        13579999999999753     6889


Q ss_pred             EEEEecccCC--CcCCCCcceEEEECCCCCccccceeEEecC
Q psy1727         196 YWIIKNSWGP--HWGEKTMPFWIIKNSWGPRWGEQVTKSIYS  235 (240)
Q Consensus       196 yWivkNSWG~--~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~  235 (240)
                      |||||||||+  +|||+|  ||||+||.|+|.++...+.++|
T Consensus       420 YWIVKNSWGt~~~WGE~G--YfRI~RG~N~CGIes~~v~~~~  459 (548)
T PTZ00364        420 YWLVLDPWGSRRSWCDGG--TRKIARGVNAYNIESEVVVMYW  459 (548)
T ss_pred             EEEEECCCCCCCCcccCC--eEEEEcCCCcccccceeeeeee
Confidence            9999999999  999999  9999999987777777777776


No 12 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=100.00  E-value=1.1e-36  Score=252.40  Aligned_cols=159  Identities=34%  Similarity=0.719  Sum_probs=136.1

Q ss_pred             eEecCCC-CCCCCCCCccchHHHHHHHhcCCCccCCcccccccCC-CcccCCcccc-eeEEeEeeeechh-HHHHHHHHH
Q psy1727          62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLNKEEI-RVKIQSYVNVSSD-ETEMAKYLV  137 (240)
Q Consensus        62 qlvDCs~-~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~-~~C~~~~~~~-~~~i~~~~~v~~~-~~~i~~~l~  137 (240)
                      +|+||+. .+.+|+||++..|+++++ ++ .|+++|.+|||.... ..|....... .+++..|..+..+ .++|+++|.
T Consensus        54 ~l~~~~~~~~~~c~gg~~~~a~~~~~-~~-~Gi~~e~~~pY~~~~~~~c~~~~~~~~~~~i~~~~~~~~~~~~~ik~~L~  131 (219)
T PF00112_consen   54 YLIDCSNKYNKGCDGGSPFDALKYIK-NN-NGIVTEEDYPYNGNENPTCKSKKSNSYYVKIKGYGKVKDNDIEDIKKALM  131 (219)
T ss_dssp             HHHHHSTGTSSTTBBBEHHHHHHHHH-HH-TSBEBTTTS--SSSSSCSSCHSGGGEEEBEESEEEEEESTCHHHHHHHHH
T ss_pred             cccccccccccccccCcccccceeec-cc-CcccccccccccccccccccccccccccccccccccccccchhHHHHHHh
Confidence            7999998 678999999999999999 82 399999999999887 7898765543 4788888888764 899999999


Q ss_pred             hCCCeEEEEeccc--cccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceE
Q psy1727         138 KNGPMAVAINANA--MQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFW  215 (240)
Q Consensus       138 ~~GPv~v~~~~~~--f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~  215 (240)
                      ++|||++++.+..  |+.|++|||..+  .|..  ..++|+|+|||||++      .+++|||||||||+.||++|  ||
T Consensus       132 ~~gpV~~~~~~~~~~f~~~~~gi~~~~--~~~~--~~~~Hav~iVGy~~~------~~~~~wiv~NSWG~~WG~~G--y~  199 (219)
T PF00112_consen  132 KYGPVVASIDVSSEDFQNYKSGIYDPP--DCSN--ESGGHAVLIVGYDDE------NGKGYWIVKNSWGTDWGDNG--YF  199 (219)
T ss_dssp             HHSSEEEEEEEESHHHHTEESSEECST--SSSS--SSEEEEEEEEEEEEE------TTEEEEEEE-SBTTTSTBTT--EE
T ss_pred             hCceeeeeeeccccccccccceeeecc--cccc--ccccccccccccccc------cceeeEeeehhhCCccCCCe--EE
Confidence            9999999999865  999999999987  6775  578999999999998      68999999999999999999  99


Q ss_pred             EEECCCCCccccceeEEecC
Q psy1727         216 IIKNSWGPRWGEQVTKSIYS  235 (240)
Q Consensus       216 ~i~~~~~~~~g~~~~~~~~~  235 (240)
                      ||+|+.++.||+.. .++||
T Consensus       200 ~i~~~~~~~c~i~~-~~~~~  218 (219)
T PF00112_consen  200 RISYDYNNECGIES-QAVYP  218 (219)
T ss_dssp             EEESSSSSGGGTTS-SEEEE
T ss_pred             EEeeCCCCcCccCc-eeeec
Confidence            99999988999984 44444


No 13 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=100.00  E-value=5.9e-34  Score=271.24  Aligned_cols=190  Identities=25%  Similarity=0.470  Sum_probs=136.7

Q ss_pred             cCCHHHHHHHHHHHHHHHHHHHhcCCc---------ceece-----------------eEecCCC--CCCCCCCCccch-
Q psy1727          30 YATKEEYHKRLRIFRANLKKIQIRGEG---------THLAL-----------------KLVDCDK--VDAGCGGGLMSN-   80 (240)
Q Consensus        30 Y~~~~E~~~R~~iF~~n~~~i~~~n~g---------~~g~l-----------------qlvDCs~--~~~gC~GG~~~~-   80 (240)
                      |-+ .|+-.|++-...+...|...++|         ++++|                 ||+||+.  .+.||.||+... 
T Consensus       526 ~Cn-~~yCdR~kD~~sC~s~i~VKDQG~CGSCWAFASaaaLES~~cIkgg~~v~LSeQqLVDCs~~~gn~GC~GG~~~~e  604 (1004)
T PTZ00462        526 FCN-KEFCNRLKDENNCISKIQIEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLE  604 (1004)
T ss_pred             CCC-hhhccccccCCCCCCCCCcccCCcchHHHHHHHHHHHHHHHHHhcCCCcccCHHHHHhcccccCCCCCCCCCcHHH
Confidence            543 57888887655555555555664         11111                 9999986  368999998554 


Q ss_pred             HHHHHHHhcCCCccCCccccccc--CCCcccCCcc------------------cceeEEeEeeeechh---------HHH
Q psy1727          81 AFETIISKLGGGLEGEKDYPYKG--SNRACHLNKE------------------EIRVKIQSYVNVSSD---------ETE  131 (240)
Q Consensus        81 A~~yi~~~~~~Gi~~e~~yPY~~--~~~~C~~~~~------------------~~~~~i~~~~~v~~~---------~~~  131 (240)
                      ++.|+. ++ +||++|++|||..  ..+.|+....                  ...+.+.+|..+...         +++
T Consensus       605 fl~yI~-e~-GgLptESdYPYt~k~~~g~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kgY~~~~s~~~~~n~d~~i~~  682 (1004)
T PTZ00462        605 FLQIIE-DN-GFLPADSNYLYNYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYRAYESEHFHDKMDAFIKI  682 (1004)
T ss_pred             HHHHHH-Hc-CCCcccccCCCccCCCCCCCCCCcccccccccccccccccccccceeeccceEEecccccccchhhHHHH
Confidence            558887 76 5699999999975  4567874321                  012334456555321         468


Q ss_pred             HHHHHHhCCCeEEEEeccccccc-cCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCC
Q psy1727         132 MAKYLVKNGPMAVAINANAMQFY-FGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK  210 (240)
Q Consensus       132 i~~~l~~~GPv~v~~~~~~f~~Y-~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~  210 (240)
                      |+++|++.|||+|+|++..|+.| ++|||...  .|+.  ..++|||+|||||.+.. ....+++|||||||||+.|||+
T Consensus       683 IK~eI~~kGPVaV~IdAsdf~~Y~~sGIyv~~--~Cgs--~~~nHAVlIVGYGt~in-~eg~gk~YWIVRNSWGt~WGEn  757 (1004)
T PTZ00462        683 IKDEIMNKGSVIAYIKAENVLGYEFNGKKVQN--LCGD--DTADHAVNIVGYGNYIN-DEDEKKSYWIVRNSWGKYWGDE  757 (1004)
T ss_pred             HHHHHHhcCCEEEEEEeehHHhhhcCCccccC--CCCC--CcCCceEEEEEeccccc-ccCCCCceEEEEcCCCCCcCCC
Confidence            99999999999999998777788 58998765  6985  45799999999997520 0013679999999999999999


Q ss_pred             CcceEEEECCCCCccccce
Q psy1727         211 TMPFWIIKNSWGPRWGEQV  229 (240)
Q Consensus       211 G~~y~~i~~~~~~~~g~~~  229 (240)
                      |  ||||+|...+.||+..
T Consensus       758 G--YFKI~r~g~n~CGin~  774 (1004)
T PTZ00462        758 G--YFKVDMYGPSHCEDNF  774 (1004)
T ss_pred             e--EEEEEeCCCCCCccch
Confidence            9  9999985456788753


No 14 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.97  E-value=1.2e-31  Score=215.95  Aligned_cols=120  Identities=47%  Similarity=0.870  Sum_probs=100.2

Q ss_pred             eEecCCCC-CCCCCCCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeechhHHHHHHHHHhCC
Q psy1727          62 KLVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNG  140 (240)
Q Consensus        62 qlvDCs~~-~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~~~~~~i~~~l~~~G  140 (240)
                      +|+||+.. +.||+||++..|++|++ ++ .|+++|++|||+.                                     
T Consensus        52 ~l~~C~~~~~~gC~GG~~~~a~~~~~-~~-~Gi~~e~~~PY~~-------------------------------------   92 (174)
T smart00645       52 QLVDCSTGGNNGCNGGLPDNAFEYIK-KN-GGLETESCYPYTG-------------------------------------   92 (174)
T ss_pred             HHhhhcCCCCCCCCCcCHHHHHHHHH-Hc-CCcccccccCccc-------------------------------------
Confidence            89999874 67999999999999998 64 5999999999975                                     


Q ss_pred             CeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCcCCCCcceEEEECC
Q psy1727         141 PMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNS  220 (240)
Q Consensus       141 Pv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~  220 (240)
                        ++.+.+..|++|++|||+..  .|..  ..++|+|+|||||.+.     ++++|||||||||+.||++|  ||||+|+
T Consensus        93 --~~~~~~~~f~~Y~~Gi~~~~--~~~~--~~~~Hav~ivGyg~~~-----~g~~yWii~NSwG~~WG~~G--~~~i~~~  159 (174)
T smart00645       93 --SVAIDASDFQFYKSGIYDHP--GCGS--GTLDHAVLIVGYGTEE-----NGKDYWIVKNSWGTDWGENG--YFRIARG  159 (174)
T ss_pred             --EEEEEcccccCCcCeEECCC--CCCC--CcccEEEEEEEEeecC-----CCeeEEEEECCCCCCcccCe--EEEEEcC
Confidence              45555556999999999875  5764  3479999999999752     57899999999999999999  9999999


Q ss_pred             CCCccccceeEEe
Q psy1727         221 WGPRWGEQVTKSI  233 (240)
Q Consensus       221 ~~~~~g~~~~~~~  233 (240)
                      ..+.||+......
T Consensus       160 ~~~~c~i~~~~~~  172 (174)
T smart00645      160 KNNECGIEASVAS  172 (174)
T ss_pred             CCCccCceeeeee
Confidence            7456777544433


No 15 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.97  E-value=3.3e-31  Score=220.03  Aligned_cols=149  Identities=26%  Similarity=0.438  Sum_probs=121.2

Q ss_pred             eEecCCCC-----CCCCCCCccchHHH-HHHHhcCCCccCCcccccccCCCcccC----CcccceeEEeEeeeech-hHH
Q psy1727          62 KLVDCDKV-----DAGCGGGLMSNAFE-TIISKLGGGLEGEKDYPYKGSNRACHL----NKEEIRVKIQSYVNVSS-DET  130 (240)
Q Consensus        62 qlvDCs~~-----~~gC~GG~~~~A~~-yi~~~~~~Gi~~e~~yPY~~~~~~C~~----~~~~~~~~i~~~~~v~~-~~~  130 (240)
                      +|+||...     ..+|.||.+..++. +++ ++  ||++|.+|||......|..    ......+++..|..+.. +++
T Consensus        50 ~l~~c~~~~~~~~~~~c~gG~~~~~~~~~~~-~~--Gi~~e~~~Py~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~  126 (223)
T cd02619          50 YLYICANDECLGINGSCDGGGPLSALLKLVA-LK--GIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIE  126 (223)
T ss_pred             HHHHhccccccccCCCCCCCcHHHHHHHHHH-Hc--CCCccccCCCCCCCCCCCCCCccchhhcceeecceeEeCchhHH
Confidence            89999873     36999999999998 777 66  9999999999988766653    23345577888888765 478


Q ss_pred             HHHHHHHhCCCeEEEEeccc-cccccCCccc---CCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCC
Q psy1727         131 EMAKYLVKNGPMAVAINANA-MQFYFGGVSH---PLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPH  206 (240)
Q Consensus       131 ~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~---~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~  206 (240)
                      +||++|.++|||++++.+.. |..|++|++.   .....|..  ..++|||+|||||++..    .+++|||||||||+.
T Consensus       127 ~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~Hav~ivGy~~~~~----~~~~~~i~~NSwG~~  200 (223)
T cd02619         127 DIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDG--DLGGHAVVIVGYDDNYV----EGKGAFIVKNSWGTD  200 (223)
T ss_pred             HHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCC--ccCCeEEEEEeecCCCC----CCCCEEEEEeCCCCc
Confidence            99999999999999999876 8899999873   11112333  56899999999998721    278999999999999


Q ss_pred             cCCCCcceEEEECCC
Q psy1727         207 WGEKTMPFWIIKNSW  221 (240)
Q Consensus       207 WG~~G~~y~~i~~~~  221 (240)
                      ||++|  |+||+++.
T Consensus       201 wg~~G--y~~i~~~~  213 (223)
T cd02619         201 WGDNG--YGRISYED  213 (223)
T ss_pred             cccCC--EEEEehhh
Confidence            99999  99999985


No 16 
>KOG1544|consensus
Probab=99.96  E-value=5.3e-30  Score=216.20  Aligned_cols=162  Identities=27%  Similarity=0.521  Sum_probs=128.3

Q ss_pred             eEecCCC-CCCCCCCCccchHHHHHHHhcCCCccCCcccccccCC----CcccCC--------------------cccce
Q psy1727          62 KLVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN----RACHLN--------------------KEEIR  116 (240)
Q Consensus        62 qlvDCs~-~~~gC~GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~----~~C~~~--------------------~~~~~  116 (240)
                      ||++|.. ...||+||..+.|+=|+. +.  |++...+|||.+.+    +.|...                    ..+..
T Consensus       264 nLlSC~~h~q~GC~gG~lDRAWWYlR-Kr--GvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~i  340 (470)
T KOG1544|consen  264 NLLSCDTHQQQGCRGGRLDRAWWYLR-KR--GVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNSNDI  340 (470)
T ss_pred             HhcchhhhhhccCccCcccchheeee-cc--cccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccccCce
Confidence            8999998 468999999999999998 77  99999999997533    445321                    12234


Q ss_pred             eEEeEeeeechhHHHHHHHHHhCCCeEEEEeccc-cccccCCcccCCCcccCCC---CCCCCccEEEEEEeecccccccC
Q psy1727         117 VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGG---MDNLDHGVLIVGYGVHKTKFTHK  192 (240)
Q Consensus       117 ~~i~~~~~v~~~~~~i~~~l~~~GPv~v~~~~~~-f~~Y~~Gv~~~~~~~c~~~---~~~~~Hav~iVGyg~~~~~~~~~  192 (240)
                      +++.....|+++|++|+++|+++|||.+.|.+.. |..|++|||.+.+......   ...+.|+|.|.|||++.. ..+.
T Consensus       341 yq~tPPYrVSSnE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~-~~G~  419 (470)
T KOG1544|consen  341 YQVTPPYRVSSNEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETL-PDGR  419 (470)
T ss_pred             eeecCCeeccCCHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccC-CCCC
Confidence            5566666677789999999999999999998866 8899999998763221110   124789999999998742 1235


Q ss_pred             CcCEEEEecccCCCcCCCCcceEEEECCCCCccccce
Q psy1727         193 IQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQV  229 (240)
Q Consensus       193 g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~  229 (240)
                      ..+|||..||||+.|||+|  ||||.||.|++-+|+-
T Consensus       420 ~~KyW~aANSWG~~WGE~G--YFriLRGvNecdIEsf  454 (470)
T KOG1544|consen  420 TLKYWTAANSWGPAWGERG--YFRILRGVNECDIESF  454 (470)
T ss_pred             eeEEEEeecccccccccCc--eEEEeccccchhhhHh
Confidence            6789999999999999999  9999999998877763


No 17 
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.72  E-value=1.1e-17  Score=150.98  Aligned_cols=95  Identities=19%  Similarity=0.383  Sum_probs=73.6

Q ss_pred             EeEeeeechh--HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCc------------------ccCCCCCCCCccEE
Q psy1727         119 IQSYVNVSSD--ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKF------------------LCKGGMDNLDHGVL  178 (240)
Q Consensus       119 i~~~~~v~~~--~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~------------------~c~~~~~~~~Hav~  178 (240)
                      ...|.+++.+  ...+.++|..++||.++.++..|..|++||++....                  .|..  ...+|||+
T Consensus       286 ~~~y~Nvp~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~e--s~~tHAM~  363 (437)
T cd00585         286 PILYLNVPMDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGE--SLMTHAMV  363 (437)
T ss_pred             cceEEecCHHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcC--CcCCeEEE
Confidence            3466677654  233446888889999999998888999999965311                  1222  45789999


Q ss_pred             EEEEeecccccccCCc-CEEEEecccCCCcCCCCcceEEEECCCC
Q psy1727         179 IVGYGVHKTKFTHKIQ-PYWIIKNSWGPHWGEKTMPFWIIKNSWG  222 (240)
Q Consensus       179 iVGyg~~~~~~~~~g~-~yWivkNSWG~~WG~~G~~y~~i~~~~~  222 (240)
                      ||||+.+.     +|. .||+||||||+.||++|  ||+|++++-
T Consensus       364 ivGv~~D~-----~g~p~yw~VkNSWG~~~G~~G--y~~ms~~wf  401 (437)
T cd00585         364 LTGVDLDE-----DGKPVKWKVENSWGEKVGKKG--YFVMSDDWF  401 (437)
T ss_pred             EEEEEecC-----CCCcceEEEEcccCCCCCCCc--ceehhHHHH
Confidence            99999863     465 69999999999999999  999998863


No 18 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.58  E-value=2.9e-15  Score=129.65  Aligned_cols=136  Identities=25%  Similarity=0.344  Sum_probs=84.1

Q ss_pred             CCccchHHHHHHHhcCCCccCCcccccccCCCcccCCcccceeEEeEeeeec-----hhHHHHHHHHHhCCCeEEE--Ee
Q psy1727          75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSYVNVS-----SDETEMAKYLVKNGPMAVA--IN  147 (240)
Q Consensus        75 GG~~~~A~~yi~~~~~~Gi~~e~~yPY~~~~~~C~~~~~~~~~~i~~~~~v~-----~~~~~i~~~l~~~GPv~v~--~~  147 (240)
                      ||....+..|+. +. .|-+.+.+-||....-.|..... ...++..-..++     .++..|++++...|-++..  |+
T Consensus       168 ~g~~~m~~a~l~-e~-sgpv~et~d~y~~~s~~~~~~~p-~~k~~~~~~~i~~~~~~LdnG~i~~~~~~yg~~s~~~~id  244 (372)
T COG4870         168 GGNADMSAAYLT-EW-SGPVYETDDPYSENSYFSPTNLP-VTKHVQEAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYID  244 (372)
T ss_pred             CCcccccccccc-cc-CCcchhhcCccccccccCCcCCc-hhhccccceecccchhhhcccchHHHHhhhccccceeEEe
Confidence            566665666776 65 78888888888776655543111 111122222222     2355577777777776644  44


Q ss_pred             ccccccccCCcccCCCcccCCCCCCCCccEEEEEEeeccc----ccccCCcCEEEEecccCCCcCCCCcceEEEECCC
Q psy1727         148 ANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKT----KFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSW  221 (240)
Q Consensus       148 ~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~----~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~~~  221 (240)
                      +..+....-+.+...    +.  ...+|||+||||++...    .....|...||||||||+.||++|  ||+|++..
T Consensus       245 ~~~~~~~~~~~~~~~----s~--~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~~G--YfwisY~y  314 (372)
T COG4870         245 ATNSLGICIPYPYVD----SG--ENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENG--YFWISYYY  314 (372)
T ss_pred             cccccccccCCCCCC----cc--ccccceEEEEeccccccccccccCCCCCceEEEECccccccccCc--eEEEEeee
Confidence            444323222333322    11  46899999999998631    122235669999999999999999  99999864


No 19 
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=99.03  E-value=1.3e-09  Score=99.02  Aligned_cols=94  Identities=21%  Similarity=0.428  Sum_probs=64.1

Q ss_pred             Eeeeechh--HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccC----------------CCCCCCCccEEEEEE
Q psy1727         121 SYVNVSSD--ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCK----------------GGMDNLDHGVLIVGY  182 (240)
Q Consensus       121 ~~~~v~~~--~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~----------------~~~~~~~Hav~iVGy  182 (240)
                      .|.++|.+  .+.+.+.|...-||..+-++..+...+.||.+.......                ......+|||+|||.
T Consensus       289 ~ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv  368 (438)
T PF03051_consen  289 RYLNVPIDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGV  368 (438)
T ss_dssp             EEEE--HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEE
T ss_pred             eEeccCHHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEE
Confidence            46677765  566777888888999999998755778898875420000                001345899999999


Q ss_pred             eecccccccCCc-CEEEEecccCCCcCCCCcceEEEECCC
Q psy1727         183 GVHKTKFTHKIQ-PYWIIKNSWGPHWGEKTMPFWIIKNSW  221 (240)
Q Consensus       183 g~~~~~~~~~g~-~yWivkNSWG~~WG~~G~~y~~i~~~~  221 (240)
                      ..+.     +|. .+|+|+||||+..|.+|  ||.|+..+
T Consensus       369 ~~D~-----~g~p~~wkVeNSWG~~~g~kG--y~~msd~w  401 (438)
T PF03051_consen  369 DLDE-----DGKPVRWKVENSWGTDNGDKG--YFYMSDDW  401 (438)
T ss_dssp             EE-T-----TSSEEEEEEE-SBTTTSTBTT--EEEEEHHH
T ss_pred             Eecc-----CCCeeEEEEEcCCCCCCCCCc--EEEECHHH
Confidence            9863     565 59999999999999999  99998764


No 20 
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=98.82  E-value=4.5e-09  Score=69.06  Aligned_cols=36  Identities=39%  Similarity=0.684  Sum_probs=31.7

Q ss_pred             HHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcC
Q psy1727          19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRG   54 (240)
Q Consensus        19 f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n   54 (240)
                      |++|+++|+|.|.+++|+..|++||.+|+++|+.+|
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N   36 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHN   36 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHh
Confidence            899999999999999999999999999999998877


No 21 
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=98.52  E-value=1.2e-07  Score=61.79  Aligned_cols=37  Identities=38%  Similarity=0.624  Sum_probs=34.7

Q ss_pred             HHHHHHHhCcccCCHHHHHHHHHHHHHHHHHHHhcCC
Q psy1727          19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIRGE   55 (240)
Q Consensus        19 f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i~~~n~   55 (240)
                      |++|+.+|+|.|.+.+|...|+.||.+|++.|+.+|.
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~   37 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNK   37 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHh
Confidence            6899999999999999999999999999999988774


No 22 
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=97.57  E-value=7.3e-05  Score=64.88  Aligned_cols=83  Identities=16%  Similarity=0.359  Sum_probs=56.9

Q ss_pred             HHHHHhCCCeEEEEeccccccccCCcccCCCcc-------------cCC-C--CCCCCccEEEEEEeecccccccCCcCE
Q psy1727         133 AKYLVKNGPMAVAINANAMQFYFGGVSHPLKFL-------------CKG-G--MDNLDHGVLIVGYGVHKTKFTHKIQPY  196 (240)
Q Consensus       133 ~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~-------------c~~-~--~~~~~Hav~iVGyg~~~~~~~~~g~~y  196 (240)
                      .+.+..+-+|..+-++..+..-+.||.+..-..             .+. +  .....|||+|.|.+.++.    ...--
T Consensus       305 ~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~----g~p~r  380 (444)
T COG3579         305 IKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDET----GNPLR  380 (444)
T ss_pred             HHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccC----CCcee
Confidence            334445668888888777777788877632100             000 0  022579999999997641    23447


Q ss_pred             EEEecccCCCcCCCCcceEEEECCC
Q psy1727         197 WIIKNSWGPHWGEKTMPFWIIKNSW  221 (240)
Q Consensus       197 WivkNSWG~~WG~~G~~y~~i~~~~  221 (240)
                      |.|.||||.+=|.+|  ||-++..+
T Consensus       381 wkVENSWG~d~G~~G--yfvaSd~w  403 (444)
T COG3579         381 WKVENSWGKDVGKKG--YFVASDAW  403 (444)
T ss_pred             eEeecccccccCCCc--eEeehHhH
Confidence            999999999999999  99987654


No 23 
>PF13529 Peptidase_C39_2:  Peptidase_C39 like family; PDB: 3ERV_A.
Probab=92.05  E-value=0.28  Score=36.76  Aligned_cols=57  Identities=19%  Similarity=0.283  Sum_probs=33.5

Q ss_pred             hHHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEeccc
Q psy1727         128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSW  203 (240)
Q Consensus       128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSW  203 (240)
                      +.+.|++.|.+..||.+.+....- ...+..+.      .   ....|.|+|+||..+       +  +++|..+|
T Consensus        88 ~~~~i~~~i~~G~Pvi~~~~~~~~-~~~~~~~~------~---~~~~H~vvi~Gy~~~-------~--~~~v~DP~  144 (144)
T PF13529_consen   88 SFDDIKQEIDAGRPVIVSVNSGWR-PPNGDGYD------G---TYGGHYVVIIGYDED-------G--YVYVNDPW  144 (144)
T ss_dssp             -HHHHHHHHHTT--EEEEEETTSS---TTEEEE------E----TTEEEEEEEEE-SS-------E---EEEE-TT
T ss_pred             cHHHHHHHHHCCCcEEEEEEcccc-cCCCCCcC------C---CcCCEEEEEEEEeCC-------C--EEEEeCCC
Confidence            578899999998899999974210 00011111      1   347899999999975       2  78887776


No 24 
>PF14399 Transpep_BrtH:  NlpC/p60-like transpeptidase
Probab=88.92  E-value=1.6  Score=37.90  Aligned_cols=47  Identities=21%  Similarity=0.274  Sum_probs=32.6

Q ss_pred             HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeec
Q psy1727         129 ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVH  185 (240)
Q Consensus       129 ~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~  185 (240)
                      .+.|++.|.++.||.+.++...+ .|...-+       ..  ....|.|+|+||+++
T Consensus        78 ~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~-------~~--~~~~H~i~v~G~d~~  124 (317)
T PF14399_consen   78 WEELKEALDAGRPVIVWVDMYYL-PYRPNYY-------KK--HHADHYIVVYGYDEE  124 (317)
T ss_pred             HHHHHHHHhCCCceEEEeccccC-CCCcccc-------cc--ccCCcEEEEEEEeCC
Confidence            56788888877799999887543 2332211       11  346899999999975


No 25 
>PF09778 Guanylate_cyc_2:  Guanylylate cyclase;  InterPro: IPR018616  Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate. 
Probab=88.20  E-value=2.2  Score=35.28  Aligned_cols=58  Identities=22%  Similarity=0.338  Sum_probs=35.5

Q ss_pred             hHHHHHHHHHhCCCeEEEEeccccccc--cCCcccCCCcccC-CCCCCCCccEEEEEEeec
Q psy1727         128 DETEMAKYLVKNGPMAVAINANAMQFY--FGGVSHPLKFLCK-GGMDNLDHGVLIVGYGVH  185 (240)
Q Consensus       128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y--~~Gv~~~~~~~c~-~~~~~~~Hav~iVGyg~~  185 (240)
                      ..++|...|.++||+.+-+++..+.=.  +.-........|- ......+|-|+|+||+..
T Consensus       112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~  172 (212)
T PF09778_consen  112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAA  172 (212)
T ss_pred             cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCC
Confidence            478899999999988888877543200  2222211101122 112457899999999975


No 26 
>PF05543 Peptidase_C47:  Staphopain peptidase C47;  InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=82.19  E-value=8  Score=30.97  Aligned_cols=65  Identities=20%  Similarity=0.291  Sum_probs=40.8

Q ss_pred             hHHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccCCCc
Q psy1727         128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHW  207 (240)
Q Consensus       128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~~W  207 (240)
                      +.+++++.+.++-|+.+..+.-.     +.         ..  ...+||++||||-.-.     +|.++.++=|-|    
T Consensus        90 s~~eV~~~~~~nk~i~i~~~~v~-----~~---------~~--~~~gHAlavvGya~~~-----~g~~~y~~WNPW----  144 (175)
T PF05543_consen   90 SFDEVKKLIDNNKGIAILADRVE-----QT---------NG--PHAGHALAVVGYAKPN-----NGQKTYYFWNPW----  144 (175)
T ss_dssp             -HHHHHHHHHTT-EEEEEEEETT-----SC---------TT--B--EEEEEEEEEEEET-----TSEEEEEEE-TT----
T ss_pred             CHHHHHHHHHcCCCeEEEecccc-----cC---------CC--CccceeEEEEeeeecC-----CCCeEEEEeCCc----
Confidence            56788888888888777665211     11         11  4578999999997643     578899998988    


Q ss_pred             CCCCcceEEEECC
Q psy1727         208 GEKTMPFWIIKNS  220 (240)
Q Consensus       208 G~~G~~y~~i~~~  220 (240)
                       +++  ++.++..
T Consensus       145 -~~~--~~~~sa~  154 (175)
T PF05543_consen  145 -WND--VMIQSAK  154 (175)
T ss_dssp             --SS---EEEETT
T ss_pred             -cCC--cEEEecC
Confidence             355  6666544


No 27 
>KOG4128|consensus
Probab=81.72  E-value=0.19  Score=44.05  Aligned_cols=87  Identities=14%  Similarity=0.109  Sum_probs=53.6

Q ss_pred             HHHHHHHHH----hCCCeEEEEeccccccccCCcccCCCccc--------C--CC-------CCCCCccEEEEEEeeccc
Q psy1727         129 ETEMAKYLV----KNGPMAVAINANAMQFYFGGVSHPLKFLC--------K--GG-------MDNLDHGVLIVGYGVHKT  187 (240)
Q Consensus       129 ~~~i~~~l~----~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c--------~--~~-------~~~~~Hav~iVGyg~~~~  187 (240)
                      .+-|++.++    .+-||..+-+...+..-++|+.+-....-        .  ..       ...-.||+++.|.|.-  
T Consensus       306 ~d~l~k~vv~sl~~~kaVwfgcd~~k~~~~K~G~~dl~l~~~~l~fG~~l~~~~KAeRl~y~eSlmthAml~T~v~~k--  383 (457)
T KOG4128|consen  306 MDILMKIVVTSLEGDKAVWFGCDIRKAISLKSGPLDLRLHQFDLLFGFKLGESTKAERLDYRESLMTHAMLLTSVGLK--  383 (457)
T ss_pred             HHHHHHHHHHHhcCCcceEEecccHhhhhcccCccchhhccCceeeeeeccccchhhhhhHHHHHHHHHHHhhhcccc--
Confidence            344444443    35577777776666677888765321000        0  00       0123699999888832  


Q ss_pred             ccccCCcCEEEEecccCCCcCCCCcceEEEEC
Q psy1727         188 KFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKN  219 (240)
Q Consensus       188 ~~~~~g~~yWivkNSWG~~WG~~G~~y~~i~~  219 (240)
                      +....+-.-|-|.||||++-|.+|  +..|..
T Consensus       384 d~~~g~~~~~rVenswgkd~gkkg--~~~mt~  413 (457)
T KOG4128|consen  384 DPATGGLNEHRVENSWGKDLGKKG--VNKMTA  413 (457)
T ss_pred             CcccCCchhhhhhchhhhhccccc--hhhhhH
Confidence            112256667999999999999999  876643


No 28 
>PTZ00203 cathepsin L protease; Provisional
Probab=69.20  E-value=4.1  Score=36.40  Aligned_cols=36  Identities=31%  Similarity=0.763  Sum_probs=28.6

Q ss_pred             CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEe
Q psy1727         195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSI  233 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~  233 (240)
                      .+-++--+||.   ++|.+|++|+.+|+..||+.+++.|
T Consensus       287 nHaVliVGYG~---~~g~~YWiikNSWG~~WGe~GY~ri  322 (348)
T PTZ00203        287 NHGVLLVGYNM---TGEVPYWVIKNSWGEDWGEKGYVRV  322 (348)
T ss_pred             CeEEEEEEEec---CCCceEEEEEcCCCCCcCcCceEEE
Confidence            45555556665   3578899999999999999999877


No 29 
>KOG3404|consensus
Probab=68.73  E-value=7.7  Score=29.18  Aligned_cols=44  Identities=23%  Similarity=0.225  Sum_probs=29.4

Q ss_pred             CCCCCChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHH
Q psy1727           6 KPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK   49 (240)
Q Consensus         6 ~~~~~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~   49 (240)
                      ..+|..+...|..|.-|+..|.|+----+-+-+|.+|-.+=..+
T Consensus        34 ~~~~~~~~~~E~lwpIfqlhHQrsRYiYdlyykR~~IS~eLY~~   77 (145)
T KOG3404|consen   34 TEPHEGKRKTESLWPIFQLHHQRSRYIYDLYYKRKAISRELYDY   77 (145)
T ss_pred             cCcccCCCcchhhhhHHHHhhhhhhhHHHHHHHHHHHHHHHHHH
Confidence            34454555668999999999999743345666777776554433


No 30 
>KOG1543|consensus
Probab=66.91  E-value=7.7  Score=34.32  Aligned_cols=41  Identities=29%  Similarity=0.631  Sum_probs=35.2

Q ss_pred             CcCEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEecCC
Q psy1727         193 IQPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYSS  236 (240)
Q Consensus       193 g~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~~  236 (240)
                      ....+++--.||+   .+|.||+.++.+|+..||+.++..|...
T Consensus       268 ~~~Hav~iVGyG~---~~~~~YWivkNSWG~~WGe~Gy~ri~r~  308 (325)
T KOG1543|consen  268 EGDHAVLIVGYGT---GDGVDYWIVKNSWGTDWGEKGYFRIARG  308 (325)
T ss_pred             CCCceEEEEEEcC---CCCceeEEEEcCCCCCcccCceEEEecC
Confidence            3678999999999   5666799999999999999999988643


No 31 
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=66.40  E-value=5  Score=33.64  Aligned_cols=37  Identities=38%  Similarity=0.843  Sum_probs=27.7

Q ss_pred             CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEe
Q psy1727         195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSI  233 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~  233 (240)
                      ..-++--.||..  ++|.+|++|+..++..||+.+.+.|
T Consensus       179 ~HaV~IVGyG~~--~~g~~YWiikNSWG~~WGe~Gy~~i  215 (239)
T cd02698         179 NHIISVAGWGVD--ENGVEYWIVRNSWGEPWGERGWFRI  215 (239)
T ss_pred             CeEEEEEEEEec--CCCCEEEEEEcCCCcccCcCceEEE
Confidence            345555556652  1278899999999999999998876


No 32 
>PTZ00200 cysteine proteinase; Provisional
Probab=65.45  E-value=5.9  Score=36.69  Aligned_cols=39  Identities=31%  Similarity=0.588  Sum_probs=29.0

Q ss_pred             CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727         195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      ..-++--+||.+ -++|.+|+.|+..|+..||+.++..|-
T Consensus       387 nHaV~lVGyG~d-~~~g~~YWIIkNSWG~~WGe~GY~ri~  425 (448)
T PTZ00200        387 NHAVLLVGEGYD-EKTKKRYWIIKNSWGTDWGENGYMRLE  425 (448)
T ss_pred             cEEEEEEEeccc-CCCCCceEEEEcCCCCCcccCeeEEEE
Confidence            455666666531 035777999999999999999998773


No 33 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=64.92  E-value=5.9  Score=33.22  Aligned_cols=39  Identities=36%  Similarity=0.725  Sum_probs=29.5

Q ss_pred             CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727         195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      .+-++--+||.+= .+|.+|++|+.+++..||+.+++.|-
T Consensus       187 ~HaV~iVGyg~~~-~~g~~YWiirNSWG~~WGe~Gy~~i~  225 (243)
T cd02621         187 NHAVLLVGWGEDE-IKGEKYWIVKNSWGSSWGEKGYFKIR  225 (243)
T ss_pred             CeEEEEEEeeccC-CCCCcEEEEEcCCCCCCCcCCeEEEe
Confidence            4666666777531 02667999999999999999998773


No 34 
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=63.32  E-value=16  Score=29.44  Aligned_cols=47  Identities=23%  Similarity=0.429  Sum_probs=34.4

Q ss_pred             hHHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeecccccccCCcCEEEEecccC
Q psy1727         128 DETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWG  204 (240)
Q Consensus       128 ~~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG  204 (240)
                      +..+|+..|.+..||.+-...  |        .+          ..-|+|+|.||++.          |...-++||
T Consensus       122 sl~~ik~ql~kg~PV~iw~T~--~--------~~----------~s~H~v~itgyDk~----------n~yynDpyG  168 (195)
T COG4990         122 SLSDIKGQLLKGRPVVIWVTN--F--------HS----------YSIHSVLITGYDKY----------NIYYNDPYG  168 (195)
T ss_pred             cHHHHHHHHhcCCcEEEEEec--c--------cc----------cceeeeEeeccccc----------ceEeccccc
Confidence            478999999999999877653  2        11          13599999999975          555666664


No 35 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=62.36  E-value=12  Score=29.68  Aligned_cols=39  Identities=33%  Similarity=0.690  Sum_probs=32.7

Q ss_pred             cCEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727         194 QPYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       194 ~~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      ..+-++--.||.+  ++|++|++|+.+++..||+.+.+.|.
T Consensus       119 ~~Hav~ivGyg~~--~~g~~yWii~NSwG~~WG~~G~~~i~  157 (174)
T smart00645      119 LDHAVLIVGYGTE--ENGKDYWIVKNSWGTDWGENGYFRIA  157 (174)
T ss_pred             ccEEEEEEEEeec--CCCeeEEEEECCCCCCcccCeEEEEE
Confidence            4678888888875  46888999999999999999988774


No 36 
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=59.62  E-value=11  Score=31.53  Aligned_cols=37  Identities=32%  Similarity=0.781  Sum_probs=27.9

Q ss_pred             EEEEecccCCCcCCCCcceEEEECCCCCccccceeEEecC
Q psy1727         196 YWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYS  235 (240)
Q Consensus       196 yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~~  235 (240)
                      .=++--.||..   +|++|++|+.+|+..||+.++..|.-
T Consensus       186 HaV~iVGyg~~---~g~~YWivrNSWG~~WGe~Gy~ri~~  222 (236)
T cd02620         186 HAVKIIGWGVE---NGVPYWLAANSWGTDWGENGYFRILR  222 (236)
T ss_pred             eEEEEEEEecc---CCeeEEEEEeCCCCCCCCCcEEEEEc
Confidence            44444455543   67779999999999999999987743


No 37 
>KOG1542|consensus
Probab=53.78  E-value=11  Score=33.47  Aligned_cols=37  Identities=32%  Similarity=0.797  Sum_probs=28.2

Q ss_pred             CEEEEecccCCCcCCC-CcceEEEECCCCCccccceeEEec
Q psy1727         195 PYWIIKNSWGPHWGEK-TMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~-G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      +.=++--++|+.   + .+||+.|+.+|++.||+.+++.+.
T Consensus       317 ~HaVLlvGyG~~---g~~~PYWIVKNSWG~~WGE~GY~~l~  354 (372)
T KOG1542|consen  317 NHAVLLVGYGSS---GYEKPYWIVKNSWGTSWGEKGYYKLC  354 (372)
T ss_pred             CceEEEEeecCC---CCCCceEEEECCccccccccceEEEe
Confidence            344555566652   3 588999999999999999988764


No 38 
>cd02549 Peptidase_C39A A sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family of proteins with a single peptidase domain, which are 
Probab=52.91  E-value=45  Score=24.70  Aligned_cols=34  Identities=24%  Similarity=0.503  Sum_probs=24.9

Q ss_pred             HHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEe
Q psy1727         132 MAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYG  183 (240)
Q Consensus       132 i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg  183 (240)
                      +++.|....||.+.++..        .   .    .   ...+|.|+|+||.
T Consensus        70 ~~~~l~~~~Pvi~~~~~~--------~---~----~---~~~gH~vVv~g~~  103 (141)
T cd02549          70 LLRQLAAGHPVIVSVNLG--------V---S----I---TPSGHAMVVIGYD  103 (141)
T ss_pred             HHHHHHCCCeEEEEEecC--------c---c----c---CCCCeEEEEEEEc
Confidence            778888888998877641        0   1    1   2368999999998


No 39 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=52.41  E-value=13  Score=35.42  Aligned_cols=37  Identities=24%  Similarity=0.737  Sum_probs=28.3

Q ss_pred             CEEEEecccCCCcCCCCcceEEEECCCCC--ccccceeEEe
Q psy1727         195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGP--RWGEQVTKSI  233 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~--~~g~~~~~~~  233 (240)
                      .+-++--+||.+  ++|.+|+.|+.+|+.  .||+.++..|
T Consensus       403 nHAVlIVGYG~d--e~G~~YWIVKNSWGt~~~WGE~GYfRI  441 (548)
T PTZ00364        403 NHTVLIIGWGTD--ENGGDYWLVLDPWGSRRSWCDGGTRKI  441 (548)
T ss_pred             CeEEEEEEeccc--CCCceEEEEECCCCCCCCcccCCeEEE
Confidence            345555566642  357789999999999  9999999876


No 40 
>KOG4702|consensus
Probab=49.06  E-value=34  Score=23.03  Aligned_cols=32  Identities=19%  Similarity=0.373  Sum_probs=23.9

Q ss_pred             HHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHH
Q psy1727          17 AMFNHFLEKHNKSYATKEEYHKRLRIFRANLKK   49 (240)
Q Consensus        17 ~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~   49 (240)
                      .-|++|+.+|.+.-.. .|...|.+-|.+-++.
T Consensus        29 e~Fee~v~~~krel~p-pe~~~~~EE~~~~lRe   60 (77)
T KOG4702|consen   29 EIFEEFVRGYKRELSP-PEATKRKEEYENFLRE   60 (77)
T ss_pred             HHHHHHHHhccccCCC-hHHHhhHHHHHHHHHH
Confidence            5799999999998754 4777777777665543


No 41 
>PTZ00021 falcipain-2; Provisional
Probab=42.94  E-value=24  Score=33.06  Aligned_cols=41  Identities=32%  Similarity=0.461  Sum_probs=29.9

Q ss_pred             cCEEEEecccCCCcC-------CCCcceEEEECCCCCccccceeEEec
Q psy1727         194 QPYWIIKNSWGPHWG-------EKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       194 ~~yWivkNSWG~~WG-------~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      ...-++--+||..=|       ..|.+|+.|+.+|+..||+.+++.|-
T Consensus       420 ~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~  467 (489)
T PTZ00021        420 PNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIE  467 (489)
T ss_pred             cceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEE
Confidence            456777777875311       01245999999999999999998773


No 42 
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=42.80  E-value=25  Score=28.27  Aligned_cols=29  Identities=31%  Similarity=0.692  Sum_probs=22.7

Q ss_pred             CcCCC---CcceEEEECCCCCccccceeEEec
Q psy1727         206 HWGEK---TMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       206 ~WG~~---G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      +||+.   |++||+|+.+++..||+.+...+.
T Consensus       179 Gy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~  210 (223)
T cd02619         179 GYDDNYVEGKGAFIVKNSWGTDWGDNGYGRIS  210 (223)
T ss_pred             eecCCCCCCCCEEEEEeCCCCccccCCEEEEe
Confidence            45543   344999999999999999887764


No 43 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=40.31  E-value=27  Score=34.13  Aligned_cols=37  Identities=32%  Similarity=0.717  Sum_probs=26.4

Q ss_pred             EEEEecccCCCcCCCCc--ceEEEECCCCCccccceeEEe
Q psy1727         196 YWIIKNSWGPHWGEKTM--PFWIIKNSWGPRWGEQVTKSI  233 (240)
Q Consensus       196 yWivkNSWG~~WG~~G~--~y~~i~~~~~~~~g~~~~~~~  233 (240)
                      .-++--+||+. .++|.  +|+.|+.+|+..||+.+++.|
T Consensus       620 HAVlIVGwG~d-~enG~~~~YWIVRNSWGt~WGenGYfKI  658 (693)
T PTZ00049        620 HAIVLVGWGEE-EINGKLYKYWIGRNSWGKNWGKEGYFKI  658 (693)
T ss_pred             eEEEEEEeccc-cCCCcccCEEEEECCCCCCcccCceEEE
Confidence            34444455542 12464  699999999999999999877


No 44 
>PF12385 Peptidase_C70:  Papain-like cysteine protease AvrRpt2;  InterPro: IPR022118  This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 []. 
Probab=37.97  E-value=73  Score=25.21  Aligned_cols=38  Identities=26%  Similarity=0.357  Sum_probs=27.7

Q ss_pred             HHHHHHHHHhCCCeEEEEeccccccccCCcccCCCcccCCCCCCCCccEEEEEEeec
Q psy1727         129 ETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVH  185 (240)
Q Consensus       129 ~~~i~~~l~~~GPv~v~~~~~~f~~Y~~Gv~~~~~~~c~~~~~~~~Hav~iVGyg~~  185 (240)
                      .+.+...|.++||+-+++....                .   .-..|+++|.|-..+
T Consensus        98 ~e~~~~LL~~yGPLwv~~~~P~----------------~---~~~~H~~ViTGI~~d  135 (166)
T PF12385_consen   98 AEGLANLLREYGPLWVAWEAPG----------------D---SWVAHASVITGIDGD  135 (166)
T ss_pred             HHHHHHHHHHcCCeEEEecCCC----------------C---cceeeEEEEEeecCC
Confidence            6889999999999988854311                1   224688888888765


No 45 
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=37.65  E-value=26  Score=28.18  Aligned_cols=37  Identities=35%  Similarity=0.759  Sum_probs=27.8

Q ss_pred             CEEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727         195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       195 ~yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      .+-++--.||...| .+  ||.|+.+|+..||+.+...|-
T Consensus       159 ~Hav~iVGy~~~~~-~~--ywiv~NSWG~~WG~~Gy~~i~  195 (210)
T cd02248         159 NHAVLLVGYGTENG-VD--YWIVKNSWGTSWGEKGYIRIA  195 (210)
T ss_pred             CEEEEEEEEeecCC-ce--EEEEEcCCCCccccCcEEEEE
Confidence            45666666665443 35  999999999999999888763


No 46 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=33.38  E-value=40  Score=34.34  Aligned_cols=39  Identities=23%  Similarity=0.602  Sum_probs=28.3

Q ss_pred             EEEEecccCCCcC--CCCcceEEEECCCCCccccceeEEec
Q psy1727         196 YWIIKNSWGPHWG--EKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       196 yWivkNSWG~~WG--~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      +-++--+||..=.  .+|++|+.|+.+|+..||+.+++.|-
T Consensus       723 HAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~  763 (1004)
T PTZ00462        723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD  763 (1004)
T ss_pred             ceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEE
Confidence            4555555664321  12678999999999999999998873


No 47 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=32.26  E-value=75  Score=27.72  Aligned_cols=30  Identities=13%  Similarity=0.222  Sum_probs=23.5

Q ss_pred             CCCccEEEEEEeecccccccCCcCEEEEecccCC
Q psy1727         172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGP  205 (240)
Q Consensus       172 ~~~Hav~iVGyg~~~~~~~~~g~~yWivkNSWG~  205 (240)
                      ..+||=.|+++-...    ..+...-.+||.||.
T Consensus       234 ~~~HaY~Vl~~~~~~----~~~~~lv~lrNPWg~  263 (315)
T cd00044         234 VKGHAYSVLDVREVQ----EEGLRLLRLRNPWGV  263 (315)
T ss_pred             ccCcceEEeEEEEEc----cCceEEEEecCCccC
Confidence            468999999998651    027888999999984


No 48 
>PF03714 PUD:  Bacterial pullanase-associated domain;  InterPro: IPR005323 Domain is found in pullanase - carbohydrate de-branching - proteins. It is found both to the N or the C-termini of of the alpha-amylase active site region. This domain contains several conserved aromatic residues that are suggestive of a carbohydrate binding function.; GO: 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process; PDB: 2J43_A 2J44_A 2YA1_A 2FHC_A 2FHB_A 2FHF_A 2J73_B 2J72_A 2J71_A.
Probab=30.14  E-value=13  Score=26.85  Aligned_cols=41  Identities=17%  Similarity=0.240  Sum_probs=24.9

Q ss_pred             EEEEecccCCCcCCCCcceEEEE--CCCCCccccceeEEecCCCC
Q psy1727         196 YWIIKNSWGPHWGEKTMPFWIIK--NSWGPRWGEQVTKSIYSSAP  238 (240)
Q Consensus       196 yWivkNSWG~~WG~~G~~y~~i~--~~~~~~~g~~~~~~~~~~~~  238 (240)
                      -+||++++.++-+.+=  ++.+.  .+..+-+..++...+|+++|
T Consensus        61 gfIv~~~~~kd~~~D~--~i~~~~~~~~~evwi~~Gd~~vy~~~p  103 (103)
T PF03714_consen   61 GFIVRKGDWKDQGGDR--FIDLTSPQGNTEVWIVSGDETVYYSPP  103 (103)
T ss_dssp             EEEEEETTEECSSSEE--EEETTTTTTCEEEEEETTTSSEBSSTT
T ss_pred             EEEEEcCCCCCCCCCE--EEEeccCCCcEEEEEECCCccEEecCC
Confidence            4889998666522222  34441  23333577777788888887


No 49 
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=27.39  E-value=32  Score=27.61  Aligned_cols=36  Identities=28%  Similarity=0.613  Sum_probs=25.1

Q ss_pred             EEEEecccCCCcCCCCcceEEEECCCCCccccceeEEec
Q psy1727         196 YWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIY  234 (240)
Q Consensus       196 yWivkNSWG~~WG~~G~~y~~i~~~~~~~~g~~~~~~~~  234 (240)
                      +=++--+||..=   |.+||.|+.+++..||+.+...+.
T Consensus       167 Hav~iVGy~~~~---~~~~wiv~NSWG~~WG~~Gy~~i~  202 (219)
T PF00112_consen  167 HAVLIVGYDDEN---GKGYWIVKNSWGTDWGDNGYFRIS  202 (219)
T ss_dssp             EEEEEEEEEEET---TEEEEEEE-SBTTTSTBTTEEEEE
T ss_pred             cccccccccccc---ceeeEeeehhhCCccCCCeEEEEe
Confidence            445555555432   344999999999999999988764


No 50 
>PF04928 PAP_central:  Poly(A) polymerase central domain;  InterPro: IPR007012 In eukaryotes, polyadenylation of pre-mRNA plays an essential role in the initiation step of protein synthesis, as well as in the export and stability of mRNAs. Poly(A) polymerase, the enzyme at the heart of the polyadenylation machinery, is a template-independent RNA polymerase which specifically incorporates ATP at the 3' end of mRNA. The crystal structure of bovine poly(A) polymerase bound to an ATP analog at 2.5 A resolutio has been determined []. The structure revealed expected and unexpected similarities to other proteins. As expected, the catalytic domain of poly(A) polymerase shares substantial structural homology with other nucleotidyl transferases such as DNA polymerase beta and kanamycin transferase.  The central domain of Poly(A) polymerase shares structural similarity with the allosteric activity domain of ribonucleotide reductase R1, which comprises a four-helix bundle and a three-stranded mixed beta-sheet. Even though the two enzymes bind ATP, the ATP-recognition motifs are different.; GO: 0004652 polynucleotide adenylyltransferase activity, 0006351 transcription, DNA-dependent; PDB: 1Q79_A 1Q78_A 1F5A_A 2O1P_A 2HHP_A 3C66_B 1FA0_A 2Q66_A.
Probab=21.98  E-value=1.6e+02  Score=25.15  Aligned_cols=39  Identities=28%  Similarity=0.503  Sum_probs=27.1

Q ss_pred             CChhhHHHHHHHHHHHhCcccCCHHHHHHHHHHHHHHHHHH
Q psy1727          10 HDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI   50 (240)
Q Consensus        10 ~~~~~~~~~f~~~~~~~~k~Y~~~~E~~~R~~iF~~n~~~i   50 (240)
                      ..++....+..+|.+.++ .|.+++|..+|.+++.+ |+.|
T Consensus        17 ~~Dl~~s~~L~~~l~~~~-~~es~ee~~~R~~vl~~-L~~i   55 (254)
T PF04928_consen   17 EKDLKRSASLEEFLKDYG-LFESEEEEQKREEVLRK-LQQI   55 (254)
T ss_dssp             HHHHHHHHHHHHHHHHCT--S--HHHHHHHHHHHHH-HHHH
T ss_pred             hhhHHhHHHHHHHHHHcC-CCCChHHHhHHHHHHHH-HHHH
Confidence            334556688999999997 66899999999888755 4444


Done!