Query         018718
Match_columns 351
No_of_seqs    234 out of 1483
Neff          6.5 
Searched_HMMs 46136
Date          Fri Mar 29 03:27:37 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/018718.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/018718hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG0339 Dcp Zn-dependent oligo 100.0 6.3E-78 1.4E-82  616.1  21.5  330    9-346    56-544 (683)
  2 PRK10280 dipeptidyl carboxypep 100.0 1.4E-73   3E-78  600.7  23.0  332    9-351    55-546 (681)
  3 KOG2089 Metalloendopeptidase f 100.0 4.7E-72   1E-76  566.5  17.0  338    9-351    79-578 (718)
  4 PRK10911 oligopeptidase A; Pro 100.0 1.5E-69 3.2E-74  571.3  23.1  335    9-350    49-545 (680)
  5 KOG2090 Metalloendopeptidase f 100.0 5.8E-54 1.3E-58  430.5  21.8  309   33-350   106-560 (704)
  6 cd06457 M3A_MIP Peptidase M3 m 100.0 3.2E-52   7E-57  423.0  14.7  209  137-350    23-329 (458)
  7 cd06456 M3A_DCP_Oligopeptidase 100.0 1.3E-49 2.8E-54  399.9  14.3  203  139-350     1-289 (422)
  8 cd06455 M3A_TOP Peptidase M3 T 100.0 1.3E-46 2.8E-51  383.3  17.8  243   90-348     5-342 (472)
  9 PF01432 Peptidase_M3:  Peptida 100.0 2.7E-43 5.9E-48  357.0   8.7  203  139-350    30-323 (458)
 10 cd06258 Peptidase_M3_like The   99.9 2.7E-26 5.9E-31  226.2   8.4  171  142-339     2-221 (365)
 11 TIGR02289 M3_not_pepF oligoend  99.9 2.9E-22 6.3E-27  208.1  10.2  202   90-324   120-380 (549)
 12 cd06459 M3B_Oligoendopeptidase  99.9 2.7E-23   6E-28  207.9   2.0  164  129-324    58-265 (427)
 13 TIGR00181 pepF oligoendopeptid  99.8 4.1E-21 8.8E-26  201.0   6.1  274    9-324    50-421 (591)
 14 TIGR02290 M3_fam_3 oligoendope  99.7 6.1E-19 1.3E-23  184.6   2.0  152  141-324   223-417 (587)
 15 COG1164 Oligoendopeptidase F [  99.0 2.4E-10 5.1E-15  120.2   6.2   79  221-323   343-422 (598)
 16 cd06461 M2_ACE Peptidase famil  97.7 5.8E-05 1.3E-09   77.8   5.8   83  226-328   212-298 (477)
 17 COG0339 Dcp Zn-dependent oligo  95.7   0.085 1.8E-06   56.2  11.1   76   50-127    58-134 (683)
 18 cd06460 M32_Taq Peptidase fami  93.6   0.028   6E-07   56.9   0.9   42  281-324   161-204 (396)
 19 COG2856 Predicted Zn peptidase  89.3    0.21 4.5E-06   46.4   1.8   30  256-294    57-87  (213)
 20 PF06114 DUF955:  Domain of unk  85.6    0.39 8.4E-06   38.6   1.2   32  257-297    28-60  (122)
 21 PF00413 Peptidase_M10:  Matrix  85.0    0.39 8.5E-06   41.1   1.0   11  281-291   107-117 (154)
 22 PF14247 DUF4344:  Domain of un  84.9    0.47   1E-05   44.3   1.5   32  281-319    94-125 (220)
 23 PRK10280 dipeptidyl carboxypep  80.4     6.6 0.00014   42.6   8.3   48   49-96     56-104 (681)
 24 PRK10911 oligopeptidase A; Pro  77.6      10 0.00022   41.1   8.7   46   49-94     50-96  (680)
 25 cd04277 ZnMc_serralysin_like Z  77.1     1.1 2.4E-05   40.0   1.0   12  281-292   115-126 (186)
 26 PF13398 Peptidase_M50B:  Pepti  76.8     1.5 3.2E-05   40.1   1.8   20  281-300    24-43  (200)
 27 cd04279 ZnMc_MMP_like_1 Zinc-d  76.1     1.2 2.5E-05   38.7   0.9   11  281-291   106-116 (156)
 28 cd04278 ZnMc_MMP Zinc-dependen  76.1    0.95 2.1E-05   39.3   0.3   11  281-291   109-119 (157)
 29 cd04268 ZnMc_MMP_like Zinc-dep  76.0     1.2 2.7E-05   38.5   1.0   11  281-291    96-106 (165)
 30 PF14891 Peptidase_M91:  Effect  75.4     2.5 5.4E-05   37.7   2.8   18  281-298   105-122 (174)
 31 PF13058 DUF3920:  Protein of u  72.2       2 4.4E-05   35.8   1.3   13  281-293    78-90  (126)
 32 smart00235 ZnMc Zinc-dependent  70.9     1.9 4.2E-05   36.4   1.0   11  281-291    88-98  (140)
 33 cd04327 ZnMc_MMP_like_3 Zinc-d  70.4     2.1 4.4E-05   38.9   1.1   12  281-292    94-105 (198)
 34 PF04228 Zn_peptidase:  Putativ  69.1     2.6 5.6E-05   41.1   1.5   17  281-297   172-188 (292)
 35 PF13582 Reprolysin_3:  Metallo  67.7     2.5 5.4E-05   34.8   1.0   11  281-291   109-119 (124)
 36 cd00203 ZnMc Zinc-dependent me  65.1       3 6.4E-05   36.0   1.0   11  281-291    98-108 (167)
 37 PF13688 Reprolysin_5:  Metallo  60.2     4.1 8.9E-05   36.4   1.0   11  281-291   144-154 (196)
 38 PF02128 Peptidase_M36:  Fungal  59.1     2.5 5.4E-05   42.5  -0.7   17  281-297   187-203 (378)
 39 cd04280 ZnMc_astacin_like Zinc  57.4     4.9 0.00011   36.0   1.0   11  281-291    76-86  (180)
 40 PF13583 Reprolysin_4:  Metallo  56.9       5 0.00011   36.8   1.0   11  281-291   139-149 (206)
 41 PF05572 Peptidase_M43:  Pregna  56.4     5.2 0.00011   35.1   1.0   12  281-292    71-82  (154)
 42 KOG3658 Tumor necrosis factor-  56.3      11 0.00024   40.5   3.4   64  214-291   338-404 (764)
 43 PF13574 Reprolysin_2:  Metallo  53.0     6.8 0.00015   34.8   1.2   12  280-291   111-123 (173)
 44 PF09471 Peptidase_M64:  IgA Pe  52.0     6.5 0.00014   37.7   0.9   17  281-297   218-234 (264)
 45 PF01400 Astacin:  Astacin (Pep  51.6     7.4 0.00016   35.2   1.2   11  281-291    81-91  (191)
 46 PF13485 Peptidase_MA_2:  Pepti  51.0     8.7 0.00019   30.8   1.4   17  281-297    27-43  (128)
 47 PF12388 Peptidase_M57:  Dual-a  50.8     6.7 0.00015   36.4   0.8   12  281-292   135-146 (211)
 48 cd04283 ZnMc_hatching_enzyme Z  49.8     7.9 0.00017   35.1   1.1   11  281-291    79-89  (182)
 49 PF04298 Zn_peptidase_2:  Putat  49.4     7.5 0.00016   36.4   0.9   15  281-295    91-105 (222)
 50 cd04270 ZnMc_TACE_like Zinc-de  48.0     8.1 0.00018   36.4   0.9   11  281-291   169-179 (244)
 51 cd04267 ZnMc_ADAM_like Zinc-de  45.4     8.8 0.00019   34.2   0.7   11  281-291   135-145 (192)
 52 TIGR03296 M6dom_TIGR03296 M6 f  44.8     6.4 0.00014   38.0  -0.4   12  281-292   167-178 (286)
 53 cd04271 ZnMc_ADAM_fungal Zinc-  44.2     8.2 0.00018   36.1   0.2   13  281-293   147-162 (228)
 54 cd04281 ZnMc_BMP1_TLD Zinc-dep  43.1      12 0.00025   34.5   1.1   11  281-291    89-99  (200)
 55 cd06161 S2P-M50_SpoIVFB SpoIVF  42.8      12 0.00026   34.4   1.1   16  281-296    40-55  (208)
 56 cd05709 S2P-M50 Site-2 proteas  42.2      13 0.00027   32.9   1.1   18  281-298    10-27  (180)
 57 PF02163 Peptidase_M50:  Peptid  39.7      16 0.00036   32.4   1.5   24  281-304     9-32  (192)
 58 cd04276 ZnMc_MMP_like_2 Zinc-d  39.5      14  0.0003   33.9   1.0   12  281-292   118-129 (197)
 59 PF01435 Peptidase_M48:  Peptid  37.7      16 0.00034   33.0   1.0   13  281-293    91-103 (226)
 60 cd04269 ZnMc_adamalysin_II_lik  36.8      15 0.00033   32.7   0.8   11  281-291   133-143 (194)
 61 cd04272 ZnMc_salivary_gland_MP  35.8      17 0.00038   33.3   1.0   11  281-291   147-157 (220)
 62 PF13699 DUF4157:  Domain of un  35.1      28 0.00061   27.1   2.0   14  281-294    63-76  (79)
 63 cd04282 ZnMc_meprin Zinc-depen  34.9      18  0.0004   34.0   1.0   12  281-292   122-133 (230)
 64 cd06164 S2P-M50_SpoIVFB_CBS Sp  34.4      19 0.00042   33.6   1.1   15  281-295    55-69  (227)
 65 PF02031 Peptidase_M7:  Strepto  34.1      22 0.00049   30.5   1.3   12  281-292    79-90  (132)
 66 cd06159 S2P-M50_PDZ_Arch Uncha  33.5      20 0.00043   34.4   1.1   16  281-296   120-135 (263)
 67 COG2738 Predicted Zn-dependent  32.0      21 0.00046   32.9   0.9   39  281-323    94-132 (226)
 68 cd06163 S2P-M50_PDZ_RseP-like   31.2      28 0.00061   31.4   1.6   22  281-302    11-32  (182)
 69 PF10460 Peptidase_M30:  Peptid  31.1      24 0.00053   35.5   1.3   55  281-339   141-197 (366)
 70 cd04275 ZnMc_pappalysin_like Z  29.6      21 0.00045   33.4   0.4   12  281-292   139-150 (225)
 71 COG0501 HtpX Zn-dependent prot  28.9      24 0.00051   33.5   0.7   16  277-292   152-170 (302)
 72 PF13402 M60-like:  Peptidase M  28.9      27 0.00058   33.4   1.1   14  281-294   221-234 (307)
 73 cd04273 ZnMc_ADAMTS_like Zinc-  28.8      14 0.00031   33.5  -0.7   11  281-291   142-152 (207)
 74 PF05548 Peptidase_M11:  Gameto  28.8      24 0.00051   34.8   0.7   11  281-291   152-162 (314)
 75 cd06162 S2P-M50_PDZ_SREBP Ster  28.5      27 0.00059   33.8   1.1   16  281-296   137-152 (277)
 76 cd06160 S2P-M50_like_2 Unchara  28.2      28 0.00062   31.4   1.1   17  281-297    43-59  (183)
 77 PF01421 Reprolysin:  Reprolysi  27.1      29 0.00064   31.1   1.0   11  281-291   133-143 (199)
 78 PRK03982 heat shock protein Ht  26.7      29 0.00062   33.4   0.9   12  281-292   127-138 (288)
 79 PF01742 Peptidase_M27:  Clostr  26.6      37  0.0008   34.4   1.6   17  281-297   216-232 (408)
 80 COG5549 Predicted Zn-dependent  23.5      33 0.00071   32.0   0.5   18  274-291   180-199 (236)
 81 PRK02870 heat shock protein Ht  23.3      37 0.00079   33.8   0.9   13  281-293   175-187 (336)
 82 PRK03001 M48 family peptidase;  23.3      36 0.00078   32.7   0.8   13  281-293   126-138 (283)
 83 PF10462 Peptidase_M66:  Peptid  23.1      35 0.00076   33.5   0.7   11  281-291   195-205 (305)
 84 PF05547 Peptidase_M6:  Immune   22.9      19 0.00041   39.0  -1.3   11  281-291   223-233 (645)
 85 PHA03420 E4 protein; Provision  22.7      44 0.00095   28.4   1.1   13  178-190    12-24  (137)
 86 PF01447 Peptidase_M4:  Thermol  22.3      45 0.00097   29.2   1.2   11  281-291   137-147 (150)
 87 COG4227 Antirestriction protei  22.2      33 0.00072   33.0   0.4   11  281-291   205-215 (316)
 88 KOG2921 Intramembrane metallop  21.9      41 0.00089   34.2   0.9   45  256-300   104-152 (484)
 89 PRK10779 zinc metallopeptidase  21.8      47   0.001   34.1   1.4   21  281-301    17-37  (449)
 90 PRK04897 heat shock protein Ht  21.4      41 0.00089   32.6   0.8   12  281-292   139-150 (298)
 91 PRK01345 heat shock protein Ht  21.3      42 0.00091   33.0   0.9   12  281-292   126-137 (317)
 92 KOG3714 Meprin A metalloprotea  20.8      44 0.00096   34.0   1.0   13  281-293   161-173 (411)
 93 TIGR00054 RIP metalloprotease   20.5      49  0.0011   33.7   1.2   19  281-299    16-34  (420)

No 1  
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=100.00  E-value=6.3e-78  Score=616.05  Aligned_cols=330  Identities=34%  Similarity=0.534  Sum_probs=300.5

Q ss_pred             HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718            9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK-------------   73 (351)
Q Consensus         9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~-------------   73 (351)
                      ..|+|.|+-...+.....+.++.+  .+++|+++|+|..+..- .+.++.....+...+|..+.++.             
T Consensus        56 ~~n~v~~Le~~~~~L~rv~~~~~~l~~v~~~~elr~a~e~~~p-klse~~~~l~~~~~Lf~~~~al~~~~e~~~ld~e~~  134 (683)
T COG0339          56 WENTVEPLERAGDRLGRVWSVVSHLNSVHNSPELREAYEEILP-KLSEFSNDLGQNPGLFARYKALWQSPESAGLDPEQK  134 (683)
T ss_pred             HHHHHHHHHHHhhHHHHHHHHHHHHHhcCCCHHHHHHHHHhhH-HHHHHHHHhhcCHHHHHHHHHHhcCcccccCCHHHH
Confidence            468999999999999999999987  99999999999999999 99999999999999999999882             


Q ss_pred             -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718           74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------  127 (351)
Q Consensus        74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------  127 (351)
                                   |+..+++.|+++.+|+ |||+|+++|+||+++++.+|.+++.++++|+|||+..+            
T Consensus       135 r~~e~~~rdF~~sGa~L~~~~k~r~~~I~~ela~L~~~Fs~nvL~at~~~~~~~~~~~~LaGlp~~~l~aa~~~A~~kg~  214 (683)
T COG0339         135 RVVENALRDFVLSGAKLPPEKKARLAEINTELAELGAQFSNNVLDATKAWALIVTDEAELAGLPESLLAAAAAAAEAKGL  214 (683)
T ss_pred             HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeecCcHHHhcCCCHHHHHHHHHHHHhcCC
Confidence                         6667778899999998 99999999999999999999999999999999999986            


Q ss_pred             ------------------------------HHHHhhCC-CCCCcHHHHHHHHHHHHHHHhhh------------------
Q 018718          128 ------------------------------AYVTCSFS-GDLDNTSIIDQILKLRLEKAKLL------------------  158 (351)
Q Consensus       128 ------------------------------a~~~ra~~-~~~~N~~il~eil~LR~e~A~LL------------------  158 (351)
                                                    |+.+|+++ +..+|.+|+.++++||+|+|+||                  
T Consensus       215 ~~~~itl~~p~~~p~l~~~~~r~lRe~~y~A~~~ra~~~~~~~n~~ii~~~L~LR~E~A~LLGf~~yA~~~L~~kMA~~p  294 (683)
T COG0339         215 EGYLITLDIPSYLPVLTYADNRALREKLYRAYVTRASEGGPNDNRAIIEEILKLRAELAKLLGFSNYAEYSLADKMAKTP  294 (683)
T ss_pred             CCeEEeecccchhHHHHhcccHHHHHHHHHHHhhhccccCCcccHHHHHHHHHHHHHHHHHcCcccHHHHhHHHHhhcCH
Confidence                                          67778876 45679999999999999999999                  


Q ss_pred             --------------------hHHHHHHHHHhc-CCCCCCccccCCHHHHHHHhhhhhcCCCC------------------
Q 018718          159 --------------------DIEDLRSFCKGQ-GALEPREVNHWDINFWSERLCESKYDINE------------------  199 (351)
Q Consensus       159 --------------------El~~L~~~k~~~-~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~------------------  199 (351)
                                          |+++|+++++++ +  +..+++|||++||++|+|+++|++|+                  
T Consensus       295 ~~Vl~fL~~l~~ka~~~a~~e~a~L~~~~~~~~~--~~~~l~~WD~~yyaeK~r~~~y~~de~elrpYF~l~~Vl~GlF~  372 (683)
T COG0339         295 EAVLNFLNDLAEKARPQAEKELAELQAFAAEEEG--GLPELQPWDWAYYAEKQRQEKYAFDEEELRPYFPLNKVLEGLFE  372 (683)
T ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhcc--CCcccchhhHHHHHHHHHhhhcCCCHHHhhhcCChhHHHHHHHH
Confidence                                899999999874 4  57789999999999999999999998                  


Q ss_pred             -----------------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEE
Q 018718          200 -----------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHM  262 (351)
Q Consensus       200 -----------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~l  262 (351)
                                       +||||||+|+|+|.+|..+|.||+|+|+|+| |+|||||..+++|...  .+|+ .|+||+++
T Consensus       373 ~~~rLfGI~~~e~~~~~vwHpDVr~~~v~d~~g~~~g~fY~DlyaR~~-KrgGAWM~~~~~~~~~--~~~~-~q~PV~yl  448 (683)
T COG0339         373 VAKRLFGITFVERKDIPVWHPDVRVFEVFDENGELIGLFYLDLYARDG-KRGGAWMDDFVSQRRL--DDGG-GQKPVIYL  448 (683)
T ss_pred             HHHHHcCeEEEECCCCCccCCCceEEEEEcCCCCEEEEEEeecccCCC-CccchHHHHhhhcccc--cCCC-cccceEEE
Confidence                             9999999999999999999999999999999 7999999999987653  3454 58999999


Q ss_pred             eeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----cc
Q 018718          263 VCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QV  331 (351)
Q Consensus       263 v~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~  331 (351)
                      ||||++|.+|+|+||+|+  +||||||||+||+||++++|+.+|||+ |||||||+||||||+|    ++|..     +|
T Consensus       449 vCNF~~p~~gkPaLls~dEV~TLFHEfGHgLH~mlt~v~~~~vsGt~-v~wDfVElPSQ~mE~w~~~p~vL~~~a~Hy~T  527 (683)
T COG0339         449 VCNFTKPVGGKPALLSHDEVTTLFHEFGHGLHHLLTRVKYPGVSGTN-VPWDFVELPSQFMENWCWEPEVLAKYARHYQT  527 (683)
T ss_pred             eccCCCCCCCCCceeeHHHHHHHHHHhhhHHHHHhhcCCccccCCCC-CCcchhhccHHHHHHhhcCHHHHHHHHHhhcc
Confidence            999999999999999999  999999999999999999999999997 9999999999999654    56644     89


Q ss_pred             CCCCcccc--chhhccc
Q 018718          332 IPTIPEDG--YFCSFRH  346 (351)
Q Consensus       332 ~~~~~~d~--~~~~~~~  346 (351)
                      |+|||.++  .+.+.|+
T Consensus       528 Ge~lP~~ll~k~laakn  544 (683)
T COG0339         528 GEPLPKELLDKMLAAKN  544 (683)
T ss_pred             CCcCHHHHHHHHHHHHh
Confidence            99999999  3444443


No 2  
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=100.00  E-value=1.4e-73  Score=600.75  Aligned_cols=332  Identities=22%  Similarity=0.317  Sum_probs=293.8

Q ss_pred             HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718            9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK-------------   73 (351)
Q Consensus         9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~-------------   73 (351)
                      .-|+|+|+...++.....+.++.+  .+++|+++|+|+.++.. .+.++..+..+..++|..+.++.             
T Consensus        55 ~~n~i~~ld~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~-~l~~~~~~l~~~~~Ly~~l~~~~~~~~~~~l~~e~~  133 (681)
T PRK10280         55 FNNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSA-ELAELANDIYLNGELFARVDAVWQQRESLGLDSESI  133 (681)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHH-HHHHHHHHhhCCHHHHHHHHHHHcCccccCCCHHHH
Confidence            368999999999999988888887  99999999999999999 99999999999999999999882             


Q ss_pred             -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718           74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------  127 (351)
Q Consensus        74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------  127 (351)
                                   |+..+++.|+++++|+ +|++|+++|++|++++++.|.+++.+++||+|||++++            
T Consensus       134 r~l~~~l~dF~~sG~~L~~~~r~r~~~l~~~l~~L~~~F~~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a~~~g~  213 (681)
T PRK10280        134 RLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQFNQRLLAANKSGGLVVNDIHQLAGLSEQEIALAAEAAREKGL  213 (681)
T ss_pred             HHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhcCeeecCCHHHHcCCCHHHHHHHHHHHHhcCC
Confidence                         4455556788888887 99999999999999999999888889999999999877            


Q ss_pred             -------------------------------HHHHhhCC-CCCCcHHHHHHHHHHHHHHHhhh-----------------
Q 018718          128 -------------------------------AYVTCSFS-GDLDNTSIIDQILKLRLEKAKLL-----------------  158 (351)
Q Consensus       128 -------------------------------a~~~ra~~-~~~~N~~il~eil~LR~e~A~LL-----------------  158 (351)
                                                     |+.+|++. +..+|.+++++|+++|+|+|+||                 
T Consensus       214 ~g~~~itl~~~~~~p~l~~~~dr~~Re~~~~A~~~r~~~~~~~dn~~il~~ll~lR~e~A~lLGf~~yA~~~l~~kMa~s  293 (681)
T PRK10280        214 DNRWLIPLLNTTQQPALAELRDRQTRENLFAAGWTRAEKGDANDTRAIIQRLVEIRAQQAKLLGFPHYAAWKIADQMAKT  293 (681)
T ss_pred             CCcEEEeCcCCcHhHHHhcCCCHHHHHHHHHHHHhhhccCCchhHHHHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHCCC
Confidence                                           33333432 34689999999999999999999                 


Q ss_pred             ---------------------hHHHHHHHHHhcCCCCCCccccCCHHHHHHHhhhhhcCCCC------------------
Q 018718          159 ---------------------DIEDLRSFCKGQGALEPREVNHWDINFWSERLCESKYDINE------------------  199 (351)
Q Consensus       159 ---------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~------------------  199 (351)
                                           |+++|+++|++.+  +...|+|||++||++|+|+++|++|+                  
T Consensus       294 pe~V~~FL~~L~~~~~~~a~~E~~~L~~~~~~~~--g~~~l~pWD~~yy~ek~r~~~~~~d~~~l~~YFpl~~Vl~~Glf  371 (681)
T PRK10280        294 PEAALNFMREIVPAARQRASDELASIQAVIDKQQ--GGFSAQAWDWAFYAEQVRREKYALDEAQLKPYFELNTVLNEGVF  371 (681)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCChhHHHHHHHHHHHHhcCCCHHHcCCcCcHHHHHHHhHH
Confidence                                 8889999987654  34579999999999999999999987                  


Q ss_pred             ------------------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEE
Q 018718          200 ------------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVH  261 (351)
Q Consensus       200 ------------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~  261 (351)
                                        +|||||++|+|+|.+|.+||+||+|+|+|+| |++||||+.++.++.   .+   +|+|+++
T Consensus       372 ~l~~~LfGi~f~~~~~~~vWh~dV~~~~V~d~~g~~lG~fY~Dl~~R~g-K~~gawm~~~~~~~~---~~---~~~Pv~~  444 (681)
T PRK10280        372 WTANQLFGIKFVERFDIPVYHPDVRVWEIFDHNGVGLALFYGDFFARDS-KSGGAWMGNFVEQST---LN---ETRPVIY  444 (681)
T ss_pred             HHHHHHcCeEEEECCCCCCCCCCeeEEEEEcCCCCEEEEEEecCCCCCC-CCCCCCCcccccccc---cC---CCCCeEE
Confidence                              8999999999999888999999999999999 688999988876542   12   3679999


Q ss_pred             EeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----c
Q 018718          262 MVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----Q  330 (351)
Q Consensus       262 lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~  330 (351)
                      |||||++|++++|+||+|+  +|||||||||||+|||+++|+++|||+ |+|||||+||||||+|    +||+.     .
T Consensus       445 lvcNf~~p~~~~p~LL~~~eV~TlFHEfGHalH~lls~~~y~~~sGt~-v~~DfVE~PSq~mE~w~~~~~vL~~~a~Hy~  523 (681)
T PRK10280        445 NVCNYQKPAAGQPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTN-TPRDFVEFPSQINEHWASHPQVFARYARHYQ  523 (681)
T ss_pred             EECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhCCCccccCCCC-CCcchhcCcHHHHHHHhcCHHHHHHHhhccC
Confidence            9999999999999999999  999999999999999999999999986 7999999999999554    67765     7


Q ss_pred             cCCCCcccc--chhhcccccccC
Q 018718          331 VIPTIPEDG--YFCSFRHKFASL  351 (351)
Q Consensus       331 ~~~~~~~d~--~~~~~~~~~~~~  351 (351)
                      ||+|||+|+  .++..|+.++|+
T Consensus       524 TgepiP~~l~~~l~~ar~~~~g~  546 (681)
T PRK10280        524 SGEAMPDELQEKMRNASLFNKGY  546 (681)
T ss_pred             CCCCCCHHHHHHHHHhhCcchHH
Confidence            999999999  789999988764


No 3  
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=4.7e-72  Score=566.46  Aligned_cols=338  Identities=37%  Similarity=0.588  Sum_probs=300.4

Q ss_pred             HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718            9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK-------------   73 (351)
Q Consensus         9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~-------------   73 (351)
                      ..|+++|++.+|+++...++.|.|  +|++++++|+||.++.+ -++++.+.++|+.++|..+.++.             
T Consensus        79 ~en~vepla~ie~el~~~~~~L~f~~~vs~~~~~R~as~~~~~-~~~~~~~r~~~r~di~~~~~~i~e~~~~dslspe~~  157 (718)
T KOG2089|consen   79 YENVVEPLAKIEVELTVVIGMLVFPQHVSPDKELRKASTEADK-KLDEFDLRLSMRKDIYNRFQAIYEKPAKDSLSPEAQ  157 (718)
T ss_pred             hHHHHhHHHHHHHHHHHHHhhhhhhhhcCCchhhhhhhhcchH-HHHHHHHHhhhhhhHHHHHHHHHHhcccccCCHHHH
Confidence            579999999999999999999999  88899999999999999 99999999999999998888773             


Q ss_pred             -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718           74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------  127 (351)
Q Consensus        74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------  127 (351)
                                   |++.+++.|+..++|+ +|++++.+|++||+++++.|++++||.+||+|+|+++|            
T Consensus       158 RylE~~ike~k~nGL~L~~~kr~~ik~ikk~l~~l~~~f~~nvne~~~~~t~litd~~el~glPps~L~~la~~~~~k~s  237 (718)
T KOG2089|consen  158 RYLEKLIKEGKLNGLHLDDDKREEIKEIKKELSELSIKFSKNVNEDTKKFTFLITDKKELEGLPPSFLQSLAKDEDDKGS  237 (718)
T ss_pred             HHHHHHHHHHHhcCCCCChhHHHHHHHHHHHHHHHHHHHHhhhhhcccccceeeecHHHhccCCHHHHHHHhhcccCCCC
Confidence                         5555666788888887 99999999999999999999999999999999999987            


Q ss_pred             ------------------------------HHHHhhCCCCCCcHHHHHHHHHHHHHHHhhh-------------------
Q 018718          128 ------------------------------AYVTCSFSGDLDNTSIIDQILKLRLEKAKLL-------------------  158 (351)
Q Consensus       128 ------------------------------a~~~ra~~~~~~N~~il~eil~LR~e~A~LL-------------------  158 (351)
                                                    ||..||..+..+|..+|++|++||.++|+||                   
T Consensus       238 gpwkiTl~~p~~~pvmk~c~~r~tR~~v~~A~~~ra~~~~~eNs~ii~~l~~lR~~lAklLGY~t~Ad~~L~~kMa~ss~  317 (718)
T KOG2089|consen  238 GPWKITLKYPHYFPVMKHCKIRQTRKTVWRAYNSRASEGDLENSAIIEQLLKLRLELAKLLGYSTYADYSLAMKMAKSSE  317 (718)
T ss_pred             CceEEEecccchhHHHHhCCcHHHHHHHHHHHhccCCcccccccHHHHHHHHHHHHHHHHhCchhHHHHHHHHHhhcCcH
Confidence                                          5566666556789999999999999999999                   


Q ss_pred             -----------------------hHHHHHHHHHhcCCCCCCccccCCHHHHHHHhhhhhcCCCC----------------
Q 018718          159 -----------------------DIEDLRSFCKGQGALEPREVNHWDINFWSERLCESKYDINE----------------  199 (351)
Q Consensus       159 -----------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~----------------  199 (351)
                                             +++.++.++++.|.+.++.++.||+.||.+++.+..|+||+                
T Consensus       318 tv~~fl~dL~~kL~pl~~~e~~v~~elk~~e~k~~g~~~~~~~~~wD~~yy~~~~~e~~f~vd~~~LreyFPl~~v~~Gl  397 (718)
T KOG2089|consen  318 TVVEFLDDLSQKLRPLGIDERSVLLELKKGEAKDRGAPFDGKLTAWDLRYYMKRVEESKFDVDQEDLREYFPLPVVLSGL  397 (718)
T ss_pred             HHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHhcCCCccccchhhHHHHHHHHHHHHhcCCCHHHHHhhCCcHHHHHHH
Confidence                                   34445556666777788899999999999999999999997                


Q ss_pred             -------------------ccCCceeEEEEEcC-CCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccce
Q 018718          200 -------------------VWNDDVRFYRVKDS-SGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPV  259 (351)
Q Consensus       200 -------------------vWh~dV~~~~V~d~-~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~  259 (351)
                                         |||+|||+|.|.|. +|+++|+||+|+|+|+||+++.+.+..   ++.+...+|+ +++||
T Consensus       398 ~~i~q~LFglkf~e~~da~vWh~dVr~y~v~D~~Sg~~vG~fY~D~y~RegK~gh~~~f~l---~~~~~~~~ss-~~~PV  473 (718)
T KOG2089|consen  398 FGIYQTLFGLKFEEATDAEVWHADVRVYTVKDSASGNPVGYFYLDPYPREGKYGHAAVFGL---QPGCLQKDSS-RRIPV  473 (718)
T ss_pred             HHHHHHHhCceeeecCCchhcccceeEEeccCCCCCceeeEEEeccCCCccccchhhhhcc---chhhhccCCc-cccch
Confidence                               99999999999997 599999999999999997776554333   3335556676 79999


Q ss_pred             EEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc----
Q 018718          260 VHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT----  329 (351)
Q Consensus       260 ~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~----  329 (351)
                      ++|||||++|.+++|+||.|+  +|+||||||+||+||++++|+++||+++|+|||||+||||||+|    ++|+.    
T Consensus       474 aalv~nfS~p~~~kpsll~~~ev~t~FheFGh~~q~ll~Qa~~~~fsG~~~vewDave~psq~Lenwv~~~d~L~~lS~H  553 (718)
T KOG2089|consen  474 AALVCNFSKPQSDKPSLLGHDEVETLFHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDPDTLRSLSKH  553 (718)
T ss_pred             HHHHHhcCCcccCCCCccchHHHHHHHHHHhHHHHHHHhcCccccccCcccCCcchhhchHHHHHHhccCchHhhhhccc
Confidence            999999999999999999999  99999999999999999999999999999999999999999554    56644    


Q ss_pred             -ccCCCCcccc--chhhcccccccC
Q 018718          330 -QVIPTIPEDG--YFCSFRHKFASL  351 (351)
Q Consensus       330 -~~~~~~~~d~--~~~~~~~~~~~~  351 (351)
                       .||+||||+.  .+...|+.++||
T Consensus       554 y~tge~l~eEl~~kl~~~r~~~~gl  578 (718)
T KOG2089|consen  554 YKTGEPLPEELLKKLILTRTVNAGL  578 (718)
T ss_pred             ccCCCccHHHHHHHHHHHHHHHHHH
Confidence             7999999999  788999988875


No 4  
>PRK10911 oligopeptidase A; Provisional
Probab=100.00  E-value=1.5e-69  Score=571.28  Aligned_cols=335  Identities=30%  Similarity=0.489  Sum_probs=294.9

Q ss_pred             HHHHhcccccccccccccccceEE--EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHhh-------------
Q 018718            9 VVFLLLPPSAVQSRFKSRNTFLCF--SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHLK-------------   73 (351)
Q Consensus         9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l~-------------   73 (351)
                      .-|+|.|+...++++.....++.+  +|++|+++|+|+.++.. .+.++.....+..++|..+.++.             
T Consensus        49 ~~n~i~~l~~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~-~l~~~~~~~~~~~~Ly~~~~~~~~~~~~~~l~~e~~  127 (680)
T PRK10911         49 WENLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYEQTLP-LLSEYSTWVGQHEGLYQAYRDLRDGDHYATLNTAQK  127 (680)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHH-HHHHHHHHHhCCHHHHHHHHHHhccccccCCCHHHH
Confidence            368999999999999988888886  99999999999999999 99999999999999999988772             


Q ss_pred             -------------ccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------------
Q 018718           74 -------------SVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------------  127 (351)
Q Consensus        74 -------------~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------------  127 (351)
                                   |++.+++.|+++++|+ +|++|+++|++|++++++.|..++++++||+|||++++            
T Consensus       128 r~l~~~~~~F~~sG~~L~~~~r~~~~~i~~~l~~l~~~F~~n~~~~~~~~~~~~~~~~eL~Glp~~~~~~~~~~a~~~~~  207 (680)
T PRK10911        128 KAVDNALRDFELSGIGLPKEKQQRYGEIAARLSELGNQYSNNVLDATMGWTKLITDEAELAGMPESALAAAKAQAEAKEQ  207 (680)
T ss_pred             HHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccceeecCCHHHHCCCCHHHHHHHHHHHHhcCC
Confidence                         4444556777788777 99999999999999999999999999999999999876            


Q ss_pred             ------------------------------HHHHhhCC-----CCCCcHHHHHHHHHHHHHHHhhh--------------
Q 018718          128 ------------------------------AYVTCSFS-----GDLDNTSIIDQILKLRLEKAKLL--------------  158 (351)
Q Consensus       128 ------------------------------a~~~ra~~-----~~~~N~~il~eil~LR~e~A~LL--------------  158 (351)
                                                    |+.+|++.     +..+|.+++.+|+++|+|+|+||              
T Consensus       208 ~g~~~tl~~p~~~p~L~~~~drelRk~~y~A~~~r~~~~~~~~~~~dN~~il~eil~lR~e~AkLLGy~syAd~~L~~km  287 (680)
T PRK10911        208 EGYLLTLDIPSYLPVMTYCDNQALREEMYRAYSTRASDQGPNAGKWDNSEVMEEILALRHELAQLLGFENYADKSLATKM  287 (680)
T ss_pred             CCeEEEeecCcHHHHHhhCCCHHHHHHHHHHHHhhhhhccCcccccchHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHh
Confidence                                          33334432     23589999999999999999999              


Q ss_pred             ------------------------hHHHHHHHHHhcCCCCCCccccCCHHHHHHHhhhhhcCCCC---------------
Q 018718          159 ------------------------DIEDLRSFCKGQGALEPREVNHWDINFWSERLCESKYDINE---------------  199 (351)
Q Consensus       159 ------------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~~~~~~~~~~vd~---------------  199 (351)
                                              |+++|+++|++..  +...++|||+.||++++++++|++|+               
T Consensus       288 a~spe~V~~fL~~l~~~~~p~a~~El~~L~~~~k~~~--g~~~L~pWD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~g  365 (680)
T PRK10911        288 AENPQQVLDFLTDLAKRARPQGEKELAQLRAFAKAEF--GVDELQPWDIAYYSEKQKQHLYSISDEQLRPYFPENKAVNG  365 (680)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCCCCChhhHHHHhhHHHHhhcCCCHHHhcccCcHHHHHHH
Confidence                                    7778888887653  45679999999999999999999987               


Q ss_pred             --------------------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccce
Q 018718          200 --------------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPV  259 (351)
Q Consensus       200 --------------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~  259 (351)
                                          +|||||++|+|+|++|++||+||+|+|+|+| |++||||+.+.++..  ..+|+ +|.|+
T Consensus       366 l~~~~~~Lfgi~~~e~~~~~vwh~dV~~~~v~d~~~~~iG~~y~D~~~R~g-K~~ga~~~~~~~~~~--~~~g~-~~~Pv  441 (680)
T PRK10911        366 LFEVVKRIYGITAKERKDVDVWHPDVRFFELYDENNELRGSFYLDLYAREN-KRGGAWMDDCVGQMR--KADGS-LQKPV  441 (680)
T ss_pred             HHHHHHHHcCeeEEecCCCCCCCCcceEEEEEeCCCCeEEEEEeeccCCCC-CCCCccccccccccc--cCCCc-eeCCe
Confidence                                9999999999999888999999999999999 688889988765433  34665 68999


Q ss_pred             EEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc----
Q 018718          260 VHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT----  329 (351)
Q Consensus       260 ~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~----  329 (351)
                      ++++|||++|++++|+||+|+  +||||||||++|++|++++|++++||++++|||||+|||+||+|    +||+.    
T Consensus       442 ~~l~~Nf~~p~~~~p~LL~~~~v~tlfHEfGHalH~~ls~~~~~~~sGt~~~~~D~vE~pS~~~E~~~~~~~vL~~~a~H  521 (680)
T PRK10911        442 AYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEALAFISGH  521 (680)
T ss_pred             EEEECCCCCCCCCCCcccCHHHHHHHHHHHhHHHHHHHhCCCcCcCCCcCCCCchHhhccHHHHHHHhcCHHHHHHHHHH
Confidence            999999999999999999999  99999999999999999999999999899999999999999554    67765    


Q ss_pred             -ccCCCCcccc--chhhccccccc
Q 018718          330 -QVIPTIPEDG--YFCSFRHKFAS  350 (351)
Q Consensus       330 -~~~~~~~~d~--~~~~~~~~~~~  350 (351)
                       .|++|||+++  .++..|+.++|
T Consensus       522 ~~tgeplp~~l~~~l~~~~~~~~~  545 (680)
T PRK10911        522 YETGEPLPKELLDKMLAAKNYQAA  545 (680)
T ss_pred             hcCCCCCCHHHHHHHHHhhhhhhH
Confidence             7999999999  67777777765


No 5  
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=5.8e-54  Score=430.48  Aligned_cols=309  Identities=23%  Similarity=0.375  Sum_probs=253.5

Q ss_pred             EeCCCHHHHHHHHHHHHHHHhhhhhcccchhhHHHHHHHh--------------------------hccCCCHHHHHHHH
Q 018718           33 SICPSPASAATSMAATGAVVDGLSLEDNPLLQDFEIIGHL--------------------------KSVKNNPDLRSAYE   86 (351)
Q Consensus        33 ~~~~~~~~r~as~~a~~~~~~~~~~~~~~l~~v~~~~~~l--------------------------~~~~~~~~~R~~~~   86 (351)
                      -.||+++.+.|+.+|..++. .+-...+--..+|..+..+                          .|++.+++.|+.+-
T Consensus       106 ~aHPd~~fv~aAe~a~~~~~-e~ve~LNTn~~LY~~Lk~~l~~~~~l~~~d~e~~v~~lll~DFE~sGIhL~~~kr~kfv  184 (704)
T KOG2090|consen  106 QAHPDPEFVEAAEEACRSMF-ELVESLNTNVALYQKLKKVLQDSSRLDDLDPETYVARLLLDDFEKSGIHLDPEKREKFV  184 (704)
T ss_pred             HhCCCHHHHHHHHHHHHHHH-HHHHHhccCHHHHHHHHHHhcCcccccccCHHHHHHHHHHHHHHhhcccCCHHHHHHHH
Confidence            67999999999999998443 4444445555555555443                          37788888999988


Q ss_pred             HHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH-------------------------------HHHHhhC
Q 018718           87 EVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL-------------------------------AYVTCSF  134 (351)
Q Consensus        87 ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l-------------------------------a~~~ra~  134 (351)
                      +++ +|..|+.+|.+|...-.+.  .+.-..+-|-+.-+-.+                               .|. ...
T Consensus       185 ~Ls~eI~~lg~~F~~n~~~~P~~--~l~~s~~~l~~~~~~~~~~k~knF~~~t~~~~i~~ll~~~~d~~vRk~vY~-~~~  261 (704)
T KOG2090|consen  185 QLSSEIFDLGREFQNNTDRFPDN--KLPNSLERLPFSKKNFIESKRKNFDQATDPYIIYGLLSSSEDESVRKMVYN-TFH  261 (704)
T ss_pred             HHhHHHHHHHHHHHhccccCCcc--ccchhhhhccCcchhhHHHHhcCCCcCCCCeeeehhhhcCCCHHHHHHHHH-Hhc
Confidence            887 9999999999998743211  12222222222111111                               111 122


Q ss_pred             CCCCCcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCC
Q 018718          135 SGDLDNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPR  176 (351)
Q Consensus       135 ~~~~~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~  176 (351)
                      .....+..+|++++..|+|+|+|.                                      |++.|..+|++++..+..
T Consensus       262 ~~~~~q~~~L~~Li~sRheLa~l~G~~SfA~~al~~~~a~~pk~V~~Fl~~Ls~k~~~~~~kel~~i~~mk~k~~~~~~~  341 (704)
T KOG2090|consen  262 SPSDIQVKLLEHLISSRHELAKLVGKSSFAHRALEGKLAKNPKTVRSFLEELSEKLSEKTDKELAVIRDMKKKENNNNNA  341 (704)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHHHhCcchHHHHHhhccccCChHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhcCCCCC
Confidence            245789999999999999999999                                      788889998887654567


Q ss_pred             ccccCCHHHHHHHhhhhhcCCCC-------------------------------------ccCCceeEEEEEcCCCCCcc
Q 018718          177 EVNHWDINFWSERLCESKYDINE-------------------------------------VWNDDVRFYRVKDSSGSPIA  219 (351)
Q Consensus       177 ~l~pWD~~Yy~~~~~~~~~~vd~-------------------------------------vWh~dV~~~~V~d~~g~~lG  219 (351)
                      ++.|||..||..++|++.++++.                                     +|||||+...|.+++.+.+|
T Consensus       342 e~~~WD~~YYT~~~r~~~~~~~~~~~~~fFslg~~ieGLs~L~~~LyGirl~~~~l~pGE~WhpdV~KL~vv~E~eg~lG  421 (704)
T KOG2090|consen  342 EIEPWDRPYYTSMYRQSNNSLNPSIYSPFFSLGSCIEGLSTLFQRLYGIRLIPEPLAPGEVWHPDVRKLNVVHEQEGLLG  421 (704)
T ss_pred             ccccccchhhhhHhhcccCCCCcceeeccccHHHHHHHHHHHHHHHhCeeeeccCCCCccccCchhhhheeecCCCCcee
Confidence            89999999999999999988876                                     99999999999999889999


Q ss_pred             ccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhc
Q 018718          220 YFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       220 ~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~  297 (351)
                      |+|+|+|.|+||..|.|++++.++|..   +||+ +|.||++|+|||+.+....|++|++.  +|||||||||||+||++
T Consensus       422 ~IY~Dlf~R~gK~~g~aHFTIr~sr~l---~Dg~-yQlPVi~L~cnf~rss~~s~t~L~~~~vetLFHEmGHAMHSmLGr  497 (704)
T KOG2090|consen  422 YIYCDLFERPGKTVGDAHFTIRGSRQL---SDGT-YQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSMLGR  497 (704)
T ss_pred             EEEEEeeccCCCCCCCceEEeeccccC---CCCC-eeceeeEEeecccccccCCCcccCHHHHHHHHHHHHHHHHHHhcc
Confidence            999999999998888899999988653   7898 69999999999999999999999999  99999999999999999


Q ss_pred             CCccccccCCCccceeeeechhhch----hhhhhhc-----ccCCCCcccc--chhhccccccc
Q 018718          298 QDEGLVAGSRGIEWDAVELSTNNSW----KIGVTRT-----QVIPTIPEDG--YFCSFRHKFAS  350 (351)
Q Consensus       298 ~~y~~~sGt~~v~~DfvE~PSq~~e----~~~~l~~-----~~~~~~~~d~--~~~~~~~~~~~  350 (351)
                      |+||+++|||| |.||+|+||.+||    +|+|++.     .+++|||+||  .+|.+++.||+
T Consensus       498 T~YQhvtGTRc-~tDfaEiPSiLMEyFa~D~rVl~~~aRhy~t~e~l~~~mv~~l~~s~n~~Aa  560 (704)
T KOG2090|consen  498 THYQHVTGTRC-PTDFAEIPSILMEYFANDYRVLRFFARHYSTGEPLPEDMVNRLCESRNSFAA  560 (704)
T ss_pred             chhccccCccc-chhHhhhhHHHHHHHhcchHHHHHHHHHcCCCCCCCHHHHHHHHHHhccchh
Confidence            99999999986 6999999999984    6788865     7999999999  79999999986


No 6  
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=100.00  E-value=3.2e-52  Score=422.97  Aligned_cols=209  Identities=28%  Similarity=0.464  Sum_probs=179.4

Q ss_pred             CCCcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCCcc
Q 018718          137 DLDNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPREV  178 (351)
Q Consensus       137 ~~~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~~l  178 (351)
                      ..+|.+++.+|+++|+++|+||                                      |+++|++++++.+......|
T Consensus        23 ~~~n~~il~~l~~lR~e~A~llGy~sya~~~l~~~ma~spe~v~~fL~~l~~~~~p~~~~e~~~L~~~~~~~~~~~~~~L  102 (458)
T cd06457          23 SAEQLELLEELLSSRAELAQLLGFESYAHRALRGKMAKNPENVMEFLTELSEKLRPRAEEELAVLRDMKRKVLGKPLSSL  102 (458)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCC
Confidence            4689999999999999999999                                      77788888876531124579


Q ss_pred             ccCCHHHHHHHhhhhhcCCCC-------------------------------------ccCCceeEEEEEcCCCCCcccc
Q 018718          179 NHWDINFWSERLCESKYDINE-------------------------------------VWNDDVRFYRVKDSSGSPIAYF  221 (351)
Q Consensus       179 ~pWD~~Yy~~~~~~~~~~vd~-------------------------------------vWh~dV~~~~V~d~~g~~lG~~  221 (351)
                      .|||+.||.+++|+++|++|+                                     +|||||++|+|+|++|++||+|
T Consensus       103 ~pwD~~yy~~~~~~~~~~~d~~~l~~YFpl~~vl~gl~~~~~~lfgi~~~~~~~~~~~~Wh~dV~~~~v~d~~~~~lG~~  182 (458)
T cd06457         103 APWDRDYYTGQYRQSRFDSEPSNLSPYFSLGTVMEGLSRLFSRLYGIRLVPVPLAPGEVWHPDVRKLDVVHEDEGLLGVI  182 (458)
T ss_pred             CHHHHHHHHHHHHHhhcCCChHHhcccCcHHHHHHHHHHHHHHHhCeEEEecCCCCCCCcCccceEEEEEeCCCCEEEEE
Confidence            999999999999999999886                                     8999999999999888899999


Q ss_pred             ccCccCCCCCCCCCceeeecccccccccCC----------CCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhH
Q 018718          222 YFDPYSCPSEKKGSARICEVFSQSRVLTLD----------GATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGH  289 (351)
Q Consensus       222 YlDl~~R~gKk~~ga~~~~~~~~~~~~~~d----------G~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGH  289 (351)
                      |||+|+|+| |++||||..++++..  ..+          |+ +|+|+++|+|||++|++++|+||+|+  .||||||||
T Consensus       183 YlDl~~R~~-K~~~a~~~~~~~~~~--~~~~~~~~~~~~~g~-~q~Pv~~lvcnf~~p~~~~p~lL~~~~v~TLfHEfGH  258 (458)
T cd06457         183 YCDLFSRPG-KPPGAAHFTIRCSRR--LDDDDVAERGGRGGT-YQLPVVALMCNFPPPSPSGPTLLSPHEVETLFHEMGH  258 (458)
T ss_pred             EeecCCCCC-CCCCcceeccccccc--cCcccccccccCCCc-eeCCeEEEECCCCCCCCCCCCCcCHHHHHHHHHHHhH
Confidence            999999999 577888877665322  123          55 68999999999999999999999999  999999999


Q ss_pred             HHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----ccCCCCcccc--chhhccccccc
Q 018718          290 ALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QVIPTIPEDG--YFCSFRHKFAS  350 (351)
Q Consensus       290 alH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~~~~~~d~--~~~~~~~~~~~  350 (351)
                      +||++|++++|+.++||+ +||||||+||||||.|    .+|+.     .+++|||+++  .+...|+.++|
T Consensus       259 alH~~ls~~~~~~~sgt~-~~~d~vE~pS~~~E~~~~~~~~L~~~a~h~~t~e~ip~~l~~~l~~~~~~~~~  329 (458)
T cd06457         259 AMHSMLGRTEYQHVSGTR-CATDFVEVPSILMEYFASDPRVLKLFARHYSTGEPLPEEMLARLLASKNSFAA  329 (458)
T ss_pred             HHHHHHcCCCccccCCCC-CCcchhhcCHHHHHHHHhhHHHHHHHhcccCCCCcCcHHHHHHHHHhhhhhHH
Confidence            999999999999999986 7999999999999654    56543     6899999999  56666666543


No 7  
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=100.00  E-value=1.3e-49  Score=399.89  Aligned_cols=203  Identities=40%  Similarity=0.710  Sum_probs=179.2

Q ss_pred             CcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCCcccc
Q 018718          139 DNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPREVNH  180 (351)
Q Consensus       139 ~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~~l~p  180 (351)
                      +|.+++++|+++|+|+|++|                                      |++.|+++|++.+  +...|+|
T Consensus         1 ~N~~~l~~l~~lR~e~A~llG~~s~a~~~l~~~ma~sp~~v~~Fl~~l~~~~~p~~~~e~~~L~~~~~~~~--~~~~l~~   78 (422)
T cd06456           1 DNRPLIEEILALRAEKAKLLGFENYAEYSLADKMAKSPEAVLEFLEDLAPKAKPQAKKELAELQAFAKEEG--GEDELEP   78 (422)
T ss_pred             ChHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCCH
Confidence            69999999999999999999                                      8888999998876  4457999


Q ss_pred             CCHHHHHHHhhhhhcCCCC-----------------------------------ccCCceeEEEEEcCCCCCccccccCc
Q 018718          181 WDINFWSERLCESKYDINE-----------------------------------VWNDDVRFYRVKDSSGSPIAYFYFDP  225 (351)
Q Consensus       181 WD~~Yy~~~~~~~~~~vd~-----------------------------------vWh~dV~~~~V~d~~g~~lG~~YlDl  225 (351)
                      ||+.||.+++|+++|++|+                                   +|||||++|+|+|.+|++||+||+|+
T Consensus        79 wD~~yy~~~~~~~~~~~d~~~l~~YFpl~~v~~gl~~~~~~lfgi~~~~~~~~~~Wh~dV~~~~v~d~~~~~lG~~ylDl  158 (422)
T cd06456          79 WDWAYYSEKLRKEKYDLDEEELRPYFPLEKVLDGLFELAERLYGITFKERTDLPVWHPDVRVYEVFDKDGSHIGLFYLDL  158 (422)
T ss_pred             HHHHHHHHHHHHHhcCCCHHHHhccCCHHHHHHHHHHHHHHHcCeeEEECCCCCCCCCCceEEEEEeCCCCeEEEEEEec
Confidence            9999999999999988875                                   99999999999998889999999999


Q ss_pred             cCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCcccc
Q 018718          226 YSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLV  303 (351)
Q Consensus       226 ~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~  303 (351)
                      |+|+| |++||||+..+++..    +|. +|.|+++++|||++|++++|+||+++  .|||||||||+|++|++++|+.+
T Consensus       159 ~~R~~-K~~ga~~~~~~~~~~----~~~-~~~P~~~l~~nf~~~~~~~p~lL~~~~v~tLfHEfGHalH~~ls~~~~~~l  232 (422)
T cd06456         159 YAREG-KRGGAWMNNLRSQSK----NGL-GQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSL  232 (422)
T ss_pred             cCCCC-CCCCceeeccccccc----CCC-CCCCEEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhcCCcccc
Confidence            99999 577889988765321    454 68999999999999999999999999  99999999999999999999999


Q ss_pred             ccCCCccceeeeechhhchhh----hhhhc-----ccCCCCcccc--chhhccccccc
Q 018718          304 AGSRGIEWDAVELSTNNSWKI----GVTRT-----QVIPTIPEDG--YFCSFRHKFAS  350 (351)
Q Consensus       304 sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~~~~~~d~--~~~~~~~~~~~  350 (351)
                      +|++ +++||||+|||+||.+    .+|+.     .+++|||+++  .+...|+.++|
T Consensus       233 ~~~~-~~~d~~E~pS~~~E~~~~d~~vL~~~s~h~~t~~~lp~~l~~~~~~~~~~~~~  289 (422)
T cd06456         233 GGTN-VEWDFVELPSQFMENWAWEPEVLKLFAKHYETGEPLPDELIDKLLAARNFNSG  289 (422)
T ss_pred             CCCc-CchhHhhccHHHHHHHhcCHHHHHHHHhhcCCCCCCCHHHHHHHHHHhhHhHH
Confidence            9985 7899999999999654    56655     6999999999  56676666654


No 8  
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=100.00  E-value=1.3e-46  Score=383.27  Aligned_cols=243  Identities=31%  Similarity=0.534  Sum_probs=194.4

Q ss_pred             HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH------HHHHhhCCCCCCcHHHHHHHHHHHHHHHhhh-----
Q 018718           90 ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL------AYVTCSFSGDLDNTSIIDQILKLRLEKAKLL-----  158 (351)
Q Consensus        90 ~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l------a~~~ra~~~~~~N~~il~eil~LR~e~A~LL-----  158 (351)
                      +|.+|+++|++|+.++...+      ..-|.-.|+..+      ++.++   +..+|.+++.+|+++|+++|+++     
T Consensus         5 ~~~~~~~~f~~~~~~~~~~~------~~~l~~~~d~~~Rk~~~~a~~~~---~~~~n~~~l~~ll~~R~~~A~l~Gy~s~   75 (472)
T cd06455           5 ELSELCQEFSKNLNEKYPDY------FPVMKYAKNAETRKAMWVAFQNR---GGPENVPLLEELVALRHELARLLGYKSH   75 (472)
T ss_pred             HHHHHHHHHHHhhhhcCccH------HHHHhcCCCHHHHHHHHHHHHhh---cchhHHHHHHHHHHHHHHHHHHcCCCCH
Confidence            67899999999999843221      223444433322      33333   24799999999999999999999     


Q ss_pred             ---------------------------------hHHHHHHHHHhcCCCCCC--ccccCCHHHHHHHhhhhhcCCCC----
Q 018718          159 ---------------------------------DIEDLRSFCKGQGALEPR--EVNHWDINFWSERLCESKYDINE----  199 (351)
Q Consensus       159 ---------------------------------El~~L~~~k~~~~~~~~~--~l~pWD~~Yy~~~~~~~~~~vd~----  199 (351)
                                                       |++.|.++|++..  +.+  +++|||+.||.++++++.+++|+    
T Consensus        76 a~~~l~~~m~~s~e~v~~fl~~l~~~~~p~~~~e~~~l~~~k~~~~--~~~~~~l~~wD~~y~~~~~~~~~~~~~~~~~~  153 (472)
T cd06455          76 ADYVLEDRMAKSPETVRDFLEDLSKKLKPLAEKELAELLELKKKEV--PEAGDRIYPWDLAYYMERVEEEKYDVDQEKIR  153 (472)
T ss_pred             HHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCCCHhhHHHHHHHHHHHhcCCCHHHHh
Confidence                                             5667888887653  222  69999999999999887666553    


Q ss_pred             -------------------------------ccCCceeEEEEEcCC-CCCccccccCccCCCCCCCCCceeeeccccccc
Q 018718          200 -------------------------------VWNDDVRFYRVKDSS-GSPIAYFYFDPYSCPSEKKGSARICEVFSQSRV  247 (351)
Q Consensus       200 -------------------------------vWh~dV~~~~V~d~~-g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~  247 (351)
                                                     +|||||++|+|+|++ |+++|++|+|+|+|+| |++||||...++..  
T Consensus       154 ~yf~~~~~~~~i~~~~~~lfg~~~~~~~~~~~w~~dv~~~~v~d~~~~~~~g~~ylD~~~R~g-K~~Ga~~~~~~~~~--  230 (472)
T cd06455         154 EYFPLEVVIEGMLDIYQRLFGLRFEEVPDASVWHEDVRLYSVWDADTGEFLGYFYLDLHPREG-KYGHAANFGLQPGF--  230 (472)
T ss_pred             ccCcHHHHHHHHHHHHHHHhCeEEEeCCCCCCCCCcceEEEEEECCCCCEEEEEEeecCCCCC-CCCCccccccccce--
Confidence                                           899999999999975 7899999999999999 57788987765432  


Q ss_pred             ccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh-
Q 018718          248 LTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI-  324 (351)
Q Consensus       248 ~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~-  324 (351)
                      ...+|. +|.|+++++|||++|.+++|+||+++  .||||||||++|++|++++|+.++|++ +++||||+|||+||.+ 
T Consensus       231 ~~~~g~-~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TLfHEfGHalH~~ls~~~~~~~sg~~-~~~d~aE~pS~~~E~~~  308 (472)
T cd06455         231 LLPDGS-RQYPVAALVCNFPKPTADKPSLLRHDEVETFFHEFGHVIHHLLGRTKYARFSGTR-VERDFVEAPSQMLENWC  308 (472)
T ss_pred             ecCCCC-EeCCEEEEECcCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhcCCCccccCCCc-CChhhhhcchHHHHHHh
Confidence            235665 68899999999999999999999999  999999999999999999999999985 7899999999999654 


Q ss_pred             ---hhhhc-----ccCCCCcccc--chhhccccc
Q 018718          325 ---GVTRT-----QVIPTIPEDG--YFCSFRHKF  348 (351)
Q Consensus       325 ---~~l~~-----~~~~~~~~d~--~~~~~~~~~  348 (351)
                         ++++.     .+++|||+++  .+...|+.+
T Consensus       309 ~~~~~l~~l~~h~~t~e~i~~~li~~~~~~~~~~  342 (472)
T cd06455         309 WEPEVLKRLSKHYKTGEKIPDELIERLIASRHFN  342 (472)
T ss_pred             cCHHHHHHHhhccCCCCCCCHHHHHHHHHhhhhh
Confidence               45644     5899999998  445554433


No 9  
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=100.00  E-value=2.7e-43  Score=356.95  Aligned_cols=203  Identities=35%  Similarity=0.588  Sum_probs=163.8

Q ss_pred             CcHHHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCC-ccc
Q 018718          139 DNTSIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPR-EVN  179 (351)
Q Consensus       139 ~N~~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~-~l~  179 (351)
                      +|..++.+++++|+++|+++                                      +++.|++++++..  +.+ +|+
T Consensus        30 ~n~~~l~~l~~~R~~~A~llGy~s~~~~~l~~~m~~~~~~v~~fL~~~~~~~~p~~~~~~~~l~~~~~~~~--g~~~~l~  107 (458)
T PF01432_consen   30 DNAAILNELLKLRNELAKLLGYPSYAEYSLQDKMAKNPENVLDFLDELVKKLKPLLERELELLKKLKKKRL--GLEKKLR  107 (458)
T ss_dssp             HHHHHHHHHHHHHHHHHHHTT-SSHHHHHHTTSTTTSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT--T-SSSBB
T ss_pred             hHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccc
Confidence            79999999999999999999                                      6777777776653  344 899


Q ss_pred             cCCHHHHHHHhhhhhcCCCC-------------------------------------ccCCceeEEEEEcC--CCCCccc
Q 018718          180 HWDINFWSERLCESKYDINE-------------------------------------VWNDDVRFYRVKDS--SGSPIAY  220 (351)
Q Consensus       180 pWD~~Yy~~~~~~~~~~vd~-------------------------------------vWh~dV~~~~V~d~--~g~~lG~  220 (351)
                      |||++||.+++++..+++|+                                     +|||||++|+|+|.  ++.++|+
T Consensus       108 ~wD~~y~~~~~~~~~~~~~~~~~~~~f~~~~~~~g~~~l~~~lfgi~~~~~~~~~~~~w~~dv~~~~v~d~~~~~~~ig~  187 (458)
T PF01432_consen  108 PWDVAYYMEQYRQERYDLDEEELSPYFPLEEVLEGLFELAERLFGIRFEEVPDADGEVWHPDVRKFEVWDEIFSGMFIGY  187 (458)
T ss_dssp             GGGHHHHHHHHHHHHTSSSHHHHGGG-BHHHHHHHHHHHHHHHHTEEEEECTCGGHHHSSTT-EEEEEEETHTTHCECEE
T ss_pred             cchhHHHhhHHHHHHhccchhhcCCcCcHHHHHHHHHHHHHHHhcEEEEecccccccceecceeEEEEeehhhcccchhc
Confidence            99999999999998888775                                     69999999999997  6889999


Q ss_pred             cccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcC
Q 018718          221 FYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQ  298 (351)
Q Consensus       221 ~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~  298 (351)
                      +|+|+|+|+| |++||||+..+..  +  .+|+ +|.|+++++|||++|.+++|+||+++  .|||||||||+|++|+++
T Consensus       188 ~ylDl~~R~g-K~~ga~~~~~~~~--~--~~~~-~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tLfHE~GHa~H~~ls~~  261 (458)
T PF01432_consen  188 IYLDLYPRPG-KRSGAFCFTLRPS--R--SDGE-RQLPVPYIFCNFTGPSAGKPSLLSHDDVETLFHEFGHAMHSLLSRT  261 (458)
T ss_dssp             EEEEES--TT-S-SS-EEEEEEC---B--TTST-CECEEEEEEEEE-S-BTTC--B-SHHHHHHHHHHHHHHHHHHHCCC
T ss_pred             ccccchhcCC-CCCCceeCCccCc--c--cccc-CCCCceEEEecCCCCCCCCCCccChhhHHHHHHHHhHHHHHHHhcc
Confidence            9999999999 5777888776542  1  4676 79999999999999999999999999  999999999999999999


Q ss_pred             CccccccCCCccceeeeechhhchhh----hhhhc-----ccCCCCcccc--chhhccccccc
Q 018718          299 DEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QVIPTIPEDG--YFCSFRHKFAS  350 (351)
Q Consensus       299 ~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~~~~~~d~--~~~~~~~~~~~  350 (351)
                      +++.++|+ .+|+||||+||||||.+    .++..     .++++||+++  .+..+|+.+++
T Consensus       262 ~~~~~sg~-~~~~d~aE~~S~~~E~~~~~~~~l~~~~~~~~~~~~i~~~~l~~~~~~~~~~~~  323 (458)
T PF01432_consen  262 KYQHLSGT-RVPMDFAEFPSQFMENWLWDPLVLKAFSRHYETGEPIPEELLEDLIASRNFFAA  323 (458)
T ss_dssp             SSGGGSTT-SS-CHHCHHHHHHHHHHGGCHHHHHHH-BSTTTHHHHHHHHHHHHHHTTTTTHH
T ss_pred             ccccccCC-chhHHHHhcchHHHHHhhhchhhhhhhccChhhhhhHHHHHHHhhhhhhhhhHH
Confidence            99999999 57899999999999664    45543     5889999998  45556665543


No 10 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=99.93  E-value=2.7e-26  Score=226.17  Aligned_cols=171  Identities=20%  Similarity=0.270  Sum_probs=122.2

Q ss_pred             HHHHHHHHHHHHHHhhh--------------------------------------hHHHHHHHHHhcCCCCCCccccCCH
Q 018718          142 SIIDQILKLRLEKAKLL--------------------------------------DIEDLRSFCKGQGALEPREVNHWDI  183 (351)
Q Consensus       142 ~il~eil~LR~e~A~LL--------------------------------------El~~L~~~k~~~~~~~~~~l~pWD~  183 (351)
                      ++|++|+.+|+|+|+++                                      +++.|+++++++.  +...+.||..
T Consensus         2 ~~l~~lv~lr~e~A~~lGy~~~~d~~l~~~~~~~~e~v~~f~~~l~~~~~p~~~~~~~~l~~~~~~~~--~~~~~~~~~~   79 (365)
T cd06258           2 ALLEELVSLRNQLARLLGYENFADYKLALQEAKSPETVEGFFEELKRKLRPLLAKLREEISAAKQKEE--EIYGELPARY   79 (365)
T ss_pred             hHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCcccccccc
Confidence            68999999999999999                                      4555555554432  2345778811


Q ss_pred             HHHHHHhhhhhcCCCCccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEe
Q 018718          184 NFWSERLCESKYDINEVWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMV  263 (351)
Q Consensus       184 ~Yy~~~~~~~~~~vd~vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv  263 (351)
                      . .+.....+.|.++.+|.-             ....+|+|+++|+| |++|+||+...+..       + ++.|.++++
T Consensus        80 ~-~~~~~~~~~f~~~~~~~~-------------~~~~~~lD~~~R~g-K~~~a~~~~~~~~~-------~-~~~~~~~i~  136 (365)
T cd06258          80 D-VDSALLKEFFDAERPWEG-------------ALPFFYLDLYDRKG-KYPHGFCTGLDPGF-------N-RQDKDVRIL  136 (365)
T ss_pred             c-CCHhhccCcCChHHHHHH-------------HHHhHhccCCCCCC-CCCCCeeccccCCC-------C-CCCCeEEEE
Confidence            1 011111112232222221             11222999999999 56677886654311       1 478999999


Q ss_pred             eccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh----hhhhc-----ccC
Q 018718          264 CNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI----GVTRT-----QVI  332 (351)
Q Consensus       264 ~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~----~~l~~-----~~~  332 (351)
                      |||++|++++|+|++++  .||||||||++|+++++++|+.++|+ ++++||+|+||++||.+    .+|+.     .++
T Consensus       137 ~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~~~~~~~~~g~-~~~~~~~E~~S~~~E~~~~~~~~L~~~~~~~~~~  215 (365)
T cd06258         137 ANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERT-PTSTDFAEAQSMFLESFATDPEWLERYARHYQGG  215 (365)
T ss_pred             ccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHhcCCCCcCCCC-CCCccHHhccHHHHHHHHCCHHHHHHHhhhcCCC
Confidence            99999999999999999  99999999999999999999988997 47899999999999754    45543     344


Q ss_pred             CCCcccc
Q 018718          333 PTIPEDG  339 (351)
Q Consensus       333 ~~~~~d~  339 (351)
                       |+|++.
T Consensus       216 -~~~~~~  221 (365)
T cd06258         216 -VVPDEL  221 (365)
T ss_pred             -CCcHHH
Confidence             788777


No 11 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=99.87  E-value=2.9e-22  Score=208.09  Aligned_cols=202  Identities=16%  Similarity=0.152  Sum_probs=126.0

Q ss_pred             HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHH----------HH-----HHHHhhCCCCCCcHHHHHHHHHHHHHH
Q 018718           90 ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPAT----------AL-----AYVTCSFSGDLDNTSIIDQILKLRLEK  154 (351)
Q Consensus        90 ~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~----------~l-----a~~~ra~~~~~~N~~il~eil~LR~e~  154 (351)
                      +-.++++.|.+.+...+     |.++.+++. +++-          +.     ++.++......+|..++.+++++|++.
T Consensus       120 ~~~~~~~~y~~l~~~~~-----~~~~Ge~~~-l~~~~~~l~~~dr~~Rk~a~~a~~~~~~~~~~~~~~il~~lv~~R~~~  193 (549)
T TIGR02289       120 KENILSTKYREIISNIS-----IIFEGEEKT-LSQLKPYLQDPNRSTRKKAWYARSEFFAVVEEELDRIYDELVQIRTQI  193 (549)
T ss_pred             HHHHHHHHHHHHHhccE-----EEECCEecc-HHHhhHHhhCCCHHHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHH
Confidence            44567788888777554     455555432 2221          11     233333334567889999999999999


Q ss_pred             Hhhh------------------------------------hHHHHHHHHHhcCCCCCCccccCCHHHHH--HHhhhhhcC
Q 018718          155 AKLL------------------------------------DIEDLRSFCKGQGALEPREVNHWDINFWS--ERLCESKYD  196 (351)
Q Consensus       155 A~LL------------------------------------El~~L~~~k~~~~~~~~~~l~pWD~~Yy~--~~~~~~~~~  196 (351)
                      |+++                                    -...+.+++++..  |.+.+.+||..|..  ... ...++
T Consensus       194 Akl~Gy~s~~e~~~~~~~r~~~~~e~v~~~l~~v~~~~~p~~~~~~~~k~k~l--G~~~l~~wD~~~~~~~~~~-~~~~~  270 (549)
T TIGR02289       194 ALNLGFSNYRDYMYKLKNRTDYSAEDCYKYRESILKYVVPLWTKLRKIKKKRL--GIKTLRPWDESAVFLDGNV-KPFGN  270 (549)
T ss_pred             HHHCCCCCHHHHHHHHhcccCCCHHHHHHHHHHHHHHhHHHHHHHHHHHHHhC--CCCcCChhhccCCCCCCCc-CCCCC
Confidence            9999                                    0112233344332  45679999987532  111 12234


Q ss_pred             CCCccCCceeEEEEEcCC-----CCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCC
Q 018718          197 INEVWNDDVRFYRVKDSS-----GSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVG  271 (351)
Q Consensus       197 vd~vWh~dV~~~~V~d~~-----g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~  271 (351)
                      .+++..--.+.|...+..     ..+++..|+|+++|+| |++||||+.++.           ...|.  ++|||++.  
T Consensus       271 ~e~~~~~~~~~~~~l~~~~~e~~~~~~~~~~iD~~~r~g-K~~Gayc~~~~~-----------~~~P~--I~~Nf~~t--  334 (549)
T TIGR02289       271 VDFLLEKAEKMYKELSLEFYEFFNFMREKNLLDLESRKG-KAAGGYCTYLPK-----------YKAPF--IFSNFNGT--  334 (549)
T ss_pred             HHHHHHHHHHHhcCCCcHHHHHHHHHHhcCccccCCCCC-CCCCcccCCCCC-----------CCCcE--EEEeCCCC--
Confidence            333222112333333321     2257788999999999 688889877532           23674  46999871  


Q ss_pred             CCCCCCCCc-hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh
Q 018718          272 DKPSLMTFS-ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI  324 (351)
Q Consensus       272 ~~P~LL~~~-~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~  324 (351)
                            ..| .||+||||||+|+++++..  .....+..|+|++|+|||+||.+
T Consensus       335 ------~~dv~TL~HElGHa~H~~~s~~~--~~~~~~~~~~~~aE~aS~~~E~l  380 (549)
T TIGR02289       335 ------SGDIDVLTHEAGHAFHVYESRKD--LLPEYRWPTYEAAELASMSMELL  380 (549)
T ss_pred             ------hhHHHHHHHHhhHHHHHHHhcCC--cccccccCcchhhhhhHHHHHHH
Confidence                  344 9999999999999999852  22333345799999999999764


No 12 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=99.87  E-value=2.7e-23  Score=207.91  Aligned_cols=164  Identities=17%  Similarity=0.136  Sum_probs=114.8

Q ss_pred             HHHhhCCCCCCcHHHHHHHHHHHHHHHhhh--------------------------------hHHHHHHHHHhcCCCCCC
Q 018718          129 YVTCSFSGDLDNTSIIDQILKLRLEKAKLL--------------------------------DIEDLRSFCKGQGALEPR  176 (351)
Q Consensus       129 ~~~ra~~~~~~N~~il~eil~LR~e~A~LL--------------------------------El~~L~~~k~~~~~~~~~  176 (351)
                      +.++......+|.+++.+++++|.+.|+++                                -+..+.+.+++..  +.+
T Consensus        58 ~~~~~~~~~~~~~~~l~~lv~~r~~~A~~~Gy~~~~~~~l~~~~~~~~~~~~l~~v~~~~~p~~~~~~~~~~~~l--g~~  135 (427)
T cd06459          58 LYKAYEKYENTLAAILNTLVKLRLTLAKLRGYDSYLEAALFNNNIPEDVYDFLIAVVKENVPLLHRYLKLKKKLL--GLD  135 (427)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHhhccCCCCHHHHHHHHHHHHHhcHHHHHHHHHHHHHh--CCC
Confidence            333333234678999999999999999999                                1222223333322  356


Q ss_pred             ccccCCHHHHHHHhhhhhcCCCC----------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccc
Q 018718          177 EVNHWDINFWSERLCESKYDINE----------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSR  246 (351)
Q Consensus       177 ~l~pWD~~Yy~~~~~~~~~~vd~----------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~  246 (351)
                      .+.|||+.|+..+.++..|++++          .|++++.  ++.   +..++..|+|+++|+| |++||||+...+   
T Consensus       136 ~l~~wD~~~~~~~~~~~~~~~~~~~~~v~~~~~~l~~~~~--~~~---~~~~~~~~iD~~~r~g-K~~gaf~~~~~~---  206 (427)
T cd06459         136 KLRPYDLYAPLVSGNPPKYTYEEAKELVLEALSPLGPEYA--EFA---KRAFEERWIDVEPRKG-KRSGAYCTGLPP---  206 (427)
T ss_pred             cCcHhhcCCCCCCCCCCcCcHHHHHHHHHHHHHccCHHHH--HHH---HHHhhCCCeeccCCCC-CCCCeecCCCCC---
Confidence            79999999987777666777776          3443321  111   2355677999999999 677888866421   


Q ss_pred             cccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh
Q 018718          247 VLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI  324 (351)
Q Consensus       247 ~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~  324 (351)
                              ...|  +++|||.+         +++  .||||||||++|.++++..++ +.+. ++|+||+|+|||+||.+
T Consensus       207 --------~~~p--~i~~n~~~---------~~~~v~tl~HE~GHa~h~~~~~~~~~-~~~~-~~~~~~~E~~S~~~E~~  265 (427)
T cd06459         207 --------GKHP--FILMNFNG---------TLDDVFTLAHELGHAFHSYLSRDNQP-YLYS-DYPIFLAEIASTFNELL  265 (427)
T ss_pred             --------CCCC--eEEecCCC---------ChhhHHHHHHHhhHHHHHHHHccCCC-cccC-CCCchhhHHHHHHHHHH
Confidence                    1235  56899976         345  999999999999999987554 3333 47899999999999765


No 13 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=99.83  E-value=4.1e-21  Score=200.95  Aligned_cols=274  Identities=14%  Similarity=0.137  Sum_probs=155.9

Q ss_pred             HHHHhcccccccccccccccceEE--EeC-CCHHHHHHHHHHHHHHHhhhhhcccc-hhhHHHHHHHh-hc-cCCCHH--
Q 018718            9 VVFLLLPPSAVQSRFKSRNTFLCF--SIC-PSPASAATSMAATGAVVDGLSLEDNP-LLQDFEIIGHL-KS-VKNNPD--   80 (351)
Q Consensus         9 ~~~~~~p~~~~en~~~~~~~~l~f--~~~-~~~~~r~as~~a~~~~~~~~~~~~~~-l~~v~~~~~~l-~~-~~~~~~--   80 (351)
                      +.+.+.++-+.+............  .+. .|++.++...++.. ++.++...... -..++.+-... .. ....++  
T Consensus        50 l~~~l~~~e~~~~~~~~l~~Ya~l~~s~d~~d~~~~~~~~~~~~-l~s~~~~~~sf~~~eL~~l~~~~~~~~l~~~~~L~  128 (591)
T TIGR00181        50 FLEALALEEKILILLNRLYNYASMKLSTDVTDPEANAISQKLSN-LYTKVASATSFFEPEILEIEEKIIKEWLKDPEELA  128 (591)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhccccccCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCHHHHHHHHhcChhhH
Confidence            344555555555555444444444  444 55788998888888 88777666552 22233322111 11 112221  


Q ss_pred             -HHHHHHHH-------------------H-HHHHHHHHHHHHHHHhhcccccccc-ChhhhcCCCHHH----H-------
Q 018718           81 -LRSAYEEV-------------------R-ELGKLSIDFSENVLDATKRFETFVT-DKKEIQGLPATA----L-------  127 (351)
Q Consensus        81 -~R~~~~ei-------------------~-~ls~L~~~F~~Nl~~~~~~~~~v~~-~~~eL~GlP~~~----l-------  127 (351)
                       .+..+.++                   + .++..+..|.+..+.+. .|..+.. +++++ .+....    +       
T Consensus       129 ~y~~~l~~~~r~k~h~Ls~e~E~lLa~~~~~~~a~~~~f~~l~~ad~-~f~~~~~~~G~~~-~l~~~~~~~~l~~~~dr~  206 (591)
T TIGR00181       129 DYKRALEEIFRDKPHILSEEVEKLLSALSEVFGSPSDIYSTLTNADM-DFGSIEDYKGKKY-PITNSTYENFLQKNKDRE  206 (591)
T ss_pred             HHHHHHHHHHHhCCCCCCHHHHHHHHhCchhhccHHHHHHHHHhccC-CCCceecCCCcEE-eccHHHHHHHHhcCCCHH
Confidence             22222222                   1 34566677877655443 2222211 12221 122211    1       


Q ss_pred             -------HHHHhhCCCCCCcHHHHHHHHHHHHHHHhhh--------------------------------hHHHHHHHHH
Q 018718          128 -------AYVTCSFSGDLDNTSIIDQILKLRLEKAKLL--------------------------------DIEDLRSFCK  168 (351)
Q Consensus       128 -------a~~~ra~~~~~~N~~il~eil~LR~e~A~LL--------------------------------El~~L~~~k~  168 (351)
                             ++.++.......|..++.+++++|.+.|+++                                .+....++|+
T Consensus       207 ~Rk~a~~a~~~~~~~~~~~~a~~l~~~v~~r~~~A~lrGy~s~~~~~l~~~~i~~~v~~~l~~~v~~~~~~~~~y~~~k~  286 (591)
T TIGR00181       207 IRKKAYESFYKAYRKHKNTFAALYYGNVQKNVFLAKLRNYESYIDASLFSDEVPREVYDNLYDTIKKNAPVLQRYYKLRK  286 (591)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHhccCCCCHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence                   2333332234578999999999999999998                                1223333444


Q ss_pred             hcCCCCCCccccCCHHHHHHHhhh--hhcCCCC--------------ccCCceeEEEEEcCCCCCccccccCccCCCCCC
Q 018718          169 GQGALEPREVNHWDINFWSERLCE--SKYDINE--------------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEK  232 (351)
Q Consensus       169 ~~~~~~~~~l~pWD~~Yy~~~~~~--~~~~vd~--------------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk  232 (351)
                      +..  |.+++++||+..  ...+.  ..|++++              .|.+-|+.         ..+.=|+|+++|+| |
T Consensus       287 k~l--g~~~l~~~D~~~--~~~~~~~~~~~~~~a~~~v~~~~~~~g~~~~~~~~~---------~~~~~wiD~~~r~g-K  352 (591)
T TIGR00181       287 KVL--KLDKMEPYDLYL--PLVKEKNPKFSIEEAKELILKSLEPLGEEYIKILKR---------AFNERWVDYAENKG-K  352 (591)
T ss_pred             HHh--CCCcccHHHccC--CCCCCcCCcccHHHHHHHHHHHHhccCHHHHHHHHH---------HhhCCCeeecCCCC-C
Confidence            433  456799999872  22222  2233333              12222211         01112999999999 6


Q ss_pred             CCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCccccccCCCcc
Q 018718          233 KGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDEGLVAGSRGIE  310 (351)
Q Consensus       233 ~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y~~~sGt~~v~  310 (351)
                      ++||||+...            ...|.  ++|||++         +++  .||||||||++|+++++...+...+  .+|
T Consensus       353 ~~Ga~~~~~~------------~~~p~--il~N~~~---------~~~dv~TLaHElGHa~H~~~~~~~~~~~~~--~~~  407 (591)
T TIGR00181       353 RSGAYSIGGY------------KVKPY--ILMNWDG---------TLNSVFTLAHELGHSMHSYFSSKHQPYPNS--DYS  407 (591)
T ss_pred             CCCcccCCCC------------CCCCe--EEEecCC---------CcchHHHHHHHhhhHHHHHHHccCCCCccC--CCC
Confidence            7788875541            12454  4789987         555  9999999999999999765554554  478


Q ss_pred             ceeeeechhhchhh
Q 018718          311 WDAVELSTNNSWKI  324 (351)
Q Consensus       311 ~DfvE~PSq~~e~~  324 (351)
                      +|+||+||++||.+
T Consensus       408 ~~~aE~aS~~~E~l  421 (591)
T TIGR00181       408 IFYAEIASTFNELL  421 (591)
T ss_pred             chhhhHhHHHHHHH
Confidence            99999999999764


No 14 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=99.73  E-value=6.1e-19  Score=184.58  Aligned_cols=152  Identities=16%  Similarity=0.197  Sum_probs=97.7

Q ss_pred             HHHHHHHHHHHHHHHhhh-----------------------------hHHHHHH---HHHhcCCCCCCccccCCHHHHHH
Q 018718          141 TSIIDQILKLRLEKAKLL-----------------------------DIEDLRS---FCKGQGALEPREVNHWDINFWSE  188 (351)
Q Consensus       141 ~~il~eil~LR~e~A~LL-----------------------------El~~L~~---~k~~~~~~~~~~l~pWD~~Yy~~  188 (351)
                      ..++..+++.|...|++.                             .+..+++   .+++..  |.+.+++||..+...
T Consensus       223 a~~l~~~v~~~~~~a~~rgy~s~~e~~l~~~~i~~~~~~~l~~~v~~~~p~~~~y~~~k~~~l--gl~~l~~~D~~~p~~  300 (587)
T TIGR02290       223 AAILNALAGDRLTEYRLRGYDHPLEPRLLYNRIDQETLDAMLEAIKENYPLFRRYYKLKAKLL--GKEKLDFYDLYAPLG  300 (587)
T ss_pred             HHHHHHHHHHHHHHHHhhCCCCHHHHhhccCCCCHHHHHHHHHHHHHhhHHHHHHHHHHHHHh--CCCcccHHHcCCCCC
Confidence            346789999999999997                             1112222   222222  457799999865433


Q ss_pred             HhhhhhcCCCC----------ccCCceeEEEEEcCCCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccc
Q 018718          189 RLCESKYDINE----------VWNDDVRFYRVKDSSGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLP  258 (351)
Q Consensus       189 ~~~~~~~~vd~----------vWh~dV~~~~V~d~~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P  258 (351)
                      ...+..|.+++          .|.|+..-+..     .....-++|+++|+| |++||||+...+        +   ..|
T Consensus       301 ~~~~~~~~~e~a~~~v~~~~~~l~~e~~~~~~-----~~~~~~~iD~~~r~g-K~~Ga~~~~~~~--------~---~~p  363 (587)
T TIGR02290       301 DSSAKTYTFDEAKELVLEAFGKFSPEMADFAE-----KAFEEGWIDAEPRPG-KRGGAFCTGFPP--------S---KEP  363 (587)
T ss_pred             CCCCCeecHHHHHHHHHHHHHhcCHHHHHHHH-----HHHHcCCcccCCCCC-CCCCcccCCCCC--------C---CCC
Confidence            22234455543          33333321110     011222899999999 678888866421        1   236


Q ss_pred             eEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchhh
Q 018718          259 VVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWKI  324 (351)
Q Consensus       259 ~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~~  324 (351)
                        +++|||++.        ..+ .||||||||++|++++++..  ..++ ++++|+||+||||||.+
T Consensus       364 --~i~~N~~~~--------~~~v~TL~HE~GHa~H~~ls~~~~--~~~~-~~~~~~aE~~S~~~E~l  417 (587)
T TIGR02290       364 --RVLMNYDGS--------RRDVSTLAHELGHAYHSELAKDQP--LLNA-RYPMTLAETASIFAEML  417 (587)
T ss_pred             --EEEEecCCC--------chhHHHHHHHhhHHHHHHHHccCC--cccC-CCCchhhhHHHHHHHHH
Confidence              477999872        234 99999999999999998643  4443 47899999999999754


No 15 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=99.03  E-value=2.4e-10  Score=120.21  Aligned_cols=79  Identities=16%  Similarity=0.244  Sum_probs=58.3

Q ss_pred             cccCccCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhhhcCC
Q 018718          221 FYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRMLTKQD  299 (351)
Q Consensus       221 ~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~lls~~~  299 (351)
                      =++|+++|+| |++||++++..        +|   ..|.+.  +|+.+-        -.| .||+||+||++|+.+++..
T Consensus       343 ~WiD~~~~~g-KrsGaYs~~~~--------~~---~~p~Il--mN~~gt--------~~dV~TLaHElGHs~Hs~~s~~~  400 (598)
T COG1164         343 RWIDVYPRKG-KRSGAYSIGFY--------KG---DHPFIL--MNYDGT--------LRDVFTLAHELGHSVHSYFSRKH  400 (598)
T ss_pred             CCeeccCCCC-CCCCcccCCCC--------CC---CCCeEE--EeCCCc--------hhHHHHHHHHccHHHHHHHHhcc
Confidence            3799999999 68888876542        11   357654  677651        345 8999999999999999854


Q ss_pred             ccccccCCCccceeeeechhhchh
Q 018718          300 EGLVAGSRGIEWDAVELSTNNSWK  323 (351)
Q Consensus       300 y~~~sGt~~v~~DfvE~PSq~~e~  323 (351)
                      -+-+.+.  .++.++|+||+++|.
T Consensus       401 qp~~~~~--~~i~~AEiAS~fnE~  422 (598)
T COG1164         401 QPYLYAD--YSIFLAEIASTFNEM  422 (598)
T ss_pred             CCccccC--CchHHHHHHHHHHHH
Confidence            4424442  469999999999865


No 16 
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=97.67  E-value=5.8e-05  Score=77.76  Aligned_cols=83  Identities=11%  Similarity=-0.025  Sum_probs=48.9

Q ss_pred             cCCCCCCCCCceeeecccccccccCCCCccccceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhh-hcCCcccc
Q 018718          226 YSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRML-TKQDEGLV  303 (351)
Q Consensus       226 ~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~ll-s~~~y~~~  303 (351)
                      ++|+|| ++| .|+...-  .+   ..  ..-|.+.  +|+..        -..| .|+.||+||+.-.+. ...++--.
T Consensus       212 ~~rp~k-~~~-~C~~saw--d~---~~--~~d~rI~--~c~~~--------t~~D~~t~~HE~GH~~yy~~y~~~p~~~r  272 (477)
T cd06461         212 LEKPTD-REV-VCHASAW--DF---YN--GKDFRIK--MCTKV--------NMEDFVTVHHEMGHIQYYLQYKDQPVLFR  272 (477)
T ss_pred             ccCCCC-CCC-ccCcccc--cC---CC--CCCccee--eCCCC--------CHHHHHHHHHHHHHHHHHHHhccCCHHHh
Confidence            999995 544 5555211  00   01  2356555  44433        1456 999999999994434 33333323


Q ss_pred             ccCCCccc--eeeeechhhchhhhhhh
Q 018718          304 AGSRGIEW--DAVELSTNNSWKIGVTR  328 (351)
Q Consensus       304 sGt~~v~~--DfvE~PSq~~e~~~~l~  328 (351)
                      .|. +...  +.+|+.|+.|-.|+.|.
T Consensus       273 ~~a-np~fheav~e~~smS~~tpe~L~  298 (477)
T cd06461         273 EGA-NPGFHEAVGDAIALSVSTPKHLH  298 (477)
T ss_pred             CCC-CCChHHHHHHHHHHhcCCHHHHh
Confidence            443 2446  88999999995555554


No 17 
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=95.73  E-value=0.085  Score=56.20  Aligned_cols=76  Identities=29%  Similarity=0.449  Sum_probs=58.6

Q ss_pred             HHHhhhhhcccchhhHHHHHHHhhccCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccChhhhcCCCHHHH
Q 018718           50 AVVDGLSLEDNPLLQDFEIIGHLKSVKNNPDLRSAYEEVR-ELGKLSIDFSENVLDATKRFETFVTDKKEIQGLPATAL  127 (351)
Q Consensus        50 ~~~~~~~~~~~~l~~v~~~~~~l~~~~~~~~~R~~~~ei~-~ls~L~~~F~~Nl~~~~~~~~~v~~~~~eL~GlP~~~l  127 (351)
                      |++.+++..+.+|.+||.+|.+++++++++++|+.++++. +++++.+.-++|..=-. .+..++.+++ .+|+.+...
T Consensus        58 n~v~~Le~~~~~L~rv~~~~~~l~~v~~~~elr~a~e~~~pklse~~~~l~~~~~Lf~-~~~al~~~~e-~~~ld~e~~  134 (683)
T COG0339          58 NTVEPLERAGDRLGRVWSVVSHLNSVHNSPELREAYEEILPKLSEFSNDLGQNPGLFA-RYKALWQSPE-SAGLDPEQK  134 (683)
T ss_pred             HHHHHHHHHhhHHHHHHHHHHHHHhcCCCHHHHHHHHHhhHHHHHHHHHhhcCHHHHH-HHHHHhcCcc-cccCCHHHH
Confidence            5788899999999999999999999999999999999998 77777777666644222 2333555554 677776644


No 18 
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=93.55  E-value=0.028  Score=56.86  Aligned_cols=42  Identities=24%  Similarity=0.192  Sum_probs=29.9

Q ss_pred             hhhHhhhhHHHhhh-hhcCCc-cccccCCCccceeeeechhhchhh
Q 018718          281 ETVFHEFGHALQRM-LTKQDE-GLVAGSRGIEWDAVELSTNNSWKI  324 (351)
Q Consensus       281 ~TLFHEfGHalH~l-ls~~~y-~~~sGt~~v~~DfvE~PSq~~e~~  324 (351)
                      .|+.||+|||+|.. +...-. +.+.+  ++++=+.|..|.|+|+.
T Consensus       161 ~t~iHE~GHalye~~l~~~~~~~pl~~--~~sm~ihESqS~f~E~~  204 (396)
T cd06460         161 FSTIHETGHALYEQGLPPELRGTPLGG--GASMGIHESQSRLWENQ  204 (396)
T ss_pred             HHHHHHhhHHHHHhcCCHhHccCCccc--CCCchhHHHHHHHHHHH
Confidence            68999999999998 652111 22222  34688999999999863


No 19 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=89.30  E-value=0.21  Score=46.43  Aligned_cols=30  Identities=33%  Similarity=0.430  Sum_probs=23.1

Q ss_pred             ccceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhh
Q 018718          256 RLPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRM  294 (351)
Q Consensus       256 ~~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~l  294 (351)
                      +.|+++++.|-+.         ... .||+||+||++=|-
T Consensus        57 ~~~~I~iN~n~~~---------~r~rFtlAHELGH~llH~   87 (213)
T COG2856          57 EKPVIYINANNSL---------ERKRFTLAHELGHALLHT   87 (213)
T ss_pred             cCceEEEeCCCCH---------HHHHHHHHHHHhHHHhcc
Confidence            4599999888743         345 89999999997543


No 20 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=85.60  E-value=0.39  Score=38.58  Aligned_cols=32  Identities=28%  Similarity=0.410  Sum_probs=21.9

Q ss_pred             cceEEEeeccCCCCCCCCCCCCCc-hhhHhhhhHHHhhhhhc
Q 018718          257 LPVVHMVCNQTPPVGDKPSLMTFS-ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       257 ~P~~~lv~Nf~~p~~~~P~LL~~~-~TLFHEfGHalH~lls~  297 (351)
                      .|++++..|-++         ... .|++||+||.+.+--..
T Consensus        28 ~~~I~in~~~~~---------~~~~f~laHELgH~~~~~~~~   60 (122)
T PF06114_consen   28 NPIIFINSNLSP---------ERQRFTLAHELGHILLHHGDE   60 (122)
T ss_dssp             TTEEEEESSS-H---------HHHHHHHHHHHHHHHHHH-HH
T ss_pred             CCEEEECCCCCH---------HHHHHHHHHHHHHHHhhhccc
Confidence            577777776321         233 89999999999876553


No 21 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=84.98  E-value=0.39  Score=41.08  Aligned_cols=11  Identities=55%  Similarity=0.936  Sum_probs=10.3

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||+|
T Consensus       107 ~v~~HEiGHaL  117 (154)
T PF00413_consen  107 SVAIHEIGHAL  117 (154)
T ss_dssp             HHHHHHHHHHT
T ss_pred             hhhhhcccccc
Confidence            79999999997


No 22 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=84.94  E-value=0.47  Score=44.33  Aligned_cols=32  Identities=41%  Similarity=0.453  Sum_probs=22.2

Q ss_pred             hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechh
Q 018718          281 ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTN  319 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq  319 (351)
                      .||+||+||++-+.+.=   + +.|   -+=|.|-.-+.
T Consensus        94 ~~l~HE~GHAlI~~~~l---P-v~G---rEEDaaD~lAa  125 (220)
T PF14247_consen   94 FTLYHELGHALIDDLDL---P-VLG---REEDAADQLAA  125 (220)
T ss_pred             HHHHHHHHHHHHHHhcC---C-ccc---chhhHHHHHHH
Confidence            79999999999998732   2 445   24676664443


No 23 
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=80.44  E-value=6.6  Score=42.62  Aligned_cols=48  Identities=17%  Similarity=0.194  Sum_probs=41.0

Q ss_pred             HHHHhhhhhcccchhhHHHHHHHhhccCCCHHHHHHHHHHH-HHHHHHH
Q 018718           49 GAVVDGLSLEDNPLLQDFEIIGHLKSVKNNPDLRSAYEEVR-ELGKLSI   96 (351)
Q Consensus        49 ~~~~~~~~~~~~~l~~v~~~~~~l~~~~~~~~~R~~~~ei~-~ls~L~~   96 (351)
                      +|++.+++..+..+.+++.++.++++++.++++|+.+.++. ++++...
T Consensus        56 ~n~i~~ld~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~~l~~~~~  104 (681)
T PRK10280         56 NNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSAELAELAN  104 (681)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHH
Confidence            35888999999999999999999999999999999999776 4444433


No 24 
>PRK10911 oligopeptidase A; Provisional
Probab=77.55  E-value=10  Score=41.13  Aligned_cols=46  Identities=37%  Similarity=0.512  Sum_probs=39.9

Q ss_pred             HHHHhhhhhcccchhhHHHHHHHhhccCCCHHHHHHHHHHH-HHHHH
Q 018718           49 GAVVDGLSLEDNPLLQDFEIIGHLKSVKNNPDLRSAYEEVR-ELGKL   94 (351)
Q Consensus        49 ~~~~~~~~~~~~~l~~v~~~~~~l~~~~~~~~~R~~~~ei~-~ls~L   94 (351)
                      +|++.+++..+..+.++++++.+++++++++++|+...++. ++++.
T Consensus        50 ~n~i~~l~~~~~~l~~~~~~~~~l~~v~~~~~~r~a~~~~~~~l~~~   96 (680)
T PRK10911         50 ENLCQPLAEVDDVLGRIFSPVSHLNSVKNSPELREAYEQTLPLLSEY   96 (680)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHH
Confidence            35888999999999999999999999999999999999776 44433


No 25 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=77.11  E-value=1.1  Score=40.02  Aligned_cols=12  Identities=58%  Similarity=1.024  Sum_probs=10.9

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      .|+.||+||+|=
T Consensus       115 ~t~~HEiGHaLG  126 (186)
T cd04277         115 QTIIHEIGHALG  126 (186)
T ss_pred             HHHHHHHHHHhc
Confidence            899999999974


No 26 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=76.80  E-value=1.5  Score=40.11  Aligned_cols=20  Identities=45%  Similarity=0.594  Sum_probs=16.7

Q ss_pred             hhhHhhhhHHHhhhhhcCCc
Q 018718          281 ETVFHEFGHALQRMLTKQDE  300 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y  300 (351)
                      .|++||+||++=.+++-.+.
T Consensus        24 ~t~~HE~gHal~a~l~G~~v   43 (200)
T PF13398_consen   24 VTFVHELGHALAALLTGGRV   43 (200)
T ss_pred             HHHHHHHHHHHHHHHhCCCc
Confidence            78999999999999875433


No 27 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=76.15  E-value=1.2  Score=38.69  Aligned_cols=11  Identities=55%  Similarity=0.972  Sum_probs=10.2

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||+|
T Consensus       106 ~~~~HEiGHaL  116 (156)
T cd04279         106 AIALHELGHAL  116 (156)
T ss_pred             HHHHHHhhhhh
Confidence            78999999997


No 28 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=76.12  E-value=0.95  Score=39.34  Aligned_cols=11  Identities=55%  Similarity=0.836  Sum_probs=10.3

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||+|
T Consensus       109 ~~~~HEiGHaL  119 (157)
T cd04278         109 SVAAHEIGHAL  119 (157)
T ss_pred             HHHHHHhcccc
Confidence            79999999996


No 29 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=76.00  E-value=1.2  Score=38.48  Aligned_cols=11  Identities=64%  Similarity=0.915  Sum_probs=10.3

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||+|
T Consensus        96 ~~~~HEiGHaL  106 (165)
T cd04268          96 NTAEHELGHAL  106 (165)
T ss_pred             HHHHHHHHHHh
Confidence            79999999997


No 30 
>PF14891 Peptidase_M91:  Effector protein
Probab=75.39  E-value=2.5  Score=37.71  Aligned_cols=18  Identities=28%  Similarity=0.351  Sum_probs=16.0

Q ss_pred             hhhHhhhhHHHhhhhhcC
Q 018718          281 ETVFHEFGHALQRMLTKQ  298 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~  298 (351)
                      +.|+||+.||.|.+-+..
T Consensus       105 v~L~HEL~HA~~~~~Gt~  122 (174)
T PF14891_consen  105 VVLYHELIHAYDYMNGTM  122 (174)
T ss_pred             HHHHHHHHHHHHHHCCCC
Confidence            789999999999988764


No 31 
>PF13058 DUF3920:  Protein of unknown function (DUF3920)
Probab=72.19  E-value=2  Score=35.77  Aligned_cols=13  Identities=54%  Similarity=0.963  Sum_probs=12.1

Q ss_pred             hhhHhhhhHHHhh
Q 018718          281 ETVFHEFGHALQR  293 (351)
Q Consensus       281 ~TLFHEfGHalH~  293 (351)
                      .||+|||-|++++
T Consensus        78 ktllhefrh~mQh   90 (126)
T PF13058_consen   78 KTLLHEFRHAMQH   90 (126)
T ss_pred             HHHHHHHHHHHHh
Confidence            8999999999994


No 32 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=70.92  E-value=1.9  Score=36.44  Aligned_cols=11  Identities=55%  Similarity=0.836  Sum_probs=9.9

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      -|+.||+||+|
T Consensus        88 ~~~~HEigHaL   98 (140)
T smart00235       88 GVAAHELGHAL   98 (140)
T ss_pred             ccHHHHHHHHh
Confidence            47999999997


No 33 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=70.36  E-value=2.1  Score=38.94  Aligned_cols=12  Identities=67%  Similarity=1.121  Sum_probs=10.4

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      .|+.|||||+|=
T Consensus        94 ~~i~HElgHaLG  105 (198)
T cd04327          94 RVVLHEFGHALG  105 (198)
T ss_pred             HHHHHHHHHHhc
Confidence            689999999974


No 34 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=69.06  E-value=2.6  Score=41.06  Aligned_cols=17  Identities=35%  Similarity=0.622  Sum_probs=14.9

Q ss_pred             hhhHhhhhHHHhhhhhc
Q 018718          281 ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~  297 (351)
                      -+|.||+||.+++++.-
T Consensus       172 yVlAHEyGHHVQ~l~Gi  188 (292)
T PF04228_consen  172 YVLAHEYGHHVQNLLGI  188 (292)
T ss_pred             HHHHHHHHHHHHHHhhh
Confidence            57999999999999863


No 35 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=67.67  E-value=2.5  Score=34.80  Aligned_cols=11  Identities=55%  Similarity=0.906  Sum_probs=9.9

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||.|
T Consensus       109 ~~~~HEiGH~l  119 (124)
T PF13582_consen  109 DTFAHEIGHNL  119 (124)
T ss_dssp             THHHHHHHHHT
T ss_pred             eEeeehhhHhc
Confidence            78999999986


No 36 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=65.15  E-value=3  Score=36.00  Aligned_cols=11  Identities=64%  Similarity=1.078  Sum_probs=10.4

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||+|
T Consensus        98 ~~~~HElGH~L  108 (167)
T cd00203          98 QTIAHELGHAL  108 (167)
T ss_pred             hhHHHHHHHHh
Confidence            88999999998


No 37 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=60.18  E-value=4.1  Score=36.39  Aligned_cols=11  Identities=55%  Similarity=0.742  Sum_probs=10.0

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||.|
T Consensus       144 ~~~AHEiGH~l  154 (196)
T PF13688_consen  144 ITFAHEIGHNL  154 (196)
T ss_dssp             HHHHHHHHHHT
T ss_pred             hhhHHhHHHhc
Confidence            89999999975


No 38 
>PF02128 Peptidase_M36:  Fungalysin metallopeptidase (M36);  InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=59.14  E-value=2.5  Score=42.53  Aligned_cols=17  Identities=29%  Similarity=0.622  Sum_probs=14.9

Q ss_pred             hhhHhhhhHHHhhhhhc
Q 018718          281 ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~  297 (351)
                      ..+.||.||+|+.-|+-
T Consensus       187 ~Ii~HEy~HGiSnRLvg  203 (378)
T PF02128_consen  187 GIIAHEYGHGISNRLVG  203 (378)
T ss_pred             CeeEEeecccccccccC
Confidence            78999999999988754


No 39 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=57.36  E-value=4.9  Score=36.00  Aligned_cols=11  Identities=55%  Similarity=0.770  Sum_probs=9.8

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      -|+.|||||||
T Consensus        76 g~v~HE~~Hal   86 (180)
T cd04280          76 GTIVHELMHAL   86 (180)
T ss_pred             chhHHHHHHHh
Confidence            67899999997


No 40 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=56.90  E-value=5  Score=36.81  Aligned_cols=11  Identities=55%  Similarity=0.875  Sum_probs=9.9

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||.|
T Consensus       139 ~~~aHEiGH~l  149 (206)
T PF13583_consen  139 QTFAHEIGHNL  149 (206)
T ss_pred             hHHHHHHHHHh
Confidence            67999999986


No 41 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=56.35  E-value=5.2  Score=35.11  Aligned_cols=12  Identities=50%  Similarity=0.695  Sum_probs=10.6

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      .||-||+||-|-
T Consensus        71 ~TltHEvGH~LG   82 (154)
T PF05572_consen   71 KTLTHEVGHWLG   82 (154)
T ss_dssp             HHHHHHHHHHTT
T ss_pred             cchhhhhhhhhc
Confidence            899999999764


No 42 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=56.25  E-value=11  Score=40.55  Aligned_cols=64  Identities=17%  Similarity=0.240  Sum_probs=33.0

Q ss_pred             CCCCccccccCccCCCCCCCCCceeeecccccccccCCCCccccceEEE--eeccCCCCCCCCCCCCCc-hhhHhhhhHH
Q 018718          214 SGSPIAYFYFDPYSCPSEKKGSARICEVFSQSRVLTLDGATTRLPVVHM--VCNQTPPVGDKPSLMTFS-ETVFHEFGHA  290 (351)
Q Consensus       214 ~g~~lG~~YlDl~~R~gKk~~ga~~~~~~~~~~~~~~dG~~~~~P~~~l--v~Nf~~p~~~~P~LL~~~-~TLFHEfGHa  290 (351)
                      +|+.||.-|.-- ||.| ..||     +|..... ..+|..+ +=-+-+  +-|+.     ++..-.-. .||.||+||.
T Consensus       338 e~GtLGLAwVgs-p~~~-saGG-----IC~k~~~-y~~G~~~-sLNtGi~T~~NYg-----~~Vp~kvs~lt~AHEiGHN  403 (764)
T KOG3658|consen  338 EGGTLGLAWVGS-PRSN-SAGG-----ICEKAYT-YNEGKKR-SLNTGISTSVNYG-----KRVPTKVSDLTLAHEIGHN  403 (764)
T ss_pred             ccceeeeEEecc-CccC-CCCc-----cccccee-ccCcceE-EeecceeeeeecC-----CccCcchhheeehhhhccc
Confidence            578889888865 6665 3444     3332211 1224211 111111  13343     33333444 8999999997


Q ss_pred             H
Q 018718          291 L  291 (351)
Q Consensus       291 l  291 (351)
                      +
T Consensus       404 f  404 (764)
T KOG3658|consen  404 F  404 (764)
T ss_pred             c
Confidence            5


No 43 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=53.02  E-value=6.8  Score=34.82  Aligned_cols=12  Identities=50%  Similarity=0.872  Sum_probs=10.6

Q ss_pred             c-hhhHhhhhHHH
Q 018718          280 S-ETVFHEFGHAL  291 (351)
Q Consensus       280 ~-~TLFHEfGHal  291 (351)
                      + .|+.||+||.+
T Consensus       111 ~~~~~aHElGH~l  123 (173)
T PF13574_consen  111 GIDTFAHELGHQL  123 (173)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             eeeeehhhhHhhc
Confidence            6 78999999976


No 44 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=51.98  E-value=6.5  Score=37.71  Aligned_cols=17  Identities=35%  Similarity=0.751  Sum_probs=11.5

Q ss_pred             hhhHhhhhHHHhhhhhc
Q 018718          281 ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~  297 (351)
                      .++-|||||++-.|--.
T Consensus       218 ~v~vHE~GHsf~~LaDE  234 (264)
T PF09471_consen  218 QVVVHEFGHSFGGLADE  234 (264)
T ss_dssp             HHHHHHHHHHTT-----
T ss_pred             ceeeeeccccccccccc
Confidence            89999999999988754


No 45 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=51.60  E-value=7.4  Score=35.21  Aligned_cols=11  Identities=64%  Similarity=1.132  Sum_probs=10.1

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      -|+.|||||||
T Consensus        81 ~~i~HEl~HaL   91 (191)
T PF01400_consen   81 GTILHELGHAL   91 (191)
T ss_dssp             HHHHHHHHHHH
T ss_pred             cchHHHHHHHH
Confidence            68999999997


No 46 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=50.98  E-value=8.7  Score=30.78  Aligned_cols=17  Identities=18%  Similarity=0.159  Sum_probs=14.8

Q ss_pred             hhhHhhhhHHHhhhhhc
Q 018718          281 ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~  297 (351)
                      .+|.||++|.+|.-...
T Consensus        27 ~~l~HE~~H~~~~~~~~   43 (128)
T PF13485_consen   27 RVLAHELAHQWFGNYFG   43 (128)
T ss_pred             HHHHHHHHHHHHHHHcC
Confidence            89999999999987754


No 47 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=50.76  E-value=6.7  Score=36.43  Aligned_cols=12  Identities=33%  Similarity=0.744  Sum_probs=10.8

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      .++-||+||+|=
T Consensus       135 hvi~HEiGH~IG  146 (211)
T PF12388_consen  135 HVITHEIGHCIG  146 (211)
T ss_pred             HHHHHHhhhhcc
Confidence            689999999985


No 48 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=49.84  E-value=7.9  Score=35.06  Aligned_cols=11  Identities=45%  Similarity=0.567  Sum_probs=9.7

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      -|+.|||||||
T Consensus        79 G~i~HEl~HaL   89 (182)
T cd04283          79 GIIQHELLHAL   89 (182)
T ss_pred             chHHHHHHHHh
Confidence            47899999997


No 49 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=49.35  E-value=7.5  Score=36.38  Aligned_cols=15  Identities=40%  Similarity=0.558  Sum_probs=11.9

Q ss_pred             hhhHhhhhHHHhhhh
Q 018718          281 ETVFHEFGHALQRML  295 (351)
Q Consensus       281 ~TLFHEfGHalH~ll  295 (351)
                      -+=.||.|||+|+--
T Consensus        91 aVAAHEvGHAiQ~a~  105 (222)
T PF04298_consen   91 AVAAHEVGHAIQHAE  105 (222)
T ss_pred             HHHHHHHhHHHhccc
Confidence            345799999999763


No 50 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=48.05  E-value=8.1  Score=36.38  Aligned_cols=11  Identities=36%  Similarity=0.631  Sum_probs=10.1

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||.+
T Consensus       169 ~t~AHElGHnl  179 (244)
T cd04270         169 LVTAHELGHNF  179 (244)
T ss_pred             HHHHHHHHHhc
Confidence            79999999986


No 51 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=45.43  E-value=8.8  Score=34.20  Aligned_cols=11  Identities=55%  Similarity=0.815  Sum_probs=10.3

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||.|
T Consensus       135 ~~~aHElGH~l  145 (192)
T cd04267         135 LTMAHELGHNL  145 (192)
T ss_pred             hhhhhhHHhhc
Confidence            89999999997


No 52 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=44.81  E-value=6.4  Score=37.98  Aligned_cols=12  Identities=42%  Similarity=0.747  Sum_probs=10.0

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      -|+.|||||+|-
T Consensus       167 gv~~HE~gH~lG  178 (286)
T TIGR03296       167 GVIAHELGHDLG  178 (286)
T ss_pred             eeeehhhhcccC
Confidence            678999999763


No 53 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=44.18  E-value=8.2  Score=36.06  Aligned_cols=13  Identities=31%  Similarity=0.672  Sum_probs=11.6

Q ss_pred             hhhHhhhhHHH---hh
Q 018718          281 ETVFHEFGHAL---QR  293 (351)
Q Consensus       281 ~TLFHEfGHal---H~  293 (351)
                      .|+.||+||.|   |+
T Consensus       147 ~t~AHElGHnLGm~HD  162 (228)
T cd04271         147 QVFAHEIGHTFGAVHD  162 (228)
T ss_pred             eehhhhhhhhcCCCCC
Confidence            78999999998   65


No 54 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=43.06  E-value=12  Score=34.50  Aligned_cols=11  Identities=45%  Similarity=0.884  Sum_probs=9.7

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      -|+.||+||+|
T Consensus        89 Gti~HEl~HaL   99 (200)
T cd04281          89 GIVVHELGHVI   99 (200)
T ss_pred             chHHHHHHHHh
Confidence            57899999997


No 55 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=42.76  E-value=12  Score=34.35  Aligned_cols=16  Identities=38%  Similarity=0.557  Sum_probs=12.4

Q ss_pred             hhhHhhhhHHHhhhhh
Q 018718          281 ETVFHEFGHALQRMLT  296 (351)
Q Consensus       281 ~TLFHEfGHalH~lls  296 (351)
                      ..++||+||++=...-
T Consensus        40 ~v~iHElgH~~~A~~~   55 (208)
T cd06161          40 SVLLHELGHALVARRY   55 (208)
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            5689999999865543


No 56 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=42.22  E-value=13  Score=32.92  Aligned_cols=18  Identities=33%  Similarity=0.390  Sum_probs=13.2

Q ss_pred             hhhHhhhhHHHhhhhhcC
Q 018718          281 ETVFHEFGHALQRMLTKQ  298 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~  298 (351)
                      ..+.||+||++=...-..
T Consensus        10 ~i~iHE~gH~~~A~~~G~   27 (180)
T cd05709          10 SVTVHELGHALVARRLGV   27 (180)
T ss_pred             HHHHHHHHHHHHHHHcCC
Confidence            457899999986655443


No 57 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=39.69  E-value=16  Score=32.38  Aligned_cols=24  Identities=33%  Similarity=0.416  Sum_probs=15.5

Q ss_pred             hhhHhhhhHHHhhhhhcCCccccc
Q 018718          281 ETVFHEFGHALQRMLTKQDEGLVA  304 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y~~~s  304 (351)
                      ..+.||+||++=..+...+...+.
T Consensus         9 ~i~~HE~gH~~~a~~~G~~~~~~~   32 (192)
T PF02163_consen    9 SIVLHELGHALAARLYGDKVPRFE   32 (192)
T ss_dssp             HHHHHHHHHHHHHHTTT--B--EE
T ss_pred             cccccccccccccccccccccccc
Confidence            357899999998877665555553


No 58 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=39.53  E-value=14  Score=33.89  Aligned_cols=12  Identities=42%  Similarity=0.595  Sum_probs=10.6

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      .++.||+||+|=
T Consensus       118 ~~~~he~gh~lG  129 (197)
T cd04276         118 YLLAHEVGHTLG  129 (197)
T ss_pred             HHHHHHHHHHhc
Confidence            789999999973


No 59 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=37.74  E-value=16  Score=32.96  Aligned_cols=13  Identities=38%  Similarity=0.685  Sum_probs=11.4

Q ss_pred             hhhHhhhhHHHhh
Q 018718          281 ETVFHEFGHALQR  293 (351)
Q Consensus       281 ~TLFHEfGHalH~  293 (351)
                      -.|.||+||..|.
T Consensus        91 aVlaHElgH~~~~  103 (226)
T PF01435_consen   91 AVLAHELGHIKHR  103 (226)
T ss_dssp             HHHHHHHHHHHTT
T ss_pred             HHHHHHHHHHHcC
Confidence            6899999999874


No 60 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=36.76  E-value=15  Score=32.73  Aligned_cols=11  Identities=55%  Similarity=0.842  Sum_probs=10.4

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .++.||+||.+
T Consensus       133 ~~~AHElGH~l  143 (194)
T cd04269         133 VTMAHELGHNL  143 (194)
T ss_pred             HHHHHHHHhhc
Confidence            89999999997


No 61 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=35.82  E-value=17  Score=33.28  Aligned_cols=11  Identities=45%  Similarity=0.609  Sum_probs=10.4

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+.||+||.|
T Consensus       147 ~~~AHElGH~l  157 (220)
T cd04272         147 YTMTHELAHLL  157 (220)
T ss_pred             HHHHHHHHHHh
Confidence            89999999987


No 62 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=35.10  E-value=28  Score=27.07  Aligned_cols=14  Identities=29%  Similarity=0.494  Sum_probs=11.8

Q ss_pred             hhhHhhhhHHHhhh
Q 018718          281 ETVFHEFGHALQRM  294 (351)
Q Consensus       281 ~TLFHEfGHalH~l  294 (351)
                      ..|.||+.|.++.-
T Consensus        63 ~llaHEl~Hv~Qq~   76 (79)
T PF13699_consen   63 ALLAHELAHVVQQR   76 (79)
T ss_pred             hhHhHHHHHHHhhc
Confidence            56899999999864


No 63 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=34.91  E-value=18  Score=33.99  Aligned_cols=12  Identities=67%  Similarity=0.711  Sum_probs=10.0

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      -|..||+||+|=
T Consensus       122 Gti~HEl~HalG  133 (230)
T cd04282         122 ATVEHEFLHALG  133 (230)
T ss_pred             chHHHHHHHHhC
Confidence            578999999973


No 64 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=34.36  E-value=19  Score=33.57  Aligned_cols=15  Identities=33%  Similarity=0.607  Sum_probs=11.8

Q ss_pred             hhhHhhhhHHHhhhh
Q 018718          281 ETVFHEFGHALQRML  295 (351)
Q Consensus       281 ~TLFHEfGHalH~ll  295 (351)
                      ..++||+||++=...
T Consensus        55 ~v~iHElgH~~~A~~   69 (227)
T cd06164          55 SVLLHELGHSLVARR   69 (227)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            567999999986544


No 65 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=34.05  E-value=22  Score=30.51  Aligned_cols=12  Identities=42%  Similarity=0.667  Sum_probs=10.0

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      ....||+||.|=
T Consensus        79 RIaaHE~GHiLG   90 (132)
T PF02031_consen   79 RIAAHELGHILG   90 (132)
T ss_dssp             HHHHHHHHHHHT
T ss_pred             eeeeehhccccC
Confidence            678899999864


No 66 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=33.52  E-value=20  Score=34.35  Aligned_cols=16  Identities=25%  Similarity=0.455  Sum_probs=12.6

Q ss_pred             hhhHhhhhHHHhhhhh
Q 018718          281 ETVFHEFGHALQRMLT  296 (351)
Q Consensus       281 ~TLFHEfGHalH~lls  296 (351)
                      ..+.||+||++=...-
T Consensus       120 sv~iHElgHa~~Ar~~  135 (263)
T cd06159         120 GVVVHELSHGILARVE  135 (263)
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            5789999999876543


No 67 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=32.04  E-value=21  Score=32.87  Aligned_cols=39  Identities=23%  Similarity=0.229  Sum_probs=21.8

Q ss_pred             hhhHhhhhHHHhhhhhcCCccccccCCCccceeeeechhhchh
Q 018718          281 ETVFHEFGHALQRMLTKQDEGLVAGSRGIEWDAVELSTNNSWK  323 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y~~~sGt~~v~~DfvE~PSq~~e~  323 (351)
                      -.-.||-||++++-   .+|+.+-= |+.=.-.+-+-|++-|-
T Consensus        94 aVAAHEVGHAiQd~---~~Y~~L~~-R~~lvPv~~~gSn~a~~  132 (226)
T COG2738          94 AVAAHEVGHAIQDQ---EDYAFLVL-RHALVPVANFGSNLAPL  132 (226)
T ss_pred             HHHHHHhhHHHhhh---cccHHHHH-hhcccceeccccchhHH
Confidence            34689999999974   34543211 11113445566776543


No 68 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=31.17  E-value=28  Score=31.43  Aligned_cols=22  Identities=27%  Similarity=0.205  Sum_probs=16.1

Q ss_pred             hhhHhhhhHHHhhhhhcCCccc
Q 018718          281 ETVFHEFGHALQRMLTKQDEGL  302 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y~~  302 (351)
                      ..+.||+||.+=......+-..
T Consensus        11 ~v~iHElGH~~~Ar~~Gv~v~~   32 (182)
T cd06163          11 LIFVHELGHFLVAKLFGVKVEE   32 (182)
T ss_pred             HHHHHHHHHHHHHHHcCCeeeE
Confidence            4678999999987776554443


No 69 
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=31.10  E-value=24  Score=35.46  Aligned_cols=55  Identities=15%  Similarity=0.102  Sum_probs=31.3

Q ss_pred             hhhHhhhhHHHhhhhhcCCccccccC-CCccceeeeechhhchhhhhhhc-ccCCCCcccc
Q 018718          281 ETVFHEFGHALQRMLTKQDEGLVAGS-RGIEWDAVELSTNNSWKIGVTRT-QVIPTIPEDG  339 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y~~~sGt-~~v~~DfvE~PSq~~e~~~~l~~-~~~~~~~~d~  339 (351)
                      .||.|||-|.+|.--....    .|. ...+.=.=|.-|+++|++-..+. ....+++..+
T Consensus       141 sTlAHEfQHmInfy~~~v~----~g~~~~~dtWLnE~lS~~aEdl~s~~~~~~~n~i~d~R  197 (366)
T PF10460_consen  141 STLAHEFQHMINFYQRGVL----HGKQYAMDTWLNEMLSMSAEDLYSSKIDPGYNNIRDSR  197 (366)
T ss_pred             HHHHHHHHHHHHHHHHHHh----cCCCcccccHHHHHHHHHHHHHHhcCCCcccCcccccc
Confidence            8999999999996432211    121 11223345888999988643333 2334444433


No 70 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=29.57  E-value=21  Score=33.41  Aligned_cols=12  Identities=50%  Similarity=0.689  Sum_probs=10.7

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      .|+-||+||.|-
T Consensus       139 ~t~~HEvGH~lG  150 (225)
T cd04275         139 DTATHEVGHWLG  150 (225)
T ss_pred             ceeEEeccceee
Confidence            899999999874


No 71 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=28.90  E-value=24  Score=33.48  Aligned_cols=16  Identities=25%  Similarity=0.434  Sum_probs=13.3

Q ss_pred             CCCc---hhhHhhhhHHHh
Q 018718          277 MTFS---ETVFHEFGHALQ  292 (351)
Q Consensus       277 L~~~---~TLFHEfGHalH  292 (351)
                      |+.+   -.|.||+||..|
T Consensus       152 l~~dEl~aVlaHElgHi~~  170 (302)
T COG0501         152 LNDDELEAVLAHELGHIKN  170 (302)
T ss_pred             CCHHHHHHHHHHHHHHHhc
Confidence            4555   689999999998


No 72 
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=28.88  E-value=27  Score=33.43  Aligned_cols=14  Identities=36%  Similarity=0.456  Sum_probs=10.3

Q ss_pred             hhhHhhhhHHHhhh
Q 018718          281 ETVFHEFGHALQRM  294 (351)
Q Consensus       281 ~TLFHEfGHalH~l  294 (351)
                      --+.||+||.++.-
T Consensus       221 WG~~HE~GH~~Q~~  234 (307)
T PF13402_consen  221 WGPWHELGHNHQQG  234 (307)
T ss_dssp             HHHHHHHHHHH-BG
T ss_pred             eeehhhhhhhcCcc
Confidence            35799999998843


No 73 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=28.82  E-value=14  Score=33.50  Aligned_cols=11  Identities=55%  Similarity=0.839  Sum_probs=10.0

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .++.||+||.|
T Consensus       142 ~~~aHElGH~L  152 (207)
T cd04273         142 FTIAHELGHVL  152 (207)
T ss_pred             Eeeeeechhhc
Confidence            78999999987


No 74 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=28.78  E-value=24  Score=34.75  Aligned_cols=11  Identities=55%  Similarity=1.063  Sum_probs=9.6

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|+|||+||-+
T Consensus       152 ~~~~HElgHN~  162 (314)
T PF05548_consen  152 ATIMHELGHNL  162 (314)
T ss_pred             HHHHHHhhhhc
Confidence            68999999965


No 75 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=28.50  E-value=27  Score=33.77  Aligned_cols=16  Identities=31%  Similarity=0.567  Sum_probs=12.3

Q ss_pred             hhhHhhhhHHHhhhhh
Q 018718          281 ETVFHEFGHALQRMLT  296 (351)
Q Consensus       281 ~TLFHEfGHalH~lls  296 (351)
                      ..+.||+||++=...-
T Consensus       137 svvvHElgHal~A~~~  152 (277)
T cd06162         137 SGVVHEMGHGVAAVRE  152 (277)
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            5678999999865543


No 76 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=28.23  E-value=28  Score=31.42  Aligned_cols=17  Identities=29%  Similarity=0.261  Sum_probs=12.7

Q ss_pred             hhhHhhhhHHHhhhhhc
Q 018718          281 ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~  297 (351)
                      ..+.||+||++=...-.
T Consensus        43 ~l~iHElgH~~~A~~~G   59 (183)
T cd06160          43 ILGIHEMGHYLAARRHG   59 (183)
T ss_pred             HHHHHHHHHHHHHHHCC
Confidence            56799999998655543


No 77 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=27.09  E-value=29  Score=31.07  Aligned_cols=11  Identities=45%  Similarity=0.794  Sum_probs=9.9

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .+++||+||.|
T Consensus       133 ~~~AHelGH~l  143 (199)
T PF01421_consen  133 VIIAHELGHNL  143 (199)
T ss_dssp             HHHHHHHHHHT
T ss_pred             HHHHHHHHHhc
Confidence            89999999975


No 78 
>PRK03982 heat shock protein HtpX; Provisional
Probab=26.67  E-value=29  Score=33.43  Aligned_cols=12  Identities=25%  Similarity=0.398  Sum_probs=10.9

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      -.|.||+||.-|
T Consensus       127 AVlAHElgHi~~  138 (288)
T PRK03982        127 GVIAHELTHIKN  138 (288)
T ss_pred             HHHHHHHHHHHc
Confidence            689999999987


No 79 
>PF01742 Peptidase_M27:  Clostridial neurotoxin zinc protease This family is a subset of the Prosite family;  InterPro: IPR000395 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M27 (clan MA(E)). A number of the proteins have been classified as non-peptidase homologues as they have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the family. There are seven antigenically distinct forms of botulinum neurotoxin, designated A, B, C1, D, E, F and G. The seven neurotoxins are potent protein toxins that inhibit neurotransmitter release from peripheral cholinergic synapses []. On binding to the neuronal synapses, the molecules are internalised and move by retrograde transport up the axon into the spinal cord, where they can move between post- and presynaptic neurons. The toxin inhibits neurotransmitter release by acting as a zinc endopeptidase that cleaves synaptic proteins such as synaptobrevins, syntaxin and SNAP-25 []. The protein toxins exist as disulphide-linked heterodimers of light and heavy chains. The light chain has the pharmacological activity, while the N- and C-termini of the heavy chain mediate channel formation and toxin binding []. The light chain exhibits a high level of sequence similarity to tetanus toxin (TeTx). Alignment of all characterised neurotoxin sequences reveals the presence of highly conserved amino acid domains interspersed with amino acid tracts with little overall similarity. The most divergent region corresponds to the C-terminal extremity of each toxin, which may reflect differences in specificity of binding to neurone acceptor sites []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0009405 pathogenesis; PDB: 2QN0_A 3D3X_A 3NF3_A 2ISG_A 2ISE_B 2IMC_B 2IMA_B 2ILP_A 2IMB_A 2ISH_A ....
Probab=26.58  E-value=37  Score=34.44  Aligned_cols=17  Identities=24%  Similarity=0.364  Sum_probs=15.6

Q ss_pred             hhhHhhhhHHHhhhhhc
Q 018718          281 ETVFHEFGHALQRMLTK  297 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~  297 (351)
                      .+|.||+=|+||.|.+-
T Consensus       216 l~LmheLIh~Lh~LYGi  232 (408)
T PF01742_consen  216 LELMHELIHSLHGLYGI  232 (408)
T ss_dssp             HHHHHHHHHHHHHHTTT
T ss_pred             HHHHHHHHHHHHhhccc
Confidence            79999999999999874


No 80 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=23.48  E-value=33  Score=32.01  Aligned_cols=18  Identities=44%  Similarity=0.635  Sum_probs=13.2

Q ss_pred             CCCCCCc--hhhHhhhhHHH
Q 018718          274 PSLMTFS--ETVFHEFGHAL  291 (351)
Q Consensus       274 P~LL~~~--~TLFHEfGHal  291 (351)
                      |..+...  .|.-||+||+|
T Consensus       180 pg~~~e~L~~tarhElGhaL  199 (236)
T COG5549         180 PGELRENLNPTARHELGHAL  199 (236)
T ss_pred             cccchhhhhHHHHHhhcchh
Confidence            3444444  78999999998


No 81 
>PRK02870 heat shock protein HtpX; Provisional
Probab=23.32  E-value=37  Score=33.78  Aligned_cols=13  Identities=23%  Similarity=0.373  Sum_probs=11.6

Q ss_pred             hhhHhhhhHHHhh
Q 018718          281 ETVFHEFGHALQR  293 (351)
Q Consensus       281 ~TLFHEfGHalH~  293 (351)
                      -.+.||+||.-|.
T Consensus       175 aVlAHELgHik~~  187 (336)
T PRK02870        175 AVMAHELSHIRHG  187 (336)
T ss_pred             HHHHHHHHHHHcc
Confidence            6899999999885


No 82 
>PRK03001 M48 family peptidase; Provisional
Probab=23.27  E-value=36  Score=32.66  Aligned_cols=13  Identities=31%  Similarity=0.526  Sum_probs=11.5

Q ss_pred             hhhHhhhhHHHhh
Q 018718          281 ETVFHEFGHALQR  293 (351)
Q Consensus       281 ~TLFHEfGHalH~  293 (351)
                      -.|.||+||.-|.
T Consensus       126 aVlAHElgHi~~~  138 (283)
T PRK03001        126 GVMAHELAHVKHR  138 (283)
T ss_pred             HHHHHHHHHHhCC
Confidence            6899999999884


No 83 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=23.14  E-value=35  Score=33.50  Aligned_cols=11  Identities=36%  Similarity=0.727  Sum_probs=10.2

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .++.||+||++
T Consensus       195 ~~f~HE~GH~~  205 (305)
T PF10462_consen  195 NEFSHELGHNF  205 (305)
T ss_dssp             HHHHHHHHHTT
T ss_pred             ceeehhhhhhc
Confidence            89999999986


No 84 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=22.88  E-value=19  Score=38.95  Aligned_cols=11  Identities=55%  Similarity=0.905  Sum_probs=9.5

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      =++.|||||+|
T Consensus       223 GVfaHEfGH~L  233 (645)
T PF05547_consen  223 GVFAHEFGHDL  233 (645)
T ss_pred             EEEEeeccccC
Confidence            57889999987


No 85 
>PHA03420 E4 protein; Provisional
Probab=22.73  E-value=44  Score=28.36  Aligned_cols=13  Identities=23%  Similarity=0.685  Sum_probs=10.5

Q ss_pred             cccCCHHHHHHHh
Q 018718          178 VNHWDINFWSERL  190 (351)
Q Consensus       178 l~pWD~~Yy~~~~  190 (351)
                      -.|||..||.+.+
T Consensus        12 ~hPWdTPYYrrll   24 (137)
T PHA03420         12 HHPWDTPYYRRLL   24 (137)
T ss_pred             CCCcccHHHHHHH
Confidence            3599999997764


No 86 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=22.29  E-value=45  Score=29.20  Aligned_cols=11  Identities=36%  Similarity=0.685  Sum_probs=9.8

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      -.+.|||+|++
T Consensus       137 DVvaHEltHGV  147 (150)
T PF01447_consen  137 DVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHH
T ss_pred             ceeeecccccc
Confidence            67899999997


No 87 
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=22.21  E-value=33  Score=33.04  Aligned_cols=11  Identities=45%  Similarity=0.776  Sum_probs=10.0

Q ss_pred             hhhHhhhhHHH
Q 018718          281 ETVFHEFGHAL  291 (351)
Q Consensus       281 ~TLFHEfGHal  291 (351)
                      .|++||+||..
T Consensus       205 aTl~HElghwt  215 (316)
T COG4227         205 ATLLHELGHWT  215 (316)
T ss_pred             HHHHHHhcccc
Confidence            89999999975


No 88 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=21.88  E-value=41  Score=34.25  Aligned_cols=45  Identities=24%  Similarity=0.258  Sum_probs=25.4

Q ss_pred             ccceEEEe--eccCCCCCCCCCCCCCc--hhhHhhhhHHHhhhhhcCCc
Q 018718          256 RLPVVHMV--CNQTPPVGDKPSLMTFS--ETVFHEFGHALQRMLTKQDE  300 (351)
Q Consensus       256 ~~P~~~lv--~Nf~~p~~~~P~LL~~~--~TLFHEfGHalH~lls~~~y  300 (351)
                      ..|+..-+  =.+.-|-++-|-.++--  ...+||+|||+-.....++.
T Consensus       104 hnpv~le~iiPg~nLpl~~I~yf~t~lvi~~vvHElGHalAA~segV~v  152 (484)
T KOG2921|consen  104 HNPVFLESIIPGTNLPLSGIAYFLTSLVITVVVHELGHALAAASEGVQV  152 (484)
T ss_pred             CCceEEEeecCccccccccchhhhhhHHHHHHHHHhhHHHHHHhcCcee
Confidence            45655322  22222434444334333  56799999999877765533


No 89 
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=21.76  E-value=47  Score=34.08  Aligned_cols=21  Identities=24%  Similarity=0.173  Sum_probs=14.7

Q ss_pred             hhhHhhhhHHHhhhhhcCCcc
Q 018718          281 ETVFHEFGHALQRMLTKQDEG  301 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~y~  301 (351)
                      ..++||+||.+=.-+...+-.
T Consensus        17 li~vHElGHfl~Ar~~gv~V~   37 (449)
T PRK10779         17 LITVHEFGHFWVARRCGVRVE   37 (449)
T ss_pred             HHHHHHHHHHHHHHHcCCeee
Confidence            467999999987665544333


No 90 
>PRK04897 heat shock protein HtpX; Provisional
Probab=21.41  E-value=41  Score=32.61  Aligned_cols=12  Identities=25%  Similarity=0.329  Sum_probs=10.8

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      -.+.||+||.-|
T Consensus       139 aVlAHElgHi~~  150 (298)
T PRK04897        139 GVIGHEISHIRN  150 (298)
T ss_pred             HHHHHHHHHHhc
Confidence            689999999986


No 91 
>PRK01345 heat shock protein HtpX; Provisional
Probab=21.32  E-value=42  Score=32.96  Aligned_cols=12  Identities=25%  Similarity=0.426  Sum_probs=11.0

Q ss_pred             hhhHhhhhHHHh
Q 018718          281 ETVFHEFGHALQ  292 (351)
Q Consensus       281 ~TLFHEfGHalH  292 (351)
                      -.|.||+||.-|
T Consensus       126 aVlAHElgHi~~  137 (317)
T PRK01345        126 GVMAHELAHVKN  137 (317)
T ss_pred             HHHHHHHHHHHc
Confidence            689999999988


No 92 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=20.82  E-value=44  Score=34.02  Aligned_cols=13  Identities=46%  Similarity=0.544  Sum_probs=10.5

Q ss_pred             hhhHhhhhHHHhh
Q 018718          281 ETVFHEFGHALQR  293 (351)
Q Consensus       281 ~TLFHEfGHalH~  293 (351)
                      -|..||+||||=.
T Consensus       161 G~i~HEl~HaLGf  173 (411)
T KOG3714|consen  161 GTIVHELMHALGF  173 (411)
T ss_pred             chhHHHHHHHhhh
Confidence            4688999999853


No 93 
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=20.50  E-value=49  Score=33.71  Aligned_cols=19  Identities=32%  Similarity=0.373  Sum_probs=14.3

Q ss_pred             hhhHhhhhHHHhhhhhcCC
Q 018718          281 ETVFHEFGHALQRMLTKQD  299 (351)
Q Consensus       281 ~TLFHEfGHalH~lls~~~  299 (351)
                      ..++||+||.+=.-....+
T Consensus        16 ~v~~HE~gH~~~a~~~g~~   34 (420)
T TIGR00054        16 LIFVHELGHFLAARLCGIK   34 (420)
T ss_pred             HHHHHhHHHHHHHHHcCCE
Confidence            5789999999876655433


Done!