Query         037540
Match_columns 340
No_of_seqs    345 out of 1585
Neff          6.5 
Searched_HMMs 46136
Date          Fri Mar 29 13:17:13 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037540.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037540hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0 9.5E-51 2.1E-55  408.0  20.7  220   63-302    29-269 (469)
  2 PF00413 Peptidase_M10:  Matrix 100.0 2.4E-30 5.2E-35  223.4  15.4  133  149-294     1-154 (154)
  3 cd04278 ZnMc_MMP Zinc-dependen 100.0   3E-30 6.4E-35  225.3  14.3  137  149-294     1-157 (157)
  4 cd04279 ZnMc_MMP_like_1 Zinc-d  99.9 2.1E-23 4.5E-28  181.8  13.3  139  156-294     2-156 (156)
  5 cd04268 ZnMc_MMP_like Zinc-dep  99.9 2.3E-21 4.9E-26  169.3  10.6  132  156-293     2-165 (165)
  6 smart00235 ZnMc Zinc-dependent  99.9 2.6E-21 5.5E-26  165.4   9.9  134  147-295     2-140 (140)
  7 cd04277 ZnMc_serralysin_like Z  99.8 4.6E-20   1E-24  165.2  12.2  122  173-294    32-186 (186)
  8 cd04327 ZnMc_MMP_like_3 Zinc-d  99.7 1.6E-17 3.4E-22  150.6  12.2  103  149-262     2-110 (198)
  9 cd04280 ZnMc_astacin_like Zinc  99.6 2.8E-15 6.1E-20  134.2  11.7   90  157-262     3-92  (180)
 10 PF01400 Astacin:  Astacin (Pep  99.6 2.3E-15 4.9E-20  136.0  10.1  129  149-296     2-190 (191)
 11 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.6 1.1E-14 2.4E-19  132.4  13.0   99  147-262     7-105 (200)
 12 cd00203 ZnMc Zinc-dependent me  99.6 9.7E-15 2.1E-19  127.4  10.8  117  173-293    20-167 (167)
 13 cd04282 ZnMc_meprin Zinc-depen  99.5 4.2E-13 9.1E-18  124.3  13.1   96  147-262    43-138 (230)
 14 cd04283 ZnMc_hatching_enzyme Z  99.5 4.6E-13   1E-17  120.0  11.5  123  156-294     4-181 (182)
 15 KOG3714 Meprin A metalloprotea  99.5 3.9E-13 8.4E-18  134.6  12.0  137  145-298    77-272 (411)
 16 cd04276 ZnMc_MMP_like_2 Zinc-d  99.4 1.9E-12 4.2E-17  117.4  12.8  134  156-295     8-197 (197)
 17 PF12388 Peptidase_M57:  Dual-a  99.2 8.5E-11 1.8E-15  107.1  11.8  138  148-293    32-211 (211)
 18 PF01471 PG_binding_1:  Putativ  99.1 1.5E-10 3.2E-15   84.1   6.9   55   62-119     3-57  (57)
 19 PF02031 Peptidase_M7:  Strepto  98.7 6.7E-08 1.4E-12   81.1   9.3  125  156-294     2-130 (132)
 20 COG5549 Predicted Zn-dependent  98.6 2.4E-07 5.3E-12   83.7   9.8  112  174-295   103-234 (236)
 21 TIGR02869 spore_SleB spore cor  97.9 2.7E-05 5.9E-10   71.1   6.9   69   56-128     8-76  (201)
 22 PF13583 Reprolysin_4:  Metallo  97.7 4.9E-05 1.1E-09   69.6   4.7   47  244-292   137-197 (206)
 23 PRK10594 murein L,D-transpepti  97.4 0.00045 9.7E-09   72.2   7.6   67   56-122   236-343 (608)
 24 PF13574 Reprolysin_2:  Metallo  97.4 7.9E-05 1.7E-09   66.3   1.7   47  244-292   111-170 (173)
 25 PF11350 DUF3152:  Protein of u  97.3 0.00067 1.4E-08   61.6   7.3  108  156-264    29-159 (203)
 26 COG1913 Predicted Zn-dependent  97.3 0.00013 2.7E-09   64.7   1.9   27  245-271   125-151 (181)
 27 cd04267 ZnMc_ADAM_like Zinc-de  97.2 0.00035 7.6E-09   62.6   4.1   48  243-291   132-189 (192)
 28 PF13688 Reprolysin_5:  Metallo  97.1  0.0002 4.3E-09   64.2   1.5   23  243-265   141-163 (196)
 29 PRK13267 archaemetzincin-like   97.1 0.00035 7.5E-09   62.7   2.6   27  244-270   125-151 (179)
 30 PF13582 Reprolysin_3:  Metallo  97.0 0.00029 6.3E-09   58.4   1.3   18  244-261   107-124 (124)
 31 cd04273 ZnMc_ADAMTS_like Zinc-  97.0 0.00053 1.2E-08   62.4   3.1   54  241-294   137-199 (207)
 32 PF07998 Peptidase_M54:  Peptid  96.8 0.00067 1.4E-08   61.6   2.3   27  244-271   145-171 (194)
 33 cd04271 ZnMc_ADAM_fungal Zinc-  96.7  0.0019   4E-08   60.1   4.6   49  246-296   147-220 (228)
 34 PF05572 Peptidase_M43:  Pregna  96.7 0.00077 1.7E-08   59.0   1.8   25  241-265    66-90  (154)
 35 cd04269 ZnMc_adamalysin_II_lik  96.6  0.0024 5.3E-08   57.2   4.6   49  244-294   131-185 (194)
 36 COG2989 Uncharacterized protei  96.3  0.0072 1.6E-07   62.0   6.3   65   56-121   233-298 (561)
 37 cd04275 ZnMc_pappalysin_like Z  96.2 0.00077 1.7E-08   62.6  -1.4   26  241-266   134-159 (225)
 38 PF01421 Reprolysin:  Reprolysi  95.8   0.014 3.1E-07   52.5   5.1   52  241-293   128-187 (199)
 39 cd04270 ZnMc_TACE_like Zinc-de  95.7   0.015 3.2E-07   54.6   4.7   50  245-294   168-230 (244)
 40 KOG4525 Jacalin-like lectin do  95.4   0.023 4.9E-07   57.1   5.2   57  245-303   302-380 (614)
 41 PF05548 Peptidase_M11:  Gameto  94.8   0.018 3.8E-07   56.2   2.3   21  242-262   148-168 (314)
 42 COG3409 Putative peptidoglycan  94.5    0.08 1.7E-06   46.4   5.8   61   59-121    43-104 (185)
 43 cd04272 ZnMc_salivary_gland_MP  94.2   0.029 6.2E-07   51.5   2.2   20  243-262   144-163 (220)
 44 PF04298 Zn_peptidase_2:  Putat  94.1   0.062 1.3E-06   49.8   4.2   54  202-263    51-106 (222)
 45 PF10462 Peptidase_M66:  Peptid  93.6   0.036 7.8E-07   53.8   1.7   20  243-262   192-211 (305)
 46 COG3023 ampD N-acetyl-anhydrom  93.4    0.22 4.7E-06   47.0   6.5   71   45-120   169-252 (257)
 47 PF12044 Metallopep:  Putative   93.1   0.051 1.1E-06   55.0   2.0   22  245-268   317-338 (423)
 48 COG2738 Predicted Zn-dependent  92.2    0.14 3.1E-06   46.4   3.4   76  176-257    20-105 (226)
 49 TIGR03296 M6dom_TIGR03296 M6 f  91.0   0.061 1.3E-06   51.6  -0.3   20  245-264   166-185 (286)
 50 COG3824 Predicted Zn-dependent  88.6    0.18   4E-06   42.3   0.8   17  244-260   109-125 (136)
 51 PF08823 PG_binding_2:  Putativ  87.4     1.6 3.4E-05   33.6   5.3   53   64-119    18-74  (74)
 52 PF11150 DUF2927:  Protein of u  86.8      15 0.00032   34.0  12.3  137  148-296    28-196 (213)
 53 PTZ00337 surface protease GP63  85.9    0.86 1.9E-05   48.0   4.1   19  243-261   228-246 (567)
 54 KOG3607 Meltrins, fertilins an  84.0     1.1 2.3E-05   48.6   3.9   45  244-289   323-374 (716)
 55 PF09471 Peptidase_M64:  IgA Pe  83.9    0.54 1.2E-05   44.8   1.5   24  243-266   215-239 (264)
 56 PF06262 DUF1025:  Possibl zinc  82.2    0.73 1.6E-05   37.3   1.4   16  244-259    73-88  (97)
 57 PF05547 Peptidase_M6:  Immune   81.5     0.3 6.4E-06   52.1  -1.4   17  245-261   222-238 (645)
 58 PF01457 Peptidase_M8:  Leishma  80.6    0.83 1.8E-05   47.6   1.5   55  243-297   209-268 (521)
 59 PF06114 DUF955:  Domain of unk  78.3     1.3 2.8E-05   35.2   1.7   14  243-256    41-54  (122)
 60 PF13398 Peptidase_M50B:  Pepti  74.7     1.5 3.2E-05   39.9   1.2   16  243-258    21-36  (200)
 61 COG5549 Predicted Zn-dependent  72.4     2.2 4.8E-05   39.3   1.7   29  245-273   154-182 (236)
 62 PF09374 PG_binding_3:  Predict  72.3     3.9 8.4E-05   31.1   2.8   26   94-119     1-30  (72)
 63 PF14247 DUF4344:  Domain of un  71.6     1.8   4E-05   40.2   1.0   14  243-256    91-104 (220)
 64 COG3409 Putative peptidoglycan  71.5      13 0.00029   32.1   6.5   58   61-120   127-184 (185)
 65 COG4783 Putative Zn-dependent   70.5       2 4.2E-05   44.1   1.0   16  242-257   128-143 (484)
 66 COG2856 Predicted Zn peptidase  69.9     2.1 4.5E-05   39.6   1.0   18  244-263    72-89  (213)
 67 KOG3658 Tumor necrosis factor-  69.6     2.9 6.3E-05   44.5   2.1   49  245-293   393-453 (764)
 68 PTZ00257 Glycoprotein GP63 (le  68.6     2.5 5.5E-05   44.8   1.4   54  243-296   255-313 (622)
 69 cd06455 M3A_TOP Peptidase M3 T  67.2      15 0.00032   37.7   6.7   14  243-256   262-275 (472)
 70 PF02163 Peptidase_M50:  Peptid  66.8     2.9 6.2E-05   37.1   1.2   21  245-265     8-28  (192)
 71 TIGR02290 M3_fam_3 oligoendope  65.3     3.4 7.3E-05   43.5   1.6   19  171-189   307-325 (587)
 72 cd06163 S2P-M50_PDZ_RseP-like   65.3     3.2 6.9E-05   37.3   1.2   14  245-258    10-23  (182)
 73 cd05709 S2P-M50 Site-2 proteas  64.4     3.4 7.3E-05   36.4   1.2   20  245-264     9-28  (180)
 74 PF01435 Peptidase_M48:  Peptid  64.1     3.6 7.8E-05   37.0   1.4   20  242-262    87-106 (226)
 75 KOG3538 Disintegrin metallopro  62.7     4.7  0.0001   44.5   2.2   32  243-274   316-353 (845)
 76 cd06161 S2P-M50_SpoIVFB SpoIVF  61.5     4.1 8.8E-05   37.2   1.2   18  244-261    38-55  (208)
 77 COG0501 HtpX Zn-dependent prot  60.4       4 8.7E-05   38.6   1.0   14  242-255   155-168 (302)
 78 PRK03982 heat shock protein Ht  59.7     4.2 9.1E-05   39.0   1.0   15  242-256   123-137 (288)
 79 PRK03001 M48 family peptidase;  57.4     4.9 0.00011   38.4   1.0   15  242-256   122-136 (283)
 80 cd06258 Peptidase_M3_like The   57.2     5.2 0.00011   39.2   1.2   15  242-256   152-166 (365)
 81 cd06459 M3B_Oligoendopeptidase  56.8     4.9 0.00011   40.0   1.0   15  242-256   220-234 (427)
 82 PRK03072 heat shock protein Ht  56.5     5.1 0.00011   38.5   1.0   16  241-256   124-139 (288)
 83 PRK01345 heat shock protein Ht  56.4     5.2 0.00011   39.1   1.0   15  242-256   122-136 (317)
 84 cd06164 S2P-M50_SpoIVFB_CBS Sp  55.6     5.9 0.00013   36.8   1.2   18  244-261    53-70  (227)
 85 PF01432 Peptidase_M3:  Peptida  54.5     6.1 0.00013   40.1   1.2   12  245-256   243-254 (458)
 86 PF12725 DUF3810:  Protein of u  52.8     6.2 0.00013   38.6   0.9   23  241-263   193-215 (318)
 87 PRK04897 heat shock protein Ht  52.5     6.5 0.00014   38.0   1.0   14  242-255   135-148 (298)
 88 cd06162 S2P-M50_PDZ_SREBP Ster  52.4     7.1 0.00015   37.6   1.2   16  244-259   135-150 (277)
 89 cd06159 S2P-M50_PDZ_Arch Uncha  52.2     7.1 0.00015   37.2   1.2   18  245-262   119-136 (263)
 90 PRK02870 heat shock protein Ht  51.3       7 0.00015   38.6   1.0   15  241-255   170-184 (336)
 91 PRK05457 heat shock protein Ht  51.3       7 0.00015   37.5   1.0   15  241-255   131-145 (284)
 92 PF13485 Peptidase_MA_2:  Pepti  50.7     8.2 0.00018   30.8   1.2   14  243-256    24-37  (128)
 93 PRK01265 heat shock protein Ht  50.7     7.2 0.00016   38.3   1.0   14  242-255   138-151 (324)
 94 PRK02391 heat shock protein Ht  50.5     7.3 0.00016   37.7   1.0   15  241-255   130-144 (296)
 95 TIGR02289 M3_not_pepF oligoend  50.5     7.7 0.00017   40.6   1.3   16  241-256   334-349 (549)
 96 cd06457 M3A_MIP Peptidase M3 m  48.3     8.6 0.00019   39.4   1.2   25  176-200   132-156 (458)
 97 KOG2921 Intramembrane metallop  48.0     8.6 0.00019   38.7   1.1   18  245-262   132-149 (484)
 98 TIGR00181 pepF oligoendopeptid  47.8     9.2  0.0002   40.2   1.4   15  242-256   376-390 (591)
 99 PF03272 Enhancin:  Viral enhan  47.8      21 0.00045   39.2   4.1  136  148-294   128-289 (775)
100 cd06160 S2P-M50_like_2 Unchara  47.4     9.6 0.00021   34.3   1.2   23  245-267    42-64  (183)
101 PF01447 Peptidase_M4:  Thermol  46.9     9.6 0.00021   33.2   1.1   12  245-256   136-147 (150)
102 cd06456 M3A_DCP_Oligopeptidase  46.1      10 0.00022   38.5   1.3   14  243-256   207-220 (422)
103 PF04228 Zn_peptidase:  Putativ  44.3      11 0.00023   36.6   1.0   11  245-255   171-181 (292)
104 PF01434 Peptidase_M41:  Peptid  43.5      11 0.00024   34.4   1.0   16  241-256    25-40  (213)
105 PLN02791 Nudix hydrolase homol  42.9      11 0.00024   41.3   1.0   20  244-263   585-604 (770)
106 PF01863 DUF45:  Protein of unk  41.0      16 0.00036   32.4   1.7   26  243-268   163-188 (205)
107 cd06460 M32_Taq Peptidase fami  40.6      13 0.00028   37.6   1.0   12  245-256   160-171 (396)
108 TIGR00054 RIP metalloprotease   40.5      13 0.00028   37.7   1.0   12  245-256    15-26  (420)
109 cd06158 S2P-M50_like_1 Unchara  40.4      14 0.00031   33.0   1.2   13  244-256     9-21  (181)
110 PRK10779 zinc metallopeptidase  39.9      13 0.00029   37.8   1.0   12  245-256    16-27  (449)
111 PF08823 PG_binding_2:  Putativ  39.7      71  0.0015   24.5   4.8   32   90-121    15-49  (74)
112 COG1164 Oligoendopeptidase F [  38.6      14 0.00031   39.3   1.0   14  243-256   379-392 (598)
113 PF08434 CLCA_N:  Calcium-activ  37.7      14  0.0003   35.3   0.7   12  244-255   149-160 (262)
114 cd02641 R3H_Smubp-2_like R3H d  35.4      19 0.00042   26.3   1.0   19  244-262    29-47  (60)
115 PF10281 Ish1:  Putative stress  35.0      87  0.0019   20.5   4.1   29   65-97      7-35  (38)
116 PRK10911 oligopeptidase A; Pro  35.0      17 0.00038   39.2   1.0   13  244-256   463-475 (680)
117 KOG2719 Metalloprotease [Gener  33.2      19 0.00042   36.6   0.9   14  242-255   278-291 (428)
118 KOG2089 Metalloendopeptidase f  31.0   1E+02  0.0022   33.3   5.7   43  177-220   391-435 (718)
119 PF13699 DUF4157:  Domain of un  30.6      25 0.00054   27.2   1.0   13  244-256    61-73  (79)
120 KOG3988 Protein-tyrosine sulfo  29.8      68  0.0015   31.3   3.9   38   44-81    295-332 (378)
121 PF03571 Peptidase_M49:  Peptid  29.6      22 0.00047   37.5   0.7   15  244-258   293-308 (549)
122 PF06167 Peptidase_M90:  Glucos  29.5      27 0.00058   33.1   1.2   20  242-261   152-171 (253)
123 COG4784 Putative Zn-dependent   27.2 1.3E+02  0.0028   30.2   5.4   16  241-256   121-136 (479)
124 PF10463 Peptidase_U49:  Peptid  26.3      37 0.00081   31.2   1.5   43  245-296   102-144 (206)
125 COG4227 Antirestriction protei  25.9      30 0.00065   33.1   0.8   22  240-261   199-220 (316)
126 PRK03094 hypothetical protein;  25.6      49  0.0011   25.9   1.8   17   62-78      8-24  (80)
127 PRK10280 dipeptidyl carboxypep  25.4      32  0.0007   37.2   1.1   31  178-208   364-397 (681)
128 KOG2661 Peptidase family M48 [  23.6      40 0.00087   33.4   1.2   18  244-262   275-292 (424)
129 PF03698 UPF0180:  Uncharacteri  22.3      62  0.0013   25.3   1.8   17   62-78      8-24  (80)
130 PRK10733 hflB ATP-dependent me  21.9      42 0.00091   36.0   1.1   14  243-256   407-420 (644)
131 cd02639 R3H_RRM R3H domain of   21.8      60  0.0013   23.9   1.6   22  244-265    29-50  (60)
132 PF08006 DUF1700:  Protein of u  21.4      82  0.0018   27.7   2.7   47   64-123     7-54  (181)
133 COG1059 Thermostable 8-oxoguan  21.2 1.3E+02  0.0028   27.5   3.9   46   67-113   153-198 (210)
134 PF12315 DUF3633:  Protein of u  20.8      49  0.0011   30.6   1.1   11  245-255    94-104 (212)
135 PF01431 Peptidase_M13:  Peptid  20.2      51  0.0011   29.4   1.1   16  244-259    36-51  (206)
136 cd06461 M2_ACE Peptidase famil  20.1      52  0.0011   34.1   1.3   16  243-258   246-261 (477)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=9.5e-51  Score=407.99  Aligned_cols=220  Identities=45%  Similarity=0.799  Sum_probs=200.4

Q ss_pred             ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhcCCCCCCCCcccccccccccccc
Q 037540           63 GMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIMSPRCGVSDTSFRQKIHATKRYS  142 (340)
Q Consensus        63 ~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~~PRCGvpD~~~~~~~~~~~~~~  142 (340)
                      +...++.||++|||+++.+...+...+..+++||+.||++++|++||++|.+|++.|++|||||||          ++++
T Consensus        29 ~~~~~~~yl~~~~y~~~~~~~~~~~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd----------~~~~   98 (469)
T KOG1565|consen   29 DKVALQDYLECYGYLPPTDLTATRASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPD----------GRYR   98 (469)
T ss_pred             chhHHHHHhhhcccCCCccccccccCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCC----------Cccc
Confidence            678899999999999876543222257899999999999999999999999999999999999999          2488


Q ss_pred             cccCCCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-
Q 037540          143 YFYGKPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-  221 (340)
Q Consensus       143 ~~~~~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-  221 (340)
                      ++++++||++.   +|||+|.  |++++++..+++.++++||+.|+++++++|+|+...+.|||+|.|..++|||++|| 
T Consensus        99 ~~~~~~kW~k~---~lT~ri~--n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFD  173 (469)
T KOG1565|consen   99 YFPGKPKWNKE---HLTYRIK--NYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFD  173 (469)
T ss_pred             cCcccCccccc---ccceecc--ccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCccc
Confidence            89999999999   9999999  77899999999999999999999999999999998789999999999999998887 


Q ss_pred             --------------------cccCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCC
Q 037540          222 --------------------HFDAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDL  281 (340)
Q Consensus       222 --------------------hfd~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L  281 (340)
                                          |||++|.|+++     ...++|++.||+|||||+|||.||.++++||||+|++...++.|
T Consensus       174 G~~g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-----~~~g~~l~~Va~HEiGH~LGL~HS~~~~aiM~P~y~~~~~~~~L  248 (469)
T KOG1565|consen  174 GPGGVLAHAFFPGPGIGGDLHFDKDETWTYG-----DSNGVDLFLVAAHEIGHALGLGHSSDPDAIMYPFYQPDSGNFDL  248 (469)
T ss_pred             CCCCceecccCCCCCCCCccccCcccceecc-----CCccchhHHHhhhhcccccccCCCCCcccccccccccCCCCccc
Confidence                                99999999986     13489999999999999999999999999999999976678899


Q ss_pred             CHHHHHHHHHHhCCCCCCCcc
Q 037540          282 KVDDVEGVQALYGSNPNFKFS  302 (340)
Q Consensus       282 ~~dDI~~Iq~LYG~~~~~~~~  302 (340)
                      ++|||.|||.|||.+++....
T Consensus       249 ~~DDv~giq~lYG~~~~~~~~  269 (469)
T KOG1565|consen  249 SQDDVRGIQHLYGGPPRDYFM  269 (469)
T ss_pred             ChhhhhhhHHHhCCCcccccc
Confidence            999999999999998876553


No 2  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.97  E-value=2.4e-30  Score=223.38  Aligned_cols=133  Identities=44%  Similarity=0.834  Sum_probs=110.3

Q ss_pred             CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-------
Q 037540          149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-------  221 (340)
Q Consensus       149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-------  221 (340)
                      ||++.   +|||+|...  .+.++.+++++++++||+.|+++++++|+|+.+ ..+||+|.|....++++..|       
T Consensus         1 ~W~~~---~ity~i~~~--~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~-~~adi~i~~~~~~~~~~~~~~~~~~~~   74 (154)
T PF00413_consen    1 KWPKK---TITYSISNT--TPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSD-GNADIRISFGSNNHGDGYSFDGSGGTL   74 (154)
T ss_dssp             SSSSS---EEEEEESSB--CTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESS-SSCSEEEEEESSSSSSSS-CSSSSSES
T ss_pred             CCCCC---cEEEEEECC--CCCCCHHHHHHHHHHHHHHHHhcCCceEEeccC-CCcceeeeeeccccCcccccccceeee
Confidence            79998   999999954  467778999999999999999999999999985 57999999987766654433       


Q ss_pred             --------------cccCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCCCHHHHH
Q 037540          222 --------------HFDAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDLKVDDVE  287 (340)
Q Consensus       222 --------------hfd~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L~~dDI~  287 (340)
                                    +|+.++.|...      ..+.++..|++|||||+|||.|+++++|||||.+.+... ..|+.+||.
T Consensus        75 ~~~~~~~~~~~~~i~~~~~~~~~~~------~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~~-~~l~~~Di~  147 (154)
T PF00413_consen   75 AHAYFPNNIVSGDIHFNDDESWTID------DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPDN-KTLSEDDID  147 (154)
T ss_dssp             EEEEESSSTTTTEEEEETTSHEESS------SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSSS-TSTTHHHHH
T ss_pred             eccccccccccccccccccccchhh------hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCCC-CCCCHHHHH
Confidence                          34444445443      235678999999999999999999999999999986542 389999999


Q ss_pred             HHHHHhC
Q 037540          288 GVQALYG  294 (340)
Q Consensus       288 ~Iq~LYG  294 (340)
                      +||+|||
T Consensus       148 ~i~~lYg  154 (154)
T PF00413_consen  148 GIQYLYG  154 (154)
T ss_dssp             HHHHHHS
T ss_pred             HHHHHhC
Confidence            9999998


No 3  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.97  E-value=3e-30  Score=225.31  Aligned_cols=137  Identities=43%  Similarity=0.780  Sum_probs=112.8

Q ss_pred             CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-------
Q 037540          149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-------  221 (340)
Q Consensus       149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-------  221 (340)
                      ||+++   +|||+|..  ..+.++...+++++++|++.|+++++|+|+|+.+...|||+|.|....|+++.+|       
T Consensus         1 kW~~~---~itY~i~~--~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l   75 (157)
T cd04278           1 KWSKT---NLTYRILN--YPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTL   75 (157)
T ss_pred             CCCCC---ceeEEEEC--CCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCcccc
Confidence            79999   99999974  3444667999999999999999999999999987558999999998777653222       


Q ss_pred             -------------cccCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCCCHHHHHH
Q 037540          222 -------------HFDAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDLKVDDVEG  288 (340)
Q Consensus       222 -------------hfd~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L~~dDI~~  288 (340)
                                   +|+.++.|.+..    ...+.+++.|++|||||+|||.|+.++++||||.+.+......|+.+||.+
T Consensus        76 ~~a~~p~~~~g~i~~~~~~~~~~~~----~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~  151 (157)
T cd04278          76 AHAFFPGGIGGDIHFDDDEQWTLGS----DSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRG  151 (157)
T ss_pred             ccccCCCCcceeEEECCCcccccCC----CCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHH
Confidence                         344555555432    124678999999999999999999999999999998653456899999999


Q ss_pred             HHHHhC
Q 037540          289 VQALYG  294 (340)
Q Consensus       289 Iq~LYG  294 (340)
                      ||+|||
T Consensus       152 ~~~lYg  157 (157)
T cd04278         152 IQALYG  157 (157)
T ss_pred             HHHhcC
Confidence            999998


No 4  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.90  E-value=2.1e-23  Score=181.82  Aligned_cols=139  Identities=27%  Similarity=0.311  Sum_probs=100.0

Q ss_pred             CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccC-CCccEEEEeecC----CCCCC--Cccc-----c
Q 037540          156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDY-DSADIHIGFYHG----DHGDG--QPFH-----F  223 (340)
Q Consensus       156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~-~~ADI~I~f~~g----~hgdg--~pfh-----f  223 (340)
                      ..|+|.+.+.+..+.-...++++++++|++.|+++++|+|+++.+. .++||+|.|...    ..|+.  .++.     +
T Consensus         2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~   81 (156)
T cd04279           2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDG   81 (156)
T ss_pred             CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCC
Confidence            3789999865332223357899999999999999999999998653 379999999853    11111  1110     1


Q ss_pred             cCCcceeecccccccc--ccccchhHHHHhhhhhhCCCcCCCCC-CccccccCCCC-CCCCCCHHHHHHHHHHhC
Q 037540          224 DAAETWAVDMESVKSR--VAVDLESVATHEIGHVLGLAHTSVKE-AVMYPSLTPRT-RKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       224 d~~e~wt~~~~~~~~~--~~~dl~~Va~HEIGHaLGL~Hs~d~~-SIMyP~~~~~~-~~~~L~~dDI~~Iq~LYG  294 (340)
                      +....|..........  .+.+++.|++|||||+|||.|++++. +||||++.... .+..|+++|+++||.|||
T Consensus        82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~lY~  156 (156)
T cd04279          82 NRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRLYG  156 (156)
T ss_pred             cccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHHhC
Confidence            1223333332211111  25678999999999999999999999 99999998642 356899999999999997


No 5  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.85  E-value=2.3e-21  Score=169.32  Aligned_cols=132  Identities=21%  Similarity=0.320  Sum_probs=93.4

Q ss_pred             CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCC--CCCCcc--cc---cCCcc
Q 037540          156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDH--GDGQPF--HF---DAAET  228 (340)
Q Consensus       156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~h--gdg~pf--hf---d~~e~  228 (340)
                      ..|+|.|... +     ...+|+++++||+.|+++++++|+|+.+...+||+|.+....+  ..++++  +.   ...+.
T Consensus         2 ~~I~y~i~~~-~-----~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i   75 (165)
T cd04268           2 KPITYYIDDS-V-----PDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEI   75 (165)
T ss_pred             CCEEEEEcCC-C-----CHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCccccCCccCCCCCccE
Confidence            3899999852 2     2679999999999999999999999987558999999876321  112222  11   11233


Q ss_pred             eeecccccc---ccccccchhHHHHhhhhhhCCCcCCC----------------CCCccccccCCC------CCCCCCCH
Q 037540          229 WAVDMESVK---SRVAVDLESVATHEIGHVLGLAHTSV----------------KEAVMYPSLTPR------TRKVDLKV  283 (340)
Q Consensus       229 wt~~~~~~~---~~~~~dl~~Va~HEIGHaLGL~Hs~d----------------~~SIMyP~~~~~------~~~~~L~~  283 (340)
                      |........   ...+..+..|++|||||+|||.|+++                ..|||++.....      .....|+.
T Consensus        76 ~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~~~~~~~~  155 (165)
T cd04268          76 LLARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDGQKYTIGP  155 (165)
T ss_pred             EeeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccccCCCCCH
Confidence            321111000   01234578999999999999999998                899998654421      13468999


Q ss_pred             HHHHHHHHHh
Q 037540          284 DDVEGVQALY  293 (340)
Q Consensus       284 dDI~~Iq~LY  293 (340)
                      +||.+||.||
T Consensus       156 ~Di~ai~~lY  165 (165)
T cd04268         156 YDIAAIKKLY  165 (165)
T ss_pred             HHHHHHHhcC
Confidence            9999999998


No 6  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.85  E-value=2.6e-21  Score=165.36  Aligned_cols=134  Identities=31%  Similarity=0.491  Sum_probs=101.1

Q ss_pred             CCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCC-CCCCcc-ccc
Q 037540          147 KPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDH-GDGQPF-HFD  224 (340)
Q Consensus       147 ~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~h-gdg~pf-hfd  224 (340)
                      ..+|++.   +|+|.|..    +.++..++++++++|++.|+++++++|+|+.+  .+||.|.|...++ |+..++ .+.
T Consensus         2 ~~~W~~~---~v~Y~i~~----~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~g~~~a~~g~~   72 (140)
T smart00235        2 SKKWPKG---TVPYVIDS----SSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGSGCTLSHAGRP   72 (140)
T ss_pred             CCcCCCC---cEEEEEcC----CCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCCCcceeeeecC
Confidence            4699998   99999972    23344556999999999999999999999865  7999999999887 431132 122


Q ss_pred             CCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCC---ccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 037540          225 AAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEA---VMYPSLTPRTRKVDLKVDDVEGVQALYGS  295 (340)
Q Consensus       225 ~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~S---IMyP~~~~~~~~~~L~~dDI~~Iq~LYG~  295 (340)
                      .+..+ ++..     .++....|++|||||+|||.|++++.+   +|++.+......+.++++|..+++.+||+
T Consensus        73 ~g~~~-~~~~-----~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~~~~~yg~  140 (140)
T smart00235       73 GGDQH-FSLG-----NGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLGIPYDYGS  140 (140)
T ss_pred             CCceE-EEcc-----CCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCCchhccCc
Confidence            22232 2211     133345699999999999999998888   99999875544567888899999999984


No 7  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.83  E-value=4.6e-20  Score=165.17  Aligned_cols=122  Identities=25%  Similarity=0.342  Sum_probs=84.7

Q ss_pred             HHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCC--CCccccc-------CCcceeeccccc-cccccc
Q 037540          173 LTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGD--GQPFHFD-------AAETWAVDMESV-KSRVAV  242 (340)
Q Consensus       173 ~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgd--g~pfhfd-------~~e~wt~~~~~~-~~~~~~  242 (340)
                      ....++++++|++.|+++++|+|+|+.+...+||+|.+....++.  |+++.-.       .+..|....... ....+.
T Consensus        32 ~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~g~  111 (186)
T cd04277          32 SAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSPGS  111 (186)
T ss_pred             CHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCCCh
Confidence            467999999999999999999999998866899999987753221  2222111       112232111100 002255


Q ss_pred             cchhHHHHhhhhhhCCCcCCCC----------------CCccccccCCC-------CCCCCCCHHHHHHHHHHhC
Q 037540          243 DLESVATHEIGHVLGLAHTSVK----------------EAVMYPSLTPR-------TRKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~----------------~SIMyP~~~~~-------~~~~~L~~dDI~~Iq~LYG  294 (340)
                      ....|++|||||+|||.|+++.                .|||.+.....       .....++..||++||+|||
T Consensus       112 ~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~DI~AlQ~lYG  186 (186)
T cd04277         112 YGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDIAALQYLYG  186 (186)
T ss_pred             hhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHHHHHHHHhhC
Confidence            6789999999999999999853                46895433221       2346789999999999998


No 8  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74  E-value=1.6e-17  Score=150.64  Aligned_cols=103  Identities=26%  Similarity=0.413  Sum_probs=70.1

Q ss_pred             CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-cccC--
Q 037540          149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-HFDA--  225 (340)
Q Consensus       149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-hfd~--  225 (340)
                      +|+++  +.|+|.|... .     .+..+++|++||+.|++++||+|+|++. .++||+|.|..++ |+ +++ .-+.  
T Consensus         2 ~W~~~--~~~~~~f~~~-~-----~~~~r~~I~~A~~~W~~~t~i~F~~~~~-~~adi~I~f~~~~-Gc-~S~vG~~~~~   70 (198)
T cd04327           2 LWRNG--TVLRIAFLGG-P-----DAFLKDKVRAAAREWLPYANLKFKFVTD-ADADIRISFTPGD-GY-WSYVGTDALL   70 (198)
T ss_pred             CCCCC--CeEEEEeCCC-C-----cHHHHHHHHHHHHHHhhhcCeEEEECCC-CCCCEEEEEecCC-CC-CCCcCCcccc
Confidence            79987  3788998742 1     3778999999999999999999999976 5799999998753 32 222 1110  


Q ss_pred             --Ccceeeccccc-cccccccchhHHHHhhhhhhCCCcCC
Q 037540          226 --AETWAVDMESV-KSRVAVDLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       226 --~e~wt~~~~~~-~~~~~~dl~~Va~HEIGHaLGL~Hs~  262 (340)
                        ...-++..... ....+++...|++|||||||||.|++
T Consensus        71 ~~~~~~t~~l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh  110 (198)
T cd04327          71 IGADAPTMNLGWFTDDTPDPEFSRVVLHEFGHALGFIHEH  110 (198)
T ss_pred             cCCCCceeeeeeecCCCchhhHHHHHHHHHHHHhcCcccc
Confidence              01111111100 00123466789999999999999974


No 9  
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.62  E-value=2.8e-15  Score=134.18  Aligned_cols=90  Identities=16%  Similarity=0.132  Sum_probs=63.2

Q ss_pred             cEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCCcceeeccccc
Q 037540          157 TLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAAETWAVDMESV  236 (340)
Q Consensus       157 ~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~~~  236 (340)
                      .|.|.|..+ ++     ...++.|++|++.|++++||+|+|+..   ++..|.|..+ .|+........++ -.+...  
T Consensus         3 ~VpY~i~~~-~~-----~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~Gc~S~vG~~~~~-q~i~l~--   69 (180)
T cd04280           3 TVPYVIDGS-FD-----ESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SGCWSYVGRVGGR-QVVSLG--   69 (180)
T ss_pred             EEEEEECCC-CC-----HHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CCcceecCccCCc-eeEEeC--
Confidence            799999852 32     679999999999999999999999875   4445666655 3432222111111 122222  


Q ss_pred             cccccccchhHHHHhhhhhhCCCcCC
Q 037540          237 KSRVAVDLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       237 ~~~~~~dl~~Va~HEIGHaLGL~Hs~  262 (340)
                         .++....+++|||||||||.|++
T Consensus        70 ---~~c~~~g~v~HE~~HalG~~HEh   92 (180)
T cd04280          70 ---SGCFSLGTIVHELMHALGFYHEQ   92 (180)
T ss_pred             ---CCcCcCchhHHHHHHHhcCcchh
Confidence               25556799999999999999988


No 10 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.61  E-value=2.3e-15  Score=135.96  Aligned_cols=129  Identities=19%  Similarity=0.399  Sum_probs=79.1

Q ss_pred             CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-cccCCc
Q 037540          149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-HFDAAE  227 (340)
Q Consensus       149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-hfd~~e  227 (340)
                      ||+..   +|.|.|.. ++     ....++.|++|++.|++.+||+|+++.. . ....|.+..+ .|+ +++ ....++
T Consensus         2 ~Wp~~---~IpY~~~~-~~-----~~~~~~~I~~Am~~~e~~TcI~F~~~~~-~-~~~~i~~~~~-~gC-~S~vG~~~g~   68 (191)
T PF01400_consen    2 KWPNG---TIPYYIDP-SF-----SSSQRQRIRKAMDEWEKNTCIRFVERTE-N-EDDYISFSNG-SGC-WSYVGRQGGE   68 (191)
T ss_dssp             S-GGG---EEEEEEET-TS------HHHHHHHHHHHHHHHHHSSEEEEE-SS-S-SSSEEEEESS-SSE-EEESS--SSE
T ss_pred             cCCCC---EEEEEECC-CC-----CHHHHHHHHHHHHHHHhCCCEEEEECCC-C-CceEEEeecC-ccc-cchhhhcCcc
Confidence            89999   99999985 33     2678899999999999999999999983 3 3334555543 333 332 122222


Q ss_pred             ceeeccccccccccccchhHHHHhhhhhhCCCcCC---------------------------------------CCCCcc
Q 037540          228 TWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS---------------------------------------VKEAVM  268 (340)
Q Consensus       228 ~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~---------------------------------------d~~SIM  268 (340)
                      + .+...     .++....|++|||||||||.|++                                       |+.|||
T Consensus        69 q-~i~l~-----~~c~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIM  142 (191)
T PF01400_consen   69 Q-TINLG-----DGCFSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIM  142 (191)
T ss_dssp             E-EEEE------TTC-SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-TTSGGGG---SGGGBE-TTS---TT-TT
T ss_pred             e-eEEec-----ceeCCccchHHHHHHHHhhhhhhhccccccEEEEehhcchhhhhhhhhccccccccccCCCcCccCee
Confidence            2 23322     24556789999999999999988                                       567999


Q ss_pred             -ccccCCC-------------------CCCCCCCHHHHHHHHHHhCCC
Q 037540          269 -YPSLTPR-------------------TRKVDLKVDDVEGVQALYGSN  296 (340)
Q Consensus       269 -yP~~~~~-------------------~~~~~L~~dDI~~Iq~LYG~~  296 (340)
                       |+.....                   .+...|+..|+.+|+.+|+.+
T Consensus       143 HY~~~~~s~~~~~~ti~~~~~~~~~~iG~~~~lS~~D~~~in~~Y~C~  190 (191)
T PF01400_consen  143 HYSSYAFSKNGSQPTITPKDPKYQETIGQRNRLSFTDIKQINKMYNCP  190 (191)
T ss_dssp             ---TTTTBSSTTS-SEEESSTTCTS-GGG-SS--HHHHHHHHHHTT--
T ss_pred             cccCcccccCCCCCeEEecCCCCcccccccCCCCHHHHHHHHHhcCCC
Confidence             4332211                   012358999999999999753


No 11 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.60  E-value=1.1e-14  Score=132.37  Aligned_cols=99  Identities=21%  Similarity=0.351  Sum_probs=67.1

Q ss_pred             CCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCC
Q 037540          147 KPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAA  226 (340)
Q Consensus       147 ~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~  226 (340)
                      ..+|+++   .|.|.|.. +++     ...+++|++|++.|++.|||+|++..+ +...|.  |..+..|+.+.......
T Consensus         7 ~~~Wp~~---~VpY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~r~~-~~~yi~--f~~~~~GC~S~vG~~~~   74 (200)
T cd04281           7 ERIWPGG---VIPYVIDG-NFT-----GSQRAMFKQAMRHWENFTCVTFVERTP-EENYIV--FTYRPCGCCSYVGRRGN   74 (200)
T ss_pred             cCcCCCC---EEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceEEEECCC-CCCEEE--EEECCCCeeEcCCCcCC
Confidence            5699999   99999974 332     567889999999999999999999765 344444  44333343221111111


Q ss_pred             cceeeccccccccccccchhHHHHhhhhhhCCCcCC
Q 037540          227 ETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       227 e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~  262 (340)
                      ..-.+...     .++....+++|||||||||.|++
T Consensus        75 g~q~isl~-----~~C~~~Gti~HEl~HaLGf~HEh  105 (200)
T cd04281          75 GPQAISIG-----KNCDKFGIVVHELGHVIGFWHEH  105 (200)
T ss_pred             CceeeecC-----CCcCcCchHHHHHHHHhcCcchh
Confidence            11222222     35666789999999999999988


No 12 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.58  E-value=9.7e-15  Score=127.37  Aligned_cols=117  Identities=21%  Similarity=0.255  Sum_probs=84.0

Q ss_pred             HHHHHHHHHHHHHHhhccCcceEEEeccC-CCccEEEEeecCCCC---CCCccc-----ccCCcceeecccccccccccc
Q 037540          173 LTDVKAAFKSSFQRWSSVIPVNFTEIYDY-DSADIHIGFYHGDHG---DGQPFH-----FDAAETWAVDMESVKSRVAVD  243 (340)
Q Consensus       173 ~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~-~~ADI~I~f~~g~hg---dg~pfh-----fd~~e~wt~~~~~~~~~~~~d  243 (340)
                      ...+++++++|++.|+++++++|+++... ..+||.+.+...+.+   .|.++.     ......|.....    ..+..
T Consensus        20 ~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~~~~~~~~~~----~~~~~   95 (167)
T cd00203          20 SAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGVLQDNQ----SGTKE   95 (167)
T ss_pred             HHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCCCcEEEecCC----ccccc
Confidence            47799999999999999999999999863 389999988764422   123331     011222321111    11345


Q ss_pred             chhHHHHhhhhhhCCCcCCC--------------------CCCccccccCCC--CCCCCCCHHHHHHHHHHh
Q 037540          244 LESVATHEIGHVLGLAHTSV--------------------KEAVMYPSLTPR--TRKVDLKVDDVEGVQALY  293 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d--------------------~~SIMyP~~~~~--~~~~~L~~dDI~~Iq~LY  293 (340)
                      ...|++|||||+|||.|..+                    ..+||+|.....  .....++..||+.||++|
T Consensus        96 ~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~~~i~~~Y  167 (167)
T cd00203          96 GAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDIDQINKLY  167 (167)
T ss_pred             chhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCHHHHHHHHhhC
Confidence            78999999999999999876                    468998766532  245689999999999998


No 13 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.48  E-value=4.2e-13  Score=124.30  Aligned_cols=96  Identities=20%  Similarity=0.308  Sum_probs=65.7

Q ss_pred             CCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCC
Q 037540          147 KPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAA  226 (340)
Q Consensus       147 ~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~  226 (340)
                      ..+|++    .|.|.|.. +++     ...++.|++|++.|++.|||+|++... +...|  .|..+ .|+.+......+
T Consensus        43 ~~~Wp~----~VPY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~rt~-e~~yi--~i~~~-~GC~S~vG~~gg  108 (230)
T cd04282          43 TYRWPF----PIPYILDD-SLD-----LNAKGVILKAFEMYRLKSCVDFKPYEG-ESNYI--FFFKG-SGCWSMVGDQQG  108 (230)
T ss_pred             ccCCCc----ceeEEECC-CCC-----HHHHHHHHHHHHHHHhCCCeeEEECCC-CCcEE--EEEcC-CCeeeccCccCC
Confidence            469986    59999974 343     567899999999999999999998764 33344  44443 243222211112


Q ss_pred             cceeeccccccccccccchhHHHHhhhhhhCCCcCC
Q 037540          227 ETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       227 e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~  262 (340)
                      . -.+...     .++....+++|||||||||.|++
T Consensus       109 ~-q~isl~-----~~C~~~Gti~HEl~HalGf~HEq  138 (230)
T cd04282         109 G-QNLSIG-----AGCDYKATVEHEFLHALGFYHEQ  138 (230)
T ss_pred             e-EEEEEC-----CCcCCCchHHHHHHHHhCCcccc
Confidence            1 223222     35667899999999999999988


No 14 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.46  E-value=4.6e-13  Score=120.05  Aligned_cols=123  Identities=17%  Similarity=0.279  Sum_probs=82.7

Q ss_pred             CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-cccCCcceeeccc
Q 037540          156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-HFDAAETWAVDME  234 (340)
Q Consensus       156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-hfd~~e~wt~~~~  234 (340)
                      ..|.|.|.. +++     ...++.|++|++.|++.+||+|++... +...|+|  ..+ .|+ +++ ....+ ...+...
T Consensus         4 ~~IpY~i~~-~~~-----~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~yi~~--~~~-~gC-~S~vG~~gg-~q~i~l~   71 (182)
T cd04283           4 VYVPYVISP-QYS-----ENERAVIEKAMQEFETLTCVRFVPRTT-ERDYLNI--ESR-SGC-WSYIGRQGG-RQTVSLQ   71 (182)
T ss_pred             EEEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceeeEECCC-CCcEEEE--EcC-CCc-eEecCccCC-ceeEecC
Confidence            478999974 332     567899999999999999999998764 3444544  433 233 332 11222 2223322


Q ss_pred             cccccccccchhHHHHhhhhhhCCCcCC------------------------------------CCCCcc-ccccCC---
Q 037540          235 SVKSRVAVDLESVATHEIGHVLGLAHTS------------------------------------VKEAVM-YPSLTP---  274 (340)
Q Consensus       235 ~~~~~~~~dl~~Va~HEIGHaLGL~Hs~------------------------------------d~~SIM-yP~~~~---  274 (340)
                          ..++....+++|||||||||.|++                                    |++||| |+....   
T Consensus        72 ----~~~C~~~G~i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~~~~~~nF~~~~~~~~~~pYDy~SiMHY~~~afs~~  147 (182)
T cd04283          72 ----KQGCMYKGIIQHELLHALGFYHEQTRSDRDKYVRINWENIIPDQLYNFDKQDTNNLGTPYDYSSVMHYGRYAFSIN  147 (182)
T ss_pred             ----CCCcCccchHHHHHHHHhCCcccccccccCceEEEehhhcCchHHhhhhhccccccCCcccceeeeccCCcccccC
Confidence                125666789999999999999988                                    557899 432210   


Q ss_pred             -------C-------CCCCCCCHHHHHHHHHHhC
Q 037540          275 -------R-------TRKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       275 -------~-------~~~~~L~~dDI~~Iq~LYG  294 (340)
                             .       .+...|+..||..|+.||.
T Consensus       148 g~~Ti~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~  181 (182)
T cd04283         148 GKPTIVPIPDPNVPIGQRQGMSNLDILRINKLYN  181 (182)
T ss_pred             CCCeEEECCcccccccCCCCCCHHHHHHHHHhcC
Confidence                   0       1224689999999999996


No 15 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.46  E-value=3.9e-13  Score=134.61  Aligned_cols=137  Identities=18%  Similarity=0.235  Sum_probs=94.7

Q ss_pred             cCCCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCccccc
Q 037540          145 YGKPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFD  224 (340)
Q Consensus       145 ~~~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd  224 (340)
                      ....+|+..   .|.|.|... ++     ...|++|++||+.|++.||++|+|..+ .+.+..+-+..+  |+.......
T Consensus        77 ~~~~~Wp~~---~ipY~i~~~-~~-----~~~r~~i~~A~~~~~~~TCi~F~~~~~-~~~~~~~~~~~~--gC~S~VGr~  144 (411)
T KOG3714|consen   77 NPERRWPNG---VIPYYIDGS-FT-----SSQRALIRQAMREIENHTCIRFVERTT-PDKDYLIVFTGG--GCYSYVGRR  144 (411)
T ss_pred             ChhhcCCCC---eeeeEECCC-CC-----HHHHHHHHHHHHHHhhCcceEEEeCCC-CCcceEEEeCCC--cceeeeCcc
Confidence            346799998   999999863 43     789999999999999999999999874 334444444332  443222222


Q ss_pred             CCcceeeccccccccccccchhHHHHhhhhhhCCCcCC---------------------------------------CCC
Q 037540          225 AAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS---------------------------------------VKE  265 (340)
Q Consensus       225 ~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~---------------------------------------d~~  265 (340)
                      ....-.+.     ...++....|++||||||||+.|++                                       |++
T Consensus       145 gg~~q~~s-----l~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYDyg  219 (411)
T KOG3714|consen  145 GGGQQLLS-----LGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYDYG  219 (411)
T ss_pred             CCCcccee-----cCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCChhhhhhhhhcChhhhhccCCcccCC
Confidence            22111121     2246777999999999999999988                                       568


Q ss_pred             Cccccc-cC-----------CC--------CCCCCCCHHHHHHHHHHhCCCCC
Q 037540          266 AVMYPS-LT-----------PR--------TRKVDLKVDDVEGVQALYGSNPN  298 (340)
Q Consensus       266 SIMyP~-~~-----------~~--------~~~~~L~~dDI~~Iq~LYG~~~~  298 (340)
                      |||++. +.           +.        .+...++..||..|..||.....
T Consensus       220 SvMHY~~~afs~~~~~~ti~~~~~~~~~~mGqr~~~S~~Di~~iN~~Y~C~~~  272 (411)
T KOG3714|consen  220 SVMHYAPYAFSKNGSLPTIVPKDNGFQNTMGQRERLSFYDIRKINKLYCCPEC  272 (411)
T ss_pred             cccccCCcccCcCCCCCceecccccccccccccCcCCHHHHHHHHhhcCCCcc
Confidence            999522 11           00        11236899999999999987543


No 16 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.42  E-value=1.9e-12  Score=117.44  Aligned_cols=134  Identities=22%  Similarity=0.331  Sum_probs=81.6

Q ss_pred             CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhcc------CcceEEEeccC--CCccEE---EEeecCCCC-CCC-ccc
Q 037540          156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSV------IPVNFTEIYDY--DSADIH---IGFYHGDHG-DGQ-PFH  222 (340)
Q Consensus       156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~v------t~l~F~ev~~~--~~ADI~---I~f~~g~hg-dg~-pfh  222 (340)
                      ..|||+|..  .++    .++++++++||+.|+++      .+...+++...  ...||+   |.|...+.+ .++ |..
T Consensus         8 ~pItyyI~~--~~p----~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~~~~~~~gps~   81 (197)
T cd04276           8 EPIVYYLDN--TFP----EKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAYGPSV   81 (197)
T ss_pred             CCEEEEecC--CCc----HHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecCCCcceecccc
Confidence            489999984  332    78999999999999987      35655565431  236888   555554311 111 222


Q ss_pred             cc--CCcceeecc--cccc---------ccccccchhHHHHhhhhhhCCCcCCC---------------------CCCcc
Q 037540          223 FD--AAETWAVDM--ESVK---------SRVAVDLESVATHEIGHVLGLAHTSV---------------------KEAVM  268 (340)
Q Consensus       223 fd--~~e~wt~~~--~~~~---------~~~~~dl~~Va~HEIGHaLGL~Hs~d---------------------~~SIM  268 (340)
                      .|  .+|....+.  ....         ...+..+..+++||+||+|||.|...                     ..|||
T Consensus        82 ~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~~SVM  161 (197)
T cd04276          82 VDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGATSSVM  161 (197)
T ss_pred             cCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCcceee
Confidence            22  234332111  1000         01123367899999999999999651                     35788


Q ss_pred             -ccccC--CC------CCCCCCCHHHHHHHHHHhCC
Q 037540          269 -YPSLT--PR------TRKVDLKVDDVEGVQALYGS  295 (340)
Q Consensus       269 -yP~~~--~~------~~~~~L~~dDI~~Iq~LYG~  295 (340)
                       |+.+.  ..      .....+...||.+||.+||.
T Consensus       162 dY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~  197 (197)
T cd04276         162 DYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP  197 (197)
T ss_pred             cCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence             42221  11      12346899999999999983


No 17 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.23  E-value=8.5e-11  Score=107.10  Aligned_cols=138  Identities=20%  Similarity=0.330  Sum_probs=84.8

Q ss_pred             CCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhcc-CcceEEEe--ccCCCccEEEEeecCCC--CC-CCcc
Q 037540          148 PRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSV-IPVNFTEI--YDYDSADIHIGFYHGDH--GD-GQPF  221 (340)
Q Consensus       148 ~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~v-t~l~F~ev--~~~~~ADI~I~f~~g~h--gd-g~pf  221 (340)
                      .-|... +++|+..+.+.+. ..+ ...++.++++|++.|+.. ..|+|+-+  .....+||.|.-.....  |. +.+ 
T Consensus        32 nlV~~~-~~~I~i~~~~~~~-~~l-~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~~~~~G~ggsA-  107 (211)
T PF12388_consen   32 NLVGSS-PRTITIIGYTGGS-QSL-NSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSSNNPSGAGGSA-  107 (211)
T ss_pred             eeecCC-CCEEEEEeCCCcc-ccc-cHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccCCCCCCcceec-
Confidence            345433 4588888875321 223 367999999999999986 35777522  23356788764332211  11 122 


Q ss_pred             ccc-C-----CcceeeccccccccccccchhHHHHhhhhhhCCCcCC------------------------------CCC
Q 037540          222 HFD-A-----AETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS------------------------------VKE  265 (340)
Q Consensus       222 hfd-~-----~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~------------------------------d~~  265 (340)
                      .|. .     ...+..+..   .........|++|||||++||.|+.                              |.+
T Consensus       108 GFP~s~G~P~~~I~I~~~~---~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGTPt~~d~~  184 (211)
T PF12388_consen  108 GFPTSNGNPYKFIQIYGLS---NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGTPTGADPN  184 (211)
T ss_pred             cCCCCCCCCCceEEEEecC---CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCCCCCCCCCc
Confidence            122 1     111221111   1112235789999999999999976                              457


Q ss_pred             CccccccCCCCCCCCCCHHHHHHHHHHh
Q 037540          266 AVMYPSLTPRTRKVDLKVDDVEGVQALY  293 (340)
Q Consensus       266 SIMyP~~~~~~~~~~L~~dDI~~Iq~LY  293 (340)
                      |||...+... ...+++..||.+++.||
T Consensus       185 SiM~ac~~~~-~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  185 SIMNACFSSG-EDGEFTSNDITALNYLY  211 (211)
T ss_pred             hhhhccccCC-CCCCcChhhHHHHHhhC
Confidence            8897776543 34479999999999999


No 18 
>PF01471 PG_binding_1:  Putative peptidoglycan binding domain;  InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are:   Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX [].   Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.13  E-value=1.5e-10  Score=84.07  Aligned_cols=55  Identities=35%  Similarity=0.643  Sum_probs=46.8

Q ss_pred             cChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHh
Q 037540           62 TGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTI  119 (340)
Q Consensus        62 ~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m  119 (340)
                      +.+..++++|.++||++.   ..+..+++.+++||+.||+.+||++||++|.+|+++|
T Consensus         3 ~~v~~lq~~L~~~gy~~~---~~~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L   57 (57)
T PF01471_consen    3 PDVKALQQYLNRLGYYPG---PVDGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL   57 (57)
T ss_dssp             HHHHHHHHHHHHTTTT-S---STTSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred             HHHHHHHHHHHHcCCCCC---CCCCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence            356789999999999943   2356789999999999999999999999999999987


No 19 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.73  E-value=6.7e-08  Score=81.12  Aligned_cols=125  Identities=21%  Similarity=0.257  Sum_probs=69.4

Q ss_pred             CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhh-ccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCCcceeeccc
Q 037540          156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWS-SVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAAETWAVDME  234 (340)
Q Consensus       156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws-~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~  234 (340)
                      ++|+|.-..        ..+.+..|.+|.+.|+ .|.+++|++..+   ++|++.-.. +.+..++ .-+.....++...
T Consensus         2 vtv~Ydas~--------A~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~~-~~~~sya-~~~g~G~G~I~l~   68 (132)
T PF02031_consen    2 VTVYYDASR--------APEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEGN-DPRGSYA-STDGLGSGYIFLD   68 (132)
T ss_dssp             EEEEEEEEE---------GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE---TT--EE-EE-SSS-EEEEEE
T ss_pred             EEEEEeCCC--------CchhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEec-CCCCccc-ccCCCCcEEEEec
Confidence            478888763        2568999999999997 688999999754   888876432 1111111 1111222222221


Q ss_pred             cccccccccchhHHHHhhhhhhCCCc--CCCCCCccccccCCC-CCCCCCCHHHHHHHHHHhC
Q 037540          235 SVKSRVAVDLESVATHEIGHVLGLAH--TSVKEAVMYPSLTPR-TRKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       235 ~~~~~~~~dl~~Va~HEIGHaLGL~H--s~d~~SIMyP~~~~~-~~~~~L~~dDI~~Iq~LYG  294 (340)
                      .. ...+++-..|++|||||.|||..  +.....+|.-.-.+- -.+..+....+..++++|.
T Consensus        69 ~~-~~qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~aG~sCtN~~Pna~E~a~V~~~~a  130 (132)
T PF02031_consen   69 YQ-QNQGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSAGTSCTNAYPNAAERARVNSLFA  130 (132)
T ss_dssp             HH-HHHHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT-TT----S--HHHHHHHHHHCT
T ss_pred             hH-HhhCCccceeeeehhccccCCCCCCCCccHHhhcCCCCCCCCCCCCCCHHHHHHHHHHhc
Confidence            11 13467788899999999999964  334567887433221 1244578899999999985


No 20 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.61  E-value=2.4e-07  Score=83.68  Aligned_cols=112  Identities=21%  Similarity=0.376  Sum_probs=73.3

Q ss_pred             HHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCc-c----------ccc--------CCcceeeccc
Q 037540          174 TDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQP-F----------HFD--------AAETWAVDME  234 (340)
Q Consensus       174 ~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~p-f----------hfd--------~~e~wt~~~~  234 (340)
                      ...+.++..|++.|++.-|+..+|-  .+.|||+|..... .|-|.. |          +|=        -.....+...
T Consensus       103 p~wq~a~~tava~wa~~fpl~ive~--~eeaDItie~~n~-pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~  179 (236)
T COG5549         103 PRWQGAYLTAVAGWAKTFPLIIVER--FEEADITIEVGNP-PGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYP  179 (236)
T ss_pred             hhHHHHHHHHHHHHHHhCCceeeec--ceeeeEEEEecCC-CCCcccccchHHHHHHHHHHhhhccCcccccccccccCC
Confidence            3678899999999999977766655  4689999976543 222211 1          110        0011112111


Q ss_pred             cccccccccchhHHHHhhhhhhCCC-cCCCCCCccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 037540          235 SVKSRVAVDLESVATHEIGHVLGLA-HTSVKEAVMYPSLTPRTRKVDLKVDDVEGVQALYGS  295 (340)
Q Consensus       235 ~~~~~~~~dl~~Va~HEIGHaLGL~-Hs~d~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG~  295 (340)
                        ....+.++..+|.||+|||||+. |+.- .++|.+..+    ...++..|++.+.++|-.
T Consensus       180 --pg~~~e~L~~tarhElGhaLgi~ghsd~-ks~aly~~s----~~~is~rdv~tL~r~Yeq  234 (236)
T COG5549         180 --PGELRENLNPTARHELGHALGIWGHSDL-KSDALYGIS----WQGISRRDVKTLERKYEQ  234 (236)
T ss_pred             --cccchhhhhHHHHHhhcchheecccccc-hhhhhcchh----hcccCHHHHHHHHHHhcC
Confidence              12335568899999999999998 8764 556653333    224899999999999974


No 21 
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=97.90  E-value=2.7e-05  Score=71.05  Aligned_cols=69  Identities=23%  Similarity=0.308  Sum_probs=57.7

Q ss_pred             cCCCCCcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhcCCCCCCCC
Q 037540           56 EMGSNITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIMSPRCGVSD  128 (340)
Q Consensus        56 ~~g~~~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~~PRCGvpD  128 (340)
                      ++|++-+.+..+++.|.+.||...+   .+..++..+++||+.||+..||+++|.+|..|...|..| +.++.
T Consensus         8 ~~G~~g~~V~~LQ~~L~~lG~~~g~---idG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~~~   76 (201)
T TIGR02869         8 QRGSTGSDVIEIQRRLKAWGYYNGK---VDGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA-TTQQS   76 (201)
T ss_pred             CCCCCcHHHHHHHHHHHHcCCCCCC---CCCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc-cccCc
Confidence            4588889999999999999998533   245678999999999999999999999999999999543 34443


No 22 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=97.67  E-value=4.9e-05  Score=69.55  Aligned_cols=47  Identities=28%  Similarity=0.352  Sum_probs=29.8

Q ss_pred             chhHHHHhhhhhhCCCcCCCC--------------CCccccccCCCCCCCCCCHHHHHHHHHH
Q 037540          244 LESVATHEIGHVLGLAHTSVK--------------EAVMYPSLTPRTRKVDLKVDDVEGVQAL  292 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d~--------------~SIMyP~~~~~~~~~~L~~dDI~~Iq~L  292 (340)
                      ...+++|||||+|||.|..+.              .+||.+.-.+ .. ..++.-+|..|+..
T Consensus       137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY~~~~-~~-~~FS~~~i~~i~~~  197 (206)
T PF13583_consen  137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSYASRP-SR-PYFSPPSIEYIREV  197 (206)
T ss_pred             cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCCCCCC-CC-CCCCchhHHHHhCC
Confidence            347899999999999998742              2577521111 11 22666677666543


No 23 
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=97.37  E-value=0.00045  Score=72.24  Aligned_cols=67  Identities=21%  Similarity=0.267  Sum_probs=56.1

Q ss_pred             cCCCCCcChHHHHHHHHhcCCCCCCC------C-----------------------------------CCCCcchHHHHH
Q 037540           56 EMGSNITGMSELKKYFTRFGYLSLSY------T-----------------------------------NFTDVFDTQLQS   94 (340)
Q Consensus        56 ~~g~~~~~~~~~~~YL~~fGYl~~~~------~-----------------------------------~~~~~~d~~l~~   94 (340)
                      ++|+.-+.+..+++=|...|++....      .                                   ..+..+|.+|.+
T Consensus       236 rpG~~~~~V~~LR~RL~~~G~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~YD~~Lv~  315 (608)
T PRK10594        236 RPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPAVRAAYDNELVE  315 (608)
T ss_pred             CCCCCCchHHHHHHHHHHcCCcccccccccccccccccccccccccccccccccccccccccccccccccccccCHHHHH
Confidence            67999999999999999999985210      0                                   012357999999


Q ss_pred             HHHHHHHhcCCCCCccCCHHHHHHhcCC
Q 037540           95 AVVSYQTKLGLPVTGKLDSETISTIMSP  122 (340)
Q Consensus        95 Ai~~~Q~~~gLpvTG~LD~~T~~~m~~P  122 (340)
                      |||+||+..||..+|+++..|++.|+.|
T Consensus       316 AVK~FQ~rhGL~~DGvIG~~T~~~LNvs  343 (608)
T PRK10594        316 AVKRFQAWQGLGADGVIGPRTRDWLNVT  343 (608)
T ss_pred             HHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence            9999999999999999999999998754


No 24 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.36  E-value=7.9e-05  Score=66.27  Aligned_cols=47  Identities=19%  Similarity=0.212  Sum_probs=19.1

Q ss_pred             chhHHHHhhhhhhCCCcCCCCC-------------CccccccCCCCCCCCCCHHHHHHHHHH
Q 037540          244 LESVATHEIGHVLGLAHTSVKE-------------AVMYPSLTPRTRKVDLKVDDVEGVQAL  292 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d~~-------------SIMyP~~~~~~~~~~L~~dDI~~Iq~L  292 (340)
                      ...|++|||||.||+.|..+.+             .||++.  .......+++--|+.|+.+
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~--~~~~~~~fS~cS~~~i~~~  170 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYA--GICNNDKFSPCSIRQIRAV  170 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS---GGGG--------------
T ss_pred             eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeC--CCCCCcccccccccccccc
Confidence            3568999999999999988643             688744  2223345566666665543


No 25 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.33  E-value=0.00067  Score=61.60  Aligned_cols=108  Identities=19%  Similarity=0.269  Sum_probs=66.0

Q ss_pred             CcEEEEeecCCcCCC---CCHHHHHHHHHHHH---HHhhccCcceEEEeccCCCccEEEEeecCCCCC--CCcc------
Q 037540          156 MTLTYAFSKTDMIDY---IDLTDVKAAFKSSF---QRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGD--GQPF------  221 (340)
Q Consensus       156 ~~LTY~f~~~n~~~~---l~~~~~r~ai~~Af---~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgd--g~pf------  221 (340)
                      ..++|++.-++-...   .+.++..++|.+.+   +-|.....+.|+.|.+ +.+|++|...+...-+  +.+.      
T Consensus        29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~-~~~Df~I~Lasp~T~~~lC~g~~~~~e~  107 (203)
T PF11350_consen   29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDS-GAPDFRISLASPGTTDRLCAGLDTSGET  107 (203)
T ss_pred             ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCC-CCCCEEEEECCcchhhhhccCcCcCcee
Confidence            489999987643221   12355666666655   5699888999999987 4599999987653211  1011      


Q ss_pred             --cc-------cCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCC
Q 037540          222 --HF-------DAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVK  264 (340)
Q Consensus       222 --hf-------d~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~  264 (340)
                        +.       -+..+|..+...-...-+.....|+=||.||+||-+|..=+
T Consensus       108 SC~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cp  159 (203)
T PF11350_consen  108 SCRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCP  159 (203)
T ss_pred             EeecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCC
Confidence              11       01234544332211111233567999999999999887643


No 26 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.27  E-value=0.00013  Score=64.74  Aligned_cols=27  Identities=41%  Similarity=0.572  Sum_probs=24.3

Q ss_pred             hhHHHHhhhhhhCCCcCCCCCCccccc
Q 037540          245 ESVATHEIGHVLGLAHTSVKEAVMYPS  271 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~SIMyP~  271 (340)
                      ..=++||+||+|||.|=.++.-||++.
T Consensus       125 ~KEv~HElGH~~GL~HC~N~~CVM~FS  151 (181)
T COG1913         125 VKEVLHELGHLLGLSHCPNPRCVMNFS  151 (181)
T ss_pred             HHHHHHHhhhhcCcccCCCCCcEEeCC
Confidence            456899999999999999999999854


No 27 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.20  E-value=0.00035  Score=62.56  Aligned_cols=48  Identities=19%  Similarity=0.263  Sum_probs=32.8

Q ss_pred             cchhHHHHhhhhhhCCCcCCCC----------CCccccccCCCCCCCCCCHHHHHHHHH
Q 037540          243 DLESVATHEIGHVLGLAHTSVK----------EAVMYPSLTPRTRKVDLKVDDVEGVQA  291 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~----------~SIMyP~~~~~~~~~~L~~dDI~~Iq~  291 (340)
                      ....+++||+||.|||.|....          .-||.|.... .....+|.-.++.|+.
T Consensus       132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~-~~~~~FS~CS~~~i~~  189 (192)
T cd04267         132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG-LNSYRFSQCSIGSIRE  189 (192)
T ss_pred             eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC-CCCCccChhhHHHHHH
Confidence            4567999999999999997753          2489876543 2234566655555553


No 28 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.11  E-value=0.0002  Score=64.21  Aligned_cols=23  Identities=35%  Similarity=0.430  Sum_probs=14.9

Q ss_pred             cchhHHHHhhhhhhCCCcCCCCC
Q 037540          243 DLESVATHEIGHVLGLAHTSVKE  265 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~~  265 (340)
                      ....|++|||||.||+.|..+..
T Consensus       141 ~~~~~~AHEiGH~lGa~HD~~~~  163 (196)
T PF13688_consen  141 NGAITFAHEIGHNLGAPHDGDYS  163 (196)
T ss_dssp             HHHHHHHHHHHHHTT-----SSS
T ss_pred             ceehhhHHhHHHhcCCCCCCCCC
Confidence            34679999999999999987753


No 29 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.06  E-value=0.00035  Score=62.74  Aligned_cols=27  Identities=41%  Similarity=0.608  Sum_probs=24.0

Q ss_pred             chhHHHHhhhhhhCCCcCCCCCCcccc
Q 037540          244 LESVATHEIGHVLGLAHTSVKEAVMYP  270 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d~~SIMyP  270 (340)
                      +..+++||+||.|||.|-.++.-||+.
T Consensus       125 ~~k~~~HElGH~lGL~HC~~~~CvM~~  151 (179)
T PRK13267        125 VRKEVTHELGHTLGLEHCDNPRCVMNF  151 (179)
T ss_pred             HHHHHHHHHHHHcCCccCCCCCccCCC
Confidence            345699999999999999999999984


No 30 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.98  E-value=0.00029  Score=58.40  Aligned_cols=18  Identities=50%  Similarity=0.682  Sum_probs=12.6

Q ss_pred             chhHHHHhhhhhhCCCcC
Q 037540          244 LESVATHEIGHVLGLAHT  261 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs  261 (340)
                      ...|++|||||.|||.|.
T Consensus       107 ~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  107 GVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             STTHHHHHHHHHTT----
T ss_pred             cceEeeehhhHhcCCCCC
Confidence            358999999999999995


No 31 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=96.97  E-value=0.00053  Score=62.44  Aligned_cols=54  Identities=22%  Similarity=0.345  Sum_probs=38.8

Q ss_pred             cccchhHHHHhhhhhhCCCcCCC---------CCCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 037540          241 AVDLESVATHEIGHVLGLAHTSV---------KEAVMYPSLTPRTRKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaLGL~Hs~d---------~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG  294 (340)
                      +.....|++|||||.||+.|-.+         ..-||+|.......+...|.=-+..++.+..
T Consensus       137 ~~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~  199 (207)
T cd04273         137 GLSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLD  199 (207)
T ss_pred             CceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHc
Confidence            34456799999999999999875         2689998876432344667766666665554


No 32 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.82  E-value=0.00067  Score=61.61  Aligned_cols=27  Identities=41%  Similarity=0.588  Sum_probs=19.3

Q ss_pred             chhHHHHhhhhhhCCCcCCCCCCccccc
Q 037540          244 LESVATHEIGHVLGLAHTSVKEAVMYPS  271 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d~~SIMyP~  271 (340)
                      +..+++||+||++||.|=.+ .-+|++.
T Consensus       145 ~~Kea~HElGH~~GL~HC~~-~CvM~~s  171 (194)
T PF07998_consen  145 VCKEAVHELGHLFGLDHCEN-RCVMNFS  171 (194)
T ss_dssp             HHHHHHHHHHHHTT----SS-TSTTS--
T ss_pred             HHHHHHHHHHHHcCCcCCCC-CCccCCC
Confidence            35799999999999999999 9999964


No 33 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.72  E-value=0.0019  Score=60.13  Aligned_cols=49  Identities=29%  Similarity=0.493  Sum_probs=37.2

Q ss_pred             hHHHHhhhhhhCCCcCCCC-------------------------CCccccccCCCCCCCCCCHHHHHHHHHHhCCC
Q 037540          246 SVATHEIGHVLGLAHTSVK-------------------------EAVMYPSLTPRTRKVDLKVDDVEGVQALYGSN  296 (340)
Q Consensus       246 ~Va~HEIGHaLGL~Hs~d~-------------------------~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG~~  296 (340)
                      .|++|||||.||+.|=.+.                         .=||+|....  ....++.--+..++.+...+
T Consensus       147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~--~~~~FS~CS~~~~~~~l~~~  220 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS--GITEFSPCTIGNICSLLGRN  220 (228)
T ss_pred             eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC--CCCccChhhHHHHHHHHhcC
Confidence            5999999999999996431                         1199987554  45578888888888887654


No 34 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.70  E-value=0.00077  Score=59.03  Aligned_cols=25  Identities=40%  Similarity=0.522  Sum_probs=15.8

Q ss_pred             cccchhHHHHhhhhhhCCCcCCCCC
Q 037540          241 AVDLESVATHEIGHVLGLAHTSVKE  265 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaLGL~Hs~d~~  265 (340)
                      ..+...|++||+||-|||.|.....
T Consensus        66 ~~~~g~TltHEvGH~LGL~HtF~~~   90 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWLGLYHTFGGG   90 (154)
T ss_dssp             TS-SSHHHHHHHHHHTT---TT---
T ss_pred             ccccccchhhhhhhhhcccccccCC
Confidence            3456789999999999999998544


No 35 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.64  E-value=0.0024  Score=57.23  Aligned_cols=49  Identities=24%  Similarity=0.365  Sum_probs=32.6

Q ss_pred             chhHHHHhhhhhhCCCcCCC------CCCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 037540          244 LESVATHEIGHVLGLAHTSV------KEAVMYPSLTPRTRKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d------~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG  294 (340)
                      .-.+++|||||.||+.|-.+      ..-||+|....  .....|.=-...++.+..
T Consensus       131 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~fS~CS~~~~~~~l~  185 (194)
T cd04269         131 FAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS--LTDAFSNCSYEDYQKFLS  185 (194)
T ss_pred             HHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC--CCCCCChhhHHHHHHHHh
Confidence            35799999999999999543      36789886542  233455544555555444


No 36 
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.34  E-value=0.0072  Score=62.04  Aligned_cols=65  Identities=23%  Similarity=0.400  Sum_probs=55.2

Q ss_pred             cCCCCCcChHHHHHHHHhcCC-CCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhcC
Q 037540           56 EMGSNITGMSELKKYFTRFGY-LSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIMS  121 (340)
Q Consensus        56 ~~g~~~~~~~~~~~YL~~fGY-l~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~~  121 (340)
                      ++|+....+..+.+=|.+-|= ++.. ...+..+|.++.+|||+||+..||+..|+.++.|+++|+-
T Consensus       233 rpG~~~~~v~aL~~~L~~~~~d~~~a-~~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~  298 (561)
T COG2989         233 RPGVTSPDVPALRARLARSGMDLPSA-AGSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV  298 (561)
T ss_pred             CCCCCchhHHHHHHHHHhcCccchhh-ccCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence            679999999999999999883 3322 2235578999999999999999999999999999999974


No 37 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.19  E-value=0.00077  Score=62.64  Aligned_cols=26  Identities=46%  Similarity=0.622  Sum_probs=22.5

Q ss_pred             cccchhHHHHhhhhhhCCCcCCCCCC
Q 037540          241 AVDLESVATHEIGHVLGLAHTSVKEA  266 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaLGL~Hs~d~~S  266 (340)
                      ..++..|++||+||.|||.|.....+
T Consensus       134 ~~n~g~t~~HEvGH~lGL~HtF~~~~  159 (225)
T cd04275         134 PYNLGDTATHEVGHWLGLYHTFQGGS  159 (225)
T ss_pred             cccccceeEEeccceeeeeeeecCCC
Confidence            45677899999999999999997764


No 38 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=95.81  E-value=0.014  Score=52.51  Aligned_cols=52  Identities=21%  Similarity=0.443  Sum_probs=31.3

Q ss_pred             cccchhHHHHhhhhhhCCCcCCC--------CCCccccccCCCCCCCCCCHHHHHHHHHHh
Q 037540          241 AVDLESVATHEIGHVLGLAHTSV--------KEAVMYPSLTPRTRKVDLKVDDVEGVQALY  293 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaLGL~Hs~d--------~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LY  293 (340)
                      ...+-.+++|||||.||+.|-..        ..-||.|...+.. ....|.=-+..++.+.
T Consensus       128 ~~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~~-~~~fS~CS~~~~~~~l  187 (199)
T PF01421_consen  128 GLSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPSS-SDKFSNCSRRQFEEFL  187 (199)
T ss_dssp             HHHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSSS-TSSS-HHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCCC-CCCcCHHHHHHHHHHH
Confidence            33455789999999999999766        3568998554322 2256665555555543


No 39 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=95.67  E-value=0.015  Score=54.63  Aligned_cols=50  Identities=20%  Similarity=0.383  Sum_probs=33.7

Q ss_pred             hhHHHHhhhhhhCCCcCCC-----C------CCccccccCC--CCCCCCCCHHHHHHHHHHhC
Q 037540          245 ESVATHEIGHVLGLAHTSV-----K------EAVMYPSLTP--RTRKVDLKVDDVEGVQALYG  294 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d-----~------~SIMyP~~~~--~~~~~~L~~dDI~~Iq~LYG  294 (340)
                      ..|++|||||.||+.|-.+     +      .=||+|.-..  ......++.=-++.++...-
T Consensus       168 a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~  230 (244)
T cd04270         168 DLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLE  230 (244)
T ss_pred             HHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHc
Confidence            4699999999999999554     1      3489986421  12345677666666665554


No 40 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=95.42  E-value=0.023  Score=57.06  Aligned_cols=57  Identities=25%  Similarity=0.319  Sum_probs=39.0

Q ss_pred             hhHHHHhhhhhhCCCcCCCCCCcc---ccccCCC----C---------------CCCCCCHHHHHHHHHHhCCCCCCCcc
Q 037540          245 ESVATHEIGHVLGLAHTSVKEAVM---YPSLTPR----T---------------RKVDLKVDDVEGVQALYGSNPNFKFS  302 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~SIM---yP~~~~~----~---------------~~~~L~~dDI~~Iq~LYG~~~~~~~~  302 (340)
                      ...++|||||.||+.|.  ++.||   |++++..    +               .+-+.++.-+++++-+|-...+-.++
T Consensus       302 lGA~~HElGH~lgcpHq--~~GvMlrgy~~lNRsftt~e~~~vrt~s~g~aPv~~keec~w~~lD~~Rf~yhp~f~lp~d  379 (614)
T KOG4525|consen  302 LGAVCHELGHCLGCPHQ--SEGVMLRGYDYLNRSFTTDEFECVRTGSKGLAPVLAKEECPWRILDARRFSYHPTFSLPGD  379 (614)
T ss_pred             HHHHHHHhhhccCCCCC--CCceEecCchhhcceeeecceeeeecCCcCcCCccccccCchhhhhhhhhcccccccCCCC
Confidence            46889999999999997  46788   5554421    0               12356778889999999654333344


Q ss_pred             c
Q 037540          303 S  303 (340)
Q Consensus       303 ~  303 (340)
                      +
T Consensus       380 p  380 (614)
T KOG4525|consen  380 P  380 (614)
T ss_pred             C
Confidence            3


No 41 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.75  E-value=0.018  Score=56.22  Aligned_cols=21  Identities=43%  Similarity=0.580  Sum_probs=18.0

Q ss_pred             ccchhHHHHhhhhhhCCCcCC
Q 037540          242 VDLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       242 ~dl~~Va~HEIGHaLGL~Hs~  262 (340)
                      .....|++||+||-|||.|+.
T Consensus       148 ~~~~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  148 VQDWATIMHELGHNLGLWHAG  168 (314)
T ss_pred             cccHHHHHHHhhhhccccccC
Confidence            344579999999999999996


No 42 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=94.55  E-value=0.08  Score=46.37  Aligned_cols=61  Identities=25%  Similarity=0.372  Sum_probs=49.4

Q ss_pred             CCCcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCccCCHHHHHHhcC
Q 037540           59 SNITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGL-PVTGKLDSETISTIMS  121 (340)
Q Consensus        59 ~~~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gL-pvTG~LD~~T~~~m~~  121 (340)
                      ..-..+..++.-|.+.||.+.  ...+..+....+.||+.||+..+| +++|..+++|...+..
T Consensus        43 ~~~~~v~~lq~~L~~~g~~~~--~~~dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~  104 (185)
T COG3409          43 AEGPSVRILQAALNALGYYPD--GVIDGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPS  104 (185)
T ss_pred             CCCchHHHHHHHHHhcCCCCC--CCccCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHH
Confidence            355677888999999999873  112334567889999999999999 8999999999998865


No 43 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=94.16  E-value=0.029  Score=51.54  Aligned_cols=20  Identities=35%  Similarity=0.461  Sum_probs=17.1

Q ss_pred             cchhHHHHhhhhhhCCCcCC
Q 037540          243 DLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~  262 (340)
                      ....+++||+||.||+.|-.
T Consensus       144 ~~~~~~AHElGH~lG~~HD~  163 (220)
T cd04272         144 YGVYTMTHELAHLLGAPHDG  163 (220)
T ss_pred             ccHHHHHHHHHHHhCCCCCC
Confidence            34689999999999999954


No 44 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=94.09  E-value=0.062  Score=49.79  Aligned_cols=54  Identities=24%  Similarity=0.412  Sum_probs=32.0

Q ss_pred             CCccEEEEeecCCCCCCCcccccCCcceeeccccccccccccc--hhHHHHhhhhhhCCCcCCC
Q 037540          202 DSADIHIGFYHGDHGDGQPFHFDAAETWAVDMESVKSRVAVDL--ESVATHEIGHVLGLAHTSV  263 (340)
Q Consensus       202 ~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~~~~~~~~~dl--~~Va~HEIGHaLGL~Hs~d  263 (340)
                      +-.|++|.-..|.-+|    |||..++. +...++ ...+.++  ..||+||.|||+  .|...
T Consensus        51 gl~~V~Ve~~~G~LtD----HYdP~~k~-vrLS~~-vy~~~SiaAvaVAAHEvGHAi--Q~a~~  106 (222)
T PF04298_consen   51 GLSDVRVERVPGELTD----HYDPRNKV-VRLSED-VYNGRSIAAVAVAAHEVGHAI--QHAEG  106 (222)
T ss_pred             CCCCeeEEEeCCCCCC----CcCCCCCE-EEeCCc-cCCCCCHHHHHHHHHHHhHHH--hcccc
Confidence            4557788776664445    67766654 322211 1223333  679999999994  55543


No 45 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=93.55  E-value=0.036  Score=53.75  Aligned_cols=20  Identities=35%  Similarity=0.639  Sum_probs=14.8

Q ss_pred             cchhHHHHhhhhhhCCCcCC
Q 037540          243 DLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~  262 (340)
                      ....+++||+||++||.|-.
T Consensus       192 s~~~~f~HE~GH~~GL~H~p  211 (305)
T PF10462_consen  192 SYGNEFSHELGHNFGLGHYP  211 (305)
T ss_dssp             -SHHHHHHHHHHTTT--SST
T ss_pred             CccceeehhhhhhcCCCCCC
Confidence            35789999999999999943


No 46 
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=93.43  E-value=0.22  Score=46.99  Aligned_cols=71  Identities=25%  Similarity=0.367  Sum_probs=54.9

Q ss_pred             chHhhHhhhhc------------cCCCCCcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCccC
Q 037540           45 TWYDFARLLDA------------EMGSNITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGL-PVTGKL  111 (340)
Q Consensus        45 ~~~~~~~~~~~------------~~g~~~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gL-pvTG~L  111 (340)
                      +|..+...+|-            ..+.+..++..+++-|.+|||--+.     +.+|++.+.+|+.||.-|.- ..+|+.
T Consensus       169 pWk~l~~~~gigaw~~~~~~~~~~~~~~~~~v~~lq~~L~~YGY~v~~-----~~~d~~t~~vv~aFQ~hfrp~~~dg~~  243 (257)
T COG3023         169 PWKRLAEALGIGAWLDTAQVQKYLALLKGEDVAALQEMLARYGYGVEI-----GVFDQETQQVVRAFQMHFRPGLYDGEA  243 (257)
T ss_pred             CHHHHHhhcCcccCCcHhhhhhhhhhhccCCHHHHHHHHHHhCcCCCc-----chhhHHHHHHHHHHHHHhCCCCCCCCC
Confidence            58777666441            1123338889999999999997643     45789999999999999985 479999


Q ss_pred             CHHHHHHhc
Q 037540          112 DSETISTIM  120 (340)
Q Consensus       112 D~~T~~~m~  120 (340)
                      |-+|++.+.
T Consensus       244 d~et~a~l~  252 (257)
T COG3023         244 DVETIAILQ  252 (257)
T ss_pred             ChHHHHHHH
Confidence            999998764


No 47 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=93.06  E-value=0.051  Score=54.97  Aligned_cols=22  Identities=36%  Similarity=0.619  Sum_probs=19.4

Q ss_pred             hhHHHHhhhhhhCCCcCCCCCCcc
Q 037540          245 ESVATHEIGHVLGLAHTSVKEAVM  268 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~SIM  268 (340)
                      ....+|||||++||.|..+  .||
T Consensus       317 iGA~lHEiGH~fg~pH~~~--GIM  338 (423)
T PF12044_consen  317 IGAFLHEIGHLFGCPHQED--GIM  338 (423)
T ss_pred             hHHHHHHHHHhcCCCCCCC--Cee
Confidence            4689999999999999865  888


No 48 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=92.17  E-value=0.14  Score=46.42  Aligned_cols=76  Identities=22%  Similarity=0.382  Sum_probs=42.3

Q ss_pred             HHHHHHHHHHHhhccCc---ceEEEec-----cCCCccEEEEeecCCCCCCCcccccCCcceeecccccccccccc--ch
Q 037540          176 VKAAFKSSFQRWSSVIP---VNFTEIY-----DYDSADIHIGFYHGDHGDGQPFHFDAAETWAVDMESVKSRVAVD--LE  245 (340)
Q Consensus       176 ~r~ai~~Af~~Ws~vt~---l~F~ev~-----~~~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~~~~~~~~~d--l~  245 (340)
                      .+.-++.++++|++|..   ++=.||.     +.+--|+.+.-.+|.-.|    |||...+ .+...+ ....|..  -.
T Consensus        20 AQ~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~LTD----HYDP~~k-vvrLSe-~~y~g~Sia~~   93 (226)
T COG2738          20 AQSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTLTD----HYDPRRK-VVRLSE-ANYYGPSIAAI   93 (226)
T ss_pred             HHHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCccc----ccChhhh-eeeccc-cccCCccHHHH
Confidence            45567889999998753   3322332     123456666655543333    6665433 222221 1223333  36


Q ss_pred             hHHHHhhhhhhC
Q 037540          246 SVATHEIGHVLG  257 (340)
Q Consensus       246 ~Va~HEIGHaLG  257 (340)
                      .|++||.|||+=
T Consensus        94 aVAAHEVGHAiQ  105 (226)
T COG2738          94 AVAAHEVGHAIQ  105 (226)
T ss_pred             HHHHHHhhHHHh
Confidence            799999999963


No 49 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=90.97  E-value=0.061  Score=51.65  Aligned_cols=20  Identities=40%  Similarity=0.552  Sum_probs=16.8

Q ss_pred             hhHHHHhhhhhhCCCcCCCC
Q 037540          245 ESVATHEIGHVLGLAHTSVK  264 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~  264 (340)
                      ..|++||+||.|||.+--+.
T Consensus       166 igv~~HE~gH~lGLPDlY~~  185 (286)
T TIGR03296       166 VGVIAHELGHDLGLPDLYDT  185 (286)
T ss_pred             eeeeehhhhcccCCCccccC
Confidence            57999999999999875543


No 50 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=88.57  E-value=0.18  Score=42.26  Aligned_cols=17  Identities=47%  Similarity=0.812  Sum_probs=14.2

Q ss_pred             chhHHHHhhhhhhCCCc
Q 037540          244 LESVATHEIGHVLGLAH  260 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~H  260 (340)
                      ...|++|||||-.||+.
T Consensus       109 vthvliHEIgHhFGLsD  125 (136)
T COG3824         109 VTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             hhhhhhhhhhhhcCCCh
Confidence            45799999999988864


No 51 
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=87.38  E-value=1.6  Score=33.63  Aligned_cols=53  Identities=19%  Similarity=0.359  Sum_probs=40.4

Q ss_pred             hHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCC----CCccCCHHHHHHh
Q 037540           64 MSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLP----VTGKLDSETISTI  119 (340)
Q Consensus        64 ~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLp----vTG~LD~~T~~~m  119 (340)
                      ..++++=|.+-||+..+.   +..+|+++++|++.||..-|+.    .-|++|...++-|
T Consensus        18 ~~evq~~L~~lGyy~g~~---~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~L   74 (74)
T PF08823_consen   18 AREVQEALKRLGYYKGEA---DGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAYL   74 (74)
T ss_pred             HHHHHHHHHHcCCccCCC---CCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhhC
Confidence            356888999999987442   4567899999999999998875    3455887776543


No 52 
>PF11150 DUF2927:  Protein of unknown function (DUF2927);  InterPro: IPR021323  This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. 
Probab=86.80  E-value=15  Score=33.98  Aligned_cols=137  Identities=15%  Similarity=0.199  Sum_probs=79.8

Q ss_pred             CCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCC---------CCC
Q 037540          148 PRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDH---------GDG  218 (340)
Q Consensus       148 ~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~h---------gdg  218 (340)
                      .||...    |+..+..  .+. -......+.+..-++.-+.++.+....+.+. +|+|.|.|...+.         +..
T Consensus        28 ~Rw~~P----Vrv~v~~--~~~-~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~   99 (213)
T PF11150_consen   28 RRWEGP----VRVRVEG--VPP-ADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGA   99 (213)
T ss_pred             ccCCCC----eEEEEec--cCh-hhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccc
Confidence            589865    7777762  111 1234555666666888899999999888875 9999999987654         111


Q ss_pred             Ccccc------cC--------Ccceeeccc-------cccccccccchhHHHHhhhhhhCCCcCC--CCCCccccccCCC
Q 037540          219 QPFHF------DA--------AETWAVDME-------SVKSRVAVDLESVATHEIGHVLGLAHTS--VKEAVMYPSLTPR  275 (340)
Q Consensus       219 ~pfhf------d~--------~e~wt~~~~-------~~~~~~~~dl~~Va~HEIGHaLGL~Hs~--d~~SIMyP~~~~~  275 (340)
                      ..+..      ..        ...|.....       -.......-+...+..||-.+|||.--+  .+.||-+    ..
T Consensus       100 ~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi~i~~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFN----Dd  175 (213)
T PF11150_consen  100 ACFVVPNFSSLPRYARCLVFARSNWSGEITRARAVIFIRADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFN----DD  175 (213)
T ss_pred             ccccchhhhcccccceEEEEEEecCCCceEEEEEEEEEeccCCchHHHHHHHHHHHHHcCCccCCCcCCCceee----CC
Confidence            11100      00        001110000       0000111235678888899999996322  2334332    22


Q ss_pred             CCCCCCCHHHHHHHHHHhCCC
Q 037540          276 TRKVDLKVDDVEGVQALYGSN  296 (340)
Q Consensus       276 ~~~~~L~~dDI~~Iq~LYG~~  296 (340)
                      .....|+.-|..-++.||...
T Consensus       176 ~~~~~LT~~D~lLLr~LYdP~  196 (213)
T PF11150_consen  176 NEFALLTGFDELLLRMLYDPR  196 (213)
T ss_pred             CccccccHHHHHHHHHHcCcc
Confidence            344579999999999999754


No 53 
>PTZ00337 surface protease GP63; Provisional
Probab=85.86  E-value=0.86  Score=47.96  Aligned_cols=19  Identities=42%  Similarity=0.681  Sum_probs=16.1

Q ss_pred             cchhHHHHhhhhhhCCCcC
Q 037540          243 DLESVATHEIGHVLGLAHT  261 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs  261 (340)
                      ....|++|||.||||+...
T Consensus       228 ~~~~v~~HEi~HALGFs~~  246 (567)
T PTZ00337        228 GDVRVAAHELGHALGFVRD  246 (567)
T ss_pred             HHHHHHHHHHHHHHccCHH
Confidence            4578999999999999654


No 54 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=84.00  E-value=1.1  Score=48.58  Aligned_cols=45  Identities=24%  Similarity=0.320  Sum_probs=29.2

Q ss_pred             chhHHHHhhhhhhCCCcCC-------CCCCccccccCCCCCCCCCCHHHHHHH
Q 037540          244 LESVATHEIGHVLGLAHTS-------VKEAVMYPSLTPRTRKVDLKVDDVEGV  289 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~-------d~~SIMyP~~~~~~~~~~L~~dDI~~I  289 (340)
                      ...|++|||||.||+.|-.       ...-|| +...+...+..++.--++.+
T Consensus       323 ~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~  374 (716)
T KOG3607|consen  323 FAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDF  374 (716)
T ss_pred             HHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHH
Confidence            4579999999999999976       223578 66553322334555444444


No 55 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=83.92  E-value=0.54  Score=44.81  Aligned_cols=24  Identities=33%  Similarity=0.542  Sum_probs=13.3

Q ss_pred             cchhHHHHhhhhhh-CCCcCCCCCC
Q 037540          243 DLESVATHEIGHVL-GLAHTSVKEA  266 (340)
Q Consensus       243 dl~~Va~HEIGHaL-GL~Hs~d~~S  266 (340)
                      ....|++||+||++ ||+.+-..+.
T Consensus       215 ~~~~v~vHE~GHsf~~LaDEY~~~~  239 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADEYYYGE  239 (264)
T ss_dssp             THHHHHHHHHHHHTT------ECS-
T ss_pred             cccceeeeecccccccccccccCCC
Confidence            46789999999965 5666654443


No 56 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=82.22  E-value=0.73  Score=37.35  Aligned_cols=16  Identities=31%  Similarity=0.690  Sum_probs=12.6

Q ss_pred             chhHHHHhhhhhhCCC
Q 037540          244 LESVATHEIGHVLGLA  259 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~  259 (340)
                      ...|++|||||.+|+.
T Consensus        73 I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   73 IRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHcCCC
Confidence            3679999999999985


No 57 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=81.47  E-value=0.3  Score=52.07  Aligned_cols=17  Identities=47%  Similarity=0.661  Sum_probs=14.6

Q ss_pred             hhHHHHhhhhhhCCCcC
Q 037540          245 ESVATHEIGHVLGLAHT  261 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs  261 (340)
                      ..|++||+||.|||-.-
T Consensus       222 iGVfaHEfGH~LGLPDl  238 (645)
T PF05547_consen  222 IGVFAHEFGHDLGLPDL  238 (645)
T ss_pred             eEEEEeeccccCCCCCc
Confidence            57999999999999643


No 58 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=80.62  E-value=0.83  Score=47.56  Aligned_cols=55  Identities=20%  Similarity=0.253  Sum_probs=27.0

Q ss_pred             cchhHHHHhhhhhhCCCcCCCCCCccc---cccCCCC--CCCCCCHHHHHHHHHHhCCCC
Q 037540          243 DLESVATHEIGHVLGLAHTSVKEAVMY---PSLTPRT--RKVDLKVDDVEGVQALYGSNP  297 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~~SIMy---P~~~~~~--~~~~L~~dDI~~Iq~LYG~~~  297 (340)
                      .+..|++|||.|+||+.-..-...-|.   +...+..  ..+--++.=++-.+.-||.+.
T Consensus       209 ~~~~~~~HEi~HaLGFs~~~~~~~~~~~~~~~~~~~~~~~~~l~tp~v~~~ar~hf~C~~  268 (521)
T PF01457_consen  209 EFFRTVIHEIAHALGFSSSFFRYYGMPRTVSNVRGKPTTVTVLVTPNVVEFARKHFGCPT  268 (521)
T ss_dssp             HHHHHHHHHHHHHTT-SHHHHHHTT-EEEES-GGG-SS-EEEE--HHHHHHHHHHHT-TT
T ss_pred             cccceeeeeeeeeeeecccccccccccccccceeecCceeEEEeCcHHHHHHHHHhCCCc
Confidence            357899999999999976211110110   0011111  111225566666788888643


No 59 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=78.29  E-value=1.3  Score=35.22  Aligned_cols=14  Identities=36%  Similarity=0.648  Sum_probs=11.5

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      ....+++|||||.+
T Consensus        41 ~~~f~laHELgH~~   54 (122)
T PF06114_consen   41 RQRFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHH
Confidence            35679999999974


No 60 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=74.70  E-value=1.5  Score=39.89  Aligned_cols=16  Identities=38%  Similarity=0.542  Sum_probs=12.9

Q ss_pred             cchhHHHHhhhhhhCC
Q 037540          243 DLESVATHEIGHVLGL  258 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL  258 (340)
                      ....|+.||+||++-=
T Consensus        21 ~~l~t~~HE~gHal~a   36 (200)
T PF13398_consen   21 RLLVTFVHELGHALAA   36 (200)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3567999999999753


No 61 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=72.36  E-value=2.2  Score=39.27  Aligned_cols=29  Identities=34%  Similarity=0.604  Sum_probs=24.9

Q ss_pred             hhHHHHhhhhhhCCCcCCCCCCccccccC
Q 037540          245 ESVATHEIGHVLGLAHTSVKEAVMYPSLT  273 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~  273 (340)
                      ..+.-|++||++||+|.+++..||+|.+.
T Consensus       154 al~~~efl~~~~~~~~lnhr~~i~~~pg~  182 (236)
T COG5549         154 ALIAYEFLGHALGLGHLNHRGDIMYPPGE  182 (236)
T ss_pred             HHHHHHHhhhccCcccccccccccCCccc
Confidence            45778999999999999999999976543


No 62 
>PF09374 PG_binding_3:  Predicted Peptidoglycan domain;  InterPro: IPR018537  This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=72.31  E-value=3.9  Score=31.13  Aligned_cols=26  Identities=27%  Similarity=0.486  Sum_probs=20.6

Q ss_pred             HHHHHHHHhcC----CCCCccCCHHHHHHh
Q 037540           94 SAVVSYQTKLG----LPVTGKLDSETISTI  119 (340)
Q Consensus        94 ~Ai~~~Q~~~g----LpvTG~LD~~T~~~m  119 (340)
                      .|+|.+|+..|    +.+.|.+.+.|+..+
T Consensus         1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al   30 (72)
T PF09374_consen    1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL   30 (72)
T ss_dssp             HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred             CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence            38999999999    999999999999999


No 63 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=71.61  E-value=1.8  Score=40.16  Aligned_cols=14  Identities=36%  Similarity=0.582  Sum_probs=12.0

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      ....|+.||+||||
T Consensus        91 ~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   91 NVLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46789999999985


No 64 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=71.53  E-value=13  Score=32.14  Aligned_cols=58  Identities=24%  Similarity=0.371  Sum_probs=41.1

Q ss_pred             CcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhc
Q 037540           61 ITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIM  120 (340)
Q Consensus        61 ~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~  120 (340)
                      ......++.-+..-||.....  .+..|....+.|++.||+..+|+++|..-..|...+.
T Consensus       127 ~~~~~~~~~~~~~~~~~~~~~--~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~  184 (185)
T COG3409         127 GGDVATLQQPLPLLGYRSGIR--VDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR  184 (185)
T ss_pred             ccchHHHHHHHHhcccccCCC--CCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence            334444455555555554321  2345678899999999999999999999999887653


No 65 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=70.45  E-value=2  Score=44.10  Aligned_cols=16  Identities=50%  Similarity=0.765  Sum_probs=13.7

Q ss_pred             ccchhHHHHhhhhhhC
Q 037540          242 VDLESVATHEIGHVLG  257 (340)
Q Consensus       242 ~dl~~Va~HEIGHaLG  257 (340)
                      ..+-+|++|||||..|
T Consensus       128 sElagViAHEigHv~q  143 (484)
T COG4783         128 SELAGVIAHEIGHVAQ  143 (484)
T ss_pred             HHHHHHHHHHHHHHhh
Confidence            4578999999999876


No 66 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=69.88  E-value=2.1  Score=39.57  Aligned_cols=18  Identities=39%  Similarity=0.623  Sum_probs=13.2

Q ss_pred             chhHHHHhhhhhhCCCcCCC
Q 037540          244 LESVATHEIGHVLGLAHTSV  263 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d  263 (340)
                      -.-|++||+||+|  -|...
T Consensus        72 ~rFtlAHELGH~l--lH~~~   89 (213)
T COG2856          72 KRFTLAHELGHAL--LHTDL   89 (213)
T ss_pred             HHHHHHHHHhHHH--hcccc
Confidence            3569999999995  45443


No 67 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=69.64  E-value=2.9  Score=44.48  Aligned_cols=49  Identities=20%  Similarity=0.359  Sum_probs=32.0

Q ss_pred             hhHHHHhhhhhhCCCcCCC----------CCCccccccCC-C-CCCCCCCHHHHHHHHHHh
Q 037540          245 ESVATHEIGHVLGLAHTSV----------KEAVMYPSLTP-R-TRKVDLKVDDVEGVQALY  293 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d----------~~SIMyP~~~~-~-~~~~~L~~dDI~~Iq~LY  293 (340)
                      .-|++|||||..|=.|-.+          -+=|||++-.. + .++..++.--+.-|...-
T Consensus       393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~VL  453 (764)
T KOG3658|consen  393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKVL  453 (764)
T ss_pred             heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHHHH
Confidence            3589999999999999664          23489986442 2 234456665665555443


No 68 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=68.62  E-value=2.5  Score=44.81  Aligned_cols=54  Identities=22%  Similarity=0.284  Sum_probs=30.3

Q ss_pred             cchhHHHHhhhhhhCCCcCCCCCCcccccc---CCCCC--CCCCCHHHHHHHHHHhCCC
Q 037540          243 DLESVATHEIGHVLGLAHTSVKEAVMYPSL---TPRTR--KVDLKVDDVEGVQALYGSN  296 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~---~~~~~--~~~L~~dDI~~Iq~LYG~~  296 (340)
                      ....|++|||.|+||+.-+.-.+--|.-..   .+...  ....++.-.+..++-||.+
T Consensus       255 ~~~rv~~HEi~HALGFS~~~f~~~~~~~~~~~~rGk~~~~~vi~SP~v~~~Ar~hfgC~  313 (622)
T PTZ00257        255 GTTRTVTHEVAHALGFSSVFFENAGIVKSVTNLRGKPFAAPVINSSTAVAKAREQYGCP  313 (622)
T ss_pred             HHHHHHHHHHHHHhcCCHHHHhhcCcccceeccCCCcceeeeecCHHHHHHHHHhcCCC
Confidence            357899999999999964332211111111   11111  1112667777788888764


No 69 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=67.19  E-value=15  Score=37.70  Aligned_cols=14  Identities=43%  Similarity=0.598  Sum_probs=11.8

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      +-..|+.||+||+|
T Consensus       262 ~~V~TLfHEfGHal  275 (472)
T cd06455         262 DEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34678999999998


No 70 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=66.77  E-value=2.9  Score=37.11  Aligned_cols=21  Identities=33%  Similarity=0.385  Sum_probs=15.3

Q ss_pred             hhHHHHhhhhhhCCCcCCCCC
Q 037540          245 ESVATHEIGHVLGLAHTSVKE  265 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~  265 (340)
                      ..+++||+||++--....++.
T Consensus         8 i~i~~HE~gH~~~a~~~G~~~   28 (192)
T PF02163_consen    8 ISIVLHELGHALAARLYGDKV   28 (192)
T ss_dssp             HHHHHHHHHHHHHHHTTT--B
T ss_pred             ccccccccccccccccccccc
Confidence            578999999998776665543


No 71 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=65.30  E-value=3.4  Score=43.55  Aligned_cols=19  Identities=16%  Similarity=0.357  Sum_probs=14.0

Q ss_pred             CCHHHHHHHHHHHHHHhhc
Q 037540          171 IDLTDVKAAFKSSFQRWSS  189 (340)
Q Consensus       171 l~~~~~r~ai~~Af~~Ws~  189 (340)
                      ++.++.++.+.+||+.|..
T Consensus       307 ~~~e~a~~~v~~~~~~l~~  325 (587)
T TIGR02290       307 YTFDEAKELVLEAFGKFSP  325 (587)
T ss_pred             ecHHHHHHHHHHHHHhcCH
Confidence            3456778888888887764


No 72 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=65.27  E-value=3.2  Score=37.35  Aligned_cols=14  Identities=36%  Similarity=0.491  Sum_probs=11.2

Q ss_pred             hhHHHHhhhhhhCC
Q 037540          245 ESVATHEIGHVLGL  258 (340)
Q Consensus       245 ~~Va~HEIGHaLGL  258 (340)
                      ..|++||+||.+=-
T Consensus        10 ~~v~iHElGH~~~A   23 (182)
T cd06163          10 ILIFVHELGHFLVA   23 (182)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46899999998643


No 73 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=64.39  E-value=3.4  Score=36.39  Aligned_cols=20  Identities=45%  Similarity=0.564  Sum_probs=14.9

Q ss_pred             hhHHHHhhhhhhCCCcCCCC
Q 037540          245 ESVATHEIGHVLGLAHTSVK  264 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~  264 (340)
                      ..+.+||+||++=-.+-.++
T Consensus         9 i~i~iHE~gH~~~A~~~G~~   28 (180)
T cd05709           9 ISVTVHELGHALVARRLGVK   28 (180)
T ss_pred             HHHHHHHHHHHHHHHHcCCC
Confidence            57899999999855554443


No 74 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=64.15  E-value=3.6  Score=36.96  Aligned_cols=20  Identities=35%  Similarity=0.576  Sum_probs=15.3

Q ss_pred             ccchhHHHHhhhhhhCCCcCC
Q 037540          242 VDLESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       242 ~dl~~Va~HEIGHaLGL~Hs~  262 (340)
                      ..+..|++||+||.. .+|..
T Consensus        87 ~el~aVlaHElgH~~-~~h~~  106 (226)
T PF01435_consen   87 DELAAVLAHELGHIK-HRHIL  106 (226)
T ss_dssp             HHHHHHHHHHHHHHH-TTHCC
T ss_pred             HHHHHHHHHHHHHHH-cCCcc
Confidence            357899999999986 45544


No 75 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=62.71  E-value=4.7  Score=44.54  Aligned_cols=32  Identities=25%  Similarity=0.367  Sum_probs=24.9

Q ss_pred             cchhHHHHhhhhhhCCCcCCCCC------CccccccCC
Q 037540          243 DLESVATHEIGHVLGLAHTSVKE------AVMYPSLTP  274 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~~------SIMyP~~~~  274 (340)
                      ...-+++||+||.+|+.|-...+      .||.|....
T Consensus       316 ~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~  353 (845)
T KOG3538|consen  316 PSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCG  353 (845)
T ss_pred             ccceeeeeccccccCcccCCcCCCCCCcccccCccccc
Confidence            34579999999999999976543      589876553


No 76 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=61.53  E-value=4.1  Score=37.20  Aligned_cols=18  Identities=39%  Similarity=0.464  Sum_probs=13.6

Q ss_pred             chhHHHHhhhhhhCCCcC
Q 037540          244 LESVATHEIGHVLGLAHT  261 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs  261 (340)
                      +..|++||+||++=-.+-
T Consensus        38 ~~~v~iHElgH~~~A~~~   55 (208)
T cd06161          38 FLSVLLHELGHALVARRY   55 (208)
T ss_pred             HHHHHHHHHHHHHHHHHc
Confidence            468999999998754443


No 77 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=60.43  E-value=4  Score=38.56  Aligned_cols=14  Identities=50%  Similarity=0.937  Sum_probs=12.2

Q ss_pred             ccchhHHHHhhhhh
Q 037540          242 VDLESVATHEIGHV  255 (340)
Q Consensus       242 ~dl~~Va~HEIGHa  255 (340)
                      ..+..|+.||+||.
T Consensus       155 dEl~aVlaHElgHi  168 (302)
T COG0501         155 DELEAVLAHELGHI  168 (302)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45889999999996


No 78 
>PRK03982 heat shock protein HtpX; Provisional
Probab=59.69  E-value=4.2  Score=38.95  Aligned_cols=15  Identities=40%  Similarity=0.651  Sum_probs=12.5

Q ss_pred             ccchhHHHHhhhhhh
Q 037540          242 VDLESVATHEIGHVL  256 (340)
Q Consensus       242 ~dl~~Va~HEIGHaL  256 (340)
                      ..+..|++||+||.-
T Consensus       123 ~El~AVlAHElgHi~  137 (288)
T PRK03982        123 DELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            358899999999963


No 79 
>PRK03001 M48 family peptidase; Provisional
Probab=57.39  E-value=4.9  Score=38.42  Aligned_cols=15  Identities=33%  Similarity=0.618  Sum_probs=12.5

Q ss_pred             ccchhHHHHhhhhhh
Q 037540          242 VDLESVATHEIGHVL  256 (340)
Q Consensus       242 ~dl~~Va~HEIGHaL  256 (340)
                      .++..|++||+||.-
T Consensus       122 ~El~aVlAHElgHi~  136 (283)
T PRK03001        122 REIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            458899999999963


No 80 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=57.23  E-value=5.2  Score=39.22  Aligned_cols=15  Identities=33%  Similarity=0.414  Sum_probs=12.3

Q ss_pred             ccchhHHHHhhhhhh
Q 037540          242 VDLESVATHEIGHVL  256 (340)
Q Consensus       242 ~dl~~Va~HEIGHaL  256 (340)
                      .+-..++.||+||++
T Consensus       152 ~~~v~tl~HE~GHa~  166 (365)
T cd06258         152 HDDINTLFHEFGHAV  166 (365)
T ss_pred             HHHHHHHHHHHhHHH
Confidence            344679999999998


No 81 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=56.78  E-value=4.9  Score=39.99  Aligned_cols=15  Identities=33%  Similarity=0.541  Sum_probs=12.3

Q ss_pred             ccchhHHHHhhhhhh
Q 037540          242 VDLESVATHEIGHVL  256 (340)
Q Consensus       242 ~dl~~Va~HEIGHaL  256 (340)
                      .+-..+++||+||++
T Consensus       220 ~~~v~tl~HE~GHa~  234 (427)
T cd06459         220 LDDVFTLAHELGHAF  234 (427)
T ss_pred             hhhHHHHHHHhhHHH
Confidence            445789999999985


No 82 
>PRK03072 heat shock protein HtpX; Provisional
Probab=56.48  E-value=5.1  Score=38.50  Aligned_cols=16  Identities=38%  Similarity=0.601  Sum_probs=12.9

Q ss_pred             cccchhHHHHhhhhhh
Q 037540          241 AVDLESVATHEIGHVL  256 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaL  256 (340)
                      ..++..|++||+||.-
T Consensus       124 ~~El~aVlAHElgHi~  139 (288)
T PRK03072        124 ERELRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3458899999999953


No 83 
>PRK01345 heat shock protein HtpX; Provisional
Probab=56.37  E-value=5.2  Score=39.09  Aligned_cols=15  Identities=33%  Similarity=0.589  Sum_probs=12.8

Q ss_pred             ccchhHHHHhhhhhh
Q 037540          242 VDLESVATHEIGHVL  256 (340)
Q Consensus       242 ~dl~~Va~HEIGHaL  256 (340)
                      ..+..|++|||||.-
T Consensus       122 dEL~aVlAHElgHi~  136 (317)
T PRK01345        122 EEVAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            358899999999975


No 84 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=55.65  E-value=5.9  Score=36.80  Aligned_cols=18  Identities=39%  Similarity=0.495  Sum_probs=13.4

Q ss_pred             chhHHHHhhhhhhCCCcC
Q 037540          244 LESVATHEIGHVLGLAHT  261 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs  261 (340)
                      +..|++||+||++=-.+-
T Consensus        53 ~~~v~iHElgH~~~A~~~   70 (227)
T cd06164          53 FASVLLHELGHSLVARRY   70 (227)
T ss_pred             HHHHHHHHHHHHHHHHHc
Confidence            367999999998754443


No 85 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=54.47  E-value=6.1  Score=40.08  Aligned_cols=12  Identities=33%  Similarity=0.584  Sum_probs=10.8

Q ss_pred             hhHHHHhhhhhh
Q 037540          245 ESVATHEIGHVL  256 (340)
Q Consensus       245 ~~Va~HEIGHaL  256 (340)
                      ..|+.||+|||+
T Consensus       243 v~tLfHE~GHa~  254 (458)
T PF01432_consen  243 VETLFHEFGHAM  254 (458)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHhHHH
Confidence            579999999986


No 86 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=52.84  E-value=6.2  Score=38.60  Aligned_cols=23  Identities=30%  Similarity=0.433  Sum_probs=19.5

Q ss_pred             cccchhHHHHhhhhhhCCCcCCC
Q 037540          241 AVDLESVATHEIGHVLGLAHTSV  263 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaLGL~Hs~d  263 (340)
                      ...+..|++||++|-+|...+.+
T Consensus       193 ~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  193 PYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             cccccHHHHHHHHHHhCCCCHHH
Confidence            45678899999999999998773


No 87 
>PRK04897 heat shock protein HtpX; Provisional
Probab=52.51  E-value=6.5  Score=37.96  Aligned_cols=14  Identities=50%  Similarity=0.795  Sum_probs=12.2

Q ss_pred             ccchhHHHHhhhhh
Q 037540          242 VDLESVATHEIGHV  255 (340)
Q Consensus       242 ~dl~~Va~HEIGHa  255 (340)
                      ..+..|++||+||.
T Consensus       135 ~El~aVlAHElgHi  148 (298)
T PRK04897        135 EELEGVIGHEISHI  148 (298)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45889999999995


No 88 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=52.42  E-value=7.1  Score=37.56  Aligned_cols=16  Identities=38%  Similarity=0.478  Sum_probs=12.3

Q ss_pred             chhHHHHhhhhhhCCC
Q 037540          244 LESVATHEIGHVLGLA  259 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~  259 (340)
                      +..+++||+||++=-.
T Consensus       135 ~isvvvHElgHal~A~  150 (277)
T cd06162         135 LISGVVHEMGHGVAAV  150 (277)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3579999999986433


No 89 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=52.22  E-value=7.1  Score=37.21  Aligned_cols=18  Identities=22%  Similarity=0.231  Sum_probs=13.6

Q ss_pred             hhHHHHhhhhhhCCCcCC
Q 037540          245 ESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~  262 (340)
                      ..+++||+||++=-.+..
T Consensus       119 isv~iHElgHa~~Ar~~G  136 (263)
T cd06159         119 VGVVVHELSHGILARVEG  136 (263)
T ss_pred             HHHHHHHHHHHHHHHHcC
Confidence            579999999987554444


No 90 
>PRK02870 heat shock protein HtpX; Provisional
Probab=51.30  E-value=7  Score=38.60  Aligned_cols=15  Identities=33%  Similarity=0.694  Sum_probs=12.7

Q ss_pred             cccchhHHHHhhhhh
Q 037540          241 AVDLESVATHEIGHV  255 (340)
Q Consensus       241 ~~dl~~Va~HEIGHa  255 (340)
                      ...+..|++||+||.
T Consensus       170 ~dEL~aVlAHELgHi  184 (336)
T PRK02870        170 RDELQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345889999999997


No 91 
>PRK05457 heat shock protein HtpX; Provisional
Probab=51.29  E-value=7  Score=37.54  Aligned_cols=15  Identities=40%  Similarity=0.702  Sum_probs=12.5

Q ss_pred             cccchhHHHHhhhhh
Q 037540          241 AVDLESVATHEIGHV  255 (340)
Q Consensus       241 ~~dl~~Va~HEIGHa  255 (340)
                      ...+..|++||+||.
T Consensus       131 ~~El~aVlAHElgHi  145 (284)
T PRK05457        131 RDEVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            346889999999994


No 92 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=50.73  E-value=8.2  Score=30.77  Aligned_cols=14  Identities=36%  Similarity=0.496  Sum_probs=11.9

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      .+..|++||+.|.+
T Consensus        24 ~~~~~l~HE~~H~~   37 (128)
T PF13485_consen   24 WLDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHHH
Confidence            35689999999986


No 93 
>PRK01265 heat shock protein HtpX; Provisional
Probab=50.66  E-value=7.2  Score=38.30  Aligned_cols=14  Identities=43%  Similarity=0.866  Sum_probs=12.0

Q ss_pred             ccchhHHHHhhhhh
Q 037540          242 VDLESVATHEIGHV  255 (340)
Q Consensus       242 ~dl~~Va~HEIGHa  255 (340)
                      .++..|++||+||.
T Consensus       138 ~El~aVlAHElgHi  151 (324)
T PRK01265        138 DEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45889999999994


No 94 
>PRK02391 heat shock protein HtpX; Provisional
Probab=50.52  E-value=7.3  Score=37.66  Aligned_cols=15  Identities=47%  Similarity=0.751  Sum_probs=12.4

Q ss_pred             cccchhHHHHhhhhh
Q 037540          241 AVDLESVATHEIGHV  255 (340)
Q Consensus       241 ~~dl~~Va~HEIGHa  255 (340)
                      ..++..|++||+||.
T Consensus       130 ~~El~aVlaHElgHi  144 (296)
T PRK02391        130 PDELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345889999999994


No 95 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=50.47  E-value=7.7  Score=40.58  Aligned_cols=16  Identities=38%  Similarity=0.453  Sum_probs=12.9

Q ss_pred             cccchhHHHHhhhhhh
Q 037540          241 AVDLESVATHEIGHVL  256 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaL  256 (340)
                      ..+-..|++||+||++
T Consensus       334 t~~dv~TL~HElGHa~  349 (549)
T TIGR02289       334 TSGDIDVLTHEAGHAF  349 (549)
T ss_pred             ChhHHHHHHHHhhHHH
Confidence            3445779999999997


No 96 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=48.33  E-value=8.6  Score=39.36  Aligned_cols=25  Identities=4%  Similarity=0.069  Sum_probs=17.7

Q ss_pred             HHHHHHHHHHHhhccCcceEEEecc
Q 037540          176 VKAAFKSSFQRWSSVIPVNFTEIYD  200 (340)
Q Consensus       176 ~r~ai~~Af~~Ws~vt~l~F~ev~~  200 (340)
                      +..+++--+..-+..-+|+|+|+..
T Consensus       132 l~~vl~gl~~~~~~lfgi~~~~~~~  156 (458)
T cd06457         132 LGTVMEGLSRLFSRLYGIRLVPVPL  156 (458)
T ss_pred             HHHHHHHHHHHHHHHhCeEEEecCC
Confidence            3455566667777888899999753


No 97 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=48.05  E-value=8.6  Score=38.72  Aligned_cols=18  Identities=39%  Similarity=0.615  Sum_probs=13.6

Q ss_pred             hhHHHHhhhhhhCCCcCC
Q 037540          245 ESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~  262 (340)
                      ..+++||+||||--.-+.
T Consensus       132 i~~vvHElGHalAA~seg  149 (484)
T KOG2921|consen  132 ITVVVHELGHALAAASEG  149 (484)
T ss_pred             HHHHHHHhhHHHHHHhcC
Confidence            568899999998654443


No 98 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=47.83  E-value=9.2  Score=40.23  Aligned_cols=15  Identities=27%  Similarity=0.521  Sum_probs=12.5

Q ss_pred             ccchhHHHHhhhhhh
Q 037540          242 VDLESVATHEIGHVL  256 (340)
Q Consensus       242 ~dl~~Va~HEIGHaL  256 (340)
                      .+-..|++||+||++
T Consensus       376 ~~dv~TLaHElGHa~  390 (591)
T TIGR00181       376 LNSVFTLAHELGHSM  390 (591)
T ss_pred             cchHHHHHHHhhhHH
Confidence            445789999999997


No 99 
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=47.82  E-value=21  Score=39.23  Aligned_cols=136  Identities=15%  Similarity=0.218  Sum_probs=69.9

Q ss_pred             CCCCCCCCCcEEEEeecCCc----CCC-----------CCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEE-Eee
Q 037540          148 PRWVRESPMTLTYAFSKTDM----IDY-----------IDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHI-GFY  211 (340)
Q Consensus       148 ~kW~~~~~~~LTY~f~~~n~----~~~-----------l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I-~f~  211 (340)
                      ..|.+.   +-.|.+.....    .+.           .+.+++.+--.+.|+...+.+.|.|..... .+.++.= .|.
T Consensus       128 ~~w~~~---~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~-~~~n~~~kYF~  203 (775)
T PF03272_consen  128 SEWDDS---DSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDP-VDKNFNNKYFA  203 (775)
T ss_pred             hhhhhc---CCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCcc-cccccccceEE
Confidence            468877   77787764321    111           123344445567888888889998865432 2222211 133


Q ss_pred             cC-CCCCCCcccccCCcceeecccccccc--ccccchhHHHHhhhhhhCCCcCCCC-------CCccccccCCCCCCCCC
Q 037540          212 HG-DHGDGQPFHFDAAETWAVDMESVKSR--VAVDLESVATHEIGHVLGLAHTSVK-------EAVMYPSLTPRTRKVDL  281 (340)
Q Consensus       212 ~g-~hgdg~pfhfd~~e~wt~~~~~~~~~--~~~dl~~Va~HEIGHaLGL~Hs~d~-------~SIMyP~~~~~~~~~~L  281 (340)
                      .. .||.|.+|.   +..|+.....+...  ...-.-..++|||||+-=..=.++.       +.|+.-.|+    ...+
T Consensus       204 KAD~~G~G~AYY---~~~w~a~ss~s~~~~L~~~~~nW~~LHEIgHgYd~~F~~n~~~~~EVWnNI~~d~yQ----~~~~  276 (775)
T PF03272_consen  204 KADKSGPGAAYY---GSNWTAQSSSSLSFYLNPSPTNWGALHEIGHGYDFGFTRNGTYLNEVWNNILADRYQ----YTYM  276 (775)
T ss_pred             EecCCCCCCccc---cccceecCchhHHHHhCCCCCCchhhhhhhhhcceeEeeCCcchhhhhhhhhhhhhh----hhhc
Confidence            33 367676652   34566543321100  0001235899999997433322111       122222222    2346


Q ss_pred             CHHHHHHHHHHhC
Q 037540          282 KVDDVEGVQALYG  294 (340)
Q Consensus       282 ~~dDI~~Iq~LYG  294 (340)
                      +.++.+..-.||+
T Consensus       277 ~~~e~~~~~wly~  289 (775)
T PF03272_consen  277 TQDERQTDGWLYD  289 (775)
T ss_pred             Chhhhhhccceec
Confidence            7788888888884


No 100
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=47.42  E-value=9.6  Score=34.26  Aligned_cols=23  Identities=30%  Similarity=0.528  Sum_probs=17.2

Q ss_pred             hhHHHHhhhhhhCCCcCCCCCCc
Q 037540          245 ESVATHEIGHVLGLAHTSVKEAV  267 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~SI  267 (340)
                      ..+.+||+||++--..-.++...
T Consensus        42 ~~l~iHElgH~~~A~~~G~~~~~   64 (183)
T cd06160          42 AILGIHEMGHYLAARRHGVKASL   64 (183)
T ss_pred             HHHHHHHHHHHHHHHHCCCCccc
Confidence            56889999999877666655443


No 101
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=46.93  E-value=9.6  Score=33.21  Aligned_cols=12  Identities=33%  Similarity=0.490  Sum_probs=10.1

Q ss_pred             hhHHHHhhhhhh
Q 037540          245 ESVATHEIGHVL  256 (340)
Q Consensus       245 ~~Va~HEIGHaL  256 (340)
                      .-|++|||+|.+
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            459999999974


No 102
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=46.11  E-value=10  Score=38.52  Aligned_cols=14  Identities=43%  Similarity=0.605  Sum_probs=11.7

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      +-..|+.||+||+|
T Consensus       207 ~~v~tLfHEfGHal  220 (422)
T cd06456         207 DEVTTLFHEFGHAL  220 (422)
T ss_pred             HHHHHHHHHHHHHH
Confidence            34578999999998


No 103
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=44.28  E-value=11  Score=36.62  Aligned_cols=11  Identities=45%  Similarity=0.619  Sum_probs=9.5

Q ss_pred             hhHHHHhhhhh
Q 037540          245 ESVATHEIGHV  255 (340)
Q Consensus       245 ~~Va~HEIGHa  255 (340)
                      ..|++||.||.
T Consensus       171 ayVlAHEyGHH  181 (292)
T PF04228_consen  171 AYVLAHEYGHH  181 (292)
T ss_pred             HHHHHHHHHHH
Confidence            46999999995


No 104
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=43.53  E-value=11  Score=34.41  Aligned_cols=16  Identities=31%  Similarity=0.538  Sum_probs=13.2

Q ss_pred             cccchhHHHHhhhhhh
Q 037540          241 AVDLESVATHEIGHVL  256 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaL  256 (340)
                      ..+...+|.||-||||
T Consensus        25 ~~~~~~~A~HEAGhAv   40 (213)
T PF01434_consen   25 EEEKRRIAYHEAGHAV   40 (213)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3456889999999996


No 105
>PLN02791 Nudix hydrolase homolog
Probab=42.94  E-value=11  Score=41.25  Aligned_cols=20  Identities=20%  Similarity=0.307  Sum_probs=16.4

Q ss_pred             chhHHHHhhhhhhCCCcCCC
Q 037540          244 LESVATHEIGHVLGLAHTSV  263 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d  263 (340)
                      +..|++||+||.+|=.+.-.
T Consensus       585 ~~~v~lHElgHGsG~~~~~~  604 (770)
T PLN02791        585 FTHTICHECCHGIGPHTITL  604 (770)
T ss_pred             HHHHHHHHhhccccccceec
Confidence            46899999999999877544


No 106
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=41.00  E-value=16  Score=32.45  Aligned_cols=26  Identities=23%  Similarity=0.258  Sum_probs=20.9

Q ss_pred             cchhHHHHhhhhhhCCCcCCCCCCcc
Q 037540          243 DLESVATHEIGHVLGLAHTSVKEAVM  268 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL~Hs~d~~SIM  268 (340)
                      -+..|++||+-|..-..||..--+.|
T Consensus       163 ~idYVvvHEL~Hl~~~nHs~~Fw~~v  188 (205)
T PF01863_consen  163 VIDYVVVHELCHLRHPNHSKRFWALV  188 (205)
T ss_pred             HHHHHHHHHHHHhccCCCCHHHHHHH
Confidence            35789999999999999997644444


No 107
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=40.57  E-value=13  Score=37.62  Aligned_cols=12  Identities=42%  Similarity=0.531  Sum_probs=10.1

Q ss_pred             hhHHHHhhhhhh
Q 037540          245 ESVATHEIGHVL  256 (340)
Q Consensus       245 ~~Va~HEIGHaL  256 (340)
                      ..+++||+||+|
T Consensus       160 l~t~iHE~GHal  171 (396)
T cd06460         160 LFSTIHETGHAL  171 (396)
T ss_pred             HHHHHHHhhHHH
Confidence            458899999986


No 108
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=40.47  E-value=13  Score=37.70  Aligned_cols=12  Identities=42%  Similarity=0.719  Sum_probs=10.2

Q ss_pred             hhHHHHhhhhhh
Q 037540          245 ESVATHEIGHVL  256 (340)
Q Consensus       245 ~~Va~HEIGHaL  256 (340)
                      ..|++||+||.|
T Consensus        15 ~~v~~HE~gH~~   26 (420)
T TIGR00054        15 VLIFVHELGHFL   26 (420)
T ss_pred             HHHHHHhHHHHH
Confidence            468999999975


No 109
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=40.41  E-value=14  Score=32.95  Aligned_cols=13  Identities=31%  Similarity=0.562  Sum_probs=11.1

Q ss_pred             chhHHHHhhhhhh
Q 037540          244 LESVATHEIGHVL  256 (340)
Q Consensus       244 l~~Va~HEIGHaL  256 (340)
                      +.++.+||+||++
T Consensus         9 ~~~i~~HE~aHa~   21 (181)
T cd06158           9 LLAITLHEFAHAY   21 (181)
T ss_pred             HHHHHHHHHHHHH
Confidence            3578999999997


No 110
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=39.91  E-value=13  Score=37.84  Aligned_cols=12  Identities=33%  Similarity=0.620  Sum_probs=9.7

Q ss_pred             hhHHHHhhhhhh
Q 037540          245 ESVATHEIGHVL  256 (340)
Q Consensus       245 ~~Va~HEIGHaL  256 (340)
                      .-|++||+||-|
T Consensus        16 ~li~vHElGHfl   27 (449)
T PRK10779         16 VLITVHEFGHFW   27 (449)
T ss_pred             HHHHHHHHHHHH
Confidence            358899999964


No 111
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=39.68  E-value=71  Score=24.47  Aligned_cols=32  Identities=19%  Similarity=0.259  Sum_probs=21.6

Q ss_pred             HHHHHHHHHHHHhcCC---CCCccCCHHHHHHhcC
Q 037540           90 TQLQSAVVSYQTKLGL---PVTGKLDSETISTIMS  121 (340)
Q Consensus        90 ~~l~~Ai~~~Q~~~gL---pvTG~LD~~T~~~m~~  121 (340)
                      .++.+-|+..=+..|.   +++|..|++|.+.+..
T Consensus        15 ~~~~~evq~~L~~lGyy~g~~~g~~d~a~~~Al~~   49 (74)
T PF08823_consen   15 GDVAREVQEALKRLGYYKGEADGVWDEATEDALRA   49 (74)
T ss_pred             HHHHHHHHHHHHHcCCccCCCCCcccHHHHHHHHH
Confidence            3444444444444555   8999999999999853


No 112
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=38.62  E-value=14  Score=39.27  Aligned_cols=14  Identities=29%  Similarity=0.387  Sum_probs=11.7

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      +-..|++||+||++
T Consensus       379 ~dV~TLaHElGHs~  392 (598)
T COG1164         379 RDVFTLAHELGHSV  392 (598)
T ss_pred             hHHHHHHHHccHHH
Confidence            34679999999986


No 113
>PF08434 CLCA_N:  Calcium-activated chloride channel;  InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs. 
Probab=37.73  E-value=14  Score=35.32  Aligned_cols=12  Identities=33%  Similarity=0.390  Sum_probs=10.0

Q ss_pred             chhHHHHhhhhh
Q 037540          244 LESVATHEIGHV  255 (340)
Q Consensus       244 l~~Va~HEIGHa  255 (340)
                      ...|++||.+|.
T Consensus       149 ~grv~VhEWAhl  160 (262)
T PF08434_consen  149 RGRVFVHEWAHL  160 (262)
T ss_pred             cceeeeehhhhh
Confidence            357999999995


No 114
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=35.36  E-value=19  Score=26.33  Aligned_cols=19  Identities=37%  Similarity=0.609  Sum_probs=16.9

Q ss_pred             chhHHHHhhhhhhCCCcCC
Q 037540          244 LESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~  262 (340)
                      .+.-++||+.+.+||.|.+
T Consensus        29 ~eR~~vH~lA~~~gL~s~S   47 (60)
T cd02641          29 HDRLLVHELAEELGLRHES   47 (60)
T ss_pred             HHHHHHHHHHHHcCCceEe
Confidence            5678999999999999975


No 115
>PF10281 Ish1:  Putative stress-responsive nuclear envelope protein;  InterPro: IPR018803  This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues []. 
Probab=35.03  E-value=87  Score=20.48  Aligned_cols=29  Identities=17%  Similarity=0.167  Sum_probs=18.7

Q ss_pred             HHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHH
Q 037540           65 SELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVV   97 (340)
Q Consensus        65 ~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~   97 (340)
                      .++++||..+|...++..    ...++|-+.++
T Consensus         7 ~~L~~wL~~~gi~~~~~~----~~rd~Ll~~~k   35 (38)
T PF10281_consen    7 SDLKSWLKSHGIPVPKSA----KTRDELLKLAK   35 (38)
T ss_pred             HHHHHHHHHcCCCCCCCC----CCHHHHHHHHH
Confidence            679999999998765432    12355544444


No 116
>PRK10911 oligopeptidase A; Provisional
Probab=35.01  E-value=17  Score=39.21  Aligned_cols=13  Identities=38%  Similarity=0.360  Sum_probs=11.2

Q ss_pred             chhHHHHhhhhhh
Q 037540          244 LESVATHEIGHVL  256 (340)
Q Consensus       244 l~~Va~HEIGHaL  256 (340)
                      -..|+.||+||+|
T Consensus       463 ~v~tlfHEfGHal  475 (680)
T PRK10911        463 EVITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHHHhHHH
Confidence            3569999999998


No 117
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=33.23  E-value=19  Score=36.62  Aligned_cols=14  Identities=43%  Similarity=0.695  Sum_probs=12.0

Q ss_pred             ccchhHHHHhhhhh
Q 037540          242 VDLESVATHEIGHV  255 (340)
Q Consensus       242 ~dl~~Va~HEIGHa  255 (340)
                      ..+.+|+.||+||-
T Consensus       278 eel~AVl~HELGHW  291 (428)
T KOG2719|consen  278 EELVAVLAHELGHW  291 (428)
T ss_pred             HHHHHHHHHHhhHH
Confidence            45789999999995


No 118
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=31.00  E-value=1e+02  Score=33.33  Aligned_cols=43  Identities=21%  Similarity=0.408  Sum_probs=29.8

Q ss_pred             HHHHHHHHHHhhccCcceEEEeccC--CCccEEEEeecCCCCCCCc
Q 037540          177 KAAFKSSFQRWSSVIPVNFTEIYDY--DSADIHIGFYHGDHGDGQP  220 (340)
Q Consensus       177 r~ai~~Af~~Ws~vt~l~F~ev~~~--~~ADI~I~f~~g~hgdg~p  220 (340)
                      ..++.--|...+..-+|.|.|+++.  -.+|++| |..+|.+.|.+
T Consensus       391 ~~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~-y~v~D~~Sg~~  435 (718)
T KOG2089|consen  391 PVVLSGLFGIYQTLFGLKFEEATDAEVWHADVRV-YTVKDSASGNP  435 (718)
T ss_pred             HHHHHHHHHHHHHHhCceeeecCCchhcccceeE-EeccCCCCCce
Confidence            3466777888899999999998873  3578885 44445444443


No 119
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=30.60  E-value=25  Score=27.20  Aligned_cols=13  Identities=31%  Similarity=0.575  Sum_probs=11.1

Q ss_pred             chhHHHHhhhhhh
Q 037540          244 LESVATHEIGHVL  256 (340)
Q Consensus       244 l~~Va~HEIGHaL  256 (340)
                      -..+++||+.|++
T Consensus        61 ~~~llaHEl~Hv~   73 (79)
T PF13699_consen   61 GRALLAHELAHVV   73 (79)
T ss_pred             cchhHhHHHHHHH
Confidence            4679999999985


No 120
>KOG3988 consensus Protein-tyrosine sulfotransferase TPST1/TPST2 [Posttranslational modification, protein turnover, chaperones]
Probab=29.76  E-value=68  Score=31.30  Aligned_cols=38  Identities=8%  Similarity=0.197  Sum_probs=33.6

Q ss_pred             cchHhhHhhhhccCCCCCcChHHHHHHHHhcCCCCCCC
Q 037540           44 ATWYDFARLLDAEMGSNITGMSELKKYFTRFGYLSLSY   81 (340)
Q Consensus        44 ~~~~~~~~~~~~~~g~~~~~~~~~~~YL~~fGYl~~~~   81 (340)
                      -.-+++.+-.||=+|+.+.+|+.+--.|.+.||=|..+
T Consensus       295 VNl~AlskWvg~ip~dvvrdma~iAPmL~~LGYDp~an  332 (378)
T KOG3988|consen  295 VNLEALSKWVGCIPEDVVRDMADIAPMLAILGYDPYAN  332 (378)
T ss_pred             ccHHHHHHHhccCCHHHHHHHHHHHHHHHHhCCCCCCC
Confidence            34688999999999999999999999999999987544


No 121
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=29.60  E-value=22  Score=37.48  Aligned_cols=15  Identities=40%  Similarity=0.601  Sum_probs=11.1

Q ss_pred             chhHHHHhh-hhhhCC
Q 037540          244 LESVATHEI-GHVLGL  258 (340)
Q Consensus       244 l~~Va~HEI-GHaLGL  258 (340)
                      ...|++||+ ||..|=
T Consensus       293 ~~~v~lHEllGHGsGk  308 (549)
T PF03571_consen  293 FVQVGLHELLGHGSGK  308 (549)
T ss_dssp             HHHHHHHHHCCCCS--
T ss_pred             hhhhhHHhhccCcCcc
Confidence            357999998 998884


No 122
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=29.47  E-value=27  Score=33.11  Aligned_cols=20  Identities=30%  Similarity=0.400  Sum_probs=14.2

Q ss_pred             ccchhHHHHhhhhhhCCCcC
Q 037540          242 VDLESVATHEIGHVLGLAHT  261 (340)
Q Consensus       242 ~dl~~Va~HEIGHaLGL~Hs  261 (340)
                      .|-.+|++||+-|.|=+...
T Consensus       152 ~dg~NVviHEfAH~LD~~~g  171 (253)
T PF06167_consen  152 NDGHNVVIHEFAHKLDMEDG  171 (253)
T ss_dssp             SSS--HHHHHHHHHHHCTTS
T ss_pred             CCCcchHHHHHHHHHHhhcC
Confidence            34578999999999977543


No 123
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=27.15  E-value=1.3e+02  Score=30.16  Aligned_cols=16  Identities=38%  Similarity=0.736  Sum_probs=12.9

Q ss_pred             cccchhHHHHhhhhhh
Q 037540          241 AVDLESVATHEIGHVL  256 (340)
Q Consensus       241 ~~dl~~Va~HEIGHaL  256 (340)
                      ..++-.|+.||+||..
T Consensus       121 ~sEvAAVl~HEmgHVt  136 (479)
T COG4784         121 SSEVAAVLAHEMGHVT  136 (479)
T ss_pred             HHHHHHHHHhhhhhee
Confidence            3457889999999974


No 124
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=26.27  E-value=37  Score=31.23  Aligned_cols=43  Identities=21%  Similarity=0.280  Sum_probs=27.5

Q ss_pred             hhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCCCHHHHHHHHHHhCCC
Q 037540          245 ESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDLKVDDVEGVQALYGSN  296 (340)
Q Consensus       245 ~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG~~  296 (340)
                      .-++.||++| +=|+|..+.    .|...   ..-+ -..|..+...|++..
T Consensus       102 ~fil~HE~~H-v~~~h~~~~----~~~~~---~~eE-~~AD~~A~~~il~~~  144 (206)
T PF10463_consen  102 AFILLHELAH-VVLGHEGDS----SPSQS---IQEE-KEADSYATEMILSDV  144 (206)
T ss_pred             HHHHHHHHHH-HHHcCcccc----ccchh---HHHH-HhhhHHHHHHHHHHH
Confidence            4689999999 678998765    11111   1111 257888888877643


No 125
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=25.92  E-value=30  Score=33.13  Aligned_cols=22  Identities=27%  Similarity=0.388  Sum_probs=17.2

Q ss_pred             ccccchhHHHHhhhhhhCCCcC
Q 037540          240 VAVDLESVATHEIGHVLGLAHT  261 (340)
Q Consensus       240 ~~~dl~~Va~HEIGHaLGL~Hs  261 (340)
                      ...+.+.|+.||+||--|=.+-
T Consensus       199 da~~yyaTl~HElghwtgh~~r  220 (316)
T COG4227         199 DAINYYATLLHELGHWTGHEAR  220 (316)
T ss_pred             chHhHHHHHHHHhccccCchhh
Confidence            3567889999999999875443


No 126
>PRK03094 hypothetical protein; Provisional
Probab=25.59  E-value=49  Score=25.89  Aligned_cols=17  Identities=12%  Similarity=0.411  Sum_probs=15.1

Q ss_pred             cChHHHHHHHHhcCCCC
Q 037540           62 TGMSELKKYFTRFGYLS   78 (340)
Q Consensus        62 ~~~~~~~~YL~~fGYl~   78 (340)
                      +||..+++||++-||--
T Consensus         8 ~~Ls~i~~~L~~~GYeV   24 (80)
T PRK03094          8 QSLTDVQQALKQKGYEV   24 (80)
T ss_pred             cCcHHHHHHHHHCCCEE
Confidence            68899999999999954


No 127
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=25.42  E-value=32  Score=37.20  Aligned_cols=31  Identities=16%  Similarity=0.385  Sum_probs=19.4

Q ss_pred             HHH-HHHHHHhhccCcceEEEecc--CCCccEEE
Q 037540          178 AAF-KSSFQRWSSVIPVNFTEIYD--YDSADIHI  208 (340)
Q Consensus       178 ~ai-~~Af~~Ws~vt~l~F~ev~~--~~~ADI~I  208 (340)
                      .++ +-.|..-+..-.|+|+|+.+  .-..|+++
T Consensus       364 ~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~  397 (681)
T PRK10280        364 TVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV  397 (681)
T ss_pred             HHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence            455 34566677778899998653  23456554


No 128
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=23.58  E-value=40  Score=33.44  Aligned_cols=18  Identities=39%  Similarity=0.639  Sum_probs=14.1

Q ss_pred             chhHHHHhhhhhhCCCcCC
Q 037540          244 LESVATHEIGHVLGLAHTS  262 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~  262 (340)
                      +-.|+.||++|++ +.|+.
T Consensus       275 lAtvLgHE~aHaV-arH~A  292 (424)
T KOG2661|consen  275 LATVLGHEIAHAV-ARHAA  292 (424)
T ss_pred             HHHHHHHHHHHHH-HHHHH
Confidence            5789999999996 45554


No 129
>PF03698 UPF0180:  Uncharacterised protein family (UPF0180);  InterPro: IPR005370 The members of this family are small uncharacterised proteins.
Probab=22.29  E-value=62  Score=25.30  Aligned_cols=17  Identities=29%  Similarity=0.509  Sum_probs=15.2

Q ss_pred             cChHHHHHHHHhcCCCC
Q 037540           62 TGMSELKKYFTRFGYLS   78 (340)
Q Consensus        62 ~~~~~~~~YL~~fGYl~   78 (340)
                      +||..+++||++-||--
T Consensus         8 ~~Ls~v~~~L~~~GyeV   24 (80)
T PF03698_consen    8 EGLSNVKEALREKGYEV   24 (80)
T ss_pred             CCchHHHHHHHHCCCEE
Confidence            68899999999999964


No 130
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=21.94  E-value=42  Score=36.00  Aligned_cols=14  Identities=50%  Similarity=0.700  Sum_probs=12.1

Q ss_pred             cchhHHHHhhhhhh
Q 037540          243 DLESVATHEIGHVL  256 (340)
Q Consensus       243 dl~~Va~HEIGHaL  256 (340)
                      +...|+.||+||||
T Consensus       407 ~~~~~a~he~gha~  420 (644)
T PRK10733        407 QKESTAYHEAGHAI  420 (644)
T ss_pred             HHHHHHHHHHHHHH
Confidence            35789999999997


No 131
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.75  E-value=60  Score=23.90  Aligned_cols=22  Identities=32%  Similarity=0.336  Sum_probs=18.6

Q ss_pred             chhHHHHhhhhhhCCCcCCCCC
Q 037540          244 LESVATHEIGHVLGLAHTSVKE  265 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~Hs~d~~  265 (340)
                      -+.-++|.+-|-|||.|.++.+
T Consensus        29 ~eRriih~la~~lGL~~~s~G~   50 (60)
T cd02639          29 AERRIVHLLASRLGLNHVSDGT   50 (60)
T ss_pred             HHHHHHHHHHHHcCCceEEeCC
Confidence            4678999999999999987654


No 132
>PF08006 DUF1700:  Protein of unknown function (DUF1700);  InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=21.42  E-value=82  Score=27.75  Aligned_cols=47  Identities=26%  Similarity=0.483  Sum_probs=34.8

Q ss_pred             hHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCC-CCccCCHHHHHHhcCCC
Q 037540           64 MSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLP-VTGKLDSETISTIMSPR  123 (340)
Q Consensus        64 ~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLp-vTG~LD~~T~~~m~~PR  123 (340)
                      ++++++||++   +|          +++.++++..|.+.+.-. ..|+-+++.++.+..|+
T Consensus         7 L~~L~~~L~~---lp----------~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~   54 (181)
T PF08006_consen    7 LNELEKYLKK---LP----------EEEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPK   54 (181)
T ss_pred             HHHHHHHHHc---CC----------HHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHH
Confidence            4667788874   33          357889999999988654 46888888888886664


No 133
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=21.20  E-value=1.3e+02  Score=27.54  Aligned_cols=46  Identities=17%  Similarity=0.442  Sum_probs=31.7

Q ss_pred             HHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCH
Q 037540           67 LKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDS  113 (340)
Q Consensus        67 ~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~  113 (340)
                      +-+||.++||+.+.....+...--+.++.++.+-+..|+++ |+||=
T Consensus       153 Ilr~l~r~g~i~e~~kt~t~K~YLe~E~ilr~iae~~g~s~-gelDL  198 (210)
T COG1059         153 ILRWLVRYGLIDENPKTLTRKLYLEIEEILRSIAEEVGISL-GELDL  198 (210)
T ss_pred             HHHHHHHhcccccCcccccHHHHHHHHHHHHHHHHHhCCCc-chhHH
Confidence            35688888888654322333333578899999999999984 66663


No 134
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=20.78  E-value=49  Score=30.56  Aligned_cols=11  Identities=36%  Similarity=0.573  Sum_probs=10.0

Q ss_pred             hhHHHHhhhhh
Q 037540          245 ESVATHEIGHV  255 (340)
Q Consensus       245 ~~Va~HEIGHa  255 (340)
                      -++++||++||
T Consensus        94 gsiLAHE~mHa  104 (212)
T PF12315_consen   94 GSILAHELMHA  104 (212)
T ss_pred             hhHHHHHHHHH
Confidence            57999999998


No 135
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=20.17  E-value=51  Score=29.40  Aligned_cols=16  Identities=25%  Similarity=0.370  Sum_probs=12.8

Q ss_pred             chhHHHHhhhhhhCCC
Q 037540          244 LESVATHEIGHVLGLA  259 (340)
Q Consensus       244 l~~Va~HEIGHaLGL~  259 (340)
                      +-.+++|||+|++.-.
T Consensus        36 lG~ilahel~hafd~~   51 (206)
T PF01431_consen   36 LGFILAHELMHAFDPE   51 (206)
T ss_dssp             HHHHHHHHHHHCTSTT
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            5679999999987553


No 136
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=20.05  E-value=52  Score=34.11  Aligned_cols=16  Identities=25%  Similarity=0.331  Sum_probs=12.7

Q ss_pred             cchhHHHHhhhhhhCC
Q 037540          243 DLESVATHEIGHVLGL  258 (340)
Q Consensus       243 dl~~Va~HEIGHaLGL  258 (340)
                      +-..|+.||+||+...
T Consensus       246 ~D~~t~~HE~GH~~yy  261 (477)
T cd06461         246 EDFVTVHHEMGHIQYY  261 (477)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3467899999999853


Done!