Query         039411
Match_columns 222
No_of_seqs    226 out of 1513
Neff          7.9 
Searched_HMMs 46136
Date          Fri Mar 29 09:25:39 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/039411.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/039411hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0 1.5E-54 3.2E-59  391.9  13.5  193    1-221    62-262 (469)
  2 cd04278 ZnMc_MMP Zinc-dependen 100.0 2.2E-39 4.7E-44  256.2  15.9  150   69-220     1-157 (157)
  3 PF00413 Peptidase_M10:  Matrix 100.0 7.3E-37 1.6E-41  240.0  14.0  147   69-220     1-154 (154)
  4 cd04279 ZnMc_MMP_like_1 Zinc-d 100.0 9.9E-30 2.1E-34  200.7  14.5  141   75-220     2-156 (156)
  5 cd04268 ZnMc_MMP_like Zinc-dep  99.9 8.6E-27 1.9E-31  184.9   8.8  142   74-219     1-165 (165)
  6 smart00235 ZnMc Zinc-dependent  99.9 4.1E-25   9E-30  171.1  11.2  132   68-221     3-140 (140)
  7 cd04277 ZnMc_serralysin_like Z  99.9 8.9E-24 1.9E-28  171.3  14.2  123   87-220    33-186 (186)
  8 cd04327 ZnMc_MMP_like_3 Zinc-d  99.8 7.8E-20 1.7E-24  149.7  13.0  109   69-189     2-110 (198)
  9 cd00203 ZnMc Zinc-dependent me  99.8 6.1E-19 1.3E-23  139.8  13.0  129   76-219     2-167 (167)
 10 cd04280 ZnMc_astacin_like Zinc  99.7 5.5E-17 1.2E-21  131.0  11.5   91   75-189     2-92  (180)
 11 cd04276 ZnMc_MMP_like_2 Zinc-d  99.7 1.6E-16 3.5E-21  129.5  11.6  144   74-221     7-197 (197)
 12 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.7 3.8E-16 8.2E-21  127.7  11.9   99   67-189     7-105 (200)
 13 PF12388 Peptidase_M57:  Dual-a  99.7 6.3E-16 1.4E-20  126.0  13.0  137   69-219    33-211 (211)
 14 PF01400 Astacin:  Astacin (Pep  99.7   2E-16 4.3E-21  128.9   8.6   96   69-189     2-97  (191)
 15 cd04282 ZnMc_meprin Zinc-depen  99.6 2.2E-15 4.9E-20  125.3  12.3   97   66-189    42-138 (230)
 16 cd04283 ZnMc_hatching_enzyme Z  99.6 1.2E-14 2.5E-19  117.2  11.9   93   74-189     3-95  (182)
 17 KOG3714 Meprin A metalloprotea  99.5 4.3E-14 9.3E-19  127.5  10.6  103   63-189    75-177 (411)
 18 PF02031 Peptidase_M7:  Strepto  99.1 2.5E-10 5.4E-15   85.5   7.3  122   76-220     3-130 (132)
 19 COG5549 Predicted Zn-dependent  99.1 3.1E-10 6.7E-15   91.5   7.5  121   88-221   104-234 (236)
 20 cd04267 ZnMc_ADAM_like Zinc-de  98.1   2E-05 4.4E-10   63.8   9.8   48  170-217   132-189 (192)
 21 PF13583 Reprolysin_4:  Metallo  98.1 5.5E-06 1.2E-10   68.3   5.1   48  171-219   137-198 (206)
 22 cd04273 ZnMc_ADAMTS_like Zinc-  98.0 3.1E-05 6.8E-10   63.7   7.5  116   93-218    72-197 (207)
 23 PF01471 PG_binding_1:  Putativ  97.9 4.8E-06   1E-10   54.2   1.7   24    1-24     34-57  (57)
 24 PF05572 Peptidase_M43:  Pregna  97.9 8.8E-06 1.9E-10   64.1   3.3   23  169-191    67-89  (154)
 25 PF11350 DUF3152:  Protein of u  97.9 2.3E-05   5E-10   63.6   5.7  115   74-192    28-160 (203)
 26 PF13574 Reprolysin_2:  Metallo  97.7 1.1E-05 2.3E-10   64.8   0.5   48  171-219   111-171 (173)
 27 COG1913 Predicted Zn-dependent  97.6 2.9E-05 6.3E-10   61.5   1.9   30  171-200   124-153 (181)
 28 cd04271 ZnMc_ADAM_fungal Zinc-  97.5 9.7E-05 2.1E-09   61.8   3.8   47  173-220   147-218 (228)
 29 PF13582 Reprolysin_3:  Metallo  97.5 3.7E-05 8.1E-10   57.7   1.2   17  172-188   108-124 (124)
 30 PRK13267 archaemetzincin-like   97.4 7.6E-05 1.7E-09   60.1   2.4   27  171-197   125-151 (179)
 31 PF13688 Reprolysin_5:  Metallo  97.4 4.7E-05   1E-09   61.8   1.0   23  169-191   140-162 (196)
 32 cd04269 ZnMc_adamalysin_II_lik  97.4 0.00017 3.6E-09   58.6   4.2   48  170-218   130-183 (194)
 33 cd04275 ZnMc_pappalysin_like Z  97.4 3.4E-05 7.4E-10   64.4  -0.4   24  169-192   135-158 (225)
 34 PF07998 Peptidase_M54:  Peptid  97.3 0.00012 2.7E-09   59.5   1.8   29  171-200   145-173 (194)
 35 PF01421 Reprolysin:  Reprolysi  97.0 0.00095 2.1E-08   54.4   4.2   50  169-218   129-186 (199)
 36 cd04270 ZnMc_TACE_like Zinc-de  97.0  0.0012 2.6E-08   55.8   4.8   48  171-218   167-228 (244)
 37 cd04272 ZnMc_salivary_gland_MP  96.9  0.0013 2.8E-08   54.6   4.6   49  170-218   144-210 (220)
 38 TIGR03296 M6dom_TIGR03296 M6 f  96.2  0.0093   2E-07   51.6   5.6   19  172-190   166-184 (286)
 39 PF11150 DUF2927:  Protein of u  95.7    0.37 7.9E-06   40.0  12.7  145   68-220    28-194 (213)
 40 PF10462 Peptidase_M66:  Peptid  95.6  0.0053 1.2E-07   53.4   1.4   21  169-189   191-211 (305)
 41 PF01457 Peptidase_M8:  Leishma  95.6  0.0082 1.8E-07   56.2   2.6   60  116-187   159-226 (521)
 42 PF05548 Peptidase_M11:  Gameto  95.4  0.0088 1.9E-07   52.4   2.2   20  170-189   149-168 (314)
 43 PTZ00337 surface protease GP63  95.4   0.026 5.7E-07   53.2   5.2   37  146-188   210-246 (567)
 44 PF12044 Metallopep:  Putative   95.0   0.015 3.2E-07   52.9   2.3   25  171-197   316-340 (423)
 45 PTZ00257 Glycoprotein GP63 (le  94.2   0.059 1.3E-06   51.1   4.4   61  116-188   205-273 (622)
 46 COG2738 Predicted Zn-dependent  93.8   0.082 1.8E-06   42.9   3.8   81   89-187    20-108 (226)
 47 PF04298 Zn_peptidase_2:  Putat  93.3   0.079 1.7E-06   44.0   3.0   76   90-183    18-101 (222)
 48 KOG3658 Tumor necrosis factor-  93.1   0.082 1.8E-06   50.2   3.1   46  172-217   393-451 (764)
 49 COG3824 Predicted Zn-dependent  92.5   0.058 1.3E-06   40.3   1.1   18  170-187   108-125 (136)
 50 TIGR02869 spore_SleB spore cor  91.9    0.11 2.3E-06   42.7   2.1   32    1-33     45-76  (201)
 51 COG4783 Putative Zn-dependent   91.5   0.088 1.9E-06   48.2   1.3   18  167-184   126-143 (484)
 52 KOG4525 Jacalin-like lectin do  91.1    0.15 3.2E-06   46.2   2.3   24  171-196   301-324 (614)
 53 PF05547 Peptidase_M6:  Immune   91.0    0.04 8.7E-07   52.7  -1.4   20  171-190   221-240 (645)
 54 PF09471 Peptidase_M64:  IgA Pe  90.5    0.14 3.1E-06   43.8   1.6   24  170-193   215-239 (264)
 55 KOG3607 Meltrins, fertilins an  90.3    0.14 3.1E-06   49.6   1.6   45  170-215   322-374 (716)
 56 PF06262 DUF1025:  Possibl zinc  88.5    0.27 5.8E-06   35.6   1.6   16  171-186    73-88  (97)
 57 PRK10594 murein L,D-transpepti  88.0    0.32 6.9E-06   46.2   2.1   27    1-27    317-343 (608)
 58 PF04228 Zn_peptidase:  Putativ  87.8     1.1 2.4E-05   38.9   5.2   40  145-186   141-189 (292)
 59 PF12725 DUF3810:  Protein of u  87.7    0.27 5.9E-06   43.2   1.4   38  145-190   178-215 (318)
 60 PF13398 Peptidase_M50B:  Pepti  86.6    0.33 7.2E-06   39.7   1.2   15  170-184    21-35  (200)
 61 PF01863 DUF45:  Protein of unk  86.1     3.7   8E-05   33.1   7.2   77   91-195   112-188 (205)
 62 PF14247 DUF4344:  Domain of un  84.9    0.44 9.6E-06   39.7   1.2   14  170-183    91-104 (220)
 63 COG2856 Predicted Zn peptidase  84.8     0.4 8.6E-06   39.8   0.8   16  172-187    73-88  (213)
 64 PF06114 DUF955:  Domain of unk  84.8    0.41 8.9E-06   34.6   0.8   15  169-183    40-54  (122)
 65 COG5549 Predicted Zn-dependent  82.5    0.73 1.6E-05   37.9   1.5  110   91-200    49-182 (236)
 66 PF01435 Peptidase_M48:  Peptid  81.4    0.77 1.7E-05   37.4   1.3   20  169-189    87-106 (226)
 67 cd06456 M3A_DCP_Oligopeptidase  81.2      14  0.0003   33.8   9.5   24   90-113   107-130 (422)
 68 COG2989 Uncharacterized protei  81.0    0.91   2E-05   42.3   1.7   26    1-26    273-298 (561)
 69 cd06457 M3A_MIP Peptidase M3 m  80.8      13 0.00029   34.2   9.3   25   89-113   132-156 (458)
 70 PRK02870 heat shock protein Ht  80.2     1.3 2.8E-05   39.3   2.4   33  146-182   152-184 (336)
 71 cd06163 S2P-M50_PDZ_RseP-like   79.7    0.94   2E-05   36.6   1.2   12  172-183    10-21  (182)
 72 PRK03982 heat shock protein Ht  79.7    0.93   2E-05   39.1   1.3   15  169-183   123-137 (288)
 73 COG0501 HtpX Zn-dependent prot  79.0       1 2.2E-05   38.6   1.3   16  168-183   154-169 (302)
 74 cd06161 S2P-M50_SpoIVFB SpoIVF  78.8       1 2.2E-05   37.0   1.2   14  170-183    37-50  (208)
 75 PRK03072 heat shock protein Ht  78.7       1 2.3E-05   38.9   1.3   17  168-184   124-140 (288)
 76 PRK03001 M48 family peptidase;  78.6     1.1 2.3E-05   38.7   1.3   16  168-183   121-136 (283)
 77 PRK01345 heat shock protein Ht  78.4     1.1 2.3E-05   39.4   1.3   19  169-188   122-140 (317)
 78 cd06455 M3A_TOP Peptidase M3 T  76.0      22 0.00047   32.9   9.2   14  170-183   262-275 (472)
 79 PF13699 DUF4157:  Domain of un  75.9       3 6.5E-05   28.9   2.7   30  149-183    44-73  (79)
 80 PRK04897 heat shock protein Ht  75.4     1.5 3.2E-05   38.2   1.3   15  169-183   135-149 (298)
 81 cd06164 S2P-M50_SpoIVFB_CBS Sp  75.4     1.5 3.2E-05   36.7   1.2   15  171-185    53-67  (227)
 82 cd06258 Peptidase_M3_like The   75.3     1.4 2.9E-05   39.1   1.1   15  169-183   152-166 (365)
 83 KOG3538 Disintegrin metallopro  74.9    0.87 1.9E-05   45.3  -0.3   31  170-200   316-352 (845)
 84 PRK05457 heat shock protein Ht  74.7     1.6 3.4E-05   37.8   1.3   15  168-182   131-145 (284)
 85 cd06459 M3B_Oligoendopeptidase  74.5     1.4 3.1E-05   39.6   1.0   15  169-183   220-234 (427)
 86 PF01432 Peptidase_M3:  Peptida  74.5     1.5 3.2E-05   40.3   1.1   25   89-113   136-160 (458)
 87 PRK02391 heat shock protein Ht  73.9     1.7 3.7E-05   37.8   1.3   15  168-182   130-144 (296)
 88 PRK01265 heat shock protein Ht  73.7     1.7 3.7E-05   38.3   1.3   15  168-182   137-151 (324)
 89 PF13485 Peptidase_MA_2:  Pepti  73.2     2.2 4.8E-05   30.9   1.7   14  170-183    24-37  (128)
 90 cd06162 S2P-M50_PDZ_SREBP Ster  72.2     1.9 4.2E-05   37.1   1.2   13  171-183   135-147 (277)
 91 TIGR00181 pepF oligoendopeptid  72.2     2.4 5.2E-05   40.3   2.0   15  169-183   376-390 (591)
 92 cd06159 S2P-M50_PDZ_Arch Uncha  71.8       2 4.3E-05   36.8   1.2   15  171-185   118-132 (263)
 93 TIGR02289 M3_not_pepF oligoend  71.2     2.1 4.6E-05   40.3   1.4   15  169-183   335-349 (549)
 94 TIGR02290 M3_fam_3 oligoendope  70.3     1.9   4E-05   41.0   0.8   14  170-183   374-387 (587)
 95 PF01447 Peptidase_M4:  Thermol  69.4     2.2 4.7E-05   33.4   0.8   12  172-183   136-147 (150)
 96 cd06160 S2P-M50_like_2 Unchara  68.6     2.6 5.7E-05   34.0   1.2   20  171-190    41-60  (183)
 97 PF09374 PG_binding_3:  Predict  67.7     2.6 5.6E-05   28.6   0.9   24    1-24      3-30  (72)
 98 PLN02791 Nudix hydrolase homol  65.4     3.3 7.1E-05   40.7   1.4   21  170-190   584-604 (770)
 99 COG4227 Antirestriction protei  64.3     4.9 0.00011   34.3   2.1   23  168-190   200-222 (316)
100 KOG2921 Intramembrane metallop  63.4     3.5 7.5E-05   37.2   1.0   19  171-189   131-149 (484)
101 cd06158 S2P-M50_like_1 Unchara  61.8     4.2 9.2E-05   32.7   1.2   13  171-183     9-21  (181)
102 PF14891 Peptidase_M91:  Effect  61.2     8.7 0.00019   30.5   2.9   17  170-186   102-118 (174)
103 COG3409 Putative peptidoglycan  61.1     5.2 0.00011   31.4   1.6   27    1-27     78-105 (185)
104 cd06460 M32_Taq Peptidase fami  60.5     3.8 8.2E-05   37.2   0.8   15  172-186   160-177 (396)
105 COG2317 Zn-dependent carboxype  60.5     4.7  0.0001   37.2   1.4   12  171-183   260-271 (497)
106 KOG2089 Metalloendopeptidase f  60.5      24 0.00052   34.1   6.0   33   91-123   392-426 (718)
107 COG1164 Oligoendopeptidase F [  60.1       4 8.7E-05   39.0   0.9   15  169-183   378-392 (598)
108 PRK10911 oligopeptidase A; Pro  58.6     4.2 9.1E-05   39.5   0.8   22   91-112   361-382 (680)
109 PF01434 Peptidase_M41:  Peptid  58.0     5.3 0.00011   32.9   1.2   15  169-183    26-40  (213)
110 COG4784 Putative Zn-dependent   57.7     5.9 0.00013   35.3   1.5   16  168-183   121-136 (479)
111 KOG2719 Metalloprotease [Gener  57.1     5.5 0.00012   36.4   1.2   14  169-182   278-291 (428)
112 PF02102 Peptidase_M35:  Deuter  56.9      13 0.00028   33.3   3.5   47  138-185   262-311 (359)
113 PF04450 BSP:  Peptidase of pla  56.5      81  0.0018   25.9   7.9   38  147-184    71-109 (205)
114 TIGR00054 RIP metalloprotease   56.2     5.6 0.00012   36.3   1.2   12  172-183    15-26  (420)
115 PRK10779 zinc metallopeptidase  54.8     5.4 0.00012   36.7   0.8   12  172-183    16-27  (449)
116 PF03571 Peptidase_M49:  Peptid  52.7     6.9 0.00015   37.0   1.2   15  171-185   293-308 (549)
117 cd02641 R3H_Smubp-2_like R3H d  51.1     7.9 0.00017   25.3   0.9   21  170-190    28-48  (60)
118 COG2321 Predicted metalloprote  50.2     7.5 0.00016   33.4   0.9   39  145-183   138-179 (295)
119 PRK10280 dipeptidyl carboxypep  49.6     7.3 0.00016   37.9   0.8   32   90-121   363-397 (681)
120 KOG2661 Peptidase family M48 [  46.9      10 0.00022   33.7   1.2   19  170-189   274-292 (424)
121 PF06167 Peptidase_M90:  Glucos  45.0     9.5 0.00021   32.4   0.7   21  168-188   151-171 (253)
122 PF12315 DUF3633:  Protein of u  42.8      13 0.00029   30.6   1.3   16  170-185    92-108 (212)
123 PF01431 Peptidase_M13:  Peptid  40.3      16 0.00034   29.5   1.4   17  171-187    36-52  (206)
124 cd02639 R3H_RRM R3H domain of   39.9      16 0.00034   24.0   1.0   23  170-192    28-50  (60)
125 COG0339 Dcp Zn-dependent oligo  39.0      13 0.00029   35.9   0.8   33   90-122   364-398 (683)
126 cd06461 M2_ACE Peptidase famil  38.5      16 0.00035   34.0   1.2   17  169-185   245-261 (477)
127 KOG2090 Metalloendopeptidase f  38.0      14 0.00031   35.2   0.8   22   13-34    318-341 (704)
128 COG3590 PepO Predicted metallo  37.6     9.7 0.00021   36.1  -0.3   14  171-184   487-500 (654)
129 PF08434 CLCA_N:  Calcium-activ  36.3      14 0.00031   31.6   0.5   26  172-200   150-176 (262)
130 PRK15410 DgsA anti-repressor M  35.2      18 0.00039   30.9   0.9   20  170-189   141-160 (260)
131 COG3091 SprT Zn-dependent meta  34.9       9 0.00019   30.0  -0.9   39  140-183    35-73  (156)
132 smart00731 SprT SprT homologue  34.5      20 0.00042   27.6   1.0   32  148-183    40-71  (146)
133 PF02074 Peptidase_M32:  Carbox  33.8      15 0.00033   34.3   0.3   15  174-188   262-279 (494)
134 PF10263 SprT-like:  SprT-like   33.5      23  0.0005   27.2   1.2   16  169-184    58-73  (157)
135 PF10026 DUF2268:  Predicted Zn  33.4 1.2E+02  0.0025   24.5   5.4   19  169-187    63-81  (195)
136 PRK10733 hflB ATP-dependent me  32.9      21 0.00047   34.4   1.2   14  170-183   407-420 (644)
137 PHA00657 crystallin beta/gamma  32.3      34 0.00074   35.7   2.4   13  170-182   790-802 (2052)
138 PF13402 M60-like:  Peptidase M  31.0      21 0.00045   30.7   0.7   10  174-183   222-231 (307)
139 CHL00176 ftsH cell division pr  30.7      21 0.00046   34.5   0.7   14  170-183   437-450 (638)
140 COG5504 Predicted Zn-dependent  30.3   1E+02  0.0022   26.4   4.6   78   89-191    81-160 (280)
141 cd02640 R3H_NRF R3H domain of   30.2      30 0.00065   22.6   1.2   20  170-189    28-47  (60)
142 PRK04860 hypothetical protein;  29.7      15 0.00032   29.1  -0.5   32  147-182    43-74  (160)
143 COG4412 Uncharacterized protei  29.6      15 0.00033   34.6  -0.4   18  173-190   348-365 (760)
144 PF10460 Peptidase_M30:  Peptid  28.6      38 0.00082   30.5   1.9   16  169-184   137-152 (366)
145 PF09768 Peptidase_M76:  Peptid  28.5      49  0.0011   26.5   2.3   35  145-185    51-85  (173)
146 cd06007 R3H_DEXH_helicase R3H   28.2      33 0.00072   22.3   1.1   20  170-189    27-46  (59)
147 KOG1047 Bifunctional leukotrie  28.0      31 0.00067   32.7   1.3   20  167-186   284-304 (613)
148 PRK04351 hypothetical protein;  26.6      27 0.00057   27.3   0.5   32  147-182    41-72  (149)
149 PF14521 Aspzincin_M35:  Lysine  25.6      48   0.001   25.6   1.8   39  150-190    77-115 (148)
150 PF13058 DUF3920:  Protein of u  23.8      46   0.001   24.7   1.3   15  170-184    75-89  (126)
151 TIGR00043 metalloprotein, YbeY  22.7      69  0.0015   23.5   2.1   22  170-191    72-93  (110)
152 PF02130 UPF0054:  Uncharacteri  22.5      69  0.0015   24.7   2.1   22  169-190   106-127 (145)
153 PF03272 Enhancin:  Viral enhan  22.3      34 0.00073   33.9   0.4   18   94-111   173-190 (775)
154 TIGR01241 FtsH_fam ATP-depende  22.1      43 0.00094   31.1   1.1   14  170-183   310-323 (495)
155 COG5570 Uncharacterized small   22.0      44 0.00096   21.3   0.8   18   13-30     14-31  (57)
156 PRK00016 metal-binding heat sh  22.0      63  0.0014   25.3   1.9   43  148-190    87-132 (159)
157 KOG3314 Ku70-binding protein [  21.4      73  0.0016   25.4   2.0   15  169-183    89-103 (194)
158 KOG3624 M13 family peptidase [  21.3      43 0.00094   32.5   0.9   14  171-184   518-531 (687)
159 PF10463 Peptidase_U49:  Peptid  20.9      42 0.00091   27.7   0.7   19  172-191   102-120 (206)
160 PF13203 DUF2201_N:  Putative m  20.4      52  0.0011   28.2   1.1   22  169-191    58-79  (292)
161 PF15639 Tox-MPTase3:  Metallop  20.2 1.8E+02  0.0038   22.2   3.8   36   91-126    23-59  (135)
162 PF15127 DUF4565:  Protein of u  20.1      89  0.0019   22.2   2.1   14   90-103    63-76  (91)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=1.5e-54  Score=391.87  Aligned_cols=193  Identities=45%  Similarity=0.789  Sum_probs=178.1

Q ss_pred             ChhhhhcCCCCCccCCCHHHHHHhcCCCCCCCCCCCCcccccCCCcccCCCCCcccccceeeeCCCCCCCCCCCceeEEE
Q 039411            1 MKTYQLNYHLNSTGVLDGNTVAMMTRPRCGVADIVNGTTRMRSGMKSHQHGANHFHTVSHYEFFPGRPRWPPSNSHLTYA   80 (222)
Q Consensus         1 i~~~Q~~~~L~~TG~ld~~T~~~m~~pRCg~pD~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~W~~~~~~lty~   80 (222)
                      |+.||++++|++||+||.+|+++|.+||||+||.                         +|.+++++.||+  +.+|||+
T Consensus        62 l~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~-------------------------~~~~~~~~~kW~--k~~lT~r  114 (469)
T KOG1565|consen   62 LKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG-------------------------RYRYFPGKPKWN--KEHLTYR  114 (469)
T ss_pred             HHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------------------------ccccCcccCccc--cccccee
Confidence            5789999999999999999999999999999991                         456677999999  9999999


Q ss_pred             ecCCC----hHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCC---CCceee
Q 039411           81 FLPGT----PAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPP---RGGIFD  153 (222)
Q Consensus        81 ~~~~~----~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P---~~g~i~  153 (222)
                      |.+.+    ..+++.++++||+.|+++++|+|.|+.....|||+|.|..+.|+|++||||++|+|||||+|   .+|++|
T Consensus       115 i~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~~G~~h  194 (469)
T KOG1565|consen  115 IKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFPGPGIGGDLH  194 (469)
T ss_pred             ccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecccCCCCCCCCccc
Confidence            99885    33899999999999999999999999986699999999999999999999999999999999   689999


Q ss_pred             ecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCCceeeeeCC-CCCCCCCHhhHHHHHHHhcC
Q 039411          154 YDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGAIMYAYIDP-GATKGLHTDDVQGIRALYNN  221 (222)
Q Consensus       154 fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~~-~~~~~l~~~Di~~iq~LYg~  221 (222)
                      ||++|.|+++ ...+.+++.||+|||||+|||.||.++++||||+|.+ .....|+.|||.+||.|||.
T Consensus       195 fD~dE~Wt~~-~~~g~~l~~Va~HEiGH~LGL~HS~~~~aiM~P~y~~~~~~~~L~~DDv~giq~lYG~  262 (469)
T KOG1565|consen  195 FDKDETWTYG-DSNGVDLFLVAAHEIGHALGLGHSSDPDAIMYPFYQPDSGNFDLSQDDVRGIQHLYGG  262 (469)
T ss_pred             cCcccceecc-CCccchhHHHhhhhcccccccCCCCCcccccccccccCCCCcccChhhhhhhHHHhCC
Confidence            9999999997 3457899999999999999999999999999999994 34568999999999999996


No 2  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00  E-value=2.2e-39  Score=256.16  Aligned_cols=150  Identities=44%  Similarity=0.798  Sum_probs=135.7

Q ss_pred             CCCCCCceeEEEecCCC----hHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeee
Q 039411           69 RWPPSNSHLTYAFLPGT----PAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHA  144 (222)
Q Consensus        69 ~W~~~~~~lty~~~~~~----~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a  144 (222)
                      ||+  +++|+|+|.+..    ...++++|++||+.|+++++|+|+|+.+.+.|||+|.|....++|+++|+|.++++|||
T Consensus         1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a   78 (157)
T cd04278           1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHA   78 (157)
T ss_pred             CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccc
Confidence            799  999999998754    35889999999999999999999999876689999999999999889999999999999


Q ss_pred             cCC--CCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCCceeeeeCCC-CCCCCCHhhHHHHHHHhc
Q 039411          145 APP--RGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGAIMYAYIDPG-ATKGLHTDDVQGIRALYN  220 (222)
Q Consensus       145 ~~P--~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~~~-~~~~l~~~Di~~iq~LYg  220 (222)
                      ++|  ..|+|+||.++.|+......+.+++.|++|||||+|||.|+.++.+||||++... ....|+.+||.+||+|||
T Consensus        79 ~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~~~~lYg  157 (157)
T cd04278          79 FFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRGIQALYG  157 (157)
T ss_pred             cCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHHHHHhcC
Confidence            998  7899999999999887534567899999999999999999999999999999843 256899999999999998


No 3  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00  E-value=7.3e-37  Score=240.03  Aligned_cols=147  Identities=42%  Similarity=0.798  Sum_probs=130.9

Q ss_pred             CCCCCCceeEEEecCCCh----HHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeee
Q 039411           69 RWPPSNSHLTYAFLPGTP----AEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHA  144 (222)
Q Consensus        69 ~W~~~~~~lty~~~~~~~----~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a  144 (222)
                      ||+  +++|||+|.+..+    .++++++++||+.|+++++|+|+++.+. ++||+|.|....+++...|++.+++++++
T Consensus         1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~-~adi~i~~~~~~~~~~~~~~~~~~~~~~~   77 (154)
T PF00413_consen    1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDG-NADIRISFGSNNHGDGYSFDGSGGTLAHA   77 (154)
T ss_dssp             SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSS-SCSEEEEEESSSSSSSS-CSSSSSESEEE
T ss_pred             CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCC-CcceeeeeeccccCcccccccceeeeecc
Confidence            799  8999999987542    3788999999999999999999999853 69999999988888888899999999999


Q ss_pred             cCCC---CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCCceeeeeCCCCCCCCCHhhHHHHHHHhc
Q 039411          145 APPR---GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGAIMYAYIDPGATKGLHTDDVQGIRALYN  220 (222)
Q Consensus       145 ~~P~---~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~~~~~~~l~~~Di~~iq~LYg  220 (222)
                      +.|.   .|+++|+.++.|...  ..+.++..|++|||||+|||.|+.+++|||||++.......|+.+||.+||+|||
T Consensus        78 ~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~~~~l~~~Di~~i~~lYg  154 (154)
T PF00413_consen   78 YFPNNIVSGDIHFNDDESWTID--DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPDNKTLSEDDIDGIQYLYG  154 (154)
T ss_dssp             EESSSTTTTEEEEETTSHEESS--SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSSSTSTTHHHHHHHHHHHS
T ss_pred             ccccccccccccccccccchhh--hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCCCCCCCHHHHHHHHHHhC
Confidence            9984   799999999999876  3457899999999999999999999999999999854435899999999999998


No 4  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.97  E-value=9.9e-30  Score=200.68  Aligned_cols=141  Identities=29%  Similarity=0.427  Sum_probs=109.6

Q ss_pred             ceeEEEecCCC------hHHHHHHHHHHHHHHHhcccceEEEecCC-CCceEEEEEeecCCCCCCCCCCCCCeeeeecCC
Q 039411           75 SHLTYAFLPGT------PAEAINAVGRAFQTWAAISHFKFSQTQDY-TNANLKISFQRGWHGDSSPFDGRGGTIAHAAPP  147 (222)
Q Consensus        75 ~~lty~~~~~~------~~~~~~~i~~A~~~W~~~~~l~F~ev~~~-~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P  147 (222)
                      ..|+|++.+..      +.+++++|++||+.|+++++|+|+++... .++||+|.|....     +++|.|+++||+++|
T Consensus         2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g~~~a~a~~p   76 (156)
T cd04279           2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAGGGLARAGFP   76 (156)
T ss_pred             CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCCCceEEeccc
Confidence            46888888653      34789999999999999999999998753 3799999998532     788999999999999


Q ss_pred             CCce---eeecCCCC-cccCCCCCccchhHHHHHhhhhhcCCCCCCCCC-CceeeeeCCCC--CCCCCHhhHHHHHHHhc
Q 039411          148 RGGI---FDYDADEQ-WAVGARPGAFDLETVALHEIGHLLGLDHSLVEG-AIMYAYIDPGA--TKGLHTDDVQGIRALYN  220 (222)
Q Consensus       148 ~~g~---i~fd~~~~-w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~-siM~p~~~~~~--~~~l~~~Di~~iq~LYg  220 (222)
                      ..|.   ..++.... +.......+.+++.|++|||||+|||.|+.++. ++|||++..+.  ...|+.+|+++|++|||
T Consensus        77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~lY~  156 (156)
T cd04279          77 LISDGNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRLYG  156 (156)
T ss_pred             ccCCCcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHHhC
Confidence            6432   22221111 111111124588999999999999999999998 99999998543  35899999999999997


No 5  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.94  E-value=8.6e-27  Score=184.87  Aligned_cols=142  Identities=22%  Similarity=0.273  Sum_probs=106.5

Q ss_pred             CceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceee
Q 039411           74 NSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFD  153 (222)
Q Consensus        74 ~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~  153 (222)
                      ++.|+|+|.+..+...+++|++||+.|+++++|+|+|+...+.+||+|.+....+..    +|..+.+++...|..|+|+
T Consensus         1 ~~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~~~~~~~~~~~~g~i~   76 (165)
T cd04268           1 KKPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTWSYGPSQVDPLTGEIL   76 (165)
T ss_pred             CCCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCccccCCccCCCCCccEE
Confidence            468999999987788999999999999999999999998766799999987632111    1111122222245678999


Q ss_pred             ecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC----------------CCCceeeeeCC-------CCCCCCCHh
Q 039411          154 YDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV----------------EGAIMYAYIDP-------GATKGLHTD  210 (222)
Q Consensus       154 fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~----------------~~siM~p~~~~-------~~~~~l~~~  210 (222)
                      ++....|.......+..+..|++|||||+|||.|+.+                ..|||++....       +....|+..
T Consensus        77 ~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~~~~~~~~~  156 (165)
T cd04268          77 LARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDGQKYTIGPY  156 (165)
T ss_pred             eeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccccCCCCCHH
Confidence            8776544311001234688999999999999999998                89999876542       134689999


Q ss_pred             hHHHHHHHh
Q 039411          211 DVQGIRALY  219 (222)
Q Consensus       211 Di~~iq~LY  219 (222)
                      ||.+||+||
T Consensus       157 Di~ai~~lY  165 (165)
T cd04268         157 DIAAIKKLY  165 (165)
T ss_pred             HHHHHHhcC
Confidence            999999999


No 6  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.92  E-value=4.1e-25  Score=171.12  Aligned_cols=132  Identities=30%  Similarity=0.533  Sum_probs=107.1

Q ss_pred             CCCCCCCceeEEEec-CCChH-HHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeec
Q 039411           68 PRWPPSNSHLTYAFL-PGTPA-EAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAA  145 (222)
Q Consensus        68 ~~W~~~~~~lty~~~-~~~~~-~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~  145 (222)
                      .+|+  +.+|+|+|. +..+. +++++|++||+.|+++++|+|+|+.+  .+||.|.|....++        |..+++++
T Consensus         3 ~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~--------g~~~a~~g   70 (140)
T smart00235        3 KKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGS--------GCTLSHAG   70 (140)
T ss_pred             CcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCC--------Ccceeeee
Confidence            5899  899999997 54444 45999999999999999999999875  79999999986651        22268888


Q ss_pred             CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCC---ceeeeeCCC-CCCCCCHhhHHHHHHHhcC
Q 039411          146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGA---IMYAYIDPG-ATKGLHTDDVQGIRALYNN  221 (222)
Q Consensus       146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~s---iM~p~~~~~-~~~~l~~~Di~~iq~LYg~  221 (222)
                      .| .|+++|+. +.|...        ..|++|||||+|||.|++++.+   +|++.+..- ..+.|+++|..+++++||.
T Consensus        71 ~~-~g~~~~~~-~~~~~~--------~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~~~~~yg~  140 (140)
T smart00235       71 RP-GGDQHFSL-GNGCIN--------TGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLGIPYDYGS  140 (140)
T ss_pred             cC-CCceEEEc-cCCcCC--------cccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCCchhccCc
Confidence            88 78899875 455432        3599999999999999999888   999999732 1457888899999999984


No 7  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.91  E-value=8.9e-24  Score=171.26  Aligned_cols=123  Identities=22%  Similarity=0.280  Sum_probs=96.3

Q ss_pred             HHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCC-------CceeeecCCCC
Q 039411           87 AEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPR-------GGIFDYDADEQ  159 (222)
Q Consensus        87 ~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~-------~g~i~fd~~~~  159 (222)
                      ...+++|++||+.|+++++|+|+|+.+...+||+|.+.....         ++..+.|++|.       .|+|+++....
T Consensus        33 ~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~~~~g~a~~p~~~~~~~~~g~i~~~~~~~  103 (186)
T cd04277          33 AAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------GNTAGYAYYPGSGSGTAYGGDIWFNSSYD  103 (186)
T ss_pred             HHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------CCccEEEECCCCCccccccceeEEecCcc
Confidence            378899999999999999999999987668999998765321         34667788773       46899998776


Q ss_pred             cccCCCCCccchhHHHHHhhhhhcCCCCCCCC----------------CCceeeeeCCC--------CCCCCCHhhHHHH
Q 039411          160 WAVGARPGAFDLETVALHEIGHLLGLDHSLVE----------------GAIMYAYIDPG--------ATKGLHTDDVQGI  215 (222)
Q Consensus       160 w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~----------------~siM~p~~~~~--------~~~~l~~~Di~~i  215 (222)
                      +...  ..+.....|++|||||+|||.|+++.                -+||......+        ....++..||++|
T Consensus       104 ~~~~--~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~DI~Al  181 (186)
T cd04277         104 TNSD--SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDIAAL  181 (186)
T ss_pred             cccC--CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHHHHHH
Confidence            6543  23456789999999999999999853                36886544321        2347889999999


Q ss_pred             HHHhc
Q 039411          216 RALYN  220 (222)
Q Consensus       216 q~LYg  220 (222)
                      |+|||
T Consensus       182 Q~lYG  186 (186)
T cd04277         182 QYLYG  186 (186)
T ss_pred             HHhhC
Confidence            99998


No 8  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.83  E-value=7.8e-20  Score=149.68  Aligned_cols=109  Identities=28%  Similarity=0.460  Sum_probs=78.0

Q ss_pred             CCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCC
Q 039411           69 RWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPR  148 (222)
Q Consensus        69 ~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~  148 (222)
                      +|+. .+.|+|.|..+.+.+.++.|++||+.|++++||+|++++.. .++|+|.|..+. | ++++.|..+..   ..+.
T Consensus         2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~-~adi~I~f~~~~-G-c~S~vG~~~~~---~~~~   74 (198)
T cd04327           2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDA-DADIRISFTPGD-G-YWSYVGTDALL---IGAD   74 (198)
T ss_pred             CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCC-CCCEEEEEecCC-C-CCCCcCCcccc---cCCC
Confidence            6996 66788999887666889999999999999999999998764 789999998643 4 45666654321   1112


Q ss_pred             CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          149 GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       149 ~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                      ...+.+.    |.... ..+++...|++|||||||||.|++
T Consensus        75 ~~t~~l~----~~~~~-~~~~~~~~~i~HElgHaLG~~HEh  110 (198)
T cd04327          75 APTMNLG----WFTDD-TPDPEFSRVVLHEFGHALGFIHEH  110 (198)
T ss_pred             Cceeeee----eecCC-CchhhHHHHHHHHHHHHhcCcccc
Confidence            2344443    21110 123567789999999999999973


No 9  
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.80  E-value=6.1e-19  Score=139.82  Aligned_cols=129  Identities=22%  Similarity=0.267  Sum_probs=95.1

Q ss_pred             eeEEEecCC--------ChHHHHHHHHHHHHHHHhcccceEEEecCC-CCceEEEEEeecCCCCCCCCCCCCCeeeeecC
Q 039411           76 HLTYAFLPG--------TPAEAINAVGRAFQTWAAISHFKFSQTQDY-TNANLKISFQRGWHGDSSPFDGRGGTIAHAAP  146 (222)
Q Consensus        76 ~lty~~~~~--------~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~-~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~  146 (222)
                      .|+|.|...        .+..++++|+.|++.|+++++++|+++... ..+||.+.+.....        .++.++.|+.
T Consensus         2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~~~~g~a~~   73 (167)
T cd00203           2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DGGTGGWAYL   73 (167)
T ss_pred             EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CCCceEEEec
Confidence            455555543        345788999999999999999999999864 47999988865332        2234566666


Q ss_pred             CC-----CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC--------------------CCCceeeeeCC
Q 039411          147 PR-----GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV--------------------EGAIMYAYIDP  201 (222)
Q Consensus       147 P~-----~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~--------------------~~siM~p~~~~  201 (222)
                      |.     .+.+.+.....+       ......|++|||||+|||.|...                    ..+||+|....
T Consensus        74 ~~~c~~~~~~~~~~~~~~~-------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~  146 (167)
T cd00203          74 GRVCDSLRGVGVLQDNQSG-------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGS  146 (167)
T ss_pred             CCccCCCCCcEEEecCCcc-------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccc
Confidence            52     355655443222       23567899999999999999876                    36899987764


Q ss_pred             ---CCCCCCCHhhHHHHHHHh
Q 039411          202 ---GATKGLHTDDVQGIRALY  219 (222)
Q Consensus       202 ---~~~~~l~~~Di~~iq~LY  219 (222)
                         .....++..||+.||+||
T Consensus       147 ~~~~~~~~fS~~d~~~i~~~Y  167 (167)
T cd00203         147 FSDGQRKDFSQCDIDQINKLY  167 (167)
T ss_pred             cCcccCCCcCHHHHHHHHhhC
Confidence               245689999999999998


No 10 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.72  E-value=5.5e-17  Score=130.95  Aligned_cols=91  Identities=18%  Similarity=0.219  Sum_probs=65.5

Q ss_pred             ceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeee
Q 039411           75 SHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDY  154 (222)
Q Consensus        75 ~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~f  154 (222)
                      ..|.|.|.+..+...++.|++||+.|++++||+|+|.+.   ++..|.|..+ .|| ++..|.-+        ....+.+
T Consensus         2 ~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~Gc-~S~vG~~~--------~~q~i~l   68 (180)
T cd04280           2 GTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SGC-WSYVGRVG--------GRQVVSL   68 (180)
T ss_pred             CEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CCc-ceecCccC--------CceeEEe
Confidence            579999998766688999999999999999999999875   4445666654 443 33333211        1233444


Q ss_pred             cCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          155 DADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       155 d~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                      ..           ++....+++|||||||||.|++
T Consensus        69 ~~-----------~c~~~g~v~HE~~HalG~~HEh   92 (180)
T cd04280          69 GS-----------GCFSLGTIVHELMHALGFYHEQ   92 (180)
T ss_pred             CC-----------CcCcCchhHHHHHHHhcCcchh
Confidence            21           2344689999999999999977


No 11 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.70  E-value=1.6e-16  Score=129.51  Aligned_cols=144  Identities=16%  Similarity=0.133  Sum_probs=89.8

Q ss_pred             CceeEEEecCCChHHHHHHHHHHHHHHHhc------ccceEEEecCC--CCceEEEEEeecCCCCCCCCCCCCCeeeeec
Q 039411           74 NSHLTYAFLPGTPAEAINAVGRAFQTWAAI------SHFKFSQTQDY--TNANLKISFQRGWHGDSSPFDGRGGTIAHAA  145 (222)
Q Consensus        74 ~~~lty~~~~~~~~~~~~~i~~A~~~W~~~------~~l~F~ev~~~--~~adI~I~f~~~~~gd~~~f~g~g~~la~a~  145 (222)
                      |+.|||+|.+.+|.+++++|++||+.|+++      .++.+.++...  ...||++...+--+.+    ++..+...+..
T Consensus         7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~~~gps~~   82 (197)
T cd04276           7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGWAYGPSVV   82 (197)
T ss_pred             CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcceeccccc
Confidence            899999999998889999999999999987      34555555332  2468884433322221    12222333444


Q ss_pred             CCCC-----ceeeecCCCCcccCCC---CCccchhHHHHHhhhhhcCCCCCCC---------------------CCCcee
Q 039411          146 PPRG-----GIFDYDADEQWAVGAR---PGAFDLETVALHEIGHLLGLDHSLV---------------------EGAIMY  196 (222)
Q Consensus       146 ~P~~-----g~i~fd~~~~w~~~~~---~~~~~~~~va~HEiGHaLGL~Hs~~---------------------~~siM~  196 (222)
                      .|..     ++|.++..........   ..+..+..+++||+||+|||.|...                     ..|||-
T Consensus        83 dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~~SVMd  162 (197)
T cd04276          83 DPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGATSSVMD  162 (197)
T ss_pred             CCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCcceeec
Confidence            5644     4566654332111100   0123467899999999999999642                     246662


Q ss_pred             e---eeCC-------CCCCCCCHhhHHHHHHHhcC
Q 039411          197 A---YIDP-------GATKGLHTDDVQGIRALYNN  221 (222)
Q Consensus       197 p---~~~~-------~~~~~l~~~Di~~iq~LYg~  221 (222)
                      .   .+..       .....+...||.+||+|||.
T Consensus       163 Y~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~  197 (197)
T cd04276         163 YPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP  197 (197)
T ss_pred             CccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence            1   1110       12347889999999999984


No 12 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.68  E-value=3.8e-16  Score=127.66  Aligned_cols=99  Identities=20%  Similarity=0.353  Sum_probs=68.8

Q ss_pred             CCCCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecC
Q 039411           67 RPRWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAP  146 (222)
Q Consensus        67 ~~~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~  146 (222)
                      ..+|+  +..|.|.|....+...+++|++|++.|++.+||+|++..+. ..  .|.|..+..|| +++.|.-+.-     
T Consensus         7 ~~~Wp--~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~~-~~--yi~f~~~~~GC-~S~vG~~~~g-----   75 (200)
T cd04281           7 ERIWP--GGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTPE-EN--YIVFTYRPCGC-CSYVGRRGNG-----   75 (200)
T ss_pred             cCcCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCCC-CC--EEEEEECCCCe-eEcCCCcCCC-----
Confidence            46899  99999999877666778999999999999999999997652 33  34454443454 4444432100     


Q ss_pred             CCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          147 PRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       147 P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                        .-.+.+        +   .++....|++|||||||||.|++
T Consensus        76 --~q~isl--------~---~~C~~~Gti~HEl~HaLGf~HEh  105 (200)
T cd04281          76 --PQAISI--------G---KNCDKFGIVVHELGHVIGFWHEH  105 (200)
T ss_pred             --ceeeec--------C---CCcCcCchHHHHHHHHhcCcchh
Confidence              012222        2   12344689999999999999977


No 13 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.68  E-value=6.3e-16  Score=126.05  Aligned_cols=137  Identities=22%  Similarity=0.321  Sum_probs=94.8

Q ss_pred             CCCCCCceeEEEecC----CChHHHHHHHHHHHHHHHhc-ccceEEEe--cCCCCceEEEEEeecCCCCCCCCCCCCCee
Q 039411           69 RWPPSNSHLTYAFLP----GTPAEAINAVGRAFQTWAAI-SHFKFSQT--QDYTNANLKISFQRGWHGDSSPFDGRGGTI  141 (222)
Q Consensus        69 ~W~~~~~~lty~~~~----~~~~~~~~~i~~A~~~W~~~-~~l~F~ev--~~~~~adI~I~f~~~~~gd~~~f~g~g~~l  141 (222)
                      .|....++|+..+.+    ..+..++.++.+|++.|+.+ ..|+|+-+  .....+||.|.-...        +.+.|.+
T Consensus        33 lV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~G~g  104 (211)
T PF12388_consen   33 LVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPSGAG  104 (211)
T ss_pred             eecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCCCcc
Confidence            453226788888887    34457889999999999997 45777422  222356776642211        1233456


Q ss_pred             eeecCC-CCc----eeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC---------------------------
Q 039411          142 AHAAPP-RGG----IFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL---------------------------  189 (222)
Q Consensus       142 a~a~~P-~~g----~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~---------------------------  189 (222)
                      +.|.|| .+|    .|.+.     ... .........|++|||||+|||.|++                           
T Consensus       105 gsAGFP~s~G~P~~~I~I~-----~~~-~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGTP  178 (211)
T PF12388_consen  105 GSAGFPTSNGNPYKFIQIY-----GLS-NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGTP  178 (211)
T ss_pred             eeccCCCCCCCCCceEEEE-----ecC-CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCCCC
Confidence            778888 555    45551     111 1122345679999999999999986                           


Q ss_pred             ---CCCCceeeeeCCCCCCCCCHhhHHHHHHHh
Q 039411          190 ---VEGAIMYAYIDPGATKGLHTDDVQGIRALY  219 (222)
Q Consensus       190 ---~~~siM~p~~~~~~~~~l~~~Di~~iq~LY  219 (222)
                         +++|||..++..+....++..||.+|++||
T Consensus       179 t~~d~~SiM~ac~~~~~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  179 TGADPNSIMNACFSSGEDGEFTSNDITALNYLY  211 (211)
T ss_pred             CCCCCchhhhccccCCCCCCcChhhHHHHHhhC
Confidence               457899998876677799999999999999


No 14 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.67  E-value=2e-16  Score=128.87  Aligned_cols=96  Identities=25%  Similarity=0.439  Sum_probs=63.1

Q ss_pred             CCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCC
Q 039411           69 RWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPR  148 (222)
Q Consensus        69 ~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~  148 (222)
                      +|+  +.+|.|.|.+..+...++.|++|+..|++.+||+|+++. . .....|.|.. ..|| +++.|..+        .
T Consensus         2 ~Wp--~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~-~~gC-~S~vG~~~--------g   67 (191)
T PF01400_consen    2 KWP--NGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSN-GSGC-WSYVGRQG--------G   67 (191)
T ss_dssp             S-G--GGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEES-SSSE-EEESS--S--------S
T ss_pred             cCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeec-Cccc-cchhhhcC--------c
Confidence            799  889999999887778889999999999999999999998 3 3334455543 3343 34444221        0


Q ss_pred             CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          149 GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       149 ~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                      .-.|.+.           .++....|++|||||||||.|++
T Consensus        68 ~q~i~l~-----------~~c~~~~~i~HEl~HaLG~~HEh   97 (191)
T PF01400_consen   68 EQTINLG-----------DGCFSVGTILHELGHALGFWHEH   97 (191)
T ss_dssp             EEEEEE------------TTC-SHHHHHHHHHHHHTB--GG
T ss_pred             ceeEEec-----------ceeCCccchHHHHHHHHhhhhhh
Confidence            1233332           12345679999999999999977


No 15 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.64  E-value=2.2e-15  Score=125.29  Aligned_cols=97  Identities=24%  Similarity=0.342  Sum_probs=68.5

Q ss_pred             CCCCCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeec
Q 039411           66 GRPRWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAA  145 (222)
Q Consensus        66 ~~~~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~  145 (222)
                      ...+|+  + .|.|.|....+...++.|++|++.|++.+||+|++.+.. ..  .|.|..+ .|| +++.|.-+      
T Consensus        42 ~~~~Wp--~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e-~~--yi~i~~~-~GC-~S~vG~~g------  107 (230)
T cd04282          42 DTYRWP--F-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGE-SN--YIFFFKG-SGC-WSMVGDQQ------  107 (230)
T ss_pred             cccCCC--c-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCC-Cc--EEEEEcC-CCe-eeccCccC------
Confidence            457999  7 899999887766788999999999999999999987643 33  3445443 344 44444311      


Q ss_pred             CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                        ....+.+        +   .++....+++|||||||||.|++
T Consensus       108 --g~q~isl--------~---~~C~~~Gti~HEl~HalGf~HEq  138 (230)
T cd04282         108 --GGQNLSI--------G---AGCDYKATVEHEFLHALGFYHEQ  138 (230)
T ss_pred             --CeEEEEE--------C---CCcCCCchHHHHHHHHhCCcccc
Confidence              0123333        2   13455689999999999999987


No 16 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.60  E-value=1.2e-14  Score=117.20  Aligned_cols=93  Identities=17%  Similarity=0.270  Sum_probs=62.8

Q ss_pred             CceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceee
Q 039411           74 NSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFD  153 (222)
Q Consensus        74 ~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~  153 (222)
                      .-.|.|.|....+...++.|++|++.|++.+||+|++.+.. ...|.  |..+ .|| ++..|..+.        ...|.
T Consensus         3 ~~~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~-~~yi~--~~~~-~gC-~S~vG~~gg--------~q~i~   69 (182)
T cd04283           3 IVYVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTTE-RDYLN--IESR-SGC-WSYIGRQGG--------RQTVS   69 (182)
T ss_pred             eEEEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCCC-CcEEE--EEcC-CCc-eEecCccCC--------ceeEe
Confidence            35789999887666778999999999999999999987642 34444  4432 343 444443210        11222


Q ss_pred             ecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          154 YDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       154 fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                      +.          ..++....+++|||||||||.|++
T Consensus        70 l~----------~~~C~~~G~i~HEl~HaLG~~HEh   95 (182)
T cd04283          70 LQ----------KQGCMYKGIIQHELLHALGFYHEQ   95 (182)
T ss_pred             cC----------CCCcCccchHHHHHHHHhCCcccc
Confidence            21          123445689999999999999987


No 17 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.53  E-value=4.3e-14  Score=127.46  Aligned_cols=103  Identities=22%  Similarity=0.350  Sum_probs=72.5

Q ss_pred             eCCCCCCCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeee
Q 039411           63 FFPGRPRWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIA  142 (222)
Q Consensus        63 ~~~~~~~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la  142 (222)
                      +.....+|+  ...|.|.|........+++|++||+.|++.++|+|+|..+. ..+..+.+...  || +++.|..|.  
T Consensus        75 ~~~~~~~Wp--~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~--gC-~S~VGr~gg--  146 (411)
T KOG3714|consen   75 TSNPERRWP--NGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG--GC-YSYVGRRGG--  146 (411)
T ss_pred             ccChhhcCC--CCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC--cc-eeeeCccCC--
Confidence            334567999  89999999988666889999999999999999999998743 34444444332  44 344443221  


Q ss_pred             eecCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411          143 HAAPPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       143 ~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~  189 (222)
                             +.      +.-+++   .++..+.|++|||+||||+.|++
T Consensus       147 -------~~------q~~sl~---~~C~~~G~i~HEl~HaLGf~Heh  177 (411)
T KOG3714|consen  147 -------GQ------QLLSLG---DGCDRFGTIVHELMHALGFWHEH  177 (411)
T ss_pred             -------Cc------cceecC---CCcCcCchhHHHHHHHhhhhhcc
Confidence                   10      111222   34566899999999999999987


No 18 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=99.11  E-value=2.5e-10  Score=85.46  Aligned_cols=122  Identities=19%  Similarity=0.303  Sum_probs=70.2

Q ss_pred             eeEEEecCCChHHHHHHHHHHHHHHHh-cccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeee
Q 039411           76 HLTYAFLPGTPAEAINAVGRAFQTWAA-ISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDY  154 (222)
Q Consensus        76 ~lty~~~~~~~~~~~~~i~~A~~~W~~-~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~f  154 (222)
                      +|+|.-...  .+.+..|.+|.+.|+. |.+++|++..   .++|+|.-...          +++..+..-....|.|.+
T Consensus         3 tv~Ydas~A--~~f~~~i~~aa~iWN~sV~NV~L~~~s---~a~~~~~~~~~----------~~~sya~~~g~G~G~I~l   67 (132)
T PF02031_consen    3 TVYYDASRA--PEFRSAIAQAAQIWNSSVSNVRLVEGS---SADIRYYEGND----------PRGSYASTDGLGSGYIFL   67 (132)
T ss_dssp             EEEEEEEE---GGGHHHHHHHHHHHHHH-SSEEEEE-S---S-SEEEEEE------------TT--EEEE-SSS-EEEEE
T ss_pred             EEEEeCCCC--chhHHHHHHHHHHHhcccCceEEeecC---CCcEEEEEecC----------CCCcccccCCCCcEEEEe
Confidence            566654433  2678999999999976 8999999975   48888764321          233444333335788988


Q ss_pred             cCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC-CC-CCceeeeeCCCC---CCCCCHhhHHHHHHHhc
Q 039411          155 DADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL-VE-GAIMYAYIDPGA---TKGLHTDDVQGIRALYN  220 (222)
Q Consensus       155 d~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~-~~-~siM~p~~~~~~---~~~l~~~Di~~iq~LYg  220 (222)
                      |..  +.     .+.+-..|++|||||+|||..-- -+ .++|.-.-. |.   .......++..+.+||.
T Consensus        68 ~~~--~~-----qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~a-G~sCtN~~Pna~E~a~V~~~~a  130 (132)
T PF02031_consen   68 DYQ--QN-----QGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSA-GTSCTNAYPNAAERARVNSLFA  130 (132)
T ss_dssp             EHH--HH-----HHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT--TT----S--HHHHHHHHHHCT
T ss_pred             chH--Hh-----hCCccceeeeehhccccCCCCCCCCccHHhhcCCCC-CCCCCCCCCCHHHHHHHHHHhc
Confidence            742  21     24677889999999999996433 23 457763222 21   23567889999999985


No 19 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.09  E-value=3.1e-10  Score=91.54  Aligned_cols=121  Identities=21%  Similarity=0.205  Sum_probs=75.8

Q ss_pred             HHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCC---------eeeeecCCCCceeeecCCC
Q 039411           88 EAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGG---------TIAHAAPPRGGIFDYDADE  158 (222)
Q Consensus        88 ~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~---------~la~a~~P~~g~i~fd~~~  158 (222)
                      .+++++..|++.|++.-|+..+|-.  +.|||+|...+.. |  .+.++.|.         .+.|+..|    -|++...
T Consensus       104 ~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~p-g--tg~~~~grArtal~~~efl~~~~~~----~~lnhr~  174 (236)
T COG5549         104 RWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNPP-G--TGWRQYGRARTALIAYEFLGHALGL----GHLNHRG  174 (236)
T ss_pred             hHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCCC-C--CcccccchHHHHHHHHHHhhhccCc----ccccccc
Confidence            5688999999999999777666544  4899999876532 2  12222211         12222222    1222211


Q ss_pred             CcccCCCCCccchhHHHHHhhhhhcCCC-CCCCCCCceeeeeCCCCCCCCCHhhHHHHHHHhcC
Q 039411          159 QWAVGARPGAFDLETVALHEIGHLLGLD-HSLVEGAIMYAYIDPGATKGLHTDDVQGIRALYNN  221 (222)
Q Consensus       159 ~w~~~~~~~~~~~~~va~HEiGHaLGL~-Hs~~~~siM~p~~~~~~~~~l~~~Di~~iq~LYg~  221 (222)
                      .-.+.......++..+|.||+|||||+. ||+-+ +.|.+..+   ...++..|++.+..+|-.
T Consensus       175 ~i~~~pg~~~e~L~~tarhElGhaLgi~ghsd~k-s~aly~~s---~~~is~rdv~tL~r~Yeq  234 (236)
T COG5549         175 DIMYPPGELRENLNPTARHELGHALGIWGHSDLK-SDALYGIS---WQGISRRDVKTLERKYEQ  234 (236)
T ss_pred             cccCCcccchhhhhHHHHHhhcchheecccccch-hhhhcchh---hcccCHHHHHHHHHHhcC
Confidence            1111111234578899999999999996 98754 45554333   345999999999999953


No 20 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=98.15  E-value=2e-05  Score=63.82  Aligned_cols=48  Identities=23%  Similarity=0.208  Sum_probs=35.3

Q ss_pred             chhHHHHHhhhhhcCCCCCCCC----------CCceeeeeCCCCCCCCCHhhHHHHHH
Q 039411          170 DLETVALHEIGHLLGLDHSLVE----------GAIMYAYIDPGATKGLHTDDVQGIRA  217 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~~----------~siM~p~~~~~~~~~l~~~Di~~iq~  217 (222)
                      ....|++||+||.|||.|....          .-||.|.........+|+..+..|+.
T Consensus       132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~~~~~~FS~CS~~~i~~  189 (192)
T cd04267         132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSGLNSYRFSQCSIGSIRE  189 (192)
T ss_pred             eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccCCCCCccChhhHHHHHH
Confidence            4456999999999999997652          24899876531345788877777765


No 21 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=98.07  E-value=5.5e-06  Score=68.27  Aligned_cols=48  Identities=27%  Similarity=0.272  Sum_probs=32.8

Q ss_pred             hhHHHHHhhhhhcCCCCCCCC--------------CCceeeeeCCCCCCCCCHhhHHHHHHHh
Q 039411          171 LETVALHEIGHLLGLDHSLVE--------------GAIMYAYIDPGATKGLHTDDVQGIRALY  219 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~--------------~siM~p~~~~~~~~~l~~~Di~~iq~LY  219 (222)
                      ...+++|||||+|||.|..+.              .+||.+ .......-+++-+|..|+...
T Consensus       137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY-~~~~~~~~FS~~~i~~i~~~~  198 (206)
T PF13583_consen  137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSY-ASRPSRPYFSPPSIEYIREVV  198 (206)
T ss_pred             cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCC-CCCCCCCCCCchhHHHHhCCC
Confidence            346799999999999997642              268862 221222348888888886543


No 22 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.96  E-value=3.1e-05  Score=63.68  Aligned_cols=116  Identities=16%  Similarity=0.182  Sum_probs=65.4

Q ss_pred             HHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcccCCCCCccchh
Q 039411           93 VGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWAVGARPGAFDLE  172 (222)
Q Consensus        93 i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~  172 (222)
                      .-++|..|.......-.  .+....|+.+.+...+....   ++++++++.|+.-   .+- .......+.. ..+....
T Consensus        72 ~L~~F~~w~~~~~~~~~--~~~~~~D~a~llt~~d~~~~---~~~~~~~G~A~~g---giC-~~~~s~~i~~-~~~~~~a  141 (207)
T cd04273          72 SLKSFCRWQKKLNPPND--SDPEHHDHAILLTRQDICRS---NGNCDTLGLAPVG---GMC-SPSRSCSINE-DTGLSSA  141 (207)
T ss_pred             HHHHHHHHHHHcCCccc--ccccccceEEEEeeeccccc---CCCCCceEEeccc---cCC-CCCcceEEEc-CCCceeE
Confidence            45789999764332110  01235777776655332211   3566777777642   110 0000011111 1123345


Q ss_pred             HHHHHhhhhhcCCCCCCC---------CCCceeeeeCCC-CCCCCCHhhHHHHHHH
Q 039411          173 TVALHEIGHLLGLDHSLV---------EGAIMYAYIDPG-ATKGLHTDDVQGIRAL  218 (222)
Q Consensus       173 ~va~HEiGHaLGL~Hs~~---------~~siM~p~~~~~-~~~~l~~~Di~~iq~L  218 (222)
                      .|++|||||.||+.|-.+         .+-||+|..... ....+|.--+..++.+
T Consensus       142 ~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~  197 (207)
T cd04273         142 FTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSF  197 (207)
T ss_pred             EeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHH
Confidence            799999999999999775         268999988742 3456777766666654


No 23 
>PF01471 PG_binding_1:  Putative peptidoglycan binding domain;  InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are:   Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX [].   Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=97.92  E-value=4.8e-06  Score=54.20  Aligned_cols=24  Identities=33%  Similarity=0.423  Sum_probs=21.8

Q ss_pred             ChhhhhcCCCCCccCCCHHHHHHh
Q 039411            1 MKTYQLNYHLNSTGVLDGNTVAMM   24 (222)
Q Consensus         1 i~~~Q~~~~L~~TG~ld~~T~~~m   24 (222)
                      |+.||++.||++||++|.+|+++|
T Consensus        34 v~~fQ~~~gL~~tG~~d~~T~~~L   57 (57)
T PF01471_consen   34 VKAFQKANGLPVTGVVDPETWEAL   57 (57)
T ss_dssp             HHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred             HHHHHHHcCcCCCCccCHHHHhcC
Confidence            589999999999999999999987


No 24 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.91  E-value=8.8e-06  Score=64.07  Aligned_cols=23  Identities=35%  Similarity=0.694  Sum_probs=15.2

Q ss_pred             cchhHHHHHhhhhhcCCCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSLVE  191 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~~~  191 (222)
                      .+...|++||+||-|||.|....
T Consensus        67 ~~~g~TltHEvGH~LGL~HtF~~   89 (154)
T PF05572_consen   67 YNFGKTLTHEVGHWLGLYHTFGG   89 (154)
T ss_dssp             S-SSHHHHHHHHHHTT---TT--
T ss_pred             cccccchhhhhhhhhcccccccC
Confidence            45668999999999999998753


No 25 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.90  E-value=2.3e-05  Score=63.58  Aligned_cols=115  Identities=23%  Similarity=0.345  Sum_probs=72.9

Q ss_pred             CceeEEEecC--CC---h----HHHHHHHHHHH---HHHHhcccceEEEecCCCCceEEEEEeecCCCCCC--CCCCCCC
Q 039411           74 NSHLTYAFLP--GT---P----AEAINAVGRAF---QTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSS--PFDGRGG  139 (222)
Q Consensus        74 ~~~lty~~~~--~~---~----~~~~~~i~~A~---~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~--~f~g~g~  139 (222)
                      .+.++|.+.-  +.   +    .+..++|...+   .-|.....+.|+.|... .+|++|...+...-+..  +.+- .+
T Consensus        28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~-~~Df~I~Lasp~T~~~lC~g~~~-~~  105 (203)
T PF11350_consen   28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSG-APDFRISLASPGTTDRLCAGLDT-SG  105 (203)
T ss_pred             CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCC-CCCEEEEECCcchhhhhccCcCc-Cc
Confidence            4788998863  21   1    23444444444   45888788999999876 48999999876543322  2221 12


Q ss_pred             eeeeecCCCCceeeecCCCCcccCCCCCc----cchhHHHHHhhhhhcCCCCCCCCC
Q 039411          140 TIAHAAPPRGGIFDYDADEQWAVGARPGA----FDLETVALHEIGHLLGLDHSLVEG  192 (222)
Q Consensus       140 ~la~a~~P~~g~i~fd~~~~w~~~~~~~~----~~~~~va~HEiGHaLGL~Hs~~~~  192 (222)
                      .. .-+.|..++|.+|. ..|..+...-+    .+..-|+-||+||+||-.|..-+.
T Consensus       106 e~-SC~~~~g~~VvIN~-~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cpg  160 (203)
T PF11350_consen  106 ET-SCRNPAGGRVVINL-ARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCPG  160 (203)
T ss_pred             ee-EeecCCCCeEEEeh-HHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCCC
Confidence            22 22334467787776 46876654322    345569999999999999987643


No 26 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.69  E-value=1.1e-05  Score=64.75  Aligned_cols=48  Identities=27%  Similarity=0.275  Sum_probs=18.8

Q ss_pred             hhHHHHHhhhhhcCCCCCCCCC-------------CceeeeeCCCCCCCCCHhhHHHHHHHh
Q 039411          171 LETVALHEIGHLLGLDHSLVEG-------------AIMYAYIDPGATKGLHTDDVQGIRALY  219 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~-------------siM~p~~~~~~~~~l~~~Di~~iq~LY  219 (222)
                      ...|++|||||.||+.|..+..             .||++... .....+++.-|..|+.+-
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~~~-~~~~~fS~cS~~~i~~~l  171 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYAGI-CNNDKFSPCSIRQIRAVL  171 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS-G-GGG---------------
T ss_pred             eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeCCC-CCCccccccccccccccc
Confidence            4568999999999999987642             58975422 123467888777777653


No 27 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.60  E-value=2.9e-05  Score=61.49  Aligned_cols=30  Identities=37%  Similarity=0.516  Sum_probs=25.5

Q ss_pred             hhHHHHHhhhhhcCCCCCCCCCCceeeeeC
Q 039411          171 LETVALHEIGHLLGLDHSLVEGAIMYAYID  200 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~siM~p~~~  200 (222)
                      .+.=++||+||+|||.|=.++.=+|++..+
T Consensus       124 v~KEv~HElGH~~GL~HC~N~~CVM~FSnS  153 (181)
T COG1913         124 VVKEVLHELGHLLGLSHCPNPRCVMNFSNS  153 (181)
T ss_pred             HHHHHHHHhhhhcCcccCCCCCcEEeCCcc
Confidence            345679999999999999999999986544


No 28 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=97.50  E-value=9.7e-05  Score=61.84  Aligned_cols=47  Identities=26%  Similarity=0.309  Sum_probs=35.5

Q ss_pred             HHHHHhhhhhcCCCCCCCC-------------------------CCceeeeeCCCCCCCCCHhhHHHHHHHhc
Q 039411          173 TVALHEIGHLLGLDHSLVE-------------------------GAIMYAYIDPGATKGLHTDDVQGIRALYN  220 (222)
Q Consensus       173 ~va~HEiGHaLGL~Hs~~~-------------------------~siM~p~~~~~~~~~l~~~Di~~iq~LYg  220 (222)
                      .|++|||||.||+.|=.+.                         .=||+|.... ..+.++.--+..|+.+..
T Consensus       147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~-~~~~FS~CS~~~~~~~l~  218 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS-GITEFSPCTIGNICSLLG  218 (228)
T ss_pred             eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC-CCCccChhhHHHHHHHHh
Confidence            5999999999999995431                         1199998664 456788888888876653


No 29 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.50  E-value=3.7e-05  Score=57.69  Aligned_cols=17  Identities=53%  Similarity=0.806  Sum_probs=12.0

Q ss_pred             hHHHHHhhhhhcCCCCC
Q 039411          172 ETVALHEIGHLLGLDHS  188 (222)
Q Consensus       172 ~~va~HEiGHaLGL~Hs  188 (222)
                      ..|++|||||.|||.|+
T Consensus       108 ~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  108 VDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             TTHHHHHHHHHTT----
T ss_pred             ceEeeehhhHhcCCCCC
Confidence            37999999999999995


No 30 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.45  E-value=7.6e-05  Score=60.14  Aligned_cols=27  Identities=33%  Similarity=0.529  Sum_probs=23.9

Q ss_pred             hhHHHHHhhhhhcCCCCCCCCCCceee
Q 039411          171 LETVALHEIGHLLGLDHSLVEGAIMYA  197 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~siM~p  197 (222)
                      +..+++||+||.|||.|-.++.-+|+.
T Consensus       125 ~~k~~~HElGH~lGL~HC~~~~CvM~~  151 (179)
T PRK13267        125 VRKEVTHELGHTLGLEHCDNPRCVMNF  151 (179)
T ss_pred             HHHHHHHHHHHHcCCccCCCCCccCCC
Confidence            345699999999999999999999984


No 31 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.43  E-value=4.7e-05  Score=61.79  Aligned_cols=23  Identities=35%  Similarity=0.395  Sum_probs=14.8

Q ss_pred             cchhHHHHHhhhhhcCCCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSLVE  191 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~~~  191 (222)
                      .....|++|||||.||+.|..+.
T Consensus       140 ~~~~~~~AHEiGH~lGa~HD~~~  162 (196)
T PF13688_consen  140 YNGAITFAHEIGHNLGAPHDGDY  162 (196)
T ss_dssp             HHHHHHHHHHHHHHTT-----SS
T ss_pred             CceehhhHHhHHHhcCCCCCCCC
Confidence            34557999999999999997765


No 32 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.42  E-value=0.00017  Score=58.56  Aligned_cols=48  Identities=21%  Similarity=0.285  Sum_probs=32.7

Q ss_pred             chhHHHHHhhhhhcCCCCCCC------CCCceeeeeCCCCCCCCCHhhHHHHHHH
Q 039411          170 DLETVALHEIGHLLGLDHSLV------EGAIMYAYIDPGATKGLHTDDVQGIRAL  218 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~------~~siM~p~~~~~~~~~l~~~Di~~iq~L  218 (222)
                      ....|++|||||.|||.|-.+      .+-||+|.... ....+|.--..-++.+
T Consensus       130 ~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~-~~~~fS~CS~~~~~~~  183 (194)
T cd04269         130 LFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS-LTDAFSNCSYEDYQKF  183 (194)
T ss_pred             HHHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC-CCCCCChhhHHHHHHH
Confidence            345799999999999999543      35799987653 3445665555444443


No 33 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.37  E-value=3.4e-05  Score=64.40  Aligned_cols=24  Identities=46%  Similarity=0.858  Sum_probs=20.8

Q ss_pred             cchhHHHHHhhhhhcCCCCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSLVEG  192 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~~~~  192 (222)
                      .++-.|++||+||.|||.|+....
T Consensus       135 ~n~g~t~~HEvGH~lGL~HtF~~~  158 (225)
T cd04275         135 YNLGDTATHEVGHWLGLYHTFQGG  158 (225)
T ss_pred             ccccceeEEeccceeeeeeeecCC
Confidence            466789999999999999988665


No 34 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=97.28  E-value=0.00012  Score=59.53  Aligned_cols=29  Identities=38%  Similarity=0.548  Sum_probs=20.0

Q ss_pred             hhHHHHHhhhhhcCCCCCCCCCCceeeeeC
Q 039411          171 LETVALHEIGHLLGLDHSLVEGAIMYAYID  200 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~siM~p~~~  200 (222)
                      +..+++||+||++||.|=.+ .-+|++..+
T Consensus       145 ~~Kea~HElGH~~GL~HC~~-~CvM~~s~s  173 (194)
T PF07998_consen  145 VCKEAVHELGHLFGLDHCEN-RCVMNFSNS  173 (194)
T ss_dssp             HHHHHHHHHHHHTT----SS-TSTTS--SS
T ss_pred             HHHHHHHHHHHHcCCcCCCC-CCccCCCCC
Confidence            45799999999999999999 999997433


No 35 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.97  E-value=0.00095  Score=54.36  Aligned_cols=50  Identities=24%  Similarity=0.466  Sum_probs=31.6

Q ss_pred             cchhHHHHHhhhhhcCCCCCCC--------CCCceeeeeCCCCCCCCCHhhHHHHHHH
Q 039411          169 FDLETVALHEIGHLLGLDHSLV--------EGAIMYAYIDPGATKGLHTDDVQGIRAL  218 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~~--------~~siM~p~~~~~~~~~l~~~Di~~iq~L  218 (222)
                      ..+..+++|||||.||+.|-..        ..-||.|...+.....+|.--+..++.+
T Consensus       129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~~~~~fS~CS~~~~~~~  186 (199)
T PF01421_consen  129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPSSSDKFSNCSRRQFEEF  186 (199)
T ss_dssp             HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSSSTSSS-HHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCCCCCCcCHHHHHHHHHH
Confidence            3455799999999999999665        2458998655333346777666666554


No 36 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.96  E-value=0.0012  Score=55.78  Aligned_cols=48  Identities=29%  Similarity=0.409  Sum_probs=33.4

Q ss_pred             hhHHHHHhhhhhcCCCCCCC-----C------CCceeeeeCCC---CCCCCCHhhHHHHHHH
Q 039411          171 LETVALHEIGHLLGLDHSLV-----E------GAIMYAYIDPG---ATKGLHTDDVQGIRAL  218 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~-----~------~siM~p~~~~~---~~~~l~~~Di~~iq~L  218 (222)
                      ...|++|||||.||+.|-..     +      .=||+|....+   ....++.--++.++.+
T Consensus       167 ~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~  228 (244)
T cd04270         167 SDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKV  228 (244)
T ss_pred             HHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHH
Confidence            34699999999999999554     1      24899976422   2357777666666543


No 37 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=96.92  E-value=0.0013  Score=54.61  Aligned_cols=49  Identities=24%  Similarity=0.256  Sum_probs=33.2

Q ss_pred             chhHHHHHhhhhhcCCCCCCC------------------CCCceeeeeCCCCCCCCCHhhHHHHHHH
Q 039411          170 DLETVALHEIGHLLGLDHSLV------------------EGAIMYAYIDPGATKGLHTDDVQGIRAL  218 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~------------------~~siM~p~~~~~~~~~l~~~Di~~iq~L  218 (222)
                      ....+++|||||.||+.|-.+                  .+-||.|.........++.--+.-++.+
T Consensus       144 ~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~fS~CS~~~~~~~  210 (220)
T cd04272         144 YGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNGERQYRFSQCSQRQIRNV  210 (220)
T ss_pred             ccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCCCCCCccCcchHHHHHHH
Confidence            345799999999999999532                  2468987644223446777666666554


No 38 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=96.23  E-value=0.0093  Score=51.58  Aligned_cols=19  Identities=42%  Similarity=0.504  Sum_probs=16.6

Q ss_pred             hHHHHHhhhhhcCCCCCCC
Q 039411          172 ETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       172 ~~va~HEiGHaLGL~Hs~~  190 (222)
                      ..|++||+||+|||.+--+
T Consensus       166 igv~~HE~gH~lGLPDlY~  184 (286)
T TIGR03296       166 VGVIAHELGHDLGLPDLYD  184 (286)
T ss_pred             eeeeehhhhcccCCCcccc
Confidence            5799999999999987654


No 39 
>PF11150 DUF2927:  Protein of unknown function (DUF2927);  InterPro: IPR021323  This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. 
Probab=95.73  E-value=0.37  Score=39.97  Aligned_cols=145  Identities=19%  Similarity=0.136  Sum_probs=80.6

Q ss_pred             CCCCCCCceeEEEecCCChH---HHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCC---------CCCCC
Q 039411           68 PRWPPSNSHLTYAFLPGTPA---EAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGD---------SSPFD  135 (222)
Q Consensus        68 ~~W~~~~~~lty~~~~~~~~---~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd---------~~~f~  135 (222)
                      .||.   ..|+..+....+.   ...+.+..-++.-..++++.+..++.. +|+|.|.|...+...         ..++.
T Consensus        28 ~Rw~---~PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~~~~~  103 (213)
T PF11150_consen   28 RRWE---GPVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGAACFV  103 (213)
T ss_pred             ccCC---CCeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccccccc
Confidence            5898   4688888844332   233556666777788999999888876 899999998765410         01110


Q ss_pred             C--------CCCeeeeecCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCC-CCCC-CCCceeeeeCCCCCC
Q 039411          136 G--------RGGTIAHAAPPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLD-HSLV-EGAIMYAYIDPGATK  205 (222)
Q Consensus       136 g--------~g~~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~-Hs~~-~~siM~p~~~~~~~~  205 (222)
                      -        ...-++.++....+.+....... .+........+...+..||-.+|||. .|.. +.||-+-   .....
T Consensus       104 ~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi-~i~~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFND---d~~~~  179 (213)
T PF11150_consen  104 VPNFSSLPRYARCLVFARSNWSGEITRARAVI-FIRADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFND---DNEFA  179 (213)
T ss_pred             chhhhcccccceEEEEEEecCCCceEEEEEEE-EEeccCCchHHHHHHHHHHHHHcCCccCCCcCCCceeeC---CCccc
Confidence            0        00112222222111111100000 01111112346678888899999995 3332 3343321   12234


Q ss_pred             CCCHhhHHHHHHHhc
Q 039411          206 GLHTDDVQGIRALYN  220 (222)
Q Consensus       206 ~l~~~Di~~iq~LYg  220 (222)
                      .|+.-|..-|+.||-
T Consensus       180 ~LT~~D~lLLr~LYd  194 (213)
T PF11150_consen  180 LLTGFDELLLRMLYD  194 (213)
T ss_pred             cccHHHHHHHHHHcC
Confidence            799999999999995


No 40 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=95.62  E-value=0.0053  Score=53.39  Aligned_cols=21  Identities=33%  Similarity=0.564  Sum_probs=15.1

Q ss_pred             cchhHHHHHhhhhhcCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~  189 (222)
                      .....+++||+||++||+|-.
T Consensus       191 ~s~~~~f~HE~GH~~GL~H~p  211 (305)
T PF10462_consen  191 YSYGNEFSHELGHNFGLGHYP  211 (305)
T ss_dssp             S-SHHHHHHHHHHTTT--SST
T ss_pred             cCccceeehhhhhhcCCCCCC
Confidence            345689999999999999933


No 41 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=95.58  E-value=0.0082  Score=56.20  Aligned_cols=60  Identities=23%  Similarity=0.329  Sum_probs=30.4

Q ss_pred             CceEEEEEeecCCCCCCCCCCCCCeeeeec--------CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCC
Q 039411          116 NANLKISFQRGWHGDSSPFDGRGGTIAHAA--------PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDH  187 (222)
Q Consensus       116 ~adI~I~f~~~~~gd~~~f~g~g~~la~a~--------~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~H  187 (222)
                      ++|+.|.......+        .+++|.|.        .|..|.|.|+...   +... ....+..|++|||.|+||+.-
T Consensus       159 ~~D~vlyV~~~~~~--------~~~~A~A~~C~~~~~~RP~~G~in~~p~~---i~~~-~~~~~~~~~~HEi~HaLGFs~  226 (521)
T PF01457_consen  159 NADFVLYVTARPSS--------SSTLAWAAPCQQDSDGRPIAGVININPSY---IPSF-YFQEFFRTVIHEIAHALGFSS  226 (521)
T ss_dssp             S-SEEEEEE----S--------TT---EEEEEEE-TTS-EEEEEEE--GGG------S---HHHHHHHHHHHHHHTT-SH
T ss_pred             CccEEEEEEEeecc--------CCceEEEeehhhccCCCCeeEEEEEchhH---ccch-hhhcccceeeeeeeeeeeecc
Confidence            57877665543221        15666663        3567888887643   2211 123567899999999999975


No 42 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=95.44  E-value=0.0088  Score=52.42  Aligned_cols=20  Identities=40%  Similarity=0.489  Sum_probs=17.4

Q ss_pred             chhHHHHHhhhhhcCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~  189 (222)
                      ....|++||+||-|||.|+.
T Consensus       149 ~~~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  149 QDWATIMHELGHNLGLWHAG  168 (314)
T ss_pred             ccHHHHHHHhhhhccccccC
Confidence            34569999999999999996


No 43 
>PTZ00337 surface protease GP63; Provisional
Probab=95.37  E-value=0.026  Score=53.20  Aligned_cols=37  Identities=27%  Similarity=0.387  Sum_probs=26.6

Q ss_pred             CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCC
Q 039411          146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHS  188 (222)
Q Consensus       146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs  188 (222)
                      .|..|.|.|+....-.      ....+.|++|||.||||+...
T Consensus       210 RPi~G~in~np~~i~~------~~~~~~v~~HEi~HALGFs~~  246 (567)
T PTZ00337        210 RPFAAAVNFDPRQIAV------TNGDVRVAAHELGHALGFVRD  246 (567)
T ss_pred             CceEEEEEECHHHccc------hhHHHHHHHHHHHHHHccCHH
Confidence            3677889988643211      235678999999999999643


No 44 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=95.02  E-value=0.015  Score=52.90  Aligned_cols=25  Identities=44%  Similarity=0.519  Sum_probs=20.2

Q ss_pred             hhHHHHHhhhhhcCCCCCCCCCCceee
Q 039411          171 LETVALHEIGHLLGLDHSLVEGAIMYA  197 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~siM~p  197 (222)
                      .....+|||||++||.|..+  .||--
T Consensus       316 ~iGA~lHEiGH~fg~pH~~~--GIM~R  340 (423)
T PF12044_consen  316 GIGAFLHEIGHLFGCPHQED--GIMLR  340 (423)
T ss_pred             hhHHHHHHHHHhcCCCCCCC--CeeeC
Confidence            34578999999999999875  67753


No 45 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=94.23  E-value=0.059  Score=51.07  Aligned_cols=61  Identities=21%  Similarity=0.242  Sum_probs=37.6

Q ss_pred             CceEEEEEeecCCCCCCCCCCCCCeeeeec--------CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCC
Q 039411          116 NANLKISFQRGWHGDSSPFDGRGGTIAHAA--------PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDH  187 (222)
Q Consensus       116 ~adI~I~f~~~~~gd~~~f~g~g~~la~a~--------~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~H  187 (222)
                      ++|+.|......      +  ..+++|.|.        .|..|-|.|+....   .+ ........|++|||.|+||+.-
T Consensus       205 naD~vLYVta~P------~--~~~~lA~A~~C~~~~d~RPi~G~iNinp~~i---~s-~~~~~~~rv~~HEi~HALGFS~  272 (622)
T PTZ00257        205 NADFVLYVASVP------S--EPGVLAWATTCQVFSDDHPAVGVMNIPAANI---VS-RYDQGTTRTVTHEVAHALGFSS  272 (622)
T ss_pred             ccCEEEEEEEec------C--CCCeeEEEEEeccccCCCceEEEEeeCHHHC---CC-ccchHHHHHHHHHHHHHhcCCH
Confidence            577766655422      1  134555543        36778898876432   11 1113467899999999999964


Q ss_pred             C
Q 039411          188 S  188 (222)
Q Consensus       188 s  188 (222)
                      +
T Consensus       273 ~  273 (622)
T PTZ00257        273 V  273 (622)
T ss_pred             H
Confidence            3


No 46 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=93.77  E-value=0.082  Score=42.92  Aligned_cols=81  Identities=16%  Similarity=0.272  Sum_probs=43.1

Q ss_pred             HHHHHHHHHHHHHhccc---ceEEEecC-----CCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCc
Q 039411           89 AINAVGRAFQTWAAISH---FKFSQTQD-----YTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQW  160 (222)
Q Consensus        89 ~~~~i~~A~~~W~~~~~---l~F~ev~~-----~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w  160 (222)
                      .+.-++.+++.|+++..   ++=.||..     ..--|+.|.-.+             |.|..-|.|..-.+.+.+.. +
T Consensus        20 AQ~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~-------------G~LTDHYDP~~kvvrLSe~~-y   85 (226)
T COG2738          20 AQSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVP-------------GTLTDHYDPRRKVVRLSEAN-Y   85 (226)
T ss_pred             HHHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeec-------------CCcccccChhhheeeccccc-c
Confidence            45678899999999843   33233321     112344443222             34444455655555443321 1


Q ss_pred             ccCCCCCccchhHHHHHhhhhhcCCCC
Q 039411          161 AVGARPGAFDLETVALHEIGHLLGLDH  187 (222)
Q Consensus       161 ~~~~~~~~~~~~~va~HEiGHaLGL~H  187 (222)
                      . +   ....-..|++||.|||+==.|
T Consensus        86 ~-g---~Sia~~aVAAHEVGHAiQd~~  108 (226)
T COG2738          86 Y-G---PSIAAIAVAAHEVGHAIQDQE  108 (226)
T ss_pred             C-C---ccHHHHHHHHHHhhHHHhhhc
Confidence            1 1   113456799999999985443


No 47 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=93.28  E-value=0.079  Score=44.00  Aligned_cols=76  Identities=17%  Similarity=0.272  Sum_probs=39.9

Q ss_pred             HHHHHHHHHHHHhccc---ceEEEec-----CCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcc
Q 039411           90 INAVGRAFQTWAAISH---FKFSQTQ-----DYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWA  161 (222)
Q Consensus        90 ~~~i~~A~~~W~~~~~---l~F~ev~-----~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~  161 (222)
                      +.-++..|+.|+++..   ++=.|+.     ...-.|++|.-..+.-+|             -|.|.+..+.+.++. + 
T Consensus        18 q~~vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~LtD-------------HYdP~~k~vrLS~~v-y-   82 (222)
T PF04298_consen   18 QWWVKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGELTD-------------HYDPRNKVVRLSEDV-Y-   82 (222)
T ss_pred             HHHHHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCCCC-------------CcCCCCCEEEeCCcc-C-
Confidence            3456777777777632   2222221     112346666654432222             244666666664432 1 


Q ss_pred             cCCCCCccchhHHHHHhhhhhc
Q 039411          162 VGARPGAFDLETVALHEIGHLL  183 (222)
Q Consensus       162 ~~~~~~~~~~~~va~HEiGHaL  183 (222)
                       +.  ....-..||+||.|||+
T Consensus        83 -~~--~SiaAvaVAAHEvGHAi  101 (222)
T PF04298_consen   83 -NG--RSIAAVAVAAHEVGHAI  101 (222)
T ss_pred             -CC--CCHHHHHHHHHHHhHHH
Confidence             10  11234579999999996


No 48 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=93.06  E-value=0.082  Score=50.15  Aligned_cols=46  Identities=33%  Similarity=0.492  Sum_probs=35.1

Q ss_pred             hHHHHHhhhhhcCCCCCCC----------CCCceeeeeCCCC---CCCCCHhhHHHHHH
Q 039411          172 ETVALHEIGHLLGLDHSLV----------EGAIMYAYIDPGA---TKGLHTDDVQGIRA  217 (222)
Q Consensus       172 ~~va~HEiGHaLGL~Hs~~----------~~siM~p~~~~~~---~~~l~~~Di~~iq~  217 (222)
                      .-|++|||||..|=.|-.+          -+=|||++...|+   ..++++--+..|.+
T Consensus       393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~  451 (764)
T KOG3658|consen  393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISK  451 (764)
T ss_pred             heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHH
Confidence            3489999999999999664          2348999988664   34788887777754


No 49 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=92.46  E-value=0.058  Score=40.32  Aligned_cols=18  Identities=44%  Similarity=0.835  Sum_probs=15.3

Q ss_pred             chhHHHHHhhhhhcCCCC
Q 039411          170 DLETVALHEIGHLLGLDH  187 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~H  187 (222)
                      -...|++|||||-.||..
T Consensus       108 ~vthvliHEIgHhFGLsD  125 (136)
T COG3824         108 QVTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             HhhhhhhhhhhhhcCCCh
Confidence            456799999999999974


No 50 
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=91.87  E-value=0.11  Score=42.73  Aligned_cols=32  Identities=16%  Similarity=0.228  Sum_probs=26.6

Q ss_pred             ChhhhhcCCCCCccCCCHHHHHHhcCCCCCCCC
Q 039411            1 MKTYQLNYHLNSTGVLDGNTVAMMTRPRCGVAD   33 (222)
Q Consensus         1 i~~~Q~~~~L~~TG~ld~~T~~~m~~pRCg~pD   33 (222)
                      |+.||+..||+++|.+|..|...|..| |..+.
T Consensus        45 V~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~~~   76 (201)
T TIGR02869        45 VRKFQSKNGLTVDGIVGPKTKAALGIA-TTQQS   76 (201)
T ss_pred             HHHHHHHhCCCCCCCcCHHHHHHhCCc-cccCc
Confidence            578999999999999999999999544 44443


No 51 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=91.50  E-value=0.088  Score=48.21  Aligned_cols=18  Identities=39%  Similarity=0.564  Sum_probs=15.4

Q ss_pred             CccchhHHHHHhhhhhcC
Q 039411          167 GAFDLETVALHEIGHLLG  184 (222)
Q Consensus       167 ~~~~~~~va~HEiGHaLG  184 (222)
                      ....+.+|++|||||+.|
T Consensus       126 ~esElagViAHEigHv~q  143 (484)
T COG4783         126 NESELAGVIAHEIGHVAQ  143 (484)
T ss_pred             CHHHHHHHHHHHHHHHhh
Confidence            345789999999999987


No 52 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=91.06  E-value=0.15  Score=46.23  Aligned_cols=24  Identities=33%  Similarity=0.509  Sum_probs=19.3

Q ss_pred             hhHHHHHhhhhhcCCCCCCCCCCcee
Q 039411          171 LETVALHEIGHLLGLDHSLVEGAIMY  196 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~siM~  196 (222)
                      ....+.|||||.||+.|..  ..||-
T Consensus       301 tlGA~~HElGH~lgcpHq~--~GvMl  324 (614)
T KOG4525|consen  301 TLGAVCHELGHCLGCPHQS--EGVML  324 (614)
T ss_pred             HHHHHHHHhhhccCCCCCC--CceEe
Confidence            3567899999999999965  55774


No 53 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=91.00  E-value=0.04  Score=52.68  Aligned_cols=20  Identities=40%  Similarity=0.338  Sum_probs=16.6

Q ss_pred             hhHHHHHhhhhhcCCCCCCC
Q 039411          171 LETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~  190 (222)
                      -..|++||+||.|||...-+
T Consensus       221 giGVfaHEfGH~LGLPDlYD  240 (645)
T PF05547_consen  221 GIGVFAHEFGHDLGLPDLYD  240 (645)
T ss_pred             ceEEEEeeccccCCCCCccc
Confidence            45799999999999986544


No 54 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=90.52  E-value=0.14  Score=43.80  Aligned_cols=24  Identities=33%  Similarity=0.548  Sum_probs=13.4

Q ss_pred             chhHHHHHhhhhh-cCCCCCCCCCC
Q 039411          170 DLETVALHEIGHL-LGLDHSLVEGA  193 (222)
Q Consensus       170 ~~~~va~HEiGHa-LGL~Hs~~~~s  193 (222)
                      ....|++||+||+ .||+.+-....
T Consensus       215 ~~~~v~vHE~GHsf~~LaDEY~~~~  239 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADEYYYGE  239 (264)
T ss_dssp             THHHHHHHHHHHHTT------ECS-
T ss_pred             cccceeeeecccccccccccccCCC
Confidence            5678999999996 45677665443


No 55 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=90.28  E-value=0.14  Score=49.60  Aligned_cols=45  Identities=27%  Similarity=0.251  Sum_probs=28.3

Q ss_pred             chhHHHHHhhhhhcCCCCCCC-------CCCceeeeeCC-CCCCCCCHhhHHHH
Q 039411          170 DLETVALHEIGHLLGLDHSLV-------EGAIMYAYIDP-GATKGLHTDDVQGI  215 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~-------~~siM~p~~~~-~~~~~l~~~Di~~i  215 (222)
                      ....|++|||||.||+.|-..       ..=|| +-+.. .....++.--++.+
T Consensus       322 ~~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~  374 (716)
T KOG3607|consen  322 AFAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDF  374 (716)
T ss_pred             hHHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHH
Confidence            455799999999999999762       22367 54441 12335555444444


No 56 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=88.54  E-value=0.27  Score=35.62  Aligned_cols=16  Identities=31%  Similarity=0.706  Sum_probs=12.6

Q ss_pred             hhHHHHHhhhhhcCCC
Q 039411          171 LETVALHEIGHLLGLD  186 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~  186 (222)
                      ...|++||+||.+|+.
T Consensus        73 I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   73 IRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHcCCC
Confidence            4579999999999985


No 57 
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=87.95  E-value=0.32  Score=46.22  Aligned_cols=27  Identities=22%  Similarity=0.276  Sum_probs=24.9

Q ss_pred             ChhhhhcCCCCCccCCCHHHHHHhcCC
Q 039411            1 MKTYQLNYHLNSTGVLDGNTVAMMTRP   27 (222)
Q Consensus         1 i~~~Q~~~~L~~TG~ld~~T~~~m~~p   27 (222)
                      +|+||+..||..+|+++..|++.|+.|
T Consensus       317 VK~FQ~rhGL~~DGvIG~~T~~~LNvs  343 (608)
T PRK10594        317 VKRFQAWQGLGADGVIGPRTRDWLNVT  343 (608)
T ss_pred             HHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence            589999999999999999999998755


No 58 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=87.77  E-value=1.1  Score=38.89  Aligned_cols=40  Identities=33%  Similarity=0.424  Sum_probs=24.8

Q ss_pred             cCCCCceeeecCCCCcc-----cCCCCCccchhHHHHHhhhhh----cCCC
Q 039411          145 APPRGGIFDYDADEQWA-----VGARPGAFDLETVALHEIGHL----LGLD  186 (222)
Q Consensus       145 ~~P~~g~i~fd~~~~w~-----~~~~~~~~~~~~va~HEiGHa----LGL~  186 (222)
                      |-|.+..|.+|... |.     ++. .+......|++||+||.    ||+.
T Consensus       141 YCp~D~tIYlD~~f-~~~L~~~~ga-~G~~a~ayVlAHEyGHHVQ~l~Gil  189 (292)
T PF04228_consen  141 YCPADQTIYLDLSF-FDELQQRFGA-SGDFAQAYVLAHEYGHHVQNLLGIL  189 (292)
T ss_pred             eCCCCCEEEechHH-HHHHHHHhCC-ccHHHHHHHHHHHHHHHHHHHhhhH
Confidence            45788899998642 21     111 12233456999999996    5664


No 59 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=87.69  E-value=0.27  Score=43.21  Aligned_cols=38  Identities=24%  Similarity=0.305  Sum_probs=28.5

Q ss_pred             cCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC
Q 039411          145 APPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       145 ~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      +.|-.++.+.|.+-        ....+..|++||++|-+|...+.+
T Consensus       178 y~PFT~EA~vN~~~--------p~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  178 YNPFTGEANVNTDL--------PPYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             EcCCcceeecCCCC--------CcccccHHHHHHHHHHhCCCCHHH
Confidence            45767777776532        224677899999999999998774


No 60 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=86.56  E-value=0.33  Score=39.69  Aligned_cols=15  Identities=40%  Similarity=0.638  Sum_probs=12.4

Q ss_pred             chhHHHHHhhhhhcC
Q 039411          170 DLETVALHEIGHLLG  184 (222)
Q Consensus       170 ~~~~va~HEiGHaLG  184 (222)
                      ....|+.||+||++-
T Consensus        21 ~~l~t~~HE~gHal~   35 (200)
T PF13398_consen   21 RLLVTFVHELGHALA   35 (200)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456799999999964


No 61 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=86.10  E-value=3.7  Score=33.09  Aligned_cols=77  Identities=18%  Similarity=0.198  Sum_probs=44.7

Q ss_pred             HHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcccCCCCCccc
Q 039411           91 NAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWAVGARPGAFD  170 (222)
Q Consensus        91 ~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~  170 (222)
                      +.+..-+..|+...++.        ...|.|.-....-|.|               +..|.|.||-    -+-..+ ..-
T Consensus       112 ~~l~~~~~~~~~~~~~~--------~~~i~ir~~ksrWGsc---------------~~~~~I~ln~----~L~~~P-~~~  163 (205)
T PF01863_consen  112 EYLPERLKKYAKKLGLP--------PPKIKIRDMKSRWGSC---------------SSKGNITLNW----RLVMAP-PEV  163 (205)
T ss_pred             HHHHHHHHHHHHHcCCC--------cceEEEeehhhccccC---------------CCCCcEEeec----ccccCC-ccH
Confidence            44455555566555542        4556666544333322               3357777753    222211 235


Q ss_pred             hhHHHHHhhhhhcCCCCCCCCCCce
Q 039411          171 LETVALHEIGHLLGLDHSLVEGAIM  195 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~~~siM  195 (222)
                      +..|++||+-|..-..||..--+.|
T Consensus       164 idYVvvHEL~Hl~~~nHs~~Fw~~v  188 (205)
T PF01863_consen  164 IDYVVVHELCHLRHPNHSKRFWALV  188 (205)
T ss_pred             HHHHHHHHHHHhccCCCCHHHHHHH
Confidence            6689999999999999987643333


No 62 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=84.86  E-value=0.44  Score=39.67  Aligned_cols=14  Identities=36%  Similarity=0.610  Sum_probs=11.9

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      .+..|+.||+||+|
T Consensus        91 ~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   91 NVLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46779999999994


No 63 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=84.76  E-value=0.4  Score=39.76  Aligned_cols=16  Identities=31%  Similarity=0.370  Sum_probs=11.9

Q ss_pred             hHHHHHhhhhhcCCCC
Q 039411          172 ETVALHEIGHLLGLDH  187 (222)
Q Consensus       172 ~~va~HEiGHaLGL~H  187 (222)
                      .-|++||+||+|=-.+
T Consensus        73 rFtlAHELGH~llH~~   88 (213)
T COG2856          73 RFTLAHELGHALLHTD   88 (213)
T ss_pred             HHHHHHHHhHHHhccc
Confidence            3588999999964333


No 64 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=84.76  E-value=0.41  Score=34.60  Aligned_cols=15  Identities=33%  Similarity=0.485  Sum_probs=12.0

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      .....+++|||||.+
T Consensus        40 ~~~~f~laHELgH~~   54 (122)
T PF06114_consen   40 ERQRFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345679999999985


No 65 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=82.54  E-value=0.73  Score=37.90  Aligned_cols=110  Identities=21%  Similarity=0.320  Sum_probs=58.5

Q ss_pred             HHHHHHHHHHHhcccce--EEEec---------CCCCceEEEEEeecCC-CCCCC-CCCC-CCeee--eecCC-------
Q 039411           91 NAVGRAFQTWAAISHFK--FSQTQ---------DYTNANLKISFQRGWH-GDSSP-FDGR-GGTIA--HAAPP-------  147 (222)
Q Consensus        91 ~~i~~A~~~W~~~~~l~--F~ev~---------~~~~adI~I~f~~~~~-gd~~~-f~g~-g~~la--~a~~P-------  147 (222)
                      .-+..|++.|++-++..  |..+.         +..++++.|.+..+-- ....| |.+. -..++  .-++|       
T Consensus        49 ~dl~~aLaq~e~sgn~gdyfsQv~tspv~~L~~S~f~v~v~irwpq~l~n~~~ap~wq~a~~tava~wa~~fpl~ive~~  128 (236)
T COG5549          49 EDLEKALAQWENSGNGGDYFSQVETSPVGYLIWSEFPVDVRIRWPQNLENVEGAPRWQGAYLTAVAGWAKTFPLIIVERF  128 (236)
T ss_pred             ccHHHHHHHHHhcCCcccccceecccccceeecccCcceeeecccccccccccchhHHHHHHHHHHHHHHhCCceeeecc
Confidence            56788999999876543  54433         2236777887765221 10101 0000 00000  01345       


Q ss_pred             CCceeeecCCCCcccCCCCCc-cchhHHHHHhhhhhcCCCCCCCCCCceeeeeC
Q 039411          148 RGGIFDYDADEQWAVGARPGA-FDLETVALHEIGHLLGLDHSLVEGAIMYAYID  200 (222)
Q Consensus       148 ~~g~i~fd~~~~w~~~~~~~~-~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~  200 (222)
                      +..||.+...+.=..+-...+ .....+.-|++||++||+|.+.+..||+|.+.
T Consensus       129 eeaDItie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~pg~  182 (236)
T COG5549         129 EEADITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPPGE  182 (236)
T ss_pred             eeeeEEEEecCCCCCcccccchHHHHHHHHHHhhhccCcccccccccccCCccc
Confidence            234555433221111111111 23455778999999999999999999987554


No 66 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=81.42  E-value=0.77  Score=37.43  Aligned_cols=20  Identities=40%  Similarity=0.592  Sum_probs=15.8

Q ss_pred             cchhHHHHHhhhhhcCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~  189 (222)
                      ..+..|++||+||.. ..|..
T Consensus        87 ~el~aVlaHElgH~~-~~h~~  106 (226)
T PF01435_consen   87 DELAAVLAHELGHIK-HRHIL  106 (226)
T ss_dssp             HHHHHHHHHHHHHHH-TTHCC
T ss_pred             HHHHHHHHHHHHHHH-cCCcc
Confidence            478899999999998 44544


No 67 
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=81.19  E-value=14  Score=33.79  Aligned_cols=24  Identities=13%  Similarity=0.210  Sum_probs=16.7

Q ss_pred             HHHHHHHHHHHHhcccceEEEecC
Q 039411           90 INAVGRAFQTWAAISHFKFSQTQD  113 (222)
Q Consensus        90 ~~~i~~A~~~W~~~~~l~F~ev~~  113 (222)
                      ..+++.-|..-+..-+++|+++..
T Consensus       107 ~~v~~gl~~~~~~lfgi~~~~~~~  130 (422)
T cd06456         107 EKVLDGLFELAERLYGITFKERTD  130 (422)
T ss_pred             HHHHHHHHHHHHHHcCeeEEECCC
Confidence            345566666677788889988654


No 68 
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=80.99  E-value=0.91  Score=42.33  Aligned_cols=26  Identities=27%  Similarity=0.342  Sum_probs=24.3

Q ss_pred             ChhhhhcCCCCCccCCCHHHHHHhcC
Q 039411            1 MKTYQLNYHLNSTGVLDGNTVAMMTR   26 (222)
Q Consensus         1 i~~~Q~~~~L~~TG~ld~~T~~~m~~   26 (222)
                      ||+||...||+..|+++++|++.|+-
T Consensus       273 vKrfQ~~~GL~~DGviG~~T~~aLn~  298 (561)
T COG2989         273 VKRFQARHGLPADGVIGPATRAALNV  298 (561)
T ss_pred             HHHHHHHhCCCCCCcccHHHHHHhcc
Confidence            58999999999999999999999973


No 69 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=80.79  E-value=13  Score=34.24  Aligned_cols=25  Identities=0%  Similarity=-0.067  Sum_probs=19.1

Q ss_pred             HHHHHHHHHHHHHhcccceEEEecC
Q 039411           89 AINAVGRAFQTWAAISHFKFSQTQD  113 (222)
Q Consensus        89 ~~~~i~~A~~~W~~~~~l~F~ev~~  113 (222)
                      +..+++--|..-+..=+|+|+++..
T Consensus       132 l~~vl~gl~~~~~~lfgi~~~~~~~  156 (458)
T cd06457         132 LGTVMEGLSRLFSRLYGIRLVPVPL  156 (458)
T ss_pred             HHHHHHHHHHHHHHHhCeEEEecCC
Confidence            4456677777778889999999753


No 70 
>PRK02870 heat shock protein HtpX; Provisional
Probab=80.16  E-value=1.3  Score=39.26  Aligned_cols=33  Identities=15%  Similarity=0.260  Sum_probs=20.6

Q ss_pred             CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhh
Q 039411          146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHL  182 (222)
Q Consensus       146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHa  182 (222)
                      .|..+.|-+.+.-.-.+    ....+..|++||+||.
T Consensus       152 ~~~~~~Ivvt~GLL~~L----~~dEL~aVlAHELgHi  184 (336)
T PRK02870        152 SEKSAMVAITTGLLEKL----DRDELQAVMAHELSHI  184 (336)
T ss_pred             CCCCcEEEEehHHhhhC----CHHHHHHHHHHHHHHH
Confidence            44566666644221111    2357889999999998


No 71 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=79.71  E-value=0.94  Score=36.57  Aligned_cols=12  Identities=42%  Similarity=0.822  Sum_probs=10.3

Q ss_pred             hHHHHHhhhhhc
Q 039411          172 ETVALHEIGHLL  183 (222)
Q Consensus       172 ~~va~HEiGHaL  183 (222)
                      ..|++||+||.+
T Consensus        10 ~~v~iHElGH~~   21 (182)
T cd06163          10 ILIFVHELGHFL   21 (182)
T ss_pred             HHHHHHHHHHHH
Confidence            468999999975


No 72 
>PRK03982 heat shock protein HtpX; Provisional
Probab=79.69  E-value=0.93  Score=39.13  Aligned_cols=15  Identities=40%  Similarity=0.536  Sum_probs=13.2

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      ..+..|++||+||.-
T Consensus       123 ~El~AVlAHElgHi~  137 (288)
T PRK03982        123 DELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            478999999999975


No 73 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=79.04  E-value=1  Score=38.61  Aligned_cols=16  Identities=44%  Similarity=0.684  Sum_probs=13.6

Q ss_pred             ccchhHHHHHhhhhhc
Q 039411          168 AFDLETVALHEIGHLL  183 (222)
Q Consensus       168 ~~~~~~va~HEiGHaL  183 (222)
                      ...+..|++||+||..
T Consensus       154 ~dEl~aVlaHElgHi~  169 (302)
T COG0501         154 DDELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3578999999999975


No 74 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=78.82  E-value=1  Score=37.00  Aligned_cols=14  Identities=50%  Similarity=0.731  Sum_probs=11.6

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      -+..|+.||+||++
T Consensus        37 l~~~v~iHElgH~~   50 (208)
T cd06161          37 LFLSVLLHELGHAL   50 (208)
T ss_pred             HHHHHHHHHHHHHH
Confidence            35679999999985


No 75 
>PRK03072 heat shock protein HtpX; Provisional
Probab=78.68  E-value=1  Score=38.91  Aligned_cols=17  Identities=29%  Similarity=0.425  Sum_probs=13.9

Q ss_pred             ccchhHHHHHhhhhhcC
Q 039411          168 AFDLETVALHEIGHLLG  184 (222)
Q Consensus       168 ~~~~~~va~HEiGHaLG  184 (222)
                      ...+..|++||+||.-.
T Consensus       124 ~~El~aVlAHElgHi~~  140 (288)
T PRK03072        124 ERELRGVLGHELSHVYN  140 (288)
T ss_pred             HHHHHHHHHHHHHHHhc
Confidence            35789999999999653


No 76 
>PRK03001 M48 family peptidase; Provisional
Probab=78.59  E-value=1.1  Score=38.69  Aligned_cols=16  Identities=25%  Similarity=0.453  Sum_probs=13.6

Q ss_pred             ccchhHHHHHhhhhhc
Q 039411          168 AFDLETVALHEIGHLL  183 (222)
Q Consensus       168 ~~~~~~va~HEiGHaL  183 (222)
                      ...+..|++||+||.-
T Consensus       121 ~~El~aVlAHElgHi~  136 (283)
T PRK03001        121 EREIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3578999999999975


No 77 
>PRK01345 heat shock protein HtpX; Provisional
Probab=78.38  E-value=1.1  Score=39.42  Aligned_cols=19  Identities=21%  Similarity=0.321  Sum_probs=15.3

Q ss_pred             cchhHHHHHhhhhhcCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHS  188 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs  188 (222)
                      ..+..|++|||||.- ..|.
T Consensus       122 dEL~aVlAHElgHi~-~~d~  140 (317)
T PRK01345        122 EEVAGVMAHELAHVK-NRDT  140 (317)
T ss_pred             HHHHHHHHHHHHHHH-cCCH
Confidence            478999999999987 3553


No 78 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=75.99  E-value=22  Score=32.93  Aligned_cols=14  Identities=36%  Similarity=0.653  Sum_probs=11.7

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      +-..|+.||+||+|
T Consensus       262 ~~V~TLfHEfGHal  275 (472)
T cd06455         262 DEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44568999999998


No 79 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=75.86  E-value=3  Score=28.89  Aligned_cols=30  Identities=13%  Similarity=0.317  Sum_probs=19.0

Q ss_pred             CceeeecCCCCcccCCCCCccchhHHHHHhhhhhc
Q 039411          149 GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLL  183 (222)
Q Consensus       149 ~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaL  183 (222)
                      ..+|+|.... +...    ...-..+++||+.|++
T Consensus        44 G~~I~f~~g~-~~~~----s~~~~~llaHEl~Hv~   73 (79)
T PF13699_consen   44 GNDIYFAPGK-YNPD----SPEGRALLAHELAHVV   73 (79)
T ss_pred             CCEEEEcCCC-cCCC----CCCcchhHhHHHHHHH
Confidence            6788885532 1111    1234579999999985


No 80 
>PRK04897 heat shock protein HtpX; Provisional
Probab=75.39  E-value=1.5  Score=38.18  Aligned_cols=15  Identities=47%  Similarity=0.536  Sum_probs=13.2

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      ..+..|++||+||.-
T Consensus       135 ~El~aVlAHElgHi~  149 (298)
T PRK04897        135 EELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            578899999999965


No 81 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=75.39  E-value=1.5  Score=36.70  Aligned_cols=15  Identities=47%  Similarity=0.713  Sum_probs=11.9

Q ss_pred             hhHHHHHhhhhhcCC
Q 039411          171 LETVALHEIGHLLGL  185 (222)
Q Consensus       171 ~~~va~HEiGHaLGL  185 (222)
                      +..|++||+||++=-
T Consensus        53 ~~~v~iHElgH~~~A   67 (227)
T cd06164          53 FASVLLHELGHSLVA   67 (227)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            457999999998633


No 82 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=75.26  E-value=1.4  Score=39.14  Aligned_cols=15  Identities=33%  Similarity=0.512  Sum_probs=12.5

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      .+-..++.||+||++
T Consensus       152 ~~~v~tl~HE~GHa~  166 (365)
T cd06258         152 HDDINTLFHEFGHAV  166 (365)
T ss_pred             HHHHHHHHHHHhHHH
Confidence            455678999999998


No 83 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=74.86  E-value=0.87  Score=45.26  Aligned_cols=31  Identities=23%  Similarity=0.303  Sum_probs=24.4

Q ss_pred             chhHHHHHhhhhhcCCCCCCCCC------CceeeeeC
Q 039411          170 DLETVALHEIGHLLGLDHSLVEG------AIMYAYID  200 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~~~------siM~p~~~  200 (222)
                      ....+++||+||.+|+.|-...+      .||.|...
T Consensus       316 ~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~  352 (845)
T KOG3538|consen  316 PSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTC  352 (845)
T ss_pred             ccceeeeeccccccCcccCCcCCCCCCcccccCcccc
Confidence            34569999999999999976543      58888765


No 84 
>PRK05457 heat shock protein HtpX; Provisional
Probab=74.74  E-value=1.6  Score=37.78  Aligned_cols=15  Identities=40%  Similarity=0.609  Sum_probs=12.9

Q ss_pred             ccchhHHHHHhhhhh
Q 039411          168 AFDLETVALHEIGHL  182 (222)
Q Consensus       168 ~~~~~~va~HEiGHa  182 (222)
                      ...+..|++||+||.
T Consensus       131 ~~El~aVlAHElgHi  145 (284)
T PRK05457        131 RDEVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            357889999999996


No 85 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=74.51  E-value=1.4  Score=39.64  Aligned_cols=15  Identities=33%  Similarity=0.490  Sum_probs=12.0

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      .+-..+++||+||++
T Consensus       220 ~~~v~tl~HE~GHa~  234 (427)
T cd06459         220 LDDVFTLAHELGHAF  234 (427)
T ss_pred             hhhHHHHHHHhhHHH
Confidence            344678999999985


No 86 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=74.50  E-value=1.5  Score=40.25  Aligned_cols=25  Identities=12%  Similarity=0.199  Sum_probs=18.0

Q ss_pred             HHHHHHHHHHHHHhcccceEEEecC
Q 039411           89 AINAVGRAFQTWAAISHFKFSQTQD  113 (222)
Q Consensus        89 ~~~~i~~A~~~W~~~~~l~F~ev~~  113 (222)
                      +.+++..-+...+.+=+|+|+++..
T Consensus       136 ~~~~~~g~~~l~~~lfgi~~~~~~~  160 (458)
T PF01432_consen  136 LEEVLEGLFELAERLFGIRFEEVPD  160 (458)
T ss_dssp             HHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred             HHHHHHHHHHHHHHHhcEEEEeccc
Confidence            3456666666777888899988764


No 87 
>PRK02391 heat shock protein HtpX; Provisional
Probab=73.87  E-value=1.7  Score=37.81  Aligned_cols=15  Identities=40%  Similarity=0.629  Sum_probs=12.8

Q ss_pred             ccchhHHHHHhhhhh
Q 039411          168 AFDLETVALHEIGHL  182 (222)
Q Consensus       168 ~~~~~~va~HEiGHa  182 (222)
                      ..++..|++||+||.
T Consensus       130 ~~El~aVlaHElgHi  144 (296)
T PRK02391        130 PDELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            357889999999995


No 88 
>PRK01265 heat shock protein HtpX; Provisional
Probab=73.72  E-value=1.7  Score=38.32  Aligned_cols=15  Identities=47%  Similarity=0.724  Sum_probs=12.8

Q ss_pred             ccchhHHHHHhhhhh
Q 039411          168 AFDLETVALHEIGHL  182 (222)
Q Consensus       168 ~~~~~~va~HEiGHa  182 (222)
                      ..++..|++||+||.
T Consensus       137 ~~El~aVlAHElgHi  151 (324)
T PRK01265        137 RDEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            357899999999994


No 89 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=73.24  E-value=2.2  Score=30.89  Aligned_cols=14  Identities=36%  Similarity=0.451  Sum_probs=12.0

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      .+..|++||+.|++
T Consensus        24 ~~~~~l~HE~~H~~   37 (128)
T PF13485_consen   24 WLDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45689999999995


No 90 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=72.18  E-value=1.9  Score=37.13  Aligned_cols=13  Identities=38%  Similarity=0.603  Sum_probs=11.1

Q ss_pred             hhHHHHHhhhhhc
Q 039411          171 LETVALHEIGHLL  183 (222)
Q Consensus       171 ~~~va~HEiGHaL  183 (222)
                      +..|++||+||++
T Consensus       135 ~isvvvHElgHal  147 (277)
T cd06162         135 LISGVVHEMGHGV  147 (277)
T ss_pred             HHHHHHHHHHHHH
Confidence            4579999999996


No 91 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=72.16  E-value=2.4  Score=40.28  Aligned_cols=15  Identities=27%  Similarity=0.474  Sum_probs=12.3

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      .+-..|++||+||++
T Consensus       376 ~~dv~TLaHElGHa~  390 (591)
T TIGR00181       376 LNSVFTLAHELGHSM  390 (591)
T ss_pred             cchHHHHHHHhhhHH
Confidence            355679999999996


No 92 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=71.79  E-value=2  Score=36.77  Aligned_cols=15  Identities=27%  Similarity=0.350  Sum_probs=11.7

Q ss_pred             hhHHHHHhhhhhcCC
Q 039411          171 LETVALHEIGHLLGL  185 (222)
Q Consensus       171 ~~~va~HEiGHaLGL  185 (222)
                      ...+++||+||++=-
T Consensus       118 ~isv~iHElgHa~~A  132 (263)
T cd06159         118 VVGVVVHELSHGILA  132 (263)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456999999999633


No 93 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=71.23  E-value=2.1  Score=40.32  Aligned_cols=15  Identities=33%  Similarity=0.306  Sum_probs=12.5

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      .+-..|++||+||++
T Consensus       335 ~~dv~TL~HElGHa~  349 (549)
T TIGR02289       335 SGDIDVLTHEAGHAF  349 (549)
T ss_pred             hhHHHHHHHHhhHHH
Confidence            455679999999997


No 94 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=70.27  E-value=1.9  Score=41.05  Aligned_cols=14  Identities=29%  Similarity=0.399  Sum_probs=11.8

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      +-..|++||+||++
T Consensus       374 ~~v~TL~HE~GHa~  387 (587)
T TIGR02290       374 RDVSTLAHELGHAY  387 (587)
T ss_pred             hhHHHHHHHhhHHH
Confidence            44578999999998


No 95 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=69.36  E-value=2.2  Score=33.39  Aligned_cols=12  Identities=33%  Similarity=0.396  Sum_probs=10.2

Q ss_pred             hHHHHHhhhhhc
Q 039411          172 ETVALHEIGHLL  183 (222)
Q Consensus       172 ~~va~HEiGHaL  183 (222)
                      +-|++|||+|.+
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            459999999974


No 96 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=68.56  E-value=2.6  Score=33.99  Aligned_cols=20  Identities=35%  Similarity=0.594  Sum_probs=14.3

Q ss_pred             hhHHHHHhhhhhcCCCCCCC
Q 039411          171 LETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~~  190 (222)
                      +..+..||+||++--..-.+
T Consensus        41 ~~~l~iHElgH~~~A~~~G~   60 (183)
T cd06160          41 LAILGIHEMGHYLAARRHGV   60 (183)
T ss_pred             HHHHHHHHHHHHHHHHHCCC
Confidence            45688999999976654443


No 97 
>PF09374 PG_binding_3:  Predicted Peptidoglycan domain;  InterPro: IPR018537  This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=67.72  E-value=2.6  Score=28.63  Aligned_cols=24  Identities=21%  Similarity=0.326  Sum_probs=18.2

Q ss_pred             ChhhhhcCC----CCCccCCCHHHHHHh
Q 039411            1 MKTYQLNYH----LNSTGVLDGNTVAMM   24 (222)
Q Consensus         1 i~~~Q~~~~----L~~TG~ld~~T~~~m   24 (222)
                      +|.+|+..|    +.+.|++.+.|++.+
T Consensus         3 ik~LQ~alg~~~~v~~DG~iGp~Tl~Al   30 (72)
T PF09374_consen    3 IKFLQRALGLGADVAVDGIIGPKTLAAL   30 (72)
T ss_dssp             HHHHHHHHT----S--SS--SHHHHHHH
T ss_pred             HHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence            467899999    999999999999998


No 98 
>PLN02791 Nudix hydrolase homolog
Probab=65.36  E-value=3.3  Score=40.74  Aligned_cols=21  Identities=19%  Similarity=0.142  Sum_probs=17.1

Q ss_pred             chhHHHHHhhhhhcCCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      .+..|++||+||.+|=.+.-.
T Consensus       584 f~~~v~lHElgHGsG~~~~~~  604 (770)
T PLN02791        584 FFTHTICHECCHGIGPHTITL  604 (770)
T ss_pred             HHHHHHHHHhhccccccceec
Confidence            457899999999999877544


No 99 
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=64.29  E-value=4.9  Score=34.32  Aligned_cols=23  Identities=35%  Similarity=0.534  Sum_probs=18.5

Q ss_pred             ccchhHHHHHhhhhhcCCCCCCC
Q 039411          168 AFDLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       168 ~~~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      ..+...|+.||+||.-|-.+--+
T Consensus       200 a~~yyaTl~HElghwtgh~~rl~  222 (316)
T COG4227         200 AINYYATLLHELGHWTGHEARLD  222 (316)
T ss_pred             hHhHHHHHHHHhccccCchhhhh
Confidence            45788999999999998766544


No 100
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=63.36  E-value=3.5  Score=37.23  Aligned_cols=19  Identities=37%  Similarity=0.515  Sum_probs=13.9

Q ss_pred             hhHHHHHhhhhhcCCCCCC
Q 039411          171 LETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~Hs~  189 (222)
                      +..+..||+||||--.-+.
T Consensus       131 vi~~vvHElGHalAA~seg  149 (484)
T KOG2921|consen  131 VITVVVHELGHALAAASEG  149 (484)
T ss_pred             HHHHHHHHhhHHHHHHhcC
Confidence            4567899999998654443


No 101
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=61.84  E-value=4.2  Score=32.65  Aligned_cols=13  Identities=38%  Similarity=0.652  Sum_probs=11.3

Q ss_pred             hhHHHHHhhhhhc
Q 039411          171 LETVALHEIGHLL  183 (222)
Q Consensus       171 ~~~va~HEiGHaL  183 (222)
                      +.++.+||+||++
T Consensus         9 ~~~i~~HE~aHa~   21 (181)
T cd06158           9 LLAITLHEFAHAY   21 (181)
T ss_pred             HHHHHHHHHHHHH
Confidence            4578999999998


No 102
>PF14891 Peptidase_M91:  Effector protein
Probab=61.23  E-value=8.7  Score=30.55  Aligned_cols=17  Identities=24%  Similarity=0.102  Sum_probs=13.2

Q ss_pred             chhHHHHHhhhhhcCCC
Q 039411          170 DLETVALHEIGHLLGLD  186 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~  186 (222)
                      .-+.++.|||.||.-..
T Consensus       102 ~p~v~L~HEL~HA~~~~  118 (174)
T PF14891_consen  102 PPFVVLYHELIHAYDYM  118 (174)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            34679999999997543


No 103
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=61.08  E-value=5.2  Score=31.38  Aligned_cols=27  Identities=19%  Similarity=0.318  Sum_probs=23.9

Q ss_pred             ChhhhhcCCC-CCccCCCHHHHHHhcCC
Q 039411            1 MKTYQLNYHL-NSTGVLDGNTVAMMTRP   27 (222)
Q Consensus         1 i~~~Q~~~~L-~~TG~ld~~T~~~m~~p   27 (222)
                      |+.||+..+| +++|.++++|...+...
T Consensus        78 v~~fQ~~~gl~~~dG~~g~~t~~al~~~  105 (185)
T COG3409          78 VRAFQQKNGLSPVDGIVGPATRAALPSQ  105 (185)
T ss_pred             HHHHhhhcCcccccccccHHHHHHHHHH
Confidence            4789999999 89999999998888654


No 104
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=60.54  E-value=3.8  Score=37.20  Aligned_cols=15  Identities=47%  Similarity=0.705  Sum_probs=11.2

Q ss_pred             hHHHHHhhhhhc---CCC
Q 039411          172 ETVALHEIGHLL---GLD  186 (222)
Q Consensus       172 ~~va~HEiGHaL---GL~  186 (222)
                      ..+++||+||+|   ||.
T Consensus       160 l~t~iHE~GHalye~~l~  177 (396)
T cd06460         160 LFSTIHETGHALYEQGLP  177 (396)
T ss_pred             HHHHHHHhhHHHHHhcCC
Confidence            347799999995   554


No 105
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=60.54  E-value=4.7  Score=37.19  Aligned_cols=12  Identities=50%  Similarity=0.728  Sum_probs=9.0

Q ss_pred             hhHHHHHhhhhhc
Q 039411          171 LETVALHEIGHLL  183 (222)
Q Consensus       171 ~~~va~HEiGHaL  183 (222)
                      +++| +||.||||
T Consensus       260 L~g~-iHE~GHAl  271 (497)
T COG2317         260 LFGT-IHETGHAL  271 (497)
T ss_pred             HHHH-HHhhhhHH
Confidence            4444 89999995


No 106
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=60.51  E-value=24  Score=34.06  Aligned_cols=33  Identities=15%  Similarity=0.298  Sum_probs=25.3

Q ss_pred             HHHHHHHHHHHhcccceEEEecCC--CCceEEEEE
Q 039411           91 NAVGRAFQTWAAISHFKFSQTQDY--TNANLKISF  123 (222)
Q Consensus        91 ~~i~~A~~~W~~~~~l~F~ev~~~--~~adI~I~f  123 (222)
                      .++..-|......-+|.|.|+++.  -.+|++|.-
T Consensus       392 ~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~y~  426 (718)
T KOG2089|consen  392 VVLSGLFGIYQTLFGLKFEEATDAEVWHADVRVYT  426 (718)
T ss_pred             HHHHHHHHHHHHHhCceeeecCCchhcccceeEEe
Confidence            466777888888999999998863  257888643


No 107
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=60.07  E-value=4  Score=39.04  Aligned_cols=15  Identities=27%  Similarity=0.355  Sum_probs=12.1

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      ..-..|++||+||++
T Consensus       378 ~~dV~TLaHElGHs~  392 (598)
T COG1164         378 LRDVFTLAHELGHSV  392 (598)
T ss_pred             hhHHHHHHHHccHHH
Confidence            345579999999996


No 108
>PRK10911 oligopeptidase A; Provisional
Probab=58.56  E-value=4.2  Score=39.51  Aligned_cols=22  Identities=18%  Similarity=0.191  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHHhcccceEEEec
Q 039411           91 NAVGRAFQTWAAISHFKFSQTQ  112 (222)
Q Consensus        91 ~~i~~A~~~W~~~~~l~F~ev~  112 (222)
                      .++..-|..-+..=+++|.++.
T Consensus       361 ~v~~gl~~~~~~Lfgi~~~e~~  382 (680)
T PRK10911        361 KAVNGLFEVVKRIYGITAKERK  382 (680)
T ss_pred             HHHHHHHHHHHHHcCeeEEecC
Confidence            4455566666677778898864


No 109
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=57.98  E-value=5.3  Score=32.88  Aligned_cols=15  Identities=33%  Similarity=0.543  Sum_probs=12.8

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      .....+|.||-||||
T Consensus        26 ~~~~~~A~HEAGhAv   40 (213)
T PF01434_consen   26 EEKRRIAYHEAGHAV   40 (213)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            466789999999995


No 110
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=57.67  E-value=5.9  Score=35.27  Aligned_cols=16  Identities=31%  Similarity=0.617  Sum_probs=13.5

Q ss_pred             ccchhHHHHHhhhhhc
Q 039411          168 AFDLETVALHEIGHLL  183 (222)
Q Consensus       168 ~~~~~~va~HEiGHaL  183 (222)
                      ...+..|+.||+||+.
T Consensus       121 ~sEvAAVl~HEmgHVt  136 (479)
T COG4784         121 SSEVAAVLAHEMGHVT  136 (479)
T ss_pred             HHHHHHHHHhhhhhee
Confidence            3578899999999985


No 111
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=57.09  E-value=5.5  Score=36.36  Aligned_cols=14  Identities=43%  Similarity=0.639  Sum_probs=12.4

Q ss_pred             cchhHHHHHhhhhh
Q 039411          169 FDLETVALHEIGHL  182 (222)
Q Consensus       169 ~~~~~va~HEiGHa  182 (222)
                      ..+.+|++||+||-
T Consensus       278 eel~AVl~HELGHW  291 (428)
T KOG2719|consen  278 EELVAVLAHELGHW  291 (428)
T ss_pred             HHHHHHHHHHhhHH
Confidence            46889999999996


No 112
>PF02102 Peptidase_M35:  Deuterolysin metalloprotease (M35) family;  InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=56.92  E-value=13  Score=33.29  Aligned_cols=47  Identities=23%  Similarity=0.115  Sum_probs=23.7

Q ss_pred             CCeeeeecCCCCceeeecCCCCcccCCC---CCccchhHHHHHhhhhhcCC
Q 039411          138 GGTIAHAAPPRGGIFDYDADEQWAVGAR---PGAFDLETVALHEIGHLLGL  185 (222)
Q Consensus       138 g~~la~a~~P~~g~i~fd~~~~w~~~~~---~~~~~~~~va~HEiGHaLGL  185 (222)
                      .|++|.+. |..+.|.+-....+.+...   -.+.|.-.+++||+-|+.+.
T Consensus       262 ~~vlAYT~-p~~~~I~~Cp~ff~~lp~~~~~C~~qDqatt~LHE~TH~~~V  311 (359)
T PF02102_consen  262 SGVLAYTL-PSQNQIVNCPIFFSDLPALSNRCHAQDQATTTLHEMTHAPAV  311 (359)
T ss_dssp             TT--EEEE-GGGTEEEE-HHHHHH--SS--STT---HHHHHHHHHHT-TTT
T ss_pred             CCeEEEEE-cCCCeEEECchhhccCCCccccccCCCccchhhhhhhccccc
Confidence            46888886 4455665432221122221   13457778999999999987


No 113
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=56.53  E-value=81  Score=25.91  Aligned_cols=38  Identities=16%  Similarity=-0.003  Sum_probs=23.5

Q ss_pred             CCCceeeecCCCCcccCC-CCCccchhHHHHHhhhhhcC
Q 039411          147 PRGGIFDYDADEQWAVGA-RPGAFDLETVALHEIGHLLG  184 (222)
Q Consensus       147 P~~g~i~fd~~~~w~~~~-~~~~~~~~~va~HEiGHaLG  184 (222)
                      +....|||..+..=.... ......+..|+.||+-|+.=
T Consensus        71 ~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~Q  109 (205)
T PF04450_consen   71 DDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCWQ  109 (205)
T ss_pred             CCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHhh
Confidence            456789997643211111 11124688999999999943


No 114
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=56.21  E-value=5.6  Score=36.26  Aligned_cols=12  Identities=42%  Similarity=0.844  Sum_probs=10.0

Q ss_pred             hHHHHHhhhhhc
Q 039411          172 ETVALHEIGHLL  183 (222)
Q Consensus       172 ~~va~HEiGHaL  183 (222)
                      ..|++||+||.|
T Consensus        15 ~~v~~HE~gH~~   26 (420)
T TIGR00054        15 VLIFVHELGHFL   26 (420)
T ss_pred             HHHHHHhHHHHH
Confidence            469999999964


No 115
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=54.77  E-value=5.4  Score=36.67  Aligned_cols=12  Identities=33%  Similarity=0.744  Sum_probs=9.5

Q ss_pred             hHHHHHhhhhhc
Q 039411          172 ETVALHEIGHLL  183 (222)
Q Consensus       172 ~~va~HEiGHaL  183 (222)
                      .-|++||+||-|
T Consensus        16 ~li~vHElGHfl   27 (449)
T PRK10779         16 VLITVHEFGHFW   27 (449)
T ss_pred             HHHHHHHHHHHH
Confidence            358899999953


No 116
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=52.69  E-value=6.9  Score=36.99  Aligned_cols=15  Identities=47%  Similarity=0.689  Sum_probs=11.3

Q ss_pred             hhHHHHHhh-hhhcCC
Q 039411          171 LETVALHEI-GHLLGL  185 (222)
Q Consensus       171 ~~~va~HEi-GHaLGL  185 (222)
                      ...|++||+ ||..|=
T Consensus       293 ~~~v~lHEllGHGsGk  308 (549)
T PF03571_consen  293 FVQVGLHELLGHGSGK  308 (549)
T ss_dssp             HHHHHHHHHCCCCS--
T ss_pred             hhhhhHHhhccCcCcc
Confidence            457999998 999984


No 117
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=51.08  E-value=7.9  Score=25.28  Aligned_cols=21  Identities=29%  Similarity=0.515  Sum_probs=17.1

Q ss_pred             chhHHHHHhhhhhcCCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      ....-++||+.+.+||.|.+.
T Consensus        28 ~~eR~~vH~lA~~~gL~s~S~   48 (60)
T cd02641          28 SHDRLLVHELAEELGLRHEST   48 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEee
Confidence            445678999999999998653


No 118
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=50.15  E-value=7.5  Score=33.42  Aligned_cols=39  Identities=23%  Similarity=0.216  Sum_probs=22.7

Q ss_pred             cCCCCceeeecCCCCcccC---CCCCccchhHHHHHhhhhhc
Q 039411          145 APPRGGIFDYDADEQWAVG---ARPGAFDLETVALHEIGHLL  183 (222)
Q Consensus       145 ~~P~~g~i~fd~~~~w~~~---~~~~~~~~~~va~HEiGHaL  183 (222)
                      |-|...++.||....-.+.   ...+.....-|++||.||.+
T Consensus       138 YCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHV  179 (295)
T COG2321         138 YCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHV  179 (295)
T ss_pred             ecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHH
Confidence            5578888888864211110   01122334569999999963


No 119
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=49.58  E-value=7.3  Score=37.92  Aligned_cols=32  Identities=16%  Similarity=0.228  Sum_probs=21.5

Q ss_pred             HHHH-HHHHHHHHhcccceEEEecCC--CCceEEE
Q 039411           90 INAV-GRAFQTWAAISHFKFSQTQDY--TNANLKI  121 (222)
Q Consensus        90 ~~~i-~~A~~~W~~~~~l~F~ev~~~--~~adI~I  121 (222)
                      ..++ .-.|..-+..=+|+|+++.+.  -..|+++
T Consensus       363 ~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~  397 (681)
T PRK10280        363 NTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV  397 (681)
T ss_pred             HHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence            3566 467777888899999996532  2456554


No 120
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=46.91  E-value=10  Score=33.66  Aligned_cols=19  Identities=42%  Similarity=0.518  Sum_probs=14.9

Q ss_pred             chhHHHHHhhhhhcCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~  189 (222)
                      .+..|+.||++|++ +.|+.
T Consensus       274 glAtvLgHE~aHaV-arH~A  292 (424)
T KOG2661|consen  274 GLATVLGHEIAHAV-ARHAA  292 (424)
T ss_pred             HHHHHHHHHHHHHH-HHHHH
Confidence            46789999999997 45654


No 121
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=44.96  E-value=9.5  Score=32.45  Aligned_cols=21  Identities=29%  Similarity=0.479  Sum_probs=14.9

Q ss_pred             ccchhHHHHHhhhhhcCCCCC
Q 039411          168 AFDLETVALHEIGHLLGLDHS  188 (222)
Q Consensus       168 ~~~~~~va~HEiGHaLGL~Hs  188 (222)
                      ..+-.+|++||+.|.|=+..-
T Consensus       151 ~~dg~NVviHEfAH~LD~~~g  171 (253)
T PF06167_consen  151 PNDGHNVVIHEFAHKLDMEDG  171 (253)
T ss_dssp             SSSS--HHHHHHHHHHHCTTS
T ss_pred             CCCCcchHHHHHHHHHHhhcC
Confidence            346678999999999977543


No 122
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=42.78  E-value=13  Score=30.60  Aligned_cols=16  Identities=31%  Similarity=0.351  Sum_probs=12.3

Q ss_pred             chhHHHHHhhhhh-cCC
Q 039411          170 DLETVALHEIGHL-LGL  185 (222)
Q Consensus       170 ~~~~va~HEiGHa-LGL  185 (222)
                      .+-+|++||++|| |=|
T Consensus        92 l~gsiLAHE~mHa~Lrl  108 (212)
T PF12315_consen   92 LTGSILAHELMHAWLRL  108 (212)
T ss_pred             HHhhHHHHHHHHHHhcc
Confidence            3457999999999 444


No 123
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=40.26  E-value=16  Score=29.47  Aligned_cols=17  Identities=24%  Similarity=0.325  Sum_probs=13.4

Q ss_pred             hhHHHHHhhhhhcCCCC
Q 039411          171 LETVALHEIGHLLGLDH  187 (222)
Q Consensus       171 ~~~va~HEiGHaLGL~H  187 (222)
                      +-.+++|||+|++.-..
T Consensus        36 lG~ilahel~hafd~~g   52 (206)
T PF01431_consen   36 LGFILAHELMHAFDPEG   52 (206)
T ss_dssp             HHHHHHHHHHHCTSTTG
T ss_pred             HHHHHHHHHHHHHHHhH
Confidence            55799999999886543


No 124
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=39.87  E-value=16  Score=23.99  Aligned_cols=23  Identities=26%  Similarity=0.284  Sum_probs=18.2

Q ss_pred             chhHHHHHhhhhhcCCCCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSLVEG  192 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~~~  192 (222)
                      .-..-++|.+-|-|||.|..+..
T Consensus        28 ~~eRriih~la~~lGL~~~s~G~   50 (60)
T cd02639          28 PAERRIVHLLASRLGLNHVSDGT   50 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEEeCC
Confidence            34467799999999999987644


No 125
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=38.98  E-value=13  Score=35.86  Aligned_cols=33  Identities=9%  Similarity=0.191  Sum_probs=23.2

Q ss_pred             HHHHHHHHHHHHhcccceEEEecCC--CCceEEEE
Q 039411           90 INAVGRAFQTWAAISHFKFSQTQDY--TNANLKIS  122 (222)
Q Consensus        90 ~~~i~~A~~~W~~~~~l~F~ev~~~--~~adI~I~  122 (222)
                      -.+|.-.|..-+..=+|+|+|..+.  -.+|+++.
T Consensus       364 ~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~  398 (683)
T COG0339         364 NKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVF  398 (683)
T ss_pred             hHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEE
Confidence            3567777888888999999997432  24566643


No 126
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=38.55  E-value=16  Score=34.02  Aligned_cols=17  Identities=24%  Similarity=0.325  Sum_probs=13.3

Q ss_pred             cchhHHHHHhhhhhcCC
Q 039411          169 FDLETVALHEIGHLLGL  185 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL  185 (222)
                      .+-+.|+.||+||+...
T Consensus       245 ~~D~~t~~HE~GH~~yy  261 (477)
T cd06461         245 MEDFVTVHHEMGHIQYY  261 (477)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            34567889999999854


No 127
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=37.96  E-value=14  Score=35.19  Aligned_cols=22  Identities=9%  Similarity=0.075  Sum_probs=14.2

Q ss_pred             ccCCCHHH--HHHhcCCCCCCCCC
Q 039411           13 TGVLDGNT--VAMMTRPRCGVADI   34 (222)
Q Consensus        13 TG~ld~~T--~~~m~~pRCg~pD~   34 (222)
                      .++.|.+.  +..|++.-|+.++.
T Consensus       318 ~~~~~kel~~i~~mk~k~~~~~~~  341 (704)
T KOG2090|consen  318 SEKTDKELAVIRDMKKKENNNNNA  341 (704)
T ss_pred             hHHHHHHHHHHHHHHhhhcCCCCC
Confidence            34444443  67788888887765


No 128
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=37.57  E-value=9.7  Score=36.07  Aligned_cols=14  Identities=43%  Similarity=0.589  Sum_probs=11.3

Q ss_pred             hhHHHHHhhhhhcC
Q 039411          171 LETVALHEIGHLLG  184 (222)
Q Consensus       171 ~~~va~HEiGHaLG  184 (222)
                      +-.|+.|||||..-
T Consensus       487 IGaVIgHEI~HgFD  500 (654)
T COG3590         487 IGAVIGHEIGHGFD  500 (654)
T ss_pred             ccceehhhhccccc
Confidence            45799999999753


No 129
>PF08434 CLCA_N:  Calcium-activated chloride channel;  InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs. 
Probab=36.32  E-value=14  Score=31.57  Aligned_cols=26  Identities=23%  Similarity=0.300  Sum_probs=18.3

Q ss_pred             hHHHHHhhhhh-cCCCCCCCCCCceeeeeC
Q 039411          172 ETVALHEIGHL-LGLDHSLVEGAIMYAYID  200 (222)
Q Consensus       172 ~~va~HEiGHa-LGL~Hs~~~~siM~p~~~  200 (222)
                      -.|++||-+|. -|+-.+.....   |+|.
T Consensus       150 grv~VhEWAhlRWGVFDEY~~d~---pfY~  176 (262)
T PF08434_consen  150 GRVFVHEWAHLRWGVFDEYNNDK---PFYI  176 (262)
T ss_pred             ceeeeehhhhhccccccccCCCC---ceeE
Confidence            36999999998 88876654332   5554


No 130
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=35.21  E-value=18  Score=30.93  Aligned_cols=20  Identities=20%  Similarity=0.298  Sum_probs=16.6

Q ss_pred             chhHHHHHhhhhhcCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~  189 (222)
                      +-.+|++|||-|.|=+.+..
T Consensus       141 dg~NVvIHEFAH~LDm~~G~  160 (260)
T PRK15410        141 SGFNLIIHEVAHKLDMRNGD  160 (260)
T ss_pred             CCcchhHhHHHhHhhhhcCC
Confidence            45789999999999887744


No 131
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=34.86  E-value=9  Score=29.99  Aligned_cols=39  Identities=23%  Similarity=0.148  Sum_probs=23.7

Q ss_pred             eeeeecCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhc
Q 039411          140 TIAHAAPPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLL  183 (222)
Q Consensus       140 ~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaL  183 (222)
                      +-+-||.+.+ +|-||....-.    .+...+..|+.||+-|.+
T Consensus        35 taG~ayL~~~-~I~lNP~ll~e----n~~~f~~~vV~HELaHl~   73 (156)
T COG3091          35 TAGGAYLLKS-EIRLNPKLLEE----NGEDFIEQVVPHELAHLH   73 (156)
T ss_pred             hcchhhcccc-ccccCHHHHHH----ccHHHHHHHHHHHHHHHH
Confidence            3345677765 77776522111    122356789999999975


No 132
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=34.49  E-value=20  Score=27.59  Aligned_cols=32  Identities=19%  Similarity=0.246  Sum_probs=19.9

Q ss_pred             CCceeeecCCCCcccCCCCCccchhHHHHHhhhhhc
Q 039411          148 RGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLL  183 (222)
Q Consensus       148 ~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaL  183 (222)
                      ....|.|+.    .+-.......+..|++||+-|++
T Consensus        40 ~~~~I~ln~----~l~~~~~~~~l~~~l~HEm~H~~   71 (146)
T smart00731       40 KSAEIRLNP----KLLTENGRDRLRETLLHELCHAA   71 (146)
T ss_pred             CCCEEEeCH----HHHhhccHHHHHhhHHHHHHHHH
Confidence            356777753    12111122467789999999996


No 133
>PF02074 Peptidase_M32:  Carboxypeptidase Taq (M32) metallopeptidase;  InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=33.80  E-value=15  Score=34.30  Aligned_cols=15  Identities=47%  Similarity=0.795  Sum_probs=10.6

Q ss_pred             HHHHhhhhhc---CCCCC
Q 039411          174 VALHEIGHLL---GLDHS  188 (222)
Q Consensus       174 va~HEiGHaL---GL~Hs  188 (222)
                      -++||.||+|   ||...
T Consensus       262 s~iHE~GHalYEq~~~~~  279 (494)
T PF02074_consen  262 STIHETGHALYEQGLPPE  279 (494)
T ss_dssp             HHHHHHHHHHHHHTS-GG
T ss_pred             HHHHHHHHHHHHcCCChh
Confidence            4489999995   77543


No 134
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=33.48  E-value=23  Score=27.17  Aligned_cols=16  Identities=31%  Similarity=0.409  Sum_probs=13.6

Q ss_pred             cchhHHHHHhhhhhcC
Q 039411          169 FDLETVALHEIGHLLG  184 (222)
Q Consensus       169 ~~~~~va~HEiGHaLG  184 (222)
                      ..+..|++||+=|+.=
T Consensus        58 ~~~~~tL~HEm~H~~~   73 (157)
T PF10263_consen   58 EELIDTLLHEMAHAAA   73 (157)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            4678899999999964


No 135
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=33.38  E-value=1.2e+02  Score=24.47  Aligned_cols=19  Identities=21%  Similarity=0.258  Sum_probs=15.6

Q ss_pred             cchhHHHHHhhhhhcCCCC
Q 039411          169 FDLETVALHEIGHLLGLDH  187 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~H  187 (222)
                      ..+..+++||+-|++=+..
T Consensus        63 ~~l~~~iaHE~hH~~r~~~   81 (195)
T PF10026_consen   63 EELPALIAHEYHHNCRYEQ   81 (195)
T ss_pred             HHHHHHHHHHHHHHHHHhc
Confidence            4788999999999976554


No 136
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=32.91  E-value=21  Score=34.43  Aligned_cols=14  Identities=43%  Similarity=0.698  Sum_probs=12.0

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      ....|+.||+||||
T Consensus       407 ~~~~~a~he~gha~  420 (644)
T PRK10733        407 QKESTAYHEAGHAI  420 (644)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45679999999996


No 137
>PHA00657 crystallin beta/gamma motif-containing protein
Probab=32.33  E-value=34  Score=35.69  Aligned_cols=13  Identities=38%  Similarity=0.475  Sum_probs=11.2

Q ss_pred             chhHHHHHhhhhh
Q 039411          170 DLETVALHEIGHL  182 (222)
Q Consensus       170 ~~~~va~HEiGHa  182 (222)
                      .-+++.+||+||.
T Consensus       790 AD~STFvHE~gH~  802 (2052)
T PHA00657        790 ADLSTFLHESGHF  802 (2052)
T ss_pred             ccHHHHHHHHHHH
Confidence            4568999999998


No 138
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=31.01  E-value=21  Score=30.67  Aligned_cols=10  Identities=40%  Similarity=0.547  Sum_probs=8.4

Q ss_pred             HHHHhhhhhc
Q 039411          174 VALHEIGHLL  183 (222)
Q Consensus       174 va~HEiGHaL  183 (222)
                      -..|||||.+
T Consensus       222 G~~HE~GH~~  231 (307)
T PF13402_consen  222 GPWHELGHNH  231 (307)
T ss_dssp             HHHHHHHHHH
T ss_pred             eehhhhhhhc
Confidence            4689999985


No 139
>CHL00176 ftsH cell division protein; Validated
Probab=30.67  E-value=21  Score=34.49  Aligned_cols=14  Identities=36%  Similarity=0.752  Sum_probs=11.9

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      ....||.||.||||
T Consensus       437 ~~~~vA~hEaGhA~  450 (638)
T CHL00176        437 NKRLIAYHEVGHAI  450 (638)
T ss_pred             HHHHHHHHhhhhHH
Confidence            45679999999997


No 140
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=30.33  E-value=1e+02  Score=26.36  Aligned_cols=78  Identities=17%  Similarity=0.254  Sum_probs=44.4

Q ss_pred             HHHHHHHHHHHHHh--cccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcccCCCC
Q 039411           89 AINAVGRAFQTWAA--ISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWAVGARP  166 (222)
Q Consensus        89 ~~~~i~~A~~~W~~--~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~~~~~~  166 (222)
                      +....+++...|+.  |..+.|.- ... ++.|+..++.  .|       .|+.        .|.|++     |-... .
T Consensus        81 ~ek~l~~~~~k~~~pnv~~v~f~V-~~~-dp~irlqkg~--kg-------gg~~--------~~ki~l-----~l~p~-~  135 (280)
T COG5504          81 CEKYLPELIVKWYGPNVPIVDFPV-TPG-DPQIRLQKGR--KG-------GGGI--------PGKIML-----WLVPS-S  135 (280)
T ss_pred             HHHHHHHHHHhhcCCCCceeeecc-cCC-CchHHHhhcc--cC-------CCCc--------CceEEE-----EEecC-C
Confidence            45567778888865  45566633 222 3445554443  11       1221        355554     21111 1


Q ss_pred             CccchhHHHHHhhhhhcCCCCCCCC
Q 039411          167 GAFDLETVALHEIGHLLGLDHSLVE  191 (222)
Q Consensus       167 ~~~~~~~va~HEiGHaLGL~Hs~~~  191 (222)
                      .-.+..++++||+=|..-|.+-.-.
T Consensus       136 ~~~~v~aliaHE~HH~~R~~~i~~~  160 (280)
T COG5504         136 TITSVPALIAHEYHHNCRLRYIDYG  160 (280)
T ss_pred             CccchHHHHHHHHHhhheecccccC
Confidence            2236789999999999999887654


No 141
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=30.18  E-value=30  Score=22.58  Aligned_cols=20  Identities=25%  Similarity=0.380  Sum_probs=16.5

Q ss_pred             chhHHHHHhhhhhcCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~  189 (222)
                      ....-++|++-+.+||.|.+
T Consensus        28 ~~eR~~vH~~a~~~gL~s~S   47 (60)
T cd02640          28 KEERALIHQIAQKYGLKSRS   47 (60)
T ss_pred             HHHHHHHHHHHHHcCCceee
Confidence            45568899999999999864


No 142
>PRK04860 hypothetical protein; Provisional
Probab=29.68  E-value=15  Score=29.08  Aligned_cols=32  Identities=16%  Similarity=0.135  Sum_probs=19.4

Q ss_pred             CCCceeeecCCCCcccCCCCCccchhHHHHHhhhhh
Q 039411          147 PRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHL  182 (222)
Q Consensus       147 P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHa  182 (222)
                      .....|.||...   +.. .+...+..|+.||+-|.
T Consensus        43 l~~~~I~~Np~l---l~~-~~~~~l~~~v~HEl~H~   74 (160)
T PRK04860         43 LQSNEIRLNPVL---LLE-NQQAFIDEVVPHELAHL   74 (160)
T ss_pred             HhcCCeeeCHHH---Hhh-CcHHHHHhHHHHHHHHH
Confidence            345667776421   111 12346778999999986


No 143
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.63  E-value=15  Score=34.63  Aligned_cols=18  Identities=44%  Similarity=0.560  Sum_probs=15.6

Q ss_pred             HHHHHhhhhhcCCCCCCC
Q 039411          173 TVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       173 ~va~HEiGHaLGL~Hs~~  190 (222)
                      .|.+||.||-|||..+.+
T Consensus       348 Gv~ahEyghdlglPDEyd  365 (760)
T COG4412         348 GVEAHEYGHDLGLPDEYD  365 (760)
T ss_pred             ceeeeecccccCCCcccc
Confidence            489999999999997654


No 144
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=28.56  E-value=38  Score=30.48  Aligned_cols=16  Identities=19%  Similarity=0.320  Sum_probs=13.7

Q ss_pred             cchhHHHHHhhhhhcC
Q 039411          169 FDLETVALHEIGHLLG  184 (222)
Q Consensus       169 ~~~~~va~HEiGHaLG  184 (222)
                      ....+|++||+=|.+=
T Consensus       137 ~~~~sTlAHEfQHmIn  152 (366)
T PF10460_consen  137 DTVYSTLAHEFQHMIN  152 (366)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4578999999999983


No 145
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=28.48  E-value=49  Score=26.51  Aligned_cols=35  Identities=17%  Similarity=0.030  Sum_probs=21.8

Q ss_pred             cCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCC
Q 039411          145 APPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGL  185 (222)
Q Consensus       145 ~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL  185 (222)
                      |.|....|.+-.+...      ....+..|++||+=||.--
T Consensus        51 f~p~~~~I~lC~N~~~------~~~~l~~~l~HELIHayD~   85 (173)
T PF09768_consen   51 FDPSKKGIVLCQNRIR------SQGHLEDTLTHELIHAYDH   85 (173)
T ss_pred             ccCCCCCEEEeeCCCC------CHHHHHHHHHHHHHHHHHH
Confidence            5564444555443322      2347889999999999643


No 146
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=28.23  E-value=33  Score=22.31  Aligned_cols=20  Identities=25%  Similarity=0.423  Sum_probs=16.7

Q ss_pred             chhHHHHHhhhhhcCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSL  189 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~  189 (222)
                      ....-++|++-+.+||.|.+
T Consensus        27 ~~eR~~vH~~a~~~gL~s~S   46 (59)
T cd06007          27 NHERAVIHRLCRKLGLKSKS   46 (59)
T ss_pred             HHHHHHHHHHHHHcCCCcee
Confidence            45568899999999999864


No 147
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=27.98  E-value=31  Score=32.71  Aligned_cols=20  Identities=40%  Similarity=0.391  Sum_probs=15.1

Q ss_pred             CccchhHHHHHhhhhh-cCCC
Q 039411          167 GAFDLETVALHEIGHL-LGLD  186 (222)
Q Consensus       167 ~~~~~~~va~HEiGHa-LGL~  186 (222)
                      +...+..|++|||-|+ -|..
T Consensus       284 GDrsl~~vIaHEIAHSWtGNl  304 (613)
T KOG1047|consen  284 GDRSLVDVIAHEIAHSWTGNL  304 (613)
T ss_pred             CCcchhhHHHHHhhhhhcccc
Confidence            4467789999999887 5554


No 148
>PRK04351 hypothetical protein; Provisional
Probab=26.59  E-value=27  Score=27.26  Aligned_cols=32  Identities=16%  Similarity=0.216  Sum_probs=19.6

Q ss_pred             CCCceeeecCCCCcccCCCCCccchhHHHHHhhhhh
Q 039411          147 PRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHL  182 (222)
Q Consensus       147 P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHa  182 (222)
                      .....|.|+..-    -...+...+..|+.||+-|.
T Consensus        41 l~~~~I~lnp~l----l~~~~~~~l~~vv~HElcH~   72 (149)
T PRK04351         41 LKDHHIEFNPKM----LEEYGLEELIGIIKHELCHY   72 (149)
T ss_pred             cCCCeEEeCHHH----HhhccHHHHHhhHHHHHHHH
Confidence            445667775321    11112346788999999997


No 149
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=25.56  E-value=48  Score=25.59  Aligned_cols=39  Identities=13%  Similarity=0.060  Sum_probs=19.4

Q ss_pred             ceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC
Q 039411          150 GIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       150 g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      ..|.+-. ..|.... .+......|++||+=|..=.+.+.|
T Consensus        77 ~~IyLc~-~F~~~p~-~g~~Sk~~TLiHE~SHf~~~~~T~D  115 (148)
T PF14521_consen   77 YTIYLCP-AFFSAPT-TGKDSKEGTLIHEWSHFTAVGGTDD  115 (148)
T ss_dssp             TEEEE-H-HHHHS-S-SSTT-HHHHHHHHHHHSCCCT--B-
T ss_pred             eEEEECh-hhcCCCC-CCCCchHHHHHHhhhhhhhhcCCCc
Confidence            4566543 2343322 2334678999999999433444444


No 150
>PF13058 DUF3920:  Protein of unknown function (DUF3920)
Probab=23.77  E-value=46  Score=24.68  Aligned_cols=15  Identities=27%  Similarity=0.423  Sum_probs=13.1

Q ss_pred             chhHHHHHhhhhhcC
Q 039411          170 DLETVALHEIGHLLG  184 (222)
Q Consensus       170 ~~~~va~HEiGHaLG  184 (222)
                      .+..++.||+-|++-
T Consensus        75 qvlktllhefrh~mQ   89 (126)
T PF13058_consen   75 QVLKTLLHEFRHAMQ   89 (126)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            567899999999986


No 151
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=22.71  E-value=69  Score=23.48  Aligned_cols=22  Identities=36%  Similarity=0.685  Sum_probs=18.9

Q ss_pred             chhHHHHHhhhhhcCCCCCCCC
Q 039411          170 DLETVALHEIGHLLGLDHSLVE  191 (222)
Q Consensus       170 ~~~~va~HEiGHaLGL~Hs~~~  191 (222)
                      .+..+++|-+=|.||..|..+.
T Consensus        72 e~~~l~iHG~LHLlGyDH~~~~   93 (110)
T TIGR00043        72 ELAHLTVHGLLHLLGYDHETED   93 (110)
T ss_pred             HHHHHHHHHHHHHcCCCCCChH
Confidence            5677999999999999997653


No 152
>PF02130 UPF0054:  Uncharacterized protein family UPF0054;  InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=22.52  E-value=69  Score=24.65  Aligned_cols=22  Identities=36%  Similarity=0.560  Sum_probs=17.7

Q ss_pred             cchhHHHHHhhhhhcCCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      ..+...++|-+-|.||..|..+
T Consensus       106 ~el~~l~vHG~LHLlGyDH~~~  127 (145)
T PF02130_consen  106 EELARLLVHGLLHLLGYDHETE  127 (145)
T ss_dssp             HHHHHHHHHHHHHHTT-SSTTT
T ss_pred             HHHhHHHHHHHHHHcCCCCCCH
Confidence            3567799999999999999854


No 153
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=22.35  E-value=34  Score=33.94  Aligned_cols=18  Identities=6%  Similarity=0.213  Sum_probs=13.1

Q ss_pred             HHHHHHHHhcccceEEEe
Q 039411           94 GRAFQTWAAISHFKFSQT  111 (222)
Q Consensus        94 ~~A~~~W~~~~~l~F~ev  111 (222)
                      .+.|+...+.++|.|...
T Consensus       173 ~~i~~~Yd~l~Gl~~~~~  190 (775)
T PF03272_consen  173 NDIFKFYDDLTGLSDDPS  190 (775)
T ss_pred             HHHHHHHHhhhCCCCCCC
Confidence            567777778888887654


No 154
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=22.07  E-value=43  Score=31.09  Aligned_cols=14  Identities=50%  Similarity=0.726  Sum_probs=11.6

Q ss_pred             chhHHHHHhhhhhc
Q 039411          170 DLETVALHEIGHLL  183 (222)
Q Consensus       170 ~~~~va~HEiGHaL  183 (222)
                      ....++.||.||+|
T Consensus       310 ~~~~~A~hEaGhAl  323 (495)
T TIGR01241       310 EKKLVAYHEAGHAL  323 (495)
T ss_pred             HHHHHHHHHHhHHH
Confidence            34579999999996


No 155
>COG5570 Uncharacterized small protein [Function unknown]
Probab=22.04  E-value=44  Score=21.25  Aligned_cols=18  Identities=22%  Similarity=0.374  Sum_probs=15.6

Q ss_pred             ccCCCHHHHHHhcCCCCC
Q 039411           13 TGVLDGNTVAMMTRPRCG   30 (222)
Q Consensus        13 TG~ld~~T~~~m~~pRCg   30 (222)
                      -|.|+++..+.|+.|-|-
T Consensus        14 Hg~le~ei~ea~n~Ps~d   31 (57)
T COG5570          14 HGNLEREIQEAMNSPSSD   31 (57)
T ss_pred             hchHHHHHHHHhcCCCcc
Confidence            389999999999999774


No 156
>PRK00016 metal-binding heat shock protein; Provisional
Probab=22.01  E-value=63  Score=25.34  Aligned_cols=43  Identities=23%  Similarity=0.149  Sum_probs=26.6

Q ss_pred             CCceeeecCCCCcccCC---CCCccchhHHHHHhhhhhcCCCCCCC
Q 039411          148 RGGIFDYDADEQWAVGA---RPGAFDLETVALHEIGHLLGLDHSLV  190 (222)
Q Consensus       148 ~~g~i~fd~~~~w~~~~---~~~~~~~~~va~HEiGHaLGL~Hs~~  190 (222)
                      .-|+|.+..+.-.....   ..-...+..+++|-+-|.||..|..+
T Consensus        87 ~LGDI~Is~~~~~~qa~~~~~s~~~e~~~l~iHG~LHLlGYDH~~~  132 (159)
T PRK00016         87 YLGDIVICPEVAEEQAEEQGHSLERELAHLTVHGILHLLGYDHIED  132 (159)
T ss_pred             CceeEEEcHHHHHHHHHHcCCCHHHHHHHHHHHhhHHhcCCCCCCh
Confidence            35888775432111100   00123567799999999999999764


No 157
>KOG3314 consensus Ku70-binding protein [Replication, recombination and repair]
Probab=21.38  E-value=73  Score=25.42  Aligned_cols=15  Identities=27%  Similarity=0.437  Sum_probs=12.4

Q ss_pred             cchhHHHHHhhhhhc
Q 039411          169 FDLETVALHEIGHLL  183 (222)
Q Consensus       169 ~~~~~va~HEiGHaL  183 (222)
                      ..+..|++||+=|+.
T Consensus        89 ~h~n~vv~HElIH~f  103 (194)
T KOG3314|consen   89 DHVNQVVIHELIHAF  103 (194)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            356789999999985


No 158
>KOG3624 consensus M13 family peptidase [Amino acid transport and metabolism]
Probab=21.26  E-value=43  Score=32.48  Aligned_cols=14  Identities=29%  Similarity=0.311  Sum_probs=11.0

Q ss_pred             hhHHHHHhhhhhcC
Q 039411          171 LETVALHEIGHLLG  184 (222)
Q Consensus       171 ~~~va~HEiGHaLG  184 (222)
                      +-.|+.|||+|++-
T Consensus       518 iG~vigHEl~H~FD  531 (687)
T KOG3624|consen  518 IGFVIGHELTHGFD  531 (687)
T ss_pred             HHHHHHHHHhhccc
Confidence            44699999999653


No 159
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=20.88  E-value=42  Score=27.67  Aligned_cols=19  Identities=32%  Similarity=0.427  Sum_probs=14.9

Q ss_pred             hHHHHHhhhhhcCCCCCCCC
Q 039411          172 ETVALHEIGHLLGLDHSLVE  191 (222)
Q Consensus       172 ~~va~HEiGHaLGL~Hs~~~  191 (222)
                      .-++.||++| +=|+|....
T Consensus       102 ~fil~HE~~H-v~~~h~~~~  120 (206)
T PF10463_consen  102 AFILLHELAH-VVLGHEGDS  120 (206)
T ss_pred             HHHHHHHHHH-HHHcCcccc
Confidence            3488999999 558887764


No 160
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=20.41  E-value=52  Score=28.17  Aligned_cols=22  Identities=32%  Similarity=0.347  Sum_probs=16.9

Q ss_pred             cchhHHHHHhhhhhcCCCCCCCC
Q 039411          169 FDLETVALHEIGHLLGLDHSLVE  191 (222)
Q Consensus       169 ~~~~~va~HEiGHaLGL~Hs~~~  191 (222)
                      ..+..|++||+-|++ |.|..-.
T Consensus        58 ~~~~~~l~HevlH~~-~~H~~r~   79 (292)
T PF13203_consen   58 EERVGLLLHEVLHCL-LRHPWRR   79 (292)
T ss_pred             HHHHHHHHHHHHHHH-ccchhhh
Confidence            356789999999998 7776543


No 161
>PF15639 Tox-MPTase3:  Metallopeptidase toxin 3
Probab=20.23  E-value=1.8e+02  Score=22.22  Aligned_cols=36  Identities=19%  Similarity=0.395  Sum_probs=22.4

Q ss_pred             HHHHHHHHHHHhcccceEEEecC-CCCceEEEEEeec
Q 039411           91 NAVGRAFQTWAAISHFKFSQTQD-YTNANLKISFQRG  126 (222)
Q Consensus        91 ~~i~~A~~~W~~~~~l~F~ev~~-~~~adI~I~f~~~  126 (222)
                      ..+-+|+..|...+....++.=. +..+.|+|.-..+
T Consensus        23 p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~   59 (135)
T PF15639_consen   23 PAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVG   59 (135)
T ss_pred             hHHHHHHHHHcccchhhhhhccccCCCCEEEEeeccc
Confidence            46778888887765544444321 2367888876554


No 162
>PF15127 DUF4565:  Protein of unknown function (DUF4565)
Probab=20.08  E-value=89  Score=22.20  Aligned_cols=14  Identities=29%  Similarity=0.501  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHHHhc
Q 039411           90 INAVGRAFQTWAAI  103 (222)
Q Consensus        90 ~~~i~~A~~~W~~~  103 (222)
                      ++++..|++.|...
T Consensus        63 qEIl~dAlqQWA~~   76 (91)
T PF15127_consen   63 QEILSDALQQWAEN   76 (91)
T ss_pred             HHHHHHHHHHHHHh
Confidence            57899999999864


Done!