Query         043639
Match_columns 261
No_of_seqs    254 out of 1490
Neff          7.0 
Searched_HMMs 46136
Date          Fri Mar 29 06:52:54 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/043639.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/043639hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0 3.2E-48 6.9E-53  370.5  14.3  187   12-230    79-268 (469)
  2 cd04278 ZnMc_MMP Zinc-dependen 100.0 3.5E-38 7.7E-43  262.1  15.8  155   60-223     1-157 (157)
  3 PF00413 Peptidase_M10:  Matrix 100.0 8.7E-37 1.9E-41  251.3  15.7  151   60-223     1-154 (154)
  4 cd04279 ZnMc_MMP_like_1 Zinc-d 100.0 2.9E-29 6.2E-34  208.3  14.0  150   67-223     2-156 (156)
  5 smart00235 ZnMc Zinc-dependent  99.9   1E-24 2.2E-29  177.6  10.1  136   58-224     2-140 (140)
  6 cd04277 ZnMc_serralysin_like Z  99.9 4.8E-24   1E-28  181.7  13.5  124   85-223    33-186 (186)
  7 cd04268 ZnMc_MMP_like Zinc-dep  99.9 2.2E-24 4.8E-29  179.4   9.1  142   67-222     2-165 (165)
  8 cd04327 ZnMc_MMP_like_3 Zinc-d  99.7 1.5E-17 3.3E-22  143.4  12.0  109   60-191     2-110 (198)
  9 cd00203 ZnMc Zinc-dependent me  99.7 3.1E-17 6.8E-22  136.3  11.6  120   84-222    20-167 (167)
 10 cd04276 ZnMc_MMP_like_2 Zinc-d  99.6   7E-15 1.5E-19  126.8  11.9  146   68-224     9-197 (197)
 11 cd04280 ZnMc_astacin_like Zinc  99.6 2.4E-14 5.2E-19  121.9  12.1   91   68-192     3-93  (180)
 12 PF01400 Astacin:  Astacin (Pep  99.5 2.8E-14 6.2E-19  122.5   9.6   96   60-191     2-97  (191)
 13 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.5 1.1E-13 2.3E-18  119.7  13.1   99   58-191     7-105 (200)
 14 PF12388 Peptidase_M57:  Dual-a  99.5 1.3E-13 2.9E-18  119.1  12.4  136   66-222    38-211 (211)
 15 cd04282 ZnMc_meprin Zinc-depen  99.4 1.4E-12   3E-17  114.9  12.8  103   52-192    37-139 (230)
 16 cd04283 ZnMc_hatching_enzyme Z  99.4 3.3E-12 7.1E-17  108.9  11.8   92   67-191     4-95  (182)
 17 KOG3714 Meprin A metalloprotea  99.4 5.8E-12 1.3E-16  119.9  11.7  107   50-191    71-177 (411)
 18 PF02031 Peptidase_M7:  Strepto  99.2 1.1E-10 2.5E-15   92.7   9.3  125   67-223     2-130 (132)
 19 COG5549 Predicted Zn-dependent  98.9 3.8E-09 8.2E-14   90.5   8.8  148   57-225    77-235 (236)
 20 PF11350 DUF3152:  Protein of u  98.3 4.4E-06 9.5E-11   71.8  10.0  142   67-218    29-183 (203)
 21 cd04267 ZnMc_ADAM_like Zinc-de  98.0 5.5E-05 1.2E-09   64.4   9.9  106   91-220    74-189 (192)
 22 cd04273 ZnMc_ADAMTS_like Zinc-  98.0 4.2E-05   9E-10   66.3   8.8  120   91-224    72-200 (207)
 23 PF13583 Reprolysin_4:  Metallo  97.9 1.1E-05 2.4E-10   70.1   4.2   48  173-222   137-198 (206)
 24 PF05572 Peptidase_M43:  Pregna  97.7 2.6E-05 5.7E-10   64.9   2.9   25  170-194    66-90  (154)
 25 PF13574 Reprolysin_2:  Metallo  97.5 2.6E-05 5.6E-10   65.9   0.8   48  173-222   111-171 (173)
 26 cd04275 ZnMc_pappalysin_like Z  97.4 5.4E-05 1.2E-09   66.8   1.6   26  170-195   134-159 (225)
 27 COG1913 Predicted Zn-dependent  97.4 7.2E-05 1.6E-09   62.9   1.6   27  174-200   125-151 (181)
 28 PF13688 Reprolysin_5:  Metallo  97.3   7E-05 1.5E-09   63.8   0.9   22  172-193   141-162 (196)
 29 PF13582 Reprolysin_3:  Metallo  97.2 9.8E-05 2.1E-09   58.2   0.7   17  174-190   108-124 (124)
 30 cd04271 ZnMc_ADAM_fungal Zinc-  97.2  0.0004 8.7E-09   61.3   4.1   49  175-225   147-220 (228)
 31 PRK13267 archaemetzincin-like   97.1 0.00021 4.6E-09   60.9   1.9   26  174-199   126-151 (179)
 32 cd04269 ZnMc_adamalysin_II_lik  97.1 0.00063 1.4E-08   58.0   4.2   50  173-224   131-186 (194)
 33 PF07998 Peptidase_M54:  Peptid  96.9 0.00039 8.5E-09   59.9   1.5   27  173-200   145-171 (194)
 34 PF01421 Reprolysin:  Reprolysi  96.5  0.0039 8.4E-08   53.4   4.5   52  171-223   129-188 (199)
 35 cd04270 ZnMc_TACE_like Zinc-de  96.3  0.0045 9.9E-08   55.1   4.3   50  174-223   168-230 (244)
 36 cd04272 ZnMc_salivary_gland_MP  96.0   0.007 1.5E-07   52.8   3.7   50  173-223   145-212 (220)
 37 KOG4525 Jacalin-like lectin do  95.5   0.013 2.8E-07   55.9   3.5   58  174-233   302-381 (614)
 38 PF11150 DUF2927:  Protein of u  95.0    0.69 1.5E-05   40.5  12.8  150   59-225    28-196 (213)
 39 PF04298 Zn_peptidase_2:  Putat  94.6   0.027 5.9E-07   49.5   2.9   81   88-192    18-106 (222)
 40 PF10462 Peptidase_M66:  Peptid  94.6   0.015 3.2E-07   53.6   1.2   20  171-190   191-210 (305)
 41 PF05548 Peptidase_M11:  Gameto  94.5    0.02 4.3E-07   53.0   1.8   19  173-191   150-168 (314)
 42 PF12044 Metallopep:  Putative   93.9   0.029 6.2E-07   53.9   1.6   23  174-198   317-339 (423)
 43 TIGR03296 M6dom_TIGR03296 M6 f  93.5   0.023   5E-07   51.8   0.2   19  174-192   166-184 (286)
 44 COG2738 Predicted Zn-dependent  93.3   0.063 1.4E-06   46.2   2.5   78   87-186    20-105 (226)
 45 PTZ00337 surface protease GP63  92.6     0.2 4.3E-06   50.0   5.4   36  145-190   211-246 (567)
 46 PTZ00257 Glycoprotein GP63 (le  92.5     0.3 6.5E-06   49.0   6.4   96  114-225   205-313 (622)
 47 COG3824 Predicted Zn-dependent  91.4   0.077 1.7E-06   42.2   0.8   20  170-189   106-125 (136)
 48 PF01457 Peptidase_M8:  Leishma  91.4    0.11 2.4E-06   51.3   2.0   97  114-226   159-268 (521)
 49 PF05547 Peptidase_M6:  Immune   88.8    0.11 2.3E-06   52.6  -0.5   51  173-223   221-286 (645)
 50 KOG3607 Meltrins, fertilins an  88.0    0.46   1E-05   48.7   3.4   46  173-219   323-375 (716)
 51 PF09471 Peptidase_M64:  IgA Pe  86.6    0.27 5.8E-06   44.5   0.7   23  172-194   215-238 (264)
 52 PF06262 DUF1025:  Possibl zinc  85.8    0.31 6.8E-06   37.4   0.7   16  173-188    73-88  (97)
 53 KOG3658 Tumor necrosis factor-  84.1    0.76 1.6E-05   46.3   2.6   48  174-221   393-452 (764)
 54 COG4783 Putative Zn-dependent   82.8    0.45 9.8E-06   46.2   0.5   20  170-190   127-146 (484)
 55 PF01863 DUF45:  Protein of unk  78.7       7 0.00015   33.1   6.4   73   88-192   111-183 (205)
 56 PF06114 DUF955:  Domain of unk  78.1    0.81 1.8E-05   34.6   0.4   14  172-185    41-54  (122)
 57 PF13398 Peptidase_M50B:  Pepti  77.6    0.83 1.8E-05   39.4   0.4   14  173-186    22-35  (200)
 58 PF04228 Zn_peptidase:  Putativ  77.1     1.3 2.7E-05   40.7   1.4   40  143-184   141-181 (292)
 59 COG2856 Predicted Zn peptidase  77.1     0.9   2E-05   39.8   0.4   18  173-192    72-89  (213)
 60 PF12725 DUF3810:  Protein of u  76.7     1.4   3E-05   40.9   1.6   23  170-192   193-215 (318)
 61 COG5549 Predicted Zn-dependent  75.0     1.6 3.4E-05   38.2   1.4   29  174-202   154-182 (236)
 62 PF14247 DUF4344:  Domain of un  74.8       1 2.2E-05   39.6   0.2   14  172-185    91-104 (220)
 63 KOG3538 Disintegrin metallopro  74.2     2.6 5.7E-05   44.1   3.1   48  171-218   315-368 (845)
 64 cd06456 M3A_DCP_Oligopeptidase  74.1      15 0.00033   35.3   8.0   24   88-111   107-130 (422)
 65 cd06457 M3A_MIP Peptidase M3 m  71.5      18 0.00039   35.1   7.9   25   87-111   132-156 (458)
 66 PF01435 Peptidase_M48:  Peptid  71.0     1.6 3.4E-05   37.3   0.5   21  170-191    86-106 (226)
 67 PF13699 DUF4157:  Domain of un  70.9     3.4 7.4E-05   30.3   2.2   30  147-185    44-73  (79)
 68 PF02163 Peptidase_M50:  Peptid  70.8     1.5 3.3E-05   36.9   0.3   20  174-193     8-27  (192)
 69 cd06163 S2P-M50_PDZ_RseP-like   70.3     1.7 3.7E-05   37.1   0.5   12  174-185    10-21  (182)
 70 cd05709 S2P-M50 Site-2 proteas  69.9     1.7 3.6E-05   36.4   0.4   16  174-189     9-24  (180)
 71 PRK02870 heat shock protein Ht  67.8       3 6.6E-05   39.0   1.6   15  170-184   170-184 (336)
 72 PRK03982 heat shock protein Ht  67.8     1.9 4.2E-05   39.2   0.3   16  170-185   122-137 (288)
 73 COG0501 HtpX Zn-dependent prot  67.0     2.2 4.7E-05   38.3   0.5   16  170-185   154-169 (302)
 74 cd06161 S2P-M50_SpoIVFB SpoIVF  66.2     2.2 4.8E-05   36.9   0.4   16  173-188    38-53  (208)
 75 PRK03001 M48 family peptidase;  65.8     2.2 4.8E-05   38.7   0.3   16  170-185   121-136 (283)
 76 PRK01345 heat shock protein Ht  64.9     2.4 5.1E-05   39.3   0.3   15  171-185   122-136 (317)
 77 PRK03072 heat shock protein Ht  64.9     2.4 5.1E-05   38.7   0.3   16  170-185   124-139 (288)
 78 cd06459 M3B_Oligoendopeptidase  63.8     2.6 5.7E-05   39.8   0.4   19   81-99    153-171 (427)
 79 cd06258 Peptidase_M3_like The   62.6     2.8 6.1E-05   39.0   0.4   14  172-185   153-166 (365)
 80 PF01432 Peptidase_M3:  Peptida  62.2     2.9 6.3E-05   40.2   0.4   25   87-111   136-160 (458)
 81 cd06164 S2P-M50_SpoIVFB_CBS Sp  61.6       3 6.6E-05   36.7   0.4   14  174-187    54-67  (227)
 82 PRK04897 heat shock protein Ht  61.5       3 6.5E-05   38.2   0.3   16  170-185   134-149 (298)
 83 COG4227 Antirestriction protei  60.4     4.9 0.00011   36.4   1.4   22  170-191   200-221 (316)
 84 PRK05457 heat shock protein Ht  60.4     3.2 6.9E-05   37.8   0.3   15  170-184   131-145 (284)
 85 TIGR02289 M3_not_pepF oligoend  60.1     3.3 7.2E-05   41.1   0.4   13  173-185   337-349 (549)
 86 cd06455 M3A_TOP Peptidase M3 T  59.9     3.4 7.3E-05   40.2   0.4   12  174-185   264-275 (472)
 87 PRK02391 heat shock protein Ht  59.5     3.4 7.3E-05   37.9   0.3   15  170-184   130-144 (296)
 88 PRK01265 heat shock protein Ht  59.1     3.5 7.5E-05   38.4   0.3   15  170-184   137-151 (324)
 89 cd06162 S2P-M50_PDZ_SREBP Ster  58.7     3.6 7.9E-05   37.5   0.4   13  174-186   136-148 (277)
 90 cd06159 S2P-M50_PDZ_Arch Uncha  58.4     3.9 8.4E-05   37.0   0.5   17  174-190   119-135 (263)
 91 TIGR00181 pepF oligoendopeptid  58.1     3.7 8.1E-05   41.0   0.4   13  173-185   378-390 (591)
 92 TIGR02290 M3_fam_3 oligoendope  58.1     3.8 8.2E-05   41.0   0.4   12  174-185   376-387 (587)
 93 PF13485 Peptidase_MA_2:  Pepti  57.3       4 8.7E-05   31.0   0.4   14  172-185    24-37  (128)
 94 PF01447 Peptidase_M4:  Thermol  56.8     4.2 9.1E-05   33.6   0.4   12  174-185   136-147 (150)
 95 cd06160 S2P-M50_like_2 Unchara  53.7     4.9 0.00011   34.3   0.4   20  174-193    42-61  (183)
 96 KOG2921 Intramembrane metallop  53.5       5 0.00011   38.3   0.4   17  174-190   132-148 (484)
 97 PF08434 CLCA_N:  Calcium-activ  52.1      12 0.00025   33.9   2.5   15  174-188   150-165 (262)
 98 COG1164 Oligoendopeptidase F [  47.9     6.8 0.00015   39.5   0.4   12  174-185   381-392 (598)
 99 PF03272 Enhancin:  Viral enhan  47.9      27 0.00059   36.4   4.7   49   59-110   128-191 (775)
100 PLN02791 Nudix hydrolase homol  47.5     7.6 0.00017   40.3   0.7   20  173-192   585-604 (770)
101 cd06158 S2P-M50_like_1 Unchara  47.3     7.2 0.00016   33.1   0.4   12  174-185    10-21  (181)
102 cd06460 M32_Taq Peptidase fami  46.9     7.4 0.00016   37.3   0.4   12  174-185   160-171 (396)
103 TIGR00054 RIP metalloprotease   46.7     7.5 0.00016   37.3   0.4   12  174-185    15-26  (420)
104 PRK10779 zinc metallopeptidase  45.6     7.9 0.00017   37.5   0.4   11  174-184    16-26  (449)
105 COG0339 Dcp Zn-dependent oligo  44.3      68  0.0015   32.9   6.7   33   88-120   364-398 (683)
106 KOG2719 Metalloprotease [Gener  43.1     9.1  0.0002   36.9   0.4   15  170-184   277-291 (428)
107 PF01434 Peptidase_M41:  Peptid  43.1     8.4 0.00018   33.4   0.1   16  170-185    25-40  (213)
108 PRK10911 oligopeptidase A; Pro  41.8     9.7 0.00021   39.0   0.4   22   89-110   361-382 (680)
109 KOG2089 Metalloendopeptidase f  39.8      79  0.0017   32.4   6.3   32   89-120   392-425 (718)
110 COG3590 PepO Predicted metallo  37.4      20 0.00044   35.9   1.8   12  174-185   488-499 (654)
111 cd02641 R3H_Smubp-2_like R3H d  36.2      15 0.00034   25.4   0.6   20  173-192    29-48  (60)
112 PF03571 Peptidase_M49:  Peptid  35.8      14  0.0003   36.9   0.4   14  174-187   294-308 (549)
113 PF06167 Peptidase_M90:  Glucos  34.6      17 0.00037   32.7   0.7   20  170-189   151-170 (253)
114 KOG2661 Peptidase family M48 [  31.4      18  0.0004   34.0   0.4   20  172-192   274-293 (424)
115 PRK10280 dipeptidyl carboxypep  30.8      19 0.00041   37.0   0.4   34   87-120   362-398 (681)
116 COG2321 Predicted metalloprote  29.1      22 0.00047   32.5   0.5   41  143-185   138-179 (295)
117 cd02639 R3H_RRM R3H domain of   28.6      26 0.00055   24.4   0.7   21  174-194    30-50  (60)
118 PHA02150 hypothetical protein   27.8      34 0.00074   24.3   1.2   33   70-102     2-34  (77)
119 PF14891 Peptidase_M91:  Effect  25.7      29 0.00063   29.0   0.6   15  173-187   103-117 (174)
120 PF02102 Peptidase_M35:  Deuter  25.6      30 0.00066   32.7   0.8   50  136-187   262-311 (359)
121 PF12315 DUF3633:  Protein of u  25.2      30 0.00065   30.3   0.6   12  174-185    94-105 (212)
122 COG3091 SprT Zn-dependent meta  24.4      17 0.00037   30.2  -1.0   39  138-185    35-73  (156)
123 PF01431 Peptidase_M13:  Peptid  24.0      32  0.0007   29.1   0.6   15  174-188    37-51  (206)
124 PF02128 Peptidase_M36:  Fungal  23.4      47   0.001   31.7   1.6   15  174-188   186-200 (378)
125 PRK10733 hflB ATP-dependent me  23.4      29 0.00064   35.3   0.3   13  173-185   408-420 (644)
126 cd06461 M2_ACE Peptidase famil  23.0      34 0.00074   33.6   0.6   14  174-187   248-261 (477)
127 PRK15410 DgsA anti-repressor M  22.9      36 0.00077   30.8   0.7   20  172-191   141-160 (260)
128 PF13402 M60-like:  Peptidase M  22.4      33 0.00071   31.0   0.3   11  175-185   221-231 (307)
129 COG2317 Zn-dependent carboxype  22.0      34 0.00073   33.6   0.3   11  175-185   261-271 (497)
130 CHL00176 ftsH cell division pr  21.9      34 0.00074   34.9   0.4   14  172-185   437-450 (638)
131 COG4784 Putative Zn-dependent   21.3      33 0.00072   32.5   0.1   16  170-185   121-136 (479)
132 smart00731 SprT SprT homologue  21.2      34 0.00075   27.7   0.2   15  171-185    57-71  (146)
133 cd02640 R3H_NRF R3H domain of   20.8      48   0.001   23.0   0.8   19  173-191    29-47  (60)
134 PF02074 Peptidase_M32:  Carbox  20.1      38 0.00083   33.4   0.3   16  174-189   260-278 (494)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=3.2e-48  Score=370.50  Aligned_cols=187  Identities=43%  Similarity=0.787  Sum_probs=172.2

Q ss_pred             HHHHHHhcCCCCCCCCCCCcccccCCCCCChhhhhccccceeeCCCCCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHH
Q 043639           12 IFIIFIHESPQCIAVRSTPKFKLKQSDKYDPAEATRHKKDYVLFPGKPRWTRHVPMTLSYAFSKENMINYLHITDIKAVF   91 (261)
Q Consensus        12 ~~~~~~m~~PRCGvpD~~~~~~~~~~~~~~~~~~~~~~kry~~~~~~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i   91 (261)
                      -.|+..|++|||||||.                   |   |++++++.||.+.   +|||+|.  |++++++..+++.++
T Consensus        79 ~~Tl~~m~~prCgvpd~-------------------~---~~~~~~~~kW~k~---~lT~ri~--n~~~~l~~~~vr~~~  131 (469)
T KOG1565|consen   79 NATLALMNKPRCGVPDG-------------------R---YRYFPGKPKWNKE---HLTYRIK--NYTPYLPQAEVRCAK  131 (469)
T ss_pred             hhhhhhccCCCcCCCCC-------------------c---cccCcccCccccc---ccceecc--ccCCCCCHHHHHHHH
Confidence            46889999999999991                   2   6778899999999   9999999  678899999999999


Q ss_pred             HHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCC---CceeeecccccccccCCCCCCC
Q 043639           92 KRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPE---SGKFHLDAAETWAVDFGSEKSP  168 (261)
Q Consensus        92 ~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~---~G~vhfd~~e~wt~~~~~~~~~  168 (261)
                      ++||+.|+++++|+|+|+.....|||+|.|..+.|||++||||++|+|||||+|+   .|++|||++|.|+++.     .
T Consensus       132 ~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~~G~~hfD~dE~Wt~~~-----~  206 (469)
T KOG1565|consen  132 SEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFPGPGIGGDLHFDKDETWTYGD-----S  206 (469)
T ss_pred             HHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecccCCCCCCCCccccCcccceeccC-----C
Confidence            9999999999999999999778999999999999999999999999999999995   6899999999999862     2


Q ss_pred             ccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCCCCCCHHHHHHHHHHhCCCCCCCC
Q 043639          169 VAVDLESVAVHEIGHLLGLTHSPVKEAVMYPTLKPRDKKVDLALDDIQGVQALYGSNPNFTI  230 (261)
Q Consensus       169 ~~~~l~~Va~HEiGHaLGL~Hs~~~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~~~~~~~  230 (261)
                      .+++|+.||+|||||+|||.||.++++||||+|++...++.|++|||.+||.|||.+++...
T Consensus       207 ~g~~l~~Va~HEiGH~LGL~HS~~~~aiM~P~y~~~~~~~~L~~DDv~giq~lYG~~~~~~~  268 (469)
T KOG1565|consen  207 NGVDLFLVAAHEIGHALGLGHSSDPDAIMYPFYQPDSGNFDLSQDDVRGIQHLYGGPPRDYF  268 (469)
T ss_pred             ccchhHHHhhhhcccccccCCCCCcccccccccccCCCCcccChhhhhhhHHHhCCCccccc
Confidence            48999999999999999999999999999999997667889999999999999999887543


No 2  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00  E-value=3.5e-38  Score=262.12  Aligned_cols=155  Identities=44%  Similarity=0.761  Sum_probs=138.0

Q ss_pred             cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639           60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL  139 (261)
Q Consensus        60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l  139 (261)
                      ||+++   +|||+|..  ..+.++...++++|++||+.|+++++|+|+|+.+...|||+|.|....++++.+|+|.+|++
T Consensus         1 kW~~~---~itY~i~~--~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l   75 (157)
T cd04278           1 KWSKT---NLTYRILN--YPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTL   75 (157)
T ss_pred             CCCCC---ceeEEEEC--CCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCcccc
Confidence            79999   99999974  34446678999999999999999999999999875589999999998888889999999999


Q ss_pred             eeecCC--CCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCCCCCCHHHHHH
Q 043639          140 AHSFSP--ESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVKEAVMYPTLKPRDKKVDLALDDIQG  217 (261)
Q Consensus       140 AhA~~P--~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~svM~p~y~~~~~~~~l~~dDI~~  217 (261)
                      +||++|  ..|+++||.++.|.....    ..+.+++.|++|||||+|||.|+.++++||||+|.+......|+++||.+
T Consensus        76 ~~a~~p~~~~g~i~~~~~~~~~~~~~----~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~  151 (157)
T cd04278          76 AHAFFPGGIGGDIHFDDDEQWTLGSD----SGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRG  151 (157)
T ss_pred             ccccCCCCcceeEEECCCcccccCCC----CccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHH
Confidence            999999  679999999999987541    23678999999999999999999999999999998643467899999999


Q ss_pred             HHHHhC
Q 043639          218 VQALYG  223 (261)
Q Consensus       218 iq~LYG  223 (261)
                      ||+|||
T Consensus       152 ~~~lYg  157 (157)
T cd04278         152 IQALYG  157 (157)
T ss_pred             HHHhcC
Confidence            999998


No 3  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00  E-value=8.7e-37  Score=251.27  Aligned_cols=151  Identities=44%  Similarity=0.810  Sum_probs=134.9

Q ss_pred             cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639           60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL  139 (261)
Q Consensus        60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l  139 (261)
                      ||+++   +|||+|...  .+.++.+++++++++||+.|+++++|+|+|+.+ ..+||+|.|....+++...|++.++++
T Consensus         1 ~W~~~---~ity~i~~~--~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~-~~adi~i~~~~~~~~~~~~~~~~~~~~   74 (154)
T PF00413_consen    1 KWPKK---TITYSISNT--TPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSD-GNADIRISFGSNNHGDGYSFDGSGGTL   74 (154)
T ss_dssp             SSSSS---EEEEEESSB--CTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESS-SSCSEEEEEESSSSSSSS-CSSSSSES
T ss_pred             CCCCC---cEEEEEECC--CCCCCHHHHHHHHHHHHHHHHhcCCceEEeccC-CCcceeeeeeccccCcccccccceeee
Confidence            79999   999999854  466778899999999999999999999999985 579999999998888888999999999


Q ss_pred             eeecCCC---CceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCCCCCCHHHHH
Q 043639          140 AHSFSPE---SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVKEAVMYPTLKPRDKKVDLALDDIQ  216 (261)
Q Consensus       140 AhA~~P~---~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~svM~p~y~~~~~~~~l~~dDI~  216 (261)
                      ++++.|.   .|+++|+.++.|....      ...+++.|++|||||+|||.|++++++||||.+.+.. ...|+.+||+
T Consensus        75 ~~~~~~~~~~~~~i~~~~~~~~~~~~------~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~-~~~l~~~Di~  147 (154)
T PF00413_consen   75 AHAYFPNNIVSGDIHFNDDESWTIDD------SGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPD-NKTLSEDDID  147 (154)
T ss_dssp             EEEEESSSTTTTEEEEETTSHEESSS------SSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSS-STSTTHHHHH
T ss_pred             eccccccccccccccccccccchhhh------hhhhhhhhhhhccccccCcCcCCCcccceeeecccCC-CCCCCHHHHH
Confidence            9999997   6999999999998652      3678899999999999999999999999999998654 2489999999


Q ss_pred             HHHHHhC
Q 043639          217 GVQALYG  223 (261)
Q Consensus       217 ~iq~LYG  223 (261)
                      +||+|||
T Consensus       148 ~i~~lYg  154 (154)
T PF00413_consen  148 GIQYLYG  154 (154)
T ss_dssp             HHHHHHS
T ss_pred             HHHHHhC
Confidence            9999998


No 4  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.96  E-value=2.9e-29  Score=208.26  Aligned_cols=150  Identities=25%  Similarity=0.307  Sum_probs=114.8

Q ss_pred             eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCC-CCcceEEEeeecCCCCCCCccCCCCceeeecCC
Q 043639           67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDY-DFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSP  145 (261)
Q Consensus        67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P  145 (261)
                      ..|+|++...+..++-....++++|++||+.|+++++|+|+++... ..+||+|.|....     +|+|.|+++|||++|
T Consensus         2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g~~~a~a~~p   76 (156)
T cd04279           2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAGGGLARAGFP   76 (156)
T ss_pred             CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCCCceEEeccc
Confidence            3799999865332222356799999999999999999999998753 3899999998632     788999999999999


Q ss_pred             CCceeeecccccccccCCCCC--CCccccchhhhhhhhhhhcCCCCCCCCC-CccccccCCCC-CCCCCCHHHHHHHHHH
Q 043639          146 ESGKFHLDAAETWAVDFGSEK--SPVAVDLESVAVHEIGHLLGLTHSPVKE-AVMYPTLKPRD-KKVDLALDDIQGVQAL  221 (261)
Q Consensus       146 ~~G~vhfd~~e~wt~~~~~~~--~~~~~~l~~Va~HEiGHaLGL~Hs~~~~-svM~p~y~~~~-~~~~l~~dDI~~iq~L  221 (261)
                      ..|.  ++....|........  ...+.+++.|++|||||+|||.|++++. ++|||++.... .+..|+++|+++||.|
T Consensus        77 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~l  154 (156)
T cd04279          77 LISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRL  154 (156)
T ss_pred             ccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHH
Confidence            7543  333333322211000  0125678999999999999999999998 99999988642 3468999999999999


Q ss_pred             hC
Q 043639          222 YG  223 (261)
Q Consensus       222 YG  223 (261)
                      ||
T Consensus       155 Y~  156 (156)
T cd04279         155 YG  156 (156)
T ss_pred             hC
Confidence            97


No 5  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.92  E-value=1e-24  Score=177.55  Aligned_cols=136  Identities=32%  Similarity=0.548  Sum_probs=109.8

Q ss_pred             CCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCC
Q 043639           58 KPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLG  137 (261)
Q Consensus        58 ~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg  137 (261)
                      ..+|++.   +|+|+|..    +.++..+++++|++||+.|+++++|+|+|+.+  .+||.|.|...+++        |.
T Consensus         2 ~~~W~~~---~v~Y~i~~----~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~--------g~   64 (140)
T smart00235        2 SKKWPKG---TVPYVIDS----SSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGS--------GC   64 (140)
T ss_pred             CCcCCCC---cEEEEEcC----CCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCC--------Cc
Confidence            4689998   99999962    23445556999999999999999999999874  79999999986541        11


Q ss_pred             ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCCCC---ccccccCCCCCCCCCCHHH
Q 043639          138 VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVKEA---VMYPTLKPRDKKVDLALDD  214 (261)
Q Consensus       138 ~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~s---vM~p~y~~~~~~~~l~~dD  214 (261)
                      .+++++.| .|+.+++. +.|..            ...|++|||||+|||.|++++.+   +|++.|......+.++++|
T Consensus        65 ~~a~~g~~-~g~~~~~~-~~~~~------------~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~  130 (140)
T smart00235       65 TLSHAGRP-GGDQHFSL-GNGCI------------NTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDD  130 (140)
T ss_pred             ceeeeecC-CCceEEEc-cCCcC------------CcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhcccccccc
Confidence            26888888 78888876 55542            13599999999999999998887   9999987654567888889


Q ss_pred             HHHHHHHhCC
Q 043639          215 IQGVQALYGS  224 (261)
Q Consensus       215 I~~iq~LYG~  224 (261)
                      ..+++++||+
T Consensus       131 ~~~~~~~yg~  140 (140)
T smart00235      131 SLGIPYDYGS  140 (140)
T ss_pred             CCCchhccCc
Confidence            9999999984


No 6  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.91  E-value=4.8e-24  Score=181.72  Aligned_cols=124  Identities=27%  Similarity=0.331  Sum_probs=97.0

Q ss_pred             HHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCC-------ceeeeccccc
Q 043639           85 TDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPES-------GKFHLDAAET  157 (261)
Q Consensus        85 ~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~-------G~vhfd~~e~  157 (261)
                      ..+++++++||+.|+++++|+|+|+.+...+||+|.+.....         +++.+.|++|..       |+|+++.+..
T Consensus        33 ~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~~~~g~a~~p~~~~~~~~~g~i~~~~~~~  103 (186)
T cd04277          33 AAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------GNTAGYAYYPGSGSGTAYGGDIWFNSSYD  103 (186)
T ss_pred             HHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------CCccEEEECCCCCccccccceeEEecCcc
Confidence            578999999999999999999999988668999998865321         356778888853       6899998877


Q ss_pred             ccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCC----------------CCccccccCCC-------CCCCCCCHHH
Q 043639          158 WAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVK----------------EAVMYPTLKPR-------DKKVDLALDD  214 (261)
Q Consensus       158 wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~----------------~svM~p~y~~~-------~~~~~l~~dD  214 (261)
                      +...      ..+...+.|++|||||+|||.|+++.                .+||.......       .....++..|
T Consensus       104 ~~~~------~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~D  177 (186)
T cd04277         104 TNSD------SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLD  177 (186)
T ss_pred             cccC------CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHH
Confidence            6532      12556789999999999999999863                35885432221       2346799999


Q ss_pred             HHHHHHHhC
Q 043639          215 IQGVQALYG  223 (261)
Q Consensus       215 I~~iq~LYG  223 (261)
                      |++||+|||
T Consensus       178 I~AlQ~lYG  186 (186)
T cd04277         178 IAALQYLYG  186 (186)
T ss_pred             HHHHHHhhC
Confidence            999999998


No 7  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.91  E-value=2.2e-24  Score=179.41  Aligned_cols=142  Identities=23%  Similarity=0.338  Sum_probs=103.5

Q ss_pred             eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCC
Q 043639           67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPE  146 (261)
Q Consensus        67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~  146 (261)
                      +.|+|+|... ++     ..+|+++++||+.|+++++|+|+|+.....+||+|.+.+..+..    +|..+...+...|.
T Consensus         2 ~~I~y~i~~~-~~-----~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~~~~~~~~~~~   71 (165)
T cd04268           2 KPITYYIDDS-VP-----DKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTWSYGPSQVDPL   71 (165)
T ss_pred             CCEEEEEcCC-CC-----HHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCccccCCccCCCC
Confidence            3899999742 22     57899999999999999999999998755899999998643111    11111222222466


Q ss_pred             CceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC----------------CCCccccccCCC------
Q 043639          147 SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV----------------KEAVMYPTLKPR------  204 (261)
Q Consensus       147 ~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~----------------~~svM~p~y~~~------  204 (261)
                      .|+|+++....|....    ...+..++.|++|||||+|||.|+++                ..|||.+.....      
T Consensus        72 ~g~i~~~~~~~~~~~~----~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~  147 (165)
T cd04268          72 TGEILLARVYLYSSFV----EYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGD  147 (165)
T ss_pred             CccEEeeEEEEchhHH----HHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCcccccccccc
Confidence            7899998766543110    01245678999999999999999998                899998654421      


Q ss_pred             CCCCCCCHHHHHHHHHHh
Q 043639          205 DKKVDLALDDIQGVQALY  222 (261)
Q Consensus       205 ~~~~~l~~dDI~~iq~LY  222 (261)
                      .....|+.+||++||.||
T Consensus       148 ~~~~~~~~~Di~ai~~lY  165 (165)
T cd04268         148 GQKYTIGPYDIAAIKKLY  165 (165)
T ss_pred             ccCCCCCHHHHHHHHhcC
Confidence            134689999999999999


No 8  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74  E-value=1.5e-17  Score=143.36  Aligned_cols=109  Identities=28%  Similarity=0.440  Sum_probs=75.6

Q ss_pred             cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639           60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL  139 (261)
Q Consensus        60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l  139 (261)
                      +|+.+.  .|+|.|... .     .+.+++.|++||+.|+++++|+|+|++. ..+||+|.|..+. | ++++.|..+..
T Consensus         2 ~W~~~~--~~~~~f~~~-~-----~~~~r~~I~~A~~~W~~~t~i~F~~~~~-~~adi~I~f~~~~-G-c~S~vG~~~~~   70 (198)
T cd04327           2 LWRNGT--VLRIAFLGG-P-----DAFLKDKVRAAAREWLPYANLKFKFVTD-ADADIRISFTPGD-G-YWSYVGTDALL   70 (198)
T ss_pred             CCCCCC--eEEEEeCCC-C-----cHHHHHHHHHHHHHHhhhcCeEEEECCC-CCCCEEEEEecCC-C-CCCCcCCcccc
Confidence            688873  688888742 1     3678999999999999999999999976 5799999998743 3 56666654321


Q ss_pred             eeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639          140 AHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       140 AhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                         ..+..-.+.+.    |....     ..+.+...|++|||||||||.|++
T Consensus        71 ---~~~~~~t~~l~----~~~~~-----~~~~~~~~~i~HElgHaLG~~HEh  110 (198)
T cd04327          71 ---IGADAPTMNLG----WFTDD-----TPDPEFSRVVLHEFGHALGFIHEH  110 (198)
T ss_pred             ---cCCCCceeeee----eecCC-----CchhhHHHHHHHHHHHHhcCcccc
Confidence               11112234342    21110     113456789999999999999974


No 9  
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.73  E-value=3.1e-17  Score=136.29  Aligned_cols=120  Identities=23%  Similarity=0.269  Sum_probs=89.9

Q ss_pred             HHHHHHHHHHHHHHhHhcCCceEEEcCCC-CCcceEEEeeecCCCCCCCccCCCCceeeecCCCC-----ceeeeccccc
Q 043639           84 ITDIKAVFKRAFSKWASVIPVSFIETDDY-DFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPES-----GKFHLDAAET  157 (261)
Q Consensus        84 ~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~-----G~vhfd~~e~  157 (261)
                      ...++++++.|++.|+++++++|+++... ..+||.+.+...+.        .++.+++||.|..     |.+++.....
T Consensus        20 ~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~~~~g~a~~~~~c~~~~~~~~~~~~~~   91 (167)
T cd00203          20 SAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DGGTGGWAYLGRVCDSLRGVGVLQDNQS   91 (167)
T ss_pred             HHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CCCceEEEecCCccCCCCCcEEEecCCc
Confidence            46799999999999999999999999863 38999998875432        2245567776653     4555433222


Q ss_pred             ccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC--------------------CCCccccccCCC--CCCCCCCHHHH
Q 043639          158 WAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV--------------------KEAVMYPTLKPR--DKKVDLALDDI  215 (261)
Q Consensus       158 wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~--------------------~~svM~p~y~~~--~~~~~l~~dDI  215 (261)
                      +           +.....|++|||||+|||.|.++                    ..+||+|.....  .....++..||
T Consensus        92 ~-----------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~  160 (167)
T cd00203          92 G-----------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDI  160 (167)
T ss_pred             c-----------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCHHHH
Confidence            1           33567999999999999999876                    358998765531  24568999999


Q ss_pred             HHHHHHh
Q 043639          216 QGVQALY  222 (261)
Q Consensus       216 ~~iq~LY  222 (261)
                      +.||++|
T Consensus       161 ~~i~~~Y  167 (167)
T cd00203         161 DQINKLY  167 (167)
T ss_pred             HHHHhhC
Confidence            9999998


No 10 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.61  E-value=7e-15  Score=126.76  Aligned_cols=146  Identities=18%  Similarity=0.236  Sum_probs=88.5

Q ss_pred             eEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhc------CCceEEEcCCC--CCcceEEEeeecCCCCCCCccCCCCce
Q 043639           68 TLSYAFSKENMINYLHITDIKAVFKRAFSKWASV------IPVSFIETDDY--DFADIRIGFYNGDHGDGEPFDGVLGVL  139 (261)
Q Consensus        68 ~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~v------t~l~F~ev~~~--~~aDI~I~f~~~~hgd~~~fdG~gg~l  139 (261)
                      .|||+|..  +++    .+++++|++||+.|+++      .++.+.++...  ...||++...+-.+.+    ++..+..
T Consensus         9 pItyyI~~--~~p----~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~~~g   78 (197)
T cd04276           9 PIVYYLDN--TFP----EKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGWAYG   78 (197)
T ss_pred             CEEEEecC--CCc----HHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcceec
Confidence            89999984  333    78999999999999997      46666665421  2468985444333322    1222333


Q ss_pred             eeecCCCCc-----eeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC---------------------C
Q 043639          140 AHSFSPESG-----KFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV---------------------K  193 (261)
Q Consensus       140 AhA~~P~~G-----~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~---------------------~  193 (261)
                      .+...|..|     +|.++........... ....+..+..+++||+||+|||.|...                     .
T Consensus        79 ps~~dPrTGeIl~a~V~l~~~~~~~~~~~~-~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~  157 (197)
T cd04276          79 PSVVDPRTGEILKADVILYSGFLRQDQLWY-EDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGAT  157 (197)
T ss_pred             ccccCCCCCCeEEEEEEeCchhhccchhHH-HHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCc
Confidence            444456554     5566553321110000 000123467899999999999999752                     2


Q ss_pred             CCcc-ccccCC--C------CCCCCCCHHHHHHHHHHhCC
Q 043639          194 EAVM-YPTLKP--R------DKKVDLALDDIQGVQALYGS  224 (261)
Q Consensus       194 ~svM-~p~y~~--~------~~~~~l~~dDI~~iq~LYG~  224 (261)
                      .||| |+.+..  .      .....+...||.+||.+||.
T Consensus       158 ~SVMdY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~  197 (197)
T cd04276         158 SSVMDYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP  197 (197)
T ss_pred             ceeecCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence            4677 322111  0      12346899999999999984


No 11 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.57  E-value=2.4e-14  Score=121.87  Aligned_cols=91  Identities=21%  Similarity=0.274  Sum_probs=62.9

Q ss_pred             eEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCC
Q 043639           68 TLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPES  147 (261)
Q Consensus        68 ~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~  147 (261)
                      .|.|.|... ++     ...++.|++||+.|+++++|+|+|...   ++..|.|..+ .| ++++.|..+        +.
T Consensus         3 ~VpY~i~~~-~~-----~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~G-c~S~vG~~~--------~~   63 (180)
T cd04280           3 TVPYVIDGS-FD-----ESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SG-CWSYVGRVG--------GR   63 (180)
T ss_pred             EEEEEECCC-CC-----HHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CC-cceecCccC--------Cc
Confidence            799999842 32     578999999999999999999999875   4455666554 23 444444311        11


Q ss_pred             ceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC
Q 043639          148 GKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       148 G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      -.+.+..               ++....+++|||||||||.|++.
T Consensus        64 q~i~l~~---------------~c~~~g~v~HE~~HalG~~HEh~   93 (180)
T cd04280          64 QVVSLGS---------------GCFSLGTIVHELMHALGFYHEQS   93 (180)
T ss_pred             eeEEeCC---------------CcCcCchhHHHHHHHhcCcchhc
Confidence            2232221               23336899999999999999874


No 12 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.54  E-value=2.8e-14  Score=122.49  Aligned_cols=96  Identities=24%  Similarity=0.436  Sum_probs=60.5

Q ss_pred             cccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCce
Q 043639           60 RWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVL  139 (261)
Q Consensus        60 kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~l  139 (261)
                      ||+..   +|.|.|.. +++     +..++.|++||+.|++.++|+|+++. . .....|.|..+ .| ++++.|..+  
T Consensus         2 ~Wp~~---~IpY~~~~-~~~-----~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~~-~g-C~S~vG~~~--   66 (191)
T PF01400_consen    2 KWPNG---TIPYYIDP-SFS-----SSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSNG-SG-CWSYVGRQG--   66 (191)
T ss_dssp             S-GGG---EEEEEEET-TS------HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEESS-SS-EEEESS--S--
T ss_pred             cCCCC---EEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeecC-cc-ccchhhhcC--
Confidence            89999   99999985 332     57889999999999999999999998 2 33344555442 22 344444311  


Q ss_pred             eeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639          140 AHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       140 AhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                            +.=.+.+.               .+.....+++|||||||||.|.+
T Consensus        67 ------g~q~i~l~---------------~~c~~~~~i~HEl~HaLG~~HEh   97 (191)
T PF01400_consen   67 ------GEQTINLG---------------DGCFSVGTILHELGHALGFWHEH   97 (191)
T ss_dssp             ------SEEEEEE----------------TTC-SHHHHHHHHHHHHTB--GG
T ss_pred             ------cceeEEec---------------ceeCCccchHHHHHHHHhhhhhh
Confidence                  00012221               02234679999999999999986


No 13 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.54  E-value=1.1e-13  Score=119.70  Aligned_cols=99  Identities=24%  Similarity=0.434  Sum_probs=66.7

Q ss_pred             CCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCC
Q 043639           58 KPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLG  137 (261)
Q Consensus        58 ~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg  137 (261)
                      ..+|++.   .|.|.|.. +++     ...++.|++||+.|++.|+|+|++-.. ...  .|.|..+..| ++++.|.-+
T Consensus         7 ~~~Wp~~---~VpY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~r~~-~~~--yi~f~~~~~G-C~S~vG~~~   73 (200)
T cd04281           7 ERIWPGG---VIPYVIDG-NFT-----GSQRAMFKQAMRHWENFTCVTFVERTP-EEN--YIVFTYRPCG-CCSYVGRRG   73 (200)
T ss_pred             cCcCCCC---EEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceEEEECCC-CCC--EEEEEECCCC-eeEcCCCcC
Confidence            5699999   99999964 332     467889999999999999999998764 333  3444443333 556655422


Q ss_pred             ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639          138 VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       138 ~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                      .-.+       .+.+        +       .+.....+++|||||||||-|++
T Consensus        74 ~g~q-------~isl--------~-------~~C~~~Gti~HEl~HaLGf~HEh  105 (200)
T cd04281          74 NGPQ-------AISI--------G-------KNCDKFGIVVHELGHVIGFWHEH  105 (200)
T ss_pred             CCce-------eeec--------C-------CCcCcCchHHHHHHHHhcCcchh
Confidence            0011       1111        1       12334689999999999999986


No 14 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.52  E-value=1.3e-13  Score=119.15  Aligned_cols=136  Identities=20%  Similarity=0.352  Sum_probs=89.0

Q ss_pred             CeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhc-CCceEEEc--CCCCCcceEEEeeecCCCCCCCccCCCCceeee
Q 043639           66 PMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASV-IPVSFIET--DDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHS  142 (261)
Q Consensus        66 ~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~v-t~l~F~ev--~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA  142 (261)
                      +++|+..+.+.+. ..++ +.++.++++|++.|+++ ..|+|+-+  .....+||.|.-...        +.+.|+.+-|
T Consensus        38 ~~~I~i~~~~~~~-~~l~-s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~G~ggsA  107 (211)
T PF12388_consen   38 PRTITIIGYTGGS-QSLN-SAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPSGAGGSA  107 (211)
T ss_pred             CCEEEEEeCCCcc-cccc-HHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCCCcceec
Confidence            3488888875321 1233 67999999999999997 35677422  233466776632211        1223456678


Q ss_pred             cCC-CCc----eeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC--------------------------
Q 043639          143 FSP-ESG----KFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP--------------------------  191 (261)
Q Consensus       143 ~~P-~~G----~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~--------------------------  191 (261)
                      .|| ..|    .|.+     +.+.     .........|++|||||+|||.|++                          
T Consensus       108 GFP~s~G~P~~~I~I-----~~~~-----~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGT  177 (211)
T PF12388_consen  108 GFPTSNGNPYKFIQI-----YGLS-----NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGT  177 (211)
T ss_pred             cCCCCCCCCCceEEE-----EecC-----CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCCC
Confidence            889 444    3555     1111     1112335789999999999999986                          


Q ss_pred             ----CCCCccccccCCCCCCCCCCHHHHHHHHHHh
Q 043639          192 ----VKEAVMYPTLKPRDKKVDLALDDIQGVQALY  222 (261)
Q Consensus       192 ----~~~svM~p~y~~~~~~~~l~~dDI~~iq~LY  222 (261)
                          |++|||...+... ....++..||.+|++||
T Consensus       178 Pt~~d~~SiM~ac~~~~-~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  178 PTGADPNSIMNACFSSG-EDGEFTSNDITALNYLY  211 (211)
T ss_pred             CCCCCCchhhhccccCC-CCCCcChhhHHHHHhhC
Confidence                3457887766543 35579999999999999


No 15 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.44  E-value=1.4e-12  Score=114.88  Aligned_cols=103  Identities=21%  Similarity=0.332  Sum_probs=67.9

Q ss_pred             eeeCCCCCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCC
Q 043639           52 YVLFPGKPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEP  131 (261)
Q Consensus        52 y~~~~~~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~  131 (261)
                      -++.....+|++    .|-|.|.. +++     ...+..|++||+.|++.|+|+|++... .. | .|.|..+. | +++
T Consensus        37 ~a~~~~~~~Wp~----~VPY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~rt~-e~-~-yi~i~~~~-G-C~S  101 (230)
T cd04282          37 NGLIGDTYRWPF----PIPYILDD-SLD-----LNAKGVILKAFEMYRLKSCVDFKPYEG-ES-N-YIFFFKGS-G-CWS  101 (230)
T ss_pred             ccccCcccCCCc----ceeEEECC-CCC-----HHHHHHHHHHHHHHHhCCCeeEEECCC-CC-c-EEEEEcCC-C-eee
Confidence            344445679987    49999964 343     467899999999999999999998764 33 3 24444432 3 555


Q ss_pred             ccCCCCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCC
Q 043639          132 FDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       132 fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      +.|..|        +.-.+.+.               .++....+++|||||||||-|++.
T Consensus       102 ~vG~~g--------g~q~isl~---------------~~C~~~Gti~HEl~HalGf~HEqs  139 (230)
T cd04282         102 MVGDQQ--------GGQNLSIG---------------AGCDYKATVEHEFLHALGFYHEQS  139 (230)
T ss_pred             ccCccC--------CeEEEEEC---------------CCcCCCchHHHHHHHHhCCccccc
Confidence            555421        01112221               133446899999999999999863


No 16 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.40  E-value=3.3e-12  Score=108.89  Aligned_cols=92  Identities=20%  Similarity=0.309  Sum_probs=59.6

Q ss_pred             eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCC
Q 043639           67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPE  146 (261)
Q Consensus        67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~  146 (261)
                      ..|-|.|.. +++     ...++.|++|++.|++.|+|+|++... ....|+|  ..+ .| ++++.|..|-        
T Consensus         4 ~~IpY~i~~-~~~-----~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~yi~~--~~~-~g-C~S~vG~~gg--------   64 (182)
T cd04283           4 VYVPYVISP-QYS-----ENERAVIEKAMQEFETLTCVRFVPRTT-ERDYLNI--ESR-SG-CWSYIGRQGG--------   64 (182)
T ss_pred             EEEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceeeEECCC-CCcEEEE--EcC-CC-ceEecCccCC--------
Confidence            378899964 332     467899999999999999999998753 3334444  332 22 4555443210        


Q ss_pred             CceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639          147 SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       147 ~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                      .-.+.+.              ..+.....+++|||||||||.|++
T Consensus        65 ~q~i~l~--------------~~~C~~~G~i~HEl~HaLG~~HEh   95 (182)
T cd04283          65 RQTVSLQ--------------KQGCMYKGIIQHELLHALGFYHEQ   95 (182)
T ss_pred             ceeEecC--------------CCCcCccchHHHHHHHHhCCcccc
Confidence            0011111              013334689999999999999986


No 17 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.35  E-value=5.8e-12  Score=119.92  Aligned_cols=107  Identities=21%  Similarity=0.291  Sum_probs=75.4

Q ss_pred             cceeeCCCCCcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCC
Q 043639           50 KDYVLFPGKPRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDG  129 (261)
Q Consensus        50 kry~~~~~~~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~  129 (261)
                      +|........+|+..   .|.|.|... +.     ...+.+|++||+.|++.|+|+|.|..+ .+.+..+.+...  | +
T Consensus        71 ~r~~~~~~~~~Wp~~---~ipY~i~~~-~~-----~~~r~~i~~A~~~~~~~TCi~F~~~~~-~~~~~~~~~~~~--g-C  137 (411)
T KOG3714|consen   71 RRNGTSNPERRWPNG---VIPYYIDGS-FT-----SSQRALIRQAMREIENHTCIRFVERTT-PDKDYLIVFTGG--G-C  137 (411)
T ss_pred             hhhcccChhhcCCCC---eeeeEECCC-CC-----HHHHHHHHHHHHHHhhCcceEEEeCCC-CCcceEEEeCCC--c-c
Confidence            345555667899998   999999853 43     589999999999999999999999874 445555554432  3 5


Q ss_pred             CCccCCCCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCC
Q 043639          130 EPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       130 ~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                      +++.|..|.-         .      ..-+++       .+...+.|++|||+||||+-|.+
T Consensus       138 ~S~VGr~gg~---------~------q~~sl~-------~~C~~~G~i~HEl~HaLGf~Heh  177 (411)
T KOG3714|consen  138 YSYVGRRGGG---------Q------QLLSLG-------DGCDRFGTIVHELMHALGFWHEH  177 (411)
T ss_pred             eeeeCccCCC---------c------cceecC-------CCcCcCchhHHHHHHHhhhhhcc
Confidence            5665643311         0      011111       24455899999999999999986


No 18 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=99.19  E-value=1.1e-10  Score=92.72  Aligned_cols=125  Identities=26%  Similarity=0.313  Sum_probs=75.9

Q ss_pred             eeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHh-cCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCC
Q 043639           67 MTLSYAFSKENMINYLHITDIKAVFKRAFSKWAS-VIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSP  145 (261)
Q Consensus        67 ~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~-vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P  145 (261)
                      ++|+|.-+.        ..+.+..|.+|.+.|+. |.+++|++...   ++|++.... +         +++..+..--.
T Consensus         2 vtv~Ydas~--------A~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~~-~---------~~~sya~~~g~   60 (132)
T PF02031_consen    2 VTVYYDASR--------APEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEGN-D---------PRGSYASTDGL   60 (132)
T ss_dssp             EEEEEEEEE---------GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE------------TT--EEEE-SS
T ss_pred             EEEEEeCCC--------CchhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEec-C---------CCCcccccCCC
Confidence            378888763        24688999999999987 78999999863   888876432 1         12333333334


Q ss_pred             CCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCC--CCCCCCccccccCCCC-CCCCCCHHHHHHHHHHh
Q 043639          146 ESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTH--SPVKEAVMYPTLKPRD-KKVDLALDDIQGVQALY  222 (261)
Q Consensus       146 ~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~H--s~~~~svM~p~y~~~~-~~~~l~~dDI~~iq~LY  222 (261)
                      +.|.|.+|..  +.         .+++-..|++|||||.|||..  +...+.+|...-.+.. .+..+...++..+++||
T Consensus        61 G~G~I~l~~~--~~---------qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~aG~sCtN~~Pna~E~a~V~~~~  129 (132)
T PF02031_consen   61 GSGYIFLDYQ--QN---------QGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSAGTSCTNAYPNAAERARVNSLF  129 (132)
T ss_dssp             S-EEEEEEHH--HH---------HHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT-TT----S--HHHHHHHHHHC
T ss_pred             CcEEEEechH--Hh---------hCCccceeeeehhccccCCCCCCCCccHHhhcCCCCCCCCCCCCCCHHHHHHHHHHh
Confidence            5688888752  21         156667899999999999953  3445789975544322 34467889999999998


Q ss_pred             C
Q 043639          223 G  223 (261)
Q Consensus       223 G  223 (261)
                      .
T Consensus       130 a  130 (132)
T PF02031_consen  130 A  130 (132)
T ss_dssp             T
T ss_pred             c
Confidence            5


No 19 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.94  E-value=3.8e-09  Score=90.53  Aligned_cols=148  Identities=22%  Similarity=0.320  Sum_probs=86.3

Q ss_pred             CCCcccCCCCeeEEEEEecC-CcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCC
Q 043639           57 GKPRWTRHVPMTLSYAFSKE-NMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGV  135 (261)
Q Consensus        57 ~~~kW~~~~~~~LTY~~~~~-~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~  135 (261)
                      +..-|.+.   .++.++.-. +...--....+++++..|++-|++.-|+..+|-.  ..|||+|...+.. |  .+.++.
T Consensus        77 ~~L~~S~f---~v~v~irwpq~l~n~~~ap~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~p-g--tg~~~~  148 (236)
T COG5549          77 GYLIWSEF---PVDVRIRWPQNLENVEGAPRWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNPP-G--TGWRQY  148 (236)
T ss_pred             ceeecccC---cceeeecccccccccccchhHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCCC-C--Cccccc
Confidence            34568776   555555411 1111111235789999999999999777666654  6899999875432 2  112222


Q ss_pred             CC---------ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCC-CCCCCCCccccccCCCC
Q 043639          136 LG---------VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLT-HSPVKEAVMYPTLKPRD  205 (261)
Q Consensus       136 gg---------~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~~~svM~p~y~~~~  205 (261)
                      |.         .+.|+..|+    |++..  ..+...  .+....++..+|.||+|||||+. |+..+ ++|.+..+   
T Consensus       149 grArtal~~~efl~~~~~~~----~lnhr--~~i~~~--pg~~~e~L~~tarhElGhaLgi~ghsd~k-s~aly~~s---  216 (236)
T COG5549         149 GRARTALIAYEFLGHALGLG----HLNHR--GDIMYP--PGELRENLNPTARHELGHALGIWGHSDLK-SDALYGIS---  216 (236)
T ss_pred             chHHHHHHHHHHhhhccCcc----ccccc--ccccCC--cccchhhhhHHHHHhhcchheecccccch-hhhhcchh---
Confidence            11         112222221    12110  011111  12335678899999999999997 98765 45543333   


Q ss_pred             CCCCCCHHHHHHHHHHhCCC
Q 043639          206 KKVDLALDDIQGVQALYGSN  225 (261)
Q Consensus       206 ~~~~l~~dDI~~iq~LYG~~  225 (261)
                       ...++..|+..+.++|-.+
T Consensus       217 -~~~is~rdv~tL~r~Yeqp  235 (236)
T COG5549         217 -WQGISRRDVKTLERKYEQP  235 (236)
T ss_pred             -hcccCHHHHHHHHHHhcCC
Confidence             2238999999999999754


No 20 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=98.33  E-value=4.4e-06  Score=71.79  Aligned_cols=142  Identities=20%  Similarity=0.270  Sum_probs=89.9

Q ss_pred             eeEEEEEecCCcCCcC---CHHHHHHHHHHHH---HHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCC--CccCCCCc
Q 043639           67 MTLSYAFSKENMINYL---HITDIKAVFKRAF---SKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGE--PFDGVLGV  138 (261)
Q Consensus        67 ~~LTY~~~~~~~~~~l---~~~~~~~~i~~Af---~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~--~fdG~gg~  138 (261)
                      +.++|++.-++-....   ..+++.++|.+.+   +-|.....+.|++|.. +.+|++|...+...-+..  +++-.+. 
T Consensus        29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~-~~~Df~I~Lasp~T~~~lC~g~~~~~e-  106 (203)
T PF11350_consen   29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDS-GAPDFRISLASPGTTDRLCAGLDTSGE-  106 (203)
T ss_pred             ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCC-CCCCEEEEECCcchhhhhccCcCcCce-
Confidence            5899999866422211   1356667777766   4799888999999987 559999999876543321  2222211 


Q ss_pred             eeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCCCCC-----CCccccccCCCCCCCCCCHH
Q 043639          139 LAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPVK-----EAVMYPTLKPRDKKVDLALD  213 (261)
Q Consensus       139 lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~-----~svM~p~y~~~~~~~~l~~d  213 (261)
                      . .-+.|..++|.+|. ..|..+.......-+.+...|+=||+||+||-.|..=+     --||-      +..+.|.-+
T Consensus       107 ~-SC~~~~g~~VvIN~-~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cpg~G~~APVMm------QQT~gL~g~  178 (203)
T PF11350_consen  107 T-SCRNPAGGRVVINL-ARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCPGPGRPAPVMM------QQTLGLDGC  178 (203)
T ss_pred             e-EeecCCCCeEEEeh-HHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCCCCCCcCCcch------hhhcccCCc
Confidence            1 12334456777765 46877655433333455678999999999999887643     33553      223455556


Q ss_pred             HHHHH
Q 043639          214 DIQGV  218 (261)
Q Consensus       214 DI~~i  218 (261)
                      +|..|
T Consensus       179 ~~~~~  183 (203)
T PF11350_consen  179 ELAKL  183 (203)
T ss_pred             ccccc
Confidence            66655


No 21 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.98  E-value=5.5e-05  Score=64.40  Aligned_cols=106  Identities=16%  Similarity=0.083  Sum_probs=59.7

Q ss_pred             HHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccccccCCCCCCCcc
Q 043639           91 FKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVA  170 (261)
Q Consensus        91 i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~  170 (261)
                      ..++|..|....         ....|+.+.+...+-       +.+++++.||.++.-...+...  .....     ...
T Consensus        74 ~l~~f~~~~~~~---------~~~~d~~~l~t~~~~-------~~~~~~G~a~~~~~C~~~~~~~--v~~~~-----~~~  130 (192)
T cd04267          74 TLNSFSFWRAEG---------PIRHDNAVLLTAQDF-------IEGDILGLAYVGSMCNPYSSVG--VVEDT-----GFT  130 (192)
T ss_pred             HHHHHHHHHhcC---------CCCCceEEEEeeecc-------CCCCeeeeecccCCCCCCCCeE--EEecC-----Ccc
Confidence            356788997644         145666666553321       2345667777665211000000  00000     001


Q ss_pred             ccchhhhhhhhhhhcCCCCCCCC----------CCccccccCCCCCCCCCCHHHHHHHHH
Q 043639          171 VDLESVAVHEIGHLLGLTHSPVK----------EAVMYPTLKPRDKKVDLALDDIQGVQA  220 (261)
Q Consensus       171 ~~l~~Va~HEiGHaLGL~Hs~~~----------~svM~p~y~~~~~~~~l~~dDI~~iq~  220 (261)
                      .....+++||+||.|||.|....          .-||.|.... .....++...+..|+.
T Consensus       131 ~~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~-~~~~~FS~CS~~~i~~  189 (192)
T cd04267         131 LLTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG-LNSYRFSQCSIGSIRE  189 (192)
T ss_pred             eeehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC-CCCCccChhhHHHHHH
Confidence            23457999999999999998753          2389876543 2345678877777764


No 22 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.96  E-value=4.2e-05  Score=66.26  Aligned_cols=120  Identities=16%  Similarity=0.212  Sum_probs=71.7

Q ss_pred             HHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccccccCCCCCCCcc
Q 043639           91 FKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVA  170 (261)
Q Consensus        91 i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~  170 (261)
                      .-+.|..|.......-.  .+....|+.+.+...+....   ++++++++.||..+-=.    ......+..     ..+
T Consensus        72 ~L~~F~~w~~~~~~~~~--~~~~~~D~a~llt~~d~~~~---~~~~~~~G~A~~ggiC~----~~~s~~i~~-----~~~  137 (207)
T cd04273          72 SLKSFCRWQKKLNPPND--SDPEHHDHAILLTRQDICRS---NGNCDTLGLAPVGGMCS----PSRSCSINE-----DTG  137 (207)
T ss_pred             HHHHHHHHHHHcCCccc--ccccccceEEEEeeeccccc---CCCCCceEEeccccCCC----CCcceEEEc-----CCC
Confidence            35789999764332211  11246777777765443221   46677777777532100    000001110     113


Q ss_pred             ccchhhhhhhhhhhcCCCCCCC---------CCCccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 043639          171 VDLESVAVHEIGHLLGLTHSPV---------KEAVMYPTLKPRDKKVDLALDDIQGVQALYGS  224 (261)
Q Consensus       171 ~~l~~Va~HEiGHaLGL~Hs~~---------~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~  224 (261)
                      .....|++|||||.||+.|-.+         .+-||+|..........++.--+..++.+...
T Consensus       138 ~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~~  200 (207)
T cd04273         138 LSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLDT  200 (207)
T ss_pred             ceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHcc
Confidence            3445799999999999999875         26799987765334667888888888776653


No 23 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=97.91  E-value=1.1e-05  Score=70.13  Aligned_cols=48  Identities=25%  Similarity=0.306  Sum_probs=32.2

Q ss_pred             chhhhhhhhhhhcCCCCCCCC--------------CCccccccCCCCCCCCCCHHHHHHHHHHh
Q 043639          173 LESVAVHEIGHLLGLTHSPVK--------------EAVMYPTLKPRDKKVDLALDDIQGVQALY  222 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~~--------------~svM~p~y~~~~~~~~l~~dDI~~iq~LY  222 (261)
                      ...+++|||||+|||.|..+.              .+||.  |........++.-+|..|+...
T Consensus       137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMs--Y~~~~~~~~FS~~~i~~i~~~~  198 (206)
T PF13583_consen  137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMS--YASRPSRPYFSPPSIEYIREVV  198 (206)
T ss_pred             cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeC--CCCCCCCCCCCchhHHHHhCCC
Confidence            357899999999999998752              25785  2211112237788888776543


No 24 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.68  E-value=2.6e-05  Score=64.86  Aligned_cols=25  Identities=32%  Similarity=0.492  Sum_probs=15.9

Q ss_pred             cccchhhhhhhhhhhcCCCCCCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTHSPVKE  194 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~Hs~~~~  194 (261)
                      ..+...|++||+||-|||.|.....
T Consensus        66 ~~~~g~TltHEvGH~LGL~HtF~~~   90 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWLGLYHTFGGG   90 (154)
T ss_dssp             TS-SSHHHHHHHHHHTT---TT---
T ss_pred             ccccccchhhhhhhhhcccccccCC
Confidence            4456789999999999999998654


No 25 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.54  E-value=2.6e-05  Score=65.94  Aligned_cols=48  Identities=21%  Similarity=0.197  Sum_probs=19.3

Q ss_pred             chhhhhhhhhhhcCCCCCCCCC-------------CccccccCCCCCCCCCCHHHHHHHHHHh
Q 043639          173 LESVAVHEIGHLLGLTHSPVKE-------------AVMYPTLKPRDKKVDLALDDIQGVQALY  222 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~~~-------------svM~p~y~~~~~~~~l~~dDI~~iq~LY  222 (261)
                      ...|++|||||.||+.|..+..             .||++.  +......++..-|..|+.+-
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~--~~~~~~~fS~cS~~~i~~~l  171 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYA--GICNNDKFSPCSIRQIRAVL  171 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS---GGGG---------------
T ss_pred             eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeC--CCCCCccccccccccccccc
Confidence            3578999999999999987642             588743  32344567777777776543


No 26 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.44  E-value=5.4e-05  Score=66.77  Aligned_cols=26  Identities=38%  Similarity=0.593  Sum_probs=22.4

Q ss_pred             cccchhhhhhhhhhhcCCCCCCCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTHSPVKEA  195 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~Hs~~~~s  195 (261)
                      ..++-.|++|||||.|||.|++...+
T Consensus       134 ~~n~g~t~~HEvGH~lGL~HtF~~~~  159 (225)
T cd04275         134 PYNLGDTATHEVGHWLGLYHTFQGGS  159 (225)
T ss_pred             cccccceeEEeccceeeeeeeecCCC
Confidence            45677899999999999999998764


No 27 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.38  E-value=7.2e-05  Score=62.91  Aligned_cols=27  Identities=44%  Similarity=0.730  Sum_probs=24.2

Q ss_pred             hhhhhhhhhhhcCCCCCCCCCCccccc
Q 043639          174 ESVAVHEIGHLLGLTHSPVKEAVMYPT  200 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~~svM~p~  200 (261)
                      ..=++||+||+|||.|=.++.=||++.
T Consensus       125 ~KEv~HElGH~~GL~HC~N~~CVM~FS  151 (181)
T COG1913         125 VKEVLHELGHLLGLSHCPNPRCVMNFS  151 (181)
T ss_pred             HHHHHHHhhhhcCcccCCCCCcEEeCC
Confidence            456899999999999999999999854


No 28 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.32  E-value=7e-05  Score=63.84  Aligned_cols=22  Identities=36%  Similarity=0.401  Sum_probs=14.5

Q ss_pred             cchhhhhhhhhhhcCCCCCCCC
Q 043639          172 DLESVAVHEIGHLLGLTHSPVK  193 (261)
Q Consensus       172 ~l~~Va~HEiGHaLGL~Hs~~~  193 (261)
                      ....|++|||||.||+.|..+.
T Consensus       141 ~~~~~~AHEiGH~lGa~HD~~~  162 (196)
T PF13688_consen  141 NGAITFAHEIGHNLGAPHDGDY  162 (196)
T ss_dssp             HHHHHHHHHHHHHTT-----SS
T ss_pred             ceehhhHHhHHHhcCCCCCCCC
Confidence            3468999999999999998775


No 29 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.23  E-value=9.8e-05  Score=58.23  Aligned_cols=17  Identities=53%  Similarity=0.808  Sum_probs=12.3

Q ss_pred             hhhhhhhhhhhcCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHS  190 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs  190 (261)
                      ..+++|||||.|||.|+
T Consensus       108 ~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  108 VDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             TTHHHHHHHHHTT----
T ss_pred             ceEeeehhhHhcCCCCC
Confidence            48999999999999995


No 30 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=97.18  E-value=0.0004  Score=61.34  Aligned_cols=49  Identities=29%  Similarity=0.455  Sum_probs=38.5

Q ss_pred             hhhhhhhhhhcCCCCCCCC-------------------------CCccccccCCCCCCCCCCHHHHHHHHHHhCCC
Q 043639          175 SVAVHEIGHLLGLTHSPVK-------------------------EAVMYPTLKPRDKKVDLALDDIQGVQALYGSN  225 (261)
Q Consensus       175 ~Va~HEiGHaLGL~Hs~~~-------------------------~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~~  225 (261)
                      .|++|||||.||+.|-.+.                         .=||+|....  ....++.--+..|+.+...+
T Consensus       147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~--~~~~FS~CS~~~~~~~l~~~  220 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS--GITEFSPCTIGNICSLLGRN  220 (228)
T ss_pred             eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC--CCCccChhhHHHHHHHHhcC
Confidence            6999999999999996431                         1189987543  45789999999999888754


No 31 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.14  E-value=0.00021  Score=60.90  Aligned_cols=26  Identities=38%  Similarity=0.551  Sum_probs=23.5

Q ss_pred             hhhhhhhhhhhcCCCCCCCCCCcccc
Q 043639          174 ESVAVHEIGHLLGLTHSPVKEAVMYP  199 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~~svM~p  199 (261)
                      ..+++||+||.|||.|-.++.-+|+.
T Consensus       126 ~k~~~HElGH~lGL~HC~~~~CvM~~  151 (179)
T PRK13267        126 RKEVTHELGHTLGLEHCDNPRCVMNF  151 (179)
T ss_pred             HHHHHHHHHHHcCCccCCCCCccCCC
Confidence            45699999999999999999999984


No 32 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.07  E-value=0.00063  Score=58.04  Aligned_cols=50  Identities=20%  Similarity=0.321  Sum_probs=36.5

Q ss_pred             chhhhhhhhhhhcCCCCCCC------CCCccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 043639          173 LESVAVHEIGHLLGLTHSPV------KEAVMYPTLKPRDKKVDLALDDIQGVQALYGS  224 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~------~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG~  224 (261)
                      .-.+++|||||.||+.|-.+      ..-||+|....  ....++.--..-++++...
T Consensus       131 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~fS~CS~~~~~~~l~~  186 (194)
T cd04269         131 FAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS--LTDAFSNCSYEDYQKFLSR  186 (194)
T ss_pred             HHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC--CCCCCChhhHHHHHHHHhc
Confidence            35799999999999999653      35689886543  3456777777777766654


No 33 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.91  E-value=0.00039  Score=59.91  Aligned_cols=27  Identities=44%  Similarity=0.651  Sum_probs=19.2

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCccccc
Q 043639          173 LESVAVHEIGHLLGLTHSPVKEAVMYPT  200 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~~~svM~p~  200 (261)
                      +..+++||+||++||.|=.+ .-+|++.
T Consensus       145 ~~Kea~HElGH~~GL~HC~~-~CvM~~s  171 (194)
T PF07998_consen  145 VCKEAVHELGHLFGLDHCEN-RCVMNFS  171 (194)
T ss_dssp             HHHHHHHHHHHHTT----SS-TSTTS--
T ss_pred             HHHHHHHHHHHHcCCcCCCC-CCccCCC
Confidence            45799999999999999999 9999864


No 34 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.46  E-value=0.0039  Score=53.42  Aligned_cols=52  Identities=19%  Similarity=0.415  Sum_probs=33.4

Q ss_pred             ccchhhhhhhhhhhcCCCCCCC--------CCCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 043639          171 VDLESVAVHEIGHLLGLTHSPV--------KEAVMYPTLKPRDKKVDLALDDIQGVQALYG  223 (261)
Q Consensus       171 ~~l~~Va~HEiGHaLGL~Hs~~--------~~svM~p~y~~~~~~~~l~~dDI~~iq~LYG  223 (261)
                      ..+-.+++|||||.||+.|-..        ..-||.|...+. ....++.--+..++++..
T Consensus       129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~-~~~~fS~CS~~~~~~~l~  188 (199)
T PF01421_consen  129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPS-SSDKFSNCSRRQFEEFLS  188 (199)
T ss_dssp             HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSS-STSSS-HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCC-CCCCcCHHHHHHHHHHHc
Confidence            3445789999999999999776        256899855432 233788877777776643


No 35 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.33  E-value=0.0045  Score=55.12  Aligned_cols=50  Identities=20%  Similarity=0.408  Sum_probs=36.1

Q ss_pred             hhhhhhhhhhhcCCCCCCC-----C------CCccccccCCC--CCCCCCCHHHHHHHHHHhC
Q 043639          174 ESVAVHEIGHLLGLTHSPV-----K------EAVMYPTLKPR--DKKVDLALDDIQGVQALYG  223 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~-----~------~svM~p~y~~~--~~~~~l~~dDI~~iq~LYG  223 (261)
                      ..|++|||||.||..|-..     +      .=||+|.-...  .....++.--++.++...-
T Consensus       168 a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~  230 (244)
T cd04270         168 DLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLE  230 (244)
T ss_pred             HHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHc
Confidence            4799999999999999665     1      24899864321  2356788888888876654


No 36 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.98  E-value=0.007  Score=52.83  Aligned_cols=50  Identities=16%  Similarity=0.274  Sum_probs=35.5

Q ss_pred             chhhhhhhhhhhcCCCCCCCC------------------CCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 043639          173 LESVAVHEIGHLLGLTHSPVK------------------EAVMYPTLKPRDKKVDLALDDIQGVQALYG  223 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~~------------------~svM~p~y~~~~~~~~l~~dDI~~iq~LYG  223 (261)
                      ...+++|||||.||+.|-.+.                  .-||.+... ....+.++.--+.-++++..
T Consensus       145 ~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~-~~~~~~fS~CS~~~~~~~l~  212 (220)
T cd04272         145 GVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVN-GERQYRFSQCSQRQIRNVFR  212 (220)
T ss_pred             cHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccC-CCCCCccCcchHHHHHHHHc
Confidence            368999999999999995332                  357886432 23455788877777777654


No 37 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=95.48  E-value=0.013  Score=55.88  Aligned_cols=58  Identities=24%  Similarity=0.336  Sum_probs=40.6

Q ss_pred             hhhhhhhhhhhcCCCCCCCCCCcc---ccccCC----CC------------C---CCCCCHHHHHHHHHHhCCCCCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHSPVKEAVM---YPTLKP----RD------------K---KVDLALDDIQGVQALYGSNPNFTIG  231 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~~svM---~p~y~~----~~------------~---~~~l~~dDI~~iq~LYG~~~~~~~~  231 (261)
                      ...+.|||||.||+.|..  +.||   |+++.-    .+            +   +-+.++.-+++++-+|-+..+-..+
T Consensus       302 lGA~~HElGH~lgcpHq~--~GvMlrgy~~lNRsftt~e~~~vrt~s~g~aPv~~keec~w~~lD~~Rf~yhp~f~lp~d  379 (614)
T KOG4525|consen  302 LGAVCHELGHCLGCPHQS--EGVMLRGYDYLNRSFTTDEFECVRTGSKGLAPVLAKEECPWRILDARRFSYHPTFSLPGD  379 (614)
T ss_pred             HHHHHHHhhhccCCCCCC--CceEecCchhhcceeeecceeeeecCCcCcCCccccccCchhhhhhhhhcccccccCCCC
Confidence            468899999999999975  5688   443321    00            1   2346778899999999877666555


Q ss_pred             CC
Q 043639          232 SL  233 (261)
Q Consensus       232 ~~  233 (261)
                      ++
T Consensus       380 pt  381 (614)
T KOG4525|consen  380 PT  381 (614)
T ss_pred             CC
Confidence            44


No 38 
>PF11150 DUF2927:  Protein of unknown function (DUF2927);  InterPro: IPR021323  This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. 
Probab=95.03  E-value=0.69  Score=40.50  Aligned_cols=150  Identities=15%  Similarity=0.127  Sum_probs=83.3

Q ss_pred             CcccCCCCeeEEEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCC---------C
Q 043639           59 PRWTRHVPMTLSYAFSKENMINYLHITDIKAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGD---------G  129 (261)
Q Consensus        59 ~kW~~~~~~~LTY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd---------~  129 (261)
                      .||.+.    |+..+..  .+. -......+.+..-++.-..++.+.+..+... .|+|.|.|...+...         .
T Consensus        28 ~Rw~~P----Vrv~v~~--~~~-~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~   99 (213)
T PF11150_consen   28 RRWEGP----VRVRVEG--VPP-ADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGA   99 (213)
T ss_pred             ccCCCC----eEEEEec--cCh-hhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccc
Confidence            589976    8888872  111 1123445566666788888999999998864 999999998765411         0


Q ss_pred             CCcc--------CCCCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCC-CC-CCCCcccc
Q 043639          130 EPFD--------GVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTH-SP-VKEAVMYP  199 (261)
Q Consensus       130 ~~fd--------G~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~H-s~-~~~svM~p  199 (261)
                      ..+.        -....+..++....+.+.....-.. +...    ...--+...+..||-.+|||.- +. .+.||-+ 
T Consensus       100 ~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi~-i~~~----~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFN-  173 (213)
T PF11150_consen  100 ACFVVPNFSSLPRYARCLVFARSNWSGEITRARAVIF-IRAD----HPPQERRDCIHEELAQGLGLPNDSPRARPSIFN-  173 (213)
T ss_pred             ccccchhhhcccccceEEEEEEecCCCceEEEEEEEE-Eecc----CCchHHHHHHHHHHHHHcCCccCCCcCCCceee-
Confidence            0010        0011122222221111111000000 0000    0122356788889999999952 22 2233322 


Q ss_pred             ccCCCCCCCCCCHHHHHHHHHHhCCC
Q 043639          200 TLKPRDKKVDLALDDIQGVQALYGSN  225 (261)
Q Consensus       200 ~y~~~~~~~~l~~dDI~~iq~LYG~~  225 (261)
                         -+.....|+..|..-|+.||-+.
T Consensus       174 ---Dd~~~~~LT~~D~lLLr~LYdP~  196 (213)
T PF11150_consen  174 ---DDNEFALLTGFDELLLRMLYDPR  196 (213)
T ss_pred             ---CCCccccccHHHHHHHHHHcCcc
Confidence               22345578999999999999864


No 39 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=94.63  E-value=0.027  Score=49.46  Aligned_cols=81  Identities=22%  Similarity=0.339  Sum_probs=43.8

Q ss_pred             HHHHHHHHHHhHhcC---CceEEEcC-----CCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeeccccccc
Q 043639           88 KAVFKRAFSKWASVI---PVSFIETD-----DYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWA  159 (261)
Q Consensus        88 ~~~i~~Af~~Ws~vt---~l~F~ev~-----~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt  159 (261)
                      +--++..|++|+++.   +++=.|+.     +.+-.|++|.-..|.-+|.             |-|.+-.+.+.++.. .
T Consensus        18 q~~vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~LtDH-------------YdP~~k~vrLS~~vy-~   83 (222)
T PF04298_consen   18 QWWVKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGELTDH-------------YDPRNKVVRLSEDVY-N   83 (222)
T ss_pred             HHHHHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCCCCC-------------cCCCCCEEEeCCccC-C
Confidence            345677788888763   23222332     2345677887666544442             344444444433211 0


Q ss_pred             ccCCCCCCCccccchhhhhhhhhhhcCCCCCCC
Q 043639          160 VDFGSEKSPVAVDLESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       160 ~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  192 (261)
                              ....--..|++||.|||+  .|...
T Consensus        84 --------~~SiaAvaVAAHEvGHAi--Q~a~~  106 (222)
T PF04298_consen   84 --------GRSIAAVAVAAHEVGHAI--QHAEG  106 (222)
T ss_pred             --------CCCHHHHHHHHHHHhHHH--hcccc
Confidence                    012223579999999995  66654


No 40 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=94.59  E-value=0.015  Score=53.59  Aligned_cols=20  Identities=35%  Similarity=0.532  Sum_probs=14.8

Q ss_pred             ccchhhhhhhhhhhcCCCCC
Q 043639          171 VDLESVAVHEIGHLLGLTHS  190 (261)
Q Consensus       171 ~~l~~Va~HEiGHaLGL~Hs  190 (261)
                      .....+++||+||++||.|-
T Consensus       191 ~s~~~~f~HE~GH~~GL~H~  210 (305)
T PF10462_consen  191 YSYGNEFSHELGHNFGLGHY  210 (305)
T ss_dssp             S-SHHHHHHHHHHTTT--SS
T ss_pred             cCccceeehhhhhhcCCCCC
Confidence            34568999999999999994


No 41 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.47  E-value=0.02  Score=53.02  Aligned_cols=19  Identities=42%  Similarity=0.566  Sum_probs=17.0

Q ss_pred             chhhhhhhhhhhcCCCCCC
Q 043639          173 LESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~  191 (261)
                      -..|++||+||-|||.|+.
T Consensus       150 ~~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  150 DWATIMHELGHNLGLWHAG  168 (314)
T ss_pred             cHHHHHHHhhhhccccccC
Confidence            3579999999999999995


No 42 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=93.87  E-value=0.029  Score=53.90  Aligned_cols=23  Identities=39%  Similarity=0.658  Sum_probs=19.5

Q ss_pred             hhhhhhhhhhhcCCCCCCCCCCccc
Q 043639          174 ESVAVHEIGHLLGLTHSPVKEAVMY  198 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~~svM~  198 (261)
                      ....+|||||++||.|..+  .||-
T Consensus       317 iGA~lHEiGH~fg~pH~~~--GIM~  339 (423)
T PF12044_consen  317 IGAFLHEIGHLFGCPHQED--GIML  339 (423)
T ss_pred             hHHHHHHHHHhcCCCCCCC--Ceee
Confidence            4579999999999999965  7773


No 43 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=93.46  E-value=0.023  Score=51.80  Aligned_cols=19  Identities=42%  Similarity=0.513  Sum_probs=16.6

Q ss_pred             hhhhhhhhhhhcCCCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~  192 (261)
                      ..|++||+||.|||.+--+
T Consensus       166 igv~~HE~gH~lGLPDlY~  184 (286)
T TIGR03296       166 VGVIAHELGHDLGLPDLYD  184 (286)
T ss_pred             eeeeehhhhcccCCCcccc
Confidence            6899999999999987654


No 44 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=93.27  E-value=0.063  Score=46.20  Aligned_cols=78  Identities=19%  Similarity=0.334  Sum_probs=41.1

Q ss_pred             HHHHHHHHHHHhHhcC---CceEEEcC-----CCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccc
Q 043639           87 IKAVFKRAFSKWASVI---PVSFIETD-----DYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETW  158 (261)
Q Consensus        87 ~~~~i~~Af~~Ws~vt---~l~F~ev~-----~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~w  158 (261)
                      .+.-++.++++|++|.   +++=.||.     ..+--|+.+.-.+|...|             -|-|..-.+.+.+ .++
T Consensus        20 AQ~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~LTD-------------HYDP~~kvvrLSe-~~y   85 (226)
T COG2738          20 AQSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTLTD-------------HYDPRRKVVRLSE-ANY   85 (226)
T ss_pred             HHHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCccc-------------ccChhhheeeccc-ccc
Confidence            4567789999999984   34434443     124456666654443222             2333322222211 111


Q ss_pred             cccCCCCCCCccccchhhhhhhhhhhcC
Q 043639          159 AVDFGSEKSPVAVDLESVAVHEIGHLLG  186 (261)
Q Consensus       159 t~~~~~~~~~~~~~l~~Va~HEiGHaLG  186 (261)
                      .        ....--..|++||.|||+=
T Consensus        86 ~--------g~Sia~~aVAAHEVGHAiQ  105 (226)
T COG2738          86 Y--------GPSIAAIAVAAHEVGHAIQ  105 (226)
T ss_pred             C--------CccHHHHHHHHHHhhHHHh
Confidence            0        0122236899999999964


No 45 
>PTZ00337 surface protease GP63; Provisional
Probab=92.63  E-value=0.2  Score=49.98  Aligned_cols=36  Identities=31%  Similarity=0.488  Sum_probs=24.3

Q ss_pred             CCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCCCCC
Q 043639          145 PESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGLTHS  190 (261)
Q Consensus       145 P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs  190 (261)
                      |-.|.|.|+....   .       .......|++|||.||||+...
T Consensus       211 Pi~G~in~np~~i---~-------~~~~~~~v~~HEi~HALGFs~~  246 (567)
T PTZ00337        211 PFAAAVNFDPRQI---A-------VTNGDVRVAAHELGHALGFVRD  246 (567)
T ss_pred             ceEEEEEECHHHc---c-------chhHHHHHHHHHHHHHHccCHH
Confidence            4447788876432   0       0223578999999999999654


No 46 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=92.46  E-value=0.3  Score=49.02  Aligned_cols=96  Identities=21%  Similarity=0.230  Sum_probs=49.9

Q ss_pred             CcceEEEeeecCCCCCCCccCCCCceeeec--------CCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639          114 FADIRIGFYNGDHGDGEPFDGVLGVLAHSF--------SPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL  185 (261)
Q Consensus       114 ~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL  185 (261)
                      ++|+.|......      +.  .+++|.|-        .|..|.+.|+....   ..     ........|++|||.|+|
T Consensus       205 naD~vLYVta~P------~~--~~~lA~A~~C~~~~d~RPi~G~iNinp~~i---~s-----~~~~~~~rv~~HEi~HAL  268 (622)
T PTZ00257        205 NADFVLYVASVP------SE--PGVLAWATTCQVFSDDHPAVGVMNIPAANI---VS-----RYDQGTTRTVTHEVAHAL  268 (622)
T ss_pred             ccCEEEEEEEec------CC--CCeeEEEEEeccccCCCceEEEEeeCHHHC---CC-----ccchHHHHHHHHHHHHHh
Confidence            566665554332      21  25666553        24457788876432   00     012235789999999999


Q ss_pred             CCCCCCCCCCccccccCC-CCCCC----CCCHHHHHHHHHHhCCC
Q 043639          186 GLTHSPVKEAVMYPTLKP-RDKKV----DLALDDIQGVQALYGSN  225 (261)
Q Consensus       186 GL~Hs~~~~svM~p~y~~-~~~~~----~l~~dDI~~iq~LYG~~  225 (261)
                      |+.-+....--|.-.... ....+    ..++.-++..++-||.+
T Consensus       269 GFS~~~f~~~~~~~~~~~~rGk~~~~~vi~SP~v~~~Ar~hfgC~  313 (622)
T PTZ00257        269 GFSSVFFENAGIVKSVTNLRGKPFAAPVINSSTAVAKAREQYGCP  313 (622)
T ss_pred             cCCHHHHhhcCcccceeccCCCcceeeeecCHHHHHHHHHhcCCC
Confidence            996443222111111110 01111    12566677778888854


No 47 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=91.42  E-value=0.077  Score=42.20  Aligned_cols=20  Identities=40%  Similarity=0.735  Sum_probs=16.2

Q ss_pred             cccchhhhhhhhhhhcCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTH  189 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~H  189 (261)
                      +.....|++|||||-.||+.
T Consensus       106 gd~vthvliHEIgHhFGLsD  125 (136)
T COG3824         106 GDQVTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             hhHhhhhhhhhhhhhcCCCh
Confidence            33456899999999999974


No 48 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=91.41  E-value=0.11  Score=51.29  Aligned_cols=97  Identities=16%  Similarity=0.168  Sum_probs=40.8

Q ss_pred             CcceEEEeeecCCCCCCCccCCCCceeeec--------CCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639          114 FADIRIGFYNGDHGDGEPFDGVLGVLAHSF--------SPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL  185 (261)
Q Consensus       114 ~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL  185 (261)
                      ++|+.|.......+        .+++|.|.        -|-.|.|.|+....   ..     .....+..|++|||.|+|
T Consensus       159 ~~D~vlyV~~~~~~--------~~~~A~A~~C~~~~~~RP~~G~in~~p~~i---~~-----~~~~~~~~~~~HEi~HaL  222 (521)
T PF01457_consen  159 NADFVLYVTARPSS--------SSTLAWAAPCQQDSDGRPIAGVININPSYI---PS-----FYFQEFFRTVIHEIAHAL  222 (521)
T ss_dssp             S-SEEEEEE----S--------TT---EEEEEEE-TTS-EEEEEEE--GGG-----------S--HHHHHHHHHHHHHHT
T ss_pred             CccEEEEEEEeecc--------CCceEEEeehhhccCCCCeeEEEEEchhHc---cc-----hhhhcccceeeeeeeeee
Confidence            57776655433221        16777764        13337777766431   10     012345789999999999


Q ss_pred             CCCCCCCCCCccc---cccCCCC--CCCCCCHHHHHHHHHHhCCCC
Q 043639          186 GLTHSPVKEAVMY---PTLKPRD--KKVDLALDDIQGVQALYGSNP  226 (261)
Q Consensus       186 GL~Hs~~~~svM~---p~y~~~~--~~~~l~~dDI~~iq~LYG~~~  226 (261)
                      |+.-......-|.   +...+..  ..+--++.=+.-.+.-||.+.
T Consensus       223 GFs~~~~~~~~~~~~~~~~~~~~~~~~~l~tp~v~~~ar~hf~C~~  268 (521)
T PF01457_consen  223 GFSSSFFRYYGMPRTVSNVRGKPTTVTVLVTPNVVEFARKHFGCPT  268 (521)
T ss_dssp             T-SHHHHHHTT-EEEES-GGG-SS-EEEE--HHHHHHHHHHHT-TT
T ss_pred             eecccccccccccccccceeecCceeEEEeCcHHHHHHHHHhCCCc
Confidence            9975211111110   0111100  112235566667788998643


No 49 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=88.80  E-value=0.11  Score=52.62  Aligned_cols=51  Identities=20%  Similarity=0.149  Sum_probs=33.4

Q ss_pred             chhhhhhhhhhhcCCCCCCCC------C-----Ccccc-ccCCC---CCCCCCCHHHHHHHHHHhC
Q 043639          173 LESVAVHEIGHLLGLTHSPVK------E-----AVMYP-TLKPR---DKKVDLALDDIQGVQALYG  223 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~~------~-----svM~p-~y~~~---~~~~~l~~dDI~~iq~LYG  223 (261)
                      -..|++||+||.|||.---+.      +     ++|.. .|.+.   .....++..-.+-+|..+|
T Consensus       221 giGVfaHEfGH~LGLPDlYDT~~~g~~~~vg~WsLMs~GSw~g~~~g~~p~~~~~~~~~w~k~~lG  286 (645)
T PF05547_consen  221 GIGVFAHEFGHDLGLPDLYDTDYSGGGNGVGFWSLMSSGSWNGKGSGTEPGGFPPYMKAWLKLQLG  286 (645)
T ss_pred             ceEEEEeeccccCCCCCcccCCCCCCCCCcccceecCCCCCCCCCCCCCCCccChhHhHHHhhhcc
Confidence            368999999999999754432      1     57853 45432   1234566677777777777


No 50 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=87.96  E-value=0.46  Score=48.69  Aligned_cols=46  Identities=24%  Similarity=0.304  Sum_probs=30.2

Q ss_pred             chhhhhhhhhhhcCCCCCCC-------CCCccccccCCCCCCCCCCHHHHHHHH
Q 043639          173 LESVAVHEIGHLLGLTHSPV-------KEAVMYPTLKPRDKKVDLALDDIQGVQ  219 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~-------~~svM~p~y~~~~~~~~l~~dDI~~iq  219 (261)
                      ...|++|||||.||+.|-..       ..=|| +.+.....+..++.--++.+.
T Consensus       323 ~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~~  375 (716)
T KOG3607|consen  323 FAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDFE  375 (716)
T ss_pred             HHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHHH
Confidence            45799999999999999762       22478 655533334456655555553


No 51 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=86.61  E-value=0.27  Score=44.50  Aligned_cols=23  Identities=35%  Similarity=0.560  Sum_probs=12.9

Q ss_pred             cchhhhhhhhhhhc-CCCCCCCCC
Q 043639          172 DLESVAVHEIGHLL-GLTHSPVKE  194 (261)
Q Consensus       172 ~l~~Va~HEiGHaL-GL~Hs~~~~  194 (261)
                      ....|++||+||++ ||+-.-..+
T Consensus       215 ~~~~v~vHE~GHsf~~LaDEY~~~  238 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADEYYYG  238 (264)
T ss_dssp             THHHHHHHHHHHHTT------ECS
T ss_pred             cccceeeeecccccccccccccCC
Confidence            45689999999965 566665443


No 52 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=85.79  E-value=0.31  Score=37.41  Aligned_cols=16  Identities=38%  Similarity=0.771  Sum_probs=12.7

Q ss_pred             chhhhhhhhhhhcCCC
Q 043639          173 LESVAVHEIGHLLGLT  188 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~  188 (261)
                      +..|++|||||.+|+.
T Consensus        73 I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   73 IRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHcCCC
Confidence            4679999999999985


No 53 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=84.10  E-value=0.76  Score=46.34  Aligned_cols=48  Identities=23%  Similarity=0.439  Sum_probs=33.5

Q ss_pred             hhhhhhhhhhhcCCCCCCC----C------CCccccccCCCC--CCCCCCHHHHHHHHHH
Q 043639          174 ESVAVHEIGHLLGLTHSPV----K------EAVMYPTLKPRD--KKVDLALDDIQGVQAL  221 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~----~------~svM~p~y~~~~--~~~~l~~dDI~~iq~L  221 (261)
                      .-|++|||||..|=.|-.+    |      +=|||++-...+  .+-+++.--+..|...
T Consensus       393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~V  452 (764)
T KOG3658|consen  393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKV  452 (764)
T ss_pred             heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHHH
Confidence            4599999999999999765    1      238998755432  3456777777766543


No 54 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=82.85  E-value=0.45  Score=46.16  Aligned_cols=20  Identities=40%  Similarity=0.639  Sum_probs=15.9

Q ss_pred             cccchhhhhhhhhhhcCCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTHS  190 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~Hs  190 (261)
                      ...+-+|++|||||..| .|-
T Consensus       127 esElagViAHEigHv~q-rH~  146 (484)
T COG4783         127 ESELAGVIAHEIGHVAQ-RHL  146 (484)
T ss_pred             HHHHHHHHHHHHHHHhh-hhH
Confidence            34678999999999977 554


No 55 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=78.73  E-value=7  Score=33.09  Aligned_cols=73  Identities=23%  Similarity=0.370  Sum_probs=42.6

Q ss_pred             HHHHHHHHHHhHhcCCceEEEcCCCCCcceEEEeeecCCCCCCCccCCCCceeeecCCCCceeeecccccccccCCCCCC
Q 043639           88 KAVFKRAFSKWASVIPVSFIETDDYDFADIRIGFYNGDHGDGEPFDGVLGVLAHSFSPESGKFHLDAAETWAVDFGSEKS  167 (261)
Q Consensus        88 ~~~i~~Af~~Ws~vt~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~  167 (261)
                      ++.+..-+..|++..++.        ...|+|.-....-|.+               +..|.|.|+    |-+-.-+   
T Consensus       111 ~~~l~~~~~~~~~~~~~~--------~~~i~ir~~ksrWGsc---------------~~~~~I~ln----~~L~~~P---  160 (205)
T PF01863_consen  111 KEYLPERLKKYAKKLGLP--------PPKIKIRDMKSRWGSC---------------SSKGNITLN----WRLVMAP---  160 (205)
T ss_pred             HHHHHHHHHHHHHHcCCC--------cceEEEeehhhccccC---------------CCCCcEEee----cccccCC---
Confidence            344445555666655553        3456665444332221               335667774    3332221   


Q ss_pred             CccccchhhhhhhhhhhcCCCCCCC
Q 043639          168 PVAVDLESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       168 ~~~~~l~~Va~HEiGHaLGL~Hs~~  192 (261)
                        ..-+..|++||+-|..-..||..
T Consensus       161 --~~~idYVvvHEL~Hl~~~nHs~~  183 (205)
T PF01863_consen  161 --PEVIDYVVVHELCHLRHPNHSKR  183 (205)
T ss_pred             --ccHHHHHHHHHHHHhccCCCCHH
Confidence              33457899999999999999865


No 56 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=78.11  E-value=0.81  Score=34.60  Aligned_cols=14  Identities=36%  Similarity=0.629  Sum_probs=11.7

Q ss_pred             cchhhhhhhhhhhc
Q 043639          172 DLESVAVHEIGHLL  185 (261)
Q Consensus       172 ~l~~Va~HEiGHaL  185 (261)
                      ....+++|||||.+
T Consensus        41 ~~~f~laHELgH~~   54 (122)
T PF06114_consen   41 RQRFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHH
Confidence            34689999999985


No 57 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=77.63  E-value=0.83  Score=39.42  Aligned_cols=14  Identities=50%  Similarity=0.705  Sum_probs=12.0

Q ss_pred             chhhhhhhhhhhcC
Q 043639          173 LESVAVHEIGHLLG  186 (261)
Q Consensus       173 l~~Va~HEiGHaLG  186 (261)
                      +..|+.||+||++-
T Consensus        22 ~l~t~~HE~gHal~   35 (200)
T PF13398_consen   22 LLVTFVHELGHALA   35 (200)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46799999999975


No 58 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=77.15  E-value=1.3  Score=40.75  Aligned_cols=40  Identities=20%  Similarity=0.247  Sum_probs=21.3

Q ss_pred             cCCCCceeeeccccccc-ccCCCCCCCccccchhhhhhhhhhh
Q 043639          143 FSPESGKFHLDAAETWA-VDFGSEKSPVAVDLESVAVHEIGHL  184 (261)
Q Consensus       143 ~~P~~G~vhfd~~e~wt-~~~~~~~~~~~~~l~~Va~HEiGHa  184 (261)
                      |-|.+..|.+|... |. +... ....+..-...|++||+||.
T Consensus       141 YCp~D~tIYlD~~f-~~~L~~~-~ga~G~~a~ayVlAHEyGHH  181 (292)
T PF04228_consen  141 YCPADQTIYLDLSF-FDELQQR-FGASGDFAQAYVLAHEYGHH  181 (292)
T ss_pred             eCCCCCEEEechHH-HHHHHHH-hCCccHHHHHHHHHHHHHHH
Confidence            34667788888642 11 1100 00011122356999999996


No 59 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=77.09  E-value=0.9  Score=39.80  Aligned_cols=18  Identities=33%  Similarity=0.547  Sum_probs=13.8

Q ss_pred             chhhhhhhhhhhcCCCCCCC
Q 043639          173 LESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      -.-|++||+||+|  -|...
T Consensus        72 ~rFtlAHELGH~l--lH~~~   89 (213)
T COG2856          72 KRFTLAHELGHAL--LHTDL   89 (213)
T ss_pred             HHHHHHHHHhHHH--hcccc
Confidence            3579999999995  56554


No 60 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=76.69  E-value=1.4  Score=40.92  Aligned_cols=23  Identities=26%  Similarity=0.400  Sum_probs=19.6

Q ss_pred             cccchhhhhhhhhhhcCCCCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      ...+..|++||++|-+|...+.+
T Consensus       193 ~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  193 PYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             cccccHHHHHHHHHHhCCCCHHH
Confidence            45567899999999999998865


No 61 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=75.03  E-value=1.6  Score=38.15  Aligned_cols=29  Identities=34%  Similarity=0.561  Sum_probs=25.3

Q ss_pred             hhhhhhhhhhhcCCCCCCCCCCccccccC
Q 043639          174 ESVAVHEIGHLLGLTHSPVKEAVMYPTLK  202 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~~svM~p~y~  202 (261)
                      ..+.-|++||++||+|.+.+..||+|.+.
T Consensus       154 al~~~efl~~~~~~~~lnhr~~i~~~pg~  182 (236)
T COG5549         154 ALIAYEFLGHALGLGHLNHRGDIMYPPGE  182 (236)
T ss_pred             HHHHHHHhhhccCcccccccccccCCccc
Confidence            46788999999999999999999986554


No 62 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=74.75  E-value=1  Score=39.65  Aligned_cols=14  Identities=36%  Similarity=0.570  Sum_probs=11.9

Q ss_pred             cchhhhhhhhhhhc
Q 043639          172 DLESVAVHEIGHLL  185 (261)
Q Consensus       172 ~l~~Va~HEiGHaL  185 (261)
                      .+..|+.||+||+|
T Consensus        91 ~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   91 NVLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45789999999985


No 63 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=74.22  E-value=2.6  Score=44.10  Aligned_cols=48  Identities=17%  Similarity=0.196  Sum_probs=31.1

Q ss_pred             ccchhhhhhhhhhhcCCCCCCCCC------CccccccCCCCCCCCCCHHHHHHH
Q 043639          171 VDLESVAVHEIGHLLGLTHSPVKE------AVMYPTLKPRDKKVDLALDDIQGV  218 (261)
Q Consensus       171 ~~l~~Va~HEiGHaLGL~Hs~~~~------svM~p~y~~~~~~~~l~~dDI~~i  218 (261)
                      ...-.+++||+||.+|+.|-...+      .||.|...........+..+..-+
T Consensus       315 l~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~~~~~~~WS~CS~~~~  368 (845)
T KOG3538|consen  315 LPSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCGGNHELTWSECSREYL  368 (845)
T ss_pred             CccceeeeeccccccCcccCCcCCCCCCcccccCcccccCCCCccCCcchHHHH
Confidence            334579999999999999986543      588886654333333444433333


No 64 
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=74.06  E-value=15  Score=35.29  Aligned_cols=24  Identities=21%  Similarity=0.319  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHhHhcCCceEEEcCC
Q 043639           88 KAVFKRAFSKWASVIPVSFIETDD  111 (261)
Q Consensus        88 ~~~i~~Af~~Ws~vt~l~F~ev~~  111 (261)
                      ..+++.-|..-+.+=+++|+++..
T Consensus       107 ~~v~~gl~~~~~~lfgi~~~~~~~  130 (422)
T cd06456         107 EKVLDGLFELAERLYGITFKERTD  130 (422)
T ss_pred             HHHHHHHHHHHHHHcCeeEEECCC
Confidence            345556667777788899988764


No 65 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=71.49  E-value=18  Score=35.11  Aligned_cols=25  Identities=4%  Similarity=0.109  Sum_probs=18.9

Q ss_pred             HHHHHHHHHHHhHhcCCceEEEcCC
Q 043639           87 IKAVFKRAFSKWASVIPVSFIETDD  111 (261)
Q Consensus        87 ~~~~i~~Af~~Ws~vt~l~F~ev~~  111 (261)
                      +..+++--|..-+.+=+|+|+++..
T Consensus       132 l~~vl~gl~~~~~~lfgi~~~~~~~  156 (458)
T cd06457         132 LGTVMEGLSRLFSRLYGIRLVPVPL  156 (458)
T ss_pred             HHHHHHHHHHHHHHHhCeEEEecCC
Confidence            3456666777788888999999863


No 66 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=70.98  E-value=1.6  Score=37.30  Aligned_cols=21  Identities=33%  Similarity=0.520  Sum_probs=16.0

Q ss_pred             cccchhhhhhhhhhhcCCCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                      ...+..|++||+||.. .+|.-
T Consensus        86 ~~el~aVlaHElgH~~-~~h~~  106 (226)
T PF01435_consen   86 EDELAAVLAHELGHIK-HRHIL  106 (226)
T ss_dssp             HHHHHHHHHHHHHHHH-TTHCC
T ss_pred             HHHHHHHHHHHHHHHH-cCCcc
Confidence            3457899999999997 35554


No 67 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=70.89  E-value=3.4  Score=30.32  Aligned_cols=30  Identities=13%  Similarity=0.319  Sum_probs=18.9

Q ss_pred             CceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639          147 SGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL  185 (261)
Q Consensus       147 ~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL  185 (261)
                      ..+|+|...+. .        .....-..+++||+.|++
T Consensus        44 G~~I~f~~g~~-~--------~~s~~~~~llaHEl~Hv~   73 (79)
T PF13699_consen   44 GNDIYFAPGKY-N--------PDSPEGRALLAHELAHVV   73 (79)
T ss_pred             CCEEEEcCCCc-C--------CCCCCcchhHhHHHHHHH
Confidence            46788865442 1        012223689999999985


No 68 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=70.77  E-value=1.5  Score=36.87  Aligned_cols=20  Identities=35%  Similarity=0.439  Sum_probs=14.5

Q ss_pred             hhhhhhhhhhhcCCCCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHSPVK  193 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~  193 (261)
                      ..+++||+||++=-....++
T Consensus         8 i~i~~HE~gH~~~a~~~G~~   27 (192)
T PF02163_consen    8 ISIVLHELGHALAARLYGDK   27 (192)
T ss_dssp             HHHHHHHHHHHHHHHTTT--
T ss_pred             cccccccccccccccccccc
Confidence            57899999999866655544


No 69 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=70.32  E-value=1.7  Score=37.07  Aligned_cols=12  Identities=50%  Similarity=0.844  Sum_probs=10.4

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      ..|++||+||.+
T Consensus        10 ~~v~iHElGH~~   21 (182)
T cd06163          10 ILIFVHELGHFL   21 (182)
T ss_pred             HHHHHHHHHHHH
Confidence            478999999985


No 70 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=69.91  E-value=1.7  Score=36.40  Aligned_cols=16  Identities=50%  Similarity=0.638  Sum_probs=12.4

Q ss_pred             hhhhhhhhhhhcCCCC
Q 043639          174 ESVAVHEIGHLLGLTH  189 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~H  189 (261)
                      ..+.+||+||++=-..
T Consensus         9 i~i~iHE~gH~~~A~~   24 (180)
T cd05709           9 ISVTVHELGHALVARR   24 (180)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            5789999999984433


No 71 
>PRK02870 heat shock protein HtpX; Provisional
Probab=67.80  E-value=3  Score=39.01  Aligned_cols=15  Identities=33%  Similarity=0.676  Sum_probs=12.8

Q ss_pred             cccchhhhhhhhhhh
Q 043639          170 AVDLESVAVHEIGHL  184 (261)
Q Consensus       170 ~~~l~~Va~HEiGHa  184 (261)
                      ...+..|++||+||.
T Consensus       170 ~dEL~aVlAHELgHi  184 (336)
T PRK02870        170 RDELQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345789999999998


No 72 
>PRK03982 heat shock protein HtpX; Provisional
Probab=67.77  E-value=1.9  Score=39.18  Aligned_cols=16  Identities=38%  Similarity=0.594  Sum_probs=13.1

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ...+..|++||+||.-
T Consensus       122 ~~El~AVlAHElgHi~  137 (288)
T PRK03982        122 EDELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3457899999999974


No 73 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=66.95  E-value=2.2  Score=38.34  Aligned_cols=16  Identities=44%  Similarity=0.754  Sum_probs=13.2

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ...+..|++||+||..
T Consensus       154 ~dEl~aVlaHElgHi~  169 (302)
T COG0501         154 DDELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3457899999999983


No 74 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=66.16  E-value=2.2  Score=36.94  Aligned_cols=16  Identities=44%  Similarity=0.594  Sum_probs=12.4

Q ss_pred             chhhhhhhhhhhcCCC
Q 043639          173 LESVAVHEIGHLLGLT  188 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~  188 (261)
                      +..|+.||+||++=-.
T Consensus        38 ~~~v~iHElgH~~~A~   53 (208)
T cd06161          38 FLSVLLHELGHALVAR   53 (208)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4589999999986433


No 75 
>PRK03001 M48 family peptidase; Provisional
Probab=65.84  E-value=2.2  Score=38.67  Aligned_cols=16  Identities=25%  Similarity=0.536  Sum_probs=13.1

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ...+..|++||+||.-
T Consensus       121 ~~El~aVlAHElgHi~  136 (283)
T PRK03001        121 EREIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3457899999999973


No 76 
>PRK01345 heat shock protein HtpX; Provisional
Probab=64.90  E-value=2.4  Score=39.33  Aligned_cols=15  Identities=27%  Similarity=0.543  Sum_probs=13.0

Q ss_pred             ccchhhhhhhhhhhc
Q 043639          171 VDLESVAVHEIGHLL  185 (261)
Q Consensus       171 ~~l~~Va~HEiGHaL  185 (261)
                      ..+..|++|||||.-
T Consensus       122 dEL~aVlAHElgHi~  136 (317)
T PRK01345        122 EEVAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            457899999999985


No 77 
>PRK03072 heat shock protein HtpX; Provisional
Probab=64.88  E-value=2.4  Score=38.71  Aligned_cols=16  Identities=31%  Similarity=0.522  Sum_probs=13.2

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ...+..|++||+||.-
T Consensus       124 ~~El~aVlAHElgHi~  139 (288)
T PRK03072        124 ERELRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3457899999999974


No 78 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=63.79  E-value=2.6  Score=39.82  Aligned_cols=19  Identities=16%  Similarity=0.141  Sum_probs=13.3

Q ss_pred             cCCHHHHHHHHHHHHHHhH
Q 043639           81 YLHITDIKAVFKRAFSKWA   99 (261)
Q Consensus        81 ~l~~~~~~~~i~~Af~~Ws   99 (261)
                      ..+.++..+.+.++++.|.
T Consensus       153 ~~~~~~~~~~v~~~~~~l~  171 (427)
T cd06459         153 KYTYEEAKELVLEALSPLG  171 (427)
T ss_pred             cCcHHHHHHHHHHHHHccC
Confidence            3455677788888887763


No 79 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=62.64  E-value=2.8  Score=38.98  Aligned_cols=14  Identities=36%  Similarity=0.515  Sum_probs=11.7

Q ss_pred             cchhhhhhhhhhhc
Q 043639          172 DLESVAVHEIGHLL  185 (261)
Q Consensus       172 ~l~~Va~HEiGHaL  185 (261)
                      +-..++.||+||++
T Consensus       153 ~~v~tl~HE~GHa~  166 (365)
T cd06258         153 DDINTLFHEFGHAV  166 (365)
T ss_pred             HHHHHHHHHHhHHH
Confidence            34579999999998


No 80 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=62.22  E-value=2.9  Score=40.24  Aligned_cols=25  Identities=20%  Similarity=0.344  Sum_probs=18.6

Q ss_pred             HHHHHHHHHHHhHhcCCceEEEcCC
Q 043639           87 IKAVFKRAFSKWASVIPVSFIETDD  111 (261)
Q Consensus        87 ~~~~i~~Af~~Ws~vt~l~F~ev~~  111 (261)
                      +.++++.-+...+.+=+|+|+++..
T Consensus       136 ~~~~~~g~~~l~~~lfgi~~~~~~~  160 (458)
T PF01432_consen  136 LEEVLEGLFELAERLFGIRFEEVPD  160 (458)
T ss_dssp             HHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred             HHHHHHHHHHHHHHHhcEEEEeccc
Confidence            4456666677888888899998865


No 81 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=61.56  E-value=3  Score=36.72  Aligned_cols=14  Identities=50%  Similarity=0.681  Sum_probs=11.5

Q ss_pred             hhhhhhhhhhhcCC
Q 043639          174 ESVAVHEIGHLLGL  187 (261)
Q Consensus       174 ~~Va~HEiGHaLGL  187 (261)
                      ..|++||+||++=-
T Consensus        54 ~~v~iHElgH~~~A   67 (227)
T cd06164          54 ASVLLHELGHSLVA   67 (227)
T ss_pred             HHHHHHHHHHHHHH
Confidence            57999999998643


No 82 
>PRK04897 heat shock protein HtpX; Provisional
Probab=61.46  E-value=3  Score=38.23  Aligned_cols=16  Identities=44%  Similarity=0.586  Sum_probs=13.2

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ...+..|++||+||.-
T Consensus       134 ~~El~aVlAHElgHi~  149 (298)
T PRK04897        134 REELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            3457899999999974


No 83 
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=60.43  E-value=4.9  Score=36.40  Aligned_cols=22  Identities=27%  Similarity=0.471  Sum_probs=17.0

Q ss_pred             cccchhhhhhhhhhhcCCCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~Hs~  191 (261)
                      ..+.+.++.||+||.-|=.+--
T Consensus       200 a~~yyaTl~HElghwtgh~~rl  221 (316)
T COG4227         200 AINYYATLLHELGHWTGHEARL  221 (316)
T ss_pred             hHhHHHHHHHHhccccCchhhh
Confidence            4567899999999998755443


No 84 
>PRK05457 heat shock protein HtpX; Provisional
Probab=60.39  E-value=3.2  Score=37.83  Aligned_cols=15  Identities=40%  Similarity=0.685  Sum_probs=12.7

Q ss_pred             cccchhhhhhhhhhh
Q 043639          170 AVDLESVAVHEIGHL  184 (261)
Q Consensus       170 ~~~l~~Va~HEiGHa  184 (261)
                      ...+..|++||+||.
T Consensus       131 ~~El~aVlAHElgHi  145 (284)
T PRK05457        131 RDEVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            346789999999996


No 85 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=60.07  E-value=3.3  Score=41.07  Aligned_cols=13  Identities=38%  Similarity=0.468  Sum_probs=11.3

Q ss_pred             chhhhhhhhhhhc
Q 043639          173 LESVAVHEIGHLL  185 (261)
Q Consensus       173 l~~Va~HEiGHaL  185 (261)
                      =..|++||+||++
T Consensus       337 dv~TL~HElGHa~  349 (549)
T TIGR02289       337 DIDVLTHEAGHAF  349 (549)
T ss_pred             HHHHHHHHhhHHH
Confidence            3579999999997


No 86 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=59.95  E-value=3.4  Score=40.25  Aligned_cols=12  Identities=33%  Similarity=0.664  Sum_probs=10.7

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      ..|+.||+||+|
T Consensus       264 V~TLfHEfGHal  275 (472)
T cd06455         264 VETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHH
Confidence            568999999998


No 87 
>PRK02391 heat shock protein HtpX; Provisional
Probab=59.50  E-value=3.4  Score=37.92  Aligned_cols=15  Identities=40%  Similarity=0.705  Sum_probs=12.6

Q ss_pred             cccchhhhhhhhhhh
Q 043639          170 AVDLESVAVHEIGHL  184 (261)
Q Consensus       170 ~~~l~~Va~HEiGHa  184 (261)
                      ...+..|++||+||.
T Consensus       130 ~~El~aVlaHElgHi  144 (296)
T PRK02391        130 PDELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345789999999995


No 88 
>PRK01265 heat shock protein HtpX; Provisional
Probab=59.07  E-value=3.5  Score=38.43  Aligned_cols=15  Identities=47%  Similarity=0.795  Sum_probs=12.6

Q ss_pred             cccchhhhhhhhhhh
Q 043639          170 AVDLESVAVHEIGHL  184 (261)
Q Consensus       170 ~~~l~~Va~HEiGHa  184 (261)
                      ...+..|++||+||.
T Consensus       137 ~~El~aVlAHElgHi  151 (324)
T PRK01265        137 RDEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            345789999999995


No 89 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=58.69  E-value=3.6  Score=37.49  Aligned_cols=13  Identities=46%  Similarity=0.572  Sum_probs=11.1

Q ss_pred             hhhhhhhhhhhcC
Q 043639          174 ESVAVHEIGHLLG  186 (261)
Q Consensus       174 ~~Va~HEiGHaLG  186 (261)
                      ..|++||+||++=
T Consensus       136 isvvvHElgHal~  148 (277)
T cd06162         136 ISGVVHEMGHGVA  148 (277)
T ss_pred             HHHHHHHHHHHHH
Confidence            5799999999963


No 90 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=58.41  E-value=3.9  Score=36.98  Aligned_cols=17  Identities=29%  Similarity=0.296  Sum_probs=12.8

Q ss_pred             hhhhhhhhhhhcCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHS  190 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs  190 (261)
                      ..+++||+||++=-.+.
T Consensus       119 isv~iHElgHa~~Ar~~  135 (263)
T cd06159         119 VGVVVHELSHGILARVE  135 (263)
T ss_pred             HHHHHHHHHHHHHHHHc
Confidence            57999999999644433


No 91 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=58.09  E-value=3.7  Score=40.97  Aligned_cols=13  Identities=31%  Similarity=0.457  Sum_probs=11.3

Q ss_pred             chhhhhhhhhhhc
Q 043639          173 LESVAVHEIGHLL  185 (261)
Q Consensus       173 l~~Va~HEiGHaL  185 (261)
                      -..|++||+||++
T Consensus       378 dv~TLaHElGHa~  390 (591)
T TIGR00181       378 SVFTLAHELGHSM  390 (591)
T ss_pred             hHHHHHHHhhhHH
Confidence            3689999999996


No 92 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=58.05  E-value=3.8  Score=40.99  Aligned_cols=12  Identities=42%  Similarity=0.639  Sum_probs=10.9

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      ..|++||+||++
T Consensus       376 v~TL~HE~GHa~  387 (587)
T TIGR02290       376 VSTLAHELGHAY  387 (587)
T ss_pred             HHHHHHHhhHHH
Confidence            579999999998


No 93 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=57.34  E-value=4  Score=30.96  Aligned_cols=14  Identities=36%  Similarity=0.482  Sum_probs=12.0

Q ss_pred             cchhhhhhhhhhhc
Q 043639          172 DLESVAVHEIGHLL  185 (261)
Q Consensus       172 ~l~~Va~HEiGHaL  185 (261)
                      .+..|++||+.|.+
T Consensus        24 ~~~~~l~HE~~H~~   37 (128)
T PF13485_consen   24 WLDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHHH
Confidence            35699999999996


No 94 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=56.82  E-value=4.2  Score=33.60  Aligned_cols=12  Identities=33%  Similarity=0.445  Sum_probs=10.2

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      .-|++|||+|.+
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            469999999974


No 95 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=53.73  E-value=4.9  Score=34.25  Aligned_cols=20  Identities=35%  Similarity=0.614  Sum_probs=15.0

Q ss_pred             hhhhhhhhhhhcCCCCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHSPVK  193 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~  193 (261)
                      ..+.+||+||++--..-.++
T Consensus        42 ~~l~iHElgH~~~A~~~G~~   61 (183)
T cd06160          42 AILGIHEMGHYLAARRHGVK   61 (183)
T ss_pred             HHHHHHHHHHHHHHHHCCCC
Confidence            46889999999876554444


No 96 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=53.54  E-value=5  Score=38.33  Aligned_cols=17  Identities=41%  Similarity=0.655  Sum_probs=12.8

Q ss_pred             hhhhhhhhhhhcCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHS  190 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs  190 (261)
                      ..+.+||+||||--.-.
T Consensus       132 i~~vvHElGHalAA~se  148 (484)
T KOG2921|consen  132 ITVVVHELGHALAAASE  148 (484)
T ss_pred             HHHHHHHhhHHHHHHhc
Confidence            56889999999854433


No 97 
>PF08434 CLCA_N:  Calcium-activated chloride channel;  InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs. 
Probab=52.05  E-value=12  Score=33.94  Aligned_cols=15  Identities=47%  Similarity=0.607  Sum_probs=11.4

Q ss_pred             hhhhhhhhhhh-cCCC
Q 043639          174 ESVAVHEIGHL-LGLT  188 (261)
Q Consensus       174 ~~Va~HEiGHa-LGL~  188 (261)
                      -.|++||.+|. =|+-
T Consensus       150 grv~VhEWAhlRWGVF  165 (262)
T PF08434_consen  150 GRVFVHEWAHLRWGVF  165 (262)
T ss_pred             ceeeeehhhhhccccc
Confidence            47999999996 3553


No 98 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=47.93  E-value=6.8  Score=39.50  Aligned_cols=12  Identities=33%  Similarity=0.526  Sum_probs=10.7

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      ..|++||+||++
T Consensus       381 V~TLaHElGHs~  392 (598)
T COG1164         381 VFTLAHELGHSV  392 (598)
T ss_pred             HHHHHHHccHHH
Confidence            589999999994


No 99 
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=47.92  E-value=27  Score=36.44  Aligned_cols=49  Identities=14%  Similarity=0.281  Sum_probs=31.5

Q ss_pred             CcccCCCCeeEEEEEecCCcC----Cc-----------CCHHHHHHHHHHHHHHhHhcCCceEEEcC
Q 043639           59 PRWTRHVPMTLSYAFSKENMI----NY-----------LHITDIKAVFKRAFSKWASVIPVSFIETD  110 (261)
Q Consensus        59 ~kW~~~~~~~LTY~~~~~~~~----~~-----------l~~~~~~~~i~~Af~~Ws~vt~l~F~ev~  110 (261)
                      ..|.+.   +-.|.+......    |.           .+..+..+--.+.|+...+.+.|.|....
T Consensus       128 ~~w~~~---~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~  191 (775)
T PF03272_consen  128 SEWDDS---DSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSD  191 (775)
T ss_pred             hhhhhc---CCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCc
Confidence            478877   788887754321    11           12233445557888999999999987543


No 100
>PLN02791 Nudix hydrolase homolog
Probab=47.55  E-value=7.6  Score=40.32  Aligned_cols=20  Identities=20%  Similarity=0.262  Sum_probs=16.4

Q ss_pred             chhhhhhhhhhhcCCCCCCC
Q 043639          173 LESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      ...|++||+||.+|=.+.-.
T Consensus       585 ~~~v~lHElgHGsG~~~~~~  604 (770)
T PLN02791        585 FTHTICHECCHGIGPHTITL  604 (770)
T ss_pred             HHHHHHHHhhccccccceec
Confidence            36899999999999877543


No 101
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=47.35  E-value=7.2  Score=33.09  Aligned_cols=12  Identities=25%  Similarity=0.548  Sum_probs=10.8

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      .++.+||+||++
T Consensus        10 ~~i~~HE~aHa~   21 (181)
T cd06158          10 LAITLHEFAHAY   21 (181)
T ss_pred             HHHHHHHHHHHH
Confidence            578999999997


No 102
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=46.91  E-value=7.4  Score=37.30  Aligned_cols=12  Identities=42%  Similarity=0.598  Sum_probs=10.1

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      ..+++||+||+|
T Consensus       160 l~t~iHE~GHal  171 (396)
T cd06460         160 LFSTIHETGHAL  171 (396)
T ss_pred             HHHHHHHhhHHH
Confidence            458999999985


No 103
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=46.68  E-value=7.5  Score=37.35  Aligned_cols=12  Identities=50%  Similarity=0.866  Sum_probs=10.1

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      ..|++||+||.|
T Consensus        15 ~~v~~HE~gH~~   26 (420)
T TIGR00054        15 VLIFVHELGHFL   26 (420)
T ss_pred             HHHHHHhHHHHH
Confidence            469999999964


No 104
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=45.65  E-value=7.9  Score=37.45  Aligned_cols=11  Identities=45%  Similarity=0.857  Sum_probs=9.2

Q ss_pred             hhhhhhhhhhh
Q 043639          174 ESVAVHEIGHL  184 (261)
Q Consensus       174 ~~Va~HEiGHa  184 (261)
                      .-|++||+||-
T Consensus        16 ~li~vHElGHf   26 (449)
T PRK10779         16 VLITVHEFGHF   26 (449)
T ss_pred             HHHHHHHHHHH
Confidence            35889999995


No 105
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=44.25  E-value=68  Score=32.91  Aligned_cols=33  Identities=21%  Similarity=0.358  Sum_probs=23.1

Q ss_pred             HHHHHHHHHHhHhcCCceEEEcCCC--CCcceEEE
Q 043639           88 KAVFKRAFSKWASVIPVSFIETDDY--DFADIRIG  120 (261)
Q Consensus        88 ~~~i~~Af~~Ws~vt~l~F~ev~~~--~~aDI~I~  120 (261)
                      ..+|.-.|..-+..=+|+|+|..+.  -..|+++.
T Consensus       364 ~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~  398 (683)
T COG0339         364 NKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVF  398 (683)
T ss_pred             hHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEE
Confidence            3566677788888889999997431  35677654


No 106
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=43.14  E-value=9.1  Score=36.93  Aligned_cols=15  Identities=40%  Similarity=0.674  Sum_probs=12.8

Q ss_pred             cccchhhhhhhhhhh
Q 043639          170 AVDLESVAVHEIGHL  184 (261)
Q Consensus       170 ~~~l~~Va~HEiGHa  184 (261)
                      ...+..|+.||+||.
T Consensus       277 ~eel~AVl~HELGHW  291 (428)
T KOG2719|consen  277 NEELVAVLAHELGHW  291 (428)
T ss_pred             cHHHHHHHHHHhhHH
Confidence            446789999999997


No 107
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=43.12  E-value=8.4  Score=33.41  Aligned_cols=16  Identities=31%  Similarity=0.528  Sum_probs=13.1

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ......+|+||-||||
T Consensus        25 ~~~~~~~A~HEAGhAv   40 (213)
T PF01434_consen   25 EEEKRRIAYHEAGHAV   40 (213)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3456789999999986


No 108
>PRK10911 oligopeptidase A; Provisional
Probab=41.81  E-value=9.7  Score=39.00  Aligned_cols=22  Identities=9%  Similarity=0.087  Sum_probs=14.9

Q ss_pred             HHHHHHHHHhHhcCCceEEEcC
Q 043639           89 AVFKRAFSKWASVIPVSFIETD  110 (261)
Q Consensus        89 ~~i~~Af~~Ws~vt~l~F~ev~  110 (261)
                      .+++.-|..-+..=+++|.|+.
T Consensus       361 ~v~~gl~~~~~~Lfgi~~~e~~  382 (680)
T PRK10911        361 KAVNGLFEVVKRIYGITAKERK  382 (680)
T ss_pred             HHHHHHHHHHHHHcCeeEEecC
Confidence            4445556666677788998864


No 109
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=39.78  E-value=79  Score=32.37  Aligned_cols=32  Identities=25%  Similarity=0.386  Sum_probs=25.4

Q ss_pred             HHHHHHHHHhHhcCCceEEEcCCC--CCcceEEE
Q 043639           89 AVFKRAFSKWASVIPVSFIETDDY--DFADIRIG  120 (261)
Q Consensus        89 ~~i~~Af~~Ws~vt~l~F~ev~~~--~~aDI~I~  120 (261)
                      .++..-|...+..-+|+|.|+++.  -.+|+++.
T Consensus       392 ~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~y  425 (718)
T KOG2089|consen  392 VVLSGLFGIYQTLFGLKFEEATDAEVWHADVRVY  425 (718)
T ss_pred             HHHHHHHHHHHHHhCceeeecCCchhcccceeEE
Confidence            466777888899999999999873  36788854


No 110
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=37.45  E-value=20  Score=35.91  Aligned_cols=12  Identities=50%  Similarity=0.650  Sum_probs=10.4

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      -.|+.|||||+.
T Consensus       488 GaVIgHEI~HgF  499 (654)
T COG3590         488 GAVIGHEIGHGF  499 (654)
T ss_pred             cceehhhhcccc
Confidence            489999999974


No 111
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=36.18  E-value=15  Score=25.38  Aligned_cols=20  Identities=35%  Similarity=0.602  Sum_probs=16.8

Q ss_pred             chhhhhhhhhhhcCCCCCCC
Q 043639          173 LESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      ...-++||+.+.+||.|.+.
T Consensus        29 ~eR~~vH~lA~~~gL~s~S~   48 (60)
T cd02641          29 HDRLLVHELAEELGLRHEST   48 (60)
T ss_pred             HHHHHHHHHHHHcCCceEee
Confidence            35789999999999998653


No 112
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=35.82  E-value=14  Score=36.94  Aligned_cols=14  Identities=43%  Similarity=0.643  Sum_probs=10.7

Q ss_pred             hhhhhhhh-hhhcCC
Q 043639          174 ESVAVHEI-GHLLGL  187 (261)
Q Consensus       174 ~~Va~HEi-GHaLGL  187 (261)
                      ..|++||+ ||..|=
T Consensus       294 ~~v~lHEllGHGsGk  308 (549)
T PF03571_consen  294 VQVGLHELLGHGSGK  308 (549)
T ss_dssp             HHHHHHHHCCCCS--
T ss_pred             hhhhHHhhccCcCcc
Confidence            46999998 999984


No 113
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=34.59  E-value=17  Score=32.66  Aligned_cols=20  Identities=30%  Similarity=0.476  Sum_probs=14.2

Q ss_pred             cccchhhhhhhhhhhcCCCC
Q 043639          170 AVDLESVAVHEIGHLLGLTH  189 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaLGL~H  189 (261)
                      ..|-.+|++||+.|.|=+..
T Consensus       151 ~~dg~NVviHEfAH~LD~~~  170 (253)
T PF06167_consen  151 PNDGHNVVIHEFAHKLDMED  170 (253)
T ss_dssp             SSSS--HHHHHHHHHHHCTT
T ss_pred             CCCCcchHHHHHHHHHHhhc
Confidence            44567999999999996653


No 114
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=31.39  E-value=18  Score=33.96  Aligned_cols=20  Identities=35%  Similarity=0.489  Sum_probs=15.2

Q ss_pred             cchhhhhhhhhhhcCCCCCCC
Q 043639          172 DLESVAVHEIGHLLGLTHSPV  192 (261)
Q Consensus       172 ~l~~Va~HEiGHaLGL~Hs~~  192 (261)
                      .+..|+.||++|++ +.|+.+
T Consensus       274 glAtvLgHE~aHaV-arH~AE  293 (424)
T KOG2661|consen  274 GLATVLGHEIAHAV-ARHAAE  293 (424)
T ss_pred             HHHHHHHHHHHHHH-HHHHHH
Confidence            35789999999996 466543


No 115
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=30.75  E-value=19  Score=36.98  Aligned_cols=34  Identities=21%  Similarity=0.358  Sum_probs=22.0

Q ss_pred             HHHHH-HHHHHHhHhcCCceEEEcCCC--CCcceEEE
Q 043639           87 IKAVF-KRAFSKWASVIPVSFIETDDY--DFADIRIG  120 (261)
Q Consensus        87 ~~~~i-~~Af~~Ws~vt~l~F~ev~~~--~~aDI~I~  120 (261)
                      +..++ +-.|..-+..=+|+|+|+.+.  -..|++..
T Consensus       362 l~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~~  398 (681)
T PRK10280        362 LNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVW  398 (681)
T ss_pred             HHHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeEE
Confidence            33555 356777788889999997531  34566543


No 116
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=29.14  E-value=22  Score=32.49  Aligned_cols=41  Identities=22%  Similarity=0.250  Sum_probs=21.9

Q ss_pred             cCCCCceeeecccccccccCCCCCCCcc-ccchhhhhhhhhhhc
Q 043639          143 FSPESGKFHLDAAETWAVDFGSEKSPVA-VDLESVAVHEIGHLL  185 (261)
Q Consensus       143 ~~P~~G~vhfd~~e~wt~~~~~~~~~~~-~~l~~Va~HEiGHaL  185 (261)
                      |-|.+.++.||....=.+...  ....+ .-...|++||+||.+
T Consensus       138 YCP~D~kvYlDlsFf~~m~~~--fga~GdfAqaYViAHEVGHHV  179 (295)
T COG2321         138 YCPADTKVYLDLSFFDEMKTK--FGASGDFAQAYVIAHEVGHHV  179 (295)
T ss_pred             ecCCCceEEEehhHHHHHHHH--hcCCccHHHHHHHHhhhhHHH
Confidence            457777888876321111100  01112 223579999999963


No 117
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=28.59  E-value=26  Score=24.45  Aligned_cols=21  Identities=33%  Similarity=0.390  Sum_probs=17.9

Q ss_pred             hhhhhhhhhhhcCCCCCCCCC
Q 043639          174 ESVAVHEIGHLLGLTHSPVKE  194 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~Hs~~~~  194 (261)
                      ..-++|.+-|-|||.|.++.+
T Consensus        30 eRriih~la~~lGL~~~s~G~   50 (60)
T cd02639          30 ERRIVHLLASRLGLNHVSDGT   50 (60)
T ss_pred             HHHHHHHHHHHcCCceEEeCC
Confidence            568999999999999987654


No 118
>PHA02150 hypothetical protein
Probab=27.75  E-value=34  Score=24.35  Aligned_cols=33  Identities=27%  Similarity=0.516  Sum_probs=21.3

Q ss_pred             EEEEecCCcCCcCCHHHHHHHHHHHHHHhHhcC
Q 043639           70 SYAFSKENMINYLHITDIKAVFKRAFSKWASVI  102 (261)
Q Consensus        70 TY~~~~~~~~~~l~~~~~~~~i~~Af~~Ws~vt  102 (261)
                      .|+|.++..-...+.+.....+..||+.|.++-
T Consensus         2 ~yrfvseeqfnaw~~qnya~efa~~f~~wd~if   34 (77)
T PHA02150          2 LYRFVSEEQFNAWSSQNYAKEFAKHFKKWDDVF   34 (77)
T ss_pred             ceeeccHHHhhHHHhhhHHHHHHHHhhhHhHhh
Confidence            477765432222334566778899999999873


No 119
>PF14891 Peptidase_M91:  Effector protein
Probab=25.73  E-value=29  Score=29.02  Aligned_cols=15  Identities=27%  Similarity=0.151  Sum_probs=12.2

Q ss_pred             chhhhhhhhhhhcCC
Q 043639          173 LESVAVHEIGHLLGL  187 (261)
Q Consensus       173 l~~Va~HEiGHaLGL  187 (261)
                      -+.++.||+.||.--
T Consensus       103 p~v~L~HEL~HA~~~  117 (174)
T PF14891_consen  103 PFVVLYHELIHAYDY  117 (174)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            368999999999644


No 120
>PF02102 Peptidase_M35:  Deuterolysin metalloprotease (M35) family;  InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=25.63  E-value=30  Score=32.72  Aligned_cols=50  Identities=22%  Similarity=0.161  Sum_probs=23.6

Q ss_pred             CCceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhcCC
Q 043639          136 LGVLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLLGL  187 (261)
Q Consensus       136 gg~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaLGL  187 (261)
                      .|++|++. |..+.|.+=....+.+.... ....+.|.-.+++||+-|+.+.
T Consensus       262 ~~vlAYT~-p~~~~I~~Cp~ff~~lp~~~-~~C~~qDqatt~LHE~TH~~~V  311 (359)
T PF02102_consen  262 SGVLAYTL-PSQNQIVNCPIFFSDLPALS-NRCHAQDQATTTLHEMTHAPAV  311 (359)
T ss_dssp             TT--EEEE-GGGTEEEE-HHHHHH--SS---STT---HHHHHHHHHHT-TTT
T ss_pred             CCeEEEEE-cCCCeEEECchhhccCCCcc-ccccCCCccchhhhhhhccccc
Confidence            47899886 44455543221111122111 1344567789999999999886


No 121
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=25.19  E-value=30  Score=30.29  Aligned_cols=12  Identities=33%  Similarity=0.457  Sum_probs=10.3

Q ss_pred             hhhhhhhhhhhc
Q 043639          174 ESVAVHEIGHLL  185 (261)
Q Consensus       174 ~~Va~HEiGHaL  185 (261)
                      -+|++||++||-
T Consensus        94 gsiLAHE~mHa~  105 (212)
T PF12315_consen   94 GSILAHELMHAW  105 (212)
T ss_pred             hhHHHHHHHHHH
Confidence            579999999983


No 122
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=24.41  E-value=17  Score=30.23  Aligned_cols=39  Identities=21%  Similarity=0.207  Sum_probs=24.4

Q ss_pred             ceeeecCCCCceeeecccccccccCCCCCCCccccchhhhhhhhhhhc
Q 043639          138 VLAHSFSPESGKFHLDAAETWAVDFGSEKSPVAVDLESVAVHEIGHLL  185 (261)
Q Consensus       138 ~lAhA~~P~~G~vhfd~~e~wt~~~~~~~~~~~~~l~~Va~HEiGHaL  185 (261)
                      +.+-||++.+ +|-||....-  .      .+...+..|+.||+-|.+
T Consensus        35 taG~ayL~~~-~I~lNP~ll~--e------n~~~f~~~vV~HELaHl~   73 (156)
T COG3091          35 TAGGAYLLKS-EIRLNPKLLE--E------NGEDFIEQVVPHELAHLH   73 (156)
T ss_pred             hcchhhcccc-ccccCHHHHH--H------ccHHHHHHHHHHHHHHHH
Confidence            3444677766 7777653210  0      123457899999999974


No 123
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=24.02  E-value=32  Score=29.12  Aligned_cols=15  Identities=20%  Similarity=0.264  Sum_probs=12.1

Q ss_pred             hhhhhhhhhhhcCCC
Q 043639          174 ESVAVHEIGHLLGLT  188 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~  188 (261)
                      -.+++|||.|++.-.
T Consensus        37 G~ilahel~hafd~~   51 (206)
T PF01431_consen   37 GFILAHELMHAFDPE   51 (206)
T ss_dssp             HHHHHHHHHHCTSTT
T ss_pred             HHHHHHHHHHHHHHh
Confidence            479999999987553


No 124
>PF02128 Peptidase_M36:  Fungalysin metallopeptidase (M36);  InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=23.42  E-value=47  Score=31.71  Aligned_cols=15  Identities=27%  Similarity=0.463  Sum_probs=12.2

Q ss_pred             hhhhhhhhhhhcCCC
Q 043639          174 ESVAVHEIGHLLGLT  188 (261)
Q Consensus       174 ~~Va~HEiGHaLGL~  188 (261)
                      -.|++||.||.|--.
T Consensus       186 ~~Ii~HEy~HGiSnR  200 (378)
T PF02128_consen  186 NGIIAHEYGHGISNR  200 (378)
T ss_pred             cCeeEEeeccccccc
Confidence            579999999987543


No 125
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=23.37  E-value=29  Score=35.25  Aligned_cols=13  Identities=54%  Similarity=0.718  Sum_probs=11.4

Q ss_pred             chhhhhhhhhhhc
Q 043639          173 LESVAVHEIGHLL  185 (261)
Q Consensus       173 l~~Va~HEiGHaL  185 (261)
                      ...|+.||+||||
T Consensus       408 ~~~~a~he~gha~  420 (644)
T PRK10733        408 KESTAYHEAGHAI  420 (644)
T ss_pred             HHHHHHHHHHHHH
Confidence            4679999999997


No 126
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=23.04  E-value=34  Score=33.60  Aligned_cols=14  Identities=29%  Similarity=0.361  Sum_probs=12.0

Q ss_pred             hhhhhhhhhhhcCC
Q 043639          174 ESVAVHEIGHLLGL  187 (261)
Q Consensus       174 ~~Va~HEiGHaLGL  187 (261)
                      +.|+.||+||+...
T Consensus       248 ~~t~~HE~GH~~yy  261 (477)
T cd06461         248 FVTVHHEMGHIQYY  261 (477)
T ss_pred             HHHHHHHHHHHHHH
Confidence            67899999999854


No 127
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=22.88  E-value=36  Score=30.81  Aligned_cols=20  Identities=20%  Similarity=0.366  Sum_probs=16.5

Q ss_pred             cchhhhhhhhhhhcCCCCCC
Q 043639          172 DLESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       172 ~l~~Va~HEiGHaLGL~Hs~  191 (261)
                      +-.+|++||+-|.|=+.+..
T Consensus       141 dg~NVvIHEFAH~LDm~~G~  160 (260)
T PRK15410        141 SGFNLIIHEVAHKLDMRNGD  160 (260)
T ss_pred             CCcchhHhHHHhHhhhhcCC
Confidence            45799999999999887643


No 128
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=22.37  E-value=33  Score=30.96  Aligned_cols=11  Identities=36%  Similarity=0.341  Sum_probs=9.0

Q ss_pred             hhhhhhhhhhc
Q 043639          175 SVAVHEIGHLL  185 (261)
Q Consensus       175 ~Va~HEiGHaL  185 (261)
                      .-..|||||.+
T Consensus       221 WG~~HE~GH~~  231 (307)
T PF13402_consen  221 WGPWHELGHNH  231 (307)
T ss_dssp             HHHHHHHHHHH
T ss_pred             eeehhhhhhhc
Confidence            35899999985


No 129
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=22.03  E-value=34  Score=33.57  Aligned_cols=11  Identities=45%  Similarity=0.667  Sum_probs=9.0

Q ss_pred             hhhhhhhhhhc
Q 043639          175 SVAVHEIGHLL  185 (261)
Q Consensus       175 ~Va~HEiGHaL  185 (261)
                      .=++||.||||
T Consensus       261 ~g~iHE~GHAl  271 (497)
T COG2317         261 FGTIHETGHAL  271 (497)
T ss_pred             HHHHHhhhhHH
Confidence            34689999996


No 130
>CHL00176 ftsH cell division protein; Validated
Probab=21.89  E-value=34  Score=34.86  Aligned_cols=14  Identities=36%  Similarity=0.707  Sum_probs=11.9

Q ss_pred             cchhhhhhhhhhhc
Q 043639          172 DLESVAVHEIGHLL  185 (261)
Q Consensus       172 ~l~~Va~HEiGHaL  185 (261)
                      ....||.||.||||
T Consensus       437 ~~~~vA~hEaGhA~  450 (638)
T CHL00176        437 NKRLIAYHEVGHAI  450 (638)
T ss_pred             HHHHHHHHhhhhHH
Confidence            34679999999998


No 131
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=21.30  E-value=33  Score=32.48  Aligned_cols=16  Identities=31%  Similarity=0.692  Sum_probs=13.0

Q ss_pred             cccchhhhhhhhhhhc
Q 043639          170 AVDLESVAVHEIGHLL  185 (261)
Q Consensus       170 ~~~l~~Va~HEiGHaL  185 (261)
                      ...+-.|+.||+||..
T Consensus       121 ~sEvAAVl~HEmgHVt  136 (479)
T COG4784         121 SSEVAAVLAHEMGHVT  136 (479)
T ss_pred             HHHHHHHHHhhhhhee
Confidence            3456799999999985


No 132
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=21.24  E-value=34  Score=27.68  Aligned_cols=15  Identities=27%  Similarity=0.357  Sum_probs=12.8

Q ss_pred             ccchhhhhhhhhhhc
Q 043639          171 VDLESVAVHEIGHLL  185 (261)
Q Consensus       171 ~~l~~Va~HEiGHaL  185 (261)
                      ..+..|++||+-|++
T Consensus        57 ~~l~~~l~HEm~H~~   71 (146)
T smart00731       57 DRLRETLLHELCHAA   71 (146)
T ss_pred             HHHHhhHHHHHHHHH
Confidence            456789999999986


No 133
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.79  E-value=48  Score=23.00  Aligned_cols=19  Identities=26%  Similarity=0.433  Sum_probs=16.2

Q ss_pred             chhhhhhhhhhhcCCCCCC
Q 043639          173 LESVAVHEIGHLLGLTHSP  191 (261)
Q Consensus       173 l~~Va~HEiGHaLGL~Hs~  191 (261)
                      ...-++|++-+.+||.|.+
T Consensus        29 ~eR~~vH~~a~~~gL~s~S   47 (60)
T cd02640          29 EERALIHQIAQKYGLKSRS   47 (60)
T ss_pred             HHHHHHHHHHHHcCCceee
Confidence            3578999999999998864


No 134
>PF02074 Peptidase_M32:  Carboxypeptidase Taq (M32) metallopeptidase;  InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=20.14  E-value=38  Score=33.44  Aligned_cols=16  Identities=44%  Similarity=0.648  Sum_probs=11.5

Q ss_pred             hhhhhhhhhhhc---CCCC
Q 043639          174 ESVAVHEIGHLL---GLTH  189 (261)
Q Consensus       174 ~~Va~HEiGHaL---GL~H  189 (261)
                      ..-++||.||+|   ||..
T Consensus       260 l~s~iHE~GHalYEq~~~~  278 (494)
T PF02074_consen  260 LFSTIHETGHALYEQGLPP  278 (494)
T ss_dssp             HHHHHHHHHHHHHHHTS-G
T ss_pred             HHHHHHHHHHHHHHcCCCh
Confidence            346789999996   7753


Done!