Query         026597
Match_columns 236
No_of_seqs    107 out of 262
Neff          6.0 
Searched_HMMs 46136
Date          Fri Mar 29 10:05:17 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/026597.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/026597hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13398 Peptidase_M50B:  Pepti 100.0 9.2E-44   2E-48  306.2  21.2  195   25-226     2-199 (200)
  2 cd06161 S2P-M50_SpoIVFB SpoIVF  98.1 6.4E-06 1.4E-10   71.4   6.3   65   43-108    36-105 (208)
  3 cd06164 S2P-M50_SpoIVFB_CBS Sp  97.8 4.3E-05 9.3E-10   67.3   6.1   64   44-108    52-120 (227)
  4 cd06163 S2P-M50_PDZ_RseP-like   97.5 0.00026 5.6E-09   60.5   6.1   26   45-70      9-34  (182)
  5 cd06160 S2P-M50_like_2 Unchara  97.3 0.00047   1E-08   59.0   5.5   64   43-106    39-112 (183)
  6 PF02163 Peptidase_M50:  Peptid  97.2 0.00022 4.7E-09   60.2   2.9   28   45-72      7-34  (192)
  7 cd06159 S2P-M50_PDZ_Arch Uncha  97.2 0.00034 7.3E-09   63.1   4.3   63   45-107   118-191 (263)
  8 cd05709 S2P-M50 Site-2 proteas  97.1 0.00029 6.2E-09   59.1   2.8   29   45-73      8-36  (180)
  9 cd06162 S2P-M50_PDZ_SREBP Ster  96.9  0.0033 7.2E-08   57.2   7.6   62   45-106   135-207 (277)
 10 TIGR00054 RIP metalloprotease   96.3  0.0063 1.4E-07   58.1   5.2   63   45-108    14-118 (420)
 11 PRK10779 zinc metallopeptidase  95.7    0.02 4.3E-07   55.0   5.7   28   46-73     16-43  (449)
 12 KOG2921 Intramembrane metallop  93.7   0.068 1.5E-06   51.0   3.8   29   44-72    130-158 (484)
 13 PF01434 Peptidase_M41:  Peptid  93.6   0.064 1.4E-06   46.6   3.2   36   46-81     29-68  (213)
 14 PF00413 Peptidase_M10:  Matrix  91.0    0.12 2.7E-06   41.4   1.6   16   42-57    102-117 (154)
 15 cd06158 S2P-M50_like_1 Unchara  90.5    0.21 4.5E-06   42.5   2.7   29   45-73      9-41  (181)
 16 cd04279 ZnMc_MMP_like_1 Zinc-d  90.3    0.17 3.7E-06   41.3   1.9   17   42-58    101-117 (156)
 17 cd04268 ZnMc_MMP_like Zinc-dep  88.8    0.27 5.9E-06   39.9   2.0   15   44-58     93-107 (165)
 18 PF01432 Peptidase_M3:  Peptida  88.4    0.36 7.9E-06   46.1   2.8   24   43-66    240-263 (458)
 19 cd04278 ZnMc_MMP Zinc-dependen  88.3     0.2 4.3E-06   41.0   0.9   17   42-58    104-120 (157)
 20 cd04277 ZnMc_serralysin_like Z  86.7    0.34 7.5E-06   40.6   1.4   14   45-58    113-126 (186)
 21 CHL00176 ftsH cell division pr  86.2     0.9   2E-05   46.0   4.3   53   46-98    439-507 (638)
 22 cd06457 M3A_MIP Peptidase M3 m  85.9    0.58 1.2E-05   45.2   2.7   26   40-65    243-268 (458)
 23 PRK10280 dipeptidyl carboxypep  85.7    0.57 1.2E-05   47.7   2.6   26   40-65    459-484 (681)
 24 smart00235 ZnMc Zinc-dependent  85.7    0.39 8.5E-06   38.2   1.2   13   47-59     88-100 (140)
 25 cd06459 M3B_Oligoendopeptidase  85.3    0.49 1.1E-05   44.3   1.8   23   41-63    218-240 (427)
 26 COG0339 Dcp Zn-dependent oligo  85.0    0.64 1.4E-05   47.1   2.6   26   40-65    462-487 (683)
 27 PRK10733 hflB ATP-dependent me  84.9     1.2 2.7E-05   44.9   4.6   32   46-77    409-442 (644)
 28 PF14247 DUF4344:  Domain of un  83.9     0.9   2E-05   40.1   2.8   18   44-61     91-108 (220)
 29 cd04327 ZnMc_MMP_like_3 Zinc-d  83.1    0.79 1.7E-05   39.2   2.0   14   45-58     92-105 (198)
 30 cd06258 Peptidase_M3_like The   82.9     1.1 2.4E-05   41.3   3.1   29   40-68    149-177 (365)
 31 TIGR01241 FtsH_fam ATP-depende  82.9     1.7 3.8E-05   42.1   4.6   39   43-81    309-351 (495)
 32 cd06455 M3A_TOP Peptidase M3 T  82.8     1.1 2.4E-05   43.3   3.2   23   43-65    261-283 (472)
 33 cd06456 M3A_DCP_Oligopeptidase  82.8     1.1 2.4E-05   43.0   3.1   27   42-68    205-231 (422)
 34 cd00203 ZnMc Zinc-dependent me  82.5    0.65 1.4E-05   37.6   1.3   17   44-60     95-111 (167)
 35 PF13582 Reprolysin_3:  Metallo  81.7     0.7 1.5E-05   35.8   1.1   12   46-57    108-119 (124)
 36 PRK10911 oligopeptidase A; Pro  81.4     1.2 2.6E-05   45.4   2.9   21   43-63    461-481 (680)
 37 TIGR02290 M3_fam_3 oligoendope  81.4    0.85 1.8E-05   45.3   1.8   22   41-62    371-392 (587)
 38 PF01447 Peptidase_M4:  Thermol  76.8       2 4.4E-05   35.6   2.4   19   40-58    129-148 (150)
 39 TIGR02289 M3_not_pepF oligoend  76.6     1.9 4.2E-05   42.4   2.7   24   41-64    333-356 (549)
 40 COG2856 Predicted Zn peptidase  74.9     1.6 3.5E-05   38.3   1.5   18   41-58     68-85  (213)
 41 cd04280 ZnMc_astacin_like Zinc  74.8     1.6 3.4E-05   37.0   1.3   28   46-73     75-105 (180)
 42 TIGR00181 pepF oligoendopeptid  74.3     2.3 4.9E-05   42.2   2.5   23   41-63    374-396 (591)
 43 PF13583 Reprolysin_4:  Metallo  73.5     1.8 3.8E-05   37.5   1.3   21   44-64    136-156 (206)
 44 PF09471 Peptidase_M64:  IgA Pe  73.2     1.5 3.2E-05   39.8   0.8   18   44-61    215-232 (264)
 45 PF13574 Reprolysin_2:  Metallo  73.2     2.4 5.1E-05   35.6   2.0   20   45-64    111-130 (173)
 46 PF05572 Peptidase_M43:  Pregna  73.0     1.9   4E-05   35.8   1.3   16   42-57     66-81  (154)
 47 cd04283 ZnMc_hatching_enzyme Z  72.2     2.1 4.5E-05   36.7   1.5   15   46-60     78-92  (182)
 48 PF13688 Reprolysin_5:  Metallo  70.9     2.7 5.9E-05   35.2   1.8   20   41-60    138-157 (196)
 49 PF01400 Astacin:  Astacin (Pep  70.7     2.7 5.8E-05   35.8   1.8   30   45-74     79-111 (191)
 50 PF06114 DUF955:  Domain of unk  69.5       4 8.7E-05   30.4   2.4   21   40-60     37-57  (122)
 51 KOG2090 Metalloendopeptidase f  68.9     4.2 9.1E-05   41.0   2.9   28   34-61    468-495 (704)
 52 PF12388 Peptidase_M57:  Dual-a  68.8     2.4 5.1E-05   37.3   1.1   10   48-57    136-145 (211)
 53 PF10263 SprT-like:  SprT-like   68.6     4.4 9.4E-05   32.8   2.6   21   42-62     57-77  (157)
 54 cd04281 ZnMc_BMP1_TLD Zinc-dep  68.5     2.7   6E-05   36.4   1.4   14   46-59     88-101 (200)
 55 COG1164 Oligoendopeptidase F [  67.9     3.8 8.2E-05   41.2   2.5   28   41-68    376-403 (598)
 56 cd04269 ZnMc_adamalysin_II_lik  66.6     3.7   8E-05   34.5   1.8   17   42-58    128-144 (194)
 57 cd04267 ZnMc_ADAM_like Zinc-de  65.8     2.4 5.2E-05   35.5   0.5   22   42-63    130-151 (192)
 58 cd04270 ZnMc_TACE_like Zinc-de  65.5     3.9 8.5E-05   36.2   1.8   15   45-59    167-181 (244)
 59 PF02128 Peptidase_M36:  Fungal  63.5       1 2.2E-05   42.8  -2.4   30   47-76    187-216 (378)
 60 PF02031 Peptidase_M7:  Strepto  63.4     4.5 9.8E-05   33.0   1.6   10   48-57     80-89  (132)
 61 cd04282 ZnMc_meprin Zinc-depen  62.3     4.1 8.9E-05   36.2   1.3   13   46-58    121-133 (230)
 62 PF13485 Peptidase_MA_2:  Pepti  61.2     8.7 0.00019   28.8   2.8   22   43-64     23-44  (128)
 63 COG0465 HflB ATP-dependent Zn   60.3     7.8 0.00017   39.1   3.0   36   47-82    408-447 (596)
 64 cd06460 M32_Taq Peptidase fami  59.6     6.2 0.00013   37.8   2.1   20   41-60    155-174 (396)
 65 PF04298 Zn_peptidase_2:  Putat  58.9     5.9 0.00013   35.1   1.7   15   43-57     87-101 (222)
 66 cd04276 ZnMc_MMP_like_2 Zinc-d  58.0       7 0.00015   33.8   2.0   15   44-58    115-129 (197)
 67 KOG3854 SPRT-like metalloprote  57.8     7.4 0.00016   38.1   2.3   18   44-61    350-367 (505)
 68 TIGR03296 M6dom_TIGR03296 M6 f  57.8       2 4.3E-05   39.0  -1.5   12   46-57    166-177 (286)
 69 COG0750 Predicted membrane-ass  57.2      12 0.00026   34.4   3.6   30   46-75     14-43  (375)
 70 smart00731 SprT SprT homologue  56.7      10 0.00022   30.8   2.7   18   44-61     58-75  (146)
 71 cd04272 ZnMc_salivary_gland_MP  56.1     7.1 0.00015   33.7   1.8   17   43-59    143-159 (220)
 72 KOG0734 AAA+-type ATPase conta  55.3     2.7 5.9E-05   42.2  -1.1   41   42-82    554-596 (752)
 73 PF14891 Peptidase_M91:  Effect  55.3     9.4  0.0002   32.0   2.3   24   41-65     99-122 (174)
 74 PF01421 Reprolysin:  Reprolysi  55.0       8 0.00017   32.6   1.9   17   41-57    127-143 (199)
 75 cd04271 ZnMc_ADAM_fungal Zinc-  54.7     4.2 9.1E-05   35.8   0.1   12   47-58    147-158 (228)
 76 KOG3714 Meprin A metalloprotea  54.5     6.2 0.00014   37.8   1.2   29   46-74    160-191 (411)
 77 cd06461 M2_ACE Peptidase famil  52.5     8.4 0.00018   37.8   1.8   19   43-62    245-263 (477)
 78 PF01435 Peptidase_M48:  Peptid  50.3      11 0.00024   31.7   2.0   17   41-57     85-101 (226)
 79 PRK03982 heat shock protein Ht  47.0      12 0.00026   33.9   1.7   17   41-57    121-137 (288)
 80 PF04228 Zn_peptidase:  Putativ  45.6      19 0.00041   33.1   2.8   21   41-61    166-186 (292)
 81 COG5549 Predicted Zn-dependent  45.5      11 0.00024   33.4   1.3   19   40-58    182-200 (236)
 82 PRK03001 M48 family peptidase;  45.1      13 0.00029   33.5   1.7   17   41-57    120-136 (283)
 83 PF14521 Aspzincin_M35:  Lysine  44.4      13 0.00028   30.4   1.5   23   33-55     84-106 (148)
 84 COG0501 HtpX Zn-dependent prot  44.0      15 0.00033   32.5   1.9   17   41-57    153-169 (302)
 85 PF05548 Peptidase_M11:  Gameto  43.5      12 0.00026   34.7   1.3   15   44-58    149-163 (314)
 86 KOG2089 Metalloendopeptidase f  43.5      22 0.00047   36.4   3.1   30   40-69    490-519 (718)
 87 PRK03072 heat shock protein Ht  43.4      15 0.00031   33.5   1.7   16   41-56    123-138 (288)
 88 PF12725 DUF3810:  Protein of u  42.7      12 0.00026   34.7   1.1   18   40-57    191-208 (318)
 89 PRK01345 heat shock protein Ht  42.3      15 0.00033   33.9   1.7   16   42-57    121-136 (317)
 90 PF07998 Peptidase_M54:  Peptid  41.5      17 0.00036   31.6   1.7   16   42-57    142-157 (194)
 91 PRK04897 heat shock protein Ht  41.0      17 0.00036   33.2   1.7   16   41-56    133-148 (298)
 92 TIGR02185 Trep_Strep conserved  41.0 2.3E+02  0.0049   24.1  13.8   65  170-236   119-184 (189)
 93 PF10462 Peptidase_M66:  Peptid  40.5      12 0.00026   34.7   0.8   14   45-58    193-206 (305)
 94 PRK05457 heat shock protein Ht  39.4      18  0.0004   32.8   1.7   16   41-56    130-145 (284)
 95 cd04273 ZnMc_ADAMTS_like Zinc-  39.3     5.5 0.00012   34.0  -1.6   16   43-58    138-153 (207)
 96 PRK13267 archaemetzincin-like   38.9      20 0.00044   30.5   1.8   15   43-57    123-137 (179)
 97 KOG1565 Gelatinase A and relat  38.3      12 0.00026   36.6   0.4   33   42-76    208-240 (469)
 98 cd04275 ZnMc_pappalysin_like Z  37.5     3.6 7.8E-05   36.3  -3.1   17   42-58    134-150 (225)
 99 PRK02391 heat shock protein Ht  36.9      21 0.00046   32.6   1.7   16   41-56    129-144 (296)
100 PF12315 DUF3633:  Protein of u  35.2      23 0.00051   31.2   1.7   17   46-63     94-110 (212)
101 PRK02870 heat shock protein Ht  35.0      25 0.00054   33.0   1.9   16   41-56    169-184 (336)
102 PRK01265 heat shock protein Ht  34.6      24 0.00052   32.9   1.7   16   41-56    136-151 (324)
103 COG2738 Predicted Zn-dependent  34.2      25 0.00054   30.9   1.6   13   45-57     92-104 (226)
104 PRK04860 hypothetical protein;  30.3      50  0.0011   27.7   2.8   21   43-63     61-81  (160)
105 KOG2661 Peptidase family M48 [  30.1      38 0.00082   32.2   2.2   16   44-59    274-289 (424)
106 PHA02456 zinc metallopeptidase  29.6      35 0.00075   27.5   1.6   12   47-58     81-92  (141)
107 PF09527 ATPase_gene1:  Putativ  29.2 1.8E+02  0.0039   19.4   5.8   34  150-183    20-54  (55)
108 COG4783 Putative Zn-dependent   28.9      35 0.00076   33.6   1.9   17   41-57    126-142 (484)
109 PF02074 Peptidase_M32:  Carbox  28.5      42 0.00092   33.1   2.4   20   41-60    255-274 (494)
110 COG4784 Putative Zn-dependent   27.3      27 0.00058   33.4   0.7   16   44-59    123-138 (479)
111 PF05547 Peptidase_M6:  Immune   27.2     8.5 0.00019   39.2  -2.7   12   46-57    222-233 (645)
112 PF01457 Peptidase_M8:  Leishma  26.7      38 0.00083   33.4   1.7   15   44-58    209-223 (521)
113 PF13402 M60-like:  Peptidase M  26.3      33 0.00071   30.7   1.1   10   49-58    223-232 (307)
114 KOG2719 Metalloprotease [Gener  26.3      37  0.0008   33.0   1.5   13   44-56    279-291 (428)
115 PRK11383 hypothetical protein;  26.1 2.2E+02  0.0047   23.7   5.8   41  145-186    88-128 (145)
116 PF08325 WLM:  WLM domain;  Int  25.9      42 0.00091   28.8   1.6   14   44-57     81-94  (186)
117 PRK04351 hypothetical protein;  24.5      70  0.0015   26.5   2.7   22   42-64     58-79  (149)
118 PF13699 DUF4157:  Domain of un  24.1      47   0.001   24.4   1.4   12   46-57     62-73  (79)
119 COG4227 Antirestriction protei  23.6      40 0.00088   31.0   1.2   36   40-83    198-233 (316)
120 COG2317 Zn-dependent carboxype  23.4      52  0.0011   32.5   1.9   20   41-60    255-274 (497)
121 PF01863 DUF45:  Protein of unk  23.2      44 0.00094   28.0   1.2   10   48-57    167-176 (205)
122 PF06262 DUF1025:  Possibl zinc  22.2      54  0.0012   25.3   1.5   13   45-57     73-85  (97)
123 PF11667 DUF3267:  Protein of u  21.6 1.4E+02  0.0031   22.7   3.8   20   47-66      6-25  (111)
124 KOG3607 Meltrins, fertilins an  21.3      54  0.0012   33.9   1.7   16   42-57    320-335 (716)
125 PF07607 DUF1570:  Protein of u  21.3      57  0.0012   26.4   1.5   15   46-60      2-16  (128)
126 PTZ00337 surface protease GP63  20.9      65  0.0014   32.4   2.1   14   44-57    228-241 (567)
127 PHA00657 crystallin beta/gamma  20.8      73  0.0016   35.4   2.5   17   45-61    791-807 (2052)
128 PRK00523 hypothetical protein;  20.1 1.5E+02  0.0033   21.8   3.3   24  211-234     2-25  (72)
129 PF11350 DUF3152:  Protein of u  20.0      40 0.00087   29.6   0.4    9   49-57    143-151 (203)

No 1  
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=100.00  E-value=9.2e-44  Score=306.19  Aligned_cols=195  Identities=37%  Similarity=0.634  Sum_probs=170.1

Q ss_pred             HHHHHHHHHhhcccccchhhHHHHHHHHHHHHHHHHHhCCceeeEEEeeCCCcceEeec--Cc-eeeeeccccchHHHHH
Q 026597           25 GVFTVVILALWRTILLTPFKLITVFLHEASHAIACKLTCGDVEGMQVHANEGGATQTRG--GI-YWLILPAGYLGSSFWG  101 (236)
Q Consensus        25 ~~~~v~~~~lW~~~~~~p~r~l~t~~HE~GHalaalltGg~v~~I~l~~d~sG~t~s~G--g~-~~~i~~AGY~gp~l~G  101 (236)
                      +++.++...+|+.++++|+|+++|++||+|||++|+++|||+++++++||+||.|.++|  ++ ++++++|||++|+++|
T Consensus         2 i~~~~~l~~~~~~~~~~~~~~l~t~~HE~gHal~a~l~G~~v~~i~l~~~~~G~~~~~~~~~~~~~~i~~aGyl~~~l~g   81 (200)
T PF13398_consen    2 IALIVALLLLWNPPIWYPFRLLVTFVHELGHALAALLTGGRVKGIVLFPDGSGVTVSSGPSGIGRFLIALAGYLGPALFG   81 (200)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcceEEEEeCCCceEEEecCCCcchhHHhcccchHHHHHH
Confidence            34566788999999999999999999999999999999999999999999999999998  54 6899999999999999


Q ss_pred             HHHHHHhcchhhHHHHHHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHH
Q 026597          102 MALVLASTNLTTARIAAGCLALALFIVLFIAQNWTLRGLCIGFIVFLAVIWVLQEETSVRILRYVILFIGVMNSLFSVYD  181 (236)
Q Consensus       102 ~~l~~~~~~~~~~~~~~~~l~~ll~~~l~~vRn~~g~~~~l~~~~l~~~~~~~~~~~~~~~l~~~~~flGv~~~l~sv~d  181 (236)
                      ..+++++++++. +.........+..+++++||++++..++.++++..+.++.++   ++..++..+|+|++.++|+++|
T Consensus        82 ~~~~~~~~~~~~-~~~l~~~~~~l~~~l~~~r~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ig~~~~l~~~~~  157 (200)
T PF13398_consen   82 LLLLWLLFSGWA-RAVLFLGLLILLVLLLFVRNWFGILVVILFGALLIALWFFAP---PWILRFILLFIGVFLLLYSVRD  157 (200)
T ss_pred             HHHHHHHHhhHH-HHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHCC---HHHHHHHHHHHHHHHHHHHHcC
Confidence            999999875555 444334444445556678999999999887777766665553   6888899999999999999999


Q ss_pred             HHHHHhcCCCCCChHHHHHhhCCCCCCchhHHHHHHHHHHHHHHH
Q 026597          182 IYDDLISRRVHSSDAEKFAEVCPCPCNGVGWGVIWGIISFAFLCA  226 (236)
Q Consensus       182 ~~~~~~~~~~~~SDa~~La~lt~~~~P~~~W~~lw~~~sl~~l~~  226 (236)
                      + ||+++|+.++|||+++||+||+  |+++|+.+|.++|++++.+
T Consensus       158 i-~~l~~~~~~~sDa~~la~lt~~--p~~~W~~~~~~~s~~~l~~  199 (200)
T PF13398_consen  158 I-DDLFRRRVNASDADQLARLTGL--PAWVWGLLWLAISLACLAA  199 (200)
T ss_pred             H-HHHhcCCCCCChHHHHHHHHCC--CHHHHHHHHHHHHHHHHHh
Confidence            8 8889999999999999999998  9999999999999999865


No 2  
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=98.10  E-value=6.4e-06  Score=71.43  Aligned_cols=65  Identities=22%  Similarity=0.221  Sum_probs=52.6

Q ss_pred             hhHHHHHHHHHHHHHHHHHhCCceeeEEEeeCCCcceEeec-C----ceeeeeccccchHHHHHHHHHHHh
Q 026597           43 FKLITVFLHEASHAIACKLTCGDVEGMQVHANEGGATQTRG-G----IYWLILPAGYLGSSFWGMALVLAS  108 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltGg~v~~I~l~~d~sG~t~s~G-g----~~~~i~~AGY~gp~l~G~~l~~~~  108 (236)
                      .-...+..||+||+++|...|.|++++++.|-++ .+..+. .    .+..++.||+....+.+.....+.
T Consensus        36 ~l~~~v~iHElgH~~~A~~~G~~v~~i~l~p~Gg-~~~~~~~~~~~~~~~lIalAGPl~n~~la~~~~~l~  105 (208)
T cd06161          36 LLFLSVLLHELGHALVARRYGIRVRSITLLPFGG-VAELEEEPETPKEEFVIALAGPLVSLLLAGLFYLLY  105 (208)
T ss_pred             HHHHHHHHHHHHHHHHHHHcCCCccceEEEeeee-eeeeccCCCChhHheeeeeehHHHHHHHHHHHHHHH
Confidence            4567889999999999999999999999999754 444443 1    269999999999998887765543


No 3  
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=97.79  E-value=4.3e-05  Score=67.33  Aligned_cols=64  Identities=23%  Similarity=0.287  Sum_probs=52.3

Q ss_pred             hHHHHHHHHHHHHHHHHHhCCceeeEEEeeCCCcceEeecCc-----eeeeeccccchHHHHHHHHHHHh
Q 026597           44 KLITVFLHEASHAIACKLTCGDVEGMQVHANEGGATQTRGGI-----YWLILPAGYLGSSFWGMALVLAS  108 (236)
Q Consensus        44 r~l~t~~HE~GHalaalltGg~v~~I~l~~d~sG~t~s~Gg~-----~~~i~~AGY~gp~l~G~~l~~~~  108 (236)
                      -.+.++.||+||+++|...|.|++++++.|= +|.+..++..     +..++.||.....+.+.......
T Consensus        52 l~~~v~iHElgH~~~A~~~G~~v~~i~l~p~-Gg~~~~~~~~~~~~~~~~IalAGPl~Nllla~i~~~l~  120 (227)
T cd06164          52 LFASVLLHELGHSLVARRYGIPVRSITLFLF-GGVARLEREPETPGQEFVIAIAGPLVSLVLALLFLLLS  120 (227)
T ss_pred             HHHHHHHHHHHHHHHHHHcCCeECeEEEEee-eEEEEecCCCCCHHHHhhhhhhHHHHHHHHHHHHHHHH
Confidence            3467889999999999999999999999985 5787776521     58999999999988887655443


No 4  
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=97.47  E-value=0.00026  Score=60.53  Aligned_cols=26  Identities=35%  Similarity=0.441  Sum_probs=22.7

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceeeEE
Q 026597           45 LITVFLHEASHAIACKLTCGDVEGMQ   70 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~I~   70 (236)
                      .+.++.||+||.++|...|-|++++.
T Consensus         9 ~~~v~iHElGH~~~Ar~~Gv~v~~f~   34 (182)
T cd06163           9 GILIFVHELGHFLVAKLFGVKVEEFS   34 (182)
T ss_pred             HHHHHHHHHHHHHHHHHcCCeeeEee
Confidence            35688999999999999999999843


No 5  
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=97.29  E-value=0.00047  Score=59.01  Aligned_cols=64  Identities=17%  Similarity=0.110  Sum_probs=50.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHhCCceeeEEEeeC-----CCcceEeecC--c---eeeeeccccchHHHHHHHHHH
Q 026597           43 FKLITVFLHEASHAIACKLTCGDVEGMQVHAN-----EGGATQTRGG--I---YWLILPAGYLGSSFWGMALVL  106 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltGg~v~~I~l~~d-----~sG~t~s~Gg--~---~~~i~~AGY~gp~l~G~~l~~  106 (236)
                      .-......||+||+++|...|-|.++.++.|+     -++.+..+..  .   +..++.||.....+.+.....
T Consensus        39 ~l~~~l~iHElgH~~~A~~~G~~~~~~~l~P~~~~G~~G~~~~~~~~~~~~~~~~~IalAGPl~nl~lali~~~  112 (183)
T cd06160          39 ALLAILGIHEMGHYLAARRHGVKASLPYFIPFPFIGTFGAFIRMRSPIPNRKALFDIALAGPLAGLLLALPVLI  112 (183)
T ss_pred             HHHHHHHHHHHHHHHHHHHCCCCccceeeeehHhcCcEEEEEEecCCCCChhHhehhhhhHHHHHHHHHHHHHH
Confidence            34567789999999999999999999999998     6676665542  1   588999998888776655443


No 6  
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=97.23  E-value=0.00022  Score=60.15  Aligned_cols=28  Identities=36%  Similarity=0.519  Sum_probs=20.7

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceeeEEEe
Q 026597           45 LITVFLHEASHAIACKLTCGDVEGMQVH   72 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~I~l~   72 (236)
                      .+.+.+||+||+++|...|.|+++++..
T Consensus         7 ~i~i~~HE~gH~~~a~~~G~~~~~~~~~   34 (192)
T PF02163_consen    7 LISIVLHELGHALAARLYGDKVPRFEGG   34 (192)
T ss_dssp             HHHHHHHHHHHHHHHHTTT--B--EEE-
T ss_pred             cccccccccccccccccccccccccccc
Confidence            5678999999999999999999998654


No 7  
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=97.23  E-value=0.00034  Score=63.14  Aligned_cols=63  Identities=19%  Similarity=0.221  Sum_probs=44.1

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceeeEEE-----------eeCCCcceEeecCceeeeeccccchHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIACKLTCGDVEGMQV-----------HANEGGATQTRGGIYWLILPAGYLGSSFWGMALVLA  107 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~I~l-----------~~d~sG~t~s~Gg~~~~i~~AGY~gp~l~G~~l~~~  107 (236)
                      .++..+||+||+++|...|.+++++.+           ++|+......+-+.+..++.||.....+.+......
T Consensus       118 ~isv~iHElgHa~~Ar~~G~~V~~iGl~l~~ip~Gg~v~~~~~~~~~~~~~~~~~Ia~AGP~~Nlvla~i~~~l  191 (263)
T cd06159         118 VVGVVVHELSHGILARVEGIKVKSGGLLLLIIPPGAFVEPDEEELNKADRRIRLRIFAAGVTANFVVALIAFAL  191 (263)
T ss_pred             HHHHHHHHHHHHHHHHHcCCEECchhHHHHhhhcEEEEEecchhhccCChhheeeeeeehHHHHHHHHHHHHHH
Confidence            456889999999999999999999754           333322111111237899999999888777554433


No 8  
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=97.15  E-value=0.00029  Score=59.07  Aligned_cols=29  Identities=28%  Similarity=0.285  Sum_probs=24.6

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceeeEEEee
Q 026597           45 LITVFLHEASHAIACKLTCGDVEGMQVHA   73 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~I~l~~   73 (236)
                      .+.+..||+||+++|...|.|++++.+..
T Consensus         8 ~i~i~iHE~gH~~~A~~~G~~~~~~~~~~   36 (180)
T cd05709           8 LISVTVHELGHALVARRLGVKVARFSGGF   36 (180)
T ss_pred             HHHHHHHHHHHHHHHHHcCCCchheeeeE
Confidence            45788999999999999999988766644


No 9  
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=96.92  E-value=0.0033  Score=57.21  Aligned_cols=62  Identities=23%  Similarity=0.177  Sum_probs=46.5

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceeeEEEeeC---CCcceEee--------cCceeeeeccccchHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIACKLTCGDVEGMQVHAN---EGGATQTR--------GGIYWLILPAGYLGSSFWGMALVL  106 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~I~l~~d---~sG~t~s~--------Gg~~~~i~~AGY~gp~l~G~~l~~  106 (236)
                      .+...+||+|||++|..-|-+++++.+.+=   .+|.+.-.        .+.+..++.||.....+.+.....
T Consensus       135 ~isvvvHElgHal~A~~~gi~V~~iGl~l~~~~pGa~ve~~~e~~~~~~~~~~l~Ia~AGp~~NlvLa~i~~~  207 (277)
T cd06162         135 LISGVVHEMGHGVAAVREQVRVNGFGIFFFIIYPGAYVDLFTDHLNLISPVQQLRIFCAGVWHNFVLGLVGYL  207 (277)
T ss_pred             HHHHHHHHHHHHHHHHHcCCeeceEEEeeeeccCeeEEeecccccccCChhhhhheehhhHHHHHHHHHHHHH
Confidence            557889999999999999999999876432   24555431        122688999999999888766543


No 10 
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=96.27  E-value=0.0063  Score=58.08  Aligned_cols=63  Identities=21%  Similarity=0.293  Sum_probs=43.5

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceee--------------------EEEeeCCCcceEee---------------------c
Q 026597           45 LITVFLHEASHAIACKLTCGDVEG--------------------MQVHANEGGATQTR---------------------G   83 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~--------------------I~l~~d~sG~t~s~---------------------G   83 (236)
                      -+.+++||+||.++|..+|-|+++                    ++..|=++ .+...                     .
T Consensus        14 ~~~v~~HE~gH~~~a~~~g~~v~~FsiGfGp~l~~~~~~~tey~i~~~plGg-~v~~~g~~~~~~~~~~~~~~~~f~~~~   92 (420)
T TIGR00054        14 AVLIFVHELGHFLAARLCGIKVERFSIGFGPKILKFKKNGTEYAISLIPLGG-YVKMKGLDKEMEVKPPETDGDLFNNKS   92 (420)
T ss_pred             HHHHHHHhHHHHHHHHHcCCEEEEEEEccCchheEEecCCeEEEEEEecCcc-eEeeccCCcccccCCcchhhhhhccCC
Confidence            356889999999999999999994                    55555322 11111                     0


Q ss_pred             Cc-eeeeeccccchHHHHHHHHHHHh
Q 026597           84 GI-YWLILPAGYLGSSFWGMALVLAS  108 (236)
Q Consensus        84 g~-~~~i~~AGY~gp~l~G~~l~~~~  108 (236)
                      -+ |.++.+||.+.-.+++.+++...
T Consensus        93 ~~~r~~i~~aGp~~N~~~a~~~~~~~  118 (420)
T TIGR00054        93 VFQKAIIIFAGPLANFIFAIFVYIFI  118 (420)
T ss_pred             HHHHHHhhhcccHHHHHHHHHHHHHH
Confidence            11 57888888888888876655543


No 11 
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=95.65  E-value=0.02  Score=55.05  Aligned_cols=28  Identities=18%  Similarity=0.196  Sum_probs=25.0

Q ss_pred             HHHHHHHHHHHHHHHHhCCceeeEEEee
Q 026597           46 ITVFLHEASHAIACKLTCGDVEGMQVHA   73 (236)
Q Consensus        46 l~t~~HE~GHalaalltGg~v~~I~l~~   73 (236)
                      +.+++||+||-++|.++|=||+++.+-.
T Consensus        16 ~li~vHElGHfl~Ar~~gv~V~~FsiGf   43 (449)
T PRK10779         16 VLITVHEFGHFWVARRCGVRVERFSIGF   43 (449)
T ss_pred             HHHHHHHHHHHHHHHHcCCeeeEEEeec
Confidence            3578999999999999999999988765


No 12 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=93.75  E-value=0.068  Score=50.96  Aligned_cols=29  Identities=34%  Similarity=0.483  Sum_probs=24.3

Q ss_pred             hHHHHHHHHHHHHHHHHHhCCceeeEEEe
Q 026597           44 KLITVFLHEASHAIACKLTCGDVEGMQVH   72 (236)
Q Consensus        44 r~l~t~~HE~GHalaalltGg~v~~I~l~   72 (236)
                      -++..-+||+|||+||.--|=+++++-|+
T Consensus       130 lvi~~vvHElGHalAA~segV~vngfgIf  158 (484)
T KOG2921|consen  130 LVITVVVHELGHALAAASEGVQVNGFGIF  158 (484)
T ss_pred             HHHHHHHHHhhHHHHHHhcCceeeeeEEE
Confidence            34566799999999999999999887665


No 13 
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=93.60  E-value=0.064  Score=46.57  Aligned_cols=36  Identities=28%  Similarity=0.302  Sum_probs=27.2

Q ss_pred             HHHHHHHHHHHHHHHHhC--CceeeEEEeeCCC--cceEe
Q 026597           46 ITVFLHEASHAIACKLTC--GDVEGMQVHANEG--GATQT   81 (236)
Q Consensus        46 l~t~~HE~GHalaalltG--g~v~~I~l~~d~s--G~t~s   81 (236)
                      =-+..||.|||+++.+++  .++..+.|.|.++  |.|..
T Consensus        29 ~~~A~HEAGhAvva~~l~~~~~v~~vsi~prg~~~G~~~~   68 (213)
T PF01434_consen   29 RRIAYHEAGHAVVAYLLPPADPVSKVSIVPRGSALGFTQF   68 (213)
T ss_dssp             HHHHHHHHHHHHHHHHSSS---EEEEESSTTCCCCHCCEE
T ss_pred             HHHHHHHHHHHHHHHHhcccccEEEEEEecCCCcceeEEe
Confidence            346789999999999998  4789999988754  44444


No 14 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=90.99  E-value=0.12  Score=41.39  Aligned_cols=16  Identities=38%  Similarity=0.495  Sum_probs=13.5

Q ss_pred             hhhHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAI   57 (236)
Q Consensus        42 p~r~l~t~~HE~GHal   57 (236)
                      +..+..|+.||+||++
T Consensus       102 ~~~~~~v~~HEiGHaL  117 (154)
T PF00413_consen  102 GNDLQSVAIHEIGHAL  117 (154)
T ss_dssp             SEEHHHHHHHHHHHHT
T ss_pred             hhhhhhhhhhcccccc
Confidence            3467789999999997


No 15 
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=90.54  E-value=0.21  Score=42.48  Aligned_cols=29  Identities=21%  Similarity=0.299  Sum_probs=23.4

Q ss_pred             HHHHHHHHHHHHHHHHHhCCcee----eEEEee
Q 026597           45 LITVFLHEASHAIACKLTCGDVE----GMQVHA   73 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~----~I~l~~   73 (236)
                      .++...||.+||++|...|-+-.    +++++|
T Consensus         9 ~~~i~~HE~aHa~~A~~~Gd~t~~~~Grltlnp   41 (181)
T cd06158           9 LLAITLHEFAHAYVAYRLGDPTARRAGRLTLNP   41 (181)
T ss_pred             HHHHHHHHHHHHHHHHHcCCcHHHHcCceecCc
Confidence            45778999999999999998765    355555


No 16 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=90.34  E-value=0.17  Score=41.30  Aligned_cols=17  Identities=35%  Similarity=0.495  Sum_probs=14.5

Q ss_pred             hhhHHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAIA   58 (236)
Q Consensus        42 p~r~l~t~~HE~GHala   58 (236)
                      ...+..|+.||+||++-
T Consensus       101 ~~~~~~~~~HEiGHaLG  117 (156)
T cd04279         101 AENLQAIALHELGHALG  117 (156)
T ss_pred             chHHHHHHHHHhhhhhc
Confidence            35788999999999984


No 17 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=88.84  E-value=0.27  Score=39.94  Aligned_cols=15  Identities=33%  Similarity=0.450  Sum_probs=12.9

Q ss_pred             hHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIA   58 (236)
Q Consensus        44 r~l~t~~HE~GHala   58 (236)
                      ....|+.||+||++-
T Consensus        93 ~~~~~~~HEiGHaLG  107 (165)
T cd04268          93 RLRNTAEHELGHALG  107 (165)
T ss_pred             HHHHHHHHHHHHHhc
Confidence            467999999999974


No 18 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=88.40  E-value=0.36  Score=46.07  Aligned_cols=24  Identities=21%  Similarity=0.290  Sum_probs=20.5

Q ss_pred             hhHHHHHHHHHHHHHHHHHhCCce
Q 026597           43 FKLITVFLHEASHAIACKLTCGDV   66 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltGg~v   66 (236)
                      .+-+.|++||+|||+=.+++..+.
T Consensus       240 ~~~v~tLfHE~GHa~H~~ls~~~~  263 (458)
T PF01432_consen  240 HDDVETLFHEFGHAMHSLLSRTKY  263 (458)
T ss_dssp             HHHHHHHHHHHHHHHHHHHCCCSS
T ss_pred             hhhHHHHHHHHhHHHHHHHhcccc
Confidence            456899999999999999987653


No 19 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=88.33  E-value=0.2  Score=41.00  Aligned_cols=17  Identities=35%  Similarity=0.471  Sum_probs=14.4

Q ss_pred             hhhHHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAIA   58 (236)
Q Consensus        42 p~r~l~t~~HE~GHala   58 (236)
                      ......|+.||+|||+-
T Consensus       104 ~~~~~~~~~HEiGHaLG  120 (157)
T cd04278         104 GTDLFSVAAHEIGHALG  120 (157)
T ss_pred             cchHHHHHHHHhccccc
Confidence            45789999999999963


No 20 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=86.74  E-value=0.34  Score=40.64  Aligned_cols=14  Identities=29%  Similarity=0.468  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIA   58 (236)
Q Consensus        45 ~l~t~~HE~GHala   58 (236)
                      ...|+.||+||++-
T Consensus       113 ~~~t~~HEiGHaLG  126 (186)
T cd04277         113 GYQTIIHEIGHALG  126 (186)
T ss_pred             hHHHHHHHHHHHhc
Confidence            57899999999985


No 21 
>CHL00176 ftsH cell division protein; Validated
Probab=86.20  E-value=0.9  Score=45.96  Aligned_cols=53  Identities=19%  Similarity=0.100  Sum_probs=37.3

Q ss_pred             HHHHHHHHHHHHHHHHhC--CceeeEEEeeCC--CcceEeecC------------ceeeeeccccchHH
Q 026597           46 ITVFLHEASHAIACKLTC--GDVEGMQVHANE--GGATQTRGG------------IYWLILPAGYLGSS   98 (236)
Q Consensus        46 l~t~~HE~GHalaalltG--g~v~~I~l~~d~--sG~t~s~Gg------------~~~~i~~AGY~gp~   98 (236)
                      -.+..||.|||+++.+.+  ..++.+.|.|.+  .|.|...-.            .++.++.||..+=-
T Consensus       439 ~~vA~hEaGhA~v~~~l~~~~~v~kvtI~prg~~~G~~~~~p~~~~~~~t~~~l~~~i~~~LgGraAE~  507 (638)
T CHL00176        439 RLIAYHEVGHAIVGTLLPNHDPVQKVTLIPRGQAKGLTWFTPEEDQSLVSRSQILARIVGALGGRAAEE  507 (638)
T ss_pred             HHHHHHhhhhHHHHhhccCCCceEEEEEeecCCCCCceEecCCcccccccHHHHHHHHHHHhhhHHHHH
Confidence            457999999999998764  568999998884  466655421            03556666666654


No 22 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=85.94  E-value=0.58  Score=45.21  Aligned_cols=26  Identities=23%  Similarity=0.316  Sum_probs=22.2

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHhCCc
Q 026597           40 LTPFKLITVFLHEASHAIACKLTCGD   65 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaalltGg~   65 (236)
                      +--..-+.|++||+||++=.+++-.+
T Consensus       243 lL~~~~v~TLfHEfGHalH~~ls~~~  268 (458)
T cd06457         243 LLSPHEVETLFHEMGHAMHSMLGRTE  268 (458)
T ss_pred             CcCHHHHHHHHHHHhHHHHHHHcCCC
Confidence            44567899999999999999999655


No 23 
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=85.72  E-value=0.57  Score=47.72  Aligned_cols=26  Identities=15%  Similarity=0.236  Sum_probs=21.3

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHhCCc
Q 026597           40 LTPFKLITVFLHEASHAIACKLTCGD   65 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaalltGg~   65 (236)
                      +--..-+.|+|||||||+=.+++--+
T Consensus       459 LL~~~eV~TlFHEfGHalH~lls~~~  484 (681)
T PRK10280        459 LLLWDDVITLFHEFGHTLHGLFARQR  484 (681)
T ss_pred             ccCHHHHHHHHHHHHHHHHHHHhCCC
Confidence            44467899999999999999998533


No 24 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=85.70  E-value=0.39  Score=38.23  Aligned_cols=13  Identities=38%  Similarity=0.457  Sum_probs=10.9

Q ss_pred             HHHHHHHHHHHHH
Q 026597           47 TVFLHEASHAIAC   59 (236)
Q Consensus        47 ~t~~HE~GHalaa   59 (236)
                      .|+.||+|||+--
T Consensus        88 ~~~~HEigHaLGl  100 (140)
T smart00235       88 GVAAHELGHALGL  100 (140)
T ss_pred             ccHHHHHHHHhcC
Confidence            4899999999843


No 25 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=85.28  E-value=0.49  Score=44.26  Aligned_cols=23  Identities=22%  Similarity=0.261  Sum_probs=19.9

Q ss_pred             chhhHHHHHHHHHHHHHHHHHhC
Q 026597           41 TPFKLITVFLHEASHAIACKLTC   63 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaalltG   63 (236)
                      ..++-+.|++||+|||+=..++-
T Consensus       218 ~~~~~v~tl~HE~GHa~h~~~~~  240 (427)
T cd06459         218 GTLDDVFTLAHELGHAFHSYLSR  240 (427)
T ss_pred             CChhhHHHHHHHhhHHHHHHHHc
Confidence            35788999999999999888874


No 26 
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=84.99  E-value=0.64  Score=47.13  Aligned_cols=26  Identities=27%  Similarity=0.317  Sum_probs=21.0

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHhCCc
Q 026597           40 LTPFKLITVFLHEASHAIACKLTCGD   65 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaalltGg~   65 (236)
                      +.-+.=+.|+||||||++=.+++--+
T Consensus       462 Lls~dEV~TLFHEfGHgLH~mlt~v~  487 (683)
T COG0339         462 LLSHDEVTTLFHEFGHGLHHLLTRVK  487 (683)
T ss_pred             eeeHHHHHHHHHHhhhHHHHHhhcCC
Confidence            34467789999999999999987544


No 27 
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=84.93  E-value=1.2  Score=44.90  Aligned_cols=32  Identities=25%  Similarity=0.272  Sum_probs=26.4

Q ss_pred             HHHHHHHHHHHHHHHHhC--CceeeEEEeeCCCc
Q 026597           46 ITVFLHEASHAIACKLTC--GDVEGMQVHANEGG   77 (236)
Q Consensus        46 l~t~~HE~GHalaalltG--g~v~~I~l~~d~sG   77 (236)
                      -.+..||.|||+++.+..  ..++.++|-|+++.
T Consensus       409 ~~~a~he~gha~~~~~~~~~~~~~~v~i~prg~~  442 (644)
T PRK10733        409 ESTAYHEAGHAIIGRLVPEHDPVHKVTIIPRGRA  442 (644)
T ss_pred             HHHHHHHHHHHHHHHHccCCCceeEEEEeccCCC
Confidence            568999999999998875  66889999877643


No 28 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=83.90  E-value=0.9  Score=40.13  Aligned_cols=18  Identities=28%  Similarity=0.477  Sum_probs=15.6

Q ss_pred             hHHHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIACKL   61 (236)
Q Consensus        44 r~l~t~~HE~GHalaall   61 (236)
                      ..+-|+.||+|||+...+
T Consensus        91 ~~~~~l~HE~GHAlI~~~  108 (220)
T PF14247_consen   91 NVLFTLYHELGHALIDDL  108 (220)
T ss_pred             HHHHHHHHHHHHHHHHHh
Confidence            567899999999999874


No 29 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=83.07  E-value=0.79  Score=39.16  Aligned_cols=14  Identities=43%  Similarity=0.641  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIA   58 (236)
Q Consensus        45 ~l~t~~HE~GHala   58 (236)
                      ...|+.||+|||+-
T Consensus        92 ~~~~i~HElgHaLG  105 (198)
T cd04327          92 FSRVVLHEFGHALG  105 (198)
T ss_pred             HHHHHHHHHHHHhc
Confidence            45799999999984


No 30 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=82.93  E-value=1.1  Score=41.33  Aligned_cols=29  Identities=24%  Similarity=0.167  Sum_probs=23.0

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHhCCceee
Q 026597           40 LTPFKLITVFLHEASHAIACKLTCGDVEG   68 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaalltGg~v~~   68 (236)
                      +....-+.|++||+|||+=.+++-.+...
T Consensus       149 ll~~~~v~tl~HE~GHa~h~~l~~~~~~~  177 (365)
T cd06258         149 LLGHDDINTLFHEFGHAVHFLLIQQRYPF  177 (365)
T ss_pred             cCCHHHHHHHHHHHhHHHHHHHhcCCCCc
Confidence            34678899999999999998888655443


No 31 
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=82.90  E-value=1.7  Score=42.13  Aligned_cols=39  Identities=28%  Similarity=0.204  Sum_probs=29.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHh--CCceeeEEEeeCC--CcceEe
Q 026597           43 FKLITVFLHEASHAIACKLT--CGDVEGMQVHANE--GGATQT   81 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaallt--Gg~v~~I~l~~d~--sG~t~s   81 (236)
                      -..--+..||.|||+++.+.  +.+++++.|.|.+  .|.+..
T Consensus       309 ~~~~~~A~hEaGhAlv~~~l~~~~~v~~vsi~prg~~~G~~~~  351 (495)
T TIGR01241       309 KEKKLVAYHEAGHALVGLLLKDADPVHKVTIIPRGQALGYTQF  351 (495)
T ss_pred             HHHHHHHHHHHhHHHHHHhcCCCCceEEEEEeecCCccceEEe
Confidence            33446899999999999988  5789999998764  455533


No 32 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=82.83  E-value=1.1  Score=43.34  Aligned_cols=23  Identities=26%  Similarity=0.339  Sum_probs=20.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHhCCc
Q 026597           43 FKLITVFLHEASHAIACKLTCGD   65 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltGg~   65 (236)
                      ++-+.|++||+||++=.+++..+
T Consensus       261 ~~~V~TLfHEfGHalH~~ls~~~  283 (472)
T cd06455         261 HDEVETFFHEFGHVIHHLLGRTK  283 (472)
T ss_pred             HHHHHHHHHHHHHHHHHHhcCCC
Confidence            79999999999999999998754


No 33 
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=82.80  E-value=1.1  Score=42.97  Aligned_cols=27  Identities=26%  Similarity=0.292  Sum_probs=22.4

Q ss_pred             hhhHHHHHHHHHHHHHHHHHhCCceee
Q 026597           42 PFKLITVFLHEASHAIACKLTCGDVEG   68 (236)
Q Consensus        42 p~r~l~t~~HE~GHalaalltGg~v~~   68 (236)
                      -+.-+.|++||+|||+=.+++..+...
T Consensus       205 ~~~~v~tLfHEfGHalH~~ls~~~~~~  231 (422)
T cd06456         205 THDEVTTLFHEFGHALHHLLTDVEYPS  231 (422)
T ss_pred             CHHHHHHHHHHHHHHHHHHHhcCCccc
Confidence            578899999999999999998644443


No 34 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=82.54  E-value=0.65  Score=37.59  Aligned_cols=17  Identities=24%  Similarity=0.264  Sum_probs=14.2

Q ss_pred             hHHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIACK   60 (236)
Q Consensus        44 r~l~t~~HE~GHalaal   60 (236)
                      ....|+.||+||++--.
T Consensus        95 ~~~~~~~HElGH~LGl~  111 (167)
T cd00203          95 EGAQTIAHELGHALGFY  111 (167)
T ss_pred             cchhhHHHHHHHHhCCC
Confidence            67889999999998643


No 35 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=81.73  E-value=0.7  Score=35.78  Aligned_cols=12  Identities=33%  Similarity=0.739  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAI   57 (236)
Q Consensus        46 l~t~~HE~GHal   57 (236)
                      ..|+.||+||.+
T Consensus       108 ~~~~~HEiGH~l  119 (124)
T PF13582_consen  108 VDTFAHEIGHNL  119 (124)
T ss_dssp             TTHHHHHHHHHT
T ss_pred             ceEeeehhhHhc
Confidence            489999999976


No 36 
>PRK10911 oligopeptidase A; Provisional
Probab=81.38  E-value=1.2  Score=45.39  Aligned_cols=21  Identities=24%  Similarity=0.349  Sum_probs=19.1

Q ss_pred             hhHHHHHHHHHHHHHHHHHhC
Q 026597           43 FKLITVFLHEASHAIACKLTC   63 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltG   63 (236)
                      +.-+.|++||||||+=.+++-
T Consensus       461 ~~~v~tlfHEfGHalH~~ls~  481 (680)
T PRK10911        461 HDEVITLFHEFGHGLHHMLTR  481 (680)
T ss_pred             HHHHHHHHHHHhHHHHHHHhC
Confidence            488999999999999999884


No 37 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=81.36  E-value=0.85  Score=45.26  Aligned_cols=22  Identities=23%  Similarity=0.332  Sum_probs=19.2

Q ss_pred             chhhHHHHHHHHHHHHHHHHHh
Q 026597           41 TPFKLITVFLHEASHAIACKLT   62 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaallt   62 (236)
                      ..++-+.|++||+|||+=..++
T Consensus       371 ~~~~~v~TL~HE~GHa~H~~ls  392 (587)
T TIGR02290       371 GSRRDVSTLAHELGHAYHSELA  392 (587)
T ss_pred             CCchhHHHHHHHhhHHHHHHHH
Confidence            3478899999999999988877


No 38 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=76.81  E-value=2  Score=35.62  Aligned_cols=19  Identities=32%  Similarity=0.691  Sum_probs=15.7

Q ss_pred             cchhhH-HHHHHHHHHHHHH
Q 026597           40 LTPFKL-ITVFLHEASHAIA   58 (236)
Q Consensus        40 ~~p~r~-l~t~~HE~GHala   58 (236)
                      .+||.- +.+..||++|++.
T Consensus       129 f~~~~~~lDVvaHEltHGVt  148 (150)
T PF01447_consen  129 FKPFASSLDVVAHELTHGVT  148 (150)
T ss_dssp             BS-GGG-HHHHHHHHHHHHH
T ss_pred             cccCccccceeeeccccccc
Confidence            467887 9999999999985


No 39 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=76.64  E-value=1.9  Score=42.44  Aligned_cols=24  Identities=29%  Similarity=0.189  Sum_probs=20.1

Q ss_pred             chhhHHHHHHHHHHHHHHHHHhCC
Q 026597           41 TPFKLITVFLHEASHAIACKLTCG   64 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaalltGg   64 (236)
                      .-++-+.|++||+|||.=..++-.
T Consensus       333 ~t~~dv~TL~HElGHa~H~~~s~~  356 (549)
T TIGR02289       333 GTSGDIDVLTHEAGHAFHVYESRK  356 (549)
T ss_pred             CChhHHHHHHHHhhHHHHHHHhcC
Confidence            457889999999999998887753


No 40 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=74.91  E-value=1.6  Score=38.32  Aligned_cols=18  Identities=22%  Similarity=0.204  Sum_probs=13.6

Q ss_pred             chhhHHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAIA   58 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHala   58 (236)
                      .+-|=.=|+.||+||++.
T Consensus        68 ~~~r~rFtlAHELGH~ll   85 (213)
T COG2856          68 SLERKRFTLAHELGHALL   85 (213)
T ss_pred             CHHHHHHHHHHHHhHHHh
Confidence            344556689999999974


No 41 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=74.76  E-value=1.6  Score=36.95  Aligned_cols=28  Identities=14%  Similarity=0.219  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHHHHHHHhCCceee---EEEee
Q 026597           46 ITVFLHEASHAIACKLTCGDVEG---MQVHA   73 (236)
Q Consensus        46 l~t~~HE~GHalaalltGg~v~~---I~l~~   73 (236)
                      ..|..||+|||+--.=.--|-++   |.++.
T Consensus        75 ~g~v~HE~~HalG~~HEh~R~DRD~yv~i~~  105 (180)
T cd04280          75 LGTIVHELMHALGFYHEQSRPDRDDYVTINW  105 (180)
T ss_pred             CchhHHHHHHHhcCcchhcccccCCeEEEee
Confidence            58999999999765444444433   55554


No 42 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=74.26  E-value=2.3  Score=42.17  Aligned_cols=23  Identities=13%  Similarity=0.300  Sum_probs=19.5

Q ss_pred             chhhHHHHHHHHHHHHHHHHHhC
Q 026597           41 TPFKLITVFLHEASHAIACKLTC   63 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaalltG   63 (236)
                      .-++-+.|++||+|||.=..++.
T Consensus       374 ~~~~dv~TLaHElGHa~H~~~~~  396 (591)
T TIGR00181       374 GTLNSVFTLAHELGHSMHSYFSS  396 (591)
T ss_pred             CCcchHHHHHHHhhhHHHHHHHc
Confidence            34688999999999999888775


No 43 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=73.46  E-value=1.8  Score=37.46  Aligned_cols=21  Identities=19%  Similarity=0.292  Sum_probs=15.8

Q ss_pred             hHHHHHHHHHHHHHHHHHhCC
Q 026597           44 KLITVFLHEASHAIACKLTCG   64 (236)
Q Consensus        44 r~l~t~~HE~GHalaalltGg   64 (236)
                      +...|+.||+||.+-+-=.+.
T Consensus       136 ~~~~~~aHEiGH~lGl~H~~~  156 (206)
T PF13583_consen  136 NGYQTFAHEIGHNLGLRHDFD  156 (206)
T ss_pred             ccchHHHHHHHHHhcCCCCcc
Confidence            345779999999997765554


No 44 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=73.21  E-value=1.5  Score=39.76  Aligned_cols=18  Identities=22%  Similarity=0.303  Sum_probs=12.9

Q ss_pred             hHHHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIACKL   61 (236)
Q Consensus        44 r~l~t~~HE~GHalaall   61 (236)
                      .--.+++||+||.++.+.
T Consensus       215 ~~~~v~vHE~GHsf~~La  232 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLA  232 (264)
T ss_dssp             THHHHHHHHHHHHTT---
T ss_pred             cccceeeeeccccccccc
Confidence            456789999999988763


No 45 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=73.17  E-value=2.4  Score=35.56  Aligned_cols=20  Identities=25%  Similarity=0.345  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHHHHHHHhCC
Q 026597           45 LITVFLHEASHAIACKLTCG   64 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg   64 (236)
                      -..|++||+||.+-+-=++.
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~  130 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFD  130 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSS
T ss_pred             eeeeehhhhHhhcCCCCCCC
Confidence            56679999999998876666


No 46 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=73.05  E-value=1.9  Score=35.83  Aligned_cols=16  Identities=19%  Similarity=0.403  Sum_probs=11.9

Q ss_pred             hhhHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAI   57 (236)
Q Consensus        42 p~r~l~t~~HE~GHal   57 (236)
                      +...-.|++||+||-+
T Consensus        66 ~~~~g~TltHEvGH~L   81 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWL   81 (154)
T ss_dssp             TS-SSHHHHHHHHHHT
T ss_pred             ccccccchhhhhhhhh
Confidence            3445689999999965


No 47 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=72.18  E-value=2.1  Score=36.66  Aligned_cols=15  Identities=27%  Similarity=0.321  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAIACK   60 (236)
Q Consensus        46 l~t~~HE~GHalaal   60 (236)
                      ..|+.||+|||+--.
T Consensus        78 ~G~i~HEl~HaLG~~   92 (182)
T cd04283          78 KGIIQHELLHALGFY   92 (182)
T ss_pred             cchHHHHHHHHhCCc
Confidence            579999999998544


No 48 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=70.86  E-value=2.7  Score=35.18  Aligned_cols=20  Identities=20%  Similarity=0.404  Sum_probs=14.7

Q ss_pred             chhhHHHHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAIACK   60 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaal   60 (236)
                      .+..-..|+.||+||.+-+.
T Consensus       138 ~~~~~~~~~AHEiGH~lGa~  157 (196)
T PF13688_consen  138 PTYNGAITFAHEIGHNLGAP  157 (196)
T ss_dssp             -HHHHHHHHHHHHHHHTT--
T ss_pred             CCCceehhhHHhHHHhcCCC
Confidence            45677789999999998775


No 49 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=70.73  E-value=2.7  Score=35.83  Aligned_cols=30  Identities=17%  Similarity=0.244  Sum_probs=18.4

Q ss_pred             HHHHHHHHHHHHHHHHHhCCceee---EEEeeC
Q 026597           45 LITVFLHEASHAIACKLTCGDVEG---MQVHAN   74 (236)
Q Consensus        45 ~l~t~~HE~GHalaalltGg~v~~---I~l~~d   74 (236)
                      ...|+.||+|||+--.=.=.|-++   |+++.+
T Consensus        79 ~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~  111 (191)
T PF01400_consen   79 SVGTILHELGHALGFWHEHQRPDRDNYITINWD  111 (191)
T ss_dssp             SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GG
T ss_pred             CccchHHHHHHHHhhhhhhhccccccEEEEehh
Confidence            357999999999854444334333   777764


No 50 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=69.46  E-value=4  Score=30.36  Aligned_cols=21  Identities=24%  Similarity=0.341  Sum_probs=16.1

Q ss_pred             cchhhHHHHHHHHHHHHHHHH
Q 026597           40 LTPFKLITVFLHEASHAIACK   60 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaal   60 (236)
                      ..+.+---|+.||+||.+.--
T Consensus        37 ~~~~~~~f~laHELgH~~~~~   57 (122)
T PF06114_consen   37 LSPERQRFTLAHELGHILLHH   57 (122)
T ss_dssp             S-HHHHHHHHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHHHHHhhh
Confidence            456777889999999988654


No 51 
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=68.90  E-value=4.2  Score=41.04  Aligned_cols=28  Identities=18%  Similarity=0.265  Sum_probs=21.0

Q ss_pred             hhcccccchhhHHHHHHHHHHHHHHHHH
Q 026597           34 LWRTILLTPFKLITVFLHEASHAIACKL   61 (236)
Q Consensus        34 lW~~~~~~p~r~l~t~~HE~GHalaall   61 (236)
                      .|..|-..-+.-+.|+|||||||.=.++
T Consensus       468 s~~s~t~L~~~~vetLFHEmGHAMHSmL  495 (704)
T KOG2090|consen  468 SQSSPTFLSLSEVETLFHEMGHAMHSML  495 (704)
T ss_pred             ccCCCcccCHHHHHHHHHHHHHHHHHHh
Confidence            4444444455678999999999998886


No 52 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=68.83  E-value=2.4  Score=37.34  Aligned_cols=10  Identities=50%  Similarity=0.760  Sum_probs=9.3

Q ss_pred             HHHHHHHHHH
Q 026597           48 VFLHEASHAI   57 (236)
Q Consensus        48 t~~HE~GHal   57 (236)
                      ++.||+||.+
T Consensus       136 vi~HEiGH~I  145 (211)
T PF12388_consen  136 VITHEIGHCI  145 (211)
T ss_pred             HHHHHhhhhc
Confidence            8999999986


No 53 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=68.58  E-value=4.4  Score=32.79  Aligned_cols=21  Identities=38%  Similarity=0.469  Sum_probs=17.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHh
Q 026597           42 PFKLITVFLHEASHAIACKLT   62 (236)
Q Consensus        42 p~r~l~t~~HE~GHalaallt   62 (236)
                      +-.+..|+.|||.|+.+-...
T Consensus        57 ~~~~~~tL~HEm~H~~~~~~~   77 (157)
T PF10263_consen   57 EEELIDTLLHEMAHAAAYVFG   77 (157)
T ss_pred             HHHHHHHHHHHHHHHHhhhcc
Confidence            357899999999999997764


No 54 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=68.50  E-value=2.7  Score=36.44  Aligned_cols=14  Identities=29%  Similarity=0.560  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAIAC   59 (236)
Q Consensus        46 l~t~~HE~GHalaa   59 (236)
                      ..|+.||++||+-=
T Consensus        88 ~Gti~HEl~HaLGf  101 (200)
T cd04281          88 FGIVVHELGHVIGF  101 (200)
T ss_pred             CchHHHHHHHHhcC
Confidence            57999999999853


No 55 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=67.93  E-value=3.8  Score=41.19  Aligned_cols=28  Identities=11%  Similarity=0.147  Sum_probs=23.5

Q ss_pred             chhhHHHHHHHHHHHHHHHHHhCCceee
Q 026597           41 TPFKLITVFLHEASHAIACKLTCGDVEG   68 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaalltGg~v~~   68 (236)
                      .-.+-+-|++||+||+.-..+++.+-..
T Consensus       376 gt~~dV~TLaHElGHs~Hs~~s~~~qp~  403 (598)
T COG1164         376 GTLRDVFTLAHELGHSVHSYFSRKHQPY  403 (598)
T ss_pred             CchhHHHHHHHHccHHHHHHHHhccCCc
Confidence            4568899999999999999998876543


No 56 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=66.60  E-value=3.7  Score=34.46  Aligned_cols=17  Identities=18%  Similarity=0.335  Sum_probs=14.3

Q ss_pred             hhhHHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAIA   58 (236)
Q Consensus        42 p~r~l~t~~HE~GHala   58 (236)
                      ++....+++||+||-+-
T Consensus       128 ~~~~a~~~AHElGH~lG  144 (194)
T cd04269         128 LLLFAVTMAHELGHNLG  144 (194)
T ss_pred             hHHHHHHHHHHHHhhcC
Confidence            56778999999999874


No 57 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=65.75  E-value=2.4  Score=35.50  Aligned_cols=22  Identities=14%  Similarity=0.193  Sum_probs=17.1

Q ss_pred             hhhHHHHHHHHHHHHHHHHHhC
Q 026597           42 PFKLITVFLHEASHAIACKLTC   63 (236)
Q Consensus        42 p~r~l~t~~HE~GHalaalltG   63 (236)
                      |+....|++||+||.+-.-=.+
T Consensus       130 ~~~~~~~~aHElGH~lG~~HD~  151 (192)
T cd04267         130 TLLTALTMAHELGHNLGAEHDG  151 (192)
T ss_pred             ceeehhhhhhhHHhhcCCcCCC
Confidence            5677889999999998654444


No 58 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=65.47  E-value=3.9  Score=36.21  Aligned_cols=15  Identities=27%  Similarity=0.200  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIAC   59 (236)
Q Consensus        45 ~l~t~~HE~GHalaa   59 (236)
                      ...|++||+||-+-.
T Consensus       167 ~a~t~AHElGHnlGm  181 (244)
T cd04270         167 SDLVTAHELGHNFGS  181 (244)
T ss_pred             HHHHHHHHHHHhcCC
Confidence            458999999999754


No 59 
>PF02128 Peptidase_M36:  Fungalysin metallopeptidase (M36);  InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=63.52  E-value=1  Score=42.82  Aligned_cols=30  Identities=23%  Similarity=0.349  Sum_probs=23.8

Q ss_pred             HHHHHHHHHHHHHHHhCCceeeEEEeeCCC
Q 026597           47 TVFLHEASHAIACKLTCGDVEGMQVHANEG   76 (236)
Q Consensus        47 ~t~~HE~GHalaalltGg~v~~I~l~~d~s   76 (236)
                      .++.||.||++..-|+||.-.+=+|...|+
T Consensus       187 ~Ii~HEy~HGiSnRLvgG~~~s~cL~~~e~  216 (378)
T PF02128_consen  187 GIIAHEYGHGISNRLVGGPANSSCLQNLES  216 (378)
T ss_pred             CeeEEeecccccccccCCCccccccccccc
Confidence            468999999999999999877655654443


No 60 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=63.42  E-value=4.5  Score=33.03  Aligned_cols=10  Identities=30%  Similarity=0.547  Sum_probs=9.0

Q ss_pred             HHHHHHHHHH
Q 026597           48 VFLHEASHAI   57 (236)
Q Consensus        48 t~~HE~GHal   57 (236)
                      +..||+||.+
T Consensus        80 IaaHE~GHiL   89 (132)
T PF02031_consen   80 IAAHELGHIL   89 (132)
T ss_dssp             HHHHHHHHHH
T ss_pred             eeeehhcccc
Confidence            6789999987


No 61 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=62.34  E-value=4.1  Score=36.20  Aligned_cols=13  Identities=31%  Similarity=0.327  Sum_probs=11.4

Q ss_pred             HHHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAIA   58 (236)
Q Consensus        46 l~t~~HE~GHala   58 (236)
                      ..|+.||++||+-
T Consensus       121 ~Gti~HEl~HalG  133 (230)
T cd04282         121 KATVEHEFLHALG  133 (230)
T ss_pred             CchHHHHHHHHhC
Confidence            5899999999984


No 62 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=61.16  E-value=8.7  Score=28.75  Aligned_cols=22  Identities=27%  Similarity=0.133  Sum_probs=18.3

Q ss_pred             hhHHHHHHHHHHHHHHHHHhCC
Q 026597           43 FKLITVFLHEASHAIACKLTCG   64 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltGg   64 (236)
                      -....++.||+.|.+..-..|.
T Consensus        23 ~~~~~~l~HE~~H~~~~~~~~~   44 (128)
T PF13485_consen   23 DWLDRVLAHELAHQWFGNYFGG   44 (128)
T ss_pred             HHHHHHHHHHHHHHHHHHHcCC
Confidence            3456899999999999888765


No 63 
>COG0465 HflB ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones]
Probab=60.34  E-value=7.8  Score=39.10  Aligned_cols=36  Identities=25%  Similarity=0.161  Sum_probs=29.2

Q ss_pred             HHHHHHHHHHHHHHHhCC--ceeeEEEeeCC--CcceEee
Q 026597           47 TVFLHEASHAIACKLTCG--DVEGMQVHANE--GGATQTR   82 (236)
Q Consensus        47 ~t~~HE~GHalaalltGg--~v~~I~l~~d~--sG~t~s~   82 (236)
                      .|.-||.|||+++.++-+  .++++++-|.+  .|.|...
T Consensus       408 ~~AYhEaghalv~~~l~~~d~v~KvtIiPrG~alG~t~~~  447 (596)
T COG0465         408 ITAYHEAGHALVGLLLPDADPVHKVTIIPRGRALGYTLFL  447 (596)
T ss_pred             chHHHHHHHHHHHHhCCCCcccceeeeccCchhhcchhcC
Confidence            478899999999999866  89999999988  3444443


No 64 
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=59.60  E-value=6.2  Score=37.80  Aligned_cols=20  Identities=20%  Similarity=0.172  Sum_probs=16.7

Q ss_pred             chhhHHHHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAIACK   60 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaal   60 (236)
                      .+.+-+-|.+||+|||+=..
T Consensus       155 d~~~~l~t~iHE~GHalye~  174 (396)
T cd06460         155 DFRSALFSTIHETGHALYEQ  174 (396)
T ss_pred             chHHHHHHHHHHhhHHHHHh
Confidence            35777889999999999776


No 65 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=58.85  E-value=5.9  Score=35.12  Aligned_cols=15  Identities=40%  Similarity=0.567  Sum_probs=11.9

Q ss_pred             hhHHHHHHHHHHHHH
Q 026597           43 FKLITVFLHEASHAI   57 (236)
Q Consensus        43 ~r~l~t~~HE~GHal   57 (236)
                      ..-+.+..||-|||+
T Consensus        87 iaAvaVAAHEvGHAi  101 (222)
T PF04298_consen   87 IAAVAVAAHEVGHAI  101 (222)
T ss_pred             HHHHHHHHHHHhHHH
Confidence            345678899999997


No 66 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=58.04  E-value=7  Score=33.83  Aligned_cols=15  Identities=27%  Similarity=0.499  Sum_probs=12.2

Q ss_pred             hHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIA   58 (236)
Q Consensus        44 r~l~t~~HE~GHala   58 (236)
                      .+-.++.||.||++-
T Consensus       115 ~~~~~~~he~gh~lG  129 (197)
T cd04276         115 SLRYLLAHEVGHTLG  129 (197)
T ss_pred             HHHHHHHHHHHHHhc
Confidence            356799999999973


No 67 
>KOG3854 consensus SPRT-like metalloprotease [Function unknown]
Probab=57.83  E-value=7.4  Score=38.14  Aligned_cols=18  Identities=28%  Similarity=0.486  Sum_probs=14.3

Q ss_pred             hHHHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIACKL   61 (236)
Q Consensus        44 r~l~t~~HE~GHalaall   61 (236)
                      |+-.|+.||++||-+=+.
T Consensus       350 Rir~TLiHEmCHaAaWv~  367 (505)
T KOG3854|consen  350 RIRDTLIHEMCHAAAWVF  367 (505)
T ss_pred             HHHHHHHHHHHHHHHhhc
Confidence            677899999999865443


No 68 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=57.80  E-value=2  Score=39.02  Aligned_cols=12  Identities=33%  Similarity=0.625  Sum_probs=10.3

Q ss_pred             HHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAI   57 (236)
Q Consensus        46 l~t~~HE~GHal   57 (236)
                      +.|++||+||.+
T Consensus       166 igv~~HE~gH~l  177 (286)
T TIGR03296       166 VGVIAHELGHDL  177 (286)
T ss_pred             eeeeehhhhccc
Confidence            789999999943


No 69 
>COG0750 Predicted membrane-associated Zn-dependent proteases 1 [Cell envelope biogenesis, outer membrane]
Probab=57.22  E-value=12  Score=34.45  Aligned_cols=30  Identities=23%  Similarity=0.308  Sum_probs=24.9

Q ss_pred             HHHHHHHHHHHHHHHHhCCceeeEEEeeCC
Q 026597           46 ITVFLHEASHAIACKLTCGDVEGMQVHANE   75 (236)
Q Consensus        46 l~t~~HE~GHalaalltGg~v~~I~l~~d~   75 (236)
                      .....||.||-+.+..+|.+++++.+-.-.
T Consensus        14 ~lv~~he~gh~~~a~~~~~~v~~f~ig~g~   43 (375)
T COG0750          14 VLVFVHELGHFWVARRCGVKVERFSIGFGP   43 (375)
T ss_pred             HHHHHHHHhhHHHHHhcCceeEEEEeccCc
Confidence            346799999999999999999997665443


No 70 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=56.65  E-value=10  Score=30.79  Aligned_cols=18  Identities=33%  Similarity=0.455  Sum_probs=15.6

Q ss_pred             hHHHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIACKL   61 (236)
Q Consensus        44 r~l~t~~HE~GHalaall   61 (236)
                      .+..|+.||+.|+.+-..
T Consensus        58 ~l~~~l~HEm~H~~~~~~   75 (146)
T smart00731       58 RLRETLLHELCHAALYLF   75 (146)
T ss_pred             HHHhhHHHHHHHHHHHHh
Confidence            566699999999999885


No 71 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=56.05  E-value=7.1  Score=33.65  Aligned_cols=17  Identities=18%  Similarity=0.358  Sum_probs=13.5

Q ss_pred             hhHHHHHHHHHHHHHHH
Q 026597           43 FKLITVFLHEASHAIAC   59 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaa   59 (236)
                      +.-..|++||+||.+-.
T Consensus       143 ~~~~~~~AHElGH~lG~  159 (220)
T cd04272         143 YYGVYTMTHELAHLLGA  159 (220)
T ss_pred             cccHHHHHHHHHHHhCC
Confidence            35579999999998743


No 72 
>KOG0734 consensus AAA+-type ATPase containing the peptidase M41 domain [Posttranslational modification, protein turnover, chaperones]
Probab=55.31  E-value=2.7  Score=42.19  Aligned_cols=41  Identities=20%  Similarity=0.141  Sum_probs=31.6

Q ss_pred             hhhHHHHHHHHHHHHHHHHHhCC--ceeeEEEeeCCCcceEee
Q 026597           42 PFKLITVFLHEASHAIACKLTCG--DVEGMQVHANEGGATQTR   82 (236)
Q Consensus        42 p~r~l~t~~HE~GHalaalltGg--~v~~I~l~~d~sG~t~s~   82 (236)
                      +=..-.|..||.|||++|+-|-+  .+++-+|-|.++-.-.+.
T Consensus       554 ~eak~~TAyHE~GHAivA~yTk~A~PlhKaTImPRG~sLG~t~  596 (752)
T KOG0734|consen  554 EEAKKITAYHEGGHAIVALYTKGAMPLHKATIMPRGPSLGHTS  596 (752)
T ss_pred             hhhhhhhhhhccCceEEEeecCCCccccceeeccCCcccccee
Confidence            33455789999999999999976  578888999887544433


No 73 
>PF14891 Peptidase_M91:  Effector protein
Probab=55.30  E-value=9.4  Score=31.98  Aligned_cols=24  Identities=25%  Similarity=0.223  Sum_probs=18.4

Q ss_pred             chhhHHHHHHHHHHHHHHHHHhCCc
Q 026597           41 TPFKLITVFLHEASHAIACKLTCGD   65 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaalltGg~   65 (236)
                      .+..=++++.|||.||.=.+ .|..
T Consensus        99 ~~~~p~v~L~HEL~HA~~~~-~Gt~  122 (174)
T PF14891_consen   99 SPRPPFVVLYHELIHAYDYM-NGTM  122 (174)
T ss_pred             hhHHHHHHHHHHHHHHHHHH-CCCC
Confidence            45667899999999999765 4444


No 74 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=55.00  E-value=8  Score=32.60  Aligned_cols=17  Identities=18%  Similarity=0.460  Sum_probs=14.2

Q ss_pred             chhhHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAI   57 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHal   57 (236)
                      .++....+++||+||.+
T Consensus       127 ~~~~~a~~~AHelGH~l  143 (199)
T PF01421_consen  127 SGLSFAVIIAHELGHNL  143 (199)
T ss_dssp             SHHHHHHHHHHHHHHHT
T ss_pred             hhHHHHHHHHHHHHHhc
Confidence            46778899999999964


No 75 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=54.70  E-value=4.2  Score=35.76  Aligned_cols=12  Identities=42%  Similarity=0.736  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHH
Q 026597           47 TVFLHEASHAIA   58 (236)
Q Consensus        47 ~t~~HE~GHala   58 (236)
                      .|++||+||-+-
T Consensus       147 ~t~AHElGHnLG  158 (228)
T cd04271         147 QVFAHEIGHTFG  158 (228)
T ss_pred             eehhhhhhhhcC
Confidence            599999999874


No 76 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=54.50  E-value=6.2  Score=37.79  Aligned_cols=29  Identities=14%  Similarity=0.215  Sum_probs=20.8

Q ss_pred             HHHHHHHHHHHHHHHHhCCceee---EEEeeC
Q 026597           46 ITVFLHEASHAIACKLTCGDVEG---MQVHAN   74 (236)
Q Consensus        46 l~t~~HE~GHalaalltGg~v~~---I~l~~d   74 (236)
                      ..|++||++||+--.=.=-|-++   |+|+.|
T Consensus       160 ~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~  191 (411)
T KOG3714|consen  160 FGTIVHELMHALGFWHEHSRPDRDNYVSINWD  191 (411)
T ss_pred             CchhHHHHHHHhhhhhccCcccccCceEEeec
Confidence            78999999999976655555444   666655


No 77 
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=52.48  E-value=8.4  Score=37.77  Aligned_cols=19  Identities=21%  Similarity=0.212  Sum_probs=15.3

Q ss_pred             hhHHHHHHHHHHHHHHHHHh
Q 026597           43 FKLITVFLHEASHAIACKLT   62 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaallt   62 (236)
                      ..-+.|+.||+||+. --++
T Consensus       245 ~~D~~t~~HE~GH~~-yy~~  263 (477)
T cd06461         245 MEDFVTVHHEMGHIQ-YYLQ  263 (477)
T ss_pred             HHHHHHHHHHHHHHH-HHHH
Confidence            567899999999999 4444


No 78 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=50.28  E-value=11  Score=31.71  Aligned_cols=17  Identities=29%  Similarity=0.346  Sum_probs=13.7

Q ss_pred             chhhHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAI   57 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHal   57 (236)
                      .+=-+..++.||+||..
T Consensus        85 ~~~el~aVlaHElgH~~  101 (226)
T PF01435_consen   85 SEDELAAVLAHELGHIK  101 (226)
T ss_dssp             SHHHHHHHHHHHHHHHH
T ss_pred             cHHHHHHHHHHHHHHHH
Confidence            34467889999999985


No 79 
>PRK03982 heat shock protein HtpX; Provisional
Probab=46.97  E-value=12  Score=33.87  Aligned_cols=17  Identities=29%  Similarity=0.258  Sum_probs=14.0

Q ss_pred             chhhHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAI   57 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHal   57 (236)
                      .+=-+..++.||+||.-
T Consensus       121 ~~~El~AVlAHElgHi~  137 (288)
T PRK03982        121 NEDELEGVIAHELTHIK  137 (288)
T ss_pred             CHHHHHHHHHHHHHHHH
Confidence            44568899999999974


No 80 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=45.57  E-value=19  Score=33.14  Aligned_cols=21  Identities=29%  Similarity=0.330  Sum_probs=18.0

Q ss_pred             chhhHHHHHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAIACKL   61 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaall   61 (236)
                      -||-..-++.||.||.+=.++
T Consensus       166 G~~a~ayVlAHEyGHHVQ~l~  186 (292)
T PF04228_consen  166 GDFAQAYVLAHEYGHHVQNLL  186 (292)
T ss_pred             cHHHHHHHHHHHHHHHHHHHh
Confidence            477888899999999998875


No 81 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=45.54  E-value=11  Score=33.37  Aligned_cols=19  Identities=26%  Similarity=0.163  Sum_probs=16.7

Q ss_pred             cchhhHHHHHHHHHHHHHH
Q 026597           40 LTPFKLITVFLHEASHAIA   58 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHala   58 (236)
                      -.+-++..|.-||.|||+-
T Consensus       182 ~~~e~L~~tarhElGhaLg  200 (236)
T COG5549         182 ELRENLNPTARHELGHALG  200 (236)
T ss_pred             cchhhhhHHHHHhhcchhe
Confidence            5678999999999999973


No 82 
>PRK03001 M48 family peptidase; Provisional
Probab=45.11  E-value=13  Score=33.48  Aligned_cols=17  Identities=24%  Similarity=0.307  Sum_probs=13.7

Q ss_pred             chhhHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAI   57 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHal   57 (236)
                      .+=-+..++.||+||.-
T Consensus       120 ~~~El~aVlAHElgHi~  136 (283)
T PRK03001        120 SEREIRGVMAHELAHVK  136 (283)
T ss_pred             CHHHHHHHHHHHHHHHh
Confidence            34567899999999974


No 83 
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=44.35  E-value=13  Score=30.43  Aligned_cols=23  Identities=22%  Similarity=0.150  Sum_probs=15.4

Q ss_pred             HhhcccccchhhHHHHHHHHHHH
Q 026597           33 ALWRTILLTPFKLITVFLHEASH   55 (236)
Q Consensus        33 ~lW~~~~~~p~r~l~t~~HE~GH   55 (236)
                      .+|+.|....-.-..|++||+.|
T Consensus        84 ~F~~~p~~g~~Sk~~TLiHE~SH  106 (148)
T PF14521_consen   84 AFFSAPTTGKDSKEGTLIHEWSH  106 (148)
T ss_dssp             HHHHS-SSSTT-HHHHHHHHHHH
T ss_pred             hhcCCCCCCCCchHHHHHHhhhh
Confidence            35566554444568999999999


No 84 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=44.00  E-value=15  Score=32.54  Aligned_cols=17  Identities=29%  Similarity=0.286  Sum_probs=13.7

Q ss_pred             chhhHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAI   57 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHal   57 (236)
                      .+=-+-.++.||+||..
T Consensus       153 ~~dEl~aVlaHElgHi~  169 (302)
T COG0501         153 NDDELEAVLAHELGHIK  169 (302)
T ss_pred             CHHHHHHHHHHHHHHHh
Confidence            44467889999999974


No 85 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=43.49  E-value=12  Score=34.66  Aligned_cols=15  Identities=20%  Similarity=0.399  Sum_probs=12.1

Q ss_pred             hHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIA   58 (236)
Q Consensus        44 r~l~t~~HE~GHala   58 (236)
                      +-..|++||+||=+-
T Consensus       149 ~~~~~~~HElgHN~G  163 (314)
T PF05548_consen  149 QDWATIMHELGHNLG  163 (314)
T ss_pred             ccHHHHHHHhhhhcc
Confidence            345799999999884


No 86 
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=43.45  E-value=22  Score=36.44  Aligned_cols=30  Identities=23%  Similarity=0.320  Sum_probs=24.1

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHhCCceeeE
Q 026597           40 LTPFKLITVFLHEASHAIACKLTCGDVEGM   69 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaalltGg~v~~I   69 (236)
                      +--+.-+.|.|||+||++==+++-....++
T Consensus       490 ll~~~ev~t~FheFGh~~q~ll~Qa~~~~f  519 (718)
T KOG2089|consen  490 LLGHDEVETLFHEFGHVLQHLLTQADFARF  519 (718)
T ss_pred             ccchHHHHHHHHHHhHHHHHHHhcCccccc
Confidence            455678999999999999999887655544


No 87 
>PRK03072 heat shock protein HtpX; Provisional
Probab=43.44  E-value=15  Score=33.49  Aligned_cols=16  Identities=38%  Similarity=0.339  Sum_probs=13.3

Q ss_pred             chhhHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHA   56 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHa   56 (236)
                      .+=-+..++.||+||.
T Consensus       123 ~~~El~aVlAHElgHi  138 (288)
T PRK03072        123 NERELRGVLGHELSHV  138 (288)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            4557889999999995


No 88 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=42.65  E-value=12  Score=34.68  Aligned_cols=18  Identities=33%  Similarity=0.624  Sum_probs=14.9

Q ss_pred             cchhhHHHHHHHHHHHHH
Q 026597           40 LTPFKLITVFLHEASHAI   57 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHal   57 (236)
                      .-|++.-.|..||++|-.
T Consensus       191 ~p~~~~P~T~~HElAHq~  208 (318)
T PF12725_consen  191 LPPYSLPFTICHELAHQL  208 (318)
T ss_pred             CCcccccHHHHHHHHHHh
Confidence            567788889999999953


No 89 
>PRK01345 heat shock protein HtpX; Provisional
Probab=42.33  E-value=15  Score=33.89  Aligned_cols=16  Identities=31%  Similarity=0.461  Sum_probs=13.0

Q ss_pred             hhhHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAI   57 (236)
Q Consensus        42 p~r~l~t~~HE~GHal   57 (236)
                      +=-+..++.||+||.-
T Consensus       121 ~dEL~aVlAHElgHi~  136 (317)
T PRK01345        121 PEEVAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3467889999999963


No 90 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=41.52  E-value=17  Score=31.58  Aligned_cols=16  Identities=19%  Similarity=0.200  Sum_probs=13.7

Q ss_pred             hhhHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAI   57 (236)
Q Consensus        42 p~r~l~t~~HE~GHal   57 (236)
                      --|..-+.+||.||.+
T Consensus       142 ~~R~~Kea~HElGH~~  157 (194)
T PF07998_consen  142 LERVCKEAVHELGHLF  157 (194)
T ss_dssp             HHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            4688899999999975


No 91 
>PRK04897 heat shock protein HtpX; Provisional
Probab=40.96  E-value=17  Score=33.24  Aligned_cols=16  Identities=38%  Similarity=0.279  Sum_probs=12.9

Q ss_pred             chhhHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHA   56 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHa   56 (236)
                      .+=-+..++.||+||.
T Consensus       133 ~~~El~aVlAHElgHi  148 (298)
T PRK04897        133 NREELEGVIGHEISHI  148 (298)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            3456788999999995


No 92 
>TIGR02185 Trep_Strep conserved hypothetical integral membrane protein TIGR02185. This family consists of strongly hydrophobic proteins about 190 amino acids in length with a strongly basic motif near the C-terminus. If is found in rather few species, but in paralogous families of 12 members in the oral pathogenic spirochaete Treponema denticola and 2 in Streptococcus pneumoniae R6.
Probab=40.95  E-value=2.3e+02  Score=24.12  Aligned_cols=65  Identities=17%  Similarity=0.261  Sum_probs=39.3

Q ss_pred             HHHHHHHHHHHHHHHHHhcC-CCCCChHHHHHhhCCCCCCchhHHHHHHHHHHHHHHHHHHHHhhhcC
Q 026597          170 IGVMNSLFSVYDIYDDLISR-RVHSSDAEKFAEVCPCPCNGVGWGVIWGIISFAFLCAAMYLGLVILS  236 (236)
Q Consensus       170 lGv~~~l~sv~d~~~~~~~~-~~~~SDa~~La~lt~~~~P~~~W~~lw~~~sl~~l~~~~~~~~~~~~  236 (236)
                      ++..+...+.|=..|+...+ .++..|.+..++....  -..-|..++.+..+.+=..|.+.|-+.+|
T Consensus       119 ~~~~g~~~p~~~~~d~y~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~~~~t~v~~~iG~~iG~kllk  184 (189)
T TIGR02185       119 LVAMGPILPIWLFKDEYIAFFAARGDSAEYIDQYIKY--VSAIWAVIMIVLTAVAGIAGVLIGKKLLK  184 (189)
T ss_pred             HHHHHHHHHHHHhHHHHHHHHHHcCCcHHHHHHHHHh--cchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44555555666665555432 2233354555555443  23347888888888888888888876654


No 93 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=40.55  E-value=12  Score=34.67  Aligned_cols=14  Identities=29%  Similarity=0.358  Sum_probs=11.1

Q ss_pred             HHHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIA   58 (236)
Q Consensus        45 ~l~t~~HE~GHala   58 (236)
                      .=.++.||.||+.-
T Consensus       193 ~~~~f~HE~GH~~G  206 (305)
T PF10462_consen  193 YGNEFSHELGHNFG  206 (305)
T ss_dssp             SHHHHHHHHHHTTT
T ss_pred             ccceeehhhhhhcC
Confidence            34689999999863


No 94 
>PRK05457 heat shock protein HtpX; Provisional
Probab=39.40  E-value=18  Score=32.83  Aligned_cols=16  Identities=31%  Similarity=0.364  Sum_probs=12.9

Q ss_pred             chhhHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHA   56 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHa   56 (236)
                      .+=-+..++.||+||.
T Consensus       130 ~~~El~aVlAHElgHi  145 (284)
T PRK05457        130 SRDEVEAVLAHEISHI  145 (284)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            4456788999999995


No 95 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=39.30  E-value=5.5  Score=34.04  Aligned_cols=16  Identities=19%  Similarity=0.314  Sum_probs=12.4

Q ss_pred             hhHHHHHHHHHHHHHH
Q 026597           43 FKLITVFLHEASHAIA   58 (236)
Q Consensus        43 ~r~l~t~~HE~GHala   58 (236)
                      +....|++||+||-+-
T Consensus       138 ~~~a~~~aHElGH~LG  153 (207)
T cd04273         138 LSSAFTIAHELGHVLG  153 (207)
T ss_pred             ceeEEeeeeechhhcC
Confidence            3456789999999873


No 96 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=38.90  E-value=20  Score=30.55  Aligned_cols=15  Identities=20%  Similarity=0.299  Sum_probs=12.2

Q ss_pred             hhHHHHHHHHHHHHH
Q 026597           43 FKLITVFLHEASHAI   57 (236)
Q Consensus        43 ~r~l~t~~HE~GHal   57 (236)
                      -|..-+..||+||.+
T Consensus       123 ~R~~k~~~HElGH~l  137 (179)
T PRK13267        123 ERVRKEVTHELGHTL  137 (179)
T ss_pred             HHHHHHHHHHHHHHc
Confidence            377777899999974


No 97 
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=38.26  E-value=12  Score=36.65  Aligned_cols=33  Identities=21%  Similarity=0.133  Sum_probs=21.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHhCCceeeEEEeeCCC
Q 026597           42 PFKLITVFLHEASHAIACKLTCGDVEGMQVHANEG   76 (236)
Q Consensus        42 p~r~l~t~~HE~GHalaalltGg~v~~I~l~~d~s   76 (236)
                      ...+.-|.+||.||++-  |...+..+.-++|+=.
T Consensus       208 g~~l~~Va~HEiGH~LG--L~HS~~~~aiM~P~y~  240 (469)
T KOG1565|consen  208 GVDLFLVAAHEIGHALG--LGHSSDPDAIMYPFYQ  240 (469)
T ss_pred             cchhHHHhhhhcccccc--cCCCCCcccccccccc
Confidence            46788999999999974  3333555555555433


No 98 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=37.51  E-value=3.6  Score=36.30  Aligned_cols=17  Identities=29%  Similarity=0.495  Sum_probs=13.7

Q ss_pred             hhhHHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAIA   58 (236)
Q Consensus        42 p~r~l~t~~HE~GHala   58 (236)
                      |...-.|++||.||.+-
T Consensus       134 ~~n~g~t~~HEvGH~lG  150 (225)
T cd04275         134 PYNLGDTATHEVGHWLG  150 (225)
T ss_pred             cccccceeEEeccceee
Confidence            55667899999999863


No 99 
>PRK02391 heat shock protein HtpX; Provisional
Probab=36.85  E-value=21  Score=32.65  Aligned_cols=16  Identities=44%  Similarity=0.565  Sum_probs=13.1

Q ss_pred             chhhHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHA   56 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHa   56 (236)
                      .+=-+..++.||+||.
T Consensus       129 ~~~El~aVlaHElgHi  144 (296)
T PRK02391        129 DPDELEAVLAHELSHV  144 (296)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            4456889999999995


No 100
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=35.24  E-value=23  Score=31.18  Aligned_cols=17  Identities=24%  Similarity=0.149  Sum_probs=13.3

Q ss_pred             HHHHHHHHHHHHHHHHhC
Q 026597           46 ITVFLHEASHAIACKLTC   63 (236)
Q Consensus        46 l~t~~HE~GHalaalltG   63 (236)
                      -.++.||++||+.-+ .|
T Consensus        94 gsiLAHE~mHa~Lrl-~g  110 (212)
T PF12315_consen   94 GSILAHELMHAWLRL-NG  110 (212)
T ss_pred             hhHHHHHHHHHHhcc-cC
Confidence            567899999999833 44


No 101
>PRK02870 heat shock protein HtpX; Provisional
Probab=34.99  E-value=25  Score=32.96  Aligned_cols=16  Identities=38%  Similarity=0.378  Sum_probs=13.0

Q ss_pred             chhhHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHA   56 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHa   56 (236)
                      .+=-+-.++.||+||.
T Consensus       169 ~~dEL~aVlAHELgHi  184 (336)
T PRK02870        169 DRDELQAVMAHELSHI  184 (336)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            3446788999999997


No 102
>PRK01265 heat shock protein HtpX; Provisional
Probab=34.61  E-value=24  Score=32.91  Aligned_cols=16  Identities=25%  Similarity=0.166  Sum_probs=12.8

Q ss_pred             chhhHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHA   56 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHa   56 (236)
                      .+=-+..++.||+||.
T Consensus       136 ~~~El~aVlAHElgHi  151 (324)
T PRK01265        136 NRDEIKAVAGHELGHL  151 (324)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            3445788999999995


No 103
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=34.20  E-value=25  Score=30.94  Aligned_cols=13  Identities=54%  Similarity=0.662  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAI   57 (236)
Q Consensus        45 ~l~t~~HE~GHal   57 (236)
                      -+.+..||-|||+
T Consensus        92 ~~aVAAHEVGHAi  104 (226)
T COG2738          92 AIAVAAHEVGHAI  104 (226)
T ss_pred             HHHHHHHHhhHHH
Confidence            4567899999997


No 104
>PRK04860 hypothetical protein; Provisional
Probab=30.25  E-value=50  Score=27.68  Aligned_cols=21  Identities=24%  Similarity=0.200  Sum_probs=18.1

Q ss_pred             hhHHHHHHHHHHHHHHHHHhC
Q 026597           43 FKLITVFLHEASHAIACKLTC   63 (236)
Q Consensus        43 ~r~l~t~~HE~GHalaalltG   63 (236)
                      -.+..|+.||+.|..+=...|
T Consensus        61 ~~l~~~v~HEl~H~~~~~~~g   81 (160)
T PRK04860         61 AFIDEVVPHELAHLLVYQLFG   81 (160)
T ss_pred             HHHHhHHHHHHHHHHHHHHcC
Confidence            356788899999999999887


No 105
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=30.05  E-value=38  Score=32.23  Aligned_cols=16  Identities=50%  Similarity=0.567  Sum_probs=13.6

Q ss_pred             hHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIAC   59 (236)
Q Consensus        44 r~l~t~~HE~GHalaa   59 (236)
                      .+..++.||.+|+++-
T Consensus       274 glAtvLgHE~aHaVar  289 (424)
T KOG2661|consen  274 GLATVLGHEIAHAVAR  289 (424)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4678899999999975


No 106
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=29.60  E-value=35  Score=27.50  Aligned_cols=12  Identities=33%  Similarity=0.459  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHH
Q 026597           47 TVFLHEASHAIA   58 (236)
Q Consensus        47 ~t~~HE~GHala   58 (236)
                      .|+.||+.|++-
T Consensus        81 ~TL~HEL~H~WQ   92 (141)
T PHA02456         81 DTLAHELNHAWQ   92 (141)
T ss_pred             HHHHHHHHHHHh
Confidence            699999999985


No 107
>PF09527 ATPase_gene1:  Putative F0F1-ATPase subunit (ATPase_gene1)
Probab=29.19  E-value=1.8e+02  Score=19.40  Aligned_cols=34  Identities=15%  Similarity=0.344  Sum_probs=22.7

Q ss_pred             HHHHhcchhhH-HHHHHHHHHHHHHHHHHHHHHHH
Q 026597          150 VIWVLQEETSV-RILRYVILFIGVMNSLFSVYDIY  183 (236)
Q Consensus       150 ~~~~~~~~~~~-~~l~~~~~flGv~~~l~sv~d~~  183 (236)
                      +-|++|++.+. +.......++|+..++|+++...
T Consensus        20 ~G~~lD~~~~t~p~~~~~g~llG~~~g~~~~~~~~   54 (55)
T PF09527_consen   20 LGYWLDKWFGTSPWFTLIGLLLGIAAGFYNVYRLV   54 (55)
T ss_pred             HHHHHHHHcCCChHHHHHHHHHHHHHHHHHHHHHh
Confidence            34667765543 55566667788888888887654


No 108
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=28.91  E-value=35  Score=33.63  Aligned_cols=17  Identities=29%  Similarity=0.358  Sum_probs=13.8

Q ss_pred             chhhHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAI   57 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHal   57 (236)
                      .|--+..++.||.||..
T Consensus       126 ~esElagViAHEigHv~  142 (484)
T COG4783         126 NESELAGVIAHEIGHVA  142 (484)
T ss_pred             CHHHHHHHHHHHHHHHh
Confidence            45678899999999963


No 109
>PF02074 Peptidase_M32:  Carboxypeptidase Taq (M32) metallopeptidase;  InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=28.47  E-value=42  Score=33.14  Aligned_cols=20  Identities=20%  Similarity=0.258  Sum_probs=15.6

Q ss_pred             chhhHHHHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAIACK   60 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaal   60 (236)
                      .+..-+-...||.|||+=-.
T Consensus       255 d~~~~l~s~iHE~GHalYEq  274 (494)
T PF02074_consen  255 DFLSALFSTIHETGHALYEQ  274 (494)
T ss_dssp             BTHHHHHHHHHHHHHHHHHH
T ss_pred             cHHHHHHHHHHHHHHHHHHc
Confidence            45666777789999999766


No 110
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=27.30  E-value=27  Score=33.42  Aligned_cols=16  Identities=31%  Similarity=0.468  Sum_probs=12.7

Q ss_pred             hHHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIAC   59 (236)
Q Consensus        44 r~l~t~~HE~GHalaa   59 (236)
                      ....++.||+||..+.
T Consensus       123 EvAAVl~HEmgHVtAn  138 (479)
T COG4784         123 EVAAVLAHEMGHVTAN  138 (479)
T ss_pred             HHHHHHHhhhhheecc
Confidence            4568999999997653


No 111
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=27.23  E-value=8.5  Score=39.18  Aligned_cols=12  Identities=50%  Similarity=0.786  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAI   57 (236)
Q Consensus        46 l~t~~HE~GHal   57 (236)
                      +.|++||+||.+
T Consensus       222 iGVfaHEfGH~L  233 (645)
T PF05547_consen  222 IGVFAHEFGHDL  233 (645)
T ss_pred             eEEEEeeccccC
Confidence            789999999965


No 112
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=26.69  E-value=38  Score=33.37  Aligned_cols=15  Identities=27%  Similarity=0.634  Sum_probs=12.6

Q ss_pred             hHHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAIA   58 (236)
Q Consensus        44 r~l~t~~HE~GHala   58 (236)
                      ..+-|..||+.||+-
T Consensus       209 ~~~~~~~HEi~HaLG  223 (521)
T PF01457_consen  209 EFFRTVIHEIAHALG  223 (521)
T ss_dssp             HHHHHHHHHHHHHTT
T ss_pred             cccceeeeeeeeeee
Confidence            556799999999983


No 113
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=26.30  E-value=33  Score=30.70  Aligned_cols=10  Identities=30%  Similarity=0.255  Sum_probs=7.6

Q ss_pred             HHHHHHHHHH
Q 026597           49 FLHEASHAIA   58 (236)
Q Consensus        49 ~~HE~GHala   58 (236)
                      ..||+||-.=
T Consensus       223 ~~HE~GH~~Q  232 (307)
T PF13402_consen  223 PWHELGHNHQ  232 (307)
T ss_dssp             HHHHHHHHH-
T ss_pred             ehhhhhhhcC
Confidence            6899999753


No 114
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=26.27  E-value=37  Score=32.99  Aligned_cols=13  Identities=38%  Similarity=0.623  Sum_probs=10.2

Q ss_pred             hHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHA   56 (236)
Q Consensus        44 r~l~t~~HE~GHa   56 (236)
                      -+.++..||.||=
T Consensus       279 el~AVl~HELGHW  291 (428)
T KOG2719|consen  279 ELVAVLAHELGHW  291 (428)
T ss_pred             HHHHHHHHHhhHH
Confidence            3567889999994


No 115
>PRK11383 hypothetical protein; Provisional
Probab=26.06  E-value=2.2e+02  Score=23.72  Aligned_cols=41  Identities=12%  Similarity=0.220  Sum_probs=27.9

Q ss_pred             HHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 026597          145 IVFLAVIWVLQEETSVRILRYVILFIGVMNSLFSVYDIYDDL  186 (236)
Q Consensus       145 ~~l~~~~~~~~~~~~~~~l~~~~~flGv~~~l~sv~d~~~~~  186 (236)
                      ..+.+.+|-.+...+.+-.+....++++++++ ++-.-.||+
T Consensus        88 ~LL~iGLwNA~l~lsEKGfY~~af~lsLFgai-~vQKn~RD~  128 (145)
T PRK11383         88 ALLMVGLWNATLLLSEKGFYGLAFFLSLFGAV-AVQKNIRDA  128 (145)
T ss_pred             HHHHHHHhcCCcchhhhhHHHHHHHHHHHHHH-HHHhhhhhh
Confidence            34556678777666777777777888777777 555555555


No 116
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=25.86  E-value=42  Score=28.77  Aligned_cols=14  Identities=29%  Similarity=0.679  Sum_probs=11.8

Q ss_pred             hHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAI   57 (236)
Q Consensus        44 r~l~t~~HE~GHal   57 (236)
                      .++.|+.||+.|-.
T Consensus        81 ~i~~t~lHELaH~~   94 (186)
T PF08325_consen   81 TILGTMLHELAHNV   94 (186)
T ss_pred             HHHHHHHHHHHhcc
Confidence            46789999999964


No 117
>PRK04351 hypothetical protein; Provisional
Probab=24.45  E-value=70  Score=26.46  Aligned_cols=22  Identities=23%  Similarity=0.055  Sum_probs=16.7

Q ss_pred             hhhHHHHHHHHHHHHHHHHHhCC
Q 026597           42 PFKLITVFLHEASHAIACKLTCG   64 (236)
Q Consensus        42 p~r~l~t~~HE~GHalaalltGg   64 (236)
                      +-++..|+-||+.|...=+. |+
T Consensus        58 ~~~l~~vv~HElcH~~~~~~-g~   79 (149)
T PRK04351         58 LEELIGIIKHELCHYHLHLE-GK   79 (149)
T ss_pred             HHHHHhhHHHHHHHHHHHHH-CC
Confidence            56788999999999754443 54


No 118
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=24.15  E-value=47  Score=24.42  Aligned_cols=12  Identities=25%  Similarity=0.542  Sum_probs=9.9

Q ss_pred             HHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAI   57 (236)
Q Consensus        46 l~t~~HE~GHal   57 (236)
                      ...+.||+.|.+
T Consensus        62 ~~llaHEl~Hv~   73 (79)
T PF13699_consen   62 RALLAHELAHVV   73 (79)
T ss_pred             chhHhHHHHHHH
Confidence            567899999975


No 119
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=23.64  E-value=40  Score=30.96  Aligned_cols=36  Identities=19%  Similarity=0.197  Sum_probs=25.7

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHhCCceeeEEEeeCCCcceEeec
Q 026597           40 LTPFKLITVFLHEASHAIACKLTCGDVEGMQVHANEGGATQTRG   83 (236)
Q Consensus        40 ~~p~r~l~t~~HE~GHalaalltGg~v~~I~l~~d~sG~t~s~G   83 (236)
                      -.+-....|..||+||.     ||+   +-+++.|.|+.-.+++
T Consensus       198 ~da~~yyaTl~HElghw-----tgh---~~rl~rdLs~~~~sr~  233 (316)
T COG4227         198 RDAINYYATLLHELGHW-----TGH---EARLDRDLSRAFGSRE  233 (316)
T ss_pred             cchHhHHHHHHHHhccc-----cCc---hhhhhhhhccccchhH
Confidence            35567889999999997     343   3467778887765543


No 120
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=23.35  E-value=52  Score=32.46  Aligned_cols=20  Identities=20%  Similarity=0.134  Sum_probs=14.8

Q ss_pred             chhhHHHHHHHHHHHHHHHH
Q 026597           41 TPFKLITVFLHEASHAIACK   60 (236)
Q Consensus        41 ~p~r~l~t~~HE~GHalaal   60 (236)
                      .|..-+=-..||.|||+=-.
T Consensus       255 df~~aL~g~iHE~GHAlYEq  274 (497)
T COG2317         255 DFRSALFGTIHETGHALYEQ  274 (497)
T ss_pred             cHHHHHHHHHHhhhhHHHHc
Confidence            45566667789999998544


No 121
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=23.21  E-value=44  Score=27.95  Aligned_cols=10  Identities=40%  Similarity=0.541  Sum_probs=8.5

Q ss_pred             HHHHHHHHHH
Q 026597           48 VFLHEASHAI   57 (236)
Q Consensus        48 t~~HE~GHal   57 (236)
                      +++||++|..
T Consensus       167 VvvHEL~Hl~  176 (205)
T PF01863_consen  167 VVVHELCHLR  176 (205)
T ss_pred             HHHHHHHHhc
Confidence            6899999964


No 122
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=22.18  E-value=54  Score=25.25  Aligned_cols=13  Identities=23%  Similarity=0.521  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAI   57 (236)
Q Consensus        45 ~l~t~~HE~GHal   57 (236)
                      +-.|.+||.||-+
T Consensus        73 I~~tlvhEiah~f   85 (97)
T PF06262_consen   73 IRDTLVHEIAHHF   85 (97)
T ss_dssp             HHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHc
Confidence            4579999999953


No 123
>PF11667 DUF3267:  Protein of unknown function (DUF3267);  InterPro: IPR021683  This family of proteins has no known function. 
Probab=21.55  E-value=1.4e+02  Score=22.74  Aligned_cols=20  Identities=30%  Similarity=0.258  Sum_probs=17.5

Q ss_pred             HHHHHHHHHHHHHHHhCCce
Q 026597           47 TVFLHEASHAIACKLTCGDV   66 (236)
Q Consensus        47 ~t~~HE~GHalaalltGg~v   66 (236)
                      ....||+=|++.-...|.+-
T Consensus         6 ~~~~HEliH~l~~~~~~~~~   25 (111)
T PF11667_consen    6 LIPLHELIHGLFFKLFGKKP   25 (111)
T ss_pred             eHHHHHHHHHHHHHHhCCCC
Confidence            45799999999999999965


No 124
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=21.32  E-value=54  Score=33.93  Aligned_cols=16  Identities=25%  Similarity=0.459  Sum_probs=13.4

Q ss_pred             hhhHHHHHHHHHHHHH
Q 026597           42 PFKLITVFLHEASHAI   57 (236)
Q Consensus        42 p~r~l~t~~HE~GHal   57 (236)
                      +....+|++||+||.+
T Consensus       320 ~~~~a~v~AhelgH~l  335 (716)
T KOG3607|consen  320 LLAFAVVLAHELGHNL  335 (716)
T ss_pred             chhHHHHHHHHHHhhc
Confidence            6778889999999976


No 125
>PF07607 DUF1570:  Protein of unknown function (DUF1570);  InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=21.29  E-value=57  Score=26.39  Aligned_cols=15  Identities=40%  Similarity=0.643  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHHHH
Q 026597           46 ITVFLHEASHAIACK   60 (236)
Q Consensus        46 l~t~~HE~GHalaal   60 (236)
                      ..|++||.-|=++-=
T Consensus         2 ~~T~~HEa~HQl~~N   16 (128)
T PF07607_consen    2 IATIAHEATHQLAFN   16 (128)
T ss_pred             chHHHHHHHHHHHHH
Confidence            579999999988753


No 126
>PTZ00337 surface protease GP63; Provisional
Probab=20.93  E-value=65  Score=32.43  Aligned_cols=14  Identities=36%  Similarity=0.423  Sum_probs=12.4

Q ss_pred             hHHHHHHHHHHHHH
Q 026597           44 KLITVFLHEASHAI   57 (236)
Q Consensus        44 r~l~t~~HE~GHal   57 (236)
                      +.+-|..||+.||+
T Consensus       228 ~~~~v~~HEi~HAL  241 (567)
T PTZ00337        228 GDVRVAAHELGHAL  241 (567)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45779999999999


No 127
>PHA00657 crystallin beta/gamma motif-containing protein
Probab=20.80  E-value=73  Score=35.40  Aligned_cols=17  Identities=29%  Similarity=0.385  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 026597           45 LITVFLHEASHAIACKL   61 (236)
Q Consensus        45 ~l~t~~HE~GHalaall   61 (236)
                      -++|++||+||-+--++
T Consensus       791 D~STFvHE~gH~fLE~~  807 (2052)
T PHA00657        791 DLSTFLHESGHFFLEVQ  807 (2052)
T ss_pred             cHHHHHHHHHHHHHHHH
Confidence            37899999999855544


No 128
>PRK00523 hypothetical protein; Provisional
Probab=20.14  E-value=1.5e+02  Score=21.81  Aligned_cols=24  Identities=8%  Similarity=-0.096  Sum_probs=18.7

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHhhh
Q 026597          211 GWGVIWGIISFAFLCAAMYLGLVI  234 (236)
Q Consensus       211 ~W~~lw~~~sl~~l~~~~~~~~~~  234 (236)
                      .|..+|.++...++.+|+..|++.
T Consensus         2 ~~~~l~I~l~i~~li~G~~~Gffi   25 (72)
T PRK00523          2 LAIGLALGLGIPLLIVGGIIGYFV   25 (72)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHH
Confidence            477788888888888888887653


No 129
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=20.03  E-value=40  Score=29.56  Aligned_cols=9  Identities=44%  Similarity=0.715  Sum_probs=7.8

Q ss_pred             HHHHHHHHH
Q 026597           49 FLHEASHAI   57 (236)
Q Consensus        49 ~~HE~GHal   57 (236)
                      +=||.||++
T Consensus       143 INHEVGH~L  151 (203)
T PF11350_consen  143 INHEVGHAL  151 (203)
T ss_pred             hhhhhhhhc
Confidence            469999997


Done!