Query         042321
Match_columns 169
No_of_seqs    154 out of 901
Neff          7.3 
Searched_HMMs 46136
Date          Fri Mar 29 05:05:50 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/042321.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/042321hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0 3.3E-39 7.2E-44  281.5  12.4  161    1-168    37-230 (469)
  2 cd04278 ZnMc_MMP Zinc-dependen  99.9 7.3E-24 1.6E-28  161.0   9.9   99   68-168     1-126 (157)
  3 PF00413 Peptidase_M10:  Matrix  99.9   2E-23 4.2E-28  157.0   9.4   97   68-167     1-123 (154)
  4 smart00235 ZnMc Zinc-dependent  99.8 1.6E-19 3.6E-24  134.3   9.1   98   66-167     2-104 (140)
  5 cd04327 ZnMc_MMP_like_3 Zinc-d  99.7 1.3E-17 2.7E-22  131.5   9.7   95   68-166     2-109 (198)
  6 cd04268 ZnMc_MMP_like Zinc-dep  99.7 7.1E-18 1.5E-22  128.1   8.0   95   73-168     1-113 (165)
  7 cd04279 ZnMc_MMP_like_1 Zinc-d  99.7 6.6E-17 1.4E-21  122.5   8.7   95   74-168     2-123 (156)
  8 cd04277 ZnMc_serralysin_like Z  99.6   1E-15 2.2E-20  119.1   8.1   82   87-168    34-132 (186)
  9 cd04280 ZnMc_astacin_like Zinc  99.6 1.4E-14 3.1E-19  112.7   8.1   89   74-166     2-91  (180)
 10 PF01400 Astacin:  Astacin (Pep  99.4 1.2E-13 2.5E-18  108.5   5.8   93   68-166     2-96  (191)
 11 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.4 6.8E-13 1.5E-17  104.9   8.5   95   66-166     7-104 (200)
 12 cd00203 ZnMc Zinc-dependent me  99.4 1.7E-12 3.6E-17   98.5   7.3   94   75-168     2-115 (167)
 13 cd04282 ZnMc_meprin Zinc-depen  99.4 2.6E-12 5.6E-17  103.5   8.6   94   66-166    43-137 (230)
 14 cd04276 ZnMc_MMP_like_2 Zinc-d  99.3 1.8E-12 3.8E-17  102.4   6.2   96   73-168     7-135 (197)
 15 cd04283 ZnMc_hatching_enzyme Z  99.2 3.5E-11 7.7E-16   93.9   7.9   88   74-166     4-94  (182)
 16 KOG3714 Meprin A metalloprotea  99.2 5.9E-11 1.3E-15  103.1   7.1   96   66-166    79-176 (411)
 17 PF02031 Peptidase_M7:  Strepto  99.1 1.1E-10 2.5E-15   85.2   5.7   87   75-168     3-96  (132)
 18 PF01471 PG_binding_1:  Putativ  99.1 9.2E-11   2E-15   74.4   3.5   46    1-54     12-57  (57)
 19 PF12388 Peptidase_M57:  Dual-a  98.7 1.1E-07 2.4E-12   75.6   8.9   94   73-166    38-150 (211)
 20 COG5549 Predicted Zn-dependent  97.9 1.1E-05 2.5E-10   63.7   3.9   76   88-166   105-205 (236)
 21 PF13583 Reprolysin_4:  Metallo  97.8 1.3E-05 2.9E-10   63.6   2.4   20  149-168   137-156 (206)
 22 PF13582 Reprolysin_3:  Metallo  97.7 1.8E-05 3.8E-10   57.1   1.2   18  149-166   107-124 (124)
 23 TIGR02869 spore_SleB spore cor  97.4 0.00018 3.8E-09   57.1   3.8   36   22-57     36-71  (201)
 24 PF13688 Reprolysin_5:  Metallo  97.3 0.00011 2.4E-09   57.1   1.7   20  149-168   142-161 (196)
 25 PF13574 Reprolysin_2:  Metallo  97.3 0.00012 2.6E-09   56.5   1.8   20  149-168   111-130 (173)
 26 PF05572 Peptidase_M43:  Pregna  97.2 0.00013 2.9E-09   55.4   1.6   19  149-167    69-87  (154)
 27 cd04269 ZnMc_adamalysin_II_lik  97.1 0.00027 5.9E-09   55.0   1.8   19  149-167   131-149 (194)
 28 cd04267 ZnMc_ADAM_like Zinc-de  97.0 0.00026 5.6E-09   55.0   1.3   21  147-167   131-151 (192)
 29 PF11350 DUF3152:  Protein of u  97.0  0.0036 7.7E-08   49.5   7.2   93   73-166    28-156 (203)
 30 PRK13267 archaemetzincin-like   96.9 0.00054 1.2E-08   53.4   1.9   20  149-168   125-144 (179)
 31 cd04275 ZnMc_pappalysin_like Z  96.8 0.00013 2.9E-09   58.7  -2.0   21  147-167   135-155 (225)
 32 PF07998 Peptidase_M54:  Peptid  96.8 0.00063 1.4E-08   53.7   1.6   18  149-166   145-162 (194)
 33 cd04270 ZnMc_TACE_like Zinc-de  96.7  0.0008 1.7E-08   54.7   1.8   19  150-168   168-186 (244)
 34 COG1913 Predicted Zn-dependent  96.7 0.00072 1.6E-08   52.3   1.4   20  149-168   124-143 (181)
 35 cd04273 ZnMc_ADAMTS_like Zinc-  96.7 0.00033 7.2E-09   55.3  -0.5   75   92-168    72-159 (207)
 36 cd04271 ZnMc_ADAM_fungal Zinc-  96.7 0.00062 1.3E-08   54.9   0.8   17  151-167   147-163 (228)
 37 PF01421 Reprolysin:  Reprolysi  96.6 0.00091   2E-08   52.3   1.5   20  149-168   131-150 (199)
 38 cd04272 ZnMc_salivary_gland_MP  96.5  0.0014   3E-08   52.3   1.7   20  148-167   144-163 (220)
 39 PRK10594 murein L,D-transpepti  96.5  0.0029 6.4E-08   57.6   3.9   37   21-57    307-343 (608)
 40 TIGR03296 M6dom_TIGR03296 M6 f  96.2  0.0013 2.8E-08   54.7  -0.0   19  150-168   166-184 (286)
 41 COG2989 Uncharacterized protei  95.6   0.012 2.7E-07   52.6   3.7   37   21-57    263-299 (561)
 42 PF10462 Peptidase_M66:  Peptid  95.3  0.0083 1.8E-07   50.5   1.4   17  149-165   193-209 (305)
 43 PF05548 Peptidase_M11:  Gameto  95.2  0.0094   2E-07   50.4   1.4   19  149-167   150-168 (314)
 44 PF05547 Peptidase_M6:  Immune   94.5  0.0067 1.5E-07   55.8  -1.2   19  150-168   222-240 (645)
 45 PF04298 Zn_peptidase_2:  Putat  94.4   0.048   1E-06   43.9   3.5   69   92-162    21-102 (222)
 46 PF12044 Metallopep:  Putative   93.2   0.049 1.1E-06   47.8   1.8   18  150-167   317-334 (423)
 47 COG3409 Putative peptidoglycan  93.2   0.067 1.5E-06   40.6   2.3   35   23-57     70-105 (185)
 48 PF09471 Peptidase_M64:  IgA Pe  92.1   0.081 1.7E-06   43.7   1.6   20  148-167   215-235 (264)
 49 KOG3607 Meltrins, fertilins an  92.1   0.081 1.8E-06   49.4   1.7   20  148-167   322-341 (716)
 50 COG3824 Predicted Zn-dependent  92.1   0.062 1.3E-06   39.3   0.8   17  149-165   109-125 (136)
 51 PF01457 Peptidase_M8:  Leishma  91.7   0.094   2E-06   47.2   1.7   17  149-165   210-226 (521)
 52 TIGR02290 M3_fam_3 oligoendope  91.3    0.11 2.4E-06   47.2   1.8   16  146-161   372-387 (587)
 53 PF06262 DUF1025:  Possibl zinc  90.4    0.17 3.6E-06   35.6   1.6   17  149-165    73-89  (97)
 54 PTZ00337 surface protease GP63  90.0    0.16 3.4E-06   46.3   1.5   17  149-165   229-245 (567)
 55 KOG3658 Tumor necrosis factor-  88.7    0.16 3.5E-06   46.8   0.6   21  149-169   392-412 (764)
 56 KOG4525 Jacalin-like lectin do  88.4    0.24 5.1E-06   43.7   1.4   18  150-167   302-319 (614)
 57 PF09374 PG_binding_3:  Predict  88.0    0.53 1.1E-05   31.1   2.6   29   29-57      1-33  (72)
 58 PF14247 DUF4344:  Domain of un  87.6    0.29 6.3E-06   39.4   1.4   13  149-161    92-104 (220)
 59 COG4783 Putative Zn-dependent   87.3     0.3 6.4E-06   43.4   1.4   15  148-162   129-143 (484)
 60 PTZ00257 Glycoprotein GP63 (le  86.8    0.37 7.9E-06   44.3   1.7   16  149-164   256-271 (622)
 61 COG2738 Predicted Zn-dependent  86.0    0.69 1.5E-05   36.7   2.6   78   89-168    21-111 (226)
 62 PF06114 DUF955:  Domain of unk  85.4    0.46 9.9E-06   32.9   1.3   13  149-161    42-54  (122)
 63 COG2856 Predicted Zn peptidase  84.5    0.43 9.4E-06   38.2   0.9   13  149-161    72-84  (213)
 64 TIGR02289 M3_not_pepF oligoend  83.5     1.2 2.6E-05   40.3   3.4   17  145-161   333-349 (549)
 65 PF01435 Peptidase_M48:  Peptid  82.7    0.71 1.5E-05   36.0   1.5   15  148-162    88-102 (226)
 66 cd06161 S2P-M50_SpoIVFB SpoIVF  81.8    0.77 1.7E-05   36.3   1.4   13  149-161    38-50  (208)
 67 PRK03982 heat shock protein Ht  81.0    0.82 1.8E-05   37.9   1.3   14  148-161   124-137 (288)
 68 cd06459 M3B_Oligoendopeptidase  80.6     1.7 3.8E-05   37.3   3.3   16  146-161   219-234 (427)
 69 PRK03001 M48 family peptidase;  79.8    0.95 2.1E-05   37.4   1.3   14  148-161   123-136 (283)
 70 COG0501 HtpX Zn-dependent prot  78.9     1.1 2.3E-05   36.7   1.4   14  148-161   156-169 (302)
 71 PRK03072 heat shock protein Ht  78.9       1 2.3E-05   37.4   1.3   14  148-161   126-139 (288)
 72 cd06258 Peptidase_M3_like The   78.8    0.99 2.2E-05   38.3   1.2   16  146-161   151-166 (365)
 73 PRK01345 heat shock protein Ht  78.8     1.1 2.3E-05   38.0   1.3   14  148-161   123-136 (317)
 74 TIGR00181 pepF oligoendopeptid  78.8       2 4.4E-05   39.0   3.2   17  145-161   374-390 (591)
 75 PLN02791 Nudix hydrolase homol  78.2     1.1 2.4E-05   42.4   1.3   17  149-165   585-601 (770)
 76 cd06164 S2P-M50_SpoIVFB_CBS Sp  77.0     1.3 2.9E-05   35.6   1.4   13  149-161    53-65  (227)
 77 PRK04897 heat shock protein Ht  76.9     1.3 2.8E-05   37.0   1.3   14  148-161   136-149 (298)
 78 PF01432 Peptidase_M3:  Peptida  76.6     1.3 2.8E-05   38.9   1.3   15  147-161   240-254 (458)
 79 cd06162 S2P-M50_PDZ_SREBP Ster  76.0     1.5 3.2E-05   36.6   1.4   13  149-161   135-147 (277)
 80 PRK01265 heat shock protein Ht  75.2     1.5 3.3E-05   37.2   1.3   13  148-160   139-151 (324)
 81 PRK05457 heat shock protein Ht  75.0     1.6 3.4E-05   36.4   1.3   13  148-160   133-145 (284)
 82 PF01447 Peptidase_M4:  Thermol  75.0     1.6 3.5E-05   33.0   1.3   12  150-161   136-147 (150)
 83 cd06455 M3A_TOP Peptidase M3 T  74.8     1.5 3.3E-05   38.9   1.2   14  148-161   262-275 (472)
 84 PF01863 DUF45:  Protein of unk  74.5     9.3  0.0002   29.4   5.5   71   88-167   110-182 (205)
 85 PF08823 PG_binding_2:  Putativ  74.4     3.4 7.4E-05   27.5   2.6   34   23-57     17-50  (74)
 86 PRK02391 heat shock protein Ht  74.4     1.7 3.6E-05   36.5   1.3   13  148-160   132-144 (296)
 87 cd06159 S2P-M50_PDZ_Arch Uncha  74.3     1.7 3.7E-05   35.8   1.4   13  149-161   118-130 (263)
 88 PRK02870 heat shock protein Ht  73.8     1.8 3.8E-05   37.0   1.4   13  148-160   172-184 (336)
 89 cd06457 M3A_MIP Peptidase M3 m  73.5     1.7 3.7E-05   38.5   1.2   24   88-111   132-155 (458)
 90 KOG3538 Disintegrin metallopro  73.2    0.83 1.8E-05   43.5  -0.8   23  145-167   313-335 (845)
 91 PF13485 Peptidase_MA_2:  Pepti  73.2       2 4.4E-05   29.8   1.4   13  149-161    25-37  (128)
 92 cd06160 S2P-M50_like_2 Unchara  71.7     2.2 4.8E-05   33.2   1.4   14  149-162    41-54  (183)
 93 KOG2921 Intramembrane metallop  71.6       2 4.3E-05   37.6   1.2   13  149-161   131-143 (484)
 94 cd06456 M3A_DCP_Oligopeptidase  70.6     2.2 4.7E-05   37.5   1.2   16  146-161   205-220 (422)
 95 PF03571 Peptidase_M49:  Peptid  68.2     2.4 5.2E-05   38.6   1.0   14  149-162   293-307 (549)
 96 PF08823 PG_binding_2:  Putativ  67.1     9.1  0.0002   25.4   3.4   46    1-54     25-74  (74)
 97 PF04228 Zn_peptidase:  Putativ  67.1     2.6 5.7E-05   35.3   1.0   13  149-161   170-182 (292)
 98 PF03272 Enhancin:  Viral enhan  64.4     3.3 7.2E-05   39.3   1.2   66   93-161   173-248 (775)
 99 PF11150 DUF2927:  Protein of u  64.3      34 0.00074   27.3   6.8   57   67-127    28-87  (213)
100 cd06460 M32_Taq Peptidase fami  64.1     3.5 7.6E-05   36.1   1.2   13  149-161   159-171 (396)
101 PF12725 DUF3810:  Protein of u  63.3     3.8 8.2E-05   34.7   1.2   18  147-164   194-211 (318)
102 PRK10911 oligopeptidase A; Pro  61.8       4 8.6E-05   38.1   1.2   14  148-161   462-475 (680)
103 COG1164 Oligoendopeptidase F [  60.2     4.2 9.1E-05   37.4   1.0   16  146-161   377-392 (598)
104 COG4412 Uncharacterized protei  60.1     2.4 5.2E-05   38.6  -0.5   17  152-168   349-365 (760)
105 COG3023 ampD N-acetyl-anhydrom  59.2      14 0.00031   30.4   3.8   34   21-54    217-251 (257)
106 KOG1047 Bifunctional leukotrie  59.1     6.8 0.00015   35.8   2.1   30  131-160   264-299 (613)
107 KOG2719 Metalloprotease [Gener  56.3     5.9 0.00013   35.0   1.2   13  148-160   279-291 (428)
108 PF08434 CLCA_N:  Calcium-activ  52.8     9.9 0.00022   31.4   2.0   21  113-134    98-118 (262)
109 PRK10280 dipeptidyl carboxypep  48.7       9 0.00019   35.9   1.2   34   88-121   362-398 (681)
110 PRK10733 hflB ATP-dependent me  48.5     9.1  0.0002   35.5   1.2   13  149-161   408-420 (644)
111 KOG2661 Peptidase family M48 [  48.4     9.9 0.00022   32.8   1.3   13  149-161   275-287 (424)
112 CHL00176 ftsH cell division pr  45.8      10 0.00022   35.2   1.1   13  149-161   438-450 (638)
113 PF06167 Peptidase_M90:  Glucos  45.6     9.6 0.00021   31.3   0.8   18  147-164   152-169 (253)
114 COG4227 Antirestriction protei  45.2      11 0.00024   31.4   1.1   18  148-165   202-219 (316)
115 COG3227 LasB Zinc metalloprote  44.5     8.9 0.00019   34.4   0.5   12  150-161   338-349 (507)
116 cd02641 R3H_Smubp-2_like R3H d  42.0      15 0.00033   23.2   1.2   18  149-166    29-46  (60)
117 PF13699 DUF4157:  Domain of un  41.8      13 0.00028   24.9   0.8   13  149-161    61-73  (79)
118 PF02074 Peptidase_M32:  Carbox  41.2      12 0.00026   33.7   0.8   12  150-161   260-271 (494)
119 PF06540 GMAP:  Galanin message  39.9      14 0.00031   23.7   0.8   25   24-48     22-46  (62)
120 COG2317 Zn-dependent carboxype  39.5      13 0.00028   33.4   0.7    9  153-161   263-271 (497)
121 COG3590 PepO Predicted metallo  39.0     8.7 0.00019   35.3  -0.4   13  149-161   487-499 (654)
122 cd06461 M2_ACE Peptidase famil  38.2      17 0.00037   32.6   1.3   17  145-161   243-259 (477)
123 PF04315 DUF462:  Protein of un  36.1      21 0.00045   27.5   1.3   15  147-161    39-53  (164)
124 PF12315 DUF3633:  Protein of u  36.0      21 0.00045   28.6   1.3   13  149-161    93-105 (212)
125 PF14891 Peptidase_M91:  Effect  35.7      21 0.00046   27.3   1.3   13  149-161   103-115 (174)
126 TIGR01241 FtsH_fam ATP-depende  35.5      19 0.00041   32.1   1.1   13  149-161   311-323 (495)
127 TIGR02412 pepN_strep_liv amino  34.0      29 0.00062   33.2   2.2   12  149-160   287-298 (831)
128 smart00731 SprT SprT homologue  33.9      22 0.00049   26.2   1.2   13  149-161    59-71  (146)
129 PF13402 M60-like:  Peptidase M  33.5      19 0.00042   29.5   0.9   12  150-161   220-231 (307)
130 COG0339 Dcp Zn-dependent oligo  33.0      20 0.00043   33.6   0.9   35   87-121   362-398 (683)
131 cd02639 R3H_RRM R3H domain of   32.8      25 0.00053   22.4   1.1   19  149-167    29-47  (60)
132 COG2321 Predicted metalloprote  30.8      23 0.00049   29.7   0.8   12  149-160   167-178 (295)
133 TIGR02411 leuko_A4_hydro leuko  30.5      31 0.00067   31.8   1.7   12  149-160   279-290 (601)
134 KOG2987 Fatty acid desaturase   29.3      26 0.00056   29.2   0.9   20  149-168    83-102 (324)
135 PRK15410 DgsA anti-repressor M  29.1      22 0.00047   29.4   0.5   19  147-165   140-158 (260)
136 PF15639 Tox-MPTase3:  Metallop  27.7 1.3E+02  0.0029   22.3   4.2   69   90-159    23-110 (135)
137 COG4784 Putative Zn-dependent   27.5      31 0.00067   30.0   1.1   15  147-161   122-136 (479)
138 PF10263 SprT-like:  SprT-like   27.4      37  0.0008   24.9   1.4   14  148-161    59-72  (157)
139 TIGR00868 hCaCC calcium-activa  27.2      67  0.0014   31.1   3.3   20  149-168   149-169 (863)
140 COG3926 zliS Lysozyme family p  25.2      73  0.0016   26.0   2.7   30   28-57     94-127 (252)
141 PF08571 Yos1:  Yos1-like;  Int  25.0 1.2E+02  0.0026   20.5   3.4   41    1-41     24-64  (80)
142 cd02640 R3H_NRF R3H domain of   23.1      51  0.0011   20.9   1.3   18  149-166    29-46  (60)
143 KOG3624 M13 family peptidase [  22.4      40 0.00087   31.3   0.9   13  149-161   518-530 (687)
144 KOG2090 Metalloendopeptidase f  22.3      42 0.00091   31.2   1.0   13  149-161   479-491 (704)
145 PF08634 Pet127:  Mitochondrial  21.9      89  0.0019   26.1   2.7   55   38-101   154-214 (274)
146 COG5192 BMS1 GTP-binding prote  21.3      53  0.0012   30.7   1.4   22   36-57    775-796 (1077)
147 PF07236 Phytoreo_S7:  Phytoreo  21.2      32 0.00069   30.7   0.0   33   25-57    204-236 (506)
148 PF15127 DUF4565:  Protein of u  21.1      63  0.0014   22.4   1.5   15   88-102    62-76  (91)
149 PF13590 DUF4136:  Domain of un  20.9 1.8E+02  0.0038   20.8   4.0   41   77-125    19-69  (151)
150 cd06007 R3H_DEXH_helicase R3H   20.4      63  0.0014   20.4   1.3   18  149-166    28-45  (59)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=3.3e-39  Score=281.55  Aligned_cols=161  Identities=29%  Similarity=0.374  Sum_probs=142.3

Q ss_pred             CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC-----Ccccc-ccCCCCCCCCCc
Q 042321            1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH-----DAELQ-ILGSPRWPASKF   74 (169)
Q Consensus         1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p-----D~~~~-~~g~~kW~~~k~   74 (169)
                      |++|||+.+.+......     .++.+++||+.||++++|++||++|.+|++.|++|     |-+|+ +++.+||+  +.
T Consensus        37 l~~~~y~~~~~~~~~~~-----~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~~~~~~~~~~kW~--k~  109 (469)
T KOG1565|consen   37 LECYGYLPPTDLTATRA-----SQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDGRYRYFPGKPKWN--KE  109 (469)
T ss_pred             hhhcccCCCcccccccc-----CchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCCccccCcccCccc--cc
Confidence            67899999887543211     36889999999999999999999999999999999     65677 77889999  99


Q ss_pred             ceEEEeccCCch----hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCC--------------
Q 042321           75 SLTYAFLPGTRG----DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAV--------------  136 (169)
Q Consensus        75 ~LTy~~~~~~~~----~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~--------------  136 (169)
                      +|||+|.+++++    +++.++++||++|++|++|+|.|+.....|||+|.|..+.|||++||||.              
T Consensus       110 ~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~  189 (469)
T KOG1565|consen  110 HLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFPGPGI  189 (469)
T ss_pred             ccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecccCCCCCC
Confidence            999999999755    79999999999999999999999998569999999999999999999953              


Q ss_pred             --------CCceee-cccccchHHHHHHhhhhhcCCCCccC
Q 042321          137 --------VESWLM-LQLLLESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       137 --------~e~w~~-~~~~~~~~~va~HEiGHaLGL~h~~~  168 (169)
                              +|.|+. ....+|++.||+|||||+|||+|+.+
T Consensus       190 ~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS~~  230 (469)
T KOG1565|consen  190 GGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHSSD  230 (469)
T ss_pred             CCccccCcccceeccCCccchhHHHhhhhcccccccCCCCC
Confidence                    577876 45677899999999999999999965


No 2  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.90  E-value=7.3e-24  Score=161.03  Aligned_cols=99  Identities=32%  Similarity=0.460  Sum_probs=81.7

Q ss_pred             CCCCCCcceEEEeccCCc----hhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCC------
Q 042321           68 RWPASKFSLTYAFLPGTR----GDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVV------  137 (169)
Q Consensus        68 kW~~~k~~LTy~~~~~~~----~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~------  137 (169)
                      ||+  +++|||+|.+..+    .+++++|++||+.|+++++|+|+|+.....|||+|.|.+..|+|+++|++.+      
T Consensus         1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a   78 (157)
T cd04278           1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHA   78 (157)
T ss_pred             CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccc
Confidence            799  9999999987653    3789999999999999999999999876689999999999999877887532      


Q ss_pred             ---------------Cceeec--ccccchHHHHHHhhhhhcCCCCccC
Q 042321          138 ---------------ESWLML--QLLLESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       138 ---------------e~w~~~--~~~~~~~~va~HEiGHaLGL~h~~~  168 (169)
                                     +.|+..  ....+++.|++|||||||||.|+.+
T Consensus        79 ~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~  126 (157)
T cd04278          79 FFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSD  126 (157)
T ss_pred             cCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCC
Confidence                           123321  2345699999999999999999754


No 3  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.90  E-value=2e-23  Score=156.97  Aligned_cols=97  Identities=33%  Similarity=0.496  Sum_probs=78.6

Q ss_pred             CCCCCCcceEEEeccCCch----hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCC-----
Q 042321           68 RWPASKFSLTYAFLPGTRG----DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVE-----  138 (169)
Q Consensus        68 kW~~~k~~LTy~~~~~~~~----~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e-----  138 (169)
                      ||+  +++|||+|.+.++.    ++++++++||+.|+++++|+|.++... .|||+|.|....+++..+|++...     
T Consensus         1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~-~adi~i~~~~~~~~~~~~~~~~~~~~~~~   77 (154)
T PF00413_consen    1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDG-NADIRISFGSNNHGDGYSFDGSGGTLAHA   77 (154)
T ss_dssp             SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSS-SCSEEEEEESSSSSSSS-CSSSSSESEEE
T ss_pred             CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCC-CcceeeeeeccccCcccccccceeeeecc
Confidence            799  99999999986643    689999999999999999999999864 699999999999988877774321     


Q ss_pred             -----------------ceeecccccchHHHHHHhhhhhcCCCCcc
Q 042321          139 -----------------SWLMLQLLLESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       139 -----------------~w~~~~~~~~~~~va~HEiGHaLGL~h~~  167 (169)
                                       .|.......++..|++|||||||||.|+-
T Consensus        78 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~  123 (154)
T PF00413_consen   78 YFPNNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSN  123 (154)
T ss_dssp             EESSSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSS
T ss_pred             ccccccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCC
Confidence                             12111223458999999999999999974


No 4  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.80  E-value=1.6e-19  Score=134.28  Aligned_cols=98  Identities=27%  Similarity=0.392  Sum_probs=72.9

Q ss_pred             CCCCCCCCcceEEEec-c-CCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCC-CCCCCCCC--CCCce
Q 042321           66 SPRWPASKFSLTYAFL-P-GTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDH-GAGALSTA--VVESW  140 (169)
Q Consensus        66 ~~kW~~~k~~LTy~~~-~-~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~H-gd~~pfd~--~~e~w  140 (169)
                      ..+|+  +.+|+|.|. + .++.+++++|++||+.|+++++|+|+|+..  .|||+|.|....| |+..++.+  .+..+
T Consensus         2 ~~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~g~~~a~~g~~~g~~~   77 (140)
T smart00235        2 SKKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGSGCTLSHAGRPGGDQH   77 (140)
T ss_pred             CCcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCCCcceeeeecCCCceE
Confidence            46999  899999996 3 333356999999999999999999999865  7999999999988 55212222  12222


Q ss_pred             eecccccchHHHHHHhhhhhcCCCCcc
Q 042321          141 LMLQLLLESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       141 ~~~~~~~~~~~va~HEiGHaLGL~h~~  167 (169)
                      ...+..+....|++|||||||||.|.-
T Consensus        78 ~~~~~~~~~~~~~~HEigHaLGl~H~~  104 (140)
T smart00235       78 FSLGNGCINTGVAAHELGHALGLYHEQ  104 (140)
T ss_pred             EEccCCcCCcccHHHHHHHHhcCCcCC
Confidence            112333334569999999999999974


No 5  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74  E-value=1.3e-17  Score=131.46  Aligned_cols=95  Identities=28%  Similarity=0.334  Sum_probs=73.2

Q ss_pred             CCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCc---e---e
Q 042321           68 RWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVES---W---L  141 (169)
Q Consensus        68 kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~---w---~  141 (169)
                      +|++ .+.|+|.|.+..+.+.+++|++||+.|+.+++|+|+++... .+||+|.|..+. |+ +++.|....   .   +
T Consensus         2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~-~adi~I~f~~~~-Gc-~S~vG~~~~~~~~~~~t   77 (198)
T cd04327           2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDA-DADIRISFTPGD-GY-WSYVGTDALLIGADAPT   77 (198)
T ss_pred             CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCC-CCCEEEEEecCC-CC-CCCcCCcccccCCCCce
Confidence            7986 67899999887666789999999999999999999999864 799999999854 55 445543210   1   1


Q ss_pred             e-ccc------ccchHHHHHHhhhhhcCCCCc
Q 042321          142 M-LQL------LLESQSVCMELIFKELGIFIN  166 (169)
Q Consensus       142 ~-~~~------~~~~~~va~HEiGHaLGL~h~  166 (169)
                      + ++.      .++++.|++|||||||||.|.
T Consensus        78 ~~l~~~~~~~~~~~~~~~i~HElgHaLG~~HE  109 (198)
T cd04327          78 MNLGWFTDDTPDPEFSRVVLHEFGHALGFIHE  109 (198)
T ss_pred             eeeeeecCCCchhhHHHHHHHHHHHHhcCccc
Confidence            1 221      345678999999999999985


No 6  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.74  E-value=7.1e-18  Score=128.13  Aligned_cols=95  Identities=19%  Similarity=0.173  Sum_probs=71.0

Q ss_pred             CcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCC---CCCCCCCCC------CCCceee-
Q 042321           73 KFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGD---HGAGALSTA------VVESWLM-  142 (169)
Q Consensus        73 k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~---Hgd~~pfd~------~~e~w~~-  142 (169)
                      ++.|+|.|.+..+...+++|++||+.|+++++++|+|+.....+||+|.|.+..   .|+ +.+.+      ..+-|.. 
T Consensus         1 ~~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~-~~~~~~~~~~~~g~i~~~~   79 (165)
T cd04268           1 KKPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGT-WSYGPSQVDPLTGEILLAR   79 (165)
T ss_pred             CCCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCc-cccCCccCCCCCccEEeeE
Confidence            467999999887778899999999999999999999998755799999998743   232 22222      1222211 


Q ss_pred             ccc--------ccchHHHHHHhhhhhcCCCCccC
Q 042321          143 LQL--------LLESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       143 ~~~--------~~~~~~va~HEiGHaLGL~h~~~  168 (169)
                      ...        +..++.|++|||||||||.|+.+
T Consensus        80 ~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~  113 (165)
T cd04268          80 VYLYSSFVEYSGARLRNTAEHELGHALGLRHNFA  113 (165)
T ss_pred             EEEchhHHHHHHHHHHHHHHHHHHHHhcccccCc
Confidence            111        12378999999999999999864


No 7  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.70  E-value=6.6e-17  Score=122.54  Aligned_cols=95  Identities=16%  Similarity=0.169  Sum_probs=67.2

Q ss_pred             cceEEEeccCC------chhhHHHHHHHHHhhccCCCcceEeccCC-CcccEEEEeecC----CCCCC-----CCCC---
Q 042321           74 FSLTYAFLPGT------RGDAINPVARAFQTWAPNTQFQFAESQDY-RNADIKISFESG----DHGAG-----ALST---  134 (169)
Q Consensus        74 ~~LTy~~~~~~------~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~-~~ADI~I~F~~~----~Hgd~-----~pfd---  134 (169)
                      ..|+|++.+.+      ..+.+++|++||+.|+++++|+|+++... .+|||+|.|...    .-|+.     .|..   
T Consensus         2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~   81 (156)
T cd04279           2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDG   81 (156)
T ss_pred             CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCC
Confidence            46888888754      22789999999999999999999998753 279999999963    11221     1111   


Q ss_pred             CCCCceee-c--c-----cccchHHHHHHhhhhhcCCCCccC
Q 042321          135 AVVESWLM-L--Q-----LLLESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       135 ~~~e~w~~-~--~-----~~~~~~~va~HEiGHaLGL~h~~~  168 (169)
                      +....|.. .  .     ...+++.||+|||||+|||.|+.+
T Consensus        82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~  123 (156)
T cd04279          82 NRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSD  123 (156)
T ss_pred             cccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCC
Confidence            11122221 1  1     145799999999999999999875


No 8  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.63  E-value=1e-15  Score=119.15  Aligned_cols=82  Identities=18%  Similarity=0.219  Sum_probs=58.7

Q ss_pred             hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCC--CC---CCCC-----CCCceeec-------ccccch
Q 042321           87 DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGA--GA---LSTA-----VVESWLML-------QLLLES  149 (169)
Q Consensus        87 ~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd--~~---pfd~-----~~e~w~~~-------~~~~~~  149 (169)
                      +.++++++||+.|+.+++|+|+|+.....|||+|.+....++.  ++   |-..     .+.-|.-.       ..+...
T Consensus        34 ~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~g~~~  113 (186)
T cd04277          34 AQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSPGSYG  113 (186)
T ss_pred             HHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCCChhh
Confidence            5789999999999999999999998866899999998864221  11   2111     11222110       013347


Q ss_pred             HHHHHHhhhhhcCCCCccC
Q 042321          150 QSVCMELIFKELGIFINYN  168 (169)
Q Consensus       150 ~~va~HEiGHaLGL~h~~~  168 (169)
                      ..|++|||||||||.|..+
T Consensus       114 ~~t~~HEiGHaLGL~H~~~  132 (186)
T cd04277         114 YQTIIHEIGHALGLEHPGD  132 (186)
T ss_pred             HHHHHHHHHHHhcCCCCCc
Confidence            8899999999999999753


No 9  
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.55  E-value=1.4e-14  Score=112.70  Aligned_cols=89  Identities=16%  Similarity=0.131  Sum_probs=66.0

Q ss_pred             cceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCceee-cccccchHHH
Q 042321           74 FSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVESWLM-LQLLLESQSV  152 (169)
Q Consensus        74 ~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~-~~~~~~~~~v  152 (169)
                      ..|.|.|.+.-+...++.|++|++.|+++|+|+|+|+..   ++..|.|..+ .||.+........-.+ ++.+|....+
T Consensus         2 ~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~Gc~S~vG~~~~~q~i~l~~~c~~~g~   77 (180)
T cd04280           2 GTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SGCWSYVGRVGGRQVVSLGSGCFSLGT   77 (180)
T ss_pred             CEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CCcceecCccCCceeEEeCCCcCcCch
Confidence            468999988545578999999999999999999999864   5567777776 5664433322211112 4445666789


Q ss_pred             HHHhhhhhcCCCCc
Q 042321          153 CMELIFKELGIFIN  166 (169)
Q Consensus       153 a~HEiGHaLGL~h~  166 (169)
                      ++|||||||||.|.
T Consensus        78 v~HE~~HalG~~HE   91 (180)
T cd04280          78 IVHELMHALGFYHE   91 (180)
T ss_pred             hHHHHHHHhcCcch
Confidence            99999999999985


No 10 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.45  E-value=1.2e-13  Score=108.50  Aligned_cols=93  Identities=23%  Similarity=0.300  Sum_probs=64.6

Q ss_pred             CCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCceee--ccc
Q 042321           68 RWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVESWLM--LQL  145 (169)
Q Consensus        68 kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~--~~~  145 (169)
                      ||+  +.+|.|.|.+..+...++.|++|+..|++.|+++|+++. . .....|.|..+ .|| +++-|..+....  ++.
T Consensus         2 ~Wp--~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~~-~gC-~S~vG~~~g~q~i~l~~   75 (191)
T PF01400_consen    2 KWP--NGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSNG-SGC-WSYVGRQGGEQTINLGD   75 (191)
T ss_dssp             S-G--GGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEESS-SSE-EEESS--SSEEEEEE-T
T ss_pred             cCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeecC-ccc-cchhhhcCcceeEEecc
Confidence            899  889999999865557789999999999999999999998 3 44556677654 355 344454333322  445


Q ss_pred             ccchHHHHHHhhhhhcCCCCc
Q 042321          146 LLESQSVCMELIFKELGIFIN  166 (169)
Q Consensus       146 ~~~~~~va~HEiGHaLGL~h~  166 (169)
                      ++....|++|||||||||.|.
T Consensus        76 ~c~~~~~i~HEl~HaLG~~HE   96 (191)
T PF01400_consen   76 GCFSVGTILHELGHALGFWHE   96 (191)
T ss_dssp             TC-SHHHHHHHHHHHHTB--G
T ss_pred             eeCCccchHHHHHHHHhhhhh
Confidence            555678999999999999985


No 11 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.42  E-value=6.8e-13  Score=104.92  Aligned_cols=95  Identities=14%  Similarity=0.176  Sum_probs=69.1

Q ss_pred             CCCCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCC--Cceee-
Q 042321           66 SPRWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVV--ESWLM-  142 (169)
Q Consensus        66 ~~kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~--e~w~~-  142 (169)
                      ..+|+  +..|-|.|...-+...+++|++|+..|++.|+++|++...   ..-.|.|..+..||. ++-|..  ..-.+ 
T Consensus         7 ~~~Wp--~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~---~~~yi~f~~~~~GC~-S~vG~~~~g~q~is   80 (200)
T cd04281           7 ERIWP--GGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTP---EENYIVFTYRPCGCC-SYVGRRGNGPQAIS   80 (200)
T ss_pred             cCcCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCC---CCCEEEEEECCCCee-EcCCCcCCCceeee
Confidence            36999  8999999976544456789999999999999999999764   234666766555664 333321  11122 


Q ss_pred             cccccchHHHHHHhhhhhcCCCCc
Q 042321          143 LQLLLESQSVCMELIFKELGIFIN  166 (169)
Q Consensus       143 ~~~~~~~~~va~HEiGHaLGL~h~  166 (169)
                      ++.+|....|++|||||||||.|.
T Consensus        81 l~~~C~~~Gti~HEl~HaLGf~HE  104 (200)
T cd04281          81 IGKNCDKFGIVVHELGHVIGFWHE  104 (200)
T ss_pred             cCCCcCcCchHHHHHHHHhcCcch
Confidence            445565678999999999999984


No 12 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.37  E-value=1.7e-12  Score=98.51  Aligned_cols=94  Identities=17%  Similarity=0.140  Sum_probs=64.1

Q ss_pred             ceEEEeccC--------CchhhHHHHHHHHHhhccCCCcceEeccCC-CcccEEEEeecCCCC---CCCCCCCC------
Q 042321           75 SLTYAFLPG--------TRGDAINPVARAFQTWAPNTQFQFAESQDY-RNADIKISFESGDHG---AGALSTAV------  136 (169)
Q Consensus        75 ~LTy~~~~~--------~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~-~~ADI~I~F~~~~Hg---d~~pfd~~------  136 (169)
                      .|+|.|...        .+..++++|+.|++.|+++++++|+++... ..+||.|.+....++   .++++-+.      
T Consensus         2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~   81 (167)
T cd00203           2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR   81 (167)
T ss_pred             EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence            355666544        223688999999999999999999999864 379999999875432   12233222      


Q ss_pred             CCceeec-c-cccchHHHHHHhhhhhcCCCCccC
Q 042321          137 VESWLML-Q-LLLESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       137 ~e~w~~~-~-~~~~~~~va~HEiGHaLGL~h~~~  168 (169)
                      ..-|... . ....+..|++|||||+|||.|+.+
T Consensus        82 ~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~  115 (167)
T cd00203          82 GVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHD  115 (167)
T ss_pred             CcEEEecCCcccccchhhHHHHHHHHhCCCccCc
Confidence            1122111 1 112478899999999999999753


No 13 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.36  E-value=2.6e-12  Score=103.52  Aligned_cols=94  Identities=15%  Similarity=0.236  Sum_probs=68.1

Q ss_pred             CCCCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCceee-cc
Q 042321           66 SPRWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVESWLM-LQ  144 (169)
Q Consensus        66 ~~kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~-~~  144 (169)
                      ..+|+  + .+-|.|...-+...++.|++|++.|++.|+++|++.... ..  .|.|..+. ||.+...-.+..-.+ ++
T Consensus        43 ~~~Wp--~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e-~~--yi~i~~~~-GC~S~vG~~gg~q~isl~  115 (230)
T cd04282          43 TYRWP--F-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGE-SN--YIFFFKGS-GCWSMVGDQQGGQNLSIG  115 (230)
T ss_pred             ccCCC--c-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCC-Cc--EEEEEcCC-CeeeccCccCCeEEEEEC
Confidence            36999  7 899999775455678999999999999999999987642 22  56777653 664433221112122 44


Q ss_pred             cccchHHHHHHhhhhhcCCCCc
Q 042321          145 LLLESQSVCMELIFKELGIFIN  166 (169)
Q Consensus       145 ~~~~~~~va~HEiGHaLGL~h~  166 (169)
                      .+|....+++|||||||||.|.
T Consensus       116 ~~C~~~Gti~HEl~HalGf~HE  137 (230)
T cd04282         116 AGCDYKATVEHEFLHALGFYHE  137 (230)
T ss_pred             CCcCCCchHHHHHHHHhCCccc
Confidence            5566678999999999999985


No 14 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.34  E-value=1.8e-12  Score=102.36  Aligned_cols=96  Identities=14%  Similarity=0.095  Sum_probs=64.0

Q ss_pred             CcceEEEeccCCchhhHHHHHHHHHhhccC------CCcceEeccCC--CcccEEEEeecCCCCC-C----CC--CCC-C
Q 042321           73 KFSLTYAFLPGTRGDAINPVARAFQTWAPN------TQFQFAESQDY--RNADIKISFESGDHGA-G----AL--STA-V  136 (169)
Q Consensus        73 k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~v------t~L~F~ev~~~--~~ADI~I~F~~~~Hgd-~----~p--fd~-~  136 (169)
                      |+.|||.|.++++.+++++|++||+.|+++      .+..+.++...  ..+||++.+-+.-|.+ +    .|  .|. .
T Consensus         7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~~~~~~~gps~~dPrT   86 (197)
T cd04276           7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAYGPSVVDPRT   86 (197)
T ss_pred             CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecCCCcceecccccCCCC
Confidence            889999999998889999999999999988      35555555432  1369996655544443 1    11  221 1


Q ss_pred             CCceee---cc--------c-c-----cchHHHHHHhhhhhcCCCCccC
Q 042321          137 VESWLM---LQ--------L-L-----LESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       137 ~e~w~~---~~--------~-~-----~~~~~va~HEiGHaLGL~h~~~  168 (169)
                      +|-...   +.        . .     -.+..+++||+||+|||.||++
T Consensus        87 GeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~  135 (197)
T cd04276          87 GEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFK  135 (197)
T ss_pred             CCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccccc
Confidence            222110   00        0 0     1167899999999999999974


No 15 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.23  E-value=3.5e-11  Score=93.86  Aligned_cols=88  Identities=15%  Similarity=0.160  Sum_probs=60.2

Q ss_pred             cceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCC-CCceee-cc-cccchH
Q 042321           74 FSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAV-VESWLM-LQ-LLLESQ  150 (169)
Q Consensus        74 ~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~-~e~w~~-~~-~~~~~~  150 (169)
                      -.|-|.|...-+...++.|++|+..|++.|+++|++.... ..  .|.|..+. ||+ +.-|. +....+ ++ .++...
T Consensus         4 ~~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~-~~--yi~~~~~~-gC~-S~vG~~gg~q~i~l~~~~C~~~   78 (182)
T cd04283           4 VYVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTTE-RD--YLNIESRS-GCW-SYIGRQGGRQTVSLQKQGCMYK   78 (182)
T ss_pred             EEEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCCC-Cc--EEEEEcCC-Cce-EecCccCCceeEecCCCCcCcc
Confidence            3578888765444678999999999999999999987542 33  35555543 553 33332 122212 32 345567


Q ss_pred             HHHHHhhhhhcCCCCc
Q 042321          151 SVCMELIFKELGIFIN  166 (169)
Q Consensus       151 ~va~HEiGHaLGL~h~  166 (169)
                      .+++|||||||||.|.
T Consensus        79 G~i~HEl~HaLG~~HE   94 (182)
T cd04283          79 GIIQHELLHALGFYHE   94 (182)
T ss_pred             chHHHHHHHHhCCccc
Confidence            8999999999999984


No 16 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.17  E-value=5.9e-11  Score=103.10  Aligned_cols=96  Identities=18%  Similarity=0.226  Sum_probs=72.0

Q ss_pred             CCCCCCCCcceEEEeccCCchhhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCCCc--eeec
Q 042321           66 SPRWPASKFSLTYAFLPGTRGDAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVVES--WLML  143 (169)
Q Consensus        66 ~~kW~~~k~~LTy~~~~~~~~~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~e~--w~~~  143 (169)
                      ..+|+  ...|-|.|.+.-....+.+|++|++.|++.|+++|.|.... ..+..|-+...  |+.+...-.+..  -..+
T Consensus        79 ~~~Wp--~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~--gC~S~VGr~gg~~q~~sl  153 (411)
T KOG3714|consen   79 ERRWP--NGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG--GCYSYVGRRGGGQQLLSL  153 (411)
T ss_pred             hhcCC--CCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC--cceeeeCccCCCccceec
Confidence            46999  88999999987555789999999999999999999998753 44555555444  775444322222  1225


Q ss_pred             ccccchHHHHHHhhhhhcCCCCc
Q 042321          144 QLLLESQSVCMELIFKELGIFIN  166 (169)
Q Consensus       144 ~~~~~~~~va~HEiGHaLGL~h~  166 (169)
                      +.++.-+.|++||++||||+.|.
T Consensus       154 ~~~C~~~G~i~HEl~HaLGf~He  176 (411)
T KOG3714|consen  154 GDGCDRFGTIVHELMHALGFWHE  176 (411)
T ss_pred             CCCcCcCchhHHHHHHHhhhhhc
Confidence            66677788999999999999984


No 17 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=99.12  E-value=1.1e-10  Score=85.23  Aligned_cols=87  Identities=18%  Similarity=0.235  Sum_probs=49.3

Q ss_pred             ceEEEeccCCchhhHHHHHHHHHhh-ccCCCcceEeccCCCcccEEEEeecCCCCCCCC-CCCCCCceee-cc----ccc
Q 042321           75 SLTYAFLPGTRGDAINPVARAFQTW-APNTQFQFAESQDYRNADIKISFESGDHGAGAL-STAVVESWLM-LQ----LLL  147 (169)
Q Consensus        75 ~LTy~~~~~~~~~~~~~i~~Af~~W-s~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~p-fd~~~e~w~~-~~----~~~  147 (169)
                      +|+|.-...+  +.+.+|.+|.+.| +.|++++|.+..   .|+|+|.-....-| .+. -++-+ ...+ ++    -..
T Consensus         3 tv~Ydas~A~--~f~~~i~~aa~iWN~sV~NV~L~~~s---~a~~~~~~~~~~~~-sya~~~g~G-~G~I~l~~~~~qgy   75 (132)
T PF02031_consen    3 TVYYDASRAP--EFRSAIAQAAQIWNSSVSNVRLVEGS---SADIRYYEGNDPRG-SYASTDGLG-SGYIFLDYQQNQGY   75 (132)
T ss_dssp             EEEEEEEE-G--GGHHHHHHHHHHHHHH-SSEEEEE-S---S-SEEEEEE--TT---EEEE-SSS--EEEEEEHHHHHHS
T ss_pred             EEEEeCCCCc--hhHHHHHHHHHHHhcccCceEEeecC---CCcEEEEEecCCCC-cccccCCCC-cEEEEechHHhhCC
Confidence            5666554432  6889999999999 889999999975   38888774331111 111 12211 1111 11    233


Q ss_pred             chHHHHHHhhhhhcCCCCccC
Q 042321          148 ESQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       148 ~~~~va~HEiGHaLGL~h~~~  168 (169)
                      +-..|++|||||.|||.+.||
T Consensus        76 ~~~RIaaHE~GHiLGLPD~y~   96 (132)
T PF02031_consen   76 NSTRIAAHELGHILGLPDHYP   96 (132)
T ss_dssp             -HHHHHHHHHHHHHT----TT
T ss_pred             ccceeeeehhccccCCCCCCC
Confidence            467799999999999999997


No 18 
>PF01471 PG_binding_1:  Putative peptidoglycan binding domain;  InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are:   Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX [].   Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.09  E-value=9.2e-11  Score=74.39  Aligned_cols=46  Identities=28%  Similarity=0.358  Sum_probs=37.5

Q ss_pred             CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCCccccCChHHhhhh
Q 042321            1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKM   54 (169)
Q Consensus         1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m   54 (169)
                      |.++||++...        +..+++.+.+||+.||+++|||+||++|.+|+++|
T Consensus        12 L~~~gy~~~~~--------~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L   57 (57)
T PF01471_consen   12 LNRLGYYPGPV--------DGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL   57 (57)
T ss_dssp             HHHTTTT-SST--------TSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred             HHHcCCCCCCC--------CCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence            56789993321        24568999999999999999999999999999987


No 19 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=98.69  E-value=1.1e-07  Score=75.58  Aligned_cols=94  Identities=17%  Similarity=0.187  Sum_probs=58.2

Q ss_pred             CcceEEEecc----CCchhhHHHHHHHHHhhccC-CCcceEec--cCCCcccEEEEeecCCC----C--CCCC-CCCCCC
Q 042321           73 KFSLTYAFLP----GTRGDAINPVARAFQTWAPN-TQFQFAES--QDYRNADIKISFESGDH----G--AGAL-STAVVE  138 (169)
Q Consensus        73 k~~LTy~~~~----~~~~~~~~~i~~Af~~Ws~v-t~L~F~ev--~~~~~ADI~I~F~~~~H----g--d~~p-fd~~~e  138 (169)
                      .++|+-.+..    ......+.++.+|++.|..+ ..|+|+-.  .....+||.|.-....-    |  .++| ..|.+-
T Consensus        38 ~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~~~~~G~ggsAGFP~s~G~P~  117 (211)
T PF12388_consen   38 PRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSSNNPSGAGGSAGFPTSNGNPY  117 (211)
T ss_pred             CCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccCCCCCCcceeccCCCCCCCCC
Confidence            4667766655    12236789999999999886 35677422  22235678764222111    1  2456 666665


Q ss_pred             ceeec----ccccc-hHHHHHHhhhhhcCCCCc
Q 042321          139 SWLML----QLLLE-SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       139 ~w~~~----~~~~~-~~~va~HEiGHaLGL~h~  166 (169)
                      .|-.+    ....+ ..+|++|||||++||.|+
T Consensus       118 ~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHT  150 (211)
T PF12388_consen  118 KFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHT  150 (211)
T ss_pred             ceEEEEecCCCchhHHHHHHHHHhhhhcccccc
Confidence            55322    11222 678999999999999995


No 20 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.91  E-value=1.1e-05  Score=63.75  Aligned_cols=76  Identities=18%  Similarity=0.183  Sum_probs=48.0

Q ss_pred             hHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCC-CC------------CCC------Cce-ee----c
Q 042321           88 AINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGAL-ST------------AVV------ESW-LM----L  143 (169)
Q Consensus        88 ~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~p-fd------------~~~------e~w-~~----~  143 (169)
                      ..+++..|.+.|++.-||..+|-  ..+|||+|.-.... |-|.. +.            +..      .+- .+    .
T Consensus       105 wq~a~~tava~wa~~fpl~ive~--~eeaDItie~~n~p-gtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~pg  181 (236)
T COG5549         105 WQGAYLTAVAGWAKTFPLIIVER--FEEADITIEVGNPP-GTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPPG  181 (236)
T ss_pred             HHHHHHHHHHHHHHhCCceeeec--ceeeeEEEEecCCC-CCcccccchHHHHHHHHHHhhhccCcccccccccccCCcc
Confidence            46889999999999855555554  45899999866542 22211 11            000      000 00    0


Q ss_pred             ccccchHHHHHHhhhhhcCCC-Cc
Q 042321          144 QLLLESQSVCMELIFKELGIF-IN  166 (169)
Q Consensus       144 ~~~~~~~~va~HEiGHaLGL~-h~  166 (169)
                      .....+.++|-||+|||||+. |+
T Consensus       182 ~~~e~L~~tarhElGhaLgi~ghs  205 (236)
T COG5549         182 ELRENLNPTARHELGHALGIWGHS  205 (236)
T ss_pred             cchhhhhHHHHHhhcchheecccc
Confidence            122238999999999999998 65


No 21 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=97.78  E-value=1.3e-05  Score=63.58  Aligned_cols=20  Identities=15%  Similarity=0.215  Sum_probs=17.3

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      ...+++|||||+|||.|+..
T Consensus       137 ~~~~~aHEiGH~lGl~H~~~  156 (206)
T PF13583_consen  137 GYQTFAHEIGHNLGLRHDFD  156 (206)
T ss_pred             cchHHHHHHHHHhcCCCCcc
Confidence            45789999999999999864


No 22 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.65  E-value=1.8e-05  Score=57.15  Aligned_cols=18  Identities=17%  Similarity=0.115  Sum_probs=12.6

Q ss_pred             hHHHHHHhhhhhcCCCCc
Q 042321          149 SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~  166 (169)
                      ...|++|||||.|||.|+
T Consensus       107 ~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  107 GVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             STTHHHHHHHHHTT----
T ss_pred             cceEeeehhhHhcCCCCC
Confidence            357999999999999995


No 23 
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=97.39  E-value=0.00018  Score=57.06  Aligned_cols=36  Identities=17%  Similarity=0.160  Sum_probs=33.4

Q ss_pred             chhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321           22 DFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        22 ~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p   57 (169)
                      .|+..+.+||+.||+.+||+++|.+|..|...|..|
T Consensus        36 ~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~   71 (201)
T TIGR02869        36 VFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA   71 (201)
T ss_pred             ccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc
Confidence            467999999999999999999999999999999666


No 24 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.30  E-value=0.00011  Score=57.14  Aligned_cols=20  Identities=15%  Similarity=-0.014  Sum_probs=13.4

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      ...|++|||||.||+.|+.+
T Consensus       142 ~~~~~AHEiGH~lGa~HD~~  161 (196)
T PF13688_consen  142 GAITFAHEIGHNLGAPHDGD  161 (196)
T ss_dssp             HHHHHHHHHHHHTT-----S
T ss_pred             eehhhHHhHHHhcCCCCCCC
Confidence            67799999999999999753


No 25 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.29  E-value=0.00012  Score=56.51  Aligned_cols=20  Identities=10%  Similarity=0.140  Sum_probs=14.9

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      ...|++|||||.||+.|+.+
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~  130 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFD  130 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSS
T ss_pred             eeeeehhhhHhhcCCCCCCC
Confidence            56789999999999999875


No 26 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.24  E-value=0.00013  Score=55.40  Aligned_cols=19  Identities=11%  Similarity=0.141  Sum_probs=14.3

Q ss_pred             hHHHHHHhhhhhcCCCCcc
Q 042321          149 SQSVCMELIFKELGIFINY  167 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~  167 (169)
                      ...|++||+||-|||.|..
T Consensus        69 ~g~TltHEvGH~LGL~HtF   87 (154)
T PF05572_consen   69 FGKTLTHEVGHWLGLYHTF   87 (154)
T ss_dssp             SSHHHHHHHHHHTT---TT
T ss_pred             cccchhhhhhhhhcccccc
Confidence            5679999999999999975


No 27 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.07  E-value=0.00027  Score=55.01  Aligned_cols=19  Identities=11%  Similarity=-0.041  Sum_probs=16.9

Q ss_pred             hHHHHHHhhhhhcCCCCcc
Q 042321          149 SQSVCMELIFKELGIFINY  167 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~  167 (169)
                      .-.|++|||||.||+.|+.
T Consensus       131 ~a~~~AHElGH~lG~~HD~  149 (194)
T cd04269         131 FAVTMAHELGHNLGMEHDD  149 (194)
T ss_pred             HHHHHHHHHHhhcCCCcCC
Confidence            5679999999999999975


No 28 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.03  E-value=0.00026  Score=54.98  Aligned_cols=21  Identities=14%  Similarity=-0.039  Sum_probs=17.9

Q ss_pred             cchHHHHHHhhhhhcCCCCcc
Q 042321          147 LESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       147 ~~~~~va~HEiGHaLGL~h~~  167 (169)
                      .....|++||+||.|||.|+.
T Consensus       131 ~~~~~~~aHElGH~lG~~HD~  151 (192)
T cd04267         131 LLTALTMAHELGHNLGAEHDG  151 (192)
T ss_pred             eeehhhhhhhHHhhcCCcCCC
Confidence            346679999999999999975


No 29 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=96.97  E-value=0.0036  Score=49.54  Aligned_cols=93  Identities=16%  Similarity=0.137  Sum_probs=57.1

Q ss_pred             CcceEEEecc--CC---c----hhhHHHHHHHH---HhhccCCCcceEeccCCCcccEEEEeecCCCCCC-C-CCCCC--
Q 042321           73 KFSLTYAFLP--GT---R----GDAINPVARAF---QTWAPNTQFQFAESQDYRNADIKISFESGDHGAG-A-LSTAV--  136 (169)
Q Consensus        73 k~~LTy~~~~--~~---~----~~~~~~i~~Af---~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~-~-pfd~~--  136 (169)
                      .+.++|.+.-  ..   .    +++-.+|.+.+   +-|...-.+.|+.|... .+|++|...+..--+. + +.+-.  
T Consensus        28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~-~~Df~I~Lasp~T~~~lC~g~~~~~e  106 (203)
T PF11350_consen   28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSG-APDFRISLASPGTTDRLCAGLDTSGE  106 (203)
T ss_pred             CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCC-CCCEEEEECCcchhhhhccCcCcCce
Confidence            3678998763  11   1    23344444444   47999889999999986 5999999988753221 1 22211  


Q ss_pred             --------------CCceeec--c--cccc--hHHHHHHhhhhhcCCCCc
Q 042321          137 --------------VESWLML--Q--LLLE--SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       137 --------------~e~w~~~--~--~~~~--~~~va~HEiGHaLGL~h~  166 (169)
                                    ...|...  .  ..+.  -..|+=||.||+||-.|.
T Consensus       107 ~SC~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~  156 (203)
T PF11350_consen  107 TSCRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHE  156 (203)
T ss_pred             eEeecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCC
Confidence                          1233220  0  1111  355999999999998874


No 30 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=96.87  E-value=0.00054  Score=53.40  Aligned_cols=20  Identities=15%  Similarity=0.008  Sum_probs=16.5

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      ...+++||+||.|||.|--|
T Consensus       125 ~~k~~~HElGH~lGL~HC~~  144 (179)
T PRK13267        125 VRKEVTHELGHTLGLEHCDN  144 (179)
T ss_pred             HHHHHHHHHHHHcCCccCCC
Confidence            45579999999999999643


No 31 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.81  E-value=0.00013  Score=58.74  Aligned_cols=21  Identities=10%  Similarity=0.191  Sum_probs=18.4

Q ss_pred             cchHHHHHHhhhhhcCCCCcc
Q 042321          147 LESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       147 ~~~~~va~HEiGHaLGL~h~~  167 (169)
                      .++..|++|||||.|||.|..
T Consensus       135 ~n~g~t~~HEvGH~lGL~HtF  155 (225)
T cd04275         135 YNLGDTATHEVGHWLGLYHTF  155 (225)
T ss_pred             ccccceeEEeccceeeeeeee
Confidence            357789999999999999975


No 32 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.78  E-value=0.00063  Score=53.67  Aligned_cols=18  Identities=6%  Similarity=-0.130  Sum_probs=12.8

Q ss_pred             hHHHHHHhhhhhcCCCCc
Q 042321          149 SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~  166 (169)
                      ...+|+||+||.+||.|=
T Consensus       145 ~~Kea~HElGH~~GL~HC  162 (194)
T PF07998_consen  145 VCKEAVHELGHLFGLDHC  162 (194)
T ss_dssp             HHHHHHHHHHHHTT----
T ss_pred             HHHHHHHHHHHHcCCcCC
Confidence            567999999999999983


No 33 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.71  E-value=0.0008  Score=54.73  Aligned_cols=19  Identities=11%  Similarity=-0.001  Sum_probs=16.7

Q ss_pred             HHHHHHhhhhhcCCCCccC
Q 042321          150 QSVCMELIFKELGIFINYN  168 (169)
Q Consensus       150 ~~va~HEiGHaLGL~h~~~  168 (169)
                      ..|++||+||.||+.|+.+
T Consensus       168 a~t~AHElGHnlGm~HD~~  186 (244)
T cd04270         168 DLVTAHELGHNFGSPHDPD  186 (244)
T ss_pred             HHHHHHHHHHhcCCCCCCC
Confidence            4689999999999999864


No 34 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=96.71  E-value=0.00072  Score=52.27  Aligned_cols=20  Identities=15%  Similarity=-0.017  Sum_probs=16.3

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      .+.=++||+||.|||.|=-|
T Consensus       124 v~KEv~HElGH~~GL~HC~N  143 (181)
T COG1913         124 VVKEVLHELGHLLGLSHCPN  143 (181)
T ss_pred             HHHHHHHHhhhhcCcccCCC
Confidence            34467999999999999655


No 35 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=96.70  E-value=0.00033  Score=55.31  Aligned_cols=75  Identities=16%  Similarity=0.104  Sum_probs=40.6

Q ss_pred             HHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCC------C--CCCCCCC--CCcee--ecc-cccchHHHHHHhhh
Q 042321           92 VARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHG------A--GALSTAV--VESWL--MLQ-LLLESQSVCMELIF  158 (169)
Q Consensus        92 i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hg------d--~~pfd~~--~e~w~--~~~-~~~~~~~va~HEiG  158 (169)
                      .-.+|..|....... .. .+....|+.+-|...+..      +  |.++-|.  ...++  +.+ ..+....|++||||
T Consensus        72 ~L~~F~~w~~~~~~~-~~-~~~~~~D~a~llt~~d~~~~~~~~~~~G~A~~ggiC~~~~s~~i~~~~~~~~a~~~aHElG  149 (207)
T cd04273          72 SLKSFCRWQKKLNPP-ND-SDPEHHDHAILLTRQDICRSNGNCDTLGLAPVGGMCSPSRSCSINEDTGLSSAFTIAHELG  149 (207)
T ss_pred             HHHHHHHHHHHcCCc-cc-ccccccceEEEEeeecccccCCCCCceEEeccccCCCCCcceEEEcCCCceeEEeeeeech
Confidence            468899996633221 10 011357777777665432      1  1222211  11222  111 23335668999999


Q ss_pred             hhcCCCCccC
Q 042321          159 KELGIFINYN  168 (169)
Q Consensus       159 HaLGL~h~~~  168 (169)
                      |.||+.|+..
T Consensus       150 H~LG~~HD~~  159 (207)
T cd04273         150 HVLGMPHDGD  159 (207)
T ss_pred             hhcCCCCCCC
Confidence            9999999853


No 36 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.65  E-value=0.00062  Score=54.89  Aligned_cols=17  Identities=18%  Similarity=0.128  Sum_probs=15.5

Q ss_pred             HHHHHhhhhhcCCCCcc
Q 042321          151 SVCMELIFKELGIFINY  167 (169)
Q Consensus       151 ~va~HEiGHaLGL~h~~  167 (169)
                      .|++||+||.||+.|+.
T Consensus       147 ~t~AHElGHnLGm~HD~  163 (228)
T cd04271         147 QVFAHEIGHTFGAVHDC  163 (228)
T ss_pred             eehhhhhhhhcCCCCCC
Confidence            58999999999999974


No 37 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.62  E-value=0.00091  Score=52.28  Aligned_cols=20  Identities=10%  Similarity=0.027  Sum_probs=14.2

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      +-.+++|||||.||+.|+.+
T Consensus       131 ~a~~~AHelGH~lGm~HD~~  150 (199)
T PF01421_consen  131 FAVIIAHELGHNLGMPHDGD  150 (199)
T ss_dssp             HHHHHHHHHHHHTT---TTT
T ss_pred             HHHHHHHHHHHhcCCCCCCC
Confidence            55688999999999999864


No 38 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=96.49  E-value=0.0014  Score=52.27  Aligned_cols=20  Identities=10%  Similarity=-0.152  Sum_probs=17.4

Q ss_pred             chHHHHHHhhhhhcCCCCcc
Q 042321          148 ESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       148 ~~~~va~HEiGHaLGL~h~~  167 (169)
                      ....+++||+||.||+.|+.
T Consensus       144 ~~~~~~AHElGH~lG~~HD~  163 (220)
T cd04272         144 YGVYTMTHELAHLLGAPHDG  163 (220)
T ss_pred             ccHHHHHHHHHHHhCCCCCC
Confidence            35789999999999999973


No 39 
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=96.46  E-value=0.0029  Score=57.57  Aligned_cols=37  Identities=22%  Similarity=0.247  Sum_probs=34.9

Q ss_pred             cchhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321           21 DDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        21 ~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p   57 (169)
                      ..++.++.+|+|+||+..||+++|+++..|++.|+.|
T Consensus       307 ~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs  343 (608)
T PRK10594        307 AAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT  343 (608)
T ss_pred             cccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence            3578999999999999999999999999999999988


No 40 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=96.18  E-value=0.0013  Score=54.74  Aligned_cols=19  Identities=21%  Similarity=0.209  Sum_probs=17.3

Q ss_pred             HHHHHHhhhhhcCCCCccC
Q 042321          150 QSVCMELIFKELGIFINYN  168 (169)
Q Consensus       150 ~~va~HEiGHaLGL~h~~~  168 (169)
                      ..|++||+||.|||.+-|+
T Consensus       166 igv~~HE~gH~lGLPDlY~  184 (286)
T TIGR03296       166 VGVIAHELGHDLGLPDLYD  184 (286)
T ss_pred             eeeeehhhhcccCCCcccc
Confidence            5799999999999999886


No 41 
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=95.61  E-value=0.012  Score=52.57  Aligned_cols=37  Identities=22%  Similarity=0.311  Sum_probs=34.0

Q ss_pred             cchhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321           21 DDFDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        21 ~~~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p   57 (169)
                      ..|+.++.+|+|+||+..|||..|++++.|++.|+--
T Consensus       263 ~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~s  299 (561)
T COG2989         263 PAYDPELVEAVKRFQARHGLPADGVIGPATRAALNVS  299 (561)
T ss_pred             ccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhccC
Confidence            4688999999999999999999999999999988764


No 42 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=95.26  E-value=0.0083  Score=50.46  Aligned_cols=17  Identities=6%  Similarity=-0.103  Sum_probs=14.0

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 042321          149 SQSVCMELIFKELGIFI  165 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h  165 (169)
                      ...+.+||+||++||.|
T Consensus       193 ~~~~f~HE~GH~~GL~H  209 (305)
T PF10462_consen  193 YGNEFSHELGHNFGLGH  209 (305)
T ss_dssp             SHHHHHHHHHHTTT--S
T ss_pred             ccceeehhhhhhcCCCC
Confidence            67799999999999999


No 43 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=95.15  E-value=0.0094  Score=50.37  Aligned_cols=19  Identities=16%  Similarity=0.165  Sum_probs=16.8

Q ss_pred             hHHHHHHhhhhhcCCCCcc
Q 042321          149 SQSVCMELIFKELGIFINY  167 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~  167 (169)
                      -..|++||+||-|||.|++
T Consensus       150 ~~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  150 DWATIMHELGHNLGLWHAG  168 (314)
T ss_pred             cHHHHHHHhhhhccccccC
Confidence            4569999999999999985


No 44 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=94.51  E-value=0.0067  Score=55.77  Aligned_cols=19  Identities=21%  Similarity=0.176  Sum_probs=17.0

Q ss_pred             HHHHHHhhhhhcCCCCccC
Q 042321          150 QSVCMELIFKELGIFINYN  168 (169)
Q Consensus       150 ~~va~HEiGHaLGL~h~~~  168 (169)
                      ..|++||+||.|||.+-|+
T Consensus       222 iGVfaHEfGH~LGLPDlYD  240 (645)
T PF05547_consen  222 IGVFAHEFGHDLGLPDLYD  240 (645)
T ss_pred             eEEEEeeccccCCCCCccc
Confidence            5589999999999998885


No 45 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=94.36  E-value=0.048  Score=43.86  Aligned_cols=69  Identities=10%  Similarity=0.068  Sum_probs=38.5

Q ss_pred             HHHHHHhhccCCC---cceEecc-----CCCcccEEEEeecCCCCCCCCCCCCCCceee-----cccccchHHHHHHhhh
Q 042321           92 VARAFQTWAPNTQ---FQFAESQ-----DYRNADIKISFESGDHGAGALSTAVVESWLM-----LQLLLESQSVCMELIF  158 (169)
Q Consensus        92 i~~Af~~Ws~vt~---L~F~ev~-----~~~~ADI~I~F~~~~Hgd~~pfd~~~e~w~~-----~~~~~~~~~va~HEiG  158 (169)
                      ++..|+.++++.+   ++=.|+.     ...-.|++|.-..|.-+|-|  |.....-..     .+..+--..||+||.|
T Consensus        21 vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~LtDHY--dP~~k~vrLS~~vy~~~SiaAvaVAAHEvG   98 (222)
T PF04298_consen   21 VKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGELTDHY--DPRNKVVRLSEDVYNGRSIAAVAVAAHEVG   98 (222)
T ss_pred             HHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCCCCCc--CCCCCEEEeCCccCCCCCHHHHHHHHHHHh
Confidence            4677777777642   2222221     11246889998877545533  322211111     2234446779999999


Q ss_pred             hhcC
Q 042321          159 KELG  162 (169)
Q Consensus       159 HaLG  162 (169)
                      ||+=
T Consensus        99 HAiQ  102 (222)
T PF04298_consen   99 HAIQ  102 (222)
T ss_pred             HHHh
Confidence            9973


No 46 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=93.21  E-value=0.049  Score=47.82  Aligned_cols=18  Identities=11%  Similarity=-0.103  Sum_probs=15.5

Q ss_pred             HHHHHHhhhhhcCCCCcc
Q 042321          150 QSVCMELIFKELGIFINY  167 (169)
Q Consensus       150 ~~va~HEiGHaLGL~h~~  167 (169)
                      ....+|||||++|+.|.-
T Consensus       317 iGA~lHEiGH~fg~pH~~  334 (423)
T PF12044_consen  317 IGAFLHEIGHLFGCPHQE  334 (423)
T ss_pred             hHHHHHHHHHhcCCCCCC
Confidence            447899999999999964


No 47 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=93.16  E-value=0.067  Score=40.58  Aligned_cols=35  Identities=11%  Similarity=0.253  Sum_probs=31.4

Q ss_pred             hhHHHHHHHHHHHHhcCC-CccccCChHHhhhhhcC
Q 042321           23 FDELLESAIKNYQLNYHL-KSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        23 ~~~~~~~ai~~~Q~~~gL-~~TG~lD~~T~~~m~~p   57 (169)
                      +......|++.||+.++| +++|.++++|...+...
T Consensus        70 ~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~  105 (185)
T COG3409          70 YGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQ  105 (185)
T ss_pred             cCcccHHHHHHHhhhcCcccccccccHHHHHHHHHH
Confidence            457789999999999999 89999999999988774


No 48 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=92.13  E-value=0.081  Score=43.70  Aligned_cols=20  Identities=15%  Similarity=0.207  Sum_probs=12.8

Q ss_pred             chHHHHHHhhhhhcC-CCCcc
Q 042321          148 ESQSVCMELIFKELG-IFINY  167 (169)
Q Consensus       148 ~~~~va~HEiGHaLG-L~h~~  167 (169)
                      .+..|++||+||++| |.+-|
T Consensus       215 ~~~~v~vHE~GHsf~~LaDEY  235 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADEY  235 (264)
T ss_dssp             THHHHHHHHHHHHTT------
T ss_pred             cccceeeeecccccccccccc
Confidence            478999999999765 55544


No 49 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=92.05  E-value=0.081  Score=49.41  Aligned_cols=20  Identities=15%  Similarity=0.091  Sum_probs=17.4

Q ss_pred             chHHHHHHhhhhhcCCCCcc
Q 042321          148 ESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       148 ~~~~va~HEiGHaLGL~h~~  167 (169)
                      ....|++||+||.|||.|+-
T Consensus       322 ~~a~v~AhelgH~lGm~hD~  341 (716)
T KOG3607|consen  322 AFAVVLAHELGHNLGMIHDE  341 (716)
T ss_pred             hHHHHHHHHHHhhcCccccc
Confidence            36779999999999999963


No 50 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=92.05  E-value=0.062  Score=39.27  Aligned_cols=17  Identities=18%  Similarity=0.108  Sum_probs=15.0

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 042321          149 SQSVCMELIFKELGIFI  165 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h  165 (169)
                      ..+|++|||||-.||.+
T Consensus       109 vthvliHEIgHhFGLsD  125 (136)
T COG3824         109 VTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             hhhhhhhhhhhhcCCCh
Confidence            56799999999999985


No 51 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=91.74  E-value=0.094  Score=47.20  Aligned_cols=17  Identities=18%  Similarity=0.077  Sum_probs=14.3

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 042321          149 SQSVCMELIFKELGIFI  165 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h  165 (169)
                      +..|++|||.||||+.-
T Consensus       210 ~~~~~~HEi~HaLGFs~  226 (521)
T PF01457_consen  210 FFRTVIHEIAHALGFSS  226 (521)
T ss_dssp             HHHHHHHHHHHHTT-SH
T ss_pred             ccceeeeeeeeeeeecc
Confidence            77899999999999863


No 52 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=91.26  E-value=0.11  Score=47.17  Aligned_cols=16  Identities=6%  Similarity=-0.186  Sum_probs=13.1

Q ss_pred             ccchHHHHHHhhhhhc
Q 042321          146 LLESQSVCMELIFKEL  161 (169)
Q Consensus       146 ~~~~~~va~HEiGHaL  161 (169)
                      ..+-..|++||+|||+
T Consensus       372 ~~~~v~TL~HE~GHa~  387 (587)
T TIGR02290       372 SRRDVSTLAHELGHAY  387 (587)
T ss_pred             CchhHHHHHHHhhHHH
Confidence            3456779999999997


No 53 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=90.44  E-value=0.17  Score=35.64  Aligned_cols=17  Identities=18%  Similarity=0.131  Sum_probs=13.4

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 042321          149 SQSVCMELIFKELGIFI  165 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h  165 (169)
                      ...|++|||||-+|+..
T Consensus        73 I~~tlvhEiah~fG~~~   89 (97)
T PF06262_consen   73 IRDTLVHEIAHHFGISD   89 (97)
T ss_dssp             HHHHHHHHHHHHTT--H
T ss_pred             HHHHHHHHHHHHcCCCH
Confidence            67799999999999863


No 54 
>PTZ00337 surface protease GP63; Provisional
Probab=90.00  E-value=0.16  Score=46.29  Aligned_cols=17  Identities=18%  Similarity=0.141  Sum_probs=15.3

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 042321          149 SQSVCMELIFKELGIFI  165 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h  165 (169)
                      ++.|++|||.||||+..
T Consensus       229 ~~~v~~HEi~HALGFs~  245 (567)
T PTZ00337        229 DVRVAAHELGHALGFVR  245 (567)
T ss_pred             HHHHHHHHHHHHHccCH
Confidence            78899999999999853


No 55 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=88.68  E-value=0.16  Score=46.78  Aligned_cols=21  Identities=19%  Similarity=0.047  Sum_probs=18.1

Q ss_pred             hHHHHHHhhhhhcCCCCccCC
Q 042321          149 SQSVCMELIFKELGIFINYNI  169 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~~  169 (169)
                      ..-|++|||||.-|-.|++.|
T Consensus       392 s~lt~AHEiGHNfGSpHDp~~  412 (764)
T KOG3658|consen  392 SDLTLAHEIGHNFGSPHDPDI  412 (764)
T ss_pred             hheeehhhhccccCCCCCCCC
Confidence            455889999999999999864


No 56 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=88.40  E-value=0.24  Score=43.74  Aligned_cols=18  Identities=11%  Similarity=-0.125  Sum_probs=15.2

Q ss_pred             HHHHHHhhhhhcCCCCcc
Q 042321          150 QSVCMELIFKELGIFINY  167 (169)
Q Consensus       150 ~~va~HEiGHaLGL~h~~  167 (169)
                      ...++||+||.||+.|..
T Consensus       302 lGA~~HElGH~lgcpHq~  319 (614)
T KOG4525|consen  302 LGAVCHELGHCLGCPHQS  319 (614)
T ss_pred             HHHHHHHhhhccCCCCCC
Confidence            446799999999999964


No 57 
>PF09374 PG_binding_3:  Predicted Peptidoglycan domain;  InterPro: IPR018537  This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=87.95  E-value=0.53  Score=31.13  Aligned_cols=29  Identities=17%  Similarity=0.200  Sum_probs=22.4

Q ss_pred             HHHHHHHHhcC----CCccccCChHHhhhhhcC
Q 042321           29 SAIKNYQLNYH----LKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        29 ~ai~~~Q~~~g----L~~TG~lD~~T~~~m~~p   57 (169)
                      .|+|.+|+-.|    +.+.|++.+.|+..+...
T Consensus         1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al~~~   33 (72)
T PF09374_consen    1 RAIKFLQRALGLGADVAVDGIIGPKTLAALNAA   33 (72)
T ss_dssp             HHHHHHHHHHT----S--SS--SHHHHHHHHHS
T ss_pred             CHHHHHHHHHccCCccCCCCCcCHHHHHHHhhC
Confidence            48999999999    999999999999998666


No 58 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=87.60  E-value=0.29  Score=39.40  Aligned_cols=13  Identities=8%  Similarity=-0.184  Sum_probs=11.9

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      ...|+.||+||||
T Consensus        92 ~~~~l~HE~GHAl  104 (220)
T PF14247_consen   92 VLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHH
Confidence            7889999999996


No 59 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=87.31  E-value=0.3  Score=43.44  Aligned_cols=15  Identities=20%  Similarity=-0.082  Sum_probs=13.4

Q ss_pred             chHHHHHHhhhhhcC
Q 042321          148 ESQSVCMELIFKELG  162 (169)
Q Consensus       148 ~~~~va~HEiGHaLG  162 (169)
                      ++-.|++|||||..|
T Consensus       129 ElagViAHEigHv~q  143 (484)
T COG4783         129 ELAGVIAHEIGHVAQ  143 (484)
T ss_pred             HHHHHHHHHHHHHhh
Confidence            489999999999876


No 60 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=86.81  E-value=0.37  Score=44.31  Aligned_cols=16  Identities=13%  Similarity=0.179  Sum_probs=14.7

Q ss_pred             hHHHHHHhhhhhcCCC
Q 042321          149 SQSVCMELIFKELGIF  164 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~  164 (169)
                      ...|++|||.||||..
T Consensus       256 ~~rv~~HEi~HALGFS  271 (622)
T PTZ00257        256 TTRTVTHEVAHALGFS  271 (622)
T ss_pred             HHHHHHHHHHHHhcCC
Confidence            6889999999999985


No 61 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=86.03  E-value=0.69  Score=36.70  Aligned_cols=78  Identities=9%  Similarity=0.000  Sum_probs=44.7

Q ss_pred             HHHHHHHHHhhccCC---CcceEeccC-----CCcccEEEEeecCCCCCCCCCCCCCCce--e---ecccccchHHHHHH
Q 042321           89 INPVARAFQTWAPNT---QFQFAESQD-----YRNADIKISFESGDHGAGALSTAVVESW--L---MLQLLLESQSVCME  155 (169)
Q Consensus        89 ~~~i~~Af~~Ws~vt---~L~F~ev~~-----~~~ADI~I~F~~~~Hgd~~pfd~~~e~w--~---~~~~~~~~~~va~H  155 (169)
                      +.-++.+++.||+|.   +++=.||..     ..--|+++.-.+|.=-|-  +|.....-  +   ..+..+.-..||+|
T Consensus        21 Q~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~LTDH--YDP~~kvvrLSe~~y~g~Sia~~aVAAH   98 (226)
T COG2738          21 QSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTLTDH--YDPRRKVVRLSEANYYGPSIAAIAVAAH   98 (226)
T ss_pred             HHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCcccc--cChhhheeeccccccCCccHHHHHHHHH
Confidence            455788999999985   343333321     113477777666543321  22211111  1   12344457789999


Q ss_pred             hhhhhcCCCCccC
Q 042321          156 LIFKELGIFINYN  168 (169)
Q Consensus       156 EiGHaLGL~h~~~  168 (169)
                      |.|||+==.++|.
T Consensus        99 EVGHAiQd~~~Y~  111 (226)
T COG2738          99 EVGHAIQDQEDYA  111 (226)
T ss_pred             HhhHHHhhhcccH
Confidence            9999997766663


No 62 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=85.35  E-value=0.46  Score=32.86  Aligned_cols=13  Identities=8%  Similarity=-0.156  Sum_probs=11.1

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      -..+++|||||.+
T Consensus        42 ~~f~laHELgH~~   54 (122)
T PF06114_consen   42 QRFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHH
Confidence            5669999999975


No 63 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=84.55  E-value=0.43  Score=38.21  Aligned_cols=13  Identities=8%  Similarity=-0.090  Sum_probs=10.9

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      -+-|++||+||+|
T Consensus        72 ~rFtlAHELGH~l   84 (213)
T COG2856          72 KRFTLAHELGHAL   84 (213)
T ss_pred             HHHHHHHHHhHHH
Confidence            3459999999997


No 64 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=83.46  E-value=1.2  Score=40.27  Aligned_cols=17  Identities=6%  Similarity=-0.271  Sum_probs=13.7

Q ss_pred             cccchHHHHHHhhhhhc
Q 042321          145 LLLESQSVCMELIFKEL  161 (169)
Q Consensus       145 ~~~~~~~va~HEiGHaL  161 (169)
                      ...+-..|++||+|||+
T Consensus       333 ~t~~dv~TL~HElGHa~  349 (549)
T TIGR02289       333 GTSGDIDVLTHEAGHAF  349 (549)
T ss_pred             CChhHHHHHHHHhhHHH
Confidence            34457789999999996


No 65 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=82.66  E-value=0.71  Score=35.98  Aligned_cols=15  Identities=13%  Similarity=-0.148  Sum_probs=12.9

Q ss_pred             chHHHHHHhhhhhcC
Q 042321          148 ESQSVCMELIFKELG  162 (169)
Q Consensus       148 ~~~~va~HEiGHaLG  162 (169)
                      ++..|++||+||...
T Consensus        88 el~aVlaHElgH~~~  102 (226)
T PF01435_consen   88 ELAAVLAHELGHIKH  102 (226)
T ss_dssp             HHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHc
Confidence            488999999999863


No 66 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=81.81  E-value=0.77  Score=36.32  Aligned_cols=13  Identities=23%  Similarity=0.061  Sum_probs=11.3

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +..|++||+||++
T Consensus        38 ~~~v~iHElgH~~   50 (208)
T cd06161          38 FLSVLLHELGHAL   50 (208)
T ss_pred             HHHHHHHHHHHHH
Confidence            5679999999985


No 67 
>PRK03982 heat shock protein HtpX; Provisional
Probab=81.03  E-value=0.82  Score=37.94  Aligned_cols=14  Identities=14%  Similarity=-0.016  Sum_probs=12.2

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      ++..|++||+||.-
T Consensus       124 El~AVlAHElgHi~  137 (288)
T PRK03982        124 ELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHH
Confidence            48999999999963


No 68 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=80.61  E-value=1.7  Score=37.32  Aligned_cols=16  Identities=6%  Similarity=-0.056  Sum_probs=12.9

Q ss_pred             ccchHHHHHHhhhhhc
Q 042321          146 LLESQSVCMELIFKEL  161 (169)
Q Consensus       146 ~~~~~~va~HEiGHaL  161 (169)
                      ..+-..+++||+|||+
T Consensus       219 ~~~~v~tl~HE~GHa~  234 (427)
T cd06459         219 TLDDVFTLAHELGHAF  234 (427)
T ss_pred             ChhhHHHHHHHhhHHH
Confidence            3456779999999985


No 69 
>PRK03001 M48 family peptidase; Provisional
Probab=79.77  E-value=0.95  Score=37.44  Aligned_cols=14  Identities=14%  Similarity=0.007  Sum_probs=12.2

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      ++..|++||+||.-
T Consensus       123 El~aVlAHElgHi~  136 (283)
T PRK03001        123 EIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHh
Confidence            48999999999964


No 70 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=78.94  E-value=1.1  Score=36.74  Aligned_cols=14  Identities=14%  Similarity=-0.081  Sum_probs=12.3

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      ++..|++||+||..
T Consensus       156 El~aVlaHElgHi~  169 (302)
T COG0501         156 ELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHh
Confidence            48999999999974


No 71 
>PRK03072 heat shock protein HtpX; Provisional
Probab=78.93  E-value=1  Score=37.43  Aligned_cols=14  Identities=14%  Similarity=-0.043  Sum_probs=12.3

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      ++..|++||+||.-
T Consensus       126 El~aVlAHElgHi~  139 (288)
T PRK03072        126 ELRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHHh
Confidence            48999999999964


No 72 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=78.85  E-value=0.99  Score=38.27  Aligned_cols=16  Identities=0%  Similarity=-0.168  Sum_probs=12.9

Q ss_pred             ccchHHHHHHhhhhhc
Q 042321          146 LLESQSVCMELIFKEL  161 (169)
Q Consensus       146 ~~~~~~va~HEiGHaL  161 (169)
                      ..+-..|++||+|||+
T Consensus       151 ~~~~v~tl~HE~GHa~  166 (365)
T cd06258         151 GHDDINTLFHEFGHAV  166 (365)
T ss_pred             CHHHHHHHHHHHhHHH
Confidence            3446779999999997


No 73 
>PRK01345 heat shock protein HtpX; Provisional
Probab=78.76  E-value=1.1  Score=37.97  Aligned_cols=14  Identities=14%  Similarity=-0.014  Sum_probs=12.4

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      ++..|++|||||.-
T Consensus       123 EL~aVlAHElgHi~  136 (317)
T PRK01345        123 EVAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHHH
Confidence            48999999999975


No 74 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=78.76  E-value=2  Score=39.02  Aligned_cols=17  Identities=12%  Similarity=-0.121  Sum_probs=13.8

Q ss_pred             cccchHHHHHHhhhhhc
Q 042321          145 LLLESQSVCMELIFKEL  161 (169)
Q Consensus       145 ~~~~~~~va~HEiGHaL  161 (169)
                      ...+-..|++||+|||+
T Consensus       374 ~~~~dv~TLaHElGHa~  390 (591)
T TIGR00181       374 GTLNSVFTLAHELGHSM  390 (591)
T ss_pred             CCcchHHHHHHHhhhHH
Confidence            34567889999999995


No 75 
>PLN02791 Nudix hydrolase homolog
Probab=78.19  E-value=1.1  Score=42.37  Aligned_cols=17  Identities=6%  Similarity=-0.064  Sum_probs=14.8

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 042321          149 SQSVCMELIFKELGIFI  165 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h  165 (169)
                      +.+|++||+||.+|=..
T Consensus       585 ~~~v~lHElgHGsG~~~  601 (770)
T PLN02791        585 FTHTICHECCHGIGPHT  601 (770)
T ss_pred             HHHHHHHHhhccccccc
Confidence            78899999999999643


No 76 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=77.02  E-value=1.3  Score=35.57  Aligned_cols=13  Identities=23%  Similarity=0.102  Sum_probs=11.4

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +..|.+||+||++
T Consensus        53 ~~~v~iHElgH~~   65 (227)
T cd06164          53 FASVLLHELGHSL   65 (227)
T ss_pred             HHHHHHHHHHHHH
Confidence            6779999999986


No 77 
>PRK04897 heat shock protein HtpX; Provisional
Probab=76.91  E-value=1.3  Score=37.03  Aligned_cols=14  Identities=21%  Similarity=-0.071  Sum_probs=12.2

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      ++..|++||+||.-
T Consensus       136 El~aVlAHElgHi~  149 (298)
T PRK04897        136 ELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHh
Confidence            48999999999964


No 78 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=76.62  E-value=1.3  Score=38.86  Aligned_cols=15  Identities=0%  Similarity=-0.066  Sum_probs=12.2

Q ss_pred             cchHHHHHHhhhhhc
Q 042321          147 LESQSVCMELIFKEL  161 (169)
Q Consensus       147 ~~~~~va~HEiGHaL  161 (169)
                      .+-..|+.||+|||+
T Consensus       240 ~~~v~tLfHE~GHa~  254 (458)
T PF01432_consen  240 HDDVETLFHEFGHAM  254 (458)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             hhhHHHHHHHHhHHH
Confidence            345779999999986


No 79 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=76.03  E-value=1.5  Score=36.59  Aligned_cols=13  Identities=8%  Similarity=-0.176  Sum_probs=11.3

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +..|++||+||++
T Consensus       135 ~isvvvHElgHal  147 (277)
T cd06162         135 LISGVVHEMGHGV  147 (277)
T ss_pred             HHHHHHHHHHHHH
Confidence            5679999999986


No 80 
>PRK01265 heat shock protein HtpX; Provisional
Probab=75.22  E-value=1.5  Score=37.25  Aligned_cols=13  Identities=15%  Similarity=-0.054  Sum_probs=11.6

Q ss_pred             chHHHHHHhhhhh
Q 042321          148 ESQSVCMELIFKE  160 (169)
Q Consensus       148 ~~~~va~HEiGHa  160 (169)
                      ++..|++||+||.
T Consensus       139 El~aVlAHElgHi  151 (324)
T PRK01265        139 EIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHH
Confidence            4899999999995


No 81 
>PRK05457 heat shock protein HtpX; Provisional
Probab=75.03  E-value=1.6  Score=36.36  Aligned_cols=13  Identities=23%  Similarity=0.087  Sum_probs=11.6

Q ss_pred             chHHHHHHhhhhh
Q 042321          148 ESQSVCMELIFKE  160 (169)
Q Consensus       148 ~~~~va~HEiGHa  160 (169)
                      ++..|++||+||.
T Consensus       133 El~aVlAHElgHi  145 (284)
T PRK05457        133 EVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHH
Confidence            4899999999995


No 82 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=75.02  E-value=1.6  Score=32.97  Aligned_cols=12  Identities=8%  Similarity=0.041  Sum_probs=10.3

Q ss_pred             HHHHHHhhhhhc
Q 042321          150 QSVCMELIFKEL  161 (169)
Q Consensus       150 ~~va~HEiGHaL  161 (169)
                      ..|++||++|++
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            559999999974


No 83 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=74.77  E-value=1.5  Score=38.88  Aligned_cols=14  Identities=0%  Similarity=-0.061  Sum_probs=12.0

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      +-..|+.||+|||+
T Consensus       262 ~~V~TLfHEfGHal  275 (472)
T cd06455         262 DEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46678999999998


No 84 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=74.46  E-value=9.3  Score=29.42  Aligned_cols=71  Identities=14%  Similarity=0.093  Sum_probs=41.2

Q ss_pred             hHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCCCCCCCCCCCC-Cceeecccccc-hHHHHHHhhhhhcCCCC
Q 042321           88 AINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDHGAGALSTAVV-ESWLMLQLLLE-SQSVCMELIFKELGIFI  165 (169)
Q Consensus        88 ~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~Hgd~~pfd~~~-e~w~~~~~~~~-~~~va~HEiGHaLGL~h  165 (169)
                      .+..+..-+..|+...++.        ...|+|.-.....|.+ +-++.. =+|.+...-.+ +..|++||+-|..-..|
T Consensus       110 ~~~~l~~~~~~~~~~~~~~--------~~~i~ir~~ksrWGsc-~~~~~I~ln~~L~~~P~~~idYVvvHEL~Hl~~~nH  180 (205)
T PF01863_consen  110 AKEYLPERLKKYAKKLGLP--------PPKIKIRDMKSRWGSC-SSKGNITLNWRLVMAPPEVIDYVVVHELCHLRHPNH  180 (205)
T ss_pred             HHHHHHHHHHHHHHHcCCC--------cceEEEeehhhccccC-CCCCcEEeecccccCCccHHHHHHHHHHHHhccCCC
Confidence            3455566666777665553        3456666656555543 433321 12222111112 67899999999999999


Q ss_pred             cc
Q 042321          166 NY  167 (169)
Q Consensus       166 ~~  167 (169)
                      +-
T Consensus       181 s~  182 (205)
T PF01863_consen  181 SK  182 (205)
T ss_pred             CH
Confidence            74


No 85 
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=74.44  E-value=3.4  Score=27.48  Aligned_cols=34  Identities=21%  Similarity=0.247  Sum_probs=20.1

Q ss_pred             hhHHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321           23 FDELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        23 ~~~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p   57 (169)
                      ...++.++|++. -++.=+++|..|++|.+.|..-
T Consensus        17 ~~~evq~~L~~l-Gyy~g~~~g~~d~a~~~Al~~~   50 (74)
T PF08823_consen   17 VAREVQEALKRL-GYYKGEADGVWDEATEDALRAW   50 (74)
T ss_pred             HHHHHHHHHHHc-CCccCCCCCcccHHHHHHHHHH
Confidence            344555555544 2233377788888887776554


No 86 
>PRK02391 heat shock protein HtpX; Provisional
Probab=74.42  E-value=1.7  Score=36.45  Aligned_cols=13  Identities=15%  Similarity=0.049  Sum_probs=11.7

Q ss_pred             chHHHHHHhhhhh
Q 042321          148 ESQSVCMELIFKE  160 (169)
Q Consensus       148 ~~~~va~HEiGHa  160 (169)
                      ++..|++||+||.
T Consensus       132 El~aVlaHElgHi  144 (296)
T PRK02391        132 ELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHH
Confidence            4899999999995


No 87 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=74.35  E-value=1.7  Score=35.85  Aligned_cols=13  Identities=8%  Similarity=0.082  Sum_probs=11.2

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +..|++||+||++
T Consensus       118 ~isv~iHElgHa~  130 (263)
T cd06159         118 VVGVVVHELSHGI  130 (263)
T ss_pred             HHHHHHHHHHHHH
Confidence            5678999999985


No 88 
>PRK02870 heat shock protein HtpX; Provisional
Probab=73.82  E-value=1.8  Score=37.04  Aligned_cols=13  Identities=23%  Similarity=0.069  Sum_probs=11.9

Q ss_pred             chHHHHHHhhhhh
Q 042321          148 ESQSVCMELIFKE  160 (169)
Q Consensus       148 ~~~~va~HEiGHa  160 (169)
                      ++..|++||+||.
T Consensus       172 EL~aVlAHELgHi  184 (336)
T PRK02870        172 ELQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHHH
Confidence            4899999999997


No 89 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=73.52  E-value=1.7  Score=38.47  Aligned_cols=24  Identities=0%  Similarity=-0.128  Sum_probs=16.3

Q ss_pred             hHHHHHHHHHhhccCCCcceEecc
Q 042321           88 AINPVARAFQTWAPNTQFQFAESQ  111 (169)
Q Consensus        88 ~~~~i~~Af~~Ws~vt~L~F~ev~  111 (169)
                      +..+++.-|..-+.+=+|+|+++.
T Consensus       132 l~~vl~gl~~~~~~lfgi~~~~~~  155 (458)
T cd06457         132 LGTVMEGLSRLFSRLYGIRLVPVP  155 (458)
T ss_pred             HHHHHHHHHHHHHHHhCeEEEecC
Confidence            445556666666677788888875


No 90 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=73.18  E-value=0.83  Score=43.53  Aligned_cols=23  Identities=13%  Similarity=-0.068  Sum_probs=18.8

Q ss_pred             cccchHHHHHHhhhhhcCCCCcc
Q 042321          145 LLLESQSVCMELIFKELGIFINY  167 (169)
Q Consensus       145 ~~~~~~~va~HEiGHaLGL~h~~  167 (169)
                      .++....|++||+||.+|+.|+.
T Consensus       313 ~gl~~aftiahE~GH~~gm~hd~  335 (845)
T KOG3538|consen  313 TGLPSAFTIAHELGHNFGMEHDG  335 (845)
T ss_pred             CCCccceeeeeccccccCcccCC
Confidence            34446679999999999999984


No 91 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=73.16  E-value=2  Score=29.77  Aligned_cols=13  Identities=8%  Similarity=-0.010  Sum_probs=11.3

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +..|++||+.|++
T Consensus        25 ~~~~l~HE~~H~~   37 (128)
T PF13485_consen   25 LDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHH
Confidence            5689999999985


No 92 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=71.74  E-value=2.2  Score=33.24  Aligned_cols=14  Identities=7%  Similarity=-0.104  Sum_probs=11.5

Q ss_pred             hHHHHHHhhhhhcC
Q 042321          149 SQSVCMELIFKELG  162 (169)
Q Consensus       149 ~~~va~HEiGHaLG  162 (169)
                      +..+.+||+||++-
T Consensus        41 ~~~l~iHElgH~~~   54 (183)
T cd06160          41 LAILGIHEMGHYLA   54 (183)
T ss_pred             HHHHHHHHHHHHHH
Confidence            56688999999863


No 93 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=71.64  E-value=2  Score=37.62  Aligned_cols=13  Identities=15%  Similarity=0.089  Sum_probs=11.0

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      ...++.||+||||
T Consensus       131 vi~~vvHElGHal  143 (484)
T KOG2921|consen  131 VITVVVHELGHAL  143 (484)
T ss_pred             HHHHHHHHhhHHH
Confidence            4567899999998


No 94 
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=70.60  E-value=2.2  Score=37.46  Aligned_cols=16  Identities=6%  Similarity=-0.043  Sum_probs=12.7

Q ss_pred             ccchHHHHHHhhhhhc
Q 042321          146 LLESQSVCMELIFKEL  161 (169)
Q Consensus       146 ~~~~~~va~HEiGHaL  161 (169)
                      ..+-..|+.||+||||
T Consensus       205 ~~~~v~tLfHEfGHal  220 (422)
T cd06456         205 THDEVTTLFHEFGHAL  220 (422)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            3446678899999998


No 95 
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=68.20  E-value=2.4  Score=38.62  Aligned_cols=14  Identities=14%  Similarity=-0.007  Sum_probs=11.9

Q ss_pred             hHHHHHHhh-hhhcC
Q 042321          149 SQSVCMELI-FKELG  162 (169)
Q Consensus       149 ~~~va~HEi-GHaLG  162 (169)
                      +.+|++||+ ||..|
T Consensus       293 ~~~v~lHEllGHGsG  307 (549)
T PF03571_consen  293 FVQVGLHELLGHGSG  307 (549)
T ss_dssp             HHHHHHHHHCCCCS-
T ss_pred             hhhhhHHhhccCcCc
Confidence            678999999 99988


No 96 
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=67.10  E-value=9.1  Score=25.42  Aligned_cols=46  Identities=15%  Similarity=0.334  Sum_probs=31.9

Q ss_pred             CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCC----ccccCChHHhhhh
Q 042321            1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLK----STEVLDAGTVGKM   54 (169)
Q Consensus         1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~----~TG~lD~~T~~~m   54 (169)
                      |.+.||+.....        ..+++.+.+|++.|+-.-+|.    .-|.+|...++.|
T Consensus        25 L~~lGyy~g~~~--------g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~L   74 (74)
T PF08823_consen   25 LKRLGYYKGEAD--------GVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAYL   74 (74)
T ss_pred             HHHcCCccCCCC--------CcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhhC
Confidence            567899865531        124689999999999886664    4455787776543


No 97 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=67.09  E-value=2.6  Score=35.33  Aligned_cols=13  Identities=8%  Similarity=-0.156  Sum_probs=10.3

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      ...|++||.||.+
T Consensus       170 ~ayVlAHEyGHHV  182 (292)
T PF04228_consen  170 QAYVLAHEYGHHV  182 (292)
T ss_pred             HHHHHHHHHHHHH
Confidence            4558999999963


No 98 
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=64.44  E-value=3.3  Score=39.27  Aligned_cols=66  Identities=18%  Similarity=0.235  Sum_probs=33.9

Q ss_pred             HHHHHhhccCCCcceEeccCCCcccEEE-EeecCC-CCCCCCCCCCCCceee-----ccccc---chHHHHHHhhhhhc
Q 042321           93 ARAFQTWAPNTQFQFAESQDYRNADIKI-SFESGD-HGAGALSTAVVESWLM-----LQLLL---ESQSVCMELIFKEL  161 (169)
Q Consensus        93 ~~Af~~Ws~vt~L~F~ev~~~~~ADI~I-~F~~~~-Hgd~~pfd~~~e~w~~-----~~~~~---~~~~va~HEiGHaL  161 (169)
                      ..-|...-+.+.|.|...... +.++.= .|.+.+ ||-|.++-+.  .|..     ++..+   -.-..++|||||+-
T Consensus       173 ~~i~~~Yd~l~Gl~~~~~~~~-~~n~~~kYF~KAD~~G~G~AYY~~--~w~a~ss~s~~~~L~~~~~nW~~LHEIgHgY  248 (775)
T PF03272_consen  173 NDIFKFYDDLTGLSDDPSDPV-DKNFNNKYFAKADKSGPGAAYYGS--NWTAQSSSSLSFYLNPSPTNWGALHEIGHGY  248 (775)
T ss_pred             HHHHHHHHhhhCCCCCCCccc-ccccccceEEEecCCCCCCccccc--cceecCchhHHHHhCCCCCCchhhhhhhhhc
Confidence            455666666667777654321 111111 254443 6655555443  4532     11222   13458999999984


No 99 
>PF11150 DUF2927:  Protein of unknown function (DUF2927);  InterPro: IPR021323  This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. 
Probab=64.25  E-value=34  Score=27.32  Aligned_cols=57  Identities=16%  Similarity=0.183  Sum_probs=42.2

Q ss_pred             CCCCCCCcceEEEeccCCch---hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecCCC
Q 042321           67 PRWPASKFSLTYAFLPGTRG---DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESGDH  127 (169)
Q Consensus        67 ~kW~~~k~~LTy~~~~~~~~---~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~~H  127 (169)
                      .||.  + .++..+.+.++.   .-.+.+..-++.-+.++.+.+..+... +|+|.|.|...+.
T Consensus        28 ~Rw~--~-PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~   87 (213)
T PF11150_consen   28 RRWE--G-PVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDD   87 (213)
T ss_pred             ccCC--C-CeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccch
Confidence            5998  3 478888754443   233555555666688999999998876 8999999999876


No 100
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=64.07  E-value=3.5  Score=36.06  Aligned_cols=13  Identities=8%  Similarity=-0.148  Sum_probs=10.8

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      -..+++||.||||
T Consensus       159 ~l~t~iHE~GHal  171 (396)
T cd06460         159 ALFSTIHETGHAL  171 (396)
T ss_pred             HHHHHHHHhhHHH
Confidence            4568899999996


No 101
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=63.29  E-value=3.8  Score=34.71  Aligned_cols=18  Identities=11%  Similarity=0.054  Sum_probs=15.4

Q ss_pred             cchHHHHHHhhhhhcCCC
Q 042321          147 LESQSVCMELIFKELGIF  164 (169)
Q Consensus       147 ~~~~~va~HEiGHaLGL~  164 (169)
                      ..+..|++||++|-+|..
T Consensus       194 ~~~P~T~~HElAHq~G~a  211 (318)
T PF12725_consen  194 YSLPFTICHELAHQLGFA  211 (318)
T ss_pred             ccccHHHHHHHHHHhCCC
Confidence            347889999999999975


No 102
>PRK10911 oligopeptidase A; Provisional
Probab=61.81  E-value=4  Score=38.11  Aligned_cols=14  Identities=7%  Similarity=-0.071  Sum_probs=11.8

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      +-..|+.||+||+|
T Consensus       462 ~~v~tlfHEfGHal  475 (680)
T PRK10911        462 DEVITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHHHHhHHH
Confidence            35668999999998


No 103
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=60.19  E-value=4.2  Score=37.41  Aligned_cols=16  Identities=6%  Similarity=-0.103  Sum_probs=13.0

Q ss_pred             ccchHHHHHHhhhhhc
Q 042321          146 LLESQSVCMELIFKEL  161 (169)
Q Consensus       146 ~~~~~~va~HEiGHaL  161 (169)
                      .++-..|++||+||+.
T Consensus       377 t~~dV~TLaHElGHs~  392 (598)
T COG1164         377 TLRDVFTLAHELGHSV  392 (598)
T ss_pred             chhHHHHHHHHccHHH
Confidence            4456889999999985


No 104
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.06  E-value=2.4  Score=38.56  Aligned_cols=17  Identities=24%  Similarity=0.225  Sum_probs=15.6

Q ss_pred             HHHHhhhhhcCCCCccC
Q 042321          152 VCMELIFKELGIFINYN  168 (169)
Q Consensus       152 va~HEiGHaLGL~h~~~  168 (169)
                      |.+||.||-|||.+.|.
T Consensus       349 v~ahEyghdlglPDEyd  365 (760)
T COG4412         349 VEAHEYGHDLGLPDEYD  365 (760)
T ss_pred             eeeeecccccCCCcccc
Confidence            78999999999998875


No 105
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=59.23  E-value=14  Score=30.39  Aligned_cols=34  Identities=15%  Similarity=0.326  Sum_probs=23.2

Q ss_pred             cchhHHHHHHHHHHHHhcCCC-ccccCChHHhhhh
Q 042321           21 DDFDELLESAIKNYQLNYHLK-STEVLDAGTVGKM   54 (169)
Q Consensus        21 ~~~~~~~~~ai~~~Q~~~gL~-~TG~lD~~T~~~m   54 (169)
                      ..+++..+..|+.||.-|+=. .+|+.|.+|++.+
T Consensus       217 ~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l  251 (257)
T COG3023         217 GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAIL  251 (257)
T ss_pred             chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHH
Confidence            446777777777777776522 5777777777654


No 106
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=59.06  E-value=6.8  Score=35.79  Aligned_cols=30  Identities=13%  Similarity=-0.099  Sum_probs=19.9

Q ss_pred             CCCCCCCCcee------ecccccchHHHHHHhhhhh
Q 042321          131 ALSTAVVESWL------MLQLLLESQSVCMELIFKE  160 (169)
Q Consensus       131 ~pfd~~~e~w~------~~~~~~~~~~va~HEiGHa  160 (169)
                      +||.|...+.-      ++..+-.+..|++|||-|+
T Consensus       264 FP~gGMENPcltF~TpTllaGDrsl~~vIaHEIAHS  299 (613)
T KOG1047|consen  264 FPFGGMENPCLTFVTPTLLAGDRSLVDVIAHEIAHS  299 (613)
T ss_pred             CCcccccCcceeeecchhhcCCcchhhHHHHHhhhh
Confidence            67777654441      1233334789999999997


No 107
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=56.27  E-value=5.9  Score=34.97  Aligned_cols=13  Identities=15%  Similarity=-0.156  Sum_probs=11.5

Q ss_pred             chHHHHHHhhhhh
Q 042321          148 ESQSVCMELIFKE  160 (169)
Q Consensus       148 ~~~~va~HEiGHa  160 (169)
                      ++..|++||+||-
T Consensus       279 el~AVl~HELGHW  291 (428)
T KOG2719|consen  279 ELVAVLAHELGHW  291 (428)
T ss_pred             HHHHHHHHHhhHH
Confidence            4889999999995


No 108
>PF08434 CLCA_N:  Calcium-activated chloride channel;  InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs. 
Probab=52.77  E-value=9.9  Score=31.45  Aligned_cols=21  Identities=24%  Similarity=0.393  Sum_probs=13.3

Q ss_pred             CCcccEEEEeecCCCCCCCCCC
Q 042321          113 YRNADIKISFESGDHGAGALST  134 (169)
Q Consensus       113 ~~~ADI~I~F~~~~Hgd~~pfd  134 (169)
                      ...|||+|.-....+|| .|+.
T Consensus        98 ~~~Adv~V~~~~~~~~d-~PyT  118 (262)
T PF08434_consen   98 YETADVRVASPNPKYGD-EPYT  118 (262)
T ss_pred             hcCCCEEEeCCCCCCCC-CCce
Confidence            45899999854444455 3543


No 109
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=48.75  E-value=9  Score=35.86  Aligned_cols=34  Identities=18%  Similarity=0.160  Sum_probs=20.6

Q ss_pred             hHHHH-HHHHHhhccCCCcceEeccC--CCcccEEEE
Q 042321           88 AINPV-ARAFQTWAPNTQFQFAESQD--YRNADIKIS  121 (169)
Q Consensus        88 ~~~~i-~~Af~~Ws~vt~L~F~ev~~--~~~ADI~I~  121 (169)
                      ...++ .--|..-+..=.|+|+++.+  ....|++..
T Consensus       362 l~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~~  398 (681)
T PRK10280        362 LNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVW  398 (681)
T ss_pred             HHHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeEE
Confidence            34555 35666666777888988653  235566543


No 110
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=48.49  E-value=9.1  Score=35.47  Aligned_cols=13  Identities=8%  Similarity=-0.015  Sum_probs=11.6

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      ...||.||+||||
T Consensus       408 ~~~~a~he~gha~  420 (644)
T PRK10733        408 KESTAYHEAGHAI  420 (644)
T ss_pred             HHHHHHHHHHHHH
Confidence            5779999999996


No 111
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=48.39  E-value=9.9  Score=32.77  Aligned_cols=13  Identities=15%  Similarity=-0.015  Sum_probs=11.6

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +-.|+.|||+|++
T Consensus       275 lAtvLgHE~aHaV  287 (424)
T KOG2661|consen  275 LATVLGHEIAHAV  287 (424)
T ss_pred             HHHHHHHHHHHHH
Confidence            7779999999986


No 112
>CHL00176 ftsH cell division protein; Validated
Probab=45.76  E-value=10  Score=35.25  Aligned_cols=13  Identities=0%  Similarity=0.048  Sum_probs=11.5

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      ...||.||.||||
T Consensus       438 ~~~vA~hEaGhA~  450 (638)
T CHL00176        438 KRLIAYHEVGHAI  450 (638)
T ss_pred             HHHHHHHhhhhHH
Confidence            5679999999998


No 113
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=45.59  E-value=9.6  Score=31.28  Aligned_cols=18  Identities=11%  Similarity=0.167  Sum_probs=12.8

Q ss_pred             cchHHHHHHhhhhhcCCC
Q 042321          147 LESQSVCMELIFKELGIF  164 (169)
Q Consensus       147 ~~~~~va~HEiGHaLGL~  164 (169)
                      -|-..|++||+-|+|=+.
T Consensus       152 ~dg~NVviHEfAH~LD~~  169 (253)
T PF06167_consen  152 NDGHNVVIHEFAHKLDME  169 (253)
T ss_dssp             SSS--HHHHHHHHHHHCT
T ss_pred             CCCcchHHHHHHHHHHhh
Confidence            346789999999999543


No 114
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=45.19  E-value=11  Score=31.38  Aligned_cols=18  Identities=6%  Similarity=-0.189  Sum_probs=14.9

Q ss_pred             chHHHHHHhhhhhcCCCC
Q 042321          148 ESQSVCMELIFKELGIFI  165 (169)
Q Consensus       148 ~~~~va~HEiGHaLGL~h  165 (169)
                      ++..|++||+||.-|=++
T Consensus       202 ~yyaTl~HElghwtgh~~  219 (316)
T COG4227         202 NYYATLLHELGHWTGHEA  219 (316)
T ss_pred             hHHHHHHHHhccccCchh
Confidence            488899999999988543


No 115
>COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism]
Probab=44.47  E-value=8.9  Score=34.37  Aligned_cols=12  Identities=8%  Similarity=0.041  Sum_probs=10.0

Q ss_pred             HHHHHHhhhhhc
Q 042321          150 QSVCMELIFKEL  161 (169)
Q Consensus       150 ~~va~HEiGHaL  161 (169)
                      ..|++||+.|..
T Consensus       338 LDVvAHElTHGv  349 (507)
T COG3227         338 LDVVAHELTHGV  349 (507)
T ss_pred             cceehhhhcchh
Confidence            459999999974


No 116
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=41.99  E-value=15  Score=23.20  Aligned_cols=18  Identities=17%  Similarity=0.265  Sum_probs=15.4

Q ss_pred             hHHHHHHhhhhhcCCCCc
Q 042321          149 SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~  166 (169)
                      ...-++|++.+.+||.|.
T Consensus        29 ~eR~~vH~lA~~~gL~s~   46 (60)
T cd02641          29 HDRLLVHELAEELGLRHE   46 (60)
T ss_pred             HHHHHHHHHHHHcCCceE
Confidence            566789999999999875


No 117
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=41.81  E-value=13  Score=24.92  Aligned_cols=13  Identities=0%  Similarity=0.033  Sum_probs=11.0

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      -..+++||+.|.+
T Consensus        61 ~~~llaHEl~Hv~   73 (79)
T PF13699_consen   61 GRALLAHELAHVV   73 (79)
T ss_pred             cchhHhHHHHHHH
Confidence            4679999999985


No 118
>PF02074 Peptidase_M32:  Carboxypeptidase Taq (M32) metallopeptidase;  InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=41.19  E-value=12  Score=33.72  Aligned_cols=12  Identities=8%  Similarity=-0.191  Sum_probs=9.5

Q ss_pred             HHHHHHhhhhhc
Q 042321          150 QSVCMELIFKEL  161 (169)
Q Consensus       150 ~~va~HEiGHaL  161 (169)
                      ..-++||.||||
T Consensus       260 l~s~iHE~GHal  271 (494)
T PF02074_consen  260 LFSTIHETGHAL  271 (494)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHH
Confidence            345689999997


No 119
>PF06540 GMAP:  Galanin message associated peptide (GMAP);  InterPro: IPR013068 Galanin is a peptide hormone that controls various biological activities []. Galanin-like immuno-reactivity has been found in the central and peripheral nervous systems of mammals, with high concentrations demonstrated in discrete regions of the central nervous system, including the median eminence, hypothalamus, arcuate nucleus, septum, neuro-intermediate lobe of the pituitary, and the spinal cord. Its localisation within neurosecretory granules suggests that galanin may function as a neurotransmitter, and it has been shown to coexist with a variety of other peptide and amine neurotransmitters within individual neurons []. Although the precise physiological role of galanin is uncertain, it has a number of pharmacological properties: it stimulates food intake, when injected into the third ventricle of rats; it increases levels of plasma growth hormone and prolactin, and decreases dopamine levels in the median eminence []; and infusion into humans results in hyperglycemia and glucose intolerance, and inhibits pancreatic release of insulin, somatostatin and pancreatic peptide. Galanin also modulates smooth muscle contractility within the gastro-intestinal and genito-urinary tracts, all such activities suggesting that the hormone may play an important role in the nervous modulation of endocrine and smooth muscle function []. This domain represents the galanin message-associated peptide (GMAP) domain which is found C-terminal to the galanin domain in the preprogalanin precursor protein. GMAP sequences in different species show a high degree of homology, but the biological function of the GMAP peptide is not known [].
Probab=39.87  E-value=14  Score=23.66  Aligned_cols=25  Identities=28%  Similarity=0.392  Sum_probs=22.6

Q ss_pred             hHHHHHHHHHHHHhcCCCccccCCh
Q 042321           24 DELLESAIKNYQLNYHLKSTEVLDA   48 (169)
Q Consensus        24 ~~~~~~ai~~~Q~~~gL~~TG~lD~   48 (169)
                      ++++...|..|=.|.+|+..|-||.
T Consensus        22 d~nivrTiiEFLtfLhLKEaGALd~   46 (62)
T PF06540_consen   22 DDNIVRTIIEFLTFLHLKEAGALDN   46 (62)
T ss_pred             hhHHHHHHHHHHHHHHHHHhcchhc
Confidence            6788999999999999999999984


No 120
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=39.50  E-value=13  Score=33.37  Aligned_cols=9  Identities=11%  Similarity=-0.057  Sum_probs=7.9

Q ss_pred             HHHhhhhhc
Q 042321          153 CMELIFKEL  161 (169)
Q Consensus       153 a~HEiGHaL  161 (169)
                      ++||.||||
T Consensus       263 ~iHE~GHAl  271 (497)
T COG2317         263 TIHETGHAL  271 (497)
T ss_pred             HHHhhhhHH
Confidence            489999997


No 121
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=39.02  E-value=8.7  Score=35.27  Aligned_cols=13  Identities=15%  Similarity=-0.118  Sum_probs=10.9

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +-.|+.|||||+.
T Consensus       487 IGaVIgHEI~HgF  499 (654)
T COG3590         487 IGAVIGHEIGHGF  499 (654)
T ss_pred             ccceehhhhcccc
Confidence            5679999999974


No 122
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=38.15  E-value=17  Score=32.56  Aligned_cols=17  Identities=6%  Similarity=-0.154  Sum_probs=13.2

Q ss_pred             cccchHHHHHHhhhhhc
Q 042321          145 LLLESQSVCMELIFKEL  161 (169)
Q Consensus       145 ~~~~~~~va~HEiGHaL  161 (169)
                      ...+-+.|+.||+||+-
T Consensus       243 ~t~~D~~t~~HE~GH~~  259 (477)
T cd06461         243 VNMEDFVTVHHEMGHIQ  259 (477)
T ss_pred             CCHHHHHHHHHHHHHHH
Confidence            34446779999999996


No 123
>PF04315 DUF462:  Protein of unknown function, DUF462;  InterPro: IPR007411 This family consists of bacterial proteins of uncharacterised function.
Probab=36.06  E-value=21  Score=27.49  Aligned_cols=15  Identities=7%  Similarity=-0.232  Sum_probs=12.3

Q ss_pred             cchHHHHHHhhhhhc
Q 042321          147 LESQSVCMELIFKEL  161 (169)
Q Consensus       147 ~~~~~va~HEiGHaL  161 (169)
                      -+++.-|+|||.|=.
T Consensus        39 hgf~aSALHEIaHWc   53 (164)
T PF04315_consen   39 HGFFASALHEIAHWC   53 (164)
T ss_pred             cchHHHHHHHHHHHH
Confidence            358999999999954


No 124
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=36.01  E-value=21  Score=28.62  Aligned_cols=13  Identities=8%  Similarity=0.143  Sum_probs=11.0

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +-++++||.+||-
T Consensus        93 ~gsiLAHE~mHa~  105 (212)
T PF12315_consen   93 TGSILAHELMHAW  105 (212)
T ss_pred             HhhHHHHHHHHHH
Confidence            5778999999983


No 125
>PF14891 Peptidase_M91:  Effector protein
Probab=35.70  E-value=21  Score=27.25  Aligned_cols=13  Identities=8%  Similarity=0.015  Sum_probs=11.0

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      -..|+.||+.||.
T Consensus       103 p~v~L~HEL~HA~  115 (174)
T PF14891_consen  103 PFVVLYHELIHAY  115 (174)
T ss_pred             HHHHHHHHHHHHH
Confidence            5678999999985


No 126
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=35.54  E-value=19  Score=32.10  Aligned_cols=13  Identities=15%  Similarity=-0.013  Sum_probs=11.2

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      ...+|.||.||||
T Consensus       311 ~~~~A~hEaGhAl  323 (495)
T TIGR01241       311 KKLVAYHEAGHAL  323 (495)
T ss_pred             HHHHHHHHHhHHH
Confidence            4679999999996


No 127
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=33.98  E-value=29  Score=33.20  Aligned_cols=12  Identities=8%  Similarity=0.044  Sum_probs=9.8

Q ss_pred             hHHHHHHhhhhh
Q 042321          149 SQSVCMELIFKE  160 (169)
Q Consensus       149 ~~~va~HEiGHa  160 (169)
                      ...|++|||.|-
T Consensus       287 ~~~viaHElAHq  298 (831)
T TIGR02412       287 RAGVILHEMAHM  298 (831)
T ss_pred             HHHHHHHHHHHH
Confidence            457999999993


No 128
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=33.87  E-value=22  Score=26.25  Aligned_cols=13  Identities=0%  Similarity=-0.000  Sum_probs=11.6

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +..|++||+=|++
T Consensus        59 l~~~l~HEm~H~~   71 (146)
T smart00731       59 LRETLLHELCHAA   71 (146)
T ss_pred             HHhhHHHHHHHHH
Confidence            7779999999986


No 129
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=33.48  E-value=19  Score=29.54  Aligned_cols=12  Identities=0%  Similarity=-0.466  Sum_probs=9.0

Q ss_pred             HHHHHHhhhhhc
Q 042321          150 QSVCMELIFKEL  161 (169)
Q Consensus       150 ~~va~HEiGHaL  161 (169)
                      ..-..|||||-+
T Consensus       220 ~WG~~HE~GH~~  231 (307)
T PF13402_consen  220 GWGPWHELGHNH  231 (307)
T ss_dssp             -HHHHHHHHHHH
T ss_pred             Ceeehhhhhhhc
Confidence            446799999964


No 130
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=33.04  E-value=20  Score=33.56  Aligned_cols=35  Identities=14%  Similarity=0.185  Sum_probs=24.2

Q ss_pred             hhHHHHHHHHHhhccCCCcceEeccCC--CcccEEEE
Q 042321           87 DAINPVARAFQTWAPNTQFQFAESQDY--RNADIKIS  121 (169)
Q Consensus        87 ~~~~~i~~Af~~Ws~vt~L~F~ev~~~--~~ADI~I~  121 (169)
                      ...+++.-.|.+-...=+|+|+|..+.  ...|+++-
T Consensus       362 ~l~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~  398 (683)
T COG0339         362 PLNKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVF  398 (683)
T ss_pred             ChhHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEE
Confidence            345677777777778888999997432  35677754


No 131
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=32.83  E-value=25  Score=22.42  Aligned_cols=19  Identities=16%  Similarity=0.186  Sum_probs=15.8

Q ss_pred             hHHHHHHhhhhhcCCCCcc
Q 042321          149 SQSVCMELIFKELGIFINY  167 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~  167 (169)
                      -..-++|.+-|-|||.|.+
T Consensus        29 ~eRriih~la~~lGL~~~s   47 (60)
T cd02639          29 AERRIVHLLASRLGLNHVS   47 (60)
T ss_pred             HHHHHHHHHHHHcCCceEE
Confidence            4557899999999999875


No 132
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=30.75  E-value=23  Score=29.68  Aligned_cols=12  Identities=8%  Similarity=-0.103  Sum_probs=9.8

Q ss_pred             hHHHHHHhhhhh
Q 042321          149 SQSVCMELIFKE  160 (169)
Q Consensus       149 ~~~va~HEiGHa  160 (169)
                      ...|++||.||-
T Consensus       167 qaYViAHEVGHH  178 (295)
T COG2321         167 QAYVIAHEVGHH  178 (295)
T ss_pred             HHHHHHhhhhHH
Confidence            456999999984


No 133
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=30.46  E-value=31  Score=31.75  Aligned_cols=12  Identities=8%  Similarity=0.014  Sum_probs=10.3

Q ss_pred             hHHHHHHhhhhh
Q 042321          149 SQSVCMELIFKE  160 (169)
Q Consensus       149 ~~~va~HEiGHa  160 (169)
                      ...|++|||.|.
T Consensus       279 ~~~viaHElAHq  290 (601)
T TIGR02411       279 NVDVIAHELAHS  290 (601)
T ss_pred             hhhhHHHHHHhh
Confidence            567999999994


No 134
>KOG2987 consensus Fatty acid desaturase [Lipid transport and metabolism]
Probab=29.35  E-value=26  Score=29.22  Aligned_cols=20  Identities=20%  Similarity=0.135  Sum_probs=16.9

Q ss_pred             hHHHHHHhhhhhcCCCCccC
Q 042321          149 SQSVCMELIFKELGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~~~  168 (169)
                      ...-|+|||-|-|+.++++.
T Consensus        83 sl~LAIHeiSHN~aFg~~rp  102 (324)
T KOG2987|consen   83 SLTLAIHEISHNLAFGTNRP  102 (324)
T ss_pred             hHHHHHHHhhhhhhcccCch
Confidence            45689999999999998764


No 135
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=29.15  E-value=22  Score=29.41  Aligned_cols=19  Identities=5%  Similarity=0.032  Sum_probs=15.5

Q ss_pred             cchHHHHHHhhhhhcCCCC
Q 042321          147 LESQSVCMELIFKELGIFI  165 (169)
Q Consensus       147 ~~~~~va~HEiGHaLGL~h  165 (169)
                      .|-..|++||+-|.|=+.+
T Consensus       140 ~dg~NVvIHEFAH~LDm~~  158 (260)
T PRK15410        140 ASGFNLIIHEVAHKLDMRN  158 (260)
T ss_pred             CCCcchhHhHHHhHhhhhc
Confidence            3567899999999997665


No 136
>PF15639 Tox-MPTase3:  Metallopeptidase toxin 3
Probab=27.75  E-value=1.3e+02  Score=22.28  Aligned_cols=69  Identities=13%  Similarity=0.162  Sum_probs=41.1

Q ss_pred             HHHHHHHHhhccCCCcceEeccC-CCcccEEEEeecCCC--CCCCCCCCCCCceeec-------c---------cccchH
Q 042321           90 NPVARAFQTWAPNTQFQFAESQD-YRNADIKISFESGDH--GAGALSTAVVESWLML-------Q---------LLLESQ  150 (169)
Q Consensus        90 ~~i~~Af~~Ws~vt~L~F~ev~~-~~~ADI~I~F~~~~H--gd~~pfd~~~e~w~~~-------~---------~~~~~~  150 (169)
                      ..+-+|++.|...+-..-++.-. +..+-|+|.-..+.|  |-..|+++..+-. +.       +         ..+-..
T Consensus        23 p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~A~G~t~p~~~~~~i~-I~~~lV~~~Ek~~~~~r~~~~~~v~  101 (135)
T PF15639_consen   23 PAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGTAYGYTTPDDGSQVIR-IDGDLVNMFEKGPGERRAGNVYLVG  101 (135)
T ss_pred             hHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccceeeeeecCCCCCcEEE-eeHHHhhhhhcCCcccccceEEEee
Confidence            55678888887766444444322 456889999888854  4445666443221 10       0         011156


Q ss_pred             HHHHHhhhh
Q 042321          151 SVCMELIFK  159 (169)
Q Consensus       151 ~va~HEiGH  159 (169)
                      .+++||.=|
T Consensus       102 ~TlLHEliH  110 (135)
T PF15639_consen  102 STLLHELIH  110 (135)
T ss_pred             hHHHHHHHH
Confidence            799999877


No 137
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=27.52  E-value=31  Score=30.04  Aligned_cols=15  Identities=13%  Similarity=-0.110  Sum_probs=12.4

Q ss_pred             cchHHHHHHhhhhhc
Q 042321          147 LESQSVCMELIFKEL  161 (169)
Q Consensus       147 ~~~~~va~HEiGHaL  161 (169)
                      -++-.|+.||+||..
T Consensus       122 sEvAAVl~HEmgHVt  136 (479)
T COG4784         122 SEVAAVLAHEMGHVT  136 (479)
T ss_pred             HHHHHHHHhhhhhee
Confidence            358889999999964


No 138
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=27.39  E-value=37  Score=24.94  Aligned_cols=14  Identities=7%  Similarity=-0.031  Sum_probs=12.2

Q ss_pred             chHHHHHHhhhhhc
Q 042321          148 ESQSVCMELIFKEL  161 (169)
Q Consensus       148 ~~~~va~HEiGHaL  161 (169)
                      .+..|++||+=|+.
T Consensus        59 ~~~~tL~HEm~H~~   72 (157)
T PF10263_consen   59 ELIDTLLHEMAHAA   72 (157)
T ss_pred             HHHHHHHHHHHHHH
Confidence            38889999999986


No 139
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=27.18  E-value=67  Score=31.13  Aligned_cols=20  Identities=25%  Similarity=0.315  Sum_probs=14.9

Q ss_pred             hHHHHHHhhhhh-cCCCCccC
Q 042321          149 SQSVCMELIFKE-LGIFINYN  168 (169)
Q Consensus       149 ~~~va~HEiGHa-LGL~h~~~  168 (169)
                      .-.+.+||-||. -|+.+.||
T Consensus       149 ~~r~~VheWah~R~GvfdEy~  169 (863)
T TIGR00868       149 RGRVFVHEWAHLRWGVFDEYN  169 (863)
T ss_pred             ccchhhhhhhhhhcccccccc
Confidence            345899999996 46666665


No 140
>COG3926 zliS Lysozyme family protein [General function prediction only]
Probab=25.15  E-value=73  Score=25.97  Aligned_cols=30  Identities=20%  Similarity=0.223  Sum_probs=26.4

Q ss_pred             HHHHHHHHHhcC----CCccccCChHHhhhhhcC
Q 042321           28 ESAIKNYQLNYH----LKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        28 ~~ai~~~Q~~~g----L~~TG~lD~~T~~~m~~p   57 (169)
                      -.|++-+|+-.+    .++.|++-..|+..+++-
T Consensus        94 ~rAa~~LQkal~~~~~v~~DGvIG~~TLaAl~~~  127 (252)
T COG3926          94 GRAAKWLQKALGPAYTVRVDGVIGAQTLAALKKD  127 (252)
T ss_pred             chHHHHHHHHhccCCcccccCcccHHHHHHHHhc
Confidence            578999999999    999999999999877654


No 141
>PF08571 Yos1:  Yos1-like;  InterPro: IPR013880  In yeast, Yos1 is a subunit of the Yip1p-Yif1p complex and is required for transport between the endoplasmic reticulum and the Golgi complex. Yos1 appears to be conserved in eukaryotes []. 
Probab=24.97  E-value=1.2e+02  Score=20.48  Aligned_cols=41  Identities=12%  Similarity=0.199  Sum_probs=22.6

Q ss_pred             CCCccCCCCCCCCCCCCCCCcchhHHHHHHHHHHHHhcCCC
Q 042321            1 MGKFGYLNYDNSNNQTHANDDDFDELLESAIKNYQLNYHLK   41 (169)
Q Consensus         1 L~~fGYl~~~~~~~~~~~~~~~~~~~~~~ai~~~Q~~~gL~   41 (169)
                      |.|+|+-.++....|....+......+..-|..-|.++..|
T Consensus        24 L~kiGw~~~~~~~~g~~~~~~siK~qii~LI~aVrT~mRiP   64 (80)
T PF08571_consen   24 LAKIGWSSSQNSGFGFGGDPNSIKAQIINLIRAVRTLMRIP   64 (80)
T ss_pred             hccCCCCCccccccccCCCCcCHHHHHHHHHHHHHHHHHhH
Confidence            67889977665432211222334556666666666665555


No 142
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=23.08  E-value=51  Score=20.87  Aligned_cols=18  Identities=11%  Similarity=0.198  Sum_probs=15.3

Q ss_pred             hHHHHHHhhhhhcCCCCc
Q 042321          149 SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~  166 (169)
                      ...-++|++-+.+||.+.
T Consensus        29 ~eR~~vH~~a~~~gL~s~   46 (60)
T cd02640          29 EERALIHQIAQKYGLKSR   46 (60)
T ss_pred             HHHHHHHHHHHHcCCcee
Confidence            566889999999999864


No 143
>KOG3624 consensus M13 family peptidase [Amino acid transport and metabolism]
Probab=22.42  E-value=40  Score=31.29  Aligned_cols=13  Identities=8%  Similarity=-0.171  Sum_probs=10.9

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      +-.|++|||||+.
T Consensus       518 iG~vigHEl~H~F  530 (687)
T KOG3624|consen  518 IGFVIGHELTHGF  530 (687)
T ss_pred             HHHHHHHHHhhcc
Confidence            5569999999965


No 144
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=22.30  E-value=42  Score=31.23  Aligned_cols=13  Identities=0%  Similarity=-0.066  Sum_probs=10.6

Q ss_pred             hHHHHHHhhhhhc
Q 042321          149 SQSVCMELIFKEL  161 (169)
Q Consensus       149 ~~~va~HEiGHaL  161 (169)
                      -.-++-||+|||+
T Consensus       479 ~vetLFHEmGHAM  491 (704)
T KOG2090|consen  479 EVETLFHEMGHAM  491 (704)
T ss_pred             HHHHHHHHHHHHH
Confidence            4558899999995


No 145
>PF08634 Pet127:  Mitochondrial protein Pet127;  InterPro: IPR013943  Pet127 has been implicated in mitochondrial RNA stability and/or processing and is localised to the mitochondrial membrane []. 
Probab=21.93  E-value=89  Score=26.06  Aligned_cols=55  Identities=15%  Similarity=0.134  Sum_probs=33.8

Q ss_pred             cCCCccccCChHHhhhhhcCCccccccCCCCCCCCCcceEEEeccCCch------hhHHHHHHHHHhhcc
Q 042321           38 YHLKSTEVLDAGTVGKMRTHDAELQILGSPRWPASKFSLTYAFLPGTRG------DAINPVARAFQTWAP  101 (169)
Q Consensus        38 ~gL~~TG~lD~~T~~~m~~pD~~~~~~g~~kW~~~k~~LTy~~~~~~~~------~~~~~i~~Af~~Ws~  101 (169)
                      =.||.||+.|-.|++...-   ||...   ..   .....|.|....-.      |..+.|+.||-.++=
T Consensus       154 prLPg~G~FDLKTRAv~~I---R~d~~---~~---e~~~gY~I~~~~G~~ESfEREy~DliRsa~LKYsl  214 (274)
T PF08634_consen  154 PRLPGTGVFDLKTRAVVAI---RYDSS---NY---ENGTGYQIRKRFGQFESFEREYYDLIRSAFLKYSL  214 (274)
T ss_pred             CCCCCCCceeccchheeeh---hccCc---ch---hccCCeEeeeeeCchhHHHHHHHHHHHHHHHHHHH
Confidence            4689999999999976422   33211   11   22256666542221      677888888887764


No 146
>COG5192 BMS1 GTP-binding protein required for 40S ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=21.33  E-value=53  Score=30.74  Aligned_cols=22  Identities=14%  Similarity=0.006  Sum_probs=17.4

Q ss_pred             HhcCCCccccCChHHhhhhhcC
Q 042321           36 LNYHLKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        36 ~~~gL~~TG~lD~~T~~~m~~p   57 (169)
                      +|-.||+--.-|..|+..|-+=
T Consensus       775 RFQsiPvys~~DsrTRnRMlKY  796 (1077)
T COG5192         775 RFQSIPVYSMKDSRTRNRMLKY  796 (1077)
T ss_pred             hhcccceeeecchhhhhhhhhc
Confidence            3445899999999999988754


No 147
>PF07236 Phytoreo_S7:  Phytoreovirus S7 protein;  InterPro: IPR009873 This family consists of several Phytoreovirus S7 proteins which are thought to be viral core proteins [].; GO: 0019012 virion; PDB: 1UF2_K.
Probab=21.21  E-value=32  Score=30.68  Aligned_cols=33  Identities=24%  Similarity=0.204  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHhcCCCccccCChHHhhhhhcC
Q 042321           25 ELLESAIKNYQLNYHLKSTEVLDAGTVGKMRTH   57 (169)
Q Consensus        25 ~~~~~ai~~~Q~~~gL~~TG~lD~~T~~~m~~p   57 (169)
                      --+..||+-|-.+.....+|.+|++|+-+++.|
T Consensus       204 ~li~~a~kyF~~yv~~~~~g~~d~at~i~LN~p  236 (506)
T PF07236_consen  204 MLIMKAIKYFRNYVNNQSSGSMDQATRILLNTP  236 (506)
T ss_dssp             ---------------------------------
T ss_pred             HHHHHHHHHHHHhhhhhhcCCCChHHHHHhcCc
Confidence            357889999999999999999999999999999


No 148
>PF15127 DUF4565:  Protein of unknown function (DUF4565)
Probab=21.13  E-value=63  Score=22.37  Aligned_cols=15  Identities=33%  Similarity=0.603  Sum_probs=12.6

Q ss_pred             hHHHHHHHHHhhccC
Q 042321           88 AINPVARAFQTWAPN  102 (169)
Q Consensus        88 ~~~~i~~Af~~Ws~v  102 (169)
                      .++++.+|++.|+.+
T Consensus        62 SqEIl~dAlqQWA~~   76 (91)
T PF15127_consen   62 SQEILSDALQQWAEN   76 (91)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            468889999999874


No 149
>PF13590 DUF4136:  Domain of unknown function (DUF4136)
Probab=20.91  E-value=1.8e+02  Score=20.81  Aligned_cols=41  Identities=24%  Similarity=0.443  Sum_probs=25.1

Q ss_pred             EEEeccCC--ch--------hhHHHHHHHHHhhccCCCcceEeccCCCcccEEEEeecC
Q 042321           77 TYAFLPGT--RG--------DAINPVARAFQTWAPNTQFQFAESQDYRNADIKISFESG  125 (169)
Q Consensus        77 Ty~~~~~~--~~--------~~~~~i~~Af~~Ws~vt~L~F~ev~~~~~ADI~I~F~~~  125 (169)
                      ||+|....  +.        .++++|+++|.      ..-|+++  ...||+.|.|.-.
T Consensus        19 Ty~~~~~~~~~~~~~~~~~~~i~~~v~~~L~------~~G~~~~--~~~aDl~V~~~~~   69 (151)
T PF13590_consen   19 TYAFVPSSSDPAQSNPLDDQRIQDAVEQELA------AKGYRRV--PENADLLVSYHYS   69 (151)
T ss_pred             eEEEecCCcCccccCcHHHHHHHHHHHHHHH------HCCCeec--ccCCCEEEEEEEE
Confidence            78777652  11        24556666664      3556666  2478888887654


No 150
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.39  E-value=63  Score=20.40  Aligned_cols=18  Identities=11%  Similarity=0.318  Sum_probs=15.4

Q ss_pred             hHHHHHHhhhhhcCCCCc
Q 042321          149 SQSVCMELIFKELGIFIN  166 (169)
Q Consensus       149 ~~~va~HEiGHaLGL~h~  166 (169)
                      ...-++|++-+.+||.+.
T Consensus        28 ~eR~~vH~~a~~~gL~s~   45 (59)
T cd06007          28 HERAVIHRLCRKLGLKSK   45 (59)
T ss_pred             HHHHHHHHHHHHcCCCce
Confidence            566889999999999875


Done!