Query         psy2949
Match_columns 367
No_of_seqs    291 out of 1661
Neff          7.4 
Searched_HMMs 46136
Date          Fri Aug 16 21:41:07 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2949.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2949hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd04282 ZnMc_meprin Zinc-depen 100.0 1.6E-64 3.5E-69  463.7  17.9  206    1-214    24-230 (230)
  2 cd04281 ZnMc_BMP1_TLD Zinc-dep 100.0 5.4E-62 1.2E-66  439.8  18.5  196   16-215     2-200 (200)
  3 cd04283 ZnMc_hatching_enzyme Z 100.0 3.9E-56 8.4E-61  395.8  15.7  177   28-214     5-182 (182)
  4 PF01400 Astacin:  Astacin (Pep 100.0 1.8E-56   4E-61  403.7  13.8  189   22-215     1-190 (191)
  5 KOG3714|consensus              100.0 2.6E-55 5.5E-60  437.6  18.4  202   11-217    69-272 (411)
  6 cd04280 ZnMc_astacin_like Zinc 100.0 3.6E-54 7.8E-59  385.3  15.4  180   26-212     1-180 (180)
  7 cd04327 ZnMc_MMP_like_3 Zinc-d 100.0 9.2E-41   2E-45  303.2  11.1  168   22-212     1-197 (198)
  8 smart00235 ZnMc Zinc-dependent 100.0 5.8E-30 1.3E-34  219.7  12.3  135   21-165     2-140 (140)
  9 cd04268 ZnMc_MMP_like Zinc-dep 100.0 5.8E-29 1.3E-33  218.7  10.9  145   27-212     2-165 (165)
 10 KOG3714|consensus               99.9 1.4E-23 3.1E-28  209.7   3.7  120  191-313   168-287 (411)
 11 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.9 1.3E-23 2.8E-28  190.0   3.0  102  192-295    97-200 (200)
 12 cd04282 ZnMc_meprin Zinc-depen  99.9 2.6E-23 5.7E-28  191.3   2.8  101  192-294   130-230 (230)
 13 cd04283 ZnMc_hatching_enzyme Z  99.8 1.1E-21 2.4E-26  174.9   1.8   96  192-294    87-182 (182)
 14 cd00203 ZnMc Zinc-dependent me  99.8 1.3E-19 2.7E-24  159.4  11.6  145   28-212     2-167 (167)
 15 PF01400 Astacin:  Astacin (Pep  99.8 3.5E-20 7.6E-25  167.3   2.0  103  192-296    89-191 (191)
 16 cd04280 ZnMc_astacin_like Zinc  99.8 1.5E-19 3.3E-24  161.7   1.4   97  192-292    84-180 (180)
 17 cd04278 ZnMc_MMP Zinc-dependen  99.7 9.6E-17 2.1E-21  140.4  11.7   99   23-122     1-125 (157)
 18 cd04277 ZnMc_serralysin_like Z  99.5 4.4E-14 9.5E-19  126.9  12.0  141   37-213    30-186 (186)
 19 cd04279 ZnMc_MMP_like_1 Zinc-d  99.5 1.7E-13 3.6E-18  119.7  11.3   98   28-125     3-125 (156)
 20 PF00413 Peptidase_M10:  Matrix  99.5 7.9E-13 1.7E-17  114.6  13.1  100   23-122     1-123 (154)
 21 cd04327 ZnMc_MMP_like_3 Zinc-d  99.2 2.1E-12 4.5E-17  117.3   1.9   87  192-293   102-198 (198)
 22 PF12388 Peptidase_M57:  Dual-a  99.0 7.1E-09 1.5E-13   93.9  12.2  178    1-212     3-211 (211)
 23 cd04276 ZnMc_MMP_like_2 Zinc-d  98.5 1.2E-06 2.7E-11   79.4  12.7   31   25-56      6-36  (197)
 24 KOG1565|consensus               98.5 7.6E-07 1.6E-11   90.6  11.1  104   17-121    99-228 (469)
 25 cd04268 ZnMc_MMP_like Zinc-dep  97.7 1.4E-05 3.1E-10   69.8   2.0   44  235-292   122-165 (165)
 26 PF02031 Peptidase_M7:  Strepto  97.6   3E-05 6.5E-10   64.5   2.6   86   28-118     3-91  (132)
 27 COG5549 Predicted Zn-dependent  97.2 0.00076 1.6E-08   60.7   6.1   80   40-119   103-202 (236)
 28 PF13583 Reprolysin_4:  Metallo  95.7  0.0045 9.7E-08   56.6   1.5   19  105-123   138-156 (206)
 29 PF13582 Reprolysin_3:  Metallo  95.6  0.0053 1.1E-07   50.9   1.3   19  103-121   106-124 (124)
 30 PF13574 Reprolysin_2:  Metallo  95.4    0.01 2.2E-07   52.7   2.7   20  104-123   111-130 (173)
 31 PF05548 Peptidase_M11:  Gameto  95.3   0.031 6.8E-07   54.4   5.8   81   39-123    86-169 (314)
 32 PF05572 Peptidase_M43:  Pregna  95.2  0.0082 1.8E-07   52.4   1.3   20  106-125    71-90  (154)
 33 cd04267 ZnMc_ADAM_like Zinc-de  94.9   0.015 3.2E-07   52.2   2.2   18  106-123   135-152 (192)
 34 cd00203 ZnMc Zinc-dependent me  94.8   0.023   5E-07   49.4   3.0   40  235-292   128-167 (167)
 35 smart00042 CUB Domain first fo  94.8   0.024 5.3E-07   44.7   2.8   29  308-340     4-32  (102)
 36 PF13688 Reprolysin_5:  Metallo  94.5   0.018 3.9E-07   51.7   1.5   23  104-126   142-164 (196)
 37 cd04275 ZnMc_pappalysin_like Z  93.9  0.0096 2.1E-07   55.2  -1.5   20  106-125   139-158 (225)
 38 cd00041 CUB CUB domain; extrac  93.6   0.074 1.6E-06   42.4   3.4   33  305-341    10-43  (113)
 39 COG1913 Predicted Zn-dependent  93.3   0.035 7.5E-07   49.0   1.0   18  106-123   126-143 (181)
 40 cd04269 ZnMc_adamalysin_II_lik  93.2   0.039 8.5E-07   49.6   1.3   19  105-123   132-150 (194)
 41 cd04272 ZnMc_salivary_gland_MP  93.0   0.064 1.4E-06   49.3   2.5   20  105-124   146-165 (220)
 42 TIGR03296 M6dom_TIGR03296 M6 f  93.0    0.19 4.1E-06   48.3   5.6   19  104-122   165-183 (286)
 43 PF00431 CUB:  CUB domain CUB d  92.4   0.058 1.3E-06   42.8   1.1   34  304-341     8-42  (110)
 44 cd04271 ZnMc_ADAM_fungal Zinc-  92.2   0.053 1.2E-06   50.4   0.7   17  106-122   147-163 (228)
 45 PRK13267 archaemetzincin-like   92.0   0.073 1.6E-06   47.6   1.3   15  106-120   127-141 (179)
 46 cd04270 ZnMc_TACE_like Zinc-de  91.9   0.072 1.6E-06   50.0   1.3   17  106-122   169-185 (244)
 47 PF07998 Peptidase_M54:  Peptid  91.8   0.077 1.7E-06   48.0   1.3   17  106-122   147-163 (194)
 48 cd04273 ZnMc_ADAMTS_like Zinc-  91.7   0.041 8.8E-07   50.2  -0.7   19  105-123   141-159 (207)
 49 PF01421 Reprolysin:  Reprolysi  90.2    0.14 2.9E-06   46.3   1.3   18  106-123   133-150 (199)
 50 PF11350 DUF3152:  Protein of u  89.2    0.15 3.3E-06   46.0   0.9   98   27-124    29-159 (203)
 51 PF10462 Peptidase_M66:  Peptid  89.0    0.17 3.8E-06   48.9   1.1   16  106-121   195-210 (305)
 52 cd02641 R3H_Smubp-2_like R3H d  86.8    0.58 1.3E-05   34.1   2.5   29  105-133    30-58  (60)
 53 PTZ00337 surface protease GP63  86.1    0.25 5.5E-06   51.8   0.3   28   93-120   218-245 (567)
 54 PF12044 Metallopep:  Putative   85.6    0.34 7.4E-06   48.9   1.0   18  106-123   318-335 (423)
 55 PF01457 Peptidase_M8:  Leishma  84.2    0.46   1E-05   49.6   1.3   17  105-121   211-227 (521)
 56 cd06459 M3B_Oligoendopeptidase  82.5     1.8   4E-05   43.3   4.8   20   98-117   216-235 (427)
 57 KOG4525|consensus               81.9    0.57 1.2E-05   46.9   0.8   18  106-123   303-320 (614)
 58 TIGR00181 pepF oligoendopeptid  81.8     1.9   4E-05   45.7   4.7   74   40-118   314-392 (591)
 59 PF12725 DUF3810:  Protein of u  81.0     1.1 2.5E-05   43.7   2.5   75   40-122   133-214 (318)
 60 PF05547 Peptidase_M6:  Immune   80.1    0.37   8E-06   51.3  -1.3   16  104-119   221-236 (645)
 61 TIGR02290 M3_fam_3 oligoendope  79.8     2.6 5.7E-05   44.6   5.0   75   40-118   310-389 (587)
 62 PTZ00257 Glycoprotein GP63 (le  79.1    0.91   2E-05   47.9   1.2   17  104-120   256-272 (622)
 63 PF02408 CUB_2:  CUB-like domai  72.8     6.6 0.00014   32.4   4.6   29  313-341    31-61  (120)
 64 cd02640 R3H_NRF R3H domain of   72.4     3.3 7.2E-05   30.2   2.3   28  106-133    31-58  (60)
 65 PHA02456 zinc metallopeptidase  72.3     2.3 4.9E-05   34.9   1.6   37   77-119    58-94  (141)
 66 cd06007 R3H_DEXH_helicase R3H   71.8     3.7   8E-05   29.8   2.4   29  105-133    29-57  (59)
 67 KOG3607|consensus               71.4     1.9 4.1E-05   46.6   1.2   17  106-122   325-341 (716)
 68 PF09471 Peptidase_M64:  IgA Pe  70.5     2.1 4.6E-05   40.6   1.3   26   26-51     80-105 (264)
 69 PF13398 Peptidase_M50B:  Pepti  66.4     2.8 6.1E-05   38.0   1.1   14  106-119    24-37  (200)
 70 PF04298 Zn_peptidase_2:  Putat  65.2     3.7   8E-05   37.9   1.6   13  105-117    90-102 (222)
 71 COG2856 Predicted Zn peptidase  65.2       3 6.6E-05   38.3   1.1   16  106-123    74-89  (213)
 72 TIGR02289 M3_not_pepF oligoend  62.9     4.7  0.0001   42.3   2.1   21   99-119   332-352 (549)
 73 PF06114 DUF955:  Domain of unk  61.6     4.2 9.1E-05   32.4   1.2   12  105-116    43-54  (122)
 74 cd02646 R3H_G-patch R3H domain  61.0     8.2 0.00018   27.7   2.5   28  106-133    29-56  (58)
 75 PF06262 DUF1025:  Possibl zinc  60.9     4.1 8.8E-05   32.7   1.0   17  106-122    75-91  (97)
 76 PF03272 Enhancin:  Viral enhan  59.3     7.5 0.00016   42.6   2.9  124   22-153   128-280 (775)
 77 cd02642 R3H_encore_like R3H do  59.1     9.5  0.0002   27.9   2.6   29  105-134    33-61  (63)
 78 PF01863 DUF45:  Protein of unk  58.9      27 0.00058   31.2   6.1   17  105-121   165-181 (205)
 79 PF11667 DUF3267:  Protein of u  57.7     6.6 0.00014   31.9   1.7   20  106-125     6-25  (111)
 80 PF12388 Peptidase_M57:  Dual-a  57.4     3.4 7.3E-05   37.9  -0.1   40  232-292   170-211 (211)
 81 PF14247 DUF4344:  Domain of un  56.5     5.5 0.00012   36.8   1.2   11  106-116    94-104 (220)
 82 COG3824 Predicted Zn-dependent  55.9     4.9 0.00011   33.4   0.7   16  106-121   111-126 (136)
 83 PF02163 Peptidase_M50:  Peptid  55.5     5.8 0.00012   35.2   1.1   21  105-125     8-28  (192)
 84 smart00235 ZnMc Zinc-dependent  55.2     2.3 4.9E-05   35.8  -1.5   45  192-245    96-140 (140)
 85 COG4783 Putative Zn-dependent   51.3       7 0.00015   39.9   1.1   15  104-118   130-144 (484)
 86 PF01432 Peptidase_M3:  Peptida  51.0     7.4 0.00016   39.7   1.2   15  105-119   243-257 (458)
 87 cd06258 Peptidase_M3_like The   50.9     7.1 0.00015   38.4   1.0   12  105-116   155-166 (365)
 88 cd05709 S2P-M50 Site-2 proteas  49.3     8.3 0.00018   33.9   1.1   19  106-124    10-28  (180)
 89 TIGR00043 metalloprotein, YbeY  49.3     9.7 0.00021   31.2   1.4   17  106-122    75-91  (110)
 90 PF15639 Tox-MPTase3:  Metallop  48.0      25 0.00055   29.5   3.6   70   43-114    22-110 (135)
 91 PF01447 Peptidase_M4:  Thermol  47.7     9.6 0.00021   33.1   1.2   11  106-116   137-147 (150)
 92 COG1164 Oligoendopeptidase F [  47.0     8.2 0.00018   41.1   0.8   75   40-118   315-394 (598)
 93 cd06455 M3A_TOP Peptidase M3 T  46.5     9.4  0.0002   39.3   1.1   12  106-117   265-276 (472)
 94 cd06457 M3A_MIP Peptidase M3 m  44.4      11 0.00023   38.8   1.1   15  106-120   250-264 (458)
 95 cd06163 S2P-M50_PDZ_RseP-like   42.9      12 0.00027   33.4   1.2   19  106-124    11-29  (182)
 96 PF02130 UPF0054:  Uncharacteri  42.8      13 0.00029   31.9   1.4   16  106-121   110-125 (145)
 97 COG2738 Predicted Zn-dependent  42.2      12 0.00026   33.9   0.9   17  101-117    89-105 (226)
 98 cd06162 S2P-M50_PDZ_SREBP Ster  41.0      13 0.00028   35.6   1.1   17  106-122   137-153 (277)
 99 PF13485 Peptidase_MA_2:  Pepti  40.6      14 0.00031   29.5   1.2   11  106-116    27-37  (128)
100 cd06456 M3A_DCP_Oligopeptidase  39.8      14  0.0003   37.5   1.2   13  105-117   209-221 (422)
101 COG0319 Predicted metal-depend  39.4      16 0.00035   31.8   1.3   18  106-123   110-127 (153)
102 cd06159 S2P-M50_PDZ_Arch Uncha  39.4      14 0.00031   35.0   1.1   21  106-126   120-140 (263)
103 KOG3658|consensus               39.3      10 0.00022   40.3   0.1   16  106-121   394-409 (764)
104 PF01435 Peptidase_M48:  Peptid  39.1      15 0.00033   33.0   1.2   13  105-117    90-102 (226)
105 cd06158 S2P-M50_like_1 Unchara  38.9      15 0.00033   32.7   1.1   12  105-116    10-21  (181)
106 PRK02870 heat shock protein Ht  38.8      12 0.00026   36.9   0.5   15  105-122   174-188 (336)
107 cd06161 S2P-M50_SpoIVFB SpoIVF  38.1      16 0.00035   33.2   1.2   21  106-126    40-60  (208)
108 PRK00016 metal-binding heat sh  37.6      17 0.00037   31.8   1.2   17  106-122   115-131 (159)
109 PRK13963 unkown domain/putativ  35.6      20 0.00044   33.8   1.4   16  106-121   217-232 (258)
110 cd06460 M32_Taq Peptidase fami  35.4      18 0.00038   36.6   1.0   17  106-125   161-177 (396)
111 smart00731 SprT SprT homologue  35.0      20 0.00042   30.7   1.2   15  105-119    60-74  (146)
112 KOG2921|consensus               34.9      18 0.00039   36.2   1.0   33  106-141   133-165 (484)
113 cd06006 R3H_unknown_2 R3H doma  34.7      41  0.0009   24.4   2.6   29  105-133    29-57  (59)
114 PF10263 SprT-like:  SprT-like   34.7      20 0.00043   30.7   1.2   16  105-120    61-76  (157)
115 PF12315 DUF3633:  Protein of u  34.4      20 0.00043   32.8   1.2   14  106-119    95-108 (212)
116 cd06164 S2P-M50_SpoIVFB_CBS Sp  33.8      21 0.00044   33.1   1.2   21  106-126    55-75  (227)
117 PF13699 DUF4157:  Domain of un  32.3      24 0.00052   27.0   1.2   14  104-117    61-74  (79)
118 PRK10911 oligopeptidase A; Pro  32.3      21 0.00045   38.7   1.1   11  106-116   465-475 (680)
119 PF14521 Aspzincin_M35:  Lysine  32.0      20 0.00044   30.8   0.8   14  104-117    96-111 (148)
120 PRK03982 heat shock protein Ht  31.7      22 0.00048   34.0   1.1   12  105-116   126-137 (288)
121 PF02102 Peptidase_M35:  Deuter  30.3      29 0.00062   34.6   1.6   21   98-118   288-311 (359)
122 COG0501 HtpX Zn-dependent prot  30.1      24 0.00053   33.4   1.1   12  105-116   158-169 (302)
123 KOG3538|consensus               30.0      13 0.00029   41.1  -0.9   21  105-125   318-338 (845)
124 PRK03001 M48 family peptidase;  29.8      25 0.00054   33.6   1.0   12  105-116   125-136 (283)
125 PRK03072 heat shock protein Ht  29.4      25 0.00055   33.7   1.0   21   39-59     65-85  (288)
126 PRK01345 heat shock protein Ht  29.4      26 0.00055   34.2   1.1   12  105-116   125-136 (317)
127 PRK05457 heat shock protein Ht  28.3      28  0.0006   33.4   1.1   11  105-115   135-145 (284)
128 PRK10280 dipeptidyl carboxypep  28.0      28 0.00061   37.7   1.1   18   48-65    369-386 (681)
129 cd06160 S2P-M50_like_2 Unchara  27.9      29 0.00063   31.1   1.1   22  106-127    43-64  (183)
130 PF06167 Peptidase_M90:  Glucos  27.8      30 0.00065   32.7   1.2   21   36-56     39-59  (253)
131 PF04228 Zn_peptidase:  Putativ  26.9      31 0.00067   33.3   1.1   11  106-116   172-182 (292)
132 cd02639 R3H_RRM R3H domain of   26.6      37 0.00081   24.7   1.3   17  105-121    30-46  (60)
133 PRK04897 heat shock protein Ht  26.0      31 0.00067   33.3   0.9   11  105-115   138-148 (298)
134 KOG2556|consensus               25.2      34 0.00074   35.1   1.1   19  106-124   257-275 (666)
135 PRK02391 heat shock protein Ht  25.1      34 0.00074   33.0   1.1   11  105-115   134-144 (296)
136 PRK01265 heat shock protein Ht  24.4      34 0.00075   33.5   1.0   11  105-115   141-151 (324)
137 PF05506 DUF756:  Domain of unk  23.8      77  0.0017   24.4   2.7   41  326-366    47-87  (89)
138 PF14891 Peptidase_M91:  Effect  23.5      36 0.00078   30.0   0.8   18  106-123   105-122 (174)
139 cd02325 R3H R3H domain. The na  22.8      82  0.0018   21.3   2.5   30  105-134    29-58  (59)
140 PF08810 KapB:  Kinase associat  22.4      21 0.00046   29.3  -0.8   53    2-54     31-90  (112)
141 PRK04351 hypothetical protein;  21.7      47   0.001   28.8   1.1   12  105-116    62-73  (149)
142 PF01431 Peptidase_M13:  Peptid  21.2      44 0.00095   29.9   0.9   13  106-118    38-50  (206)
143 TIGR00054 RIP metalloprotease   21.1      45 0.00097   33.8   1.1   11  106-116    16-26  (420)
144 PF07108 PipA:  PipA protein;    21.0      94   0.002   27.2   2.8   52   68-128   122-177 (200)
145 PF13058 DUF3920:  Protein of u  21.0      49  0.0011   27.2   1.0   15  106-122    78-92  (126)
146 PRK15410 DgsA anti-repressor M  20.7      44 0.00095   31.7   0.8   17  104-120   142-158 (260)
147 PF13203 DUF2201_N:  Putative m  20.5      53  0.0011   31.3   1.4   20  106-126    62-81  (292)

No 1  
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=100.00  E-value=1.6e-64  Score=463.73  Aligned_cols=206  Identities=36%  Similarity=0.658  Sum_probs=194.6

Q ss_pred             CCCccC-CccccccccccccCCCCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCC
Q psy2949           1 MEGDIL-IPRERDSRNLVLYQAQLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFG   79 (367)
Q Consensus         1 ~~GDi~-~~~q~~~r~a~~~~~~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~g   79 (367)
                      |||||+ +++  ++|+|+.....+||+ +|||.| +.+|+..+++.|++||++|++.|||||+|++++.+||.|.. +.|
T Consensus        24 ~eGDI~l~~~--~~R~a~~~~~~~Wp~-~VPY~i-~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e~~yi~i~~-~~G   98 (230)
T cd04282          24 FEGDILLDEG--QSRNGLIGDTYRWPF-PIPYIL-DDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGESNYIFFFK-GSG   98 (230)
T ss_pred             ccccccCCcc--cccccccCcccCCCc-ceeEEE-CCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCCCcEEEEEc-CCC
Confidence            799999 775  468999999999999 999999 88899999999999999999999999999999999999974 689


Q ss_pred             ccCCCCCCCCCcccccccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCC
Q psy2949          80 CASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGM  159 (367)
Q Consensus        80 c~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~  159 (367)
                      |||+||+.+|.|.|+   |+.+|...|+|+|||||||||+|||+|||||+||+|+|+||.++..+||.|++...+.++|+
T Consensus        99 C~S~vG~~gg~q~is---l~~~C~~~Gti~HEl~HalGf~HEqsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~  175 (230)
T cd04282          99 CWSMVGDQQGGQNLS---IGAGCDYKATVEHEFLHALGFYHEQSRSDRDDYVKIWWDQILSGREHNFNKYDDSFSTDLNT  175 (230)
T ss_pred             eeeccCccCCeEEEE---ECCCcCCCchHHHHHHHHhCCcccccccccccceEEeecccCchHHHHhhhcCccccccCCC
Confidence            999999999999998   78899999999999999999999999999999999999999999999999999888888999


Q ss_pred             cCCCCCCcCcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHhcC
Q psy2949         160 PYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYKC  214 (367)
Q Consensus       160 ~YD~~SIMhY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~C  214 (367)
                      ||||+|||||++.+||+++..+||+|+++.+...|||+.+||..|+.+||+||+|
T Consensus       176 pYDy~SIMHY~~~aFs~~~~~pTi~~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~C  230 (230)
T cd04282         176 PYDYESVMHYSPFSFNKGASEPTITTKIPEFNDIIGQRLDFSDIDLERLNRMYNC  230 (230)
T ss_pred             CCCcccccccCCCccccCCCCceeeecCCcccccccccCCCCHHHHHHHHHhcCC
Confidence            9999999999999999994459999999877788999999999999999999998


No 2  
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=100.00  E-value=5.4e-62  Score=439.83  Aligned_cols=196  Identities=38%  Similarity=0.702  Sum_probs=183.5

Q ss_pred             ccccCCCCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCCccCCCCCCC-CCcccc
Q psy2949          16 LVLYQAQLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFP-IGTGID   94 (367)
Q Consensus        16 a~~~~~~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~-g~q~i~   94 (367)
                      |.....++||+++|||.| +.+|+..+++.|++||++|++.|||||+|++++.+||.|...+.||||+||+.+ |.|.|+
T Consensus         2 ~~~~~~~~Wp~~~VpY~i-~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~~~~yi~f~~~~~GC~S~vG~~~~g~q~is   80 (200)
T cd04281           2 ATARKERIWPGGVIPYVI-DGNFTGSQRAMFKQAMRHWENFTCVTFVERTPEENYIVFTYRPCGCCSYVGRRGNGPQAIS   80 (200)
T ss_pred             CccCccCcCCCCEEEEEE-CCCCCHHHHHHHHHHHHHHHhCCceEEEECCCCCCEEEEEECCCCeeEcCCCcCCCceeee
Confidence            556678999999999999 888999999999999999999999999999999999999866689999999987 789887


Q ss_pred             cccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCccccee
Q psy2949          95 IFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAF  174 (367)
Q Consensus        95 ~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~f  174 (367)
                         |+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++...+..+|+||||+|||||++.+|
T Consensus        81 ---l~~~C~~~Gti~HEl~HaLGf~HEhsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SiMHY~~~aF  157 (200)
T cd04281          81 ---IGKNCDKFGIVVHELGHVIGFWHEHTRPDRDDHVTIIRENIQPGQEYNFLKMEPEEVDSLGEPYDFDSIMHYARNTF  157 (200)
T ss_pred             ---cCCCcCcCchHHHHHHHHhcCcchhccccccceEEEeecccCcchhhhhhhcCccccccCCCccCCCccccCCCCcc
Confidence               78999999999999999999999999999999999999999999999999999888888999999999999999999


Q ss_pred             ccCCCcceeeeCCCC--CCCccCccccccHHHHHHHHHHhcCC
Q psy2949         175 SKDGVSKTIVPLYPG--AEDTMGQRDAMSRVDLAKLNRLYKCP  215 (367)
Q Consensus       175 s~~~~~~ti~~~~~~--~~~~IG~r~~lS~~D~~~in~lY~C~  215 (367)
                      ++++..+||+|+++.  +...|||+.+||+.|+.+||+||+|+
T Consensus       158 s~~~~~~Ti~p~~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~C~  200 (200)
T cd04281         158 SRGMFLDTILPKRDPNGVRPEIGQRTRLSEGDIIQANKLYKCP  200 (200)
T ss_pred             ccCCCCCceEECCCcccccccccccCCCCHHHHHHHHHhcCCC
Confidence            999877999998774  45779999999999999999999995


No 3  
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=100.00  E-value=3.9e-56  Score=395.81  Aligned_cols=177  Identities=42%  Similarity=0.773  Sum_probs=165.3

Q ss_pred             eeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCCccCCCCCCCCCcccccccCC-Ccccccc
Q psy2949          28 TVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFPIGTGIDIFLGG-RVCFLKG  106 (367)
Q Consensus        28 ~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~g~q~i~~~~l~-~~c~~~g  106 (367)
                      +|||.| +.+|++.+++.|++||++|++.|||||++++++.+||.|.. +.||||+||+.++.|.|+   ++ .+|...|
T Consensus         5 ~IpY~i-~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~~~yi~~~~-~~gC~S~vG~~gg~q~i~---l~~~~C~~~G   79 (182)
T cd04283           5 YVPYVI-SPQYSENERAVIEKAMQEFETLTCVRFVPRTTERDYLNIES-RSGCWSYIGRQGGRQTVS---LQKQGCMYKG   79 (182)
T ss_pred             EEEEEE-CCCCCHHHHHHHHHHHHHHHhCCceeeEECCCCCcEEEEEc-CCCceEecCccCCceeEe---cCCCCcCccc
Confidence            799999 88899999999999999999999999999998899999974 689999999999999998   55 6899999


Q ss_pred             hhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCcceeeeC
Q psy2949         107 KIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVSKTIVPL  186 (367)
Q Consensus       107 ~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~~ti~~~  186 (367)
                      +|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++.   .++++||||+|||||++.+||+++. +||+|+
T Consensus        80 ~i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~~~~~~nF~~~~~---~~~~~pYDy~SiMHY~~~afs~~g~-~Ti~~~  155 (182)
T cd04283          80 IIQHELLHALGFYHEQTRSDRDKYVRINWENIIPDQLYNFDKQDT---NNLGTPYDYSSVMHYGRYAFSINGK-PTIVPI  155 (182)
T ss_pred             hHHHHHHHHhCCcccccccccCceEEEehhhcCchHHhhhhhccc---cccCCcccceeeeccCCcccccCCC-CeEEEC
Confidence            999999999999999999999999999999999999999999864   3578999999999999999999887 999999


Q ss_pred             CCCCCCccCccccccHHHHHHHHHHhcC
Q psy2949         187 YPGAEDTMGQRDAMSRVDLAKLNRLYKC  214 (367)
Q Consensus       187 ~~~~~~~IG~r~~lS~~D~~~in~lY~C  214 (367)
                      ++.. ..||||.+||+.|+.+||+||.|
T Consensus       156 ~~~~-~~iGqr~~lS~~Di~~iN~~Y~C  182 (182)
T cd04283         156 PDPN-VPIGQRQGMSNLDILRINKLYNC  182 (182)
T ss_pred             Cccc-ccccCCCCCCHHHHHHHHHhcCC
Confidence            7643 57999999999999999999988


No 4  
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=100.00  E-value=1.8e-56  Score=403.66  Aligned_cols=189  Identities=44%  Similarity=0.855  Sum_probs=151.0

Q ss_pred             CCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcC-CCCceEEEecCCCCccCCCCCCCCCcccccccCCC
Q psy2949          22 QLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRT-NQDTYLRFRNTGFGCASPVGYFPIGTGIDIFLGGR  100 (367)
Q Consensus        22 ~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~-~~~~~I~i~~~~~gc~S~vG~~~g~q~i~~~~l~~  100 (367)
                      ++||+++|||.| ++++++.+++.|++||++|++.|||+|++++ ...++|.|. .+.||||+||+.++.|.|+   |+.
T Consensus         1 ~~Wp~~~IpY~~-~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~-~~~gC~S~vG~~~g~q~i~---l~~   75 (191)
T PF01400_consen    1 KKWPNGTIPYYI-DPSFSSSQRQRIRKAMDEWEKNTCIRFVERTENEDDYISFS-NGSGCWSYVGRQGGEQTIN---LGD   75 (191)
T ss_dssp             -S-GGGEEEEEE-ETTS-HHHHHHHHHHHHHHHHHSSEEEEE-SSSSSSEEEEE-SSSSEEEESS--SSEEEEE---E-T
T ss_pred             CcCCCCEEEEEE-CCCCCHHHHHHHHHHHHHHHhCCCEEEEECCCCCceEEEee-cCccccchhhhcCcceeEE---ecc
Confidence            589999999999 7889999999999999999999999999999 888899886 5699999999999999998   788


Q ss_pred             cccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCc
Q psy2949         101 VCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVS  180 (367)
Q Consensus       101 ~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~  180 (367)
                      +|...|+|+|||||||||.|||+|||||+||.|+|+||.++...||.+.+...+..+++|||++|||||++.+|++++..
T Consensus        76 ~c~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIMHY~~~~~s~~~~~  155 (191)
T PF01400_consen   76 GCFSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIMHYSSYAFSKNGSQ  155 (191)
T ss_dssp             TC-SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-TTSGGGG---SGGGBE-TTS---TT-TT---TTTTBSSTTS
T ss_pred             eeCCccchHHHHHHHHhhhhhhhccccccEEEEehhcchhhhhhhhhccccccccccCCCcCccCeecccCcccccCCCC
Confidence            99999999999999999999999999999999999999999999999998888888999999999999999999999977


Q ss_pred             ceeeeCCCCCCCccCccccccHHHHHHHHHHhcCC
Q psy2949         181 KTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYKCP  215 (367)
Q Consensus       181 ~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~C~  215 (367)
                      +||+|++..+...||++.+||+.|+++||.+|+|+
T Consensus       156 ~ti~~~~~~~~~~iG~~~~lS~~D~~~in~~Y~C~  190 (191)
T PF01400_consen  156 PTITPKDPKYQETIGQRNRLSFTDIKQINKMYNCP  190 (191)
T ss_dssp             -SEEESSTTCTS-GGG-SS--HHHHHHHHHHTT--
T ss_pred             CeEEecCCCCcccccccCCCCHHHHHHHHHhcCCC
Confidence            99999998777899999999999999999999996


No 5  
>KOG3714|consensus
Probab=100.00  E-value=2.6e-55  Score=437.59  Aligned_cols=202  Identities=45%  Similarity=0.813  Sum_probs=188.5

Q ss_pred             cccccccccCCCCcCCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcC-CCCceEEEecCCCCccCCCCCCCC
Q psy2949          11 RDSRNLVLYQAQLWPDKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRT-NQDTYLRFRNTGFGCASPVGYFPI   89 (367)
Q Consensus        11 ~~~r~a~~~~~~~Wp~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~-~~~~~I~i~~~~~gc~S~vG~~~g   89 (367)
                      +..|+++.+..++||+++|||.| +++|+..+|+.|+.||++|+++|||+|+|++ ...+++.|.... ||||+||+.++
T Consensus        69 ~~~r~~~~~~~~~Wp~~~ipY~i-~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~~-gC~S~VGr~gg  146 (411)
T KOG3714|consen   69 RSRRNGTSNPERRWPNGVIPYYI-DGSFTSSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTGG-GCYSYVGRRGG  146 (411)
T ss_pred             hhhhhcccChhhcCCCCeeeeEE-CCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCCCcceEEEeCCC-cceeeeCccCC
Confidence            46788888999999999999999 7779999999999999999999999999998 567888886554 99999999998


Q ss_pred             C-cccccccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcC
Q psy2949          90 G-TGIDIFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMH  168 (367)
Q Consensus        90 ~-q~i~~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMh  168 (367)
                      . |.++   ++.+|...|+|+|||||||||+|||+|||||+||+|+|+||.++..+||.|++......+++||||+||||
T Consensus       147 ~~q~~s---l~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYDygSvMH  223 (411)
T KOG3714|consen  147 GQQLLS---LGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYDYGSVMH  223 (411)
T ss_pred             Ccccee---cCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCChhhhhhhhhcChhhhhccCCcccCCcccc
Confidence            6 7666   89999999999999999999999999999999999999999999999999999988999999999999999


Q ss_pred             cccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHhcCCCC
Q psy2949         169 YSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYKCPKN  217 (367)
Q Consensus       169 Y~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~C~~n  217 (367)
                      |++.+|++++..+||+|++...+.+||||..+|+.|+.+||+||.|+..
T Consensus       224 Y~~~afs~~~~~~ti~~~~~~~~~~mGqr~~~S~~Di~~iN~~Y~C~~~  272 (411)
T KOG3714|consen  224 YAPYAFSKNGSLPTIVPKDNGFQNTMGQRERLSFYDIRKINKLYCCPEC  272 (411)
T ss_pred             cCCcccCcCCCCCceecccccccccccccCcCCHHHHHHHHhhcCCCcc
Confidence            9999999999988999999987778999999999999999999999643


No 6  
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=100.00  E-value=3.6e-54  Score=385.26  Aligned_cols=180  Identities=44%  Similarity=0.815  Sum_probs=170.0

Q ss_pred             CCeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCCccCCCCCCCCCcccccccCCCccccc
Q psy2949          26 DKTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFPIGTGIDIFLGGRVCFLK  105 (367)
Q Consensus        26 ~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~g~q~i~~~~l~~~c~~~  105 (367)
                      +++|||.| +.++++.+++.|++||++|++.|||+|+|++.+.++|.|... .||||.||+.++.|.|+   |+.+|...
T Consensus         1 ~~~VpY~i-~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~~~~~I~f~~~-~Gc~S~vG~~~~~q~i~---l~~~c~~~   75 (180)
T cd04280           1 NGTVPYVI-DGSFDESDRSLILRAMREIESNTCIRFVPRTTEKDYIRIVKG-SGCWSYVGRVGGRQVVS---LGSGCFSL   75 (180)
T ss_pred             CCEEEEEE-CCCCCHHHHHHHHHHHHHHHhCCcceEEECCCCCcEEEEEcC-CCcceecCccCCceeEE---eCCCcCcC
Confidence            57999999 777999999999999999999999999999988999999765 89999999999999998   78899999


Q ss_pred             chhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCcceeee
Q psy2949         106 GKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVSKTIVP  185 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~~ti~~  185 (367)
                      |+|+|||||||||+|||+|||||+||.|+|+||.++...||.|++...+..+++||||+|||||++.+|++++. +||+|
T Consensus        76 g~v~HE~~HalG~~HEh~R~DRD~yv~i~~~ni~~~~~~nf~~~~~~~~~~~~~~YDy~SiMhY~~~~fs~~~~-~ti~~  154 (180)
T cd04280          76 GTIVHELMHALGFYHEQSRPDRDDYVTINWENIQPGYEHNFDKYSPDTVTTYGVPYDYGSVMHYGPTAFSKNGK-PTIVP  154 (180)
T ss_pred             chhHHHHHHHhcCcchhcccccCCeEEEeecccChhHHHHhhhcCcccccccCCcCCcceEeccCCcccccCCC-ceEEE
Confidence            99999999999999999999999999999999999999999999988888889999999999999999999977 99999


Q ss_pred             CCCCCCCccCccccccHHHHHHHHHHh
Q psy2949         186 LYPGAEDTMGQRDAMSRVDLAKLNRLY  212 (367)
Q Consensus       186 ~~~~~~~~IG~r~~lS~~D~~~in~lY  212 (367)
                      +++.+.. +|++.+||+.|+++||+||
T Consensus       155 ~~~~~~~-~g~~~~~S~~D~~~in~~Y  180 (180)
T cd04280         155 KDPGYQI-IGQREGLSFLDIKKINKMY  180 (180)
T ss_pred             CCchhhc-ccCCCCCCHHHHHHHHHhC
Confidence            9987544 8999999999999999998


No 7  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=100.00  E-value=9.2e-41  Score=303.23  Aligned_cols=168  Identities=26%  Similarity=0.292  Sum_probs=132.9

Q ss_pred             CCcCCCeeE-EEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEe-cCCCCccCCCCCCC-----CCcccc
Q psy2949          22 QLWPDKTVY-YNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFR-NTGFGCASPVGYFP-----IGTGID   94 (367)
Q Consensus        22 ~~Wp~~~Vp-Y~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~-~~~~gc~S~vG~~~-----g~q~i~   94 (367)
                      ++||+++|+ |.| ..++++.+|+.|++||++|++.|||+|++++.....|+|. ..+.||||+||+.+     +.+.|+
T Consensus         1 ~~W~~~~~~~~~f-~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~~~Gc~S~vG~~~~~~~~~~~t~~   79 (198)
T cd04327           1 KLWRNGTVLRIAF-LGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTPGDGYWSYVGTDALLIGADAPTMN   79 (198)
T ss_pred             CCCCCCCeEEEEe-CCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEecCCCCCCCcCCcccccCCCCceee
Confidence            589999655 777 6667899999999999999999999999998434444443 34589999999953     345565


Q ss_pred             cccCCC------cccccchhHHHHHhhhcccccccCCCCCcceEEeeCC--cCC-----------CCC--cccccCCCCc
Q psy2949          95 IFLGGR------VCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLREN--IGP-----------GHE--FNLERRPTGS  153 (367)
Q Consensus        95 ~~~l~~------~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~n--i~~-----------~~~--~nf~k~~~~~  153 (367)
                         ++.      +|...++|+|||||||||.|||+|||||    |.|++  |.+           +..  .+|.|+..  
T Consensus        80 ---l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh~rpdrd----i~w~~~~i~~~~~~~~~~~~~~~~~~~~f~~~~~--  150 (198)
T cd04327          80 ---LGWFTDDTPDPEFSRVVLHEFGHALGFIHEHQSPAAN----IPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDD--  150 (198)
T ss_pred             ---eeeecCCCchhhHHHHHHHHHHHHhcCcccccCCCCC----CCcCHHHHHHHHhcCCCCcHHHHHHhhhhhcccc--
Confidence               333      4556699999999999999999999999    67773  321           112  56888764  


Q ss_pred             ccccCCcCCCCCCcCcc-cceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHh
Q psy2949         154 VRTFGMPYDYGSIMHYS-GIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY  212 (367)
Q Consensus       154 ~~~~g~~YD~~SIMhY~-~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY  212 (367)
                      ...+++||||+|||||+ +.+|++++. +|            |++.+||..|+++|+.||
T Consensus       151 ~~~~~~~yD~~SIMHY~~~~~~t~~g~-~t------------~~~~~lS~~D~~~i~~~Y  197 (198)
T cd04327         151 GDVAYSPYDPDSIMHYPFPGSLTLDGE-EV------------PPNRTLSDKDKAFMRLLY  197 (198)
T ss_pred             ccccCCCCCcHHHHcCCCcHHhhcCCc-cc------------CCCCCCCHHHHHHHHHhC
Confidence            33578999999999999 789998877 33            677889999999999999


No 8  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.96  E-value=5.8e-30  Score=219.67  Aligned_cols=135  Identities=34%  Similarity=0.610  Sum_probs=115.9

Q ss_pred             CCCcCCCeeEEEEcCCCCCHHH-HHHHHHHHHHHHHcCCcEEEEcC-CCCceEEEecCCC-Ccc-CCCCCCCCCcccccc
Q psy2949          21 AQLWPDKTVYYNFEDSEFTIYE-KTLVENAIQDLRMHTCVRFVPRT-NQDTYLRFRNTGF-GCA-SPVGYFPIGTGIDIF   96 (367)
Q Consensus        21 ~~~Wp~~~VpY~~~~~~~~~~~-~~~I~~A~~~w~~~tci~F~~~~-~~~~~I~i~~~~~-gc~-S~vG~~~g~q~i~~~   96 (367)
                      .++||+++|+|+|+...+++.+ ++.|++||+.|++.+||+|++++ +...+|.|..... ||+ |++|+.++.+.++  
T Consensus         2 ~~~W~~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~ad~~I~f~~~~~~g~~~a~~g~~~g~~~~~--   79 (140)
T smart00235        2 SKKWPKGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDGSGCTLSHAGRPGGDQHFS--   79 (140)
T ss_pred             CCcCCCCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCCCCeEEEEEECCCCCcceeeeecCCCceEEE--
Confidence            4789999999999326778887 99999999999999999999987 5677888876543 998 9999988888887  


Q ss_pred             cCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCC
Q psy2949          97 LGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGS  165 (367)
Q Consensus        97 ~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~S  165 (367)
                       ++.+|...++++|||||||||.|||+|||||+|+.|+|+++    ..+|.+..   .+.++.+|||+|
T Consensus        80 -~~~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~----~~~~~~~~---~~~~~~~~~yg~  140 (140)
T smart00235       80 -LGNGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNI----TRNFDLSN---DDSLGIPYDYGS  140 (140)
T ss_pred             -ccCCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhh----hhcccccc---ccCCCchhccCc
Confidence             57789999999999999999999999999999999999998    35666543   245778999986


No 9  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.96  E-value=5.8e-29  Score=218.70  Aligned_cols=145  Identities=25%  Similarity=0.322  Sum_probs=119.6

Q ss_pred             CeeEEEEcCCCCCHHHHHHHHHHHHHHHHcCCcEEEEcCCC-CceEEEecC-----CCCccCCCCCC--CCCcccccccC
Q psy2949          27 KTVYYNFEDSEFTIYEKTLVENAIQDLRMHTCVRFVPRTNQ-DTYLRFRNT-----GFGCASPVGYF--PIGTGIDIFLG   98 (367)
Q Consensus        27 ~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~I~i~~~-----~~gc~S~vG~~--~g~q~i~~~~l   98 (367)
                      .+|+|.| +..++...|+.|++||+.|++.+||+|++++.. ...|+|...     +.||+|++|+.  ++.+.|.   +
T Consensus         2 ~~I~y~i-~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i~---~   77 (165)
T cd04268           2 KPITYYI-DDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEIL---L   77 (165)
T ss_pred             CCEEEEE-cCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCccccCCccCCCCCccEE---e
Confidence            4799999 778899999999999999999999999999853 566666432     58999999985  4455555   3


Q ss_pred             CCccc-----------ccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCc
Q psy2949          99 GRVCF-----------LKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIM  167 (367)
Q Consensus        99 ~~~c~-----------~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIM  167 (367)
                      +..|.           ..++++|||||||||.|||+|||||.++                       ..++.+||+.|||
T Consensus        78 ~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~-----------------------~~~~~~~~~~SvM  134 (165)
T cd04268          78 ARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNV-----------------------DLLAEKGDTSSVM  134 (165)
T ss_pred             eEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcc-----------------------hhhccCCCCcccC
Confidence            44443           4599999999999999999999999876                       3467899999999


Q ss_pred             CcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHh
Q psy2949         168 HYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY  212 (367)
Q Consensus       168 hY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY  212 (367)
                      ||....|+.+              ..+|++..++..|+..|+.||
T Consensus       135 ~y~~~~~~~~--------------~~~~~~~~~~~~Di~ai~~lY  165 (165)
T cd04268         135 DYAPSNFSIQ--------------LGDGQKYTIGPYDIAAIKKLY  165 (165)
T ss_pred             CCCccccccc--------------cccccCCCCCHHHHHHHHhcC
Confidence            9999887655              125778889999999999998


No 10 
>KOG3714|consensus
Probab=99.88  E-value=1.4e-23  Score=209.66  Aligned_cols=120  Identities=38%  Similarity=0.560  Sum_probs=105.1

Q ss_pred             CCccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCC
Q psy2949         191 EDTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGA  270 (367)
Q Consensus       191 ~~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~  270 (367)
                      .|++|..|+.|+.|++....+-  .+||.++...||++++...++++++||||+|||||++++||+++..+||+|+++..
T Consensus       168 ~HaLGf~HehsR~DRD~yV~I~--~~ni~~~~~~nF~k~~~~~~~~~~~pYDygSvMHY~~~afs~~~~~~ti~~~~~~~  245 (411)
T KOG3714|consen  168 MHALGFWHEHSRPDRDNYVSIN--WDNIDPGQEYNFEKYSPDEVTTYGVPYDYGSVMHYAPYAFSKNGSLPTIVPKDNGF  245 (411)
T ss_pred             HHHhhhhhccCcccccCceEEe--eccCChhhhhhhhhcChhhhhccCCcccCCcccccCCcccCcCCCCCceecccccc
Confidence            4889999999999999855544  48999999999999999999999999999999999999999999999999999888


Q ss_pred             ccccccccCCCHHHHHHHhhhccCCCCcccCccccccccCCCC
Q psy2949         271 EDTMGQRDAMSRVDLAKLNRLYKCPKNYYQGFDIQGFYSTSGP  313 (367)
Q Consensus       271 ~~~~gqr~~~s~~D~~~in~~Y~C~~~~~~~~~~~g~~~~~~~  313 (367)
                      +.+||||.++|+.|+.+||+||+|+.++...+ .+|..+++++
T Consensus       246 ~~~mGqr~~~S~~Di~~iN~~Y~C~~~~~~~~-~~~g~~~~~~  287 (411)
T KOG3714|consen  246 QNTMGQRERLSFYDIRKINKLYCCPECCELLC-NNGGTNPSNC  287 (411)
T ss_pred             cccccccCcCCHHHHHHHHhhcCCCccCcccc-cCCCCCCccc
Confidence            77899999999999999999999999887444 3434444333


No 11 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.88  E-value=1.3e-23  Score=190.02  Aligned_cols=102  Identities=31%  Similarity=0.540  Sum_probs=92.5

Q ss_pred             CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCC--
Q psy2949         192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPG--  269 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~--  269 (367)
                      |++|..|+.|+.|++....+-  .+||.++...||.+.+...++++++||||+|||||++.+||+++..+||+|+.+.  
T Consensus        97 HaLGf~HEhsRpDRD~yV~I~--~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SiMHY~~~aFs~~~~~~Ti~p~~~~~~  174 (200)
T cd04281          97 HVIGFWHEHTRPDRDDHVTII--RENIQPGQEYNFLKMEPEEVDSLGEPYDFDSIMHYARNTFSRGMFLDTILPKRDPNG  174 (200)
T ss_pred             HHhcCcchhccccccceEEEe--ecccCcchhhhhhhcCccccccCCCccCCCccccCCCCccccCCCCCceEECCCccc
Confidence            789999999999999855543  5789999999999999888899999999999999999999999878999998763  


Q ss_pred             CccccccccCCCHHHHHHHhhhccCC
Q psy2949         270 AEDTMGQRDAMSRVDLAKLNRLYKCP  295 (367)
Q Consensus       270 ~~~~~gqr~~~s~~D~~~in~~Y~C~  295 (367)
                      +..+||||.+||+.|+++||+||+||
T Consensus       175 ~~~~iGqr~~lS~~Di~~iN~~Y~C~  200 (200)
T cd04281         175 VRPEIGQRTRLSEGDIIQANKLYKCP  200 (200)
T ss_pred             ccccccccCCCCHHHHHHHHHhcCCC
Confidence            45689999999999999999999997


No 12 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.87  E-value=2.6e-23  Score=191.26  Aligned_cols=101  Identities=33%  Similarity=0.580  Sum_probs=93.7

Q ss_pred             CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949         192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE  271 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~  271 (367)
                      |++|..|+.|+.|++....+.  .+||.++...||+|++...++++++||||+|||||++++||+++..+||+|+.+.+.
T Consensus       130 HalGf~HEqsRpDRD~yV~I~--~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SIMHY~~~aFs~~~~~pTi~~~~~~~~  207 (230)
T cd04282         130 HALGFYHEQSRSDRDDYVKIW--WDQILSGREHNFNKYDDSFSTDLNTPYDYESVMHYSPFSFNKGASEPTITTKIPEFN  207 (230)
T ss_pred             HHhCCcccccccccccceEEe--ecccCchHHHHhhhcCccccccCCCCCCcccccccCCCccccCCCCceeeecCCccc
Confidence            789999999999999966665  578999999999999998889999999999999999999999866799999988877


Q ss_pred             cccccccCCCHHHHHHHhhhccC
Q psy2949         272 DTMGQRDAMSRVDLAKLNRLYKC  294 (367)
Q Consensus       272 ~~~gqr~~~s~~D~~~in~~Y~C  294 (367)
                      .+||||.+||+.|+++||+||+|
T Consensus       208 ~~iGqr~~lS~~Di~~iN~~Y~C  230 (230)
T cd04282         208 DIIGQRLDFSDIDLERLNRMYNC  230 (230)
T ss_pred             ccccccCCCCHHHHHHHHHhcCC
Confidence            89999999999999999999999


No 13 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.83  E-value=1.1e-21  Score=174.86  Aligned_cols=96  Identities=34%  Similarity=0.644  Sum_probs=85.7

Q ss_pred             CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949         192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE  271 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~  271 (367)
                      |++|..++.|+.|++....+.  .+||.++...||++.+.   .+++.||||+|||||++.+||+++. +||+|+.+. .
T Consensus        87 HaLG~~HEhsRpDRD~yV~I~--~~nI~~~~~~nF~~~~~---~~~~~pYDy~SiMHY~~~afs~~g~-~Ti~~~~~~-~  159 (182)
T cd04283          87 HALGFYHEQTRSDRDKYVRIN--WENIIPDQLYNFDKQDT---NNLGTPYDYSSVMHYGRYAFSINGK-PTIVPIPDP-N  159 (182)
T ss_pred             HHhCCcccccccccCceEEEe--hhhcCchHHhhhhhccc---cccCCcccceeeeccCCcccccCCC-CeEEECCcc-c
Confidence            789999999999999966655  58899999999999865   4678999999999999999999875 899998654 3


Q ss_pred             cccccccCCCHHHHHHHhhhccC
Q psy2949         272 DTMGQRDAMSRVDLAKLNRLYKC  294 (367)
Q Consensus       272 ~~~gqr~~~s~~D~~~in~~Y~C  294 (367)
                      .+||||.+||+.|+++||+||+|
T Consensus       160 ~~iGqr~~lS~~Di~~iN~~Y~C  182 (182)
T cd04283         160 VPIGQRQGMSNLDILRINKLYNC  182 (182)
T ss_pred             ccccCCCCCCHHHHHHHHHhcCC
Confidence            58999999999999999999999


No 14 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.81  E-value=1.3e-19  Score=159.36  Aligned_cols=145  Identities=28%  Similarity=0.327  Sum_probs=107.6

Q ss_pred             eeEEEEcCC-------CCCHHHHHHHHHHHHHHHHcCCcEEEEcCCC--CceEEEecC------CCCccCCCCCC--CCC
Q psy2949          28 TVYYNFEDS-------EFTIYEKTLVENAIQDLRMHTCVRFVPRTNQ--DTYLRFRNT------GFGCASPVGYF--PIG   90 (367)
Q Consensus        28 ~VpY~~~~~-------~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~--~~~I~i~~~------~~gc~S~vG~~--~g~   90 (367)
                      .|+|.|...       .++...++.|+.|++.|+..+||+|+++..+  ...|.|...      +.+|+|++|..  +..
T Consensus         2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~   81 (167)
T cd00203           2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR   81 (167)
T ss_pred             EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence            466766322       2677899999999999999999999999854  555555321      46789999885  322


Q ss_pred             cccccccCCCc----ccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCC
Q psy2949          91 TGIDIFLGGRV----CFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSI  166 (367)
Q Consensus        91 q~i~~~~l~~~----c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SI  166 (367)
                      ..+.   +...    .....+++|||||||||.|++.+++||.+|++.                   ......++|..||
T Consensus        82 ~~~~---~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~-------------------~~~~~~~~~~~si  139 (167)
T cd00203          82 GVGV---LQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTID-------------------DTLNAEDDDYYSV  139 (167)
T ss_pred             CcEE---EecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCcccc-------------------ccccCCCCCCCeE
Confidence            2222   1112    245589999999999999999999999988772                   1124578999999


Q ss_pred             cCcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHHHHh
Q psy2949         167 MHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY  212 (367)
Q Consensus       167 MhY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY  212 (367)
                      |||....++                  ++++..+|..|+.+|+.+|
T Consensus       140 M~y~~~~~~------------------~~~~~~fS~~d~~~i~~~Y  167 (167)
T cd00203         140 MSYTKGSFS------------------DGQRKDFSQCDIDQINKLY  167 (167)
T ss_pred             eccCccccC------------------cccCCCcCHHHHHHHHhhC
Confidence            999875432                  4566779999999999987


No 15 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.78  E-value=3.5e-20  Score=167.33  Aligned_cols=103  Identities=37%  Similarity=0.669  Sum_probs=73.0

Q ss_pred             CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949         192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE  271 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~  271 (367)
                      |++|..++.++.|++....+.  .+||.++...||.+.+......++.||||+|||||++.+|++++..+||+|+++...
T Consensus        89 HaLG~~HEh~RpDRd~yi~i~--~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIMHY~~~~~s~~~~~~ti~~~~~~~~  166 (191)
T PF01400_consen   89 HALGFWHEHQRPDRDNYITIN--WDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIMHYSSYAFSKNGSQPTITPKDPKYQ  166 (191)
T ss_dssp             HHHTB--GGGSTTGGGTEEE---GGGB-TTSGGGG---SGGGBE-TTS---TT-TT---TTTTBSSTTS-SEEESSTTCT
T ss_pred             HHHhhhhhhhccccccEEEEe--hhcchhhhhhhhhccccccccccCCCcCccCeecccCcccccCCCCCeEEecCCCCc
Confidence            789999999999999866665  478999999999999877788899999999999999999999998899999988778


Q ss_pred             cccccccCCCHHHHHHHhhhccCCC
Q psy2949         272 DTMGQRDAMSRVDLAKLNRLYKCPK  296 (367)
Q Consensus       272 ~~~gqr~~~s~~D~~~in~~Y~C~~  296 (367)
                      .+|||+.+||+.|+++||.||+|+.
T Consensus       167 ~~iG~~~~lS~~D~~~in~~Y~C~~  191 (191)
T PF01400_consen  167 ETIGQRNRLSFTDIKQINKMYNCPE  191 (191)
T ss_dssp             S-GGG-SS--HHHHHHHHHHTT---
T ss_pred             ccccccCCCCHHHHHHHHHhcCCCC
Confidence            8999999999999999999999973


No 16 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.75  E-value=1.5e-19  Score=161.75  Aligned_cols=97  Identities=39%  Similarity=0.672  Sum_probs=87.2

Q ss_pred             CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCCcccccccccccCCCceeeeeCCCCCc
Q psy2949         192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAE  271 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~  271 (367)
                      |++|..++.++.|++....+.  .+||.++...||.+.+.....+++.||||+|||||++.+|++++. +||+|+++...
T Consensus        84 HalG~~HEh~R~DRD~yv~i~--~~ni~~~~~~nf~~~~~~~~~~~~~~YDy~SiMhY~~~~fs~~~~-~ti~~~~~~~~  160 (180)
T cd04280          84 HALGFYHEQSRPDRDDYVTIN--WENIQPGYEHNFDKYSPDTVTTYGVPYDYGSVMHYGPTAFSKNGK-PTIVPKDPGYQ  160 (180)
T ss_pred             HHhcCcchhcccccCCeEEEe--ecccChhHHHHhhhcCcccccccCCcCCcceEeccCCcccccCCC-ceEEECCchhh
Confidence            789999999999999855554  478999999999999998888899999999999999999999864 89999987764


Q ss_pred             cccccccCCCHHHHHHHhhhc
Q psy2949         272 DTMGQRDAMSRVDLAKLNRLY  292 (367)
Q Consensus       272 ~~~gqr~~~s~~D~~~in~~Y  292 (367)
                      . ||||.+||+.|+++||+||
T Consensus       161 ~-~g~~~~~S~~D~~~in~~Y  180 (180)
T cd04280         161 I-IGQREGLSFLDIKKINKMY  180 (180)
T ss_pred             c-ccCCCCCCHHHHHHHHHhC
Confidence            3 9999999999999999998


No 17 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.70  E-value=9.6e-17  Score=140.42  Aligned_cols=99  Identities=18%  Similarity=0.094  Sum_probs=66.8

Q ss_pred             CcCCCeeEEEEcCCCC----CHHHHHHHHHHHHHHHHcCCcEEEEcCCC-Cce--EEEecCCCCc-cCCCCCC-------
Q psy2949          23 LWPDKTVYYNFEDSEF----TIYEKTLVENAIQDLRMHTCVRFVPRTNQ-DTY--LRFRNTGFGC-ASPVGYF-------   87 (367)
Q Consensus        23 ~Wp~~~VpY~~~~~~~----~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~--I~i~~~~~gc-~S~vG~~-------   87 (367)
                      +||+++|+|+| +...    ....++.|++||+.|+..+||+|++++.. ..-  |.+.....+| +++-|..       
T Consensus         1 kW~~~~itY~i-~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a~   79 (157)
T cd04278           1 KWSKTNLTYRI-LNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHAF   79 (157)
T ss_pred             CCCCCceeEEE-ECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCcccccccc
Confidence            69999999999 5443    37889999999999999999999998743 333  4444444454 3333321       


Q ss_pred             -----CCCcccc---cccC---CCcccccchhHHHHHhhhcccccc
Q psy2949          88 -----PIGTGID---IFLG---GRVCFLKGKIQHEILHSLGFWHEH  122 (367)
Q Consensus        88 -----~g~q~i~---~~~l---~~~c~~~g~i~HEigHALG~~HEh  122 (367)
                           .+...++   ..++   ..+.....+++|||||||||.|++
T Consensus        80 ~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~  125 (157)
T cd04278          80 FPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSS  125 (157)
T ss_pred             CCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCC
Confidence                 1111222   1111   124445689999999999999974


No 18 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.55  E-value=4.4e-14  Score=126.95  Aligned_cols=141  Identities=17%  Similarity=0.141  Sum_probs=91.8

Q ss_pred             CCCHHHHHHHHHHHHHHHHcCCcEEEEcCCC-CceEEEecCC-----CCccCCCCCC------CCCcccccccC----CC
Q psy2949          37 EFTIYEKTLVENAIQDLRMHTCVRFVPRTNQ-DTYLRFRNTG-----FGCASPVGYF------PIGTGIDIFLG----GR  100 (367)
Q Consensus        37 ~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~I~i~~~~-----~gc~S~vG~~------~g~q~i~~~~l----~~  100 (367)
                      .+++.+++.|++||+.|++.+||+|++++.. ...|+|....     .+++++.+..      .+.-.++....    ..
T Consensus        30 ~~~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~  109 (186)
T cd04277          30 ALSAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSP  109 (186)
T ss_pred             cCCHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCC
Confidence            4578999999999999999999999999843 5677775432     3445544432      22333442111    23


Q ss_pred             cccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCCcccccCCcCCCCCCcCcccceeccCCCc
Q psy2949         101 VCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTGSVRTFGMPYDYGSIMHYSGIAFSKDGVS  180 (367)
Q Consensus       101 ~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~~~~~~g~~YD~~SIMhY~~~~fs~~~~~  180 (367)
                      ++....+++|||||||||.|+|..++.+.+-.                       .....-.-.|||-|.......  . 
T Consensus       110 g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~-----------------------~~~~~~~~~SVMSY~~~~~~~--~-  163 (186)
T cd04277         110 GSYGYQTIIHEIGHALGLEHPGDYNGGDPVPP-----------------------TYALDSREYTVMSYNSGYGNG--A-  163 (186)
T ss_pred             ChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCc-----------------------cccccCcceEEEeecCCCCCC--c-
Confidence            66677999999999999999999887653210                       001112345899996542110  0 


Q ss_pred             ceeeeCCCCCCCccCccccccHHHHHHHHHHhc
Q psy2949         181 KTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYK  213 (367)
Q Consensus       181 ~ti~~~~~~~~~~IG~r~~lS~~D~~~in~lY~  213 (367)
                       .         ..-+....+...|++.|..||+
T Consensus       164 -~---------~~~~~~~~~~~~DI~AlQ~lYG  186 (186)
T cd04277         164 -S---------AGGGYPQTPMLLDIAALQYLYG  186 (186)
T ss_pred             -c---------ccCcccCCccHHHHHHHHHhhC
Confidence             0         0113335678899999999984


No 19 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.50  E-value=1.7e-13  Score=119.74  Aligned_cols=98  Identities=21%  Similarity=0.244  Sum_probs=60.9

Q ss_pred             eeEEEEcCCC-----CCHHHHHHHHHHHHHHHHcCCcEEEEcCC---CCceEEEecC-------CCCccCCCCCCC----
Q psy2949          28 TVYYNFEDSE-----FTIYEKTLVENAIQDLRMHTCVRFVPRTN---QDTYLRFRNT-------GFGCASPVGYFP----   88 (367)
Q Consensus        28 ~VpY~~~~~~-----~~~~~~~~I~~A~~~w~~~tci~F~~~~~---~~~~I~i~~~-------~~gc~S~vG~~~----   88 (367)
                      .|+|+++...     .....++.|++||+.|++.+||+|++...   ..+.+.++..       +..|.++.+..-    
T Consensus         3 ~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~~   82 (156)
T cd04279           3 PIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGN   82 (156)
T ss_pred             CeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCCc
Confidence            5788884432     25678999999999999999999999874   3333222221       233433333210    


Q ss_pred             --C--CcccccccCCC--cccccchhHHHHHhhhcccccccCC
Q psy2949          89 --I--GTGIDIFLGGR--VCFLKGKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus        89 --g--~q~i~~~~l~~--~c~~~g~i~HEigHALG~~HEh~rp  125 (367)
                        +  ...++......  .....++++|||||||||.||++++
T Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~  125 (156)
T cd04279          83 RKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRP  125 (156)
T ss_pred             ccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCc
Confidence              1  11122100000  1223589999999999999999865


No 20 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.47  E-value=7.9e-13  Score=114.59  Aligned_cols=100  Identities=20%  Similarity=0.246  Sum_probs=61.8

Q ss_pred             CcCCCeeEEEEcCCC--C-CHHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEe--cC-CCCccCCCCCC--------C
Q psy2949          23 LWPDKTVYYNFEDSE--F-TIYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFR--NT-GFGCASPVGYF--------P   88 (367)
Q Consensus        23 ~Wp~~~VpY~~~~~~--~-~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~--~~-~~gc~S~vG~~--------~   88 (367)
                      +||..+|.|+|....  + .+..+++|++||+.|+..+.|+|++......-|.|.  .. ...+..+-|..        +
T Consensus         1 ~W~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~adi~i~~~~~~~~~~~~~~~~~~~~~~~~~~   80 (154)
T PF00413_consen    1 KWPKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGNADIRISFGSNNHGDGYSFDGSGGTLAHAYFP   80 (154)
T ss_dssp             SSSSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSSCSEEEEEESSSSSSSS-CSSSSSESEEEEES
T ss_pred             CCCCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCCcceeeeeeccccCcccccccceeeeeccccc
Confidence            699999999993322  1 256889999999999999999999998433444442  22 12222221211        1


Q ss_pred             -----CCcccc---cccCCCccc-ccchhHHHHHhhhcccccc
Q psy2949          89 -----IGTGID---IFLGGRVCF-LKGKIQHEILHSLGFWHEH  122 (367)
Q Consensus        89 -----g~q~i~---~~~l~~~c~-~~g~i~HEigHALG~~HEh  122 (367)
                           +...++   ......... ...+++|||||||||.|++
T Consensus        81 ~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~  123 (154)
T PF00413_consen   81 NNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSN  123 (154)
T ss_dssp             SSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSS
T ss_pred             cccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCC
Confidence                 111111   000111222 2379999999999999975


No 21 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.24  E-value=2.1e-12  Score=117.35  Aligned_cols=87  Identities=21%  Similarity=0.220  Sum_probs=64.2

Q ss_pred             CccCccccccHHHHH------HHHHHhcCCCC-cccCcc--cccccCCCCcccccccCCCCCCccccc-ccccccCCCce
Q psy2949         192 DTMGQRDAMSRVDLA------KLNRLYKCPKN-YYQGHE--FNLERRPAGSVRTFSMPYDYGSIMHYS-GIAFSKDGVSK  261 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~------~in~lY~C~~n-I~~g~~--~Nf~~~~~~~~~~~~~pYDy~SiMhY~-~~~fs~~~~~~  261 (367)
                      |++|..|+.|+.|++      .|...|.-..+ +..+..  .+|.+.+.  ...++.||||+|||||+ +.+|++++. +
T Consensus       102 HaLG~~HEh~rpdrdi~w~~~~i~~~~~~~~~~~~~~~~~~~~f~~~~~--~~~~~~~yD~~SIMHY~~~~~~t~~g~-~  178 (198)
T cd04327         102 HALGFIHEHQSPAANIPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDD--GDVAYSPYDPDSIMHYPFPGSLTLDGE-E  178 (198)
T ss_pred             HHhcCcccccCCCCCCCcCHHHHHHHHhcCCCCcHHHHHHhhhhhcccc--ccccCCCCCcHHHHcCCCcHHhhcCCc-c
Confidence            788888999999886      22233322212 222222  67888764  34689999999999999 899998875 2


Q ss_pred             eeeeCCCCCccccccccCCCHHHHHHHhhhcc
Q psy2949         262 TIVPLYPGAEDTMGQRDAMSRVDLAKLNRLYK  293 (367)
Q Consensus       262 ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y~  293 (367)
                      |            |++.+||..|+++||+||.
T Consensus       179 t------------~~~~~lS~~D~~~i~~~Yp  198 (198)
T cd04327         179 V------------PPNRTLSDKDKAFMRLLYP  198 (198)
T ss_pred             c------------CCCCCCCHHHHHHHHHhCc
Confidence            2            6778999999999999994


No 22 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=98.97  E-value=7.1e-09  Score=93.93  Aligned_cols=178  Identities=22%  Similarity=0.264  Sum_probs=95.2

Q ss_pred             CCCccC-Ccccccccc----cc---ccCCCCcC--CCeeEEEEcCC----CCCHHHHHHHHHHHHHHHH-cCCcEEEEc-
Q psy2949           1 MEGDIL-IPRERDSRN----LV---LYQAQLWP--DKTVYYNFEDS----EFTIYEKTLVENAIQDLRM-HTCVRFVPR-   64 (367)
Q Consensus         1 ~~GDi~-~~~q~~~r~----a~---~~~~~~Wp--~~~VpY~~~~~----~~~~~~~~~I~~A~~~w~~-~tci~F~~~-   64 (367)
                      .||||+ +++|+++..    ..   -++..+|.  ..+|...+ ..    +++...+..+.+||+.|+. ...|+|.-. 
T Consensus         3 vegDi~~s~~~l~~~~~~~~~~~eqYrTtnlV~~~~~~I~i~~-~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~   81 (211)
T PF12388_consen    3 VEGDIFVSKKQLREMLQSDDGTSEQYRTTNLVGSSPRTITIIG-YTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTF   81 (211)
T ss_pred             eccceEEcHHHHhhhhccCCcchhheeeeeeecCCCCEEEEEe-CCCccccccHHHHHHHHHHHHHHHhhCCceEEEEec
Confidence            489999 887743322    11   12345673  34788888 55    5678899999999999999 557888411 


Q ss_pred             -C--CCCceEEEecCC---CCccCCCCCC--CCC--cccccccCCCccc--ccchhHHHHHhhhcccccccCCCCCcceE
Q psy2949          65 -T--NQDTYLRFRNTG---FGCASPVGYF--PIG--TGIDIFLGGRVCF--LKGKIQHEILHSLGFWHEHTRPDRDQFVR  132 (367)
Q Consensus        65 -~--~~~~~I~i~~~~---~gc~S~vG~~--~g~--q~i~~~~l~~~c~--~~g~i~HEigHALG~~HEh~rpDRD~yv~  132 (367)
                       .  ...+.+.+....   .|--..-|..  +|.  ..+....+...-.  ...+|+|||||+|||.|+.          
T Consensus        82 g~~~~~~di~v~~~~~~~~~G~ggsAGFP~s~G~P~~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHTD----------  151 (211)
T PF12388_consen   82 GTNYQNADIIVYSNSSNNPSGAGGSAGFPTSNGNPYKFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHTD----------  151 (211)
T ss_pred             CCCcCCCceEEEeccCCCCCCcceeccCCCCCCCCCceEEEEecCCCchhHHHHHHHHHhhhhccccccC----------
Confidence             1  233444442211   1111111222  121  1111111111111  2369999999999999975          


Q ss_pred             EeeCCcCC-CCCcccccCCCCcccccCCc--CCCCCCcCcccceeccCCCcceeeeCCCCCCCccCccccccHHHHHHHH
Q psy2949         133 VLRENIGP-GHEFNLERRPTGSVRTFGMP--YDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLN  209 (367)
Q Consensus       133 I~~~ni~~-~~~~nf~k~~~~~~~~~g~~--YD~~SIMhY~~~~fs~~~~~~ti~~~~~~~~~~IG~r~~lS~~D~~~in  209 (367)
                        |.+-.. +...|=.--....+..-|+|  +|..|||.-.-.                     -+...+++..|+..|+
T Consensus       152 --~~~R~SCG~~~nEg~~~vGAi~IpGTPt~~d~~SiM~ac~~---------------------~~~~~~f~~~Di~Al~  208 (211)
T PF12388_consen  152 --YFNRSSCGSGGNEGSAGVGAIHIPGTPTGADPNSIMNACFS---------------------SGEDGEFTSNDITALN  208 (211)
T ss_pred             --cCCcccccccCCcCccccceEECCCCCCCCCCchhhhcccc---------------------CCCCCCcChhhHHHHH
Confidence              222110 00000000000011222443  599999987421                     1234568999999999


Q ss_pred             HHh
Q psy2949         210 RLY  212 (367)
Q Consensus       210 ~lY  212 (367)
                      .||
T Consensus       209 ~lY  211 (211)
T PF12388_consen  209 YLY  211 (211)
T ss_pred             hhC
Confidence            998


No 23 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.55  E-value=1.2e-06  Score=79.36  Aligned_cols=31  Identities=13%  Similarity=0.296  Sum_probs=27.1

Q ss_pred             CCCeeEEEEcCCCCCHHHHHHHHHHHHHHHHc
Q psy2949          25 PDKTVYYNFEDSEFTIYEKTLVENAIQDLRMH   56 (367)
Q Consensus        25 p~~~VpY~~~~~~~~~~~~~~I~~A~~~w~~~   56 (367)
                      |+..|.|.| +..++...++.|++|++.|++.
T Consensus         6 pk~pItyyI-~~~~p~~~r~aI~~A~~~Wn~~   36 (197)
T cd04276           6 PKEPIVYYL-DNTFPEKYRDAIREGVLYWNKA   36 (197)
T ss_pred             CCCCEEEEe-cCCCcHHHHHHHHHHHHHHHHH
Confidence            567899999 6667789999999999999885


No 24 
>KOG1565|consensus
Probab=98.50  E-value=7.6e-07  Score=90.57  Aligned_cols=104  Identities=16%  Similarity=0.111  Sum_probs=66.4

Q ss_pred             cccCCCCcCCCeeEEEEcCCCC----CHHHHHHHHHHHHHHHHcCCcEEEEcCC-CCceEEEe--cCCCCccC-------
Q psy2949          17 VLYQAQLWPDKTVYYNFEDSEF----TIYEKTLVENAIQDLRMHTCVRFVPRTN-QDTYLRFR--NTGFGCAS-------   82 (367)
Q Consensus        17 ~~~~~~~Wp~~~VpY~~~~~~~----~~~~~~~I~~A~~~w~~~tci~F~~~~~-~~~~I~i~--~~~~gc~S-------   82 (367)
                      ......+|+...|+|+| ....    ....+.++++|+..|++.|-++|.|+.. ...-|.|.  ....|+..       
T Consensus        99 ~~~~~~kW~k~~lT~ri-~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g  177 (469)
T KOG1565|consen   99 YFPGKPKWNKEHLTYRI-KNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGG  177 (469)
T ss_pred             cCcccCcccccccceec-cccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCC
Confidence            33456799999999999 3322    4678999999999999999999999984 44444443  22223211       


Q ss_pred             CCCC-------CCCCcccc---cccCC--CcccccchhHHHHHhhhccccc
Q psy2949          83 PVGY-------FPIGTGID---IFLGG--RVCFLKGKIQHEILHSLGFWHE  121 (367)
Q Consensus        83 ~vG~-------~~g~q~i~---~~~l~--~~c~~~g~i~HEigHALG~~HE  121 (367)
                      .+.+       .+|.-...   .+.++  .+-....|++|||||||||.|.
T Consensus       178 ~laHAf~Pg~~~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS  228 (469)
T KOG1565|consen  178 VLAHAFFPGPGIGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHS  228 (469)
T ss_pred             ceecccCCCCCCCCccccCcccceeccCCccchhHHHhhhhcccccccCCC
Confidence            1111       11111111   22221  1334458999999999999998


No 25 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=97.72  E-value=1.4e-05  Score=69.80  Aligned_cols=44  Identities=27%  Similarity=0.391  Sum_probs=38.6

Q ss_pred             cccccCCCCCCcccccccccccCCCceeeeeCCCCCccccccccCCCHHHHHHHhhhc
Q psy2949         235 RTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY  292 (367)
Q Consensus       235 ~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y  292 (367)
                      ..++.+||+.|||||....|+.+              ..+|++..++..||+.|+.||
T Consensus       122 ~~~~~~~~~~SvM~y~~~~~~~~--------------~~~~~~~~~~~~Di~ai~~lY  165 (165)
T cd04268         122 DLLAEKGDTSSVMDYAPSNFSIQ--------------LGDGQKYTIGPYDIAAIKKLY  165 (165)
T ss_pred             hhhccCCCCcccCCCCccccccc--------------cccccCCCCCHHHHHHHHhcC
Confidence            57888999999999999998765              245788999999999999998


No 26 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=97.65  E-value=3e-05  Score=64.48  Aligned_cols=86  Identities=20%  Similarity=0.091  Sum_probs=45.5

Q ss_pred             eeEEEEcCCCCCHHHHHHHHHHHHHHHH-cCCcEEEEcCCCCceEEEecCCCCccCCCCCCC-CCcccccc-cCCCcccc
Q psy2949          28 TVYYNFEDSEFTIYEKTLVENAIQDLRM-HTCVRFVPRTNQDTYLRFRNTGFGCASPVGYFP-IGTGIDIF-LGGRVCFL  104 (367)
Q Consensus        28 ~VpY~~~~~~~~~~~~~~I~~A~~~w~~-~tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~-g~q~i~~~-~l~~~c~~  104 (367)
                      +|.|.- +.  .++.+..|.+|.+.|+. .++|+|++...  ..+++......-+|+-...+ |.-.|-.. ....+-..
T Consensus         3 tv~Yda-s~--A~~f~~~i~~aa~iWN~sV~NV~L~~~s~--a~~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy~~   77 (132)
T PF02031_consen    3 TVYYDA-SR--APEFRSAIAQAAQIWNSSVSNVRLVEGSS--ADIRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGYNS   77 (132)
T ss_dssp             EEEEEE-EE---GGGHHHHHHHHHHHHHH-SSEEEEE-SS---SEEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS-H
T ss_pred             EEEEeC-CC--CchhHHHHHHHHHHHhcccCceEEeecCC--CcEEEEEecCCCCcccccCCCCcEEEEechHHhhCCcc
Confidence            466665 32  45788999999999987 78999999875  33334322222233322221 11112100 00123333


Q ss_pred             cchhHHHHHhhhcc
Q psy2949         105 KGKIQHEILHSLGF  118 (367)
Q Consensus       105 ~g~i~HEigHALG~  118 (367)
                      ..+++|||||.|||
T Consensus        78 ~RIaaHE~GHiLGL   91 (132)
T PF02031_consen   78 TRIAAHELGHILGL   91 (132)
T ss_dssp             HHHHHHHHHHHHT-
T ss_pred             ceeeeehhccccCC
Confidence            47999999999997


No 27 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.20  E-value=0.00076  Score=60.71  Aligned_cols=80  Identities=15%  Similarity=0.023  Sum_probs=44.2

Q ss_pred             HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecCCCC-ccCCCCC-------------CCCCcccc---cccCCCcc
Q psy2949          40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNTGFG-CASPVGY-------------FPIGTGID---IFLGGRVC  102 (367)
Q Consensus        40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~~~g-c~S~vG~-------------~~g~q~i~---~~~l~~~c  102 (367)
                      +...+.+..|++.|.+.-.+..+++.++.|......+..| -|.-.|+             .-+.-.+|   .+....+-
T Consensus       103 p~wq~a~~tava~wa~~fpl~ive~~eeaDItie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~pg~  182 (236)
T COG5549         103 PRWQGAYLTAVAGWAKTFPLIIVERFEEADITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPPGE  182 (236)
T ss_pred             hhHHHHHHHHHHHHHHhCCceeeecceeeeEEEEecCCCCCcccccchHHHHHHHHHHhhhccCcccccccccccCCccc
Confidence            3568999999999999866666776554444333333222 1222222             11111122   00011111


Q ss_pred             c---ccchhHHHHHhhhccc
Q psy2949         103 F---LKGKIQHEILHSLGFW  119 (367)
Q Consensus       103 ~---~~g~i~HEigHALG~~  119 (367)
                      .   ..+++.||+|||||++
T Consensus       183 ~~e~L~~tarhElGhaLgi~  202 (236)
T COG5549         183 LRENLNPTARHELGHALGIW  202 (236)
T ss_pred             chhhhhHHHHHhhcchheec
Confidence            1   2389999999999976


No 28 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=95.73  E-value=0.0045  Score=56.62  Aligned_cols=19  Identities=37%  Similarity=0.588  Sum_probs=17.4

Q ss_pred             cchhHHHHHhhhccccccc
Q psy2949         105 KGKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~  123 (367)
                      ..+++|||||+||+.|+..
T Consensus       138 ~~~~aHEiGH~lGl~H~~~  156 (206)
T PF13583_consen  138 YQTFAHEIGHNLGLRHDFD  156 (206)
T ss_pred             chHHHHHHHHHhcCCCCcc
Confidence            3679999999999999987


No 29 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=95.58  E-value=0.0053  Score=50.92  Aligned_cols=19  Identities=37%  Similarity=0.455  Sum_probs=13.2

Q ss_pred             cccchhHHHHHhhhccccc
Q psy2949         103 FLKGKIQHEILHSLGFWHE  121 (367)
Q Consensus       103 ~~~g~i~HEigHALG~~HE  121 (367)
                      ....+++|||||.||+.|-
T Consensus       106 ~~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  106 SGVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             TSTTHHHHHHHHHTT----
T ss_pred             ccceEeeehhhHhcCCCCC
Confidence            3458999999999999993


No 30 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=95.45  E-value=0.01  Score=52.72  Aligned_cols=20  Identities=30%  Similarity=0.345  Sum_probs=14.9

Q ss_pred             ccchhHHHHHhhhccccccc
Q psy2949         104 LKGKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       104 ~~g~i~HEigHALG~~HEh~  123 (367)
                      ...+++|||||.||..|...
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~  130 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFD  130 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSS
T ss_pred             eeeeehhhhHhhcCCCCCCC
Confidence            34689999999999999987


No 31 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=95.34  E-value=0.031  Score=54.40  Aligned_cols=81  Identities=20%  Similarity=0.249  Sum_probs=48.1

Q ss_pred             CHHHHHHHHHHHHHHHHcC-Cc-EEEEcCCCCceEEEecCCCCc-cCCCCCCCCCcccccccCCCcccccchhHHHHHhh
Q psy2949          39 TIYEKTLVENAIQDLRMHT-CV-RFVPRTNQDTYLRFRNTGFGC-ASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILHS  115 (367)
Q Consensus        39 ~~~~~~~I~~A~~~w~~~t-ci-~F~~~~~~~~~I~i~~~~~gc-~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigHA  115 (367)
                      +..+.-.+..+.+.+-+.. .+ .|   ..-..+|.|...+..| |+-+|..+|.+..- ..-+.++...++++||+||-
T Consensus        86 ~~~d~~~~~~~Ad~~a~~~lG~~~~---s~y~h~vyvlP~~~~C~w~Gla~v~G~~~~~-~~~~~~~~~~~~~~HElgHN  161 (314)
T PF05548_consen   86 STDDWYGWADAADAAARAQLGVNAF---SSYTHRVYVLPPGFACGWAGLATVPGSQSWL-WISGYGVQDWATIMHELGHN  161 (314)
T ss_pred             chHHHHHHHHHHHHHHHhhcCcccc---cccceEEEEcCCCCCCCceEEeecCCcceee-eecCcccccHHHHHHHhhhh
Confidence            3344556777777776654 33 23   2234456665555456 44444454544221 11234566778999999999


Q ss_pred             hccccccc
Q psy2949         116 LGFWHEHT  123 (367)
Q Consensus       116 LG~~HEh~  123 (367)
                      |||.|...
T Consensus       162 ~GL~Ha~~  169 (314)
T PF05548_consen  162 LGLWHAGR  169 (314)
T ss_pred             ccccccCC
Confidence            99999963


No 32 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=95.24  E-value=0.0082  Score=52.39  Aligned_cols=20  Identities=30%  Similarity=0.527  Sum_probs=13.5

Q ss_pred             chhHHHHHhhhcccccccCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rp  125 (367)
                      .|++||+||-|||.|-.+..
T Consensus        71 ~TltHEvGH~LGL~HtF~~~   90 (154)
T PF05572_consen   71 KTLTHEVGHWLGLYHTFGGG   90 (154)
T ss_dssp             HHHHHHHHHHTT---TT---
T ss_pred             cchhhhhhhhhcccccccCC
Confidence            79999999999999998754


No 33 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=94.94  E-value=0.015  Score=52.19  Aligned_cols=18  Identities=33%  Similarity=0.429  Sum_probs=16.5

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      .+++||+||.||+.|...
T Consensus       135 ~~~aHElGH~lG~~HD~~  152 (192)
T cd04267         135 LTMAHELGHNLGAEHDGG  152 (192)
T ss_pred             hhhhhhHHhhcCCcCCCC
Confidence            689999999999999864


No 34 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=94.80  E-value=0.023  Score=49.38  Aligned_cols=40  Identities=38%  Similarity=0.577  Sum_probs=33.3

Q ss_pred             cccccCCCCCCcccccccccccCCCceeeeeCCCCCccccccccCCCHHHHHHHhhhc
Q psy2949         235 RTFSMPYDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY  292 (367)
Q Consensus       235 ~~~~~pYDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y  292 (367)
                      .....++++.|||||....|+                  ++++..||.-|+.+|+++|
T Consensus       128 ~~~~~~~~~~siM~y~~~~~~------------------~~~~~~fS~~d~~~i~~~Y  167 (167)
T cd00203         128 TLNAEDDDYYSVMSYTKGSFS------------------DGQRKDFSQCDIDQINKLY  167 (167)
T ss_pred             cccCCCCCCCeEeccCccccC------------------cccCCCcCHHHHHHHHhhC
Confidence            356778999999999866553                  4677889999999999998


No 35 
>smart00042 CUB Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
Probab=94.76  E-value=0.024  Score=44.69  Aligned_cols=29  Identities=21%  Similarity=0.285  Sum_probs=24.8

Q ss_pred             ccCCCCCCCCCCCCCCCCcEEEEcccCCCcccc
Q psy2949         308 YSTSGPIPDLGYLPTGSGWFYKIGGPSDDRKIM  340 (367)
Q Consensus       308 ~~~~~~~~~~~~~~~~~~c~~~~~~~~~~r~~~  340 (367)
                      .+|++|.+    ++.+.+|.|.|.+|++.++.+
T Consensus         4 ~Sp~yP~~----y~~~~~C~w~i~~~~g~~i~l   32 (102)
T smart00042        4 TSPNYPQS----YPNNLDCVWTIRAPPGYRIEL   32 (102)
T ss_pred             eCCCCCcC----CCCCCcEEEEEECCCCeEEEE
Confidence            37888876    789999999999999987765


No 36 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=94.49  E-value=0.018  Score=51.74  Aligned_cols=23  Identities=30%  Similarity=0.254  Sum_probs=14.4

Q ss_pred             ccchhHHHHHhhhcccccccCCC
Q psy2949         104 LKGKIQHEILHSLGFWHEHTRPD  126 (367)
Q Consensus       104 ~~g~i~HEigHALG~~HEh~rpD  126 (367)
                      ..-+++|||||.||..|.+..+.
T Consensus       142 ~~~~~AHEiGH~lGa~HD~~~~~  164 (196)
T PF13688_consen  142 GAITFAHEIGHNLGAPHDGDYSS  164 (196)
T ss_dssp             HHHHHHHHHHHHTT-----SSS-
T ss_pred             eehhhHHhHHHhcCCCCCCCCCC
Confidence            34799999999999999887654


No 37 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=93.92  E-value=0.0096  Score=55.22  Aligned_cols=20  Identities=30%  Similarity=0.506  Sum_probs=18.0

Q ss_pred             chhHHHHHhhhcccccccCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rp  125 (367)
                      .|++||+||.|||+|.++--
T Consensus       139 ~t~~HEvGH~lGL~HtF~~~  158 (225)
T cd04275         139 DTATHEVGHWLGLYHTFQGG  158 (225)
T ss_pred             ceeEEeccceeeeeeeecCC
Confidence            69999999999999999753


No 38 
>cd00041 CUB CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
Probab=93.61  E-value=0.074  Score=42.37  Aligned_cols=33  Identities=21%  Similarity=0.295  Sum_probs=27.0

Q ss_pred             cccc-cCCCCCCCCCCCCCCCCcEEEEcccCCCccccc
Q psy2949         305 QGFY-STSGPIPDLGYLPTGSGWFYKIGGPSDDRKIMD  341 (367)
Q Consensus       305 ~g~~-~~~~~~~~~~~~~~~~~c~~~~~~~~~~r~~~~  341 (367)
                      .|.+ +|++|.+    ++.+.+|.|.|.+|.++|+.+-
T Consensus        10 ~g~i~Sp~~p~~----~~~~~~C~w~i~~~~g~~i~l~   43 (113)
T cd00041          10 SGTISSPNYPNN----YPNNLNCVWTIEAPPGYRIRLT   43 (113)
T ss_pred             CeEEECCCCCCC----CCCCCcEEEEEEcCCCCEEEEE
Confidence            4555 8888876    7788999999999999988764


No 39 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=93.31  E-value=0.035  Score=48.97  Aligned_cols=18  Identities=33%  Similarity=0.468  Sum_probs=15.2

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      --++||+||++||.|=-+
T Consensus       126 KEv~HElGH~~GL~HC~N  143 (181)
T COG1913         126 KEVLHELGHLLGLSHCPN  143 (181)
T ss_pred             HHHHHHhhhhcCcccCCC
Confidence            358999999999999654


No 40 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=93.20  E-value=0.039  Score=49.56  Aligned_cols=19  Identities=32%  Similarity=0.519  Sum_probs=16.7

Q ss_pred             cchhHHHHHhhhccccccc
Q psy2949         105 KGKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~  123 (367)
                      ..+++|||||.||+.|...
T Consensus       132 a~~~AHElGH~lG~~HD~~  150 (194)
T cd04269         132 AVTMAHELGHNLGMEHDDG  150 (194)
T ss_pred             HHHHHHHHHhhcCCCcCCC
Confidence            3799999999999999864


No 41 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=93.04  E-value=0.064  Score=49.34  Aligned_cols=20  Identities=30%  Similarity=0.388  Sum_probs=17.4

Q ss_pred             cchhHHHHHhhhcccccccC
Q psy2949         105 KGKIQHEILHSLGFWHEHTR  124 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~r  124 (367)
                      ..+++||+||.||+.|....
T Consensus       146 ~~~~AHElGH~lG~~HD~~~  165 (220)
T cd04272         146 VYTMTHELAHLLGAPHDGSP  165 (220)
T ss_pred             HHHHHHHHHHHhCCCCCCCC
Confidence            47999999999999998654


No 42 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=92.95  E-value=0.19  Score=48.30  Aligned_cols=19  Identities=37%  Similarity=0.354  Sum_probs=16.2

Q ss_pred             ccchhHHHHHhhhcccccc
Q psy2949         104 LKGKIQHEILHSLGFWHEH  122 (367)
Q Consensus       104 ~~g~i~HEigHALG~~HEh  122 (367)
                      ..|+++||+||+|||.+.-
T Consensus       165 ~igv~~HE~gH~lGLPDlY  183 (286)
T TIGR03296       165 GVGVIAHELGHDLGLPDLY  183 (286)
T ss_pred             ceeeeehhhhcccCCCccc
Confidence            3599999999999987655


No 43 
>PF00431 CUB:  CUB domain CUB domain entry Spermadhesins family entry Link to schematic domain picture by Peer Bork. ;  InterPro: IPR000859 The CUB domain (for complement C1r/C1s, Uegf, Bmp1) is a structural motif of approximately 110 residues found almost exclusively in extracellular and plasma membrane-associated proteins, many of which are developmentally regulated [, ]. These proteins are involved in a diverse range of functions, including complement activation, developmental patterning, tissue repair, axon guidance and angiogenesis, cell signalling, fertilisation, haemostasis, inflammation, neurotransmission, receptor-mediated endocytosis, and tumour suppression [, ]. Many CUB-containing proteins are peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Proteins containing a CUB domain include:  Mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system.  Cricetidae sp. (Hamster) serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium. Mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement. Vertebrate enteropeptidase (3.4.21.9 from EC), a type II membrane protein of the intestinal brush border, which activates trypsinogen. Vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity. Sea urchin blastula proteins BP10 and SpAN.  Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5. Neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits. Fibropellins I and III from Strongylocentrotus purpuratus (Purple sea urchin). Mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein. Mammalian spermadhesins.  Xenopus laevis embryonic protein UVS.2, which is expressed during dorsoanterior development.  Several of the above proteins consist of a catalytic domain together with several CUB domains interspersed by calcium-binding EGF domains. Some CUB domains appear to be involved in oligomerisation and/or recognition of substrates and binding partners. For example, in the complement proteases, the CUB domains mediate dimerisation and binding to collagen-like regions of target proteins (e.g. C1q for C1r/C1s). The structure of CUB domains consists of a beta-sandwich with a jelly-roll fold. Almost all CUB domains contain four conserved cysteines that probably form two disulphide bridges (C1-C2, C3-C4). The CUB1 domains of C1s and Map19 have calcium-binding sites [].; PDB: 1SFP_A 3KQ4_B 2WNO_A 2QQK_A 2QQL_A 2QQO_B 2QQM_A 3POJ_A 3POB_A 3POG_B ....
Probab=92.40  E-value=0.058  Score=42.83  Aligned_cols=34  Identities=24%  Similarity=0.355  Sum_probs=26.0

Q ss_pred             ccccc-cCCCCCCCCCCCCCCCCcEEEEcccCCCccccc
Q psy2949         304 IQGFY-STSGPIPDLGYLPTGSGWFYKIGGPSDDRKIMD  341 (367)
Q Consensus       304 ~~g~~-~~~~~~~~~~~~~~~~~c~~~~~~~~~~r~~~~  341 (367)
                      ..|.+ +|++|.+    ++...+|.|.|.+|.+.+|.+-
T Consensus         8 ~~g~i~Sp~yp~~----y~~~~~C~w~i~~~~~~~I~l~   42 (110)
T PF00431_consen    8 NSGIISSPNYPSN----YPSNSDCTWTITAPPGHRIRLT   42 (110)
T ss_dssp             SEEEEESTTTTS-----SSSSEEEEEEEE-STTEEEEEE
T ss_pred             CeEEEECCCCCCC----CCCCCcEeEEEEecccceeeec
Confidence            45555 8888876    7888999999999999987654


No 44 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=92.16  E-value=0.053  Score=50.37  Aligned_cols=17  Identities=35%  Similarity=0.509  Sum_probs=15.3

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      .+++|||||.||+.|--
T Consensus       147 ~t~AHElGHnLGm~HD~  163 (228)
T cd04271         147 QVFAHEIGHTFGAVHDC  163 (228)
T ss_pred             eehhhhhhhhcCCCCCC
Confidence            48999999999999864


No 45 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=91.96  E-value=0.073  Score=47.61  Aligned_cols=15  Identities=40%  Similarity=0.734  Sum_probs=13.5

Q ss_pred             chhHHHHHhhhcccc
Q psy2949         106 GKIQHEILHSLGFWH  120 (367)
Q Consensus       106 g~i~HEigHALG~~H  120 (367)
                      -.++||+||.||+.|
T Consensus       127 k~~~HElGH~lGL~H  141 (179)
T PRK13267        127 KEVTHELGHTLGLEH  141 (179)
T ss_pred             HHHHHHHHHHcCCcc
Confidence            358999999999999


No 46 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=91.95  E-value=0.072  Score=49.99  Aligned_cols=17  Identities=29%  Similarity=0.323  Sum_probs=15.0

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      .+++|||||.||..|--
T Consensus       169 ~t~AHElGHnlGm~HD~  185 (244)
T cd04270         169 LVTAHELGHNFGSPHDP  185 (244)
T ss_pred             HHHHHHHHHhcCCCCCC
Confidence            68999999999998854


No 47 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=91.79  E-value=0.077  Score=48.01  Aligned_cols=17  Identities=29%  Similarity=0.364  Sum_probs=11.6

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      -.++||+||++||.|=.
T Consensus       147 Kea~HElGH~~GL~HC~  163 (194)
T PF07998_consen  147 KEAVHELGHLFGLDHCE  163 (194)
T ss_dssp             HHHHHHHHHHTT----S
T ss_pred             HHHHHHHHHHcCCcCCC
Confidence            57999999999999854


No 48 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=91.68  E-value=0.041  Score=50.16  Aligned_cols=19  Identities=37%  Similarity=0.454  Sum_probs=17.0

Q ss_pred             cchhHHHHHhhhccccccc
Q psy2949         105 KGKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~  123 (367)
                      ..+++|||||.||+.|...
T Consensus       141 a~~~aHElGH~LG~~HD~~  159 (207)
T cd04273         141 AFTIAHELGHVLGMPHDGD  159 (207)
T ss_pred             EEeeeeechhhcCCCCCCC
Confidence            4789999999999999865


No 49 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=90.18  E-value=0.14  Score=46.26  Aligned_cols=18  Identities=39%  Similarity=0.516  Sum_probs=13.5

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      .+++|||||.||+.|...
T Consensus       133 ~~~AHelGH~lGm~HD~~  150 (199)
T PF01421_consen  133 VIIAHELGHNLGMPHDGD  150 (199)
T ss_dssp             HHHHHHHHHHTT---TTT
T ss_pred             HHHHHHHHHhcCCCCCCC
Confidence            688999999999999876


No 50 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=89.25  E-value=0.15  Score=46.05  Aligned_cols=98  Identities=20%  Similarity=0.255  Sum_probs=53.3

Q ss_pred             CeeEEEEcCC-CC--C-----HHHHHHHHHHHHH---HHHcCCcEEEEcCC-CCce-EEEecC---CCCcc-CC-CCC--
Q psy2949          27 KTVYYNFEDS-EF--T-----IYEKTLVENAIQD---LRMHTCVRFVPRTN-QDTY-LRFRNT---GFGCA-SP-VGY--   86 (367)
Q Consensus        27 ~~VpY~~~~~-~~--~-----~~~~~~I~~A~~~---w~~~tci~F~~~~~-~~~~-I~i~~~---~~gc~-S~-vG~--   86 (367)
                      .++.|.++-+ ++  +     ...-..|...+..   |...--++|+.+.. ..++ |.+...   ..-|. .. -|.  
T Consensus        29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~~~Df~I~Lasp~T~~~lC~g~~~~~e~S  108 (203)
T PF11350_consen   29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSGAPDFRISLASPGTTDRLCAGLDTSGETS  108 (203)
T ss_pred             ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCCCCCEEEEECCcchhhhhccCcCcCceeE
Confidence            4688888522 22  1     3455556555544   88888899999983 3344 333322   34461 11 111  


Q ss_pred             --C--CCCccccc--ccCCCcc-------cccchhHHHHHhhhcccccccC
Q psy2949          87 --F--PIGTGIDI--FLGGRVC-------FLKGKIQHEILHSLGFWHEHTR  124 (367)
Q Consensus        87 --~--~g~q~i~~--~~l~~~c-------~~~g~i~HEigHALG~~HEh~r  124 (367)
                        .  ++.-.||.  +..+..-       ...-+|-||.||+||..||-.-
T Consensus       109 C~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cp  159 (203)
T PF11350_consen  109 CRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCP  159 (203)
T ss_pred             eecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCC
Confidence              1  23444551  1111111       1125899999999999887653


No 51 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=89.03  E-value=0.17  Score=48.87  Aligned_cols=16  Identities=31%  Similarity=0.575  Sum_probs=12.8

Q ss_pred             chhHHHHHhhhccccc
Q psy2949         106 GKIQHEILHSLGFWHE  121 (367)
Q Consensus       106 g~i~HEigHALG~~HE  121 (367)
                      ++.+||+||++||.|-
T Consensus       195 ~~f~HE~GH~~GL~H~  210 (305)
T PF10462_consen  195 NEFSHELGHNFGLGHY  210 (305)
T ss_dssp             HHHHHHHHHTTT--SS
T ss_pred             ceeehhhhhhcCCCCC
Confidence            7899999999999993


No 52 
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=86.81  E-value=0.58  Score=34.09  Aligned_cols=29  Identities=31%  Similarity=0.511  Sum_probs=25.7

Q ss_pred             cchhHHHHHhhhcccccccCCCCCcceEE
Q psy2949         105 KGKIQHEILHSLGFWHEHTRPDRDQFVRV  133 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I  133 (367)
                      ...++|++.+.+||.|+..-...+.+|.|
T Consensus        30 eR~~vH~lA~~~gL~s~S~G~g~~R~v~v   58 (60)
T cd02641          30 DRLLVHELAEELGLRHESTGEGSDRVITV   58 (60)
T ss_pred             HHHHHHHHHHHcCCceEeeCCCCceEEEe
Confidence            37899999999999999988888888776


No 53 
>PTZ00337 surface protease GP63; Provisional
Probab=86.06  E-value=0.25  Score=51.78  Aligned_cols=28  Identities=25%  Similarity=0.081  Sum_probs=19.1

Q ss_pred             cccccCCCcccccchhHHHHHhhhcccc
Q psy2949          93 IDIFLGGRVCFLKGKIQHEILHSLGFWH  120 (367)
Q Consensus        93 i~~~~l~~~c~~~g~i~HEigHALG~~H  120 (367)
                      ++|..+...-....+++|||.|||||-.
T Consensus       218 ~np~~i~~~~~~~~v~~HEi~HALGFs~  245 (567)
T PTZ00337        218 FDPRQIAVTNGDVRVAAHELGHALGFVR  245 (567)
T ss_pred             ECHHHccchhHHHHHHHHHHHHHHccCH
Confidence            3343343333345899999999999944


No 54 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=85.62  E-value=0.34  Score=48.95  Aligned_cols=18  Identities=39%  Similarity=0.549  Sum_probs=16.3

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      |.++||+||+||+.|.-+
T Consensus       318 GA~lHEiGH~fg~pH~~~  335 (423)
T PF12044_consen  318 GAFLHEIGHLFGCPHQED  335 (423)
T ss_pred             HHHHHHHHHhcCCCCCCC
Confidence            789999999999999764


No 55 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=84.22  E-value=0.46  Score=49.55  Aligned_cols=17  Identities=41%  Similarity=0.604  Sum_probs=14.0

Q ss_pred             cchhHHHHHhhhccccc
Q psy2949         105 KGKIQHEILHSLGFWHE  121 (367)
Q Consensus       105 ~g~i~HEigHALG~~HE  121 (367)
                      ..+++||++|||||--.
T Consensus       211 ~~~~~HEi~HaLGFs~~  227 (521)
T PF01457_consen  211 FRTVIHEIAHALGFSSS  227 (521)
T ss_dssp             HHHHHHHHHHHTT-SHH
T ss_pred             cceeeeeeeeeeeeccc
Confidence            37999999999999764


No 56 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=82.47  E-value=1.8  Score=43.35  Aligned_cols=20  Identities=15%  Similarity=-0.069  Sum_probs=14.6

Q ss_pred             CCCcccccchhHHHHHhhhc
Q psy2949          98 GGRVCFLKGKIQHEILHSLG  117 (367)
Q Consensus        98 l~~~c~~~g~i~HEigHALG  117 (367)
                      +.......-|++||+|||+=
T Consensus       216 ~~~~~~~v~tl~HE~GHa~h  235 (427)
T cd06459         216 FNGTLDDVFTLAHELGHAFH  235 (427)
T ss_pred             CCCChhhHHHHHHHhhHHHH
Confidence            34445556899999999974


No 57 
>KOG4525|consensus
Probab=81.89  E-value=0.57  Score=46.93  Aligned_cols=18  Identities=39%  Similarity=0.615  Sum_probs=16.2

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      |.++|||||.||+.|.-.
T Consensus       303 GA~~HElGH~lgcpHq~~  320 (614)
T KOG4525|consen  303 GAVCHELGHCLGCPHQSE  320 (614)
T ss_pred             HHHHHHhhhccCCCCCCC
Confidence            789999999999999753


No 58 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=81.84  E-value=1.9  Score=45.66  Aligned_cols=74  Identities=11%  Similarity=0.077  Sum_probs=35.7

Q ss_pred             HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecC----CCC-ccCCCCCCCCCcccccccCCCcccccchhHHHHHh
Q psy2949          40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNT----GFG-CASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILH  114 (367)
Q Consensus        40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~----~~g-c~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigH  114 (367)
                      ++.++.|.+|++.+... ...++...-...||-+..+    +++ |++..+ ..+--.+|   +...-...-|++|||||
T Consensus       314 ~~a~~~v~~~~~~~g~~-~~~~~~~~~~~~wiD~~~r~gK~~Ga~~~~~~~-~~p~il~N---~~~~~~dv~TLaHElGH  388 (591)
T TIGR00181       314 EEAKELILKSLEPLGEE-YIKILKRAFNERWVDYAENKGKRSGAYSIGGYK-VKPYILMN---WDGTLNSVFTLAHELGH  388 (591)
T ss_pred             HHHHHHHHHHHhccCHH-HHHHHHHHhhCCCeeecCCCCCCCCcccCCCCC-CCCeEEEe---cCCCcchHHHHHHHhhh
Confidence            34566677787543221 1111111112246665432    222 444433 22222233   44444456899999999


Q ss_pred             hhcc
Q psy2949         115 SLGF  118 (367)
Q Consensus       115 ALG~  118 (367)
                      |+=.
T Consensus       389 a~H~  392 (591)
T TIGR00181       389 SMHS  392 (591)
T ss_pred             HHHH
Confidence            9743


No 59 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=80.96  E-value=1.1  Score=43.65  Aligned_cols=75  Identities=17%  Similarity=0.020  Sum_probs=41.2

Q ss_pred             HHHHHHHHHHHHHHHHc-CCcEEEEcCCCCceEEEecCCCCccCCCCCCC------CCcccccccCCCcccccchhHHHH
Q psy2949          40 IYEKTLVENAIQDLRMH-TCVRFVPRTNQDTYLRFRNTGFGCASPVGYFP------IGTGIDIFLGGRVCFLKGKIQHEI  112 (367)
Q Consensus        40 ~~~~~~I~~A~~~w~~~-tci~F~~~~~~~~~I~i~~~~~gc~S~vG~~~------g~q~i~~~~l~~~c~~~g~i~HEi  112 (367)
                      .+..+.+++|++.+.+. ..+.....  ...-+.+    +.-.|++|..|      ++-.+|.  .-.......|++||+
T Consensus       133 ~~i~~~~~~~y~~l~~~~p~l~~~~~--~~K~~l~----S~~~s~~g~~Gyy~PFT~EA~vN~--~~p~~~~P~T~~HEl  204 (318)
T PF12725_consen  133 EEIFEEAREGYENLAERYPFLSGYYP--SPKPSLF----SRLMSYMGISGYYNPFTGEANVNT--DLPPYSLPFTICHEL  204 (318)
T ss_pred             HHHHHHHHHHHHHHHHhCCccCCCCC--CCcHhhc----cHHHHHccCceEEcCCcceeecCC--CCCcccccHHHHHHH
Confidence            56677788888888763 33322211  1111111    22234444432      3333441  112334458999999


Q ss_pred             Hhhhcccccc
Q psy2949         113 LHSLGFWHEH  122 (367)
Q Consensus       113 gHALG~~HEh  122 (367)
                      +|-+||..|.
T Consensus       205 AHq~G~a~E~  214 (318)
T PF12725_consen  205 AHQLGFASED  214 (318)
T ss_pred             HHHhCCCCHH
Confidence            9999999996


No 60 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=80.07  E-value=0.37  Score=51.29  Aligned_cols=16  Identities=38%  Similarity=0.407  Sum_probs=14.2

Q ss_pred             ccchhHHHHHhhhccc
Q psy2949         104 LKGKIQHEILHSLGFW  119 (367)
Q Consensus       104 ~~g~i~HEigHALG~~  119 (367)
                      ..|+++|||||.|||.
T Consensus       221 giGVfaHEfGH~LGLP  236 (645)
T PF05547_consen  221 GIGVFAHEFGHDLGLP  236 (645)
T ss_pred             ceEEEEeeccccCCCC
Confidence            4699999999999984


No 61 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=79.85  E-value=2.6  Score=44.56  Aligned_cols=75  Identities=13%  Similarity=0.037  Sum_probs=37.6

Q ss_pred             HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEec----C-CCCccCCCCCCCCCcccccccCCCcccccchhHHHHHh
Q psy2949          40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRN----T-GFGCASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILH  114 (367)
Q Consensus        40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~----~-~~gc~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigH  114 (367)
                      ++.++.|.+|++.|...-.- |..+.-...+|-+..    . +.-|++..+.....-.+|   +.......-|++||+||
T Consensus       310 e~a~~~v~~~~~~l~~e~~~-~~~~~~~~~~iD~~~r~gK~~Ga~~~~~~~~~~p~i~~N---~~~~~~~v~TL~HE~GH  385 (587)
T TIGR02290       310 DEAKELVLEAFGKFSPEMAD-FAEKAFEEGWIDAEPRPGKRGGAFCTGFPPSKEPRVLMN---YDGSRRDVSTLAHELGH  385 (587)
T ss_pred             HHHHHHHHHHHHhcCHHHHH-HHHHHHHcCCcccCCCCCCCCCcccCCCCCCCCCEEEEe---cCCCchhHHHHHHHhhH
Confidence            35678888888776543111 111111233444422    1 222444343222222233   34444456899999999


Q ss_pred             hhcc
Q psy2949         115 SLGF  118 (367)
Q Consensus       115 ALG~  118 (367)
                      |+=.
T Consensus       386 a~H~  389 (587)
T TIGR02290       386 AYHS  389 (587)
T ss_pred             HHHH
Confidence            9943


No 62 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=79.07  E-value=0.91  Score=47.91  Aligned_cols=17  Identities=35%  Similarity=0.661  Sum_probs=14.5

Q ss_pred             ccchhHHHHHhhhcccc
Q psy2949         104 LKGKIQHEILHSLGFWH  120 (367)
Q Consensus       104 ~~g~i~HEigHALG~~H  120 (367)
                      ...+++|||.|||||--
T Consensus       256 ~~rv~~HEi~HALGFS~  272 (622)
T PTZ00257        256 TTRTVTHEVAHALGFSS  272 (622)
T ss_pred             HHHHHHHHHHHHhcCCH
Confidence            35899999999999944


No 63 
>PF02408 CUB_2:  CUB-like domain;  InterPro: IPR003366 This domain is found in a family of hypothetical Caenorhabditis elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain (IPR000859 from INTERPRO). The aligned region is approximately 130 amino acids long and contains two conserved cysteine residues.
Probab=72.82  E-value=6.6  Score=32.43  Aligned_cols=29  Identities=14%  Similarity=0.232  Sum_probs=22.3

Q ss_pred             CCCCCCCCCCCCCcEEEEcccCC--Cccccc
Q psy2949         313 PIPDLGYLPTGSGWFYKIGGPSD--DRKIMD  341 (367)
Q Consensus       313 ~~~~~~~~~~~~~c~~~~~~~~~--~r~~~~  341 (367)
                      ...++-.+|++.+|.|.|.+|.|  -++.|-
T Consensus        31 ~s~~~~~~p~n~~C~y~i~iP~G~~a~v~~~   61 (120)
T PF02408_consen   31 GSTSPPQFPANQNCTYQINIPKGYYAKVTLS   61 (120)
T ss_pred             CCCCccccCCCCceEEEEEcCCceEEEEEEE
Confidence            34466789999999999999998  444443


No 64 
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=72.43  E-value=3.3  Score=30.16  Aligned_cols=28  Identities=14%  Similarity=0.341  Sum_probs=25.0

Q ss_pred             chhHHHHHhhhcccccccCCCCCcceEE
Q psy2949         106 GKIQHEILHSLGFWHEHTRPDRDQFVRV  133 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpDRD~yv~I  133 (367)
                      ..++|++.+.+||.|+-.-...+.||.|
T Consensus        31 R~~vH~~a~~~gL~s~S~G~g~~R~v~v   58 (60)
T cd02640          31 RALIHQIAQKYGLKSRSYGSGNDRYLVI   58 (60)
T ss_pred             HHHHHHHHHHcCCceeeEeCCCCeEEEE
Confidence            7899999999999999888888888776


No 65 
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=72.31  E-value=2.3  Score=34.85  Aligned_cols=37  Identities=27%  Similarity=0.471  Sum_probs=26.3

Q ss_pred             CCCccCCCCCCCCCcccccccCCCcccccchhHHHHHhhhccc
Q psy2949          77 GFGCASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILHSLGFW  119 (367)
Q Consensus        77 ~~gc~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigHALG~~  119 (367)
                      ++=|.|++|..-    |.|..-.+||  ..|+.||+.|++.|.
T Consensus        58 ~~LCG~~~~~i~----IDP~~~~KGC--~~TL~HEL~H~WQ~R   94 (141)
T PHA02456         58 QDLCGQFVGWIE----IDPDYANKGC--RDTLAHELNHAWQFR   94 (141)
T ss_pred             cchhhcceeEEE----ECCcccccch--HHHHHHHHHHHHhhh
Confidence            566888888752    3344446788  569999999998653


No 66 
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=71.82  E-value=3.7  Score=29.83  Aligned_cols=29  Identities=14%  Similarity=0.268  Sum_probs=24.8

Q ss_pred             cchhHHHHHhhhcccccccCCCCCcceEE
Q psy2949         105 KGKIQHEILHSLGFWHEHTRPDRDQFVRV  133 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I  133 (367)
                      ...++|++.+.+||.|+-.-...+.||.|
T Consensus        29 eR~~vH~~a~~~gL~s~S~G~g~~R~v~v   57 (59)
T cd06007          29 ERAVIHRLCRKLGLKSKSKGKGSNRRLSV   57 (59)
T ss_pred             HHHHHHHHHHHcCCCceeecCCCCeEEEE
Confidence            37899999999999999877777777766


No 67 
>KOG3607|consensus
Probab=71.37  E-value=1.9  Score=46.64  Aligned_cols=17  Identities=35%  Similarity=0.628  Sum_probs=16.2

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      .+++|||||.||+.|+-
T Consensus       325 ~v~AhelgH~lGm~hD~  341 (716)
T KOG3607|consen  325 VVLAHELGHNLGMIHDE  341 (716)
T ss_pred             HHHHHHHHhhcCccccc
Confidence            79999999999999987


No 68 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=70.51  E-value=2.1  Score=40.65  Aligned_cols=26  Identities=12%  Similarity=0.077  Sum_probs=14.1

Q ss_pred             CCeeEEEEcCCCCCHHHHHHHHHHHH
Q psy2949          26 DKTVYYNFEDSEFTIYEKTLVENAIQ   51 (367)
Q Consensus        26 ~~~VpY~~~~~~~~~~~~~~I~~A~~   51 (367)
                      ...|...|-..+|++.++..+++.++
T Consensus        80 ~~~i~ivilgDGYT~~E~~~F~~da~  105 (264)
T PF09471_consen   80 GNRIDIVILGDGYTAAEMDKFEEDAR  105 (264)
T ss_dssp             TTSEEEEEEEES--GGGHHHHHHHHH
T ss_pred             cCCeEEEEEcCCCCHHHHHHHHHHHH
Confidence            34577777566787766655544333


No 69 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=66.42  E-value=2.8  Score=38.04  Aligned_cols=14  Identities=29%  Similarity=0.309  Sum_probs=11.7

Q ss_pred             chhHHHHHhhhccc
Q psy2949         106 GKIQHEILHSLGFW  119 (367)
Q Consensus       106 g~i~HEigHALG~~  119 (367)
                      +++.||+|||+--.
T Consensus        24 ~t~~HE~gHal~a~   37 (200)
T PF13398_consen   24 VTFVHELGHALAAL   37 (200)
T ss_pred             HHHHHHHHHHHHHH
Confidence            79999999998533


No 70 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=65.22  E-value=3.7  Score=37.91  Aligned_cols=13  Identities=23%  Similarity=0.342  Sum_probs=11.6

Q ss_pred             cchhHHHHHhhhc
Q psy2949         105 KGKIQHEILHSLG  117 (367)
Q Consensus       105 ~g~i~HEigHALG  117 (367)
                      .++++||.|||+-
T Consensus        90 vaVAAHEvGHAiQ  102 (222)
T PF04298_consen   90 VAVAAHEVGHAIQ  102 (222)
T ss_pred             HHHHHHHHhHHHh
Confidence            4899999999985


No 71 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=65.21  E-value=3  Score=38.33  Aligned_cols=16  Identities=31%  Similarity=0.436  Sum_probs=12.5

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      -|++|||||+|  .|...
T Consensus        74 FtlAHELGH~l--lH~~~   89 (213)
T COG2856          74 FTLAHELGHAL--LHTDL   89 (213)
T ss_pred             HHHHHHHhHHH--hcccc
Confidence            69999999997  45443


No 72 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=62.92  E-value=4.7  Score=42.35  Aligned_cols=21  Identities=14%  Similarity=0.093  Sum_probs=14.7

Q ss_pred             CCcccccchhHHHHHhhhccc
Q psy2949          99 GRVCFLKGKIQHEILHSLGFW  119 (367)
Q Consensus        99 ~~~c~~~g~i~HEigHALG~~  119 (367)
                      ...-...-|++||+|||+=.+
T Consensus       332 ~~t~~dv~TL~HElGHa~H~~  352 (549)
T TIGR02289       332 NGTSGDIDVLTHEAGHAFHVY  352 (549)
T ss_pred             CCChhHHHHHHHHhhHHHHHH
Confidence            333344589999999998443


No 73 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=61.61  E-value=4.2  Score=32.39  Aligned_cols=12  Identities=33%  Similarity=0.456  Sum_probs=10.3

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .-+++|||||.+
T Consensus        43 ~f~laHELgH~~   54 (122)
T PF06114_consen   43 RFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHH
Confidence            369999999976


No 74 
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=60.99  E-value=8.2  Score=27.67  Aligned_cols=28  Identities=18%  Similarity=0.312  Sum_probs=24.9

Q ss_pred             chhHHHHHhhhcccccccCCCCCcceEE
Q psy2949         106 GKIQHEILHSLGFWHEHTRPDRDQFVRV  133 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpDRD~yv~I  133 (367)
                      ..++|+|.+++||.++.+-...+.||.|
T Consensus        29 R~~vH~lA~~~~L~S~S~G~g~~R~v~v   56 (58)
T cd02646          29 RKTIHKLANCYNLKSKSRGKGKKRFVTV   56 (58)
T ss_pred             HHHHHHHHHHcCCcccccccCCceEEEE
Confidence            6899999999999999888888888776


No 75 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=60.88  E-value=4.1  Score=32.71  Aligned_cols=17  Identities=35%  Similarity=0.597  Sum_probs=13.0

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      -|++||++|-+|+--|.
T Consensus        75 ~tlvhEiah~fG~~~e~   91 (97)
T PF06262_consen   75 DTLVHEIAHHFGISDED   91 (97)
T ss_dssp             HHHHHHHHHHTT--HHH
T ss_pred             HHHHHHHHHHcCCCHHH
Confidence            58999999999987654


No 76 
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=59.30  E-value=7.5  Score=42.60  Aligned_cols=124  Identities=14%  Similarity=0.145  Sum_probs=65.3

Q ss_pred             CCcCCCeeEEEEcCCC-----CCHHHHHHHH---------------HHHHHHHHcCCcEEEEcC------CCCceEEEec
Q psy2949          22 QLWPDKTVYYNFEDSE-----FTIYEKTLVE---------------NAIQDLRMHTCVRFVPRT------NQDTYLRFRN   75 (367)
Q Consensus        22 ~~Wp~~~VpY~~~~~~-----~~~~~~~~I~---------------~A~~~w~~~tci~F~~~~------~~~~~I~i~~   75 (367)
                      ..|.+..-+|++-+.+     .+...|..++               +.++.+...+.|.|.+..      ...-+++--.
T Consensus       128 ~~w~~~~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~~~~n~~~kYF~KAD~  207 (775)
T PF03272_consen  128 SEWDDSDSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDPVDKNFNNKYFAKADK  207 (775)
T ss_pred             hhhhhcCCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCcccccccccceEEEecC
Confidence            4688777888874333     1445555555               556667788898888754      1222232211


Q ss_pred             CCCCccCCCCCCCC---CcccccccCCCcccccchhHHHHHhhhcccccccCCCCCcceEEeeCCcCCCCCcccccCCCC
Q psy2949          76 TGFGCASPVGYFPI---GTGIDIFLGGRVCFLKGKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPGHEFNLERRPTG  152 (367)
Q Consensus        76 ~~~gc~S~vG~~~g---~q~i~~~~l~~~c~~~g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~~~~nf~k~~~~  152 (367)
                      .|.| .+|.|..--   ...|.. -|... ..-..++|||||+--+.-.++.+    ++.=.|+||.+. ..++..++..
T Consensus       208 ~G~G-~AYY~~~w~a~ss~s~~~-~L~~~-~~nW~~LHEIgHgYd~~F~~n~~----~~~EVWnNI~~d-~yQ~~~~~~~  279 (775)
T PF03272_consen  208 SGPG-AAYYGSNWTAQSSSSLSF-YLNPS-PTNWGALHEIGHGYDFGFTRNGT----YLNEVWNNILAD-RYQYTYMTQD  279 (775)
T ss_pred             CCCC-CccccccceecCchhHHH-HhCCC-CCCchhhhhhhhhcceeEeeCCc----chhhhhhhhhhh-hhhhhhcChh
Confidence            2222 334444211   112220 01000 12256899999987666665544    244579999864 3455555433


Q ss_pred             c
Q psy2949         153 S  153 (367)
Q Consensus       153 ~  153 (367)
                      +
T Consensus       280 e  280 (775)
T PF03272_consen  280 E  280 (775)
T ss_pred             h
Confidence            3


No 77 
>cd02642 R3H_encore_like R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=59.06  E-value=9.5  Score=27.85  Aligned_cols=29  Identities=17%  Similarity=0.189  Sum_probs=25.4

Q ss_pred             cchhHHHHHhhhcccccccCCCCCcceEEe
Q psy2949         105 KGKIQHEILHSLGFWHEHTRPDRDQFVRVL  134 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I~  134 (367)
                      ..-++|++....||.||.+... +.+|.|.
T Consensus        33 ~RllvH~la~~~gL~s~s~~~~-~r~vvv~   61 (63)
T cd02642          33 YRLLAHRVAQYYGLDHNVDNSG-GKCVIVN   61 (63)
T ss_pred             HHHHHHHHHHHhCCeeEeecCC-ceEEEEE
Confidence            3789999999999999999987 7777764


No 78 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=58.87  E-value=27  Score=31.18  Aligned_cols=17  Identities=24%  Similarity=0.128  Sum_probs=14.3

Q ss_pred             cchhHHHHHhhhccccc
Q psy2949         105 KGKIQHEILHSLGFWHE  121 (367)
Q Consensus       105 ~g~i~HEigHALG~~HE  121 (367)
                      .-+|+||+.|.+-..|.
T Consensus       165 dYVvvHEL~Hl~~~nHs  181 (205)
T PF01863_consen  165 DYVVVHELCHLRHPNHS  181 (205)
T ss_pred             HHHHHHHHHHhccCCCC
Confidence            37999999999977765


No 79 
>PF11667 DUF3267:  Protein of unknown function (DUF3267);  InterPro: IPR021683  This family of proteins has no known function. 
Probab=57.67  E-value=6.6  Score=31.90  Aligned_cols=20  Identities=30%  Similarity=0.275  Sum_probs=18.1

Q ss_pred             chhHHHHHhhhcccccccCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rp  125 (367)
                      ..++||+.|+|.+...-.++
T Consensus         6 ~~~~HEliH~l~~~~~~~~~   25 (111)
T PF11667_consen    6 LIPLHELIHGLFFKLFGKKP   25 (111)
T ss_pred             eHHHHHHHHHHHHHHhCCCC
Confidence            57899999999999998877


No 80 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=57.40  E-value=3.4  Score=37.90  Aligned_cols=40  Identities=35%  Similarity=0.566  Sum_probs=26.9

Q ss_pred             CcccccccC--CCCCCcccccccccccCCCceeeeeCCCCCccccccccCCCHHHHHHHhhhc
Q psy2949         232 GSVRTFSMP--YDYGSIMHYSGIAFSKDGVSKTIVPLYPGAEDTMGQRDAMSRVDLAKLNRLY  292 (367)
Q Consensus       232 ~~~~~~~~p--YDy~SiMhY~~~~fs~~~~~~ti~~~~~~~~~~~gqr~~~s~~D~~~in~~Y  292 (367)
                      +.+.-.|+|  +|-+|||.-   .|+.                  +....|+..|+..++.||
T Consensus       170 GAi~IpGTPt~~d~~SiM~a---c~~~------------------~~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  170 GAIHIPGTPTGADPNSIMNA---CFSS------------------GEDGEFTSNDITALNYLY  211 (211)
T ss_pred             ceEECCCCCCCCCCchhhhc---cccC------------------CCCCCcChhhHHHHHhhC
Confidence            334455555  599999953   2221                  223469999999999998


No 81 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=56.45  E-value=5.5  Score=36.81  Aligned_cols=11  Identities=36%  Similarity=0.458  Sum_probs=10.0

Q ss_pred             chhHHHHHhhh
Q psy2949         106 GKIQHEILHSL  116 (367)
Q Consensus       106 g~i~HEigHAL  116 (367)
                      .|+.||+||||
T Consensus        94 ~~l~HE~GHAl  104 (220)
T PF14247_consen   94 FTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHH
Confidence            68999999997


No 82 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=55.89  E-value=4.9  Score=33.41  Aligned_cols=16  Identities=31%  Similarity=0.484  Sum_probs=13.6

Q ss_pred             chhHHHHHhhhccccc
Q psy2949         106 GKIQHEILHSLGFWHE  121 (367)
Q Consensus       106 g~i~HEigHALG~~HE  121 (367)
                      -+++|||||-+|+--+
T Consensus       111 hvliHEIgHhFGLsDd  126 (136)
T COG3824         111 HVLIHEIGHHFGLSDD  126 (136)
T ss_pred             hhhhhhhhhhcCCChh
Confidence            5899999999998654


No 83 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=55.55  E-value=5.8  Score=35.24  Aligned_cols=21  Identities=19%  Similarity=0.134  Sum_probs=15.3

Q ss_pred             cchhHHHHHhhhcccccccCC
Q psy2949         105 KGKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rp  125 (367)
                      ..+++||+||++-....-.+.
T Consensus         8 i~i~~HE~gH~~~a~~~G~~~   28 (192)
T PF02163_consen    8 ISIVLHELGHALAARLYGDKV   28 (192)
T ss_dssp             HHHHHHHHHHHHHHHTTT--B
T ss_pred             ccccccccccccccccccccc
Confidence            368999999999877765544


No 84 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=55.15  E-value=2.3  Score=35.83  Aligned_cols=45  Identities=27%  Similarity=0.425  Sum_probs=29.9

Q ss_pred             CccCccccccHHHHHHHHHHhcCCCCcccCcccccccCCCCcccccccCCCCCC
Q psy2949         192 DTMGQRDAMSRVDLAKLNRLYKCPKNYYQGHEFNLERRPAGSVRTFSMPYDYGS  245 (367)
Q Consensus       192 ~~IG~r~~lS~~D~~~in~lY~C~~nI~~g~~~Nf~~~~~~~~~~~~~pYDy~S  245 (367)
                      |++|..+..++.|++.+..+.  ..++    ..+|...+.   +.+++||||+|
T Consensus        96 HaLGl~H~~~~~drd~~~~~~--~~~~----~~~~~~~~~---~~~~~~~~yg~  140 (140)
T smart00235       96 HALGLYHEQSRSDRDNYMYIN--YTNI----TRNFDLSND---DSLGIPYDYGS  140 (140)
T ss_pred             HHhcCCcCCCCCcccCeEEEe--hhhh----hhccccccc---cCCCchhccCc
Confidence            678888888999987544332  2333    345555432   57889999987


No 85 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=51.32  E-value=7  Score=39.95  Aligned_cols=15  Identities=47%  Similarity=0.470  Sum_probs=12.7

Q ss_pred             ccchhHHHHHhhhcc
Q psy2949         104 LKGKIQHEILHSLGF  118 (367)
Q Consensus       104 ~~g~i~HEigHALG~  118 (367)
                      ..||++|||||+.|-
T Consensus       130 lagViAHEigHv~qr  144 (484)
T COG4783         130 LAGVIAHEIGHVAQR  144 (484)
T ss_pred             HHHHHHHHHHHHhhh
Confidence            349999999999873


No 86 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=51.04  E-value=7.4  Score=39.67  Aligned_cols=15  Identities=20%  Similarity=0.202  Sum_probs=12.2

Q ss_pred             cchhHHHHHhhhccc
Q psy2949         105 KGKIQHEILHSLGFW  119 (367)
Q Consensus       105 ~g~i~HEigHALG~~  119 (367)
                      .-|+.|||||||=..
T Consensus       243 v~tLfHE~GHa~H~~  257 (458)
T PF01432_consen  243 VETLFHEFGHAMHSL  257 (458)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhHHHHHH
Confidence            479999999998544


No 87 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=50.92  E-value=7.1  Score=38.43  Aligned_cols=12  Identities=25%  Similarity=0.382  Sum_probs=10.7

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      ..|+.||+|||+
T Consensus       155 v~tl~HE~GHa~  166 (365)
T cd06258         155 INTLFHEFGHAV  166 (365)
T ss_pred             HHHHHHHHhHHH
Confidence            478999999998


No 88 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=49.31  E-value=8.3  Score=33.95  Aligned_cols=19  Identities=21%  Similarity=0.136  Sum_probs=13.9

Q ss_pred             chhHHHHHhhhcccccccC
Q psy2949         106 GKIQHEILHSLGFWHEHTR  124 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~r  124 (367)
                      .+++||+||++=-.+.-.+
T Consensus        10 ~i~iHE~gH~~~A~~~G~~   28 (180)
T cd05709          10 SVTVHELGHALVARRLGVK   28 (180)
T ss_pred             HHHHHHHHHHHHHHHcCCC
Confidence            6899999999955554433


No 89 
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=49.29  E-value=9.7  Score=31.20  Aligned_cols=17  Identities=41%  Similarity=0.591  Sum_probs=15.4

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      -.++|=++|.||+-|+.
T Consensus        75 ~l~iHG~LHLlGyDH~~   91 (110)
T TIGR00043        75 HLTVHGLLHLLGYDHET   91 (110)
T ss_pred             HHHHHHHHHHcCCCCCC
Confidence            57999999999999985


No 90 
>PF15639 Tox-MPTase3:  Metallopeptidase toxin 3
Probab=47.98  E-value=25  Score=29.53  Aligned_cols=70  Identities=16%  Similarity=0.170  Sum_probs=40.3

Q ss_pred             HHHHHHHHHHHHHcCCcEEEEcC--CCCceEEEecCCCCccCCCCCCC---CCcccccc-----cC--CCc-----cc--
Q psy2949          43 KTLVENAIQDLRMHTCVRFVPRT--NQDTYLRFRNTGFGCASPVGYFP---IGTGIDIF-----LG--GRV-----CF--  103 (367)
Q Consensus        43 ~~~I~~A~~~w~~~tci~F~~~~--~~~~~I~i~~~~~gc~S~vG~~~---g~q~i~~~-----~l--~~~-----c~--  103 (367)
                      ...|-+|+..|-..+-+...+.=  .....|.|.....||  ..|+..   +.|.|.+.     .+  +.+     -.  
T Consensus        22 ~p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~--A~G~t~p~~~~~~i~I~~~lV~~~Ek~~~~~r~~~~~~   99 (135)
T PF15639_consen   22 VPAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGT--AYGYTTPDDGSQVIRIDGDLVNMFEKGPGERRAGNVYL   99 (135)
T ss_pred             ChHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccce--eeeeecCCCCCcEEEeeHHHhhhhhcCCcccccceEEE
Confidence            46778888888665444333322  567789998776777  556532   24444310     01  111     11  


Q ss_pred             ccchhHHHHHh
Q psy2949         104 LKGKIQHEILH  114 (367)
Q Consensus       104 ~~g~i~HEigH  114 (367)
                      -..|++||+-|
T Consensus       100 v~~TlLHEliH  110 (135)
T PF15639_consen  100 VGSTLLHELIH  110 (135)
T ss_pred             eehHHHHHHHH
Confidence            12699999998


No 91 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=47.68  E-value=9.6  Score=33.05  Aligned_cols=11  Identities=27%  Similarity=0.625  Sum_probs=9.8

Q ss_pred             chhHHHHHhhh
Q psy2949         106 GKIQHEILHSL  116 (367)
Q Consensus       106 g~i~HEigHAL  116 (367)
                      -+|+|||.|++
T Consensus       137 DVvaHEltHGV  147 (150)
T PF01447_consen  137 DVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHH
T ss_pred             ceeeecccccc
Confidence            59999999985


No 92 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=47.03  E-value=8.2  Score=41.05  Aligned_cols=75  Identities=13%  Similarity=-0.104  Sum_probs=37.2

Q ss_pred             HHHHHHHHHHHHHHHHcCCcEEEEcCCCCceEEEecC---CCC--ccCCCCCCCCCcccccccCCCcccccchhHHHHHh
Q psy2949          40 IYEKTLVENAIQDLRMHTCVRFVPRTNQDTYLRFRNT---GFG--CASPVGYFPIGTGIDIFLGGRVCFLKGKIQHEILH  114 (367)
Q Consensus        40 ~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~I~i~~~---~~g--c~S~vG~~~g~q~i~~~~l~~~c~~~g~i~HEigH  114 (367)
                      ++.++.|.++++.......=.|++. -...+|-+..+   ..|  |.+.......---||   +...-..+-|++||+||
T Consensus       315 ~ea~~~v~~~l~~lg~ey~~~~~~a-~~~~WiD~~~~~gKrsGaYs~~~~~~~~p~IlmN---~~gt~~dV~TLaHElGH  390 (598)
T COG1164         315 EEAKELVLKALAPLGPEYAKIARRA-FDERWIDVYPRKGKRSGAYSIGFYKGDHPFILMN---YDGTLRDVFTLAHELGH  390 (598)
T ss_pred             HHHHHHHHHHHHhhCHHHHHHHHHH-HhcCCeeccCCCCCCCCcccCCCCCCCCCeEEEe---CCCchhHHHHHHHHccH
Confidence            3566677777766644322212221 22356655322   233  333322222222333   33344456899999999


Q ss_pred             hhcc
Q psy2949         115 SLGF  118 (367)
Q Consensus       115 ALG~  118 (367)
                      ++=.
T Consensus       391 s~Hs  394 (598)
T COG1164         391 SVHS  394 (598)
T ss_pred             HHHH
Confidence            9853


No 93 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=46.47  E-value=9.4  Score=39.28  Aligned_cols=12  Identities=25%  Similarity=0.288  Sum_probs=10.6

Q ss_pred             chhHHHHHhhhc
Q psy2949         106 GKIQHEILHSLG  117 (367)
Q Consensus       106 g~i~HEigHALG  117 (367)
                      -|+.|||||||=
T Consensus       265 ~TLfHEfGHalH  276 (472)
T cd06455         265 ETFFHEFGHVIH  276 (472)
T ss_pred             HHHHHHHHHHHH
Confidence            689999999983


No 94 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=44.45  E-value=11  Score=38.79  Aligned_cols=15  Identities=20%  Similarity=0.224  Sum_probs=12.0

Q ss_pred             chhHHHHHhhhcccc
Q psy2949         106 GKIQHEILHSLGFWH  120 (367)
Q Consensus       106 g~i~HEigHALG~~H  120 (367)
                      -|+.|||||||=..-
T Consensus       250 ~TLfHEfGHalH~~l  264 (458)
T cd06457         250 ETLFHEMGHAMHSML  264 (458)
T ss_pred             HHHHHHHhHHHHHHH
Confidence            689999999985333


No 95 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=42.95  E-value=12  Score=33.44  Aligned_cols=19  Identities=21%  Similarity=-0.092  Sum_probs=13.1

Q ss_pred             chhHHHHHhhhcccccccC
Q psy2949         106 GKIQHEILHSLGFWHEHTR  124 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~r  124 (367)
                      .+++||+||.+=-.-..-+
T Consensus        11 ~v~iHElGH~~~Ar~~Gv~   29 (182)
T cd06163          11 LIFVHELGHFLVAKLFGVK   29 (182)
T ss_pred             HHHHHHHHHHHHHHHcCCe
Confidence            5799999998744444333


No 96 
>PF02130 UPF0054:  Uncharacterized protein family UPF0054;  InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=42.80  E-value=13  Score=31.85  Aligned_cols=16  Identities=44%  Similarity=0.713  Sum_probs=14.0

Q ss_pred             chhHHHHHhhhccccc
Q psy2949         106 GKIQHEILHSLGFWHE  121 (367)
Q Consensus       106 g~i~HEigHALG~~HE  121 (367)
                      -.++|=++|.||+-|+
T Consensus       110 ~l~vHG~LHLlGyDH~  125 (145)
T PF02130_consen  110 RLLVHGLLHLLGYDHE  125 (145)
T ss_dssp             HHHHHHHHHHTT-SST
T ss_pred             HHHHHHHHHHcCCCCC
Confidence            5799999999999999


No 97 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=42.19  E-value=12  Score=33.90  Aligned_cols=17  Identities=18%  Similarity=0.145  Sum_probs=13.1

Q ss_pred             cccccchhHHHHHhhhc
Q psy2949         101 VCFLKGKIQHEILHSLG  117 (367)
Q Consensus       101 ~c~~~g~i~HEigHALG  117 (367)
                      .....++++||.|||+.
T Consensus        89 Sia~~aVAAHEVGHAiQ  105 (226)
T COG2738          89 SIAAIAVAAHEVGHAIQ  105 (226)
T ss_pred             cHHHHHHHHHHhhHHHh
Confidence            33345899999999985


No 98 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=41.02  E-value=13  Score=35.58  Aligned_cols=17  Identities=24%  Similarity=0.350  Sum_probs=14.1

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      .+++||++||+=-..|.
T Consensus       137 svvvHElgHal~A~~~g  153 (277)
T cd06162         137 SGVVHEMGHGVAAVREQ  153 (277)
T ss_pred             HHHHHHHHHHHHHHHcC
Confidence            78999999998766664


No 99 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=40.64  E-value=14  Score=29.47  Aligned_cols=11  Identities=27%  Similarity=0.504  Sum_probs=10.2

Q ss_pred             chhHHHHHhhh
Q psy2949         106 GKIQHEILHSL  116 (367)
Q Consensus       106 g~i~HEigHAL  116 (367)
                      .+++||+.|++
T Consensus        27 ~~l~HE~~H~~   37 (128)
T PF13485_consen   27 RVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHH
Confidence            89999999986


No 100
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=39.75  E-value=14  Score=37.54  Aligned_cols=13  Identities=31%  Similarity=0.345  Sum_probs=11.0

Q ss_pred             cchhHHHHHhhhc
Q psy2949         105 KGKIQHEILHSLG  117 (367)
Q Consensus       105 ~g~i~HEigHALG  117 (367)
                      .-|+.|||||||=
T Consensus       209 v~tLfHEfGHalH  221 (422)
T cd06456         209 VTTLFHEFGHALH  221 (422)
T ss_pred             HHHHHHHHHHHHH
Confidence            3689999999984


No 101
>COG0319 Predicted metal-dependent hydrolase [General function prediction only]
Probab=39.39  E-value=16  Score=31.80  Aligned_cols=18  Identities=44%  Similarity=0.586  Sum_probs=16.2

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      -.++|=++|.||+-|+..
T Consensus       110 ~l~vHG~LHLlGYDH~~~  127 (153)
T COG0319         110 HLTIHGILHLLGYDHEED  127 (153)
T ss_pred             HHHHHHHHHHccCCCCCc
Confidence            579999999999999975


No 102
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=39.36  E-value=14  Score=35.02  Aligned_cols=21  Identities=24%  Similarity=0.298  Sum_probs=16.8

Q ss_pred             chhHHHHHhhhcccccccCCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRPD  126 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpD  126 (367)
                      .+++||+||++=-.++..+.+
T Consensus       120 sv~iHElgHa~~Ar~~G~~V~  140 (263)
T cd06159         120 GVVVHELSHGILARVEGIKVK  140 (263)
T ss_pred             HHHHHHHHHHHHHHHcCCEEC
Confidence            689999999998777765543


No 103
>KOG3658|consensus
Probab=39.27  E-value=10  Score=40.29  Aligned_cols=16  Identities=38%  Similarity=0.499  Sum_probs=14.5

Q ss_pred             chhHHHHHhhhccccc
Q psy2949         106 GKIQHEILHSLGFWHE  121 (367)
Q Consensus       106 g~i~HEigHALG~~HE  121 (367)
                      -|++|||||.+|=.|.
T Consensus       394 lt~AHEiGHNfGSpHD  409 (764)
T KOG3658|consen  394 LTLAHEIGHNFGSPHD  409 (764)
T ss_pred             eeehhhhccccCCCCC
Confidence            4899999999999886


No 104
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=39.08  E-value=15  Score=33.03  Aligned_cols=13  Identities=23%  Similarity=0.250  Sum_probs=11.0

Q ss_pred             cchhHHHHHhhhc
Q psy2949         105 KGKIQHEILHSLG  117 (367)
Q Consensus       105 ~g~i~HEigHALG  117 (367)
                      .++++||+||...
T Consensus        90 ~aVlaHElgH~~~  102 (226)
T PF01435_consen   90 AAVLAHELGHIKH  102 (226)
T ss_dssp             HHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHc
Confidence            3899999999763


No 105
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=38.88  E-value=15  Score=32.74  Aligned_cols=12  Identities=25%  Similarity=0.296  Sum_probs=10.6

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .+.++||++||+
T Consensus        10 ~~i~~HE~aHa~   21 (181)
T cd06158          10 LAITLHEFAHAY   21 (181)
T ss_pred             HHHHHHHHHHHH
Confidence            378999999998


No 106
>PRK02870 heat shock protein HtpX; Provisional
Probab=38.76  E-value=12  Score=36.89  Aligned_cols=15  Identities=27%  Similarity=0.470  Sum_probs=12.1

Q ss_pred             cchhHHHHHhhhcccccc
Q psy2949         105 KGKIQHEILHSLGFWHEH  122 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh  122 (367)
                      .|+++||+||.   .|.|
T Consensus       174 ~aVlAHELgHi---k~~d  188 (336)
T PRK02870        174 QAVMAHELSHI---RHGD  188 (336)
T ss_pred             HHHHHHHHHHH---Hccc
Confidence            49999999997   5544


No 107
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=38.05  E-value=16  Score=33.25  Aligned_cols=21  Identities=24%  Similarity=0.166  Sum_probs=16.8

Q ss_pred             chhHHHHHhhhcccccccCCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRPD  126 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpD  126 (367)
                      .+++||+||++=..+...+..
T Consensus        40 ~v~iHElgH~~~A~~~G~~v~   60 (208)
T cd06161          40 SVLLHELGHALVARRYGIRVR   60 (208)
T ss_pred             HHHHHHHHHHHHHHHcCCCcc
Confidence            689999999997777766654


No 108
>PRK00016 metal-binding heat shock protein; Provisional
Probab=37.64  E-value=17  Score=31.75  Aligned_cols=17  Identities=41%  Similarity=0.505  Sum_probs=15.3

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      -.++|=++|.||+-|+.
T Consensus       115 ~l~iHG~LHLlGYDH~~  131 (159)
T PRK00016        115 HLTVHGILHLLGYDHIE  131 (159)
T ss_pred             HHHHHhhHHhcCCCCCC
Confidence            57999999999999974


No 109
>PRK13963 unkown domain/putative metalloprotease fusion protein; Provisional
Probab=35.64  E-value=20  Score=33.81  Aligned_cols=16  Identities=38%  Similarity=0.580  Sum_probs=15.0

Q ss_pred             chhHHHHHhhhccccc
Q psy2949         106 GKIQHEILHSLGFWHE  121 (367)
Q Consensus       106 g~i~HEigHALG~~HE  121 (367)
                      -.++|=++|.|||-||
T Consensus       217 ~LlIHGlLHLLGYDHe  232 (258)
T PRK13963        217 HLLVHGALHAQGYDHE  232 (258)
T ss_pred             HHHHHHHHHHcCCCCC
Confidence            5799999999999999


No 110
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=35.35  E-value=18  Score=36.56  Aligned_cols=17  Identities=35%  Similarity=0.296  Sum_probs=13.8

Q ss_pred             chhHHHHHhhhcccccccCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rp  125 (367)
                      -+++||+||||   |||+-|
T Consensus       161 ~t~iHE~GHal---ye~~l~  177 (396)
T cd06460         161 FSTIHETGHAL---YEQGLP  177 (396)
T ss_pred             HHHHHHhhHHH---HHhcCC
Confidence            58999999997   777554


No 111
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=35.01  E-value=20  Score=30.70  Aligned_cols=15  Identities=20%  Similarity=0.445  Sum_probs=12.2

Q ss_pred             cchhHHHHHhhhccc
Q psy2949         105 KGKIQHEILHSLGFW  119 (367)
Q Consensus       105 ~g~i~HEigHALG~~  119 (367)
                      ..||+||+.|++=+.
T Consensus        60 ~~~l~HEm~H~~~~~   74 (146)
T smart00731       60 RETLLHELCHAALYL   74 (146)
T ss_pred             HhhHHHHHHHHHHHH
Confidence            379999999988553


No 112
>KOG2921|consensus
Probab=34.92  E-value=18  Score=36.22  Aligned_cols=33  Identities=24%  Similarity=0.258  Sum_probs=21.4

Q ss_pred             chhHHHHHhhhcccccccCCCCCcceEEeeCCcCCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRPDRDQFVRVLRENIGPG  141 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpDRD~yv~I~~~ni~~~  141 (367)
                      ..++||+||||--.-|+-+-   +++-|..-.|.|+
T Consensus       133 ~~vvHElGHalAA~segV~v---ngfgIfi~aiyPg  165 (484)
T KOG2921|consen  133 TVVVHELGHALAAASEGVQV---NGFGIFIAAIYPG  165 (484)
T ss_pred             HHHHHHhhHHHHHHhcCcee---eeeEEEEEEEcCc
Confidence            57899999999877777543   3344444444443


No 113
>cd06006 R3H_unknown_2 R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA  or ssRNA in a sequence-specific manner.
Probab=34.72  E-value=41  Score=24.37  Aligned_cols=29  Identities=24%  Similarity=0.453  Sum_probs=25.7

Q ss_pred             cchhHHHHHhhhcccccccCCCCCcceEE
Q psy2949         105 KGKIQHEILHSLGFWHEHTRPDRDQFVRV  133 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I  133 (367)
                      ...++||+..+.||.=|..-+.-+.+|-|
T Consensus        29 ~R~~vHdla~~~gl~SeS~d~Ep~R~V~v   57 (59)
T cd06006          29 QRAFIHELAKDYGLYSESQDPEPKRSVFV   57 (59)
T ss_pred             HHHHHHHHHHHcCCeeEecCCCCCcEEEE
Confidence            37899999999999999999888888766


No 114
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=34.71  E-value=20  Score=30.66  Aligned_cols=16  Identities=19%  Similarity=0.397  Sum_probs=13.3

Q ss_pred             cchhHHHHHhhhcccc
Q psy2949         105 KGKIQHEILHSLGFWH  120 (367)
Q Consensus       105 ~g~i~HEigHALG~~H  120 (367)
                      ..|++||+.||+=+..
T Consensus        61 ~~tL~HEm~H~~~~~~   76 (157)
T PF10263_consen   61 IDTLLHEMAHAAAYVF   76 (157)
T ss_pred             HHHHHHHHHHHHhhhc
Confidence            4899999999986555


No 115
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=34.45  E-value=20  Score=32.80  Aligned_cols=14  Identities=21%  Similarity=0.340  Sum_probs=11.4

Q ss_pred             chhHHHHHhhhccc
Q psy2949         106 GKIQHEILHSLGFW  119 (367)
Q Consensus       106 g~i~HEigHALG~~  119 (367)
                      .+++||+|||+=..
T Consensus        95 siLAHE~mHa~Lrl  108 (212)
T PF12315_consen   95 SILAHELMHAWLRL  108 (212)
T ss_pred             hHHHHHHHHHHhcc
Confidence            58899999998544


No 116
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=33.81  E-value=21  Score=33.12  Aligned_cols=21  Identities=24%  Similarity=0.094  Sum_probs=16.3

Q ss_pred             chhHHHHHhhhcccccccCCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRPD  126 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpD  126 (367)
                      .+++||++|++=..+...+.+
T Consensus        55 ~v~iHElgH~~~A~~~G~~v~   75 (227)
T cd06164          55 SVLLHELGHSLVARRYGIPVR   75 (227)
T ss_pred             HHHHHHHHHHHHHHHcCCeEC
Confidence            688999999987666666554


No 117
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=32.30  E-value=24  Score=27.03  Aligned_cols=14  Identities=21%  Similarity=0.389  Sum_probs=11.6

Q ss_pred             ccchhHHHHHhhhc
Q psy2949         104 LKGKIQHEILHSLG  117 (367)
Q Consensus       104 ~~g~i~HEigHALG  117 (367)
                      ....++||+.|++-
T Consensus        61 ~~~llaHEl~Hv~Q   74 (79)
T PF13699_consen   61 GRALLAHELAHVVQ   74 (79)
T ss_pred             cchhHhHHHHHHHh
Confidence            35789999999973


No 118
>PRK10911 oligopeptidase A; Provisional
Probab=32.26  E-value=21  Score=38.67  Aligned_cols=11  Identities=36%  Similarity=0.377  Sum_probs=10.2

Q ss_pred             chhHHHHHhhh
Q psy2949         106 GKIQHEILHSL  116 (367)
Q Consensus       106 g~i~HEigHAL  116 (367)
                      -|+.|||||||
T Consensus       465 ~tlfHEfGHal  475 (680)
T PRK10911        465 ITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHhHHH
Confidence            68999999997


No 119
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=31.97  E-value=20  Score=30.77  Aligned_cols=14  Identities=36%  Similarity=0.665  Sum_probs=11.3

Q ss_pred             ccchhHHHHHh--hhc
Q psy2949         104 LKGKIQHEILH--SLG  117 (367)
Q Consensus       104 ~~g~i~HEigH--ALG  117 (367)
                      ..+|++||+-|  ++|
T Consensus        96 k~~TLiHE~SHf~~~~  111 (148)
T PF14521_consen   96 KEGTLIHEWSHFTAVG  111 (148)
T ss_dssp             HHHHHHHHHHHSCCCT
T ss_pred             hHHHHHHhhhhhhhhc
Confidence            35999999999  644


No 120
>PRK03982 heat shock protein HtpX; Provisional
Probab=31.74  E-value=22  Score=34.03  Aligned_cols=12  Identities=42%  Similarity=0.642  Sum_probs=10.2

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .++++||+||.-
T Consensus       126 ~AVlAHElgHi~  137 (288)
T PRK03982        126 EGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHH
Confidence            499999999953


No 121
>PF02102 Peptidase_M35:  Deuterolysin metalloprotease (M35) family;  InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=30.28  E-value=29  Score=34.56  Aligned_cols=21  Identities=19%  Similarity=0.303  Sum_probs=13.6

Q ss_pred             CCCccccc---chhHHHHHhhhcc
Q psy2949          98 GGRVCFLK---GKIQHEILHSLGF  118 (367)
Q Consensus        98 l~~~c~~~---g~i~HEigHALG~  118 (367)
                      +...|...   ++++|||-|+.+.
T Consensus       288 ~~~~C~~qDqatt~LHE~TH~~~V  311 (359)
T PF02102_consen  288 LSNRCHAQDQATTTLHEMTHAPAV  311 (359)
T ss_dssp             S--STT---HHHHHHHHHHT-TTT
T ss_pred             ccccccCCCccchhhhhhhccccc
Confidence            45677654   8999999999875


No 122
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=30.10  E-value=24  Score=33.37  Aligned_cols=12  Identities=25%  Similarity=0.354  Sum_probs=10.4

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .+++.||+||..
T Consensus       158 ~aVlaHElgHi~  169 (302)
T COG0501         158 EAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHh
Confidence            399999999964


No 123
>KOG3538|consensus
Probab=29.98  E-value=13  Score=41.15  Aligned_cols=21  Identities=29%  Similarity=0.387  Sum_probs=17.2

Q ss_pred             cchhHHHHHhhhcccccccCC
Q psy2949         105 KGKIQHEILHSLGFWHEHTRP  125 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rp  125 (367)
                      .-+|+||+||.+|+.|--.-.
T Consensus       318 aftiahE~GH~~gm~hd~~~~  338 (845)
T KOG3538|consen  318 AFTIAHELGHNFGMEHDGRGN  338 (845)
T ss_pred             ceeeeeccccccCcccCCcCC
Confidence            368999999999999984433


No 124
>PRK03001 M48 family peptidase; Provisional
Probab=29.76  E-value=25  Score=33.57  Aligned_cols=12  Identities=33%  Similarity=0.667  Sum_probs=10.3

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .++++||+||.-
T Consensus       125 ~aVlAHElgHi~  136 (283)
T PRK03001        125 RGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHh
Confidence            499999999964


No 125
>PRK03072 heat shock protein HtpX; Provisional
Probab=29.44  E-value=25  Score=33.70  Aligned_cols=21  Identities=5%  Similarity=0.048  Sum_probs=13.4

Q ss_pred             CHHHHHHHHHHHHHHHHcCCc
Q psy2949          39 TIYEKTLVENAIQDLRMHTCV   59 (367)
Q Consensus        39 ~~~~~~~I~~A~~~w~~~tci   59 (367)
                      ++.+...+.+.+++..+...+
T Consensus        65 ~~~~~p~L~~~v~~la~~~g~   85 (288)
T PRK03072         65 SEVQAPAMYRIVRELSTAARQ   85 (288)
T ss_pred             ChhhhHHHHHHHHHHHHHcCC
Confidence            344555677777777776665


No 126
>PRK01345 heat shock protein HtpX; Provisional
Probab=29.38  E-value=26  Score=34.23  Aligned_cols=12  Identities=33%  Similarity=0.581  Sum_probs=10.5

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .++++|||||..
T Consensus       125 ~aVlAHElgHi~  136 (317)
T PRK01345        125 AGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHH
Confidence            499999999965


No 127
>PRK05457 heat shock protein HtpX; Provisional
Probab=28.29  E-value=28  Score=33.43  Aligned_cols=11  Identities=36%  Similarity=0.555  Sum_probs=9.7

Q ss_pred             cchhHHHHHhh
Q psy2949         105 KGKIQHEILHS  115 (367)
Q Consensus       105 ~g~i~HEigHA  115 (367)
                      .|+++||+||.
T Consensus       135 ~aVlAHElgHi  145 (284)
T PRK05457        135 EAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHH
Confidence            49999999994


No 128
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=27.96  E-value=28  Score=37.70  Aligned_cols=18  Identities=17%  Similarity=0.169  Sum_probs=13.0

Q ss_pred             HHHHHHHHcCCcEEEEcC
Q psy2949          48 NAIQDLRMHTCVRFVPRT   65 (367)
Q Consensus        48 ~A~~~w~~~tci~F~~~~   65 (367)
                      -.+...++-=.|+|++++
T Consensus       369 Glf~l~~~LfGi~f~~~~  386 (681)
T PRK10280        369 GVFWTANQLFGIKFVERF  386 (681)
T ss_pred             hHHHHHHHHcCeEEEECC
Confidence            344556777899999865


No 129
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=27.94  E-value=29  Score=31.05  Aligned_cols=22  Identities=18%  Similarity=0.007  Sum_probs=18.0

Q ss_pred             chhHHHHHhhhcccccccCCCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRPDR  127 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpDR  127 (367)
                      ..+.||+||++-......+..+
T Consensus        43 ~l~iHElgH~~~A~~~G~~~~~   64 (183)
T cd06160          43 ILGIHEMGHYLAARRHGVKASL   64 (183)
T ss_pred             HHHHHHHHHHHHHHHCCCCccc
Confidence            5789999999988887777664


No 130
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=27.78  E-value=30  Score=32.67  Aligned_cols=21  Identities=10%  Similarity=0.077  Sum_probs=14.1

Q ss_pred             CCCCHHHHHHHHHHHHHHHHc
Q psy2949          36 SEFTIYEKTLVENAIQDLRMH   56 (367)
Q Consensus        36 ~~~~~~~~~~I~~A~~~w~~~   56 (367)
                      ..+++++++.++.-+..+-..
T Consensus        39 ~~L~~~~~~~l~~~~~~Fl~~   59 (253)
T PF06167_consen   39 RRLSPEEQRRLRQRVQLFLAE   59 (253)
T ss_dssp             TT--HHHHHHHHHHHHHHHHH
T ss_pred             hcCCHHHHHHHHHHHHHHhcc
Confidence            466888888888877776554


No 131
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=26.92  E-value=31  Score=33.32  Aligned_cols=11  Identities=27%  Similarity=0.274  Sum_probs=9.3

Q ss_pred             chhHHHHHhhh
Q psy2949         106 GKIQHEILHSL  116 (367)
Q Consensus       106 g~i~HEigHAL  116 (367)
                      .+|+||+||.+
T Consensus       172 yVlAHEyGHHV  182 (292)
T PF04228_consen  172 YVLAHEYGHHV  182 (292)
T ss_pred             HHHHHHHHHHH
Confidence            48999999954


No 132
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=26.63  E-value=37  Score=24.68  Aligned_cols=17  Identities=29%  Similarity=0.450  Sum_probs=15.6

Q ss_pred             cchhHHHHHhhhccccc
Q psy2949         105 KGKIQHEILHSLGFWHE  121 (367)
Q Consensus       105 ~g~i~HEigHALG~~HE  121 (367)
                      ...++|.+.|-||+.|.
T Consensus        30 eRriih~la~~lGL~~~   46 (60)
T cd02639          30 ERRIVHLLASRLGLNHV   46 (60)
T ss_pred             HHHHHHHHHHHcCCceE
Confidence            37899999999999998


No 133
>PRK04897 heat shock protein HtpX; Provisional
Probab=26.00  E-value=31  Score=33.29  Aligned_cols=11  Identities=55%  Similarity=0.752  Sum_probs=9.7

Q ss_pred             cchhHHHHHhh
Q psy2949         105 KGKIQHEILHS  115 (367)
Q Consensus       105 ~g~i~HEigHA  115 (367)
                      .++++||+||.
T Consensus       138 ~aVlAHElgHi  148 (298)
T PRK04897        138 EGVIGHEISHI  148 (298)
T ss_pred             HHHHHHHHHHH
Confidence            49999999994


No 134
>KOG2556|consensus
Probab=25.17  E-value=34  Score=35.06  Aligned_cols=19  Identities=32%  Similarity=0.450  Sum_probs=15.6

Q ss_pred             chhHHHHHhhhcccccccC
Q psy2949         106 GKIQHEILHSLGFWHEHTR  124 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~r  124 (367)
                      .+|.||+.|||||.--+-.
T Consensus       257 stVkhE~~halgfsaGl~a  275 (666)
T KOG2556|consen  257 STVKHERDHALGFSAGLVA  275 (666)
T ss_pred             HHHHHHHHhhhcccceeeE
Confidence            5899999999999766543


No 135
>PRK02391 heat shock protein HtpX; Provisional
Probab=25.05  E-value=34  Score=33.01  Aligned_cols=11  Identities=27%  Similarity=0.543  Sum_probs=9.7

Q ss_pred             cchhHHHHHhh
Q psy2949         105 KGKIQHEILHS  115 (367)
Q Consensus       105 ~g~i~HEigHA  115 (367)
                      .++++||+||.
T Consensus       134 ~aVlaHElgHi  144 (296)
T PRK02391        134 EAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHH
Confidence            49999999994


No 136
>PRK01265 heat shock protein HtpX; Provisional
Probab=24.40  E-value=34  Score=33.51  Aligned_cols=11  Identities=36%  Similarity=0.368  Sum_probs=9.7

Q ss_pred             cchhHHHHHhh
Q psy2949         105 KGKIQHEILHS  115 (367)
Q Consensus       105 ~g~i~HEigHA  115 (367)
                      .++++||+||.
T Consensus       141 ~aVlAHElgHi  151 (324)
T PRK01265        141 KAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHH
Confidence            49999999994


No 137
>PF05506 DUF756:  Domain of unknown function (DUF756);  InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=23.77  E-value=77  Score=24.42  Aligned_cols=41  Identities=10%  Similarity=0.056  Sum_probs=35.7

Q ss_pred             cEEEEcccCCCccccchhhccccccccceeeecCCceeeec
Q psy2949         326 WFYKIGGPSDDRKIMDKFFNNTIHFSKMKVGRDNSYTYRYA  366 (367)
Q Consensus       326 c~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  366 (367)
                      =.|.+.++.+++...-=-+...-+|=..+|..+++|..|||
T Consensus        47 ~~~~~~v~ag~~~~~~w~l~~s~gwYDl~v~~~~~F~rr~a   87 (89)
T PF05506_consen   47 GPWTYTVAAGQTVSLTWPLAASGGWYDLTVTGPNGFLRRFA   87 (89)
T ss_pred             CCEEEEECCCCEEEEEEeecCCCCcEEEEEEcCCCEEEEec
Confidence            34899999998877776678899999999999999999987


No 138
>PF14891 Peptidase_M91:  Effector protein
Probab=23.45  E-value=36  Score=30.00  Aligned_cols=18  Identities=17%  Similarity=0.410  Sum_probs=14.8

Q ss_pred             chhHHHHHhhhccccccc
Q psy2949         106 GKIQHEILHSLGFWHEHT  123 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~  123 (367)
                      -++.|||.||.-+.+=-.
T Consensus       105 v~L~HEL~HA~~~~~Gt~  122 (174)
T PF14891_consen  105 VVLYHELIHAYDYMNGTM  122 (174)
T ss_pred             HHHHHHHHHHHHHHCCCC
Confidence            579999999997777655


No 139
>cd02325 R3H R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=22.75  E-value=82  Score=21.31  Aligned_cols=30  Identities=17%  Similarity=0.254  Sum_probs=25.3

Q ss_pred             cchhHHHHHhhhcccccccCCCCCcceEEe
Q psy2949         105 KGKIQHEILHSLGFWHEHTRPDRDQFVRVL  134 (367)
Q Consensus       105 ~g~i~HEigHALG~~HEh~rpDRD~yv~I~  134 (367)
                      ..-++|++....|+.|+..-.+-+.+|.|.
T Consensus        29 ~R~~vH~la~~~~L~s~s~g~~~~r~v~i~   58 (59)
T cd02325          29 ERKLIHDLAEYYGLKSESEGEGPNRRVVIT   58 (59)
T ss_pred             HHHHHHHHHHHCCCEEEEecCCCCcEEEEe
Confidence            378999999999999999888877777663


No 140
>PF08810 KapB:  Kinase associated protein B;  InterPro: IPR014916 This bacterial protein forms an anti-parallel beta sheet with an extending alpha helical region. ; PDB: 1Y71_B.
Probab=22.38  E-value=21  Score=29.27  Aligned_cols=53  Identities=19%  Similarity=0.334  Sum_probs=18.6

Q ss_pred             CCccCCcccc-----ccccccccCCCCc-CCC-eeEEEEcCCCCCHHHHHHHHHHHHHHH
Q psy2949           2 EGDILIPRER-----DSRNLVLYQAQLW-PDK-TVYYNFEDSEFTIYEKTLVENAIQDLR   54 (367)
Q Consensus         2 ~GDi~~~~q~-----~~r~a~~~~~~~W-p~~-~VpY~~~~~~~~~~~~~~I~~A~~~w~   54 (367)
                      |||...|.|.     ..|+|++..++.| |.. +-||.-+-+++.++.+++|.+.++..+
T Consensus        31 QGDLHnP~q~dvpfFHERkALs~~Ek~~ip~~~vk~y~geip~Y~~SLq~A~~~~~~~L~   90 (112)
T PF08810_consen   31 QGDLHNPKQTDVPFFHERKALSYREKTNIPKSMVKPYEGEIPDYEESLQQALEKLKEKLK   90 (112)
T ss_dssp             ----------------------TT-EEEEEGGGEEE--S----HHHHHHHHHHHHHHTTT
T ss_pred             ccCCCCCCcCCCcceehhhhcchHHHhcCCHHhcccCCCCCCChHHHHHHHHHHHHHHHh
Confidence            7998865553     3688999888888 444 557766545566666766666655543


No 141
>PRK04351 hypothetical protein; Provisional
Probab=21.70  E-value=47  Score=28.77  Aligned_cols=12  Identities=42%  Similarity=0.584  Sum_probs=10.5

Q ss_pred             cchhHHHHHhhh
Q psy2949         105 KGKIQHEILHSL  116 (367)
Q Consensus       105 ~g~i~HEigHAL  116 (367)
                      .+||.||+.|++
T Consensus        62 ~~vv~HElcH~~   73 (149)
T PRK04351         62 IGIIKHELCHYH   73 (149)
T ss_pred             HhhHHHHHHHHH
Confidence            489999999975


No 142
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=21.22  E-value=44  Score=29.91  Aligned_cols=13  Identities=23%  Similarity=0.409  Sum_probs=10.4

Q ss_pred             chhHHHHHhhhcc
Q psy2949         106 GKIQHEILHSLGF  118 (367)
Q Consensus       106 g~i~HEigHALG~  118 (367)
                      .+++|||+||+.-
T Consensus        38 ~ilahel~hafd~   50 (206)
T PF01431_consen   38 FILAHELMHAFDP   50 (206)
T ss_dssp             HHHHHHHHHCTST
T ss_pred             HHHHHHHHHHHHH
Confidence            4799999997754


No 143
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=21.08  E-value=45  Score=33.80  Aligned_cols=11  Identities=36%  Similarity=0.250  Sum_probs=9.4

Q ss_pred             chhHHHHHhhh
Q psy2949         106 GKIQHEILHSL  116 (367)
Q Consensus       106 g~i~HEigHAL  116 (367)
                      -+++||+||.|
T Consensus        16 ~v~~HE~gH~~   26 (420)
T TIGR00054        16 LIFVHELGHFL   26 (420)
T ss_pred             HHHHHhHHHHH
Confidence            57899999965


No 144
>PF07108 PipA:  PipA protein;  InterPro: IPR010777 This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis [].
Probab=21.02  E-value=94  Score=27.24  Aligned_cols=52  Identities=17%  Similarity=0.056  Sum_probs=27.6

Q ss_pred             CceEEEecCCCCccCCCCC---CCCCcccccccCCCcccccchhHHHHHhhhccc-ccccCCCCC
Q psy2949          68 DTYLRFRNTGFGCASPVGY---FPIGTGIDIFLGGRVCFLKGKIQHEILHSLGFW-HEHTRPDRD  128 (367)
Q Consensus        68 ~~~I~i~~~~~gc~S~vG~---~~g~q~i~~~~l~~~c~~~g~i~HEigHALG~~-HEh~rpDRD  128 (367)
                      ...|-..-++..|.++--.   ..|+|.+.         .....+||+.|||-.. -+-...-|+
T Consensus       122 rkvIcLnlDd~dd~~~~E~Yes~~G~qlFd---------~~RsFiHEiVHALt~L~Dke~nhprg  177 (200)
T PF07108_consen  122 RKVICLNLDDSDDDSYPEHYESNEGPQLFD---------TKRSFIHEIVHALTHLQDKEDNHPRG  177 (200)
T ss_pred             ceEEEeecCCcccccchhhhhccCCccchh---------HHHHHHHHHHHHHhhcccccccCCCC
Confidence            3444444455566555433   22344332         2467899999999633 333333344


No 145
>PF13058 DUF3920:  Protein of unknown function (DUF3920)
Probab=21.01  E-value=49  Score=27.20  Aligned_cols=15  Identities=33%  Similarity=0.616  Sum_probs=12.8

Q ss_pred             chhHHHHHhhhcccccc
Q psy2949         106 GKIQHEILHSLGFWHEH  122 (367)
Q Consensus       106 g~i~HEigHALG~~HEh  122 (367)
                      -|++|||-|++.  ||+
T Consensus        78 ktllhefrh~mQ--h~~   92 (126)
T PF13058_consen   78 KTLLHEFRHAMQ--HEK   92 (126)
T ss_pred             HHHHHHHHHHHH--hhh
Confidence            699999999987  655


No 146
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=20.70  E-value=44  Score=31.74  Aligned_cols=17  Identities=29%  Similarity=0.307  Sum_probs=14.3

Q ss_pred             ccchhHHHHHhhhcccc
Q psy2949         104 LKGKIQHEILHSLGFWH  120 (367)
Q Consensus       104 ~~g~i~HEigHALG~~H  120 (367)
                      ..-+++|||.|.|-+.+
T Consensus       142 g~NVvIHEFAH~LDm~~  158 (260)
T PRK15410        142 GFNLIIHEVAHKLDMRN  158 (260)
T ss_pred             CcchhHhHHHhHhhhhc
Confidence            34799999999998866


No 147
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=20.47  E-value=53  Score=31.27  Aligned_cols=20  Identities=40%  Similarity=0.677  Sum_probs=15.8

Q ss_pred             chhHHHHHhhhcccccccCCC
Q psy2949         106 GKIQHEILHSLGFWHEHTRPD  126 (367)
Q Consensus       106 g~i~HEigHALG~~HEh~rpD  126 (367)
                      ++++||++|.+ |.|-.-+..
T Consensus        62 ~~l~HevlH~~-~~H~~r~~~   81 (292)
T PF13203_consen   62 GLLLHEVLHCL-LRHPWRRGG   81 (292)
T ss_pred             HHHHHHHHHHH-ccchhhhcc
Confidence            78999999976 666666665


Done!