Query         psy1890
Match_columns 242
No_of_seqs    189 out of 1300
Neff          7.7 
Searched_HMMs 46136
Date          Fri Aug 16 18:53:08 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1890.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1890hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd04282 ZnMc_meprin Zinc-depen 100.0 7.8E-68 1.7E-72  449.4  19.5  204    5-217    24-230 (230)
  2 cd04281 ZnMc_BMP1_TLD Zinc-dep 100.0   4E-67 8.8E-72  437.9  19.8  199   17-218     2-200 (200)
  3 cd04283 ZnMc_hatching_enzyme Z 100.0   1E-61 2.2E-66  399.5  17.2  177   29-217     5-182 (182)
  4 PF01400 Astacin:  Astacin (Pep 100.0 6.4E-62 1.4E-66  406.0  15.4  191   23-219     1-191 (191)
  5 KOG3714|consensus              100.0 2.2E-60 4.7E-65  437.3  17.2  204   12-221    69-273 (411)
  6 cd04280 ZnMc_astacin_like Zinc 100.0 1.5E-59 3.3E-64  388.3  17.3  180   27-215     1-180 (180)
  7 cd04327 ZnMc_MMP_like_3 Zinc-d 100.0 7.9E-45 1.7E-49  304.7  13.3  170   23-216     1-198 (198)
  8 smart00235 ZnMc Zinc-dependent 100.0   8E-33 1.7E-37  219.5  13.5  136   22-166     2-140 (140)
  9 cd04268 ZnMc_MMP_like Zinc-dep 100.0   1E-32 2.2E-37  224.0  10.8  148   28-215     2-165 (165)
 10 cd00203 ZnMc Zinc-dependent me  99.9 2.5E-22 5.5E-27  163.0  12.8  148   29-215     2-167 (167)
 11 cd04278 ZnMc_MMP Zinc-dependen  99.8 1.2E-19 2.6E-24  146.7  11.3   98   24-123     1-125 (157)
 12 cd04277 ZnMc_serralysin_like Z  99.8 1.8E-17   4E-22  137.4  13.7  142   37-216    30-186 (186)
 13 PF00413 Peptidase_M10:  Matrix  99.7 2.2E-15 4.9E-20  120.6  14.1   98   24-123     1-123 (154)
 14 cd04279 ZnMc_MMP_like_1 Zinc-d  99.7 1.2E-15 2.6E-20  123.1  11.3   98   29-126     3-125 (156)
 15 cd04276 ZnMc_MMP_like_2 Zinc-d  99.1 3.9E-09 8.4E-14   88.4  14.0  158   26-216     6-196 (197)
 16 KOG1565|consensus               99.0 1.3E-09 2.8E-14  102.1  10.0  100   21-122   102-228 (469)
 17 PF12388 Peptidase_M57:  Dual-a  99.0 1.1E-08 2.5E-13   85.7  13.5  160   22-215    31-211 (211)
 18 PF02031 Peptidase_M7:  Strepto  98.4 4.9E-07 1.1E-11   69.6   4.6   85   29-120     3-92  (132)
 19 PF13583 Reprolysin_4:  Metallo  97.4 0.00021 4.5E-09   60.3   4.8   20  105-124   137-156 (206)
 20 COG5549 Predicted Zn-dependent  97.4 0.00037 8.1E-09   58.2   5.9   77   41-120   104-202 (236)
 21 PF13574 Reprolysin_2:  Metallo  96.6  0.0018 3.9E-08   53.1   2.9   21  105-125   111-131 (173)
 22 PF13582 Reprolysin_3:  Metallo  96.6   0.001 2.2E-08   51.0   1.4   19  104-122   106-124 (124)
 23 PF05548 Peptidase_M11:  Gameto  96.5  0.0055 1.2E-07   55.0   5.7   78   41-124    88-169 (314)
 24 PF05572 Peptidase_M43:  Pregna  96.3  0.0016 3.5E-08   52.5   1.4   21  106-126    70-90  (154)
 25 cd04267 ZnMc_ADAM_like Zinc-de  96.1  0.0032   7E-08   52.1   2.2   19  106-124   134-152 (192)
 26 PF13688 Reprolysin_5:  Metallo  96.0  0.0037 7.9E-08   51.8   1.8   23  105-127   142-164 (196)
 27 cd04275 ZnMc_pappalysin_like Z  95.4   0.002 4.3E-08   55.2  -1.8   22  105-126   137-158 (225)
 28 cd04272 ZnMc_salivary_gland_MP  95.2   0.014 2.9E-07   49.6   2.6   21  105-125   145-165 (220)
 29 cd04269 ZnMc_adamalysin_II_lik  95.2  0.0087 1.9E-07   49.6   1.3   19  106-124   132-150 (194)
 30 COG1913 Predicted Zn-dependent  95.0  0.0099 2.1E-07   48.5   1.1   19  107-125   126-144 (181)
 31 cd04270 ZnMc_TACE_like Zinc-de  94.7   0.015 3.2E-07   50.4   1.4   17  107-123   169-185 (244)
 32 cd04271 ZnMc_ADAM_fungal Zinc-  94.7   0.011 2.5E-07   50.6   0.7   18  107-124   147-164 (228)
 33 PRK13267 archaemetzincin-like   94.5   0.018 3.9E-07   47.6   1.4   15  107-121   127-141 (179)
 34 cd04273 ZnMc_ADAMTS_like Zinc-  94.3  0.0087 1.9E-07   50.3  -0.9   20  105-124   140-159 (207)
 35 PF07998 Peptidase_M54:  Peptid  94.3   0.021 4.5E-07   47.7   1.4   18  107-124   147-164 (194)
 36 PF01421 Reprolysin:  Reprolysi  93.7    0.03 6.6E-07   46.5   1.4   19  106-124   132-150 (199)
 37 PF11350 DUF3152:  Protein of u  93.0   0.038 8.3E-07   46.2   0.8   98   28-126    29-160 (203)
 38 TIGR03296 M6dom_TIGR03296 M6 f  93.0   0.028   6E-07   49.8  -0.1   20  105-124   165-184 (286)
 39 PF10462 Peptidase_M66:  Peptid  92.8   0.046 9.9E-07   48.8   1.1   16  106-121   194-209 (305)
 40 cd02641 R3H_Smubp-2_like R3H d  92.5    0.16 3.5E-06   34.2   3.2   40   95-134    18-58  (60)
 41 PF12044 Metallopep:  Putative   92.3   0.059 1.3E-06   50.2   1.1   20  106-125   317-336 (423)
 42 PF05547 Peptidase_M6:  Immune   89.6    0.14 3.1E-06   50.2   1.0   16  105-120   221-236 (645)
 43 TIGR02290 M3_fam_3 oligoendope  89.6    0.16 3.5E-06   49.4   1.4   77   41-120   311-390 (587)
 44 PF01457 Peptidase_M8:  Leishma  89.4    0.17 3.7E-06   48.6   1.3   17  106-122   211-227 (521)
 45 cd06459 M3B_Oligoendopeptidase  89.1    0.73 1.6E-05   42.5   5.3   25   96-120   213-237 (427)
 46 TIGR00181 pepF oligoendopeptid  89.0    0.74 1.6E-05   44.8   5.5   75   41-119   315-392 (591)
 47 PTZ00337 surface protease GP63  88.1    0.23 4.9E-06   48.2   1.2   17  105-121   229-245 (567)
 48 KOG4525|consensus               87.2    0.24 5.1E-06   46.1   0.7   20  106-125   302-321 (614)
 49 PTZ00257 Glycoprotein GP63 (le  86.1    0.35 7.7E-06   47.1   1.3   17  105-121   256-272 (622)
 50 cd02640 R3H_NRF R3H domain of   86.0       1 2.2E-05   30.4   3.1   40   95-134    18-58  (60)
 51 cd06007 R3H_DEXH_helicase R3H   85.0     1.5 3.3E-05   29.4   3.6   40   95-134    17-57  (59)
 52 KOG3607|consensus               82.7     0.6 1.3E-05   46.5   1.3   18  106-123   324-341 (716)
 53 cd02642 R3H_encore_like R3H do  81.8     2.4 5.3E-05   28.6   3.7   40   95-135    22-61  (63)
 54 cd02646 R3H_G-patch R3H domain  80.6     2.5 5.4E-05   28.0   3.4   40   95-134    17-56  (58)
 55 PF12725 DUF3810:  Protein of u  80.6    0.87 1.9E-05   41.0   1.4   23  102-124   193-215 (318)
 56 TIGR02289 M3_not_pepF oligoend  80.3     1.3 2.9E-05   42.7   2.7   24   97-120   329-352 (549)
 57 PF09471 Peptidase_M64:  IgA Pe  79.5    0.98 2.1E-05   39.6   1.4   15  106-120   217-231 (264)
 58 PF04298 Zn_peptidase_2:  Putat  77.6       2 4.4E-05   36.6   2.7   24   95-118    73-102 (222)
 59 PF03272 Enhancin:  Viral enhan  77.5     1.7 3.8E-05   43.8   2.6  134   11-151   114-277 (775)
 60 COG2856 Predicted Zn peptidase  77.4     1.2 2.6E-05   37.9   1.2   16  107-124    74-89  (213)
 61 PF13398 Peptidase_M50B:  Pepti  76.2     1.4 2.9E-05   37.0   1.3   22  106-127    23-44  (200)
 62 PF06262 DUF1025:  Possibl zinc  74.9     1.6 3.4E-05   32.4   1.1   19  106-124    74-92  (97)
 63 PF06114 DUF955:  Domain of unk  73.9     1.8   4E-05   31.8   1.4   23   95-117    29-54  (122)
 64 PF01863 DUF45:  Protein of unk  73.7     9.4  0.0002   31.4   5.8   19  105-123   164-182 (205)
 65 PF14247 DUF4344:  Domain of un  71.6     2.1 4.6E-05   36.5   1.3   19  106-126    93-111 (220)
 66 COG3824 Predicted Zn-dependent  68.3     2.2 4.7E-05   32.9   0.6   17  106-122   110-126 (136)
 67 PF01432 Peptidase_M3:  Peptida  68.3     2.5 5.4E-05   39.7   1.2   21  105-125   242-262 (458)
 68 PF11667 DUF3267:  Protein of u  67.4       5 0.00011   30.1   2.5   20  107-126     6-25  (111)
 69 COG1164 Oligoendopeptidase F [  66.0     3.1 6.8E-05   40.7   1.4   78   41-120   316-395 (598)
 70 PF02163 Peptidase_M50:  Peptid  65.9     3.2   7E-05   33.9   1.3   23  106-128     8-30  (192)
 71 cd06258 Peptidase_M3_like The   64.5     3.3 7.1E-05   37.5   1.2   14  105-118   154-167 (365)
 72 TIGR00043 metalloprotein, YbeY  62.9     4.5 9.7E-05   30.6   1.5   18  107-124    75-92  (110)
 73 COG5549 Predicted Zn-dependent  62.4     5.6 0.00012   33.6   2.1   76  107-218   155-235 (236)
 74 cd06455 M3A_TOP Peptidase M3 T  61.8     4.1 8.8E-05   38.6   1.3   14  105-118   263-276 (472)
 75 PF15639 Tox-MPTase3:  Metallop  61.2      19 0.00041   28.1   4.6   69   43-115    22-110 (135)
 76 cd05709 S2P-M50 Site-2 proteas  61.0     4.5 9.7E-05   32.8   1.3   22  106-127     9-30  (180)
 77 PHA02456 zinc metallopeptidase  61.0     3.9 8.4E-05   31.1   0.8   37   80-120    58-94  (141)
 78 COG4783 Putative Zn-dependent   59.9     4.5 9.8E-05   38.2   1.2   15  105-119   130-144 (484)
 79 PF01447 Peptidase_M4:  Thermol  59.6     5.2 0.00011   32.0   1.4   12  106-117   136-147 (150)
 80 cd06457 M3A_MIP Peptidase M3 m  59.4     4.7  0.0001   38.1   1.3   18  106-123   249-266 (458)
 81 COG4412 Uncharacterized protei  58.2     4.1 8.9E-05   39.1   0.6   24   39-62    234-257 (760)
 82 PF02130 UPF0054:  Uncharacteri  58.1       6 0.00013   31.4   1.5   16  107-122   110-125 (145)
 83 cd06163 S2P-M50_PDZ_RseP-like   55.2     6.6 0.00014   32.4   1.3   22  106-127    10-31  (182)
 84 COG0319 Predicted metal-depend  54.8     7.3 0.00016   31.4   1.5   18  107-124   110-127 (153)
 85 cd06456 M3A_DCP_Oligopeptidase  54.6     6.4 0.00014   36.8   1.3   14  105-118   208-221 (422)
 86 KOG3658|consensus               54.5       4 8.8E-05   40.1  -0.1   18  106-123   393-410 (764)
 87 PF13485 Peptidase_MA_2:  Pepti  54.1     7.3 0.00016   28.7   1.3   12  106-117    26-37  (128)
 88 COG2738 Predicted Zn-dependent  53.7     6.5 0.00014   32.9   1.0   24   95-118    76-105 (226)
 89 PRK00016 metal-binding heat sh  52.6     7.6 0.00017   31.3   1.3   17  107-123   115-131 (159)
 90 cd06162 S2P-M50_PDZ_SREBP Ster  52.4     7.5 0.00016   34.4   1.3   22  106-127   136-157 (277)
 91 cd06460 M32_Taq Peptidase fami  52.3     7.9 0.00017   36.0   1.5   28   96-126   148-177 (396)
 92 cd06159 S2P-M50_PDZ_Arch Uncha  52.0     7.6 0.00017   34.0   1.3   23  106-128   119-141 (263)
 93 cd06006 R3H_unknown_2 R3H doma  51.6      19 0.00041   24.1   2.9   38   97-134    20-57  (59)
 94 PF01435 Peptidase_M48:  Peptid  49.8       9 0.00019   31.8   1.3   12  106-117    90-101 (226)
 95 PRK13963 unkown domain/putativ  49.0     9.9 0.00021   33.1   1.5   16  107-122   217-232 (258)
 96 cd06158 S2P-M50_like_1 Unchara  48.4     9.6 0.00021   31.3   1.3   13  106-118    10-22  (181)
 97 cd06164 S2P-M50_SpoIVFB_CBS Sp  47.6     9.9 0.00021   32.5   1.3   23  106-128    54-76  (227)
 98 smart00731 SprT SprT homologue  46.8      11 0.00024   29.7   1.3   15  106-120    60-74  (146)
 99 PF13699 DUF4157:  Domain of un  46.7      12 0.00025   26.6   1.3   14  105-118    61-74  (79)
100 KOG2921|consensus               46.3      10 0.00022   35.2   1.1   34  106-142   132-165 (484)
101 cd02325 R3H R3H domain. The na  46.2      32 0.00068   21.6   3.3   40   95-134    18-57  (59)
102 PF10263 SprT-like:  SprT-like   46.0      11 0.00025   29.6   1.3   17  105-121    60-76  (157)
103 cd02639 R3H_RRM R3H domain of   45.8      21 0.00045   24.0   2.4   29   95-123    18-47  (60)
104 PRK10911 oligopeptidase A; Pro  45.4      10 0.00022   37.8   1.2   12  106-117   464-475 (680)
105 PRK02870 heat shock protein Ht  45.2     8.8 0.00019   34.9   0.6   12  105-116   173-184 (336)
106 PF12315 DUF3633:  Protein of u  43.1      13 0.00029   31.4   1.3   17  105-121    93-109 (212)
107 PF14521 Aspzincin_M35:  Lysine  41.7      12 0.00026   29.7   0.8   14  105-118    96-111 (148)
108 KOG3538|consensus               40.9     8.4 0.00018   39.3  -0.2   23  105-127   317-339 (845)
109 cd06160 S2P-M50_like_2 Unchara  40.9      15 0.00033   30.3   1.3   24  106-129    42-65  (183)
110 PRK03982 heat shock protein Ht  40.8      14  0.0003   32.6   1.2   13  105-117   125-137 (288)
111 PRK10280 dipeptidyl carboxypep  40.0      15 0.00032   36.7   1.3   13  106-118   465-477 (681)
112 COG0501 HtpX Zn-dependent prot  40.0      15 0.00032   32.1   1.2   12  106-117   158-169 (302)
113 PRK03001 M48 family peptidase;  39.0      16 0.00034   32.2   1.2   13  105-117   124-136 (283)
114 cd06161 S2P-M50_SpoIVFB SpoIVF  38.7      16 0.00035   30.6   1.2   24  105-128    38-61  (208)
115 PRK03072 heat shock protein Ht  38.2      16 0.00035   32.3   1.2   13  105-117   127-139 (288)
116 smart00393 R3H Putative single  38.1      44 0.00096   23.3   3.2   40   96-135    38-77  (79)
117 PRK01345 heat shock protein Ht  37.5      17 0.00037   32.7   1.2   13  105-117   124-136 (317)
118 PF04228 Zn_peptidase:  Putativ  37.1      18 0.00039   32.2   1.3   10  107-116   172-181 (292)
119 PRK05457 heat shock protein Ht  36.9      17 0.00036   32.2   1.1   12  105-116   134-145 (284)
120 PF06167 Peptidase_M90:  Glucos  35.1      22 0.00047   31.0   1.4   17  105-121   154-170 (253)
121 PRK04897 heat shock protein Ht  34.7      19 0.00041   32.0   1.1   12  106-117   138-149 (298)
122 PF14891 Peptidase_M91:  Effect  33.3      21 0.00045   29.0   1.0   19  106-124   104-122 (174)
123 PRK02391 heat shock protein Ht  33.0      22 0.00048   31.6   1.2   12  105-116   133-144 (296)
124 PRK01265 heat shock protein Ht  32.4      22 0.00047   32.2   1.1   11  105-115   140-150 (324)
125 KOG2556|consensus               31.6      23 0.00049   33.7   1.0   21  106-126   256-276 (666)
126 PLN02887 hydrolase family prot  30.5      27 0.00059   34.1   1.5   19  107-125   232-250 (580)
127 TIGR00054 RIP metalloprotease   30.5      25 0.00055   32.8   1.2   12  106-117    15-26  (420)
128 PLN02791 Nudix hydrolase homol  30.4      20 0.00044   36.2   0.6   21  106-126   586-606 (770)
129 PF01424 R3H:  R3H domain;  Int  30.2      70  0.0015   21.0   3.1   40   96-135    22-61  (63)
130 PF01431 Peptidase_M13:  Peptid  29.9      25 0.00054   29.0   0.9   14  107-120    38-51  (206)
131 PRK04351 hypothetical protein;  29.5      30 0.00065   27.6   1.3   16  105-120    61-76  (149)
132 PF02102 Peptidase_M35:  Deuter  29.4      29 0.00062   31.9   1.3   20  101-120   290-312 (359)
133 PF13058 DUF3920:  Protein of u  29.1      31 0.00066   26.3   1.2   31   35-65      3-34  (126)
134 COG0339 Dcp Zn-dependent oligo  28.0      28 0.00062   34.5   1.1   24   43-66    364-387 (683)
135 PF13203 DUF2201_N:  Putative m  27.8      34 0.00074   30.0   1.5   21  106-127    61-81  (292)
136 PRK10779 zinc metallopeptidase  27.1      32  0.0007   32.4   1.3   10  106-115    16-25  (449)
137 PRK04860 hypothetical protein;  26.6      36 0.00079   27.5   1.3   17  105-121    63-79  (160)
138 PRK15410 DgsA anti-repressor M  25.2      35 0.00077   29.9   1.1   17  105-121   142-158 (260)
139 cd06461 M2_ACE Peptidase famil  24.8      34 0.00074   32.7   1.0   17  105-121   247-263 (477)
140 KOG2090|consensus               24.5      39 0.00084   33.2   1.3   12  106-117   480-491 (704)
141 COG2317 Zn-dependent carboxype  24.0      35 0.00077   32.4   0.9   10  108-117   262-271 (497)
142 PF09768 Peptidase_M76:  Peptid  23.6      44 0.00094   27.4   1.3   25  106-140    72-96  (173)
143 KOG3624|consensus               23.3      43 0.00093   33.3   1.4   29  107-137   520-548 (687)
144 PF10026 DUF2268:  Predicted Zn  23.2   3E+02  0.0066   22.6   6.3   75   41-123     4-83  (195)
145 PF10023 DUF2265:  Predicted am  23.1      37 0.00081   30.9   0.8   15  106-120   166-180 (337)
146 PF01434 Peptidase_M41:  Peptid  23.0      46   0.001   27.9   1.4   13  106-118    29-41  (213)
147 KOG3314|consensus               22.7      49  0.0011   27.1   1.3   25  106-140    92-116 (194)
148 KOG0201|consensus               22.2      39 0.00085   31.9   0.8   30  106-139   113-142 (467)
149 PF04450 BSP:  Peptidase of pla  21.7      49  0.0011   27.8   1.2   16  106-121    97-112 (205)
150 COG3590 PepO Predicted metallo  20.7      29 0.00062   33.8  -0.4   21  107-129   489-509 (654)
151 PF10460 Peptidase_M30:  Peptid  20.6      53  0.0011   30.3   1.3   16  106-121   140-155 (366)

No 1  
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=100.00  E-value=7.8e-68  Score=449.45  Aligned_cols=204  Identities=37%  Similarity=0.621  Sum_probs=191.6

Q ss_pred             cCCCccCC--CCcccccCCCCCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCC
Q psy1890           5 VDGFRDHD--GVKNLAKTPAKLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKS   82 (242)
Q Consensus         5 ~~g~~~~~--~~R~~~~~~~~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~g   82 (242)
                      ++||+.++  +.|+|+.....+||+ +|||.|+++|+...++.|++||++|++.|||+|+|++. +.+||.|..   +.|
T Consensus        24 ~eGDI~l~~~~~R~a~~~~~~~Wp~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~-e~~yi~i~~---~~G   98 (230)
T cd04282          24 FEGDILLDEGQSRNGLIGDTYRWPF-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEG-ESNYIFFFK---GSG   98 (230)
T ss_pred             ccccccCCcccccccccCcccCCCc-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCC-CCcEEEEEc---CCC
Confidence            78999987  689999999999999 89999999999999999999999999999999999995 889999985   589


Q ss_pred             CCCccCCCCCccceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcC
Q psy1890          83 CSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPY  162 (242)
Q Consensus        83 c~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~y  162 (242)
                      |||+||+.+|. |.|+|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||+|++...+.++|+||
T Consensus        99 C~S~vG~~gg~-q~isl~~~C~~~Gti~HEl~HalGf~HEqsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~pY  177 (230)
T cd04282          99 CWSMVGDQQGG-QNLSIGAGCDYKATVEHEFLHALGFYHEQSRSDRDDYVKIWWDQILSGREHNFNKYDDSFSTDLNTPY  177 (230)
T ss_pred             eeeccCccCCe-EEEEECCCcCCCchHHHHHHHHhCCcccccccccccceEEeecccCchHHHHhhhcCccccccCCCCC
Confidence            99999999888 99999999999999999999999999999999999999999999999999999999888888899999


Q ss_pred             CCCcccccCCCccccCCCCCeeeeccCCCcccc-ccCCcCcccHHHHHHHHhHCCC
Q psy1890         163 DFGSIMHYSRFAFSRDSKSPTITPKKTLNVQAD-KMGQRINISKMDIAKLNTLYKC  217 (242)
Q Consensus       163 D~~SIMhY~~~~f~~~g~~~ti~~~~~~~~~~~-~ig~r~~lS~~Di~~in~lY~C  217 (242)
                      ||+|||||++.+|++++..|||+|+++   ... .||||.+||+.|+++||+||+|
T Consensus       178 Dy~SIMHY~~~aFs~~~~~pTi~~~~~---~~~~~iGqr~~lS~~Di~~iN~~Y~C  230 (230)
T cd04282         178 DYESVMHYSPFSFNKGASEPTITTKIP---EFNDIIGQRLDFSDIDLERLNRMYNC  230 (230)
T ss_pred             CcccccccCCCccccCCCCceeeecCC---cccccccccCCCCHHHHHHHHHhcCC
Confidence            999999999999999844499999987   443 6999999999999999999999


No 2  
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=100.00  E-value=4e-67  Score=437.85  Aligned_cols=199  Identities=42%  Similarity=0.700  Sum_probs=181.4

Q ss_pred             cccCCCCCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccce
Q psy1890          17 LAKTPAKLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSY   96 (242)
Q Consensus        17 ~~~~~~~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~   96 (242)
                      |......+||+++|||.|+++|+...++.|++||++|++.|||+|+|++. +.+||.|...  +.||||+||+.++..|.
T Consensus         2 ~~~~~~~~Wp~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~-~~~yi~f~~~--~~GC~S~vG~~~~g~q~   78 (200)
T cd04281           2 ATARKERIWPGGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTP-EENYIVFTYR--PCGCCSYVGRRGNGPQA   78 (200)
T ss_pred             CccCccCcCCCCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCC-CCCEEEEEEC--CCCeeEcCCCcCCCcee
Confidence            45567789999999999999999999999999999999999999999995 8899999865  57999999999733399


Q ss_pred             EEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccc
Q psy1890          97 VNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFS  176 (242)
Q Consensus        97 i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~  176 (242)
                      |+|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++...+.++|+||||+|||||++.+|+
T Consensus        79 isl~~~C~~~Gti~HEl~HaLGf~HEhsRpDRD~yV~I~~~nI~~~~~~nF~k~~~~~~~~~g~pYDy~SiMHY~~~aFs  158 (200)
T cd04281          79 ISIGKNCDKFGIVVHELGHVIGFWHEHTRPDRDDHVTIIRENIQPGQEYNFLKMEPEEVDSLGEPYDFDSIMHYARNTFS  158 (200)
T ss_pred             eecCCCcCcCchHHHHHHHHhcCcchhccccccceEEEeecccCcchhhhhhhcCccccccCCCccCCCccccCCCCccc
Confidence            99999999999999999999999999999999999999999999999999999988888889999999999999999999


Q ss_pred             cCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHCCCC
Q psy1890         177 RDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLYKCP  218 (242)
Q Consensus       177 ~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~C~  218 (242)
                      +++..+||+|+++.+.-...||||.+||+.|+++||+||+||
T Consensus       159 ~~~~~~Ti~p~~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~C~  200 (200)
T cd04281         159 RGMFLDTILPKRDPNGVRPEIGQRTRLSEGDIIQANKLYKCP  200 (200)
T ss_pred             cCCCCCceEECCCcccccccccccCCCCHHHHHHHHHhcCCC
Confidence            988779999987621101369999999999999999999997


No 3  
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=100.00  E-value=1e-61  Score=399.55  Aligned_cols=177  Identities=41%  Similarity=0.750  Sum_probs=165.0

Q ss_pred             eEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeC-CCCcccc
Q psy1890          29 LIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLG-KNCFVMG  107 (242)
Q Consensus        29 ~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~-~~c~~~~  107 (242)
                      +|||.|+++|++.+++.|++||++|++.|||+|++++. +.+||.|..   +.||||+||+.++. |.|+|+ .+|...|
T Consensus         5 ~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~yi~~~~---~~gC~S~vG~~gg~-q~i~l~~~~C~~~G   79 (182)
T cd04283           5 YVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTT-ERDYLNIES---RSGCWSYIGRQGGR-QTVSLQKQGCMYKG   79 (182)
T ss_pred             EEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCC-CCcEEEEEc---CCCceEecCccCCc-eeEecCCCCcCccc
Confidence            69999999999999999999999999999999999984 889999975   58999999999888 999997 6899999


Q ss_pred             hhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCCCeeeec
Q psy1890         108 TVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKSPTITPK  187 (242)
Q Consensus       108 ~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ti~~~  187 (242)
                      +|+|||||||||+|||+|||||+||.|+|+||.++..+||++++.   .++++||||+|||||++.+|++++. +||+|+
T Consensus        80 ~i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~~~~~~nF~~~~~---~~~~~pYDy~SiMHY~~~afs~~g~-~Ti~~~  155 (182)
T cd04283          80 IIQHELLHALGFYHEQTRSDRDKYVRINWENIIPDQLYNFDKQDT---NNLGTPYDYSSVMHYGRYAFSINGK-PTIVPI  155 (182)
T ss_pred             hHHHHHHHHhCCcccccccccCceEEEehhhcCchHHhhhhhccc---cccCCcccceeeeccCCcccccCCC-CeEEEC
Confidence            999999999999999999999999999999999999999999853   4578999999999999999999887 999999


Q ss_pred             cCCCccccccCCcCcccHHHHHHHHhHCCC
Q psy1890         188 KTLNVQADKMGQRINISKMDIAKLNTLYKC  217 (242)
Q Consensus       188 ~~~~~~~~~ig~r~~lS~~Di~~in~lY~C  217 (242)
                      ++   ....||||.+||+.|+++||+||+|
T Consensus       156 ~~---~~~~iGqr~~lS~~Di~~iN~~Y~C  182 (182)
T cd04283         156 PD---PNVPIGQRQGMSNLDILRINKLYNC  182 (182)
T ss_pred             Cc---ccccccCCCCCCHHHHHHHHHhcCC
Confidence            76   3347999999999999999999998


No 4  
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=100.00  E-value=6.4e-62  Score=405.98  Aligned_cols=191  Identities=45%  Similarity=0.827  Sum_probs=148.2

Q ss_pred             CCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeCCC
Q psy1890          23 KLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLGKN  102 (242)
Q Consensus        23 ~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~~~  102 (242)
                      ++||+++|||.|++++++.+++.|++||++|++.|||+|++++..+..+|.|..   +.||||+||+.++. |.|+|+.+
T Consensus         1 ~~Wp~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~~---~~gC~S~vG~~~g~-q~i~l~~~   76 (191)
T PF01400_consen    1 KKWPNGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERTENEDDYISFSN---GSGCWSYVGRQGGE-QTINLGDG   76 (191)
T ss_dssp             -S-GGGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-SSSSSSEEEEES---SSSEEEESS--SSE-EEEEE-TT
T ss_pred             CcCCCCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECCCCCceEEEeec---CccccchhhhcCcc-eeEEecce
Confidence            589999999999999999999999999999999999999999944678899875   58999999999777 99999999


Q ss_pred             CcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCCC
Q psy1890         103 CFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKSP  182 (242)
Q Consensus       103 c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~  182 (242)
                      |...++|+|||||||||.|||+|||||+||.|+|+||.++...||.+.+...+.++++||||+|||||++.+|++++..+
T Consensus        77 c~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIMHY~~~~~s~~~~~~  156 (191)
T PF01400_consen   77 CFSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIMHYSSYAFSKNGSQP  156 (191)
T ss_dssp             C-SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-TTSGGGG---SGGGBE-TTS---TT-TT---TTTTBSSTTS-
T ss_pred             eCCccchHHHHHHHHhhhhhhhccccccEEEEehhcchhhhhhhhhccccccccccCCCcCccCeecccCcccccCCCCC
Confidence            99999999999999999999999999999999999999999999999988788889999999999999999999998679


Q ss_pred             eeeeccCCCccccccCCcCcccHHHHHHHHhHCCCCC
Q psy1890         183 TITPKKTLNVQADKMGQRINISKMDIAKLNTLYKCPN  219 (242)
Q Consensus       183 ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~C~~  219 (242)
                      ||.|+++.  ....||++.+||+.|+++||++|+|+.
T Consensus       157 ti~~~~~~--~~~~iG~~~~lS~~D~~~in~~Y~C~~  191 (191)
T PF01400_consen  157 TITPKDPK--YQETIGQRNRLSFTDIKQINKMYNCPE  191 (191)
T ss_dssp             SEEESSTT--CTS-GGG-SS--HHHHHHHHHHTT---
T ss_pred             eEEecCCC--CcccccccCCCCHHHHHHHHHhcCCCC
Confidence            99999862  123799999999999999999999973


No 5  
>KOG3714|consensus
Probab=100.00  E-value=2.2e-60  Score=437.26  Aligned_cols=204  Identities=41%  Similarity=0.768  Sum_probs=190.2

Q ss_pred             CCCcccccCCCCCCCCCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCC
Q psy1890          12 DGVKNLAKTPAKLWKDGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTG   91 (242)
Q Consensus        12 ~~~R~~~~~~~~~Wp~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~   91 (242)
                      .+.|+++....++||+++|||.|+++|+...|+.|+.||++|++.|||+|+|.++.+.+++.|...   .||||+||+.+
T Consensus        69 ~~~r~~~~~~~~~Wp~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~---~gC~S~VGr~g  145 (411)
T KOG3714|consen   69 RSRRNGTSNPERRWPNGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTG---GGCYSYVGRRG  145 (411)
T ss_pred             hhhhhcccChhhcCCCCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCCCcceEEEeCC---CcceeeeCccC
Confidence            466888889999999999999999999999999999999999999999999998656788888765   49999999999


Q ss_pred             CccceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccC
Q psy1890          92 KKVSYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYS  171 (242)
Q Consensus        92 g~~q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~  171 (242)
                      +..|.++|+.+|...|+|+|||||||||+|||+|||||+||.|+|+||.++..+||.|++...+.++++||||+|||||.
T Consensus       146 g~~q~~sl~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYDygSvMHY~  225 (411)
T KOG3714|consen  146 GGQQLLSLGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYDYGSVMHYA  225 (411)
T ss_pred             CCccceecCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCChhhhhhhhhcChhhhhccCCcccCCcccccC
Confidence            86699999999999999999999999999999999999999999999999999999999988888999999999999999


Q ss_pred             CCccccCCCCCeeeeccCCCcccc-ccCCcCcccHHHHHHHHhHCCCCCCC
Q psy1890         172 RFAFSRDSKSPTITPKKTLNVQAD-KMGQRINISKMDIAKLNTLYKCPNKY  221 (242)
Q Consensus       172 ~~~f~~~g~~~ti~~~~~~~~~~~-~ig~r~~lS~~Di~~in~lY~C~~~~  221 (242)
                      +.+|++++..+||+|++.   ..+ .||||..||+.|+.+||+||.|+..+
T Consensus       226 ~~afs~~~~~~ti~~~~~---~~~~~mGqr~~~S~~Di~~iN~~Y~C~~~~  273 (411)
T KOG3714|consen  226 PYAFSKNGSLPTIVPKDN---GFQNTMGQRERLSFYDIRKINKLYCCPECC  273 (411)
T ss_pred             CcccCcCCCCCceecccc---cccccccccCcCCHHHHHHHHhhcCCCccC
Confidence            999999999889999987   444 59999999999999999999997654


No 6  
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=100.00  E-value=1.5e-59  Score=388.28  Aligned_cols=180  Identities=44%  Similarity=0.792  Sum_probs=169.3

Q ss_pred             CCeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeCCCCccc
Q psy1890          27 DGLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLGKNCFVM  106 (242)
Q Consensus        27 ~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~~~c~~~  106 (242)
                      +++|||.|+++|++.+++.|++||++|++.|||+|+|++. +.++|.|...   .||||+||+.++. |.|+|+.+|...
T Consensus         1 ~~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~-~~~~I~f~~~---~Gc~S~vG~~~~~-q~i~l~~~c~~~   75 (180)
T cd04280           1 NGTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT-EKDYIRIVKG---SGCWSYVGRVGGR-QVVSLGSGCFSL   75 (180)
T ss_pred             CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC-CCcEEEEEcC---CCcceecCccCCc-eeEEeCCCcCcC
Confidence            4789999999999999999999999999999999999996 7899999864   8999999999877 999999999999


Q ss_pred             chhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCCCeeee
Q psy1890         107 GTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKSPTITP  186 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ti~~  186 (242)
                      ++++|||||||||.|||+|||||+||.|+|+||.++...||++++...+..+++||||+|||||++.+|++++. +||+|
T Consensus        76 g~v~HE~~HalG~~HEh~R~DRD~yv~i~~~ni~~~~~~nf~~~~~~~~~~~~~~YDy~SiMhY~~~~fs~~~~-~ti~~  154 (180)
T cd04280          76 GTIVHELMHALGFYHEQSRPDRDDYVTINWENIQPGYEHNFDKYSPDTVTTYGVPYDYGSVMHYGPTAFSKNGK-PTIVP  154 (180)
T ss_pred             chhHHHHHHHhcCcchhcccccCCeEEEeecccChhHHHHhhhcCcccccccCCcCCcceEeccCCcccccCCC-ceEEE
Confidence            99999999999999999999999999999999999999999999888877889999999999999999999987 99999


Q ss_pred             ccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890         187 KKTLNVQADKMGQRINISKMDIAKLNTLY  215 (242)
Q Consensus       187 ~~~~~~~~~~ig~r~~lS~~Di~~in~lY  215 (242)
                      +++   ....+||+.+||+.|+++||+||
T Consensus       155 ~~~---~~~~~g~~~~~S~~D~~~in~~Y  180 (180)
T cd04280         155 KDP---GYQIIGQREGLSFLDIKKINKMY  180 (180)
T ss_pred             CCc---hhhcccCCCCCCHHHHHHHHHhC
Confidence            987   44458999999999999999999


No 7  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=100.00  E-value=7.9e-45  Score=304.73  Aligned_cols=170  Identities=26%  Similarity=0.321  Sum_probs=135.8

Q ss_pred             CCCCCCe-EEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCC-CeEEEEccCCCCCCCCccCCCCC----ccce
Q psy1890          23 KLWKDGL-IYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQT-PYIVYKESPNGKSCSSNVGFTGK----KVSY   96 (242)
Q Consensus        23 ~~Wp~~~-I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~-~~I~i~~~~~~~gc~S~vG~~~g----~~q~   96 (242)
                      ++||+++ ++|.|.+++++.+|+.|++||++|++.|||+|++++..+. .+|.|..   +.||||+||+.+.    ..++
T Consensus         1 ~~W~~~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~---~~Gc~S~vG~~~~~~~~~~~t   77 (198)
T cd04327           1 KLWRNGTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTP---GDGYWSYVGTDALLIGADAPT   77 (198)
T ss_pred             CCCCCCCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEec---CCCCCCCcCCcccccCCCCce
Confidence            5899986 4577888788899999999999999999999999985222 2355543   4799999999642    2378


Q ss_pred             EEeCCC------CcccchhHHHHHHHhcccccccCCCCCceEEEeee--cccc-----------c-cc-ccccccCcccc
Q psy1890          97 VNLGKN------CFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYE--LIKK-----------G-SE-INFKKRNFKEA  155 (242)
Q Consensus        97 i~l~~~------c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~--ni~~-----------~-~~-~nf~k~~~~~~  155 (242)
                      |+|+..      |...++|+|||||||||.|||+|||||    |.|+  +|.+           + .. .+|.++..  .
T Consensus        78 ~~l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh~rpdrd----i~w~~~~i~~~~~~~~~~~~~~~~~~~~f~~~~~--~  151 (198)
T cd04327          78 MNLGWFTDDTPDPEFSRVVLHEFGHALGFIHEHQSPAAN----IPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDD--G  151 (198)
T ss_pred             eeeeeecCCCchhhHHHHHHHHHHHHhcCcccccCCCCC----CCcCHHHHHHHHhcCCCCcHHHHHHhhhhhcccc--c
Confidence            888653      455799999999999999999999999    7887  3432           1 22 67888764  3


Q ss_pred             ccCCCcCCCCcccccC-CCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHCC
Q psy1890         156 TTLDLPYDFGSIMHYS-RFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLYK  216 (242)
Q Consensus       156 ~~~g~~yD~~SIMhY~-~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~  216 (242)
                      ..+++||||+|||||+ +.+|+++|. +              .|++.+||+.|+++||+||+
T Consensus       152 ~~~~~~yD~~SIMHY~~~~~~t~~g~-~--------------t~~~~~lS~~D~~~i~~~Yp  198 (198)
T cd04327         152 DVAYSPYDPDSIMHYPFPGSLTLDGE-E--------------VPPNRTLSDKDKAFMRLLYP  198 (198)
T ss_pred             cccCCCCCcHHHHcCCCcHHhhcCCc-c--------------cCCCCCCCHHHHHHHHHhCc
Confidence            4578999999999999 789988876 3              25678899999999999995


No 8  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=100.00  E-value=8e-33  Score=219.46  Aligned_cols=136  Identities=37%  Similarity=0.710  Sum_probs=116.8

Q ss_pred             CCCCCCCeEEEEEC-CCCCHHH-HHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCCCCCCC-CccCCCCCccceEE
Q psy1890          22 AKLWKDGLIYYEID-KNFSVDR-KRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPNGKSCS-SNVGFTGKKVSYVN   98 (242)
Q Consensus        22 ~~~Wp~~~I~Y~~~-~~~~~~~-~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~~~gc~-S~vG~~~g~~q~i~   98 (242)
                      .++||+++|+|+|+ +.++..+ ++.|++||+.|++.+||+|++++.+...+|.|.... ..||| |++|+.++. +.++
T Consensus         2 ~~~W~~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~ad~~I~f~~~~-~~g~~~a~~g~~~g~-~~~~   79 (140)
T smart00235        2 SKKWPKGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGD-GSGCTLSHAGRPGGD-QHFS   79 (140)
T ss_pred             CCcCCCCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCCCCeEEEEEECC-CCCcceeeeecCCCc-eEEE
Confidence            46899999999997 8888876 999999999999999999999885455678887651 13998 999999777 8999


Q ss_pred             eCCCCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCc
Q psy1890          99 LGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGS  166 (242)
Q Consensus        99 l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~S  166 (242)
                      ++.+|...++++|||||||||.|||+|||||+|+.|+|+++    ..+|.+..   ....+.+|||+|
T Consensus        80 ~~~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~----~~~~~~~~---~~~~~~~~~yg~  140 (140)
T smart00235       80 LGNGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNI----TRNFDLSN---DDSLGIPYDYGS  140 (140)
T ss_pred             ccCCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhh----hhcccccc---ccCCCchhccCc
Confidence            98899999999999999999999999999999999999987    35665542   245778999986


No 9  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.98  E-value=1e-32  Score=224.02  Aligned_cols=148  Identities=27%  Similarity=0.297  Sum_probs=121.4

Q ss_pred             CeEEEEECCCCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEcc---CCCCCCCCccCCC--CCccceEEeCCC
Q psy1890          28 GLIYYEIDKNFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKES---PNGKSCSSNVGFT--GKKVSYVNLGKN  102 (242)
Q Consensus        28 ~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~---~~~~gc~S~vG~~--~g~~q~i~l~~~  102 (242)
                      ..|+|.|++.++...|+.|++||+.|++.+||+|++++..+.+.|+|...   +.+.+||+++|..  ++. +.|.++..
T Consensus         2 ~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~-g~i~~~~~   80 (165)
T cd04268           2 KPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLT-GEILLARV   80 (165)
T ss_pred             CCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCccccCCccCCCCC-ccEEeeEE
Confidence            47999999999999999999999999999999999998633455665432   1258899999986  334 77888765


Q ss_pred             Cc-----------ccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccC
Q psy1890         103 CF-----------VMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYS  171 (242)
Q Consensus       103 c~-----------~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~  171 (242)
                      |.           ..++++|||||||||.|+|+|+|||.|+                       ..++.+||+.|||||.
T Consensus        81 ~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~-----------------------~~~~~~~~~~SvM~y~  137 (165)
T cd04268          81 YLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNV-----------------------DLLAEKGDTSSVMDYA  137 (165)
T ss_pred             EEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcc-----------------------hhhccCCCCcccCCCC
Confidence            54           4699999999999999999999999776                       2356799999999999


Q ss_pred             CCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890         172 RFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLY  215 (242)
Q Consensus       172 ~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY  215 (242)
                      ...|+.+          .      .+|++..||..|+++|+.||
T Consensus       138 ~~~~~~~----------~------~~~~~~~~~~~Di~ai~~lY  165 (165)
T cd04268         138 PSNFSIQ----------L------GDGQKYTIGPYDIAAIKKLY  165 (165)
T ss_pred             ccccccc----------c------ccccCCCCCHHHHHHHHhcC
Confidence            8777543          1      35677789999999999999


No 10 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.88  E-value=2.5e-22  Score=162.95  Aligned_cols=148  Identities=28%  Similarity=0.359  Sum_probs=109.5

Q ss_pred             eEEEEECCC--------CCHHHHHHHHHHHHHHhccccEEEEecCCC-CCCeEEEEcc----CCCCCCCCccCCCC-Ccc
Q psy1890          29 LIYYEIDKN--------FSVDRKRFIVKALNNIMTFSCVRFRRRTHN-QTPYIVYKES----PNGKSCSSNVGFTG-KKV   94 (242)
Q Consensus        29 ~I~Y~~~~~--------~~~~~~~~I~~A~~~w~~~tci~F~~~~~~-~~~~I~i~~~----~~~~gc~S~vG~~~-g~~   94 (242)
                      .|+|.|..+        ++...++.|+.|++.|++.+||+|+++... ....|.|...    ..+.+|+|++|... ...
T Consensus         2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~   81 (167)
T cd00203           2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLR   81 (167)
T ss_pred             EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCC
Confidence            467777543        567889999999999999999999999852 1444544332    12467889998863 222


Q ss_pred             ceEEeCCCC----cccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCccccc
Q psy1890          95 SYVNLGKNC----FVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHY  170 (242)
Q Consensus        95 q~i~l~~~c----~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY  170 (242)
                      +.+.+...+    ....+++|||||||||.|++.+++|++++.+.                   ......++|..|||||
T Consensus        82 ~~~~~~~~~~~~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~-------------------~~~~~~~~~~~siM~y  142 (167)
T cd00203          82 GVGVLQDNQSGTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTID-------------------DTLNAEDDDYYSVMSY  142 (167)
T ss_pred             CcEEEecCCcccccchhhHHHHHHHHhCCCccCcCCCCCCCcccc-------------------ccccCCCCCCCeEecc
Confidence            445554333    45689999999999999999999999988662                   1124568999999999


Q ss_pred             CCCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890         171 SRFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLY  215 (242)
Q Consensus       171 ~~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY  215 (242)
                      ....++                    ++++..+|..|+++|+++|
T Consensus       143 ~~~~~~--------------------~~~~~~fS~~d~~~i~~~Y  167 (167)
T cd00203         143 TKGSFS--------------------DGQRKDFSQCDIDQINKLY  167 (167)
T ss_pred             CccccC--------------------cccCCCcCHHHHHHHHhhC
Confidence            865332                    3345679999999999998


No 11 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.82  E-value=1.2e-19  Score=146.70  Aligned_cols=98  Identities=21%  Similarity=0.237  Sum_probs=67.0

Q ss_pred             CCCCCeEEEEECCCC----CHHHHHHHHHHHHHHhccccEEEEecCCCCCCeE--EEEccCCCCCC-CCccCCCCC----
Q psy1890          24 LWKDGLIYYEIDKNF----SVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYI--VYKESPNGKSC-SSNVGFTGK----   92 (242)
Q Consensus        24 ~Wp~~~I~Y~~~~~~----~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I--~i~~~~~~~gc-~S~vG~~~g----   92 (242)
                      +||+++|+|+|++..    ....+++|++||+.|++.+||+|++++......|  .|...  ..+| +++-|..+.    
T Consensus         1 kW~~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~--~~~~~~~~~~~~g~l~~a   78 (157)
T cd04278           1 KWSKTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARG--NHGDGYPFDGPGGTLAHA   78 (157)
T ss_pred             CCCCCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeec--ccCCCCCCCCCccccccc
Confidence            799999999997754    3678999999999999999999999985323334  44433  3444 333332110    


Q ss_pred             -----ccceEEeC-----------CCCcccchhHHHHHHHhcccccc
Q psy1890          93 -----KVSYVNLG-----------KNCFVMGTVQHETLHALGFWHEQ  123 (242)
Q Consensus        93 -----~~q~i~l~-----------~~c~~~~~i~HEigHALGl~HEh  123 (242)
                           ..-.+.|.           .++....+++|||||||||.|++
T Consensus        79 ~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~  125 (157)
T cd04278          79 FFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSS  125 (157)
T ss_pred             cCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCC
Confidence                 00112221           23445689999999999999974


No 12 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.75  E-value=1.8e-17  Score=137.43  Aligned_cols=142  Identities=20%  Similarity=0.152  Sum_probs=92.8

Q ss_pred             CCCHHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC---CCCCCCccCCCCC---ccceEEeC---------C
Q psy1890          37 NFSVDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN---GKSCSSNVGFTGK---KVSYVNLG---------K  101 (242)
Q Consensus        37 ~~~~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~---~~gc~S~vG~~~g---~~q~i~l~---------~  101 (242)
                      .+++.+++.|++||+.|++.+||+|++++..+...|+|.....   +.+++++.+...+   ....+.|.         .
T Consensus        30 ~~~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~  109 (186)
T cd04277          30 ALSAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSP  109 (186)
T ss_pred             cCCHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCC
Confidence            3567899999999999999999999999865456788865521   2345555554321   11223332         2


Q ss_pred             CCcccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCcCCCCcccccCCCccccCCCC
Q psy1890         102 NCFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLPYDFGSIMHYSRFAFSRDSKS  181 (242)
Q Consensus       102 ~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~yD~~SIMhY~~~~f~~~g~~  181 (242)
                      ++....+++|||||||||.|+|..++.+.+         +.              .....-.-.|||.|.......    
T Consensus       110 g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~---------~~--------------~~~~~~~~~SVMSY~~~~~~~----  162 (186)
T cd04277         110 GSYGYQTIIHEIGHALGLEHPGDYNGGDPV---------PP--------------TYALDSREYTVMSYNSGYGNG----  162 (186)
T ss_pred             ChhhHHHHHHHHHHHhcCCCCCcCCCCCCC---------Cc--------------cccccCcceEEEeecCCCCCC----
Confidence            566679999999999999999998875421         00              000111345999997542110    


Q ss_pred             CeeeeccCCCccccccCCcCcccHHHHHHHHhHCC
Q psy1890         182 PTITPKKTLNVQADKMGQRINISKMDIAKLNTLYK  216 (242)
Q Consensus       182 ~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~  216 (242)
                      ..     .      .-+....|...||++|+.||+
T Consensus       163 ~~-----~------~~~~~~~~~~~DI~AlQ~lYG  186 (186)
T cd04277         163 AS-----A------GGGYPQTPMLLDIAALQYLYG  186 (186)
T ss_pred             cc-----c------cCcccCCccHHHHHHHHHhhC
Confidence            00     0      112234688999999999995


No 13 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.66  E-value=2.2e-15  Score=120.60  Aligned_cols=98  Identities=19%  Similarity=0.235  Sum_probs=63.2

Q ss_pred             CCCCCeEEEEECCC---CC-HHHHHHHHHHHHHHhccccEEEEecCCCCCCeEEE--EccCCCCCCCCccCCCC------
Q psy1890          24 LWKDGLIYYEIDKN---FS-VDRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVY--KESPNGKSCSSNVGFTG------   91 (242)
Q Consensus        24 ~Wp~~~I~Y~~~~~---~~-~~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i--~~~~~~~gc~S~vG~~~------   91 (242)
                      ||+..+|+|+|...   ++ +..+++|++||+.|+..+.|+|+++...+ .-|.|  .... ...+..+-|..+      
T Consensus         1 ~W~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~-adi~i~~~~~~-~~~~~~~~~~~~~~~~~~   78 (154)
T PF00413_consen    1 KWPKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGN-ADIRISFGSNN-HGDGYSFDGSGGTLAHAY   78 (154)
T ss_dssp             SSSSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSS-CSEEEEEESSS-SSSSS-CSSSSSESEEEE
T ss_pred             CCCCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCC-cceeeeeeccc-cCcccccccceeeeeccc
Confidence            79999999999763   23 46899999999999999999999998523 43444  3221 122222112111      


Q ss_pred             --C--ccceEEeCCC--------C-cccchhHHHHHHHhcccccc
Q psy1890          92 --K--KVSYVNLGKN--------C-FVMGTVQHETLHALGFWHEQ  123 (242)
Q Consensus        92 --g--~~q~i~l~~~--------c-~~~~~i~HEigHALGl~HEh  123 (242)
                        .  ..-.+.+...        . ....+++|||||||||.|++
T Consensus        79 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~~  123 (154)
T PF00413_consen   79 FPNNIVSGDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHSN  123 (154)
T ss_dssp             ESSSTTTTEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESSS
T ss_pred             cccccccccccccccccchhhhhhhhhhhhhhhccccccCcCcCC
Confidence              0  0123444211        1 23489999999999999975


No 14 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.65  E-value=1.2e-15  Score=123.07  Aligned_cols=98  Identities=20%  Similarity=0.216  Sum_probs=63.2

Q ss_pred             eEEEEECCC------CCHHHHHHHHHHHHHHhccccEEEEecCCC--CCCe-EEEEc----cCCCCCCCCccCCC-----
Q psy1890          29 LIYYEIDKN------FSVDRKRFIVKALNNIMTFSCVRFRRRTHN--QTPY-IVYKE----SPNGKSCSSNVGFT-----   90 (242)
Q Consensus        29 ~I~Y~~~~~------~~~~~~~~I~~A~~~w~~~tci~F~~~~~~--~~~~-I~i~~----~~~~~gc~S~vG~~-----   90 (242)
                      .|+|++++.      .....++.|++|++.|++.+||+|++....  +.+. |.|..    ...++.|.+..+..     
T Consensus         3 ~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~~   82 (156)
T cd04279           3 PIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGN   82 (156)
T ss_pred             CeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCCc
Confidence            578888764      245789999999999999999999998742  3332 44432    10123343333321     


Q ss_pred             C-Cc-cceEEeCC-----CCcccchhHHHHHHHhcccccccCC
Q psy1890          91 G-KK-VSYVNLGK-----NCFVMGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus        91 ~-g~-~q~i~l~~-----~c~~~~~i~HEigHALGl~HEh~rp  126 (242)
                      . .. ...++++.     .....++++|||||||||.||++++
T Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~  125 (156)
T cd04279          83 RKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRP  125 (156)
T ss_pred             ccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCc
Confidence            1 11 12333332     1234689999999999999999864


No 15 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.08  E-value=3.9e-09  Score=88.37  Aligned_cols=158  Identities=15%  Similarity=0.064  Sum_probs=85.2

Q ss_pred             CCCeEEEEECCCCCHHHHHHHHHHHHHHhcc------ccEEEEecCCCC--CCe-----EEEEccCCCCCCCCc----cC
Q psy1890          26 KDGLIYYEIDKNFSVDRKRFIVKALNNIMTF------SCVRFRRRTHNQ--TPY-----IVYKESPNGKSCSSN----VG   88 (242)
Q Consensus        26 p~~~I~Y~~~~~~~~~~~~~I~~A~~~w~~~------tci~F~~~~~~~--~~~-----I~i~~~~~~~gc~S~----vG   88 (242)
                      |+..|.|.|++.++...++.|++|++.|++.      ..+.++......  ..-     |+....  ..+-+++    ..
T Consensus         6 pk~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~--~~~~~~~gps~~d   83 (197)
T cd04276           6 PKEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHS--PNGGWAYGPSVVD   83 (197)
T ss_pred             CCCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEec--CCCcceecccccC
Confidence            5678999999887788999999999999975      344444443211  112     333333  2122222    11


Q ss_pred             CCCCc--cceEEeCCCCc--------------ccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCc
Q psy1890          89 FTGKK--VSYVNLGKNCF--------------VMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNF  152 (242)
Q Consensus        89 ~~~g~--~q~i~l~~~c~--------------~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~  152 (242)
                      -..|+  .-.+.|..+-.              ...+++||+||+|||.|.....-.           .+    .....++
T Consensus        84 PrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~-----------~~----~~~l~~~  148 (197)
T cd04276          84 PRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSD-----------GS----NEELEDP  148 (197)
T ss_pred             CCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccccccccc-----------Cc----hhhhcch
Confidence            11122  13344432111              236899999999999997654311           00    0000000


Q ss_pred             cccccCCCcCCCCcccccCCCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHCC
Q psy1890         153 KEATTLDLPYDFGSIMHYSRFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLYK  216 (242)
Q Consensus       153 ~~~~~~g~~yD~~SIMhY~~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~  216 (242)
                      ......+..   .|||-|.+......+.       +.      .-.-...+-..|+.+|+.+|+
T Consensus       149 ~~~~~~~~~---~SVMdY~~~~~~~~~~-------~~------~~~~~~~~g~yDi~Aiq~~Y~  196 (197)
T cd04276         149 LGTKEKGAT---SSVMDYPPPNVAAQGE-------DQ------GDYYPPTIGPYDKWAIEYGYT  196 (197)
T ss_pred             hhhhhcCCc---ceeecCccccccccCc-------cc------cccccCCCchHHHHHHHHHcC
Confidence            111122332   3999998654322211       10      001123477789999999995


No 16 
>KOG1565|consensus
Probab=99.03  E-value=1.3e-09  Score=102.10  Aligned_cols=100  Identities=16%  Similarity=0.177  Sum_probs=65.0

Q ss_pred             CCCCCCCCeEEEEECCC---CC-HHHHHHHHHHHHHHhccccEEEEecCC-CCCC-eEEEEccCCCCCC-CCccCCC---
Q psy1890          21 PAKLWKDGLIYYEIDKN---FS-VDRKRFIVKALNNIMTFSCVRFRRRTH-NQTP-YIVYKESPNGKSC-SSNVGFT---   90 (242)
Q Consensus        21 ~~~~Wp~~~I~Y~~~~~---~~-~~~~~~I~~A~~~w~~~tci~F~~~~~-~~~~-~I~i~~~~~~~gc-~S~vG~~---   90 (242)
                      ...+|++..|+|.|.+.   ++ ...+.++++|+..|+++|-++|.|+.. ...+ .|.|...  ..|+ +.+-|..   
T Consensus       102 ~~~kW~k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~--~h~d~~PFDG~~g~l  179 (469)
T KOG1565|consen  102 GKPKWNKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPG--DHGDGFPFDGPGGVL  179 (469)
T ss_pred             ccCcccccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeecc--CCCCCCcccCCCCce
Confidence            45699999999999653   33 578999999999999999999999986 3333 2444322  1221 1111111   


Q ss_pred             ----------CCc-----cceEEeC--CCCcccchhHHHHHHHhccccc
Q psy1890          91 ----------GKK-----VSYVNLG--KNCFVMGTVQHETLHALGFWHE  122 (242)
Q Consensus        91 ----------~g~-----~q~i~l~--~~c~~~~~i~HEigHALGl~HE  122 (242)
                                +|.     .....++  .+-....|++|||||||||.|.
T Consensus       180 aHAf~Pg~~~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS  228 (469)
T KOG1565|consen  180 AHAFFPGPGIGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHS  228 (469)
T ss_pred             ecccCCCCCCCCccccCcccceeccCCccchhHHHhhhhcccccccCCC
Confidence                      111     1222221  1334568999999999999997


No 17 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.00  E-value=1.1e-08  Score=85.74  Aligned_cols=160  Identities=23%  Similarity=0.257  Sum_probs=85.8

Q ss_pred             CCCCC--CCeEEEEECC----CCCHHHHHHHHHHHHHHhc-cccEEEEe--cCC-CCCCeEEEEccC---CCCCCCCccC
Q psy1890          22 AKLWK--DGLIYYEIDK----NFSVDRKRFIVKALNNIMT-FSCVRFRR--RTH-NQTPYIVYKESP---NGKSCSSNVG   88 (242)
Q Consensus        22 ~~~Wp--~~~I~Y~~~~----~~~~~~~~~I~~A~~~w~~-~tci~F~~--~~~-~~~~~I~i~~~~---~~~gc~S~vG   88 (242)
                      ..+|.  ..+|...+..    .++...+..+.+|++.|+. ...|+|+-  .+. ...+.+.+....   .+.+-.+...
T Consensus        31 tnlV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~~~~~G~ggsAGFP  110 (211)
T PF12388_consen   31 TNLVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSSNNPSGAGGSAGFP  110 (211)
T ss_pred             eeeecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccCCCCCCcceeccCC
Confidence            34563  2467777766    5678899999999999998 45888841  121 123334443221   1222223222


Q ss_pred             CCCCc-cceEEeC---CCC--cccchhHHHHHHHhcccccccCCCCCceEEEeeecccccccccccccCccccccCCCc-
Q psy1890          89 FTGKK-VSYVNLG---KNC--FVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEINFKKRNFKEATTLDLP-  161 (242)
Q Consensus        89 ~~~g~-~q~i~l~---~~c--~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~nf~k~~~~~~~~~g~~-  161 (242)
                      ..+|. ...|.+.   ..-  ....+++|||||++||.|+.- -+|..   - -.+..++. ..+     ......|+| 
T Consensus       111 ~s~G~P~~~I~I~~~~~~~~~~~~hvi~HEiGH~IGfRHTD~-~~R~S---C-G~~~nEg~-~~v-----GAi~IpGTPt  179 (211)
T PF12388_consen  111 TSNGNPYKFIQIYGLSNYSVNVIEHVITHEIGHCIGFRHTDY-FNRSS---C-GSGGNEGS-AGV-----GAIHIPGTPT  179 (211)
T ss_pred             CCCCCCCceEEEEecCCCchhHHHHHHHHHhhhhccccccCc-CCccc---c-cccCCcCc-ccc-----ceEECCCCCC
Confidence            11232 1233331   111  124699999999999999752 22210   0 00000000 000     011122333 


Q ss_pred             -CCCCcccccCCCccccCCCCCeeeeccCCCccccccCCcCcccHHHHHHHHhHC
Q psy1890         162 -YDFGSIMHYSRFAFSRDSKSPTITPKKTLNVQADKMGQRINISKMDIAKLNTLY  215 (242)
Q Consensus       162 -yD~~SIMhY~~~~f~~~g~~~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY  215 (242)
                       +|..|||...   |+.                    +....+|..|+.+|+.||
T Consensus       180 ~~d~~SiM~ac---~~~--------------------~~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  180 GADPNSIMNAC---FSS--------------------GEDGEFTSNDITALNYLY  211 (211)
T ss_pred             CCCCchhhhcc---ccC--------------------CCCCCcChhhHHHHHhhC
Confidence             6999999975   111                    112359999999999999


No 18 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.37  E-value=4.9e-07  Score=69.64  Aligned_cols=85  Identities=22%  Similarity=0.246  Sum_probs=50.3

Q ss_pred             eEEEEECCCCCHHHHHHHHHHHHHHhc-cccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEEeCC----CC
Q psy1890          29 LIYYEIDKNFSVDRKRFIVKALNNIMT-FSCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVNLGK----NC  103 (242)
Q Consensus        29 ~I~Y~~~~~~~~~~~~~I~~A~~~w~~-~tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~l~~----~c  103 (242)
                      +|.|.-+..  ...+..|.+|.+.|+. +++|+|++.+.   ..+.|...  ...-+|+-...+...-+|-|+.    +-
T Consensus         3 tv~Ydas~A--~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~--~~~~~sya~~~g~G~G~I~l~~~~~qgy   75 (132)
T PF02031_consen    3 TVYYDASRA--PEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEG--NDPRGSYASTDGLGSGYIFLDYQQNQGY   75 (132)
T ss_dssp             EEEEEEEE---GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE----TT--EEEE-SSS-EEEEEEHHHHHHS
T ss_pred             EEEEeCCCC--chhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEe--cCCCCcccccCCCCcEEEEechHHhhCC
Confidence            456655443  4578899999999986 78999999885   33444432  1222454444433225666642    34


Q ss_pred             cccchhHHHHHHHhccc
Q psy1890         104 FVMGTVQHETLHALGFW  120 (242)
Q Consensus       104 ~~~~~i~HEigHALGl~  120 (242)
                      ....+++|||||.|||.
T Consensus        76 ~~~RIaaHE~GHiLGLP   92 (132)
T PF02031_consen   76 NSTRIAAHELGHILGLP   92 (132)
T ss_dssp             -HHHHHHHHHHHHHT--
T ss_pred             ccceeeeehhccccCCC
Confidence            45689999999999994


No 19 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=97.41  E-value=0.00021  Score=60.33  Aligned_cols=20  Identities=35%  Similarity=0.418  Sum_probs=17.9

Q ss_pred             ccchhHHHHHHHhccccccc
Q psy1890         105 VMGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~  124 (242)
                      ...+++|||||+||+.|+..
T Consensus       137 ~~~~~aHEiGH~lGl~H~~~  156 (206)
T PF13583_consen  137 GYQTFAHEIGHNLGLRHDFD  156 (206)
T ss_pred             cchHHHHHHHHHhcCCCCcc
Confidence            34789999999999999997


No 20 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.40  E-value=0.00037  Score=58.16  Aligned_cols=77  Identities=17%  Similarity=0.125  Sum_probs=45.8

Q ss_pred             HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEE-ccCCCCCCCCccCCCC-------------Cc-----cceEEeCC
Q psy1890          41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYK-ESPNGKSCSSNVGFTG-------------KK-----VSYVNLGK  101 (242)
Q Consensus        41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~-~~~~~~gc~S~vG~~~-------------g~-----~q~i~l~~  101 (242)
                      ..+.++..|++.|.+.-.+..++..+ +.+ |.|. .++ +.-.|.-.|+..             +.     ...+.+.+
T Consensus       104 ~wq~a~~tava~wa~~fpl~ive~~e-eaD-Itie~~n~-pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~p  180 (236)
T COG5549         104 RWQGAYLTAVAGWAKTFPLIIVERFE-EAD-ITIEVGNP-PGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPP  180 (236)
T ss_pred             hHHHHHHHHHHHHHHhCCceeeecce-eee-EEEEecCC-CCCcccccchHHHHHHHHHHhhhccCcccccccccccCCc
Confidence            45789999999999987777777774 444 5553 332 222233333321             10     01223322


Q ss_pred             CC---cccchhHHHHHHHhccc
Q psy1890         102 NC---FVMGTVQHETLHALGFW  120 (242)
Q Consensus       102 ~c---~~~~~i~HEigHALGl~  120 (242)
                      +-   ...+++.||+|||||++
T Consensus       181 g~~~e~L~~tarhElGhaLgi~  202 (236)
T COG5549         181 GELRENLNPTARHELGHALGIW  202 (236)
T ss_pred             ccchhhhhHHHHHhhcchheec
Confidence            22   23489999999999987


No 21 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=96.56  E-value=0.0018  Score=53.12  Aligned_cols=21  Identities=33%  Similarity=0.332  Sum_probs=15.4

Q ss_pred             ccchhHHHHHHHhcccccccC
Q psy1890         105 VMGTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~r  125 (242)
                      ...+++|||||.||..|...-
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~~  131 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFDN  131 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSSS
T ss_pred             eeeeehhhhHhhcCCCCCCCC
Confidence            346899999999999999883


No 22 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.56  E-value=0.001  Score=50.98  Aligned_cols=19  Identities=37%  Similarity=0.457  Sum_probs=13.3

Q ss_pred             cccchhHHHHHHHhccccc
Q psy1890         104 FVMGTVQHETLHALGFWHE  122 (242)
Q Consensus       104 ~~~~~i~HEigHALGl~HE  122 (242)
                      ....+++|||||.||+.|.
T Consensus       106 ~~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  106 SGVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             TSTTHHHHHHHHHTT----
T ss_pred             ccceEeeehhhHhcCCCCC
Confidence            3468999999999999993


No 23 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=96.47  E-value=0.0055  Score=55.02  Aligned_cols=78  Identities=22%  Similarity=0.248  Sum_probs=45.5

Q ss_pred             HHHHHHHHHHHHHhccc-cE-EEEecCCCCCCeEEEEccCCCCCC-CCccCCCCCccceEEe-CCCCcccchhHHHHHHH
Q psy1890          41 DRKRFIVKALNNIMTFS-CV-RFRRRTHNQTPYIVYKESPNGKSC-SSNVGFTGKKVSYVNL-GKNCFVMGTVQHETLHA  116 (242)
Q Consensus        41 ~~~~~I~~A~~~w~~~t-ci-~F~~~~~~~~~~I~i~~~~~~~gc-~S~vG~~~g~~q~i~l-~~~c~~~~~i~HEigHA  116 (242)
                      ...-.+..+.+.+.+.. .+ .|..    -...|.|...  +..| |+-+|..+|....+.. +.++...++++||+||-
T Consensus        88 ~d~~~~~~~Ad~~a~~~lG~~~~s~----y~h~vyvlP~--~~~C~w~Gla~v~G~~~~~~~~~~~~~~~~~~~HElgHN  161 (314)
T PF05548_consen   88 DDWYGWADAADAAARAQLGVNAFSS----YTHRVYVLPP--GFACGWAGLATVPGSQSWLWISGYGVQDWATIMHELGHN  161 (314)
T ss_pred             HHHHHHHHHHHHHHHhhcCcccccc----cceEEEEcCC--CCCCCceEEeecCCcceeeeecCcccccHHHHHHHhhhh
Confidence            33445666666666544 33 2311    2244655543  3347 5544444444122333 33566778999999999


Q ss_pred             hccccccc
Q psy1890         117 LGFWHEQA  124 (242)
Q Consensus       117 LGl~HEh~  124 (242)
                      |||.|...
T Consensus       162 ~GL~Ha~~  169 (314)
T PF05548_consen  162 LGLWHAGR  169 (314)
T ss_pred             ccccccCC
Confidence            99999974


No 24 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.35  E-value=0.0016  Score=52.52  Aligned_cols=21  Identities=33%  Similarity=0.401  Sum_probs=14.1

Q ss_pred             cchhHHHHHHHhcccccccCC
Q psy1890         106 MGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rp  126 (242)
                      ..|++||+||-|||.|..+..
T Consensus        70 g~TltHEvGH~LGL~HtF~~~   90 (154)
T PF05572_consen   70 GKTLTHEVGHWLGLYHTFGGG   90 (154)
T ss_dssp             SHHHHHHHHHHTT---TT---
T ss_pred             ccchhhhhhhhhcccccccCC
Confidence            479999999999999999764


No 25 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.14  E-value=0.0032  Score=52.06  Aligned_cols=19  Identities=37%  Similarity=0.363  Sum_probs=17.0

Q ss_pred             cchhHHHHHHHhccccccc
Q psy1890         106 MGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~  124 (242)
                      ..+++|||||.||+.|...
T Consensus       134 ~~~~aHElGH~lG~~HD~~  152 (192)
T cd04267         134 ALTMAHELGHNLGAEHDGG  152 (192)
T ss_pred             hhhhhhhHHhhcCCcCCCC
Confidence            4699999999999999865


No 26 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=95.97  E-value=0.0037  Score=51.81  Aligned_cols=23  Identities=30%  Similarity=0.205  Sum_probs=14.7

Q ss_pred             ccchhHHHHHHHhcccccccCCC
Q psy1890         105 VMGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~rpD  127 (242)
                      ...+++|||||.||+.|.+..+.
T Consensus       142 ~~~~~AHEiGH~lGa~HD~~~~~  164 (196)
T PF13688_consen  142 GAITFAHEIGHNLGAPHDGDYSS  164 (196)
T ss_dssp             HHHHHHHHHHHHTT-----SSS-
T ss_pred             eehhhHHhHHHhcCCCCCCCCCC
Confidence            45799999999999999887653


No 27 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=95.45  E-value=0.002  Score=55.17  Aligned_cols=22  Identities=32%  Similarity=0.392  Sum_probs=19.2

Q ss_pred             ccchhHHHHHHHhcccccccCC
Q psy1890         105 VMGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~rp  126 (242)
                      ...|++||+||.|||.|.++.-
T Consensus       137 ~g~t~~HEvGH~lGL~HtF~~~  158 (225)
T cd04275         137 LGDTATHEVGHWLGLYHTFQGG  158 (225)
T ss_pred             ccceeEEeccceeeeeeeecCC
Confidence            3469999999999999999774


No 28 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.22  E-value=0.014  Score=49.60  Aligned_cols=21  Identities=33%  Similarity=0.343  Sum_probs=18.1

Q ss_pred             ccchhHHHHHHHhcccccccC
Q psy1890         105 VMGTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~r  125 (242)
                      ...+++|||||.||+.|....
T Consensus       145 ~~~~~AHElGH~lG~~HD~~~  165 (220)
T cd04272         145 GVYTMTHELAHLLGAPHDGSP  165 (220)
T ss_pred             cHHHHHHHHHHHhCCCCCCCC
Confidence            357999999999999998654


No 29 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.19  E-value=0.0087  Score=49.59  Aligned_cols=19  Identities=37%  Similarity=0.484  Sum_probs=16.9

Q ss_pred             cchhHHHHHHHhccccccc
Q psy1890         106 MGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~  124 (242)
                      ..+++|||||.||+.|...
T Consensus       132 a~~~AHElGH~lG~~HD~~  150 (194)
T cd04269         132 AVTMAHELGHNLGMEHDDG  150 (194)
T ss_pred             HHHHHHHHHhhcCCCcCCC
Confidence            4799999999999999864


No 30 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=95.00  E-value=0.0099  Score=48.51  Aligned_cols=19  Identities=37%  Similarity=0.356  Sum_probs=16.0

Q ss_pred             chhHHHHHHHhcccccccC
Q psy1890         107 GTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~r  125 (242)
                      --++||+||.|||.|=-++
T Consensus       126 KEv~HElGH~~GL~HC~N~  144 (181)
T COG1913         126 KEVLHELGHLLGLSHCPNP  144 (181)
T ss_pred             HHHHHHhhhhcCcccCCCC
Confidence            4689999999999986554


No 31 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=94.68  E-value=0.015  Score=50.40  Aligned_cols=17  Identities=29%  Similarity=0.282  Sum_probs=15.2

Q ss_pred             chhHHHHHHHhcccccc
Q psy1890         107 GTVQHETLHALGFWHEQ  123 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh  123 (242)
                      .+++|||||.||+.|--
T Consensus       169 ~t~AHElGHnlGm~HD~  185 (244)
T cd04270         169 LVTAHELGHNFGSPHDP  185 (244)
T ss_pred             HHHHHHHHHhcCCCCCC
Confidence            68999999999999754


No 32 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=94.67  E-value=0.011  Score=50.58  Aligned_cols=18  Identities=28%  Similarity=0.385  Sum_probs=15.7

Q ss_pred             chhHHHHHHHhccccccc
Q psy1890         107 GTVQHETLHALGFWHEQA  124 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~  124 (242)
                      .+++|||||.||+.|--.
T Consensus       147 ~t~AHElGHnLGm~HD~~  164 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCT  164 (228)
T ss_pred             eehhhhhhhhcCCCCCCC
Confidence            589999999999998653


No 33 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=94.47  E-value=0.018  Score=47.57  Aligned_cols=15  Identities=47%  Similarity=0.601  Sum_probs=13.6

Q ss_pred             chhHHHHHHHhcccc
Q psy1890         107 GTVQHETLHALGFWH  121 (242)
Q Consensus       107 ~~i~HEigHALGl~H  121 (242)
                      ..++||+||.|||.|
T Consensus       127 k~~~HElGH~lGL~H  141 (179)
T PRK13267        127 KEVTHELGHTLGLEH  141 (179)
T ss_pred             HHHHHHHHHHcCCcc
Confidence            458999999999998


No 34 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=94.31  E-value=0.0087  Score=50.30  Aligned_cols=20  Identities=35%  Similarity=0.386  Sum_probs=17.6

Q ss_pred             ccchhHHHHHHHhccccccc
Q psy1890         105 VMGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~  124 (242)
                      ...+++|||||.||+.|...
T Consensus       140 ~a~~~aHElGH~LG~~HD~~  159 (207)
T cd04273         140 SAFTIAHELGHVLGMPHDGD  159 (207)
T ss_pred             eEEeeeeechhhcCCCCCCC
Confidence            45799999999999999865


No 35 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=94.28  E-value=0.021  Score=47.74  Aligned_cols=18  Identities=28%  Similarity=0.296  Sum_probs=12.3

Q ss_pred             chhHHHHHHHhccccccc
Q psy1890         107 GTVQHETLHALGFWHEQA  124 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~  124 (242)
                      ..++||+||.|||.|=..
T Consensus       147 Kea~HElGH~~GL~HC~~  164 (194)
T PF07998_consen  147 KEAVHELGHLFGLDHCEN  164 (194)
T ss_dssp             HHHHHHHHHHTT----SS
T ss_pred             HHHHHHHHHHcCCcCCCC
Confidence            679999999999998654


No 36 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=93.75  E-value=0.03  Score=46.55  Aligned_cols=19  Identities=32%  Similarity=0.396  Sum_probs=14.1

Q ss_pred             cchhHHHHHHHhccccccc
Q psy1890         106 MGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~  124 (242)
                      ..+++|||||.||+.|...
T Consensus       132 a~~~AHelGH~lGm~HD~~  150 (199)
T PF01421_consen  132 AVIIAHELGHNLGMPHDGD  150 (199)
T ss_dssp             HHHHHHHHHHHTT---TTT
T ss_pred             HHHHHHHHHHhcCCCCCCC
Confidence            3688999999999999887


No 37 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=93.02  E-value=0.038  Score=46.19  Aligned_cols=98  Identities=21%  Similarity=0.334  Sum_probs=55.6

Q ss_pred             CeEEEEEC--CCCC-------HHHHHHHHHHHHH---HhccccEEEEecCCCCCCe-EEEEccC-CCCCCC--CccCC--
Q psy1890          28 GLIYYEID--KNFS-------VDRKRFIVKALNN---IMTFSCVRFRRRTHNQTPY-IVYKESP-NGKSCS--SNVGF--   89 (242)
Q Consensus        28 ~~I~Y~~~--~~~~-------~~~~~~I~~A~~~---w~~~tci~F~~~~~~~~~~-I~i~~~~-~~~gc~--S~vG~--   89 (242)
                      ..+.|.++  +.+.       ...-..|...+..   |....-++|+.+...+.++ |.+.... -..-|.  ..-|.  
T Consensus        29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~~~Df~I~Lasp~T~~~lC~g~~~~~e~S  108 (203)
T PF11350_consen   29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSGAPDFRISLASPGTTDRLCAGLDTSGETS  108 (203)
T ss_pred             ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCCCCCEEEEECCcchhhhhccCcCcCceeE
Confidence            45788873  3331       3455666666654   8888889999999755453 4443220 023351  11111  


Q ss_pred             --C--CCccceEEeCCC------------CcccchhHHHHHHHhcccccccCC
Q psy1890          90 --T--GKKVSYVNLGKN------------CFVMGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus        90 --~--~g~~q~i~l~~~------------c~~~~~i~HEigHALGl~HEh~rp  126 (242)
                        .  +++ -.||+..+            -...-+|-||.||+||..||-.--
T Consensus       109 C~~~~g~~-VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cpg  160 (203)
T PF11350_consen  109 CRNPAGGR-VVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCPG  160 (203)
T ss_pred             eecCCCCe-EEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCCC
Confidence              1  122 34555211            113468999999999998887753


No 38 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=92.96  E-value=0.028  Score=49.80  Aligned_cols=20  Identities=30%  Similarity=0.345  Sum_probs=16.9

Q ss_pred             ccchhHHHHHHHhccccccc
Q psy1890         105 VMGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~  124 (242)
                      ..+|++||+||+|||.+.-.
T Consensus       165 ~igv~~HE~gH~lGLPDlY~  184 (286)
T TIGR03296       165 GVGVIAHELGHDLGLPDLYD  184 (286)
T ss_pred             ceeeeehhhhcccCCCcccc
Confidence            46999999999999976654


No 39 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=92.84  E-value=0.046  Score=48.82  Aligned_cols=16  Identities=31%  Similarity=0.403  Sum_probs=13.0

Q ss_pred             cchhHHHHHHHhcccc
Q psy1890         106 MGTVQHETLHALGFWH  121 (242)
Q Consensus       106 ~~~i~HEigHALGl~H  121 (242)
                      .++++||+||++||.|
T Consensus       194 ~~~f~HE~GH~~GL~H  209 (305)
T PF10462_consen  194 GNEFSHELGHNFGLGH  209 (305)
T ss_dssp             HHHHHHHHHHTTT--S
T ss_pred             cceeehhhhhhcCCCC
Confidence            4789999999999999


No 40 
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=92.48  E-value=0.16  Score=34.21  Aligned_cols=40  Identities=20%  Similarity=0.265  Sum_probs=31.4

Q ss_pred             ceEEeCC-CCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890          95 SYVNLGK-NCFVMGTVQHETLHALGFWHEQARPDRDRFVDI  134 (242)
Q Consensus        95 q~i~l~~-~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i  134 (242)
                      ..+.|.+ -+.....++|++.+.+||.|+..-...+.||.|
T Consensus        18 ~~l~F~p~ls~~eR~~vH~lA~~~gL~s~S~G~g~~R~v~v   58 (60)
T cd02641          18 TELEFPPTLSSHDRLLVHELAEELGLRHESTGEGSDRVITV   58 (60)
T ss_pred             CcEECCCCCCHHHHHHHHHHHHHcCCceEeeCCCCceEEEe
Confidence            4456654 333458999999999999999998888888766


No 41 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=92.26  E-value=0.059  Score=50.23  Aligned_cols=20  Identities=30%  Similarity=0.476  Sum_probs=17.5

Q ss_pred             cchhHHHHHHHhcccccccC
Q psy1890         106 MGTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~r  125 (242)
                      .|.++|||||+||+.|.-+-
T Consensus       317 iGA~lHEiGH~fg~pH~~~G  336 (423)
T PF12044_consen  317 IGAFLHEIGHLFGCPHQEDG  336 (423)
T ss_pred             hHHHHHHHHHhcCCCCCCCC
Confidence            47899999999999998753


No 42 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=89.62  E-value=0.14  Score=50.16  Aligned_cols=16  Identities=38%  Similarity=0.459  Sum_probs=14.7

Q ss_pred             ccchhHHHHHHHhccc
Q psy1890         105 VMGTVQHETLHALGFW  120 (242)
Q Consensus       105 ~~~~i~HEigHALGl~  120 (242)
                      ..|+++|||||.|||.
T Consensus       221 giGVfaHEfGH~LGLP  236 (645)
T PF05547_consen  221 GIGVFAHEFGHDLGLP  236 (645)
T ss_pred             ceEEEEeeccccCCCC
Confidence            5799999999999996


No 43 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=89.60  E-value=0.16  Score=49.36  Aligned_cols=77  Identities=17%  Similarity=0.199  Sum_probs=38.6

Q ss_pred             HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC---CCCCCCccCCCCCccceEEeCCCCcccchhHHHHHHHh
Q psy1890          41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN---GKSCSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHAL  117 (242)
Q Consensus        41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~---~~gc~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHAL  117 (242)
                      ..++.|.+|++.|...-. .|....- +..+|-+...++   +.-|++..+... ..-.+|+......+.|++||+|||+
T Consensus       311 ~a~~~v~~~~~~l~~e~~-~~~~~~~-~~~~iD~~~r~gK~~Ga~~~~~~~~~~-p~i~~N~~~~~~~v~TL~HE~GHa~  387 (587)
T TIGR02290       311 EAKELVLEAFGKFSPEMA-DFAEKAF-EEGWIDAEPRPGKRGGAFCTGFPPSKE-PRVLMNYDGSRRDVSTLAHELGHAY  387 (587)
T ss_pred             HHHHHHHHHHHhcCHHHH-HHHHHHH-HcCCcccCCCCCCCCCcccCCCCCCCC-CEEEEecCCCchhHHHHHHHhhHHH
Confidence            567888888877654311 1111111 223444433221   223444332211 1133455544555689999999999


Q ss_pred             ccc
Q psy1890         118 GFW  120 (242)
Q Consensus       118 Gl~  120 (242)
                      =..
T Consensus       388 H~~  390 (587)
T TIGR02290       388 HSE  390 (587)
T ss_pred             HHH
Confidence            443


No 44 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=89.38  E-value=0.17  Score=48.62  Aligned_cols=17  Identities=53%  Similarity=0.688  Sum_probs=14.0

Q ss_pred             cchhHHHHHHHhccccc
Q psy1890         106 MGTVQHETLHALGFWHE  122 (242)
Q Consensus       106 ~~~i~HEigHALGl~HE  122 (242)
                      ..+++|||+|||||.-.
T Consensus       211 ~~~~~HEi~HaLGFs~~  227 (521)
T PF01457_consen  211 FRTVIHEIAHALGFSSS  227 (521)
T ss_dssp             HHHHHHHHHHHTT-SHH
T ss_pred             cceeeeeeeeeeeeccc
Confidence            47999999999999763


No 45 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=89.09  E-value=0.73  Score=42.50  Aligned_cols=25  Identities=24%  Similarity=0.175  Sum_probs=17.7

Q ss_pred             eEEeCCCCcccchhHHHHHHHhccc
Q psy1890          96 YVNLGKNCFVMGTVQHETLHALGFW  120 (242)
Q Consensus        96 ~i~l~~~c~~~~~i~HEigHALGl~  120 (242)
                      .+|+........|+.||+|||+=..
T Consensus       213 ~~n~~~~~~~v~tl~HE~GHa~h~~  237 (427)
T cd06459         213 LMNFNGTLDDVFTLAHELGHAFHSY  237 (427)
T ss_pred             EecCCCChhhHHHHHHHhhHHHHHH
Confidence            3444555556789999999997444


No 46 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=89.02  E-value=0.74  Score=44.77  Aligned_cols=75  Identities=19%  Similarity=0.262  Sum_probs=36.3

Q ss_pred             HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC---CCCCCCccCCCCCccceEEeCCCCcccchhHHHHHHHh
Q psy1890          41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN---GKSCSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHAL  117 (242)
Q Consensus        41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~---~~gc~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHAL  117 (242)
                      +.++.|.+|++.+.. ....++...- +..||-...+++   |.-|++..+ . ...--+|+...-..+.|++||+|||+
T Consensus       315 ~a~~~v~~~~~~~g~-~~~~~~~~~~-~~~wiD~~~r~gK~~Ga~~~~~~~-~-~p~il~N~~~~~~dv~TLaHElGHa~  390 (591)
T TIGR00181       315 EAKELILKSLEPLGE-EYIKILKRAF-NERWVDYAENKGKRSGAYSIGGYK-V-KPYILMNWDGTLNSVFTLAHELGHSM  390 (591)
T ss_pred             HHHHHHHHHHhccCH-HHHHHHHHHh-hCCCeeecCCCCCCCCcccCCCCC-C-CCeEEEecCCCcchHHHHHHHhhhHH
Confidence            456667777754321 1111111111 124665543321   233555433 1 11123455444455789999999998


Q ss_pred             cc
Q psy1890         118 GF  119 (242)
Q Consensus       118 Gl  119 (242)
                      =.
T Consensus       391 H~  392 (591)
T TIGR00181       391 HS  392 (591)
T ss_pred             HH
Confidence            44


No 47 
>PTZ00337 surface protease GP63; Provisional
Probab=88.07  E-value=0.23  Score=48.17  Aligned_cols=17  Identities=41%  Similarity=0.501  Sum_probs=14.7

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...+++|||.|||||..
T Consensus       229 ~~~v~~HEi~HALGFs~  245 (567)
T PTZ00337        229 DVRVAAHELGHALGFVR  245 (567)
T ss_pred             HHHHHHHHHHHHHccCH
Confidence            46899999999999944


No 48 
>KOG4525|consensus
Probab=87.16  E-value=0.24  Score=46.07  Aligned_cols=20  Identities=40%  Similarity=0.552  Sum_probs=17.4

Q ss_pred             cchhHHHHHHHhcccccccC
Q psy1890         106 MGTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~r  125 (242)
                      .|.++|||||.||+.|.-.-
T Consensus       302 lGA~~HElGH~lgcpHq~~G  321 (614)
T KOG4525|consen  302 LGAVCHELGHCLGCPHQSEG  321 (614)
T ss_pred             HHHHHHHhhhccCCCCCCCc
Confidence            48899999999999997653


No 49 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=86.14  E-value=0.35  Score=47.08  Aligned_cols=17  Identities=53%  Similarity=0.690  Sum_probs=14.6

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...+++|||.|||||.-
T Consensus       256 ~~rv~~HEi~HALGFS~  272 (622)
T PTZ00257        256 TTRTVTHEVAHALGFSS  272 (622)
T ss_pred             HHHHHHHHHHHHhcCCH
Confidence            46899999999999943


No 50 
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=85.98  E-value=1  Score=30.37  Aligned_cols=40  Identities=13%  Similarity=0.150  Sum_probs=31.6

Q ss_pred             ceEEeCCCC-cccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890          95 SYVNLGKNC-FVMGTVQHETLHALGFWHEQARPDRDRFVDI  134 (242)
Q Consensus        95 q~i~l~~~c-~~~~~i~HEigHALGl~HEh~rpDRD~yv~i  134 (242)
                      +.+.|.+.. .....++|++.+.+||.|+-.-...+.||.|
T Consensus        18 ~~l~f~p~lt~~eR~~vH~~a~~~gL~s~S~G~g~~R~v~v   58 (60)
T cd02640          18 RDMVFSPEFSKEERALIHQIAQKYGLKSRSYGSGNDRYLVI   58 (60)
T ss_pred             ceEEcCCCCCHHHHHHHHHHHHHcCCceeeEeCCCCeEEEE
Confidence            567776533 3358999999999999999888888888765


No 51 
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=84.97  E-value=1.5  Score=29.39  Aligned_cols=40  Identities=13%  Similarity=0.113  Sum_probs=30.7

Q ss_pred             ceEEeCCC-CcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890          95 SYVNLGKN-CFVMGTVQHETLHALGFWHEQARPDRDRFVDI  134 (242)
Q Consensus        95 q~i~l~~~-c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i  134 (242)
                      +.+.+.++ ......++|++.+.+||.|+-.-...+.||.|
T Consensus        17 ~~l~Fpp~ls~~eR~~vH~~a~~~gL~s~S~G~g~~R~v~v   57 (59)
T cd06007          17 EEYEFPSSLTNHERAVIHRLCRKLGLKSKSKGKGSNRRLSV   57 (59)
T ss_pred             cEEEcCCCCCHHHHHHHHHHHHHcCCCceeecCCCCeEEEE
Confidence            55666653 23458999999999999999877777777765


No 52 
>KOG3607|consensus
Probab=82.67  E-value=0.6  Score=46.51  Aligned_cols=18  Identities=33%  Similarity=0.519  Sum_probs=16.7

Q ss_pred             cchhHHHHHHHhcccccc
Q psy1890         106 MGTVQHETLHALGFWHEQ  123 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh  123 (242)
                      ..+++|||||.||+.|.-
T Consensus       324 a~v~AhelgH~lGm~hD~  341 (716)
T KOG3607|consen  324 AVVLAHELGHNLGMIHDE  341 (716)
T ss_pred             HHHHHHHHHhhcCccccc
Confidence            479999999999999987


No 53 
>cd02642 R3H_encore_like R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=81.75  E-value=2.4  Score=28.63  Aligned_cols=40  Identities=13%  Similarity=0.114  Sum_probs=31.4

Q ss_pred             ceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEe
Q psy1890          95 SYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIH  135 (242)
Q Consensus        95 q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~  135 (242)
                      +.+.|.+-......++|++....||.||.+... +.+|.|.
T Consensus        22 ~~~~f~pm~sy~RllvH~la~~~gL~s~s~~~~-~r~vvv~   61 (63)
T cd02642          22 QSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSG-GKCVIVN   61 (63)
T ss_pred             CeeEcCCCCcHHHHHHHHHHHHhCCeeEeecCC-ceEEEEE
Confidence            445555534456899999999999999999987 7788764


No 54 
>cd02646 R3H_G-patch R3H domain of a group of fungal and plant proteins with unknown function, who also contain a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the R3H domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=80.59  E-value=2.5  Score=28.04  Aligned_cols=40  Identities=10%  Similarity=0.085  Sum_probs=31.1

Q ss_pred             ceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890          95 SYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDI  134 (242)
Q Consensus        95 q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i  134 (242)
                      +.+.+.+=......++|+|.+++||.++..-...+.||.|
T Consensus        17 ~~~~fppm~~~~R~~vH~lA~~~~L~S~S~G~g~~R~v~v   56 (58)
T cd02646          17 DSLSFPPMDKHGRKTIHKLANCYNLKSKSRGKGKKRFVTV   56 (58)
T ss_pred             ceEecCCCCHHHHHHHHHHHHHcCCcccccccCCceEEEE
Confidence            4566655333347899999999999999988888888766


No 55 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=80.55  E-value=0.87  Score=41.02  Aligned_cols=23  Identities=35%  Similarity=0.355  Sum_probs=19.2

Q ss_pred             CCcccchhHHHHHHHhccccccc
Q psy1890         102 NCFVMGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       102 ~c~~~~~i~HEigHALGl~HEh~  124 (242)
                      .....-|++||++|-+|+..|.-
T Consensus       193 ~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  193 PYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             cccccHHHHHHHHHHhCCCCHHH
Confidence            34456899999999999999864


No 56 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=80.32  E-value=1.3  Score=42.69  Aligned_cols=24  Identities=21%  Similarity=0.279  Sum_probs=16.8

Q ss_pred             EEeCCCCcccchhHHHHHHHhccc
Q psy1890          97 VNLGKNCFVMGTVQHETLHALGFW  120 (242)
Q Consensus        97 i~l~~~c~~~~~i~HEigHALGl~  120 (242)
                      +|+...-..+.|++||+|||+=..
T Consensus       329 ~Nf~~t~~dv~TL~HElGHa~H~~  352 (549)
T TIGR02289       329 SNFNGTSGDIDVLTHEAGHAFHVY  352 (549)
T ss_pred             EeCCCChhHHHHHHHHhhHHHHHH
Confidence            455444445689999999998443


No 57 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=79.52  E-value=0.98  Score=39.63  Aligned_cols=15  Identities=33%  Similarity=0.275  Sum_probs=11.2

Q ss_pred             cchhHHHHHHHhccc
Q psy1890         106 MGTVQHETLHALGFW  120 (242)
Q Consensus       106 ~~~i~HEigHALGl~  120 (242)
                      ..+++|||||++|-.
T Consensus       217 ~~v~vHE~GHsf~~L  231 (264)
T PF09471_consen  217 KQVVVHEFGHSFGGL  231 (264)
T ss_dssp             HHHHHHHHHHHTT--
T ss_pred             cceeeeecccccccc
Confidence            478999999977643


No 58 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=77.58  E-value=2  Score=36.63  Aligned_cols=24  Identities=25%  Similarity=0.461  Sum_probs=17.5

Q ss_pred             ceEEeCCCCc------ccchhHHHHHHHhc
Q psy1890          95 SYVNLGKNCF------VMGTVQHETLHALG  118 (242)
Q Consensus        95 q~i~l~~~c~------~~~~i~HEigHALG  118 (242)
                      ..++|++...      ..++++||.|||+.
T Consensus        73 k~vrLS~~vy~~~SiaAvaVAAHEvGHAiQ  102 (222)
T PF04298_consen   73 KVVRLSEDVYNGRSIAAVAVAAHEVGHAIQ  102 (222)
T ss_pred             CEEEeCCccCCCCCHHHHHHHHHHHhHHHh
Confidence            6777754322      35899999999985


No 59 
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=77.47  E-value=1.7  Score=43.76  Aligned_cols=134  Identities=10%  Similarity=0.022  Sum_probs=69.8

Q ss_pred             CCCCcccccC--CCCCCCCCeEEEEECC-C-----CCHHHHHHHH---------------HHHHHHhccccEEEEecCCC
Q psy1890          11 HDGVKNLAKT--PAKLWKDGLIYYEIDK-N-----FSVDRKRFIV---------------KALNNIMTFSCVRFRRRTHN   67 (242)
Q Consensus        11 ~~~~R~~~~~--~~~~Wp~~~I~Y~~~~-~-----~~~~~~~~I~---------------~A~~~w~~~tci~F~~~~~~   67 (242)
                      +|+-|.+...  -...|.+..-+|.+-+ +     .+...+..++               +.++.+..-+.|.|.+....
T Consensus       114 LP~y~~g~~~~~F~~~w~~~~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~~  193 (775)
T PF03272_consen  114 LPVYRCGQNEEEFFSEWDDSDSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDPV  193 (775)
T ss_pred             CCEEecCCCHHHHhhhhhhcCCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCccc
Confidence            4444554443  2347988777888743 2     1333444444               45566666788888876531


Q ss_pred             C---CCeEEEEccCCCCCCCCccCCCC-CccceEEeCCC---CcccchhHHHHHHHhcccccccCCCCCceEEEeeeccc
Q psy1890          68 Q---TPYIVYKESPNGKSCSSNVGFTG-KKVSYVNLGKN---CFVMGTVQHETLHALGFWHEQARPDRDRFVDIHYELIK  140 (242)
Q Consensus        68 ~---~~~I~i~~~~~~~gc~S~vG~~~-g~~q~i~l~~~---c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~  140 (242)
                      .   ...-.++.+.+|.| -+|.|..= +. ..-+++..   -..-..++|||||+--+.-.++.+    ++.=.|+||.
T Consensus       194 ~~n~~~kYF~KAD~~G~G-~AYY~~~w~a~-ss~s~~~~L~~~~~nW~~LHEIgHgYd~~F~~n~~----~~~EVWnNI~  267 (775)
T PF03272_consen  194 DKNFNNKYFAKADKSGPG-AAYYGSNWTAQ-SSSSLSFYLNPSPTNWGALHEIGHGYDFGFTRNGT----YLNEVWNNIL  267 (775)
T ss_pred             ccccccceEEEecCCCCC-Cccccccceec-CchhHHHHhCCCCCCchhhhhhhhhcceeEeeCCc----chhhhhhhhh
Confidence            1   11223333222322 34554421 11 11122100   112357899999998887776644    2336799998


Q ss_pred             ccccccccccC
Q psy1890         141 KGSEINFKKRN  151 (242)
Q Consensus       141 ~~~~~nf~k~~  151 (242)
                      +.. ++|..+.
T Consensus       268 ~d~-yQ~~~~~  277 (775)
T PF03272_consen  268 ADR-YQYTYMT  277 (775)
T ss_pred             hhh-hhhhhcC
Confidence            643 4444443


No 60 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=77.43  E-value=1.2  Score=37.88  Aligned_cols=16  Identities=44%  Similarity=0.420  Sum_probs=12.4

Q ss_pred             chhHHHHHHHhccccccc
Q psy1890         107 GTVQHETLHALGFWHEQA  124 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~  124 (242)
                      =|++|||||+|  .|...
T Consensus        74 FtlAHELGH~l--lH~~~   89 (213)
T COG2856          74 FTLAHELGHAL--LHTDL   89 (213)
T ss_pred             HHHHHHHhHHH--hcccc
Confidence            58999999997  44444


No 61 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=76.19  E-value=1.4  Score=36.96  Aligned_cols=22  Identities=32%  Similarity=0.234  Sum_probs=16.7

Q ss_pred             cchhHHHHHHHhcccccccCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpD  127 (242)
                      ..++.||+|||+-=.-.-.|..
T Consensus        23 l~t~~HE~gHal~a~l~G~~v~   44 (200)
T PF13398_consen   23 LVTFVHELGHALAALLTGGRVK   44 (200)
T ss_pred             HHHHHHHHHHHHHHHHhCCCcc
Confidence            3799999999987666555544


No 62 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=74.95  E-value=1.6  Score=32.45  Aligned_cols=19  Identities=32%  Similarity=0.426  Sum_probs=14.0

Q ss_pred             cchhHHHHHHHhccccccc
Q psy1890         106 MGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~  124 (242)
                      ..|++||++|-+|+.-|.-
T Consensus        74 ~~tlvhEiah~fG~~~e~l   92 (97)
T PF06262_consen   74 RDTLVHEIAHHFGISDEDL   92 (97)
T ss_dssp             HHHHHHHHHHHTT--HHHH
T ss_pred             HHHHHHHHHHHcCCCHHHh
Confidence            3689999999999876643


No 63 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=73.86  E-value=1.8  Score=31.77  Aligned_cols=23  Identities=26%  Similarity=0.160  Sum_probs=15.2

Q ss_pred             ceEEeCCCC---cccchhHHHHHHHh
Q psy1890          95 SYVNLGKNC---FVMGTVQHETLHAL  117 (242)
Q Consensus        95 q~i~l~~~c---~~~~~i~HEigHAL  117 (242)
                      ..|-+....   ...-+++|||||.+
T Consensus        29 ~~I~in~~~~~~~~~f~laHELgH~~   54 (122)
T PF06114_consen   29 PIIFINSNLSPERQRFTLAHELGHIL   54 (122)
T ss_dssp             TEEEEESSS-HHHHHHHHHHHHHHHH
T ss_pred             CEEEECCCCCHHHHHHHHHHHHHHHH
Confidence            455554432   22469999999987


No 64 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=73.71  E-value=9.4  Score=31.41  Aligned_cols=19  Identities=26%  Similarity=0.113  Sum_probs=15.5

Q ss_pred             ccchhHHHHHHHhcccccc
Q psy1890         105 VMGTVQHETLHALGFWHEQ  123 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh  123 (242)
                      ...+|+|||.|..-..|.-
T Consensus       164 idYVvvHEL~Hl~~~nHs~  182 (205)
T PF01863_consen  164 IDYVVVHELCHLRHPNHSK  182 (205)
T ss_pred             HHHHHHHHHHHhccCCCCH
Confidence            4589999999999877654


No 65 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=71.58  E-value=2.1  Score=36.52  Aligned_cols=19  Identities=37%  Similarity=0.375  Sum_probs=13.4

Q ss_pred             cchhHHHHHHHhcccccccCC
Q psy1890         106 MGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rp  126 (242)
                      ..|+.||+||||  .++.+=|
T Consensus        93 ~~~l~HE~GHAl--I~~~~lP  111 (220)
T PF14247_consen   93 LFTLYHELGHAL--IDDLDLP  111 (220)
T ss_pred             HHHHHHHHHHHH--HHHhcCC
Confidence            478999999997  4444433


No 66 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=68.32  E-value=2.2  Score=32.91  Aligned_cols=17  Identities=24%  Similarity=0.356  Sum_probs=14.3

Q ss_pred             cchhHHHHHHHhccccc
Q psy1890         106 MGTVQHETLHALGFWHE  122 (242)
Q Consensus       106 ~~~i~HEigHALGl~HE  122 (242)
                      .-+++|||||-+|+.-+
T Consensus       110 thvliHEIgHhFGLsDd  126 (136)
T COG3824         110 THVLIHEIGHHFGLSDD  126 (136)
T ss_pred             hhhhhhhhhhhcCCChh
Confidence            36899999999999754


No 67 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=68.30  E-value=2.5  Score=39.68  Aligned_cols=21  Identities=24%  Similarity=0.192  Sum_probs=15.4

Q ss_pred             ccchhHHHHHHHhcccccccC
Q psy1890         105 VMGTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~r  125 (242)
                      ...|+.||||||+=..=-.++
T Consensus       242 ~v~tLfHE~GHa~H~~ls~~~  262 (458)
T PF01432_consen  242 DVETLFHEFGHAMHSLLSRTK  262 (458)
T ss_dssp             HHHHHHHHHHHHHHHHHCCCS
T ss_pred             hHHHHHHHHhHHHHHHHhccc
Confidence            357999999999866544443


No 68 
>PF11667 DUF3267:  Protein of unknown function (DUF3267);  InterPro: IPR021683  This family of proteins has no known function. 
Probab=67.40  E-value=5  Score=30.06  Aligned_cols=20  Identities=30%  Similarity=0.245  Sum_probs=18.1

Q ss_pred             chhHHHHHHHhcccccccCC
Q psy1890         107 GTVQHETLHALGFWHEQARP  126 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~rp  126 (242)
                      ..++||+.|+|.+...-.++
T Consensus         6 ~~~~HEliH~l~~~~~~~~~   25 (111)
T PF11667_consen    6 LIPLHELIHGLFFKLFGKKP   25 (111)
T ss_pred             eHHHHHHHHHHHHHHhCCCC
Confidence            46899999999999998877


No 69 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=65.98  E-value=3.1  Score=40.74  Aligned_cols=78  Identities=18%  Similarity=0.117  Sum_probs=37.0

Q ss_pred             HHHHHHHHHHHHHhccccEEEEecCCCCCCeEEEEccCC-CCCCCCccCCCCCc-cceEEeCCCCcccchhHHHHHHHhc
Q psy1890          41 DRKRFIVKALNNIMTFSCVRFRRRTHNQTPYIVYKESPN-GKSCSSNVGFTGKK-VSYVNLGKNCFVMGTVQHETLHALG  118 (242)
Q Consensus        41 ~~~~~I~~A~~~w~~~tci~F~~~~~~~~~~I~i~~~~~-~~gc~S~vG~~~g~-~q~i~l~~~c~~~~~i~HEigHALG  118 (242)
                      ..++.+.++++......+=.|++.-  +..||-+..+++ .+|-+|.-...+.. ---||+...-..+.|++||+||++=
T Consensus       316 ea~~~v~~~l~~lg~ey~~~~~~a~--~~~WiD~~~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~dV~TLaHElGHs~H  393 (598)
T COG1164         316 EAKELVLKALAPLGPEYAKIARRAF--DERWIDVYPRKGKRSGAYSIGFYKGDHPFILMNYDGTLRDVFTLAHELGHSVH  393 (598)
T ss_pred             HHHHHHHHHHHhhCHHHHHHHHHHH--hcCCeeccCCCCCCCCcccCCCCCCCCCeEEEeCCCchhHHHHHHHHccHHHH
Confidence            4566666666655543222122221  225555533321 13323211111122 1234544334457899999999986


Q ss_pred             cc
Q psy1890         119 FW  120 (242)
Q Consensus       119 l~  120 (242)
                      ..
T Consensus       394 s~  395 (598)
T COG1164         394 SY  395 (598)
T ss_pred             HH
Confidence            53


No 70 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=65.90  E-value=3.2  Score=33.94  Aligned_cols=23  Identities=30%  Similarity=0.234  Sum_probs=16.3

Q ss_pred             cchhHHHHHHHhcccccccCCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPDR  128 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDR  128 (242)
                      ..++.||+||++--.....+..+
T Consensus         8 i~i~~HE~gH~~~a~~~G~~~~~   30 (192)
T PF02163_consen    8 ISIVLHELGHALAARLYGDKVPR   30 (192)
T ss_dssp             HHHHHHHHHHHHHHHTTT--B--
T ss_pred             ccccccccccccccccccccccc
Confidence            36899999999998887776543


No 71 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=64.49  E-value=3.3  Score=37.54  Aligned_cols=14  Identities=36%  Similarity=0.323  Sum_probs=11.7

Q ss_pred             ccchhHHHHHHHhc
Q psy1890         105 VMGTVQHETLHALG  118 (242)
Q Consensus       105 ~~~~i~HEigHALG  118 (242)
                      ...|+.||+|||+=
T Consensus       154 ~v~tl~HE~GHa~h  167 (365)
T cd06258         154 DINTLFHEFGHAVH  167 (365)
T ss_pred             HHHHHHHHHhHHHH
Confidence            35789999999983


No 72 
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=62.86  E-value=4.5  Score=30.60  Aligned_cols=18  Identities=39%  Similarity=0.530  Sum_probs=15.9

Q ss_pred             chhHHHHHHHhccccccc
Q psy1890         107 GTVQHETLHALGFWHEQA  124 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~  124 (242)
                      ..++|=+.|.|||-|+..
T Consensus        75 ~l~iHG~LHLlGyDH~~~   92 (110)
T TIGR00043        75 HLTVHGLLHLLGYDHETE   92 (110)
T ss_pred             HHHHHHHHHHcCCCCCCh
Confidence            679999999999999853


No 73 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=62.44  E-value=5.6  Score=33.65  Aligned_cols=76  Identities=26%  Similarity=0.209  Sum_probs=45.1

Q ss_pred             chhHHHHHHHhcccccccCCCCCceEEEeeecccccccc-cccccCccc----cccCCCcCCCCcccccCCCccccCCCC
Q psy1890         107 GTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKGSEI-NFKKRNFKE----ATTLDLPYDFGSIMHYSRFAFSRDSKS  181 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~~~~-nf~k~~~~~----~~~~g~~yD~~SIMhY~~~~f~~~g~~  181 (242)
                      -+.-|+++|++|+.|--.|-|      |+   +.++... +....-.++    ....| --|..|.|.|+..        
T Consensus       155 l~~~efl~~~~~~~~lnhr~~------i~---~~pg~~~e~L~~tarhElGhaLgi~g-hsd~ks~aly~~s--------  216 (236)
T COG5549         155 LIAYEFLGHALGLGHLNHRGD------IM---YPPGELRENLNPTARHELGHALGIWG-HSDLKSDALYGIS--------  216 (236)
T ss_pred             HHHHHHhhhccCccccccccc------cc---CCcccchhhhhHHHHHhhcchheecc-cccchhhhhcchh--------
Confidence            356789999999999888866      22   2222211 111000000    11122 2478888888742        


Q ss_pred             CeeeeccCCCccccccCCcCcccHHHHHHHHhHCCCC
Q psy1890         182 PTITPKKTLNVQADKMGQRINISKMDIAKLNTLYKCP  218 (242)
Q Consensus       182 ~ti~~~~~~~~~~~~ig~r~~lS~~Di~~in~lY~C~  218 (242)
                                        ..+.|..|++.|.++|.-|
T Consensus       217 ------------------~~~is~rdv~tL~r~Yeqp  235 (236)
T COG5549         217 ------------------WQGISRRDVKTLERKYEQP  235 (236)
T ss_pred             ------------------hcccCHHHHHHHHHHhcCC
Confidence                              1138899999999999654


No 74 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=61.78  E-value=4.1  Score=38.63  Aligned_cols=14  Identities=29%  Similarity=0.226  Sum_probs=11.6

Q ss_pred             ccchhHHHHHHHhc
Q psy1890         105 VMGTVQHETLHALG  118 (242)
Q Consensus       105 ~~~~i~HEigHALG  118 (242)
                      ...|+.|||||||=
T Consensus       263 ~V~TLfHEfGHalH  276 (472)
T cd06455         263 EVETFFHEFGHVIH  276 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            35789999999993


No 75 
>PF15639 Tox-MPTase3:  Metallopeptidase toxin 3
Probab=61.23  E-value=19  Score=28.07  Aligned_cols=69  Identities=20%  Similarity=0.265  Sum_probs=40.4

Q ss_pred             HHHHHHHHHHHhccccEEEEecCC-CCCCeEEEEccCCCCCCCCccCCCCC--ccceEEeC----------CC-------
Q psy1890          43 KRFIVKALNNIMTFSCVRFRRRTH-NQTPYIVYKESPNGKSCSSNVGFTGK--KVSYVNLG----------KN-------  102 (242)
Q Consensus        43 ~~~I~~A~~~w~~~tci~F~~~~~-~~~~~I~i~~~~~~~gc~S~vG~~~g--~~q~i~l~----------~~-------  102 (242)
                      ...|-+|+..|-..+-....+.-. +....|.|...  ..||  ..|....  ..|.|.+.          ++       
T Consensus        22 ~p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~--~~~~--A~G~t~p~~~~~~i~I~~~lV~~~Ek~~~~~r~~~~   97 (135)
T PF15639_consen   22 VPAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPL--VGGT--AYGYTTPDDGSQVIRIDGDLVNMFEKGPGERRAGNV   97 (135)
T ss_pred             ChHHHHHHHHHcccchhhhhhccccCCCCEEEEeec--ccce--eeeeecCCCCCcEEEeeHHHhhhhhcCCcccccceE
Confidence            457778888877655444443322 34578999876  5677  3444321  13666652          11       


Q ss_pred             CcccchhHHHHHH
Q psy1890         103 CFVMGTVQHETLH  115 (242)
Q Consensus       103 c~~~~~i~HEigH  115 (242)
                      -....|++||+-|
T Consensus        98 ~~v~~TlLHEliH  110 (135)
T PF15639_consen   98 YLVGSTLLHELIH  110 (135)
T ss_pred             EEeehHHHHHHHH
Confidence            0123799999988


No 76 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=60.97  E-value=4.5  Score=32.83  Aligned_cols=22  Identities=23%  Similarity=0.123  Sum_probs=16.8

Q ss_pred             cchhHHHHHHHhcccccccCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpD  127 (242)
                      ..++.||+||++=-.+.-.+..
T Consensus         9 i~i~iHE~gH~~~A~~~G~~~~   30 (180)
T cd05709           9 ISVTVHELGHALVARRLGVKVA   30 (180)
T ss_pred             HHHHHHHHHHHHHHHHcCCCch
Confidence            3678999999997776666544


No 77 
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=60.95  E-value=3.9  Score=31.11  Aligned_cols=37  Identities=30%  Similarity=0.438  Sum_probs=22.2

Q ss_pred             CCCCCCccCCCCCccceEEeCCCCcccchhHHHHHHHhccc
Q psy1890          80 GKSCSSNVGFTGKKVSYVNLGKNCFVMGTVQHETLHALGFW  120 (242)
Q Consensus        80 ~~gc~S~vG~~~g~~q~i~l~~~c~~~~~i~HEigHALGl~  120 (242)
                      ++-|.|++|..--.+.  +-..||  ..|+.||+.|+..|.
T Consensus        58 ~~LCG~~~~~i~IDP~--~~~KGC--~~TL~HEL~H~WQ~R   94 (141)
T PHA02456         58 QDLCGQFVGWIEIDPD--YANKGC--RDTLAHELNHAWQFR   94 (141)
T ss_pred             cchhhcceeEEEECCc--ccccch--HHHHHHHHHHHHhhh
Confidence            4557676665321100  012467  469999999998764


No 78 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=59.92  E-value=4.5  Score=38.25  Aligned_cols=15  Identities=27%  Similarity=0.355  Sum_probs=12.9

Q ss_pred             ccchhHHHHHHHhcc
Q psy1890         105 VMGTVQHETLHALGF  119 (242)
Q Consensus       105 ~~~~i~HEigHALGl  119 (242)
                      ..++++|||||..|=
T Consensus       130 lagViAHEigHv~qr  144 (484)
T COG4783         130 LAGVIAHEIGHVAQR  144 (484)
T ss_pred             HHHHHHHHHHHHhhh
Confidence            459999999998874


No 79 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=59.60  E-value=5.2  Score=32.02  Aligned_cols=12  Identities=33%  Similarity=0.595  Sum_probs=10.4

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      .-+++|||.|++
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            469999999985


No 80 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=59.45  E-value=4.7  Score=38.08  Aligned_cols=18  Identities=28%  Similarity=0.222  Sum_probs=13.8

Q ss_pred             cchhHHHHHHHhcccccc
Q psy1890         106 MGTVQHETLHALGFWHEQ  123 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh  123 (242)
                      ..|+.|||||||=..-.+
T Consensus       249 v~TLfHEfGHalH~~ls~  266 (458)
T cd06457         249 VETLFHEMGHAMHSMLGR  266 (458)
T ss_pred             HHHHHHHHhHHHHHHHcC
Confidence            478999999998654443


No 81 
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=58.20  E-value=4.1  Score=39.13  Aligned_cols=24  Identities=13%  Similarity=0.375  Sum_probs=18.3

Q ss_pred             CHHHHHHHHHHHHHHhccccEEEE
Q psy1890          39 SVDRKRFIVKALNNIMTFSCVRFR   62 (242)
Q Consensus        39 ~~~~~~~I~~A~~~w~~~tci~F~   62 (242)
                      +...|+.+++|+.+-.++..|...
T Consensus       234 ~~~ar~lvkeal~~~ak~~~I~l~  257 (760)
T COG4412         234 PSGARKLVKEALNTAAKHQAIQLQ  257 (760)
T ss_pred             ChhHHHHHHHHHHHHhhcccCChh
Confidence            456899999999998886655543


No 82 
>PF02130 UPF0054:  Uncharacterized protein family UPF0054;  InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=58.15  E-value=6  Score=31.37  Aligned_cols=16  Identities=44%  Similarity=0.642  Sum_probs=14.1

Q ss_pred             chhHHHHHHHhccccc
Q psy1890         107 GTVQHETLHALGFWHE  122 (242)
Q Consensus       107 ~~i~HEigHALGl~HE  122 (242)
                      ..++|=+.|.||+-|+
T Consensus       110 ~l~vHG~LHLlGyDH~  125 (145)
T PF02130_consen  110 RLLVHGLLHLLGYDHE  125 (145)
T ss_dssp             HHHHHHHHHHTT-SST
T ss_pred             HHHHHHHHHHcCCCCC
Confidence            6799999999999999


No 83 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=55.17  E-value=6.6  Score=32.42  Aligned_cols=22  Identities=18%  Similarity=-0.065  Sum_probs=15.8

Q ss_pred             cchhHHHHHHHhcccccccCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpD  127 (242)
                      ..++.||+||.+=-.-...+.+
T Consensus        10 ~~v~iHElGH~~~Ar~~Gv~v~   31 (182)
T cd06163          10 ILIFVHELGHFLVAKLFGVKVE   31 (182)
T ss_pred             HHHHHHHHHHHHHHHHcCCeee
Confidence            3579999999876665555543


No 84 
>COG0319 Predicted metal-dependent hydrolase [General function prediction only]
Probab=54.83  E-value=7.3  Score=31.35  Aligned_cols=18  Identities=39%  Similarity=0.562  Sum_probs=16.5

Q ss_pred             chhHHHHHHHhccccccc
Q psy1890         107 GTVQHETLHALGFWHEQA  124 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~  124 (242)
                      ..++|=+.|.||+-|+..
T Consensus       110 ~l~vHG~LHLlGYDH~~~  127 (153)
T COG0319         110 HLTIHGILHLLGYDHEED  127 (153)
T ss_pred             HHHHHHHHHHccCCCCCc
Confidence            679999999999999986


No 85 
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=54.62  E-value=6.4  Score=36.82  Aligned_cols=14  Identities=43%  Similarity=0.342  Sum_probs=11.7

Q ss_pred             ccchhHHHHHHHhc
Q psy1890         105 VMGTVQHETLHALG  118 (242)
Q Consensus       105 ~~~~i~HEigHALG  118 (242)
                      ...|+.|||||||=
T Consensus       208 ~v~tLfHEfGHalH  221 (422)
T cd06456         208 EVTTLFHEFGHALH  221 (422)
T ss_pred             HHHHHHHHHHHHHH
Confidence            35789999999994


No 86 
>KOG3658|consensus
Probab=54.48  E-value=4  Score=40.08  Aligned_cols=18  Identities=33%  Similarity=0.303  Sum_probs=15.5

Q ss_pred             cchhHHHHHHHhcccccc
Q psy1890         106 MGTVQHETLHALGFWHEQ  123 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh  123 (242)
                      .-|++|||||.+|-.|.-
T Consensus       393 ~lt~AHEiGHNfGSpHDp  410 (764)
T KOG3658|consen  393 DLTLAHEIGHNFGSPHDP  410 (764)
T ss_pred             heeehhhhccccCCCCCC
Confidence            368999999999999963


No 87 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=54.11  E-value=7.3  Score=28.72  Aligned_cols=12  Identities=25%  Similarity=0.274  Sum_probs=10.7

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      ..+++||+.|++
T Consensus        26 ~~~l~HE~~H~~   37 (128)
T PF13485_consen   26 DRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHH
Confidence            389999999996


No 88 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=53.69  E-value=6.5  Score=32.94  Aligned_cols=24  Identities=25%  Similarity=0.338  Sum_probs=16.5

Q ss_pred             ceEEeCC------CCcccchhHHHHHHHhc
Q psy1890          95 SYVNLGK------NCFVMGTVQHETLHALG  118 (242)
Q Consensus        95 q~i~l~~------~c~~~~~i~HEigHALG  118 (242)
                      +.+.|.+      .....++++||.|||+.
T Consensus        76 kvvrLSe~~y~g~Sia~~aVAAHEVGHAiQ  105 (226)
T COG2738          76 KVVRLSEANYYGPSIAAIAVAAHEVGHAIQ  105 (226)
T ss_pred             heeeccccccCCccHHHHHHHHHHhhHHHh
Confidence            5566643      22345899999999984


No 89 
>PRK00016 metal-binding heat shock protein; Provisional
Probab=52.59  E-value=7.6  Score=31.35  Aligned_cols=17  Identities=35%  Similarity=0.485  Sum_probs=15.4

Q ss_pred             chhHHHHHHHhcccccc
Q psy1890         107 GTVQHETLHALGFWHEQ  123 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh  123 (242)
                      ..++|=++|.||+-|+.
T Consensus       115 ~l~iHG~LHLlGYDH~~  131 (159)
T PRK00016        115 HLTVHGILHLLGYDHIE  131 (159)
T ss_pred             HHHHHhhHHhcCCCCCC
Confidence            57999999999999975


No 90 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=52.38  E-value=7.5  Score=34.38  Aligned_cols=22  Identities=32%  Similarity=0.370  Sum_probs=17.4

Q ss_pred             cchhHHHHHHHhcccccccCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpD  127 (242)
                      ..++.||++||+=-..|..+-+
T Consensus       136 isvvvHElgHal~A~~~gi~V~  157 (277)
T cd06162         136 ISGVVHEMGHGVAAVREQVRVN  157 (277)
T ss_pred             HHHHHHHHHHHHHHHHcCCeec
Confidence            4789999999998887765544


No 91 
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=52.26  E-value=7.9  Score=36.01  Aligned_cols=28  Identities=36%  Similarity=0.310  Sum_probs=18.2

Q ss_pred             eEEeCCC--CcccchhHHHHHHHhcccccccCC
Q psy1890          96 YVNLGKN--CFVMGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus        96 ~i~l~~~--c~~~~~i~HEigHALGl~HEh~rp  126 (242)
                      +++....  ....-+++||+||||   |||+-|
T Consensus       148 tt~y~~~d~~~~l~t~iHE~GHal---ye~~l~  177 (396)
T cd06460         148 TTRYDENDFRSALFSTIHETGHAL---YEQGLP  177 (396)
T ss_pred             EeeeCCcchHHHHHHHHHHhhHHH---HHhcCC
Confidence            4444443  233468999999997   677544


No 92 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=51.96  E-value=7.6  Score=34.02  Aligned_cols=23  Identities=26%  Similarity=0.267  Sum_probs=19.0

Q ss_pred             cchhHHHHHHHhcccccccCCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPDR  128 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDR  128 (242)
                      ..++.||+|||+=-.++..+.+.
T Consensus       119 isv~iHElgHa~~Ar~~G~~V~~  141 (263)
T cd06159         119 VGVVVHELSHGILARVEGIKVKS  141 (263)
T ss_pred             HHHHHHHHHHHHHHHHcCCEECc
Confidence            46899999999988888877653


No 93 
>cd06006 R3H_unknown_2 R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA  or ssRNA in a sequence-specific manner.
Probab=51.57  E-value=19  Score=24.12  Aligned_cols=38  Identities=18%  Similarity=0.236  Sum_probs=29.8

Q ss_pred             EEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890          97 VNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDI  134 (242)
Q Consensus        97 i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i  134 (242)
                      +.|.+=......++||+..+.||.=|..-+.-+.||-|
T Consensus        20 ~~f~pM~~~~R~~vHdla~~~gl~SeS~d~Ep~R~V~v   57 (59)
T cd06006          20 LRFPPMRSPQRAFIHELAKDYGLYSESQDPEPKRSVFV   57 (59)
T ss_pred             eeCCCCCHHHHHHHHHHHHHcCCeeEecCCCCCcEEEE
Confidence            44444333458899999999999999999888888765


No 94 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=49.84  E-value=9  Score=31.75  Aligned_cols=12  Identities=25%  Similarity=0.235  Sum_probs=10.9

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      .++++||+||..
T Consensus        90 ~aVlaHElgH~~  101 (226)
T PF01435_consen   90 AAVLAHELGHIK  101 (226)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHH
Confidence            489999999987


No 95 
>PRK13963 unkown domain/putative metalloprotease fusion protein; Provisional
Probab=49.01  E-value=9.9  Score=33.15  Aligned_cols=16  Identities=44%  Similarity=0.632  Sum_probs=15.1

Q ss_pred             chhHHHHHHHhccccc
Q psy1890         107 GTVQHETLHALGFWHE  122 (242)
Q Consensus       107 ~~i~HEigHALGl~HE  122 (242)
                      ..++|=++|.|||-|+
T Consensus       217 ~LlIHGlLHLLGYDHe  232 (258)
T PRK13963        217 HLLVHGALHAQGYDHE  232 (258)
T ss_pred             HHHHHHHHHHcCCCCC
Confidence            6799999999999999


No 96 
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=48.38  E-value=9.6  Score=31.33  Aligned_cols=13  Identities=31%  Similarity=0.319  Sum_probs=11.0

Q ss_pred             cchhHHHHHHHhc
Q psy1890         106 MGTVQHETLHALG  118 (242)
Q Consensus       106 ~~~i~HEigHALG  118 (242)
                      .+.++||++||+=
T Consensus        10 ~~i~~HE~aHa~~   22 (181)
T cd06158          10 LAITLHEFAHAYV   22 (181)
T ss_pred             HHHHHHHHHHHHH
Confidence            4789999999983


No 97 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=47.59  E-value=9.9  Score=32.46  Aligned_cols=23  Identities=17%  Similarity=0.011  Sum_probs=19.0

Q ss_pred             cchhHHHHHHHhcccccccCCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPDR  128 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDR  128 (242)
                      ..++.||+||++=..+...+..+
T Consensus        54 ~~v~iHElgH~~~A~~~G~~v~~   76 (227)
T cd06164          54 ASVLLHELGHSLVARRYGIPVRS   76 (227)
T ss_pred             HHHHHHHHHHHHHHHHcCCeECe
Confidence            47899999999988888877754


No 98 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=46.80  E-value=11  Score=29.73  Aligned_cols=15  Identities=33%  Similarity=0.317  Sum_probs=12.7

Q ss_pred             cchhHHHHHHHhccc
Q psy1890         106 MGTVQHETLHALGFW  120 (242)
Q Consensus       106 ~~~i~HEigHALGl~  120 (242)
                      ..|++||+.|++=+.
T Consensus        60 ~~~l~HEm~H~~~~~   74 (146)
T smart00731       60 RETLLHELCHAALYL   74 (146)
T ss_pred             HhhHHHHHHHHHHHH
Confidence            479999999998664


No 99 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=46.73  E-value=12  Score=26.56  Aligned_cols=14  Identities=21%  Similarity=0.235  Sum_probs=11.8

Q ss_pred             ccchhHHHHHHHhc
Q psy1890         105 VMGTVQHETLHALG  118 (242)
Q Consensus       105 ~~~~i~HEigHALG  118 (242)
                      ....+.|||.|++-
T Consensus        61 ~~~llaHEl~Hv~Q   74 (79)
T PF13699_consen   61 GRALLAHELAHVVQ   74 (79)
T ss_pred             cchhHhHHHHHHHh
Confidence            45799999999974


No 100
>KOG2921|consensus
Probab=46.32  E-value=10  Score=35.23  Aligned_cols=34  Identities=29%  Similarity=0.214  Sum_probs=24.0

Q ss_pred             cchhHHHHHHHhcccccccCCCCCceEEEeeeccccc
Q psy1890         106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELIKKG  142 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~~~  142 (242)
                      ...+.||+||||--.-|+-+-   +++-|.---|.|+
T Consensus       132 i~~vvHElGHalAA~segV~v---ngfgIfi~aiyPg  165 (484)
T KOG2921|consen  132 ITVVVHELGHALAAASEGVQV---NGFGIFIAAIYPG  165 (484)
T ss_pred             HHHHHHHhhHHHHHHhcCcee---eeeEEEEEEEcCc
Confidence            367899999999998888764   3445544445554


No 101
>cd02325 R3H R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=46.16  E-value=32  Score=21.61  Aligned_cols=40  Identities=18%  Similarity=0.092  Sum_probs=30.5

Q ss_pred             ceEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEE
Q psy1890          95 SYVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDI  134 (242)
Q Consensus        95 q~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i  134 (242)
                      ..+.|.+-.....-++|++.-..||.++....+-+.+|.|
T Consensus        18 ~~~~~~p~~~~~R~~vH~la~~~~L~s~s~g~~~~r~v~i   57 (59)
T cd02325          18 KSLELPPMNSYERKLIHDLAEYYGLKSESEGEGPNRRVVI   57 (59)
T ss_pred             CeEEcCCCCHHHHHHHHHHHHHCCCEEEEecCCCCcEEEE
Confidence            4455554333458899999999999999998887777765


No 102
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=45.96  E-value=11  Score=29.62  Aligned_cols=17  Identities=29%  Similarity=0.491  Sum_probs=14.3

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...|++||+.||+=+..
T Consensus        60 ~~~tL~HEm~H~~~~~~   76 (157)
T PF10263_consen   60 LIDTLLHEMAHAAAYVF   76 (157)
T ss_pred             HHHHHHHHHHHHHhhhc
Confidence            35899999999988765


No 103
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=45.79  E-value=21  Score=24.00  Aligned_cols=29  Identities=14%  Similarity=0.095  Sum_probs=20.8

Q ss_pred             ceEEeCCCCc-ccchhHHHHHHHhcccccc
Q psy1890          95 SYVNLGKNCF-VMGTVQHETLHALGFWHEQ  123 (242)
Q Consensus        95 q~i~l~~~c~-~~~~i~HEigHALGl~HEh  123 (242)
                      ..+.++++-. ....++|.+.|-|||.|.-
T Consensus        18 ~eL~Fp~~ls~~eRriih~la~~lGL~~~s   47 (60)
T cd02639          18 DELAFPSSLSPAERRIVHLLASRLGLNHVS   47 (60)
T ss_pred             eEEEcCCCCCHHHHHHHHHHHHHcCCceEE
Confidence            4455544322 3478999999999999983


No 104
>PRK10911 oligopeptidase A; Provisional
Probab=45.42  E-value=10  Score=37.75  Aligned_cols=12  Identities=42%  Similarity=0.343  Sum_probs=10.8

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      +.|+.|||||||
T Consensus       464 v~tlfHEfGHal  475 (680)
T PRK10911        464 VITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHHhHHH
Confidence            478999999998


No 105
>PRK02870 heat shock protein HtpX; Provisional
Probab=45.22  E-value=8.8  Score=34.88  Aligned_cols=12  Identities=25%  Similarity=0.393  Sum_probs=10.5

Q ss_pred             ccchhHHHHHHH
Q psy1890         105 VMGTVQHETLHA  116 (242)
Q Consensus       105 ~~~~i~HEigHA  116 (242)
                      ..++++||+||.
T Consensus       173 L~aVlAHELgHi  184 (336)
T PRK02870        173 LQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHH
Confidence            359999999997


No 106
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=43.09  E-value=13  Score=31.39  Aligned_cols=17  Identities=24%  Similarity=0.190  Sum_probs=13.3

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...+++||+|||.=...
T Consensus        93 ~gsiLAHE~mHa~Lrl~  109 (212)
T PF12315_consen   93 TGSILAHELMHAWLRLN  109 (212)
T ss_pred             HhhHHHHHHHHHHhccc
Confidence            34789999999986553


No 107
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=41.70  E-value=12  Score=29.74  Aligned_cols=14  Identities=50%  Similarity=0.622  Sum_probs=11.4

Q ss_pred             ccchhHHHHHH--Hhc
Q psy1890         105 VMGTVQHETLH--ALG  118 (242)
Q Consensus       105 ~~~~i~HEigH--ALG  118 (242)
                      ..+|++||+-|  +.|
T Consensus        96 k~~TLiHE~SHf~~~~  111 (148)
T PF14521_consen   96 KEGTLIHEWSHFTAVG  111 (148)
T ss_dssp             HHHHHHHHHHHSCCCT
T ss_pred             hHHHHHHhhhhhhhhc
Confidence            45899999999  644


No 108
>KOG3538|consensus
Probab=40.90  E-value=8.4  Score=39.34  Aligned_cols=23  Identities=26%  Similarity=0.287  Sum_probs=18.6

Q ss_pred             ccchhHHHHHHHhcccccccCCC
Q psy1890         105 VMGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~rpD  127 (242)
                      ..-+++||+||.+|+.|--...+
T Consensus       317 ~aftiahE~GH~~gm~hd~~~~~  339 (845)
T KOG3538|consen  317 SAFTIAHELGHNFGMEHDGRGNS  339 (845)
T ss_pred             cceeeeeccccccCcccCCcCCC
Confidence            34689999999999999855543


No 109
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=40.88  E-value=15  Score=30.31  Aligned_cols=24  Identities=17%  Similarity=-0.025  Sum_probs=19.9

Q ss_pred             cchhHHHHHHHhcccccccCCCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPDRD  129 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDRD  129 (242)
                      ...+.||+||++--.....+..+-
T Consensus        42 ~~l~iHElgH~~~A~~~G~~~~~~   65 (183)
T cd06160          42 AILGIHEMGHYLAARRHGVKASLP   65 (183)
T ss_pred             HHHHHHHHHHHHHHHHCCCCccce
Confidence            467899999999999888887653


No 110
>PRK03982 heat shock protein HtpX; Provisional
Probab=40.76  E-value=14  Score=32.58  Aligned_cols=13  Identities=31%  Similarity=0.498  Sum_probs=10.8

Q ss_pred             ccchhHHHHHHHh
Q psy1890         105 VMGTVQHETLHAL  117 (242)
Q Consensus       105 ~~~~i~HEigHAL  117 (242)
                      -.++++||+||.-
T Consensus       125 l~AVlAHElgHi~  137 (288)
T PRK03982        125 LEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHH
Confidence            3589999999964


No 111
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=40.03  E-value=15  Score=36.74  Aligned_cols=13  Identities=38%  Similarity=0.284  Sum_probs=11.2

Q ss_pred             cchhHHHHHHHhc
Q psy1890         106 MGTVQHETLHALG  118 (242)
Q Consensus       106 ~~~i~HEigHALG  118 (242)
                      +.|+-|||||||=
T Consensus       465 V~TlFHEfGHalH  477 (681)
T PRK10280        465 VITLFHEFGHTLH  477 (681)
T ss_pred             HHHHHHHHHHHHH
Confidence            4689999999985


No 112
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=40.01  E-value=15  Score=32.08  Aligned_cols=12  Identities=25%  Similarity=0.199  Sum_probs=10.7

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      .+++.||+||..
T Consensus       158 ~aVlaHElgHi~  169 (302)
T COG0501         158 EAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHh
Confidence            589999999975


No 113
>PRK03001 M48 family peptidase; Provisional
Probab=38.95  E-value=16  Score=32.21  Aligned_cols=13  Identities=31%  Similarity=0.526  Sum_probs=10.8

Q ss_pred             ccchhHHHHHHHh
Q psy1890         105 VMGTVQHETLHAL  117 (242)
Q Consensus       105 ~~~~i~HEigHAL  117 (242)
                      -.++++||+||.-
T Consensus       124 l~aVlAHElgHi~  136 (283)
T PRK03001        124 IRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHh
Confidence            3589999999964


No 114
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=38.69  E-value=16  Score=30.58  Aligned_cols=24  Identities=25%  Similarity=0.156  Sum_probs=19.6

Q ss_pred             ccchhHHHHHHHhcccccccCCCC
Q psy1890         105 VMGTVQHETLHALGFWHEQARPDR  128 (242)
Q Consensus       105 ~~~~i~HEigHALGl~HEh~rpDR  128 (242)
                      ...++.||+||++=..+...+..+
T Consensus        38 ~~~v~iHElgH~~~A~~~G~~v~~   61 (208)
T cd06161          38 FLSVLLHELGHALVARRYGIRVRS   61 (208)
T ss_pred             HHHHHHHHHHHHHHHHHcCCCccc
Confidence            347899999999988888887654


No 115
>PRK03072 heat shock protein HtpX; Provisional
Probab=38.20  E-value=16  Score=32.29  Aligned_cols=13  Identities=31%  Similarity=0.509  Sum_probs=10.6

Q ss_pred             ccchhHHHHHHHh
Q psy1890         105 VMGTVQHETLHAL  117 (242)
Q Consensus       105 ~~~~i~HEigHAL  117 (242)
                      -.++++||+||.-
T Consensus       127 l~aVlAHElgHi~  139 (288)
T PRK03072        127 LRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHh
Confidence            3599999999943


No 116
>smart00393 R3H Putative single-stranded nucleic acids-binding domain.
Probab=38.09  E-value=44  Score=23.25  Aligned_cols=40  Identities=23%  Similarity=0.100  Sum_probs=28.8

Q ss_pred             eEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEe
Q psy1890          96 YVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIH  135 (242)
Q Consensus        96 ~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~  135 (242)
                      .+.|.+=.....-++|++...+||.|+.....-+.+|.|.
T Consensus        38 ~~~~~pm~~~~R~~iH~~a~~~~l~s~S~g~g~~R~vvv~   77 (79)
T smart00393       38 SVELPPMNSYERKIVHELAEKYGLESESFGEGPKRRVVIS   77 (79)
T ss_pred             eEEcCCCCHHHHHHHHHHHHHcCCEEEEEcCCCCcEEEEE
Confidence            4455442234578999999999999998776666676653


No 117
>PRK01345 heat shock protein HtpX; Provisional
Probab=37.52  E-value=17  Score=32.70  Aligned_cols=13  Identities=38%  Similarity=0.560  Sum_probs=10.6

Q ss_pred             ccchhHHHHHHHh
Q psy1890         105 VMGTVQHETLHAL  117 (242)
Q Consensus       105 ~~~~i~HEigHAL  117 (242)
                      ..++++|||||..
T Consensus       124 L~aVlAHElgHi~  136 (317)
T PRK01345        124 VAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHH
Confidence            3589999999943


No 118
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=37.13  E-value=18  Score=32.24  Aligned_cols=10  Identities=30%  Similarity=0.205  Sum_probs=8.7

Q ss_pred             chhHHHHHHH
Q psy1890         107 GTVQHETLHA  116 (242)
Q Consensus       107 ~~i~HEigHA  116 (242)
                      .+++||+||.
T Consensus       172 yVlAHEyGHH  181 (292)
T PF04228_consen  172 YVLAHEYGHH  181 (292)
T ss_pred             HHHHHHHHHH
Confidence            5899999995


No 119
>PRK05457 heat shock protein HtpX; Provisional
Probab=36.91  E-value=17  Score=32.17  Aligned_cols=12  Identities=33%  Similarity=0.434  Sum_probs=10.2

Q ss_pred             ccchhHHHHHHH
Q psy1890         105 VMGTVQHETLHA  116 (242)
Q Consensus       105 ~~~~i~HEigHA  116 (242)
                      -.++++||+||.
T Consensus       134 l~aVlAHElgHi  145 (284)
T PRK05457        134 VEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHH
Confidence            359999999995


No 120
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=35.08  E-value=22  Score=31.02  Aligned_cols=17  Identities=29%  Similarity=0.216  Sum_probs=12.6

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...+++|||.|+|-+..
T Consensus       154 g~NVviHEfAH~LD~~~  170 (253)
T PF06167_consen  154 GHNVVIHEFAHKLDMED  170 (253)
T ss_dssp             S--HHHHHHHHHHHCTT
T ss_pred             CcchHHHHHHHHHHhhc
Confidence            45899999999997653


No 121
>PRK04897 heat shock protein HtpX; Provisional
Probab=34.71  E-value=19  Score=32.01  Aligned_cols=12  Identities=33%  Similarity=0.462  Sum_probs=10.2

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      .++++||+||.-
T Consensus       138 ~aVlAHElgHi~  149 (298)
T PRK04897        138 EGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHh
Confidence            589999999953


No 122
>PF14891 Peptidase_M91:  Effector protein
Probab=33.33  E-value=21  Score=29.01  Aligned_cols=19  Identities=21%  Similarity=0.384  Sum_probs=15.7

Q ss_pred             cchhHHHHHHHhccccccc
Q psy1890         106 MGTVQHETLHALGFWHEQA  124 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~  124 (242)
                      .-++.|||.||.-+.+=-.
T Consensus       104 ~v~L~HEL~HA~~~~~Gt~  122 (174)
T PF14891_consen  104 FVVLYHELIHAYDYMNGTM  122 (174)
T ss_pred             HHHHHHHHHHHHHHHCCCC
Confidence            3689999999998877666


No 123
>PRK02391 heat shock protein HtpX; Provisional
Probab=32.95  E-value=22  Score=31.61  Aligned_cols=12  Identities=25%  Similarity=0.396  Sum_probs=10.1

Q ss_pred             ccchhHHHHHHH
Q psy1890         105 VMGTVQHETLHA  116 (242)
Q Consensus       105 ~~~~i~HEigHA  116 (242)
                      -.++++||+||.
T Consensus       133 l~aVlaHElgHi  144 (296)
T PRK02391        133 LEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHH
Confidence            358999999993


No 124
>PRK01265 heat shock protein HtpX; Provisional
Probab=32.37  E-value=22  Score=32.17  Aligned_cols=11  Identities=27%  Similarity=0.341  Sum_probs=9.7

Q ss_pred             ccchhHHHHHH
Q psy1890         105 VMGTVQHETLH  115 (242)
Q Consensus       105 ~~~~i~HEigH  115 (242)
                      -.++++||+||
T Consensus       140 l~aVlAHElgH  150 (324)
T PRK01265        140 IKAVAGHELGH  150 (324)
T ss_pred             HHHHHHHHHHH
Confidence            35899999999


No 125
>KOG2556|consensus
Probab=31.63  E-value=23  Score=33.71  Aligned_cols=21  Identities=43%  Similarity=0.441  Sum_probs=16.8

Q ss_pred             cchhHHHHHHHhcccccccCC
Q psy1890         106 MGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rp  126 (242)
                      ..++.||+.|||||.--+-..
T Consensus       256 ~stVkhE~~halgfsaGl~af  276 (666)
T KOG2556|consen  256 ASTVKHERDHALGFSAGLVAF  276 (666)
T ss_pred             HHHHHHHHHhhhcccceeeEE
Confidence            368999999999998665543


No 126
>PLN02887 hydrolase family protein
Probab=30.52  E-value=27  Score=34.15  Aligned_cols=19  Identities=47%  Similarity=0.532  Sum_probs=16.7

Q ss_pred             chhHHHHHHHhcccccccC
Q psy1890         107 GTVQHETLHALGFWHEQAR  125 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~r  125 (242)
                      ..++|=++|.|||.||...
T Consensus       232 ~l~vHG~LHLlGyDH~~~~  250 (580)
T PLN02887        232 ILVVHGLLHLLGFDHEISN  250 (580)
T ss_pred             HHHHHHHHHhcCCCCCCCH
Confidence            5799999999999999753


No 127
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=30.52  E-value=25  Score=32.79  Aligned_cols=12  Identities=33%  Similarity=0.191  Sum_probs=9.4

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      .-++.||+||.|
T Consensus        15 ~~v~~HE~gH~~   26 (420)
T TIGR00054        15 VLIFVHELGHFL   26 (420)
T ss_pred             HHHHHHhHHHHH
Confidence            357899999943


No 128
>PLN02791 Nudix hydrolase homolog
Probab=30.38  E-value=20  Score=36.21  Aligned_cols=21  Identities=24%  Similarity=0.229  Sum_probs=17.3

Q ss_pred             cchhHHHHHHHhcccccccCC
Q psy1890         106 MGTVQHETLHALGFWHEQARP  126 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rp  126 (242)
                      ..|++||+||.+|=.+....+
T Consensus       586 ~~v~lHElgHGsG~~~~~~~~  606 (770)
T PLN02791        586 THTICHECCHGIGPHTITLPD  606 (770)
T ss_pred             HHHHHHHhhccccccceecCC
Confidence            478999999999998866543


No 129
>PF01424 R3H:  R3H domain;  InterPro: IPR001374 The R3H motif: a domain that binds single-stranded nucleic acids. The most prominent feature of the R3H motif is the presence of an invariant arginine residue and a highly conserved histidine residue that are separated by three residues. The motif also displays a conserved pattern of hydrophobic residues, prolines and glycines. The R3H motif is present in proteins from a diverse range of organisms that includes Eubacteria, green plants, fungi and various groups of metazoans. Intriguingly, it has not yet been identified in Archaea and Escherichia coli. The sequences that contain the R3H domain, many of which are hypothetical proteins predicted from genome sequencing projects, can be grouped into eight families on the basis of similarities outside the R3H region. Three of the families contain ATPase domains either upstream (families II and VII) or downstream of the R3H domain (family VIII). The N-terminal part of members of family VII contains an SF1 helicase domain5. The C-terminal part of family VIII contains an SF2 DEAH helicase domain5. The ATPase domain in the members of family II is similar to the stage-III sporulation protein AA (S3AA_BACSU), the proteasome ATPase, bacterial transcription-termination factor r and the mitochondrial F1-ATPase b subunit (the F5 helicase family5). Family VI contains Cys-rich repeats6, as well as a ring-type zinc finger upstream of the R3H domain. JAG bacterial proteins (family I) contain a KH domain N-terminal to the R3H domain. The functions of other domains in R3H proteins support the notion that the R3H domain might be involved in interactions with single-stranded nucleic acids [].; GO: 0003676 nucleic acid binding; PDB: 1WHR_A 1MSZ_A 1UG8_A 3GKU_B 2CPM_A.
Probab=30.21  E-value=70  Score=20.97  Aligned_cols=40  Identities=13%  Similarity=0.063  Sum_probs=28.6

Q ss_pred             eEEeCCCCcccchhHHHHHHHhcccccccCCCCCceEEEe
Q psy1890          96 YVNLGKNCFVMGTVQHETLHALGFWHEQARPDRDRFVDIH  135 (242)
Q Consensus        96 ~i~l~~~c~~~~~i~HEigHALGl~HEh~rpDRD~yv~i~  135 (242)
                      .+.|.+-.....-++|++..-.||.|+-...+-+.+|.|.
T Consensus        22 ~~~f~pm~~~~R~~iH~~a~~~gL~s~S~g~~~~R~vvv~   61 (63)
T PF01424_consen   22 SLEFPPMNSFERKLIHELAEYYGLKSKSEGEGPNRRVVVS   61 (63)
T ss_dssp             EEEEEC--SHHHHHHHHHHHHCTEEEEEESSSSSSEEEEE
T ss_pred             EEEECCCCHHHHHHHHHHHHHCCCEEEEecCCCCeEEEEE
Confidence            3444432223578999999999999999877767777664


No 130
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=29.95  E-value=25  Score=29.01  Aligned_cols=14  Identities=29%  Similarity=0.223  Sum_probs=10.7

Q ss_pred             chhHHHHHHHhccc
Q psy1890         107 GTVQHETLHALGFW  120 (242)
Q Consensus       107 ~~i~HEigHALGl~  120 (242)
                      .+++|||+||+.-.
T Consensus        38 ~ilahel~hafd~~   51 (206)
T PF01431_consen   38 FILAHELMHAFDPE   51 (206)
T ss_dssp             HHHHHHHHHCTSTT
T ss_pred             HHHHHHHHHHHHHh
Confidence            57999999966443


No 131
>PRK04351 hypothetical protein; Provisional
Probab=29.47  E-value=30  Score=27.63  Aligned_cols=16  Identities=25%  Similarity=0.384  Sum_probs=12.3

Q ss_pred             ccchhHHHHHHHhccc
Q psy1890         105 VMGTVQHETLHALGFW  120 (242)
Q Consensus       105 ~~~~i~HEigHALGl~  120 (242)
                      ..+||.||+.|.+=+.
T Consensus        61 l~~vv~HElcH~~~~~   76 (149)
T PRK04351         61 LIGIIKHELCHYHLHL   76 (149)
T ss_pred             HHhhHHHHHHHHHHHH
Confidence            3589999999986443


No 132
>PF02102 Peptidase_M35:  Deuterolysin metalloprotease (M35) family;  InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=29.36  E-value=29  Score=31.91  Aligned_cols=20  Identities=30%  Similarity=0.504  Sum_probs=13.8

Q ss_pred             CCCcc---cchhHHHHHHHhccc
Q psy1890         101 KNCFV---MGTVQHETLHALGFW  120 (242)
Q Consensus       101 ~~c~~---~~~i~HEigHALGl~  120 (242)
                      ..|..   .++++||+-|+.+..
T Consensus       290 ~~C~~qDqatt~LHE~TH~~~V~  312 (359)
T PF02102_consen  290 NRCHAQDQATTTLHEMTHAPAVY  312 (359)
T ss_dssp             -STT---HHHHHHHHHHT-TTTS
T ss_pred             ccccCCCccchhhhhhhcccccc
Confidence            45754   379999999999864


No 133
>PF13058 DUF3920:  Protein of unknown function (DUF3920)
Probab=29.06  E-value=31  Score=26.25  Aligned_cols=31  Identities=23%  Similarity=0.208  Sum_probs=20.0

Q ss_pred             CCCCCHHHHHHHHHHHHHHhc-cccEEEEecC
Q psy1890          35 DKNFSVDRKRFIVKALNNIMT-FSCVRFRRRT   65 (242)
Q Consensus        35 ~~~~~~~~~~~I~~A~~~w~~-~tci~F~~~~   65 (242)
                      ++.|+-..+.+-.+-++.++. .+.|.|-..-
T Consensus         3 d~E~~wd~~~l~~DvFs~i~k~~~pviFcdTc   34 (126)
T PF13058_consen    3 DSEFPWDVQRLRNDVFSLIEKCHIPVIFCDTC   34 (126)
T ss_pred             cccccchHHHHHHHHHHHHHhhcCCeEEeccc
Confidence            344555566666777777776 5778786554


No 134
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=28.04  E-value=28  Score=34.50  Aligned_cols=24  Identities=8%  Similarity=0.158  Sum_probs=16.6

Q ss_pred             HHHHHHHHHHHhccccEEEEecCC
Q psy1890          43 KRFIVKALNNIMTFSCVRFRRRTH   66 (242)
Q Consensus        43 ~~~I~~A~~~w~~~tci~F~~~~~   66 (242)
                      -++|.--++..++-=.|+|+|+..
T Consensus       364 ~~Vl~GlF~~~~rLfGI~~~e~~~  387 (683)
T COG0339         364 NKVLEGLFEVAKRLFGITFVERKD  387 (683)
T ss_pred             hHHHHHHHHHHHHHcCeEEEECCC
Confidence            346655556666777999999853


No 135
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=27.84  E-value=34  Score=29.98  Aligned_cols=21  Identities=38%  Similarity=0.588  Sum_probs=16.5

Q ss_pred             cchhHHHHHHHhcccccccCCC
Q psy1890         106 MGTVQHETLHALGFWHEQARPD  127 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpD  127 (242)
                      .++++||++|.+ |.|..-+..
T Consensus        61 ~~~l~HevlH~~-~~H~~r~~~   81 (292)
T PF13203_consen   61 VGLLLHEVLHCL-LRHPWRRGG   81 (292)
T ss_pred             HHHHHHHHHHHH-ccchhhhcc
Confidence            478999999987 666666666


No 136
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=27.14  E-value=32  Score=32.36  Aligned_cols=10  Identities=30%  Similarity=0.139  Sum_probs=8.3

Q ss_pred             cchhHHHHHH
Q psy1890         106 MGTVQHETLH  115 (242)
Q Consensus       106 ~~~i~HEigH  115 (242)
                      .-++.||+||
T Consensus        16 ~li~vHElGH   25 (449)
T PRK10779         16 VLITVHEFGH   25 (449)
T ss_pred             HHHHHHHHHH
Confidence            3578999999


No 137
>PRK04860 hypothetical protein; Provisional
Probab=26.60  E-value=36  Score=27.51  Aligned_cols=17  Identities=29%  Similarity=0.307  Sum_probs=13.2

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ..+||.||+.|.+=+.+
T Consensus        63 l~~~v~HEl~H~~~~~~   79 (160)
T PRK04860         63 IDEVVPHELAHLLVYQL   79 (160)
T ss_pred             HHhHHHHHHHHHHHHHH
Confidence            35899999999875543


No 138
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=25.17  E-value=35  Score=29.87  Aligned_cols=17  Identities=24%  Similarity=0.353  Sum_probs=15.0

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...+++|||.|.|-+.+
T Consensus       142 g~NVvIHEFAH~LDm~~  158 (260)
T PRK15410        142 GFNLIIHEVAHKLDMRN  158 (260)
T ss_pred             CcchhHhHHHhHhhhhc
Confidence            46899999999999876


No 139
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=24.77  E-value=34  Score=32.66  Aligned_cols=17  Identities=29%  Similarity=0.243  Sum_probs=13.7

Q ss_pred             ccchhHHHHHHHhcccc
Q psy1890         105 VMGTVQHETLHALGFWH  121 (242)
Q Consensus       105 ~~~~i~HEigHALGl~H  121 (242)
                      ...|+.||+|||..+..
T Consensus       247 D~~t~~HE~GH~~yy~~  263 (477)
T cd06461         247 DFVTVHHEMGHIQYYLQ  263 (477)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            45799999999996544


No 140
>KOG2090|consensus
Probab=24.55  E-value=39  Score=33.18  Aligned_cols=12  Identities=42%  Similarity=0.509  Sum_probs=10.4

Q ss_pred             cchhHHHHHHHh
Q psy1890         106 MGTVQHETLHAL  117 (242)
Q Consensus       106 ~~~i~HEigHAL  117 (242)
                      .-++-||+|||+
T Consensus       480 vetLFHEmGHAM  491 (704)
T KOG2090|consen  480 VETLFHEMGHAM  491 (704)
T ss_pred             HHHHHHHHHHHH
Confidence            468999999996


No 141
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=24.05  E-value=35  Score=32.40  Aligned_cols=10  Identities=60%  Similarity=0.571  Sum_probs=8.7

Q ss_pred             hhHHHHHHHh
Q psy1890         108 TVQHETLHAL  117 (242)
Q Consensus       108 ~i~HEigHAL  117 (242)
                      -++||.||||
T Consensus       262 g~iHE~GHAl  271 (497)
T COG2317         262 GTIHETGHAL  271 (497)
T ss_pred             HHHHhhhhHH
Confidence            4789999986


No 142
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=23.63  E-value=44  Score=27.44  Aligned_cols=25  Identities=24%  Similarity=0.454  Sum_probs=17.7

Q ss_pred             cchhHHHHHHHhcccccccCCCCCceEEEeeeccc
Q psy1890         106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELIK  140 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~  140 (242)
                      ..+++|||-||.-.    .|-      .|+|.|..
T Consensus        72 ~~~l~HELIHayD~----cr~------kvD~~n~~   96 (173)
T PF09768_consen   72 EDTLTHELIHAYDH----CRA------KVDWTNCR   96 (173)
T ss_pred             HHHHHHHHHHHHHH----Hhc------cCCcccch
Confidence            47999999999753    332      36777654


No 143
>KOG3624|consensus
Probab=23.29  E-value=43  Score=33.26  Aligned_cols=29  Identities=21%  Similarity=0.259  Sum_probs=21.1

Q ss_pred             chhHHHHHHHhcccccccCCCCCceEEEeee
Q psy1890         107 GTVQHETLHALGFWHEQARPDRDRFVDIHYE  137 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~rpDRD~yv~i~~~  137 (242)
                      .+|.|||+|  ||.......|..-.+.--|+
T Consensus       520 ~vigHEl~H--~FD~~G~~~D~~Gn~~~ww~  548 (687)
T KOG3624|consen  520 FVIGHELTH--GFDDQGRQYDESGNLRDWWD  548 (687)
T ss_pred             HHHHHHHhh--cccccccccCCCcccccCCC
Confidence            578999999  77777777776655555444


No 144
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=23.23  E-value=3e+02  Score=22.57  Aligned_cols=75  Identities=11%  Similarity=0.077  Sum_probs=38.1

Q ss_pred             HHHHHHHHHHHHHhcc-ccEEEEecCCCCCCeEEEEccCCCCCCCCccCCCCCccceEE-eCCCCc---ccchhHHHHHH
Q psy1890          41 DRKRFIVKALNNIMTF-SCVRFRRRTHNQTPYIVYKESPNGKSCSSNVGFTGKKVSYVN-LGKNCF---VMGTVQHETLH  115 (242)
Q Consensus        41 ~~~~~I~~A~~~w~~~-tci~F~~~~~~~~~~I~i~~~~~~~gc~S~vG~~~g~~q~i~-l~~~c~---~~~~i~HEigH  115 (242)
                      ...+.+.++.+.|... ..|...+... ....+...     .|..+..+..+.  -.+. +.+...   ...+++||+-|
T Consensus         4 ~i~~~~~~~~~~~~~~~i~v~i~p~~~-~~~~~~~~-----~g~~g~~~~~~~--i~l~~~~~~~~~~~l~~~iaHE~hH   75 (195)
T PF10026_consen    4 IIEEALEKSIELLPGPDIPVFIFPADP-ENPFLIPE-----LGGKGGGAIPGY--IFLFLLPNDYSLEELPALIAHEYHH   75 (195)
T ss_pred             HHHHHHHHHHHHcCCCCCCEEEEeccC-CCcccccc-----cCcccccCCCCE--EEEEecCCcccHHHHHHHHHHHHHH
Confidence            3456777777888753 3444444442 22222211     222233333222  2333 333222   24899999999


Q ss_pred             Hhcccccc
Q psy1890         116 ALGFWHEQ  123 (242)
Q Consensus       116 ALGl~HEh  123 (242)
                      ++=+..-.
T Consensus        76 ~~r~~~~~   83 (195)
T PF10026_consen   76 NCRYEQIG   83 (195)
T ss_pred             HHHHhccC
Confidence            98776443


No 145
>PF10023 DUF2265:  Predicted aminopeptidase (DUF2265);  InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=23.10  E-value=37  Score=30.88  Aligned_cols=15  Identities=20%  Similarity=0.162  Sum_probs=11.5

Q ss_pred             cchhHHHHHHHhccc
Q psy1890         106 MGTVQHETLHALGFW  120 (242)
Q Consensus       106 ~~~i~HEigHALGl~  120 (242)
                      ...|.||+.|..=+.
T Consensus       166 A~LIfHELaHq~~Yv  180 (337)
T PF10023_consen  166 ARLIFHELAHQTLYV  180 (337)
T ss_pred             HHHHHHHHhhceeec
Confidence            478999999975443


No 146
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=23.01  E-value=46  Score=27.90  Aligned_cols=13  Identities=31%  Similarity=0.202  Sum_probs=11.2

Q ss_pred             cchhHHHHHHHhc
Q psy1890         106 MGTVQHETLHALG  118 (242)
Q Consensus       106 ~~~i~HEigHALG  118 (242)
                      ..++.||-||||=
T Consensus        29 ~~~A~HEAGhAvv   41 (213)
T PF01434_consen   29 RRIAYHEAGHAVV   41 (213)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHH
Confidence            5789999999974


No 147
>KOG3314|consensus
Probab=22.71  E-value=49  Score=27.06  Aligned_cols=25  Identities=24%  Similarity=0.489  Sum_probs=18.3

Q ss_pred             cchhHHHHHHHhcccccccCCCCCceEEEeeeccc
Q psy1890         106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELIK  140 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni~  140 (242)
                      ..++.|||.||+-    +.|      ..|+|.|+.
T Consensus        92 n~vv~HElIH~fD----d~r------~~vDW~Nlr  116 (194)
T KOG3314|consen   92 NQVVIHELIHAFD----DCR------AKVDWTNLR  116 (194)
T ss_pred             HHHHHHHHHHHHH----hhh------hhcccchhh
Confidence            3799999999973    233      248898876


No 148
>KOG0201|consensus
Probab=22.20  E-value=39  Score=31.86  Aligned_cols=30  Identities=27%  Similarity=0.480  Sum_probs=25.0

Q ss_pred             cchhHHHHHHHhcccccccCCCCCceEEEeeecc
Q psy1890         106 MGTVQHETLHALGFWHEQARPDRDRFVDIHYELI  139 (242)
Q Consensus       106 ~~~i~HEigHALGl~HEh~rpDRD~yv~i~~~ni  139 (242)
                      .++|++|+++||+|.|.+.---||    |.-.||
T Consensus       113 i~~ilre~l~~l~ylH~~~kiHrD----IKaani  142 (467)
T KOG0201|consen  113 IAVILREVLKGLDYLHSEKKIHRD----IKAANI  142 (467)
T ss_pred             eeeehHHHHHHhhhhhhcceeccc----ccccce
Confidence            478999999999999999988777    555554


No 149
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=21.66  E-value=49  Score=27.85  Aligned_cols=16  Identities=25%  Similarity=0.455  Sum_probs=13.5

Q ss_pred             cchhHHHHHHHhcccc
Q psy1890         106 MGTVQHETLHALGFWH  121 (242)
Q Consensus       106 ~~~i~HEigHALGl~H  121 (242)
                      .|++.||+.|+.-..-
T Consensus        97 ~Gvl~HE~~H~~Q~~~  112 (205)
T PF04450_consen   97 IGVLYHEMVHCWQWDG  112 (205)
T ss_pred             HHHHHHHHHHHhhcCC
Confidence            6999999999976554


No 150
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=20.73  E-value=29  Score=33.84  Aligned_cols=21  Identities=33%  Similarity=0.419  Sum_probs=14.4

Q ss_pred             chhHHHHHHHhcccccccCCCCC
Q psy1890         107 GTVQHETLHALGFWHEQARPDRD  129 (242)
Q Consensus       107 ~~i~HEigHALGl~HEh~rpDRD  129 (242)
                      ++|.|||+|+  |--.-.+.|.|
T Consensus       489 aVIgHEI~Hg--FDdqGakfD~~  509 (654)
T COG3590         489 AVIGHEIGHG--FDDQGAKFDGD  509 (654)
T ss_pred             ceehhhhccc--ccCCccccCCC
Confidence            6899999996  44444555544


No 151
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=20.56  E-value=53  Score=30.30  Aligned_cols=16  Identities=31%  Similarity=0.644  Sum_probs=13.7

Q ss_pred             cchhHHHHHHHhcccc
Q psy1890         106 MGTVQHETLHALGFWH  121 (242)
Q Consensus       106 ~~~i~HEigHALGl~H  121 (242)
                      .+|++|||-|.+-|..
T Consensus       140 ~sTlAHEfQHmInfy~  155 (366)
T PF10460_consen  140 YSTLAHEFQHMINFYQ  155 (366)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            5899999999997764


Done!