Query         021444
Match_columns 312
No_of_seqs    270 out of 1540
Neff          7.1 
Searched_HMMs 46136
Date          Fri Mar 29 03:00:43 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/021444.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/021444hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0 2.8E-56   6E-61  438.2  20.4  235    3-275    29-268 (469)
  2 cd04278 ZnMc_MMP Zinc-dependen 100.0 3.9E-36 8.5E-41  257.2  15.4  153  108-268     1-157 (157)
  3 PF00413 Peptidase_M10:  Matrix 100.0 2.2E-34 4.7E-39  244.2  15.1  149  108-268     1-154 (154)
  4 cd04279 ZnMc_MMP_like_1 Zinc-d 100.0 5.7E-28 1.2E-32  206.4  13.9  148  114-268     2-156 (156)
  5 cd04268 ZnMc_MMP_like Zinc-dep  99.9   5E-24 1.1E-28  182.7   9.3  143  113-267     1-165 (165)
  6 smart00235 ZnMc Zinc-dependent  99.9 1.6E-23 3.5E-28  175.6  10.5  134  107-269     3-140 (140)
  7 cd04277 ZnMc_serralysin_like Z  99.9 6.7E-23 1.5E-27  180.0  13.4  125  127-268    32-186 (186)
  8 cd04327 ZnMc_MMP_like_3 Zinc-d  99.7 1.8E-17 3.9E-22  147.2  12.3  109  108-236     2-110 (198)
  9 cd00203 ZnMc Zinc-dependent me  99.7 6.7E-17 1.5E-21  138.5  13.0  121  126-267    19-167 (167)
 10 cd04276 ZnMc_MMP_like_2 Zinc-d  99.6 2.1E-14 4.6E-19  127.2  11.1  146  112-269     6-197 (197)
 11 cd04280 ZnMc_astacin_like Zinc  99.5 1.1E-13 2.4E-18  121.2  11.7   92  114-237     2-93  (180)
 12 PF12388 Peptidase_M57:  Dual-a  99.5 1.5E-13 3.2E-18  122.0  12.2  135  112-267    37-211 (211)
 13 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.5 5.7E-13 1.2E-17  118.4  12.9   99  106-236     7-105 (200)
 14 PF01400 Astacin:  Astacin (Pep  99.4 2.6E-13 5.7E-18  119.9   8.4   96  108-236     2-97  (191)
 15 cd04282 ZnMc_meprin Zinc-depen  99.3 1.5E-11 3.3E-16  111.2  12.6   98  105-237    42-139 (230)
 16 cd04283 ZnMc_hatching_enzyme Z  99.3 1.2E-11 2.6E-16  108.4  11.0   92  114-236     4-95  (182)
 17 KOG3714 Meprin A metalloprotea  99.3 3.1E-11 6.8E-16  118.4  11.6  106   99-236    72-177 (411)
 18 PF01471 PG_binding_1:  Putativ  99.3 1.4E-11 3.1E-16   87.5   6.2   55    2-59      3-57  (57)
 19 COG5549 Predicted Zn-dependent  98.9 6.8E-09 1.5E-13   91.1   9.1  123  128-270   103-235 (236)
 20 PF02031 Peptidase_M7:  Strepto  98.9 5.7E-09 1.2E-13   85.1   8.1  124  115-268     3-130 (132)
 21 TIGR02869 spore_SleB spore cor  97.7 6.5E-05 1.4E-09   67.0   6.6   56    3-61     15-70  (201)
 22 cd04267 ZnMc_ADAM_like Zinc-de  97.7 0.00019   4E-09   63.0   9.3   49  216-265   131-189 (192)
 23 cd04273 ZnMc_ADAMTS_like Zinc-  97.7 0.00019 4.1E-09   64.0   9.0  119  135-269    73-200 (207)
 24 PF13583 Reprolysin_4:  Metallo  97.7   5E-05 1.1E-09   67.9   5.2   48  218-267   137-198 (206)
 25 PF11350 DUF3152:  Protein of u  97.7 0.00013 2.9E-09   64.4   7.4  139  113-263    28-183 (203)
 26 PF05572 Peptidase_M43:  Pregna  97.4 6.3E-05 1.4E-09   64.3   2.0   25  215-239    66-90  (154)
 27 PF13574 Reprolysin_2:  Metallo  97.4 4.8E-05   1E-09   66.2   1.0   47  218-266   111-170 (173)
 28 COG1913 Predicted Zn-dependent  97.3 9.3E-05   2E-09   63.8   1.7   28  218-245   124-151 (181)
 29 cd04275 ZnMc_pappalysin_like Z  97.3 0.00011 2.4E-09   66.6   1.9   75  215-299   134-209 (225)
 30 PF13688 Reprolysin_5:  Metallo  97.3 9.5E-05 2.1E-09   65.0   1.0   23  216-238   140-162 (196)
 31 PRK13267 archaemetzincin-like   97.1 0.00023 4.9E-09   62.4   2.0   27  218-244   125-151 (179)
 32 PF13582 Reprolysin_3:  Metallo  97.1 0.00015 3.2E-09   58.9   0.8   18  218-235   107-124 (124)
 33 PRK10594 murein L,D-transpepti  97.0  0.0018 3.8E-08   66.3   7.1   37   26-62    307-343 (608)
 34 PF07998 Peptidase_M54:  Peptid  96.9 0.00036 7.8E-09   61.8   1.4   28  217-245   144-171 (194)
 35 cd04271 ZnMc_ADAM_fungal Zinc-  96.9  0.0011 2.4E-08   60.3   4.2   49  220-270   147-220 (228)
 36 cd04269 ZnMc_adamalysin_II_lik  96.8  0.0017 3.6E-08   57.1   4.8   50  217-268   130-185 (194)
 37 cd04270 ZnMc_TACE_like Zinc-de  96.0  0.0083 1.8E-07   55.1   4.4   50  219-268   168-230 (244)
 38 PF01421 Reprolysin:  Reprolysi  95.9   0.011 2.3E-07   52.2   4.5   51  216-267   129-187 (199)
 39 cd04272 ZnMc_salivary_gland_MP  95.5   0.018 3.8E-07   51.8   4.5   51  217-268   144-212 (220)
 40 COG3409 Putative peptidoglycan  95.5   0.022 4.8E-07   49.0   4.8   59    3-63     47-106 (185)
 41 COG3023 ampD N-acetyl-anhydrom  95.3   0.046   1E-06   50.1   6.4   54    2-60    198-252 (257)
 42 COG2989 Uncharacterized protei  95.0   0.048   1E-06   54.8   5.9   58    3-61    240-298 (561)
 43 KOG4525 Jacalin-like lectin do  94.9   0.034 7.4E-07   54.4   4.6   74  218-293   301-397 (614)
 44 PF10462 Peptidase_M66:  Peptid  94.6   0.014   3E-07   55.2   1.2   21  216-236   191-211 (305)
 45 PF05548 Peptidase_M11:  Gameto  94.5   0.019 4.1E-07   54.7   1.9   19  218-236   150-168 (314)
 46 PF12044 Metallopep:  Putative   93.4    0.04 8.8E-07   54.4   1.7   57  218-276   316-394 (423)
 47 TIGR03296 M6dom_TIGR03296 M6 f  93.2   0.028 6.1E-07   52.8   0.3   20  218-237   165-184 (286)
 48 PF11150 DUF2927:  Protein of u  91.9     3.7 7.9E-05   37.0  12.2  150  107-271    28-197 (213)
 49 PTZ00337 surface protease GP63  91.9    0.18   4E-06   51.7   4.2   36  188-235   211-246 (567)
 50 PF01457 Peptidase_M8:  Leishma  91.4   0.093   2E-06   53.4   1.6   60  157-234   159-226 (521)
 51 PF08823 PG_binding_2:  Putativ  91.4    0.66 1.4E-05   34.7   5.7   53    4-59     18-74  (74)
 52 COG3824 Predicted Zn-dependent  91.0   0.091   2E-06   42.8   0.8   19  216-234   107-125 (136)
 53 PTZ00257 Glycoprotein GP63 (le  90.6    0.48   1E-05   48.9   5.8   97  156-270   204-313 (622)
 54 PF04298 Zn_peptidase_2:  Putat  90.2    0.35 7.6E-06   43.7   3.9   20  216-237    87-106 (222)
 55 PF05547 Peptidase_M6:  Immune   88.1    0.13 2.8E-06   53.5  -0.4   20  218-237   221-240 (645)
 56 COG2738 Predicted Zn-dependent  87.3    0.45 9.8E-06   42.1   2.5   16  216-231    90-105 (226)
 57 PF09471 Peptidase_M64:  IgA Pe  87.0    0.25 5.5E-06   45.9   0.8   21  217-237   215-236 (264)
 58 KOG3607 Meltrins, fertilins an  85.4    0.76 1.7E-05   48.5   3.5   46  217-263   322-374 (716)
 59 KOG3658 Tumor necrosis factor-  83.2    0.85 1.8E-05   47.2   2.6   48  219-266   393-452 (764)
 60 PF06262 DUF1025:  Possibl zinc  83.1    0.51 1.1E-05   37.2   0.8   17  217-233    72-88  (97)
 61 PF09374 PG_binding_3:  Predict  78.2     2.5 5.3E-05   31.4   3.0   26   34-59      1-30  (72)
 62 PF13398 Peptidase_M50B:  Pepti  78.0    0.84 1.8E-05   40.6   0.5   15  217-231    21-35  (200)
 63 PF14247 DUF4344:  Domain of un  77.8    0.82 1.8E-05   41.4   0.4   14  217-230    91-104 (220)
 64 COG4783 Putative Zn-dependent   77.3    0.82 1.8E-05   45.7   0.2   19  216-235   128-146 (484)
 65 KOG3538 Disintegrin metallopro  76.7     3.3 7.2E-05   44.7   4.7   34  215-248   314-353 (845)
 66 PF06114 DUF955:  Domain of unk  76.7    0.99 2.2E-05   35.2   0.6   14  217-230    41-54  (122)
 67 COG2856 Predicted Zn peptidase  76.4       1 2.2E-05   40.6   0.6   19  217-237    71-89  (213)
 68 COG5549 Predicted Zn-dependent  75.0     1.6 3.5E-05   39.1   1.4   30  218-247   153-182 (236)
 69 PF02163 Peptidase_M50:  Peptid  70.9     1.6 3.5E-05   37.9   0.5   22  218-239     7-28  (192)
 70 cd05709 S2P-M50 Site-2 proteas  70.6     1.7 3.6E-05   37.5   0.5   20  218-237     8-27  (180)
 71 cd06163 S2P-M50_PDZ_RseP-like   69.8     1.8   4E-05   37.9   0.6   12  219-230    10-21  (182)
 72 PF04228 Zn_peptidase:  Putativ  68.8     1.5 3.2E-05   41.4  -0.2   40  186-229   141-181 (292)
 73 PF12725 DUF3810:  Protein of u  68.6     3.3 7.1E-05   39.6   2.1   38  186-237   178-215 (318)
 74 cd06161 S2P-M50_SpoIVFB SpoIVF  67.0     2.2 4.8E-05   38.1   0.5   17  217-233    37-53  (208)
 75 PF01435 Peptidase_M48:  Peptid  66.5     2.4 5.1E-05   37.4   0.6   20  216-236    87-106 (226)
 76 cd06459 M3B_Oligoendopeptidase  62.6     2.9 6.3E-05   40.8   0.5   15  216-230   220-234 (427)
 77 cd06164 S2P-M50_SpoIVFB_CBS Sp  62.5       3 6.5E-05   37.8   0.5   17  218-234    53-69  (227)
 78 cd06258 Peptidase_M3_like The   62.4       3 6.6E-05   40.0   0.6   16  215-230   151-166 (365)
 79 PRK03982 heat shock protein Ht  62.1     3.2 6.9E-05   38.9   0.6   15  216-230   123-137 (288)
 80 COG0501 HtpX Zn-dependent prot  61.6     3.3 7.1E-05   38.4   0.6   15  216-230   155-169 (302)
 81 PF01432 Peptidase_M3:  Peptida  61.0     3.3 7.1E-05   41.2   0.6   25  130-154   136-160 (458)
 82 PRK03001 M48 family peptidase;  59.9     3.7 7.9E-05   38.4   0.6   15  216-230   122-136 (283)
 83 COG3409 Putative peptidoglycan  59.6      13 0.00028   31.6   4.0   37   24-60    148-184 (185)
 84 cd06162 S2P-M50_PDZ_SREBP Ster  59.5     3.6 7.9E-05   38.5   0.5   14  218-231   135-148 (277)
 85 cd06159 S2P-M50_PDZ_Arch Uncha  59.2     3.7   8E-05   38.2   0.5   19  218-236   118-136 (263)
 86 PRK03072 heat shock protein Ht  59.2     3.8 8.3E-05   38.5   0.6   15  216-230   125-139 (288)
 87 PRK01345 heat shock protein Ht  59.0     3.9 8.4E-05   39.0   0.6   18  216-234   122-139 (317)
 88 TIGR02289 M3_not_pepF oligoend  58.8     3.7 8.1E-05   42.0   0.5   15  216-230   335-349 (549)
 89 PF13699 DUF4157:  Domain of un  58.7     8.6 0.00019   29.0   2.4   30  190-230    44-73  (79)
 90 cd06455 M3A_TOP Peptidase M3 T  58.3     3.9 8.5E-05   41.1   0.6   14  217-230   262-275 (472)
 91 TIGR02290 M3_fam_3 oligoendope  56.5     4.3 9.3E-05   41.9   0.5   15  216-230   373-387 (587)
 92 TIGR00181 pepF oligoendopeptid  55.6     5.1 0.00011   41.3   0.9   15  216-230   376-390 (591)
 93 PRK04897 heat shock protein Ht  55.5     4.6  0.0001   38.1   0.5   15  216-230   135-149 (298)
 94 PF01447 Peptidase_M4:  Thermol  55.3     4.9 0.00011   34.2   0.6   12  219-230   136-147 (150)
 95 cd06160 S2P-M50_like_2 Unchara  55.0     4.8  0.0001   35.3   0.5   23  218-240    41-63  (183)
 96 PF13485 Peptidase_MA_2:  Pepti  54.8       5 0.00011   31.4   0.6   14  217-230    24-37  (128)
 97 cd06457 M3A_MIP Peptidase M3 m  54.3     5.1 0.00011   40.1   0.6   33  130-162   132-168 (458)
 98 PRK05457 heat shock protein Ht  54.3       5 0.00011   37.7   0.5   14  216-229   132-145 (284)
 99 PRK02870 heat shock protein Ht  53.9     5.3 0.00011   38.5   0.6   14  216-229   171-184 (336)
100 PRK02391 heat shock protein Ht  53.7     5.1 0.00011   37.8   0.5   14  216-229   131-144 (296)
101 PRK01265 heat shock protein Ht  53.0     5.3 0.00012   38.3   0.5   14  216-229   138-151 (324)
102 KOG2921 Intramembrane metallop  52.7     5.5 0.00012   39.1   0.5   18  218-235   131-148 (484)
103 cd06456 M3A_DCP_Oligopeptidase  52.5     5.7 0.00012   39.4   0.6   32  131-162   107-140 (422)
104 PF08823 PG_binding_2:  Putativ  52.0      39 0.00084   25.2   4.9   29   31-60     20-48  (74)
105 PLN02791 Nudix hydrolase homol  51.8     5.8 0.00013   42.4   0.6   21  217-237   584-604 (770)
106 cd06158 S2P-M50_like_1 Unchara  48.4     7.1 0.00015   34.1   0.5   13  218-230     9-21  (181)
107 PRK10779 zinc metallopeptidase  47.4     7.4 0.00016   38.8   0.5   12  219-230    16-27  (449)
108 PHA00024 IX minor coat protein  47.3       8 0.00017   24.3   0.5   15  291-305    10-24  (33)
109 TIGR00054 RIP metalloprotease   47.0     7.6 0.00016   38.5   0.5   12  219-230    15-26  (420)
110 cd06460 M32_Taq Peptidase fami  46.9     7.6 0.00017   38.3   0.5   13  218-230   159-171 (396)
111 COG1164 Oligoendopeptidase F [  46.7     7.6 0.00016   40.4   0.5   14  217-230   379-392 (598)
112 COG1059 Thermostable 8-oxoguan  46.2      27 0.00059   31.0   3.7   47    7-54    153-199 (210)
113 PF01434 Peptidase_M41:  Peptid  43.5     8.9 0.00019   34.3   0.4   15  216-230    26-40  (213)
114 COG4227 Antirestriction protei  41.5      12 0.00027   34.7   1.0   23  215-237   200-222 (316)
115 PF01863 DUF45:  Protein of unk  41.4      14  0.0003   32.2   1.3   28  215-242   161-188 (205)
116 PRK10911 oligopeptidase A; Pro  40.7      11 0.00023   39.9   0.5   14  217-230   462-475 (680)
117 PF10281 Ish1:  Putative stress  37.7      82  0.0018   20.0   4.2   29    5-37      7-35  (38)
118 PF03571 Peptidase_M49:  Peptid  37.5      12 0.00027   38.3   0.4   16  217-232   292-308 (549)
119 KOG2719 Metalloprotease [Gener  37.1      13 0.00029   36.8   0.5   26  216-242   278-303 (428)
120 PF03272 Enhancin:  Viral enhan  35.8      52  0.0011   35.4   4.6   22  132-153   170-191 (775)
121 cd02641 R3H_Smubp-2_like R3H d  33.2      19 0.00042   25.6   0.7   21  217-237    28-48  (60)
122 PRK03094 hypothetical protein;  33.2      31 0.00066   26.3   1.8   16    2-17      8-23  (80)
123 COG5570 Uncharacterized small   32.5      43 0.00094   23.3   2.3   31   32-67      3-33  (57)
124 PF10463 Peptidase_U49:  Peptid  32.3      14 0.00031   33.1  -0.1   46  218-272   101-146 (206)
125 PF06167 Peptidase_M90:  Glucos  31.1      21 0.00046   33.0   0.8   21  215-235   151-171 (253)
126 PF03698 UPF0180:  Uncharacteri  29.8      38 0.00081   25.8   1.8   16    2-17      8-23  (80)
127 PRK10280 dipeptidyl carboxypep  29.5      21 0.00046   37.7   0.6   32  131-162   363-397 (681)
128 KOG2661 Peptidase family M48 [  28.2      23 0.00051   34.1   0.5   19  217-236   274-292 (424)
129 COG3590 PepO Predicted metallo  28.1      19 0.00042   37.0  -0.1   13  218-230   487-499 (654)
130 PF03801 Ndc80_HEC:  HEC/Ndc80p  28.1 1.2E+02  0.0027   25.7   5.0   63    4-66     42-108 (157)
131 PF08434 CLCA_N:  Calcium-activ  25.5      43 0.00094   31.2   1.8   12  218-229   149-160 (262)
132 COG2321 Predicted metalloprote  25.4      28 0.00061   32.6   0.5   82  185-269   137-224 (295)
133 PRK14749 hypothetical protein;  24.6      47   0.001   20.4   1.2   18  292-309     5-22  (30)
134 PF12315 DUF3633:  Protein of u  24.1      33 0.00072   30.8   0.7   14  217-230    92-105 (212)
135 COG2317 Zn-dependent carboxype  23.6      31 0.00067   34.7   0.5   44  254-301   374-424 (497)
136 cd06461 M2_ACE Peptidase famil  23.2      35 0.00075   34.6   0.7   16  217-232   246-261 (477)
137 TIGR02106 cyd_oper_ybgT cyd op  22.7      53  0.0012   20.2   1.2   17  293-309     6-22  (30)
138 PF01431 Peptidase_M13:  Peptid  22.7      34 0.00075   29.9   0.5   16  218-233    36-51  (206)
139 TIGR03141 cytochro_ccmD heme e  22.6      61  0.0013   21.6   1.6   22  288-310     2-23  (45)
140 COG5504 Predicted Zn-dependent  22.1 1.1E+02  0.0025   28.4   3.7   78  128-238    79-160 (280)
141 cd02639 R3H_RRM R3H domain of   22.0      42 0.00091   24.0   0.8   22  218-239    29-50  (60)
142 CHL00176 ftsH cell division pr  21.9      35 0.00076   35.8   0.5   14  217-230   437-450 (638)
143 PRK10733 hflB ATP-dependent me  21.8      35 0.00075   35.8   0.4   14  217-230   407-420 (644)
144 PF14891 Peptidase_M91:  Effect  21.6      40 0.00088   29.0   0.7   18  217-234   102-119 (174)
145 PF02074 Peptidase_M32:  Carbox  20.9      38 0.00082   34.5   0.5   17  219-235   260-279 (494)
146 COG4412 Uncharacterized protei  20.6      34 0.00073   35.1   0.0   51  218-268   346-411 (760)
147 PF13402 M60-like:  Peptidase M  20.6      39 0.00085   31.4   0.5   11  220-230   221-231 (307)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=2.8e-56  Score=438.18  Aligned_cols=235  Identities=47%  Similarity=0.793  Sum_probs=210.2

Q ss_pred             ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcCCCCCCCCCCCCccccCCCCCC
Q 021444            3 GLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILRPRCGNPDIVNGTTTMNSGKPT   82 (312)
Q Consensus         3 ~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~pRCG~pD~~~~~~~~~~~~~~   82 (312)
                      ++..++.||++|||+++.+...+...++.+++||+.||++++|++||++|.+|++.|++|||||||.             
T Consensus        29 ~~~~~~~yl~~~~y~~~~~~~~~~~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~-------------   95 (469)
T KOG1565|consen   29 DKVALQDYLECYGYLPPTDLTATRASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG-------------   95 (469)
T ss_pred             chhHHHHHhhhcccCCCccccccccCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------------
Confidence            3577899999999998765432222578999999999999999999999999999999999999992             


Q ss_pred             CCCCCCCCCCCcccccccccCCCCCCCCCCCCceEEeecCC-CCCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccE
Q 021444           83 SYNATSSTNGHLHSVGHYSFFPGMPRWPSNKRDLTYAFLPD-NQLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADI  160 (312)
Q Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~~~~kW~~~~~~LtY~i~~~-~~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI  160 (312)
                                      +|++++++.||  ++.+|||+|.++ +.++ .+++.++++||+.|+++++++|.|+.....|||
T Consensus        96 ----------------~~~~~~~~~kW--~k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi  157 (469)
T KOG1565|consen   96 ----------------RYRYFPGKPKW--NKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADI  157 (469)
T ss_pred             ----------------ccccCcccCcc--cccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCce
Confidence                            25567788999  788999999987 3344 899999999999999999999999996679999


Q ss_pred             EEEeeecCCCCCCCcCCCCCcceeeeCCC---CCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444          161 RIGFFTGDHGDGEPFDGPMGTLAHAFSPP---NGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       161 ~I~f~~~~hgd~~~fdG~gg~lAhA~~P~---~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      +|.|..+.|||++||||++|+|||||+|+   .|++|||++|.|++.       ...+++++.||+|||||+|||.||.+
T Consensus       158 ~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-------~~~g~~l~~Va~HEiGH~LGL~HS~~  230 (469)
T KOG1565|consen  158 RISFFPGDHGDGFPFDGPGGVLAHAFFPGPGIGGDLHFDKDETWTYG-------DSNGVDLFLVAAHEIGHALGLGHSSD  230 (469)
T ss_pred             eeeeeccCCCCCCcccCCCCceecccCCCCCCCCccccCcccceecc-------CCccchhHHHhhhhcccccccCCCCC
Confidence            99999999999999999999999999994   689999999999995       13488999999999999999999999


Q ss_pred             CCCccccccCCCCCcCCCCHHHHHHHHHHhCCCCCCCC
Q 021444          238 EDAIMYPSISSGSRKVELANDDIQGIQALYGNNPNFNG  275 (312)
Q Consensus       238 ~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~~~~~~  275 (312)
                      +++||||+|++...++.|+.|||.|||.|||.+++...
T Consensus       231 ~~aiM~P~y~~~~~~~~L~~DDv~giq~lYG~~~~~~~  268 (469)
T KOG1565|consen  231 PDAIMYPFYQPDSGNFDLSQDDVRGIQHLYGGPPRDYF  268 (469)
T ss_pred             cccccccccccCCCCcccChhhhhhhHHHhCCCccccc
Confidence            99999999997667889999999999999999888643


No 2  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00  E-value=3.9e-36  Score=257.24  Aligned_cols=153  Identities=54%  Similarity=0.876  Sum_probs=135.5

Q ss_pred             CCCCCCCceEEeecCCC--CChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444          108 RWPSNKRDLTYAFLPDN--QLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA  185 (312)
Q Consensus       108 kW~~~~~~LtY~i~~~~--~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA  185 (312)
                      ||  .+++|||+|.+..  .....++++|++||+.|+++++|+|+|+.+...|||+|.|....++++.+|+|.+|+++||
T Consensus         1 kW--~~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a   78 (157)
T cd04278           1 KW--SKTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHA   78 (157)
T ss_pred             CC--CCCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccc
Confidence            79  6899999998543  2348899999999999999999999999865589999999998899888999999999999


Q ss_pred             eCC--CCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCcCCCCHHHHHHH
Q 021444          186 FSP--PNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSRKVELANDDIQGI  263 (312)
Q Consensus       186 ~~P--~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~I  263 (312)
                      ++|  ..|+|+||.++.|....      ...+.+++.|++|||||||||+|+.++.+||||.+.+......|+.+||.+|
T Consensus        79 ~~p~~~~g~i~~~~~~~~~~~~------~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~~  152 (157)
T cd04278          79 FFPGGIGGDIHFDDDEQWTLGS------DSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRGI  152 (157)
T ss_pred             cCCCCcceeEEECCCcccccCC------CCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHHH
Confidence            999  67999999999998753      1346789999999999999999999999999999986434678999999999


Q ss_pred             HHHhC
Q 021444          264 QALYG  268 (312)
Q Consensus       264 q~LYG  268 (312)
                      |+|||
T Consensus       153 ~~lYg  157 (157)
T cd04278         153 QALYG  157 (157)
T ss_pred             HHhcC
Confidence            99998


No 3  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00  E-value=2.2e-34  Score=244.22  Aligned_cols=149  Identities=46%  Similarity=0.822  Sum_probs=131.6

Q ss_pred             CCCCCCCceEEeecCCC-CCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444          108 RWPSNKRDLTYAFLPDN-QLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA  185 (312)
Q Consensus       108 kW~~~~~~LtY~i~~~~-~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA  185 (312)
                      ||  ++++|||+|.+.+ ..+ +++++++++||+.|+++++++|+++.+ ..+||+|.|....+++...|++..++++++
T Consensus         1 ~W--~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~-~~adi~i~~~~~~~~~~~~~~~~~~~~~~~   77 (154)
T PF00413_consen    1 KW--PKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSD-GNADIRISFGSNNHGDGYSFDGSGGTLAHA   77 (154)
T ss_dssp             SS--SSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESS-SSCSEEEEEESSSSSSSS-CSSSSSESEEE
T ss_pred             CC--CCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccC-CCcceeeeeeccccCcccccccceeeeecc
Confidence            79  6799999999754 333 789999999999999999999999985 579999999988888888899999999999


Q ss_pred             eCCC---CCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCcCCCCHHHHHH
Q 021444          186 FSPP---NGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSRKVELANDDIQG  262 (312)
Q Consensus       186 ~~P~---~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~  262 (312)
                      +.|.   .|+++|+.++.|...        ....++..|++|||||||||+|++++.|||||.+.+.. ...|+.+||.+
T Consensus        78 ~~~~~~~~~~i~~~~~~~~~~~--------~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~-~~~l~~~Di~~  148 (154)
T PF00413_consen   78 YFPNNIVSGDIHFNDDESWTID--------DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPD-NKTLSEDDIDG  148 (154)
T ss_dssp             EESSSTTTTEEEEETTSHEESS--------SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSS-STSTTHHHHHH
T ss_pred             ccccccccccccccccccchhh--------hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCC-CCCCCHHHHHH
Confidence            9997   689999999999874        23578999999999999999999999999999998644 24899999999


Q ss_pred             HHHHhC
Q 021444          263 IQALYG  268 (312)
Q Consensus       263 Iq~LYG  268 (312)
                      ||+|||
T Consensus       149 i~~lYg  154 (154)
T PF00413_consen  149 IQYLYG  154 (154)
T ss_dssp             HHHHHS
T ss_pred             HHHHhC
Confidence            999998


No 4  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.95  E-value=5.7e-28  Score=206.44  Aligned_cols=148  Identities=26%  Similarity=0.311  Sum_probs=113.2

Q ss_pred             CceEEeecCCCC----ChHHHHHHHHHHHHHHhhcccceEEEcccC-CcccEEEEeeecCCCCCCCcCCCCCcceeeeCC
Q 021444          114 RDLTYAFLPDNQ----LTDEVKNVFATAFDRWSEATSLTFTRTDVY-TTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSP  188 (312)
Q Consensus       114 ~~LtY~i~~~~~----~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P  188 (312)
                      ..|+|++.+...    ....++++|++||+.|+++++|+|++++.. .++||+|.|....     +|+|.++++|||++|
T Consensus         2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g~~~a~a~~p   76 (156)
T cd04279           2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAGGGLARAGFP   76 (156)
T ss_pred             CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCCCceEEeccc
Confidence            478999987543    247899999999999999999999998743 3899999997532     678999999999999


Q ss_pred             CCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCC-CccccccCCCC-CcCCCCHHHHHHHHHH
Q 021444          189 PNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVED-AIMYPSISSGS-RKVELANDDIQGIQAL  266 (312)
Q Consensus       189 ~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~-sIM~p~y~~~~-~~~~L~~dDi~~Iq~L  266 (312)
                      ..|.  ++....|...-+..........+++.|++|||||+|||.|++++. +||||++.... .+..|+.+|+++||.|
T Consensus        77 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~l  154 (156)
T cd04279          77 LISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRL  154 (156)
T ss_pred             ccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHH
Confidence            7543  333322222110000000124689999999999999999999999 99999997632 3568999999999999


Q ss_pred             hC
Q 021444          267 YG  268 (312)
Q Consensus       267 YG  268 (312)
                      ||
T Consensus       155 Y~  156 (156)
T cd04279         155 YG  156 (156)
T ss_pred             hC
Confidence            97


No 5  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.90  E-value=5e-24  Score=182.74  Aligned_cols=143  Identities=25%  Similarity=0.348  Sum_probs=105.8

Q ss_pred             CCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCc
Q 021444          113 KRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGR  192 (312)
Q Consensus       113 ~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~  192 (312)
                      ++.|+|.|++.  .+..++++|++||+.|+++++|+|+++.....+||+|.+.+..+..    +|..+..++...|..|+
T Consensus         1 ~~~I~y~i~~~--~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~~~~~~~~~~~~g~   74 (165)
T cd04268           1 KKPITYYIDDS--VPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTWSYGPSQVDPLTGE   74 (165)
T ss_pred             CCCEEEEEcCC--CCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCccccCCccCCCCCcc
Confidence            46899999864  3478999999999999999999999998655899999987643111    11112222333466789


Q ss_pred             eEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC----------------CCCccccccCCC------C
Q 021444          193 FHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV----------------EDAIMYPSISSG------S  250 (312)
Q Consensus       193 ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~----------------~~sIM~p~y~~~------~  250 (312)
                      |+++....|....      ...+..+..|++|||||||||.|+++                ..|||++.....      .
T Consensus        75 i~~~~~~~~~~~~------~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~  148 (165)
T cd04268          75 ILLARVYLYSSFV------EYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDG  148 (165)
T ss_pred             EEeeEEEEchhHH------HHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccc
Confidence            9998765443210      01245689999999999999999998                899999865421      2


Q ss_pred             CcCCCCHHHHHHHHHHh
Q 021444          251 RKVELANDDIQGIQALY  267 (312)
Q Consensus       251 ~~~~L~~dDi~~Iq~LY  267 (312)
                      ....|+.+||.+||.||
T Consensus       149 ~~~~~~~~Di~ai~~lY  165 (165)
T cd04268         149 QKYTIGPYDIAAIKKLY  165 (165)
T ss_pred             cCCCCCHHHHHHHHhcC
Confidence            35689999999999998


No 6  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.90  E-value=1.6e-23  Score=175.55  Aligned_cols=134  Identities=39%  Similarity=0.597  Sum_probs=107.7

Q ss_pred             CCCCCCCCceEEeecCCCCCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444          107 PRWPSNKRDLTYAFLPDNQLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA  185 (312)
Q Consensus       107 ~kW~~~~~~LtY~i~~~~~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA  185 (312)
                      .+|  .+.+|+|+|.. +.++ .+++++|++||+.|+++++|+|+|+.+  ++||.|.|...+++        +..++++
T Consensus         3 ~~W--~~~~v~Y~i~~-~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~--------g~~~a~~   69 (140)
T smart00235        3 KKW--PKGTVPYVIDS-SSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGS--------GCTLSHA   69 (140)
T ss_pred             CcC--CCCcEEEEEcC-CCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCC--------Ccceeee
Confidence            589  56899999972 2233 446999999999999999999999874  89999999986641        1126889


Q ss_pred             eCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCCC---ccccccCCCCCcCCCCHHHHHH
Q 021444          186 FSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVEDA---IMYPSISSGSRKVELANDDIQG  262 (312)
Q Consensus       186 ~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~s---IM~p~y~~~~~~~~L~~dDi~~  262 (312)
                      +.| .|+++++. +.|...              ..|++|||||||||.|++++.+   +|++.+.+....+.++.+|..+
T Consensus        70 g~~-~g~~~~~~-~~~~~~--------------~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~  133 (140)
T smart00235       70 GRP-GGDQHFSL-GNGCIN--------------TGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLG  133 (140)
T ss_pred             ecC-CCceEEEc-cCCcCC--------------cccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCC
Confidence            888 78899976 556542              3499999999999999998887   9999997644467788889999


Q ss_pred             HHHHhCC
Q 021444          263 IQALYGN  269 (312)
Q Consensus       263 Iq~LYG~  269 (312)
                      ++++||+
T Consensus       134 ~~~~yg~  140 (140)
T smart00235      134 IPYDYGS  140 (140)
T ss_pred             chhccCc
Confidence            9999984


No 7  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.89  E-value=6.7e-23  Score=179.96  Aligned_cols=125  Identities=23%  Similarity=0.301  Sum_probs=99.0

Q ss_pred             hHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCC-------CCceEecCCC
Q 021444          127 TDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPP-------NGRFHLDGDE  199 (312)
Q Consensus       127 ~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~-------~G~ihfd~~e  199 (312)
                      +...+++|++||+.|+++++|+|+|+.+...+||+|.+.....         +++.+.|++|.       .|+|++|.+.
T Consensus        32 ~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~~~~g~a~~p~~~~~~~~~g~i~~~~~~  102 (186)
T cd04277          32 SAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------GNTAGYAYYPGSGSGTAYGGDIWFNSSY  102 (186)
T ss_pred             CHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------CCccEEEECCCCCccccccceeEEecCc
Confidence            4789999999999999999999999987668999998764321         35678888885       3689999877


Q ss_pred             CceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCC----------------CCccccccCCC-------CCcCCCC
Q 021444          200 NWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVE----------------DAIMYPSISSG-------SRKVELA  256 (312)
Q Consensus       200 ~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~----------------~sIM~p~y~~~-------~~~~~L~  256 (312)
                      .|...        ..+.....|++|||||+|||.|+++.                .|||.+.....       .....++
T Consensus       103 ~~~~~--------~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~  174 (186)
T cd04277         103 DTNSD--------SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPM  174 (186)
T ss_pred             ccccC--------CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCcc
Confidence            66532        23556899999999999999999853                46897654332       2346889


Q ss_pred             HHHHHHHHHHhC
Q 021444          257 NDDIQGIQALYG  268 (312)
Q Consensus       257 ~dDi~~Iq~LYG  268 (312)
                      ..||+|||+|||
T Consensus       175 ~~DI~AlQ~lYG  186 (186)
T cd04277         175 LLDIAALQYLYG  186 (186)
T ss_pred             HHHHHHHHHhhC
Confidence            999999999998


No 8  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74  E-value=1.8e-17  Score=147.17  Aligned_cols=109  Identities=29%  Similarity=0.456  Sum_probs=76.2

Q ss_pred             CCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeC
Q 021444          108 RWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFS  187 (312)
Q Consensus       108 kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~  187 (312)
                      +|| +.++|+|.|...  .+...++.|++||+.|+++++|+|++++. .++||+|.|..+. | ++++.|..+..   ..
T Consensus         2 ~W~-~~~~~~~~f~~~--~~~~~r~~I~~A~~~W~~~t~i~F~~~~~-~~adi~I~f~~~~-G-c~S~vG~~~~~---~~   72 (198)
T cd04327           2 LWR-NGTVLRIAFLGG--PDAFLKDKVRAAAREWLPYANLKFKFVTD-ADADIRISFTPGD-G-YWSYVGTDALL---IG   72 (198)
T ss_pred             CCC-CCCeEEEEeCCC--CcHHHHHHHHHHHHHHhhhcCeEEEECCC-CCCCEEEEEecCC-C-CCCCcCCcccc---cC
Confidence            697 477899998754  24788999999999999999999999985 5799999998643 3 45555543221   11


Q ss_pred             CCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444          188 PPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       188 P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                      +....+.++    |...       ...+.+...|++|||||||||.|++
T Consensus        73 ~~~~t~~l~----~~~~-------~~~~~~~~~~i~HElgHaLG~~HEh  110 (198)
T cd04327          73 ADAPTMNLG----WFTD-------DTPDPEFSRVVLHEFGHALGFIHEH  110 (198)
T ss_pred             CCCceeeee----eecC-------CCchhhHHHHHHHHHHHHhcCcccc
Confidence            112234442    2111       0123467789999999999999984


No 9  
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.72  E-value=6.7e-17  Score=138.49  Aligned_cols=121  Identities=24%  Similarity=0.309  Sum_probs=91.8

Q ss_pred             ChHHHHHHHHHHHHHHhhcccceEEEcccC-CcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCC-----CceEecCCC
Q 021444          126 LTDEVKNVFATAFDRWSEATSLTFTRTDVY-TTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPN-----GRFHLDGDE  199 (312)
Q Consensus       126 ~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~-----G~ihfd~~e  199 (312)
                      +...++++|+.|++.|+++++++|+++... ..+||.+.+...+.        .++.+++||.|..     +.++++...
T Consensus        19 ~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~~~~g~a~~~~~c~~~~~~~~~~~~~   90 (167)
T cd00203          19 LSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DGGTGGWAYLGRVCDSLRGVGVLQDNQ   90 (167)
T ss_pred             hHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CCCceEEEecCCccCCCCCcEEEecCC
Confidence            447899999999999999999999999853 38999998865432        2235667776653     455554432


Q ss_pred             CceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC--------------------CCCccccccCCC--CCcCCCCH
Q 021444          200 NWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV--------------------EDAIMYPSISSG--SRKVELAN  257 (312)
Q Consensus       200 ~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~--------------------~~sIM~p~y~~~--~~~~~L~~  257 (312)
                      .|             +.....|++|||||+|||.|..+                    ..+||+|.....  .....++.
T Consensus        91 ~~-------------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~  157 (167)
T cd00203          91 SG-------------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQ  157 (167)
T ss_pred             cc-------------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCH
Confidence            22             23578899999999999999876                    368999876532  34568999


Q ss_pred             HHHHHHHHHh
Q 021444          258 DDIQGIQALY  267 (312)
Q Consensus       258 dDi~~Iq~LY  267 (312)
                      .||+.||.+|
T Consensus       158 ~d~~~i~~~Y  167 (167)
T cd00203         158 CDIDQINKLY  167 (167)
T ss_pred             HHHHHHHhhC
Confidence            9999999998


No 10 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.56  E-value=2.1e-14  Score=127.21  Aligned_cols=146  Identities=18%  Similarity=0.196  Sum_probs=90.1

Q ss_pred             CCCceEEeecCCCCChHHHHHHHHHHHHHHhhc------ccceEEEcccC--CcccEE---EEeeecCCCCCCCcCCCCC
Q 021444          112 NKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEA------TSLTFTRTDVY--TTADIR---IGFFTGDHGDGEPFDGPMG  180 (312)
Q Consensus       112 ~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~------s~l~F~ev~~~--~~aDI~---I~f~~~~hgd~~~fdG~gg  180 (312)
                      .|+.|||+|+++.  +.+++++|++||+.|+++      .++.++++...  ...||+   |+|....       ++..+
T Consensus         6 pk~pItyyI~~~~--p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~-------~~~~~   76 (197)
T cd04276           6 PKEPIVYYLDNTF--PEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP-------NGGWA   76 (197)
T ss_pred             CCCCEEEEecCCC--cHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC-------CCcce
Confidence            4789999999753  388999999999999987      35555555321  246888   5554322       12223


Q ss_pred             cceeeeCCCCC-----ceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC------------------
Q 021444          181 TLAHAFSPPNG-----RFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV------------------  237 (312)
Q Consensus       181 ~lAhA~~P~~G-----~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~------------------  237 (312)
                      ...+...|..|     +|+++...........   ....+..+..+++||+||+|||.|...                  
T Consensus        77 ~gps~~dPrTGeIl~a~V~l~~~~~~~~~~~~---~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~  153 (197)
T cd04276          77 YGPSVVDPRTGEILKADVILYSGFLRQDQLWY---EDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKE  153 (197)
T ss_pred             ecccccCCCCCCeEEEEEEeCchhhccchhHH---HHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhh
Confidence            33444556555     4556543221110000   001123577899999999999999752                  


Q ss_pred             ---CCCccccc---cCC------CCCcCCCCHHHHHHHHHHhCC
Q 021444          238 ---EDAIMYPS---ISS------GSRKVELANDDIQGIQALYGN  269 (312)
Q Consensus       238 ---~~sIM~p~---y~~------~~~~~~L~~dDi~~Iq~LYG~  269 (312)
                         ..|||.+.   +..      ......+...||.+||++||.
T Consensus       154 ~~~~~SVMdY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~  197 (197)
T cd04276         154 KGATSSVMDYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP  197 (197)
T ss_pred             cCCcceeecCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence               35777432   210      012457899999999999984


No 11 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.52  E-value=1.1e-13  Score=121.20  Aligned_cols=92  Identities=18%  Similarity=0.249  Sum_probs=64.4

Q ss_pred             CceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCce
Q 021444          114 RDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRF  193 (312)
Q Consensus       114 ~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~i  193 (312)
                      ..|.|.|++.  ++...++.|++||+.|+++++|+|++.+.   ++..|.|..+ .| ++.+.|..+        +...+
T Consensus         2 ~~VpY~i~~~--~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~G-c~S~vG~~~--------~~q~i   66 (180)
T cd04280           2 GTVPYVIDGS--FDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SG-CWSYVGRVG--------GRQVV   66 (180)
T ss_pred             CEEEEEECCC--CCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CC-cceecCccC--------CceeE
Confidence            4789999863  34788999999999999999999999874   4556666654 33 333333211        11223


Q ss_pred             EecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444          194 HLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       194 hfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      .+..                 ++....+++|||||||||.|++.
T Consensus        67 ~l~~-----------------~c~~~g~v~HE~~HalG~~HEh~   93 (180)
T cd04280          67 SLGS-----------------GCFSLGTIVHELMHALGFYHEQS   93 (180)
T ss_pred             EeCC-----------------CcCcCchhHHHHHHHhcCcchhc
Confidence            3321                 34447899999999999999873


No 12 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.51  E-value=1.5e-13  Score=121.97  Aligned_cols=135  Identities=28%  Similarity=0.422  Sum_probs=91.7

Q ss_pred             CCCceEEeecCCC--CChHHHHHHHHHHHHHHhhcc-cceEEEc--ccCCcccEEEEeeecCCCCCCCcCCCCCcceeee
Q 021444          112 NKRDLTYAFLPDN--QLTDEVKNVFATAFDRWSEAT-SLTFTRT--DVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAF  186 (312)
Q Consensus       112 ~~~~LtY~i~~~~--~~~~~~~~~i~~A~~~Ws~~s-~l~F~ev--~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~  186 (312)
                      ..++|+..+.+..  .+...++.++++|++.|+++. .|+|+-+  .....+||.|.-...        +.+.|..+-|.
T Consensus        37 ~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~G~ggsAG  108 (211)
T PF12388_consen   37 SPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPSGAGGSAG  108 (211)
T ss_pred             CCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCCCcceecc
Confidence            3568888887632  344789999999999999973 5677522  223466776632211        12334567788


Q ss_pred             CC-CCC----ceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC-------------------------
Q 021444          187 SP-PNG----RFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS-------------------------  236 (312)
Q Consensus       187 ~P-~~G----~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~-------------------------  236 (312)
                      || ..|    .|.+     +.+.       .........|++|||||+|||.|++                         
T Consensus       109 FP~s~G~P~~~I~I-----~~~~-------~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpG  176 (211)
T PF12388_consen  109 FPTSNGNPYKFIQI-----YGLS-------NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPG  176 (211)
T ss_pred             CCCCCCCCCceEEE-----EecC-------CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCC
Confidence            99 544    3555     1111       1122346789999999999999986                         


Q ss_pred             -----CCCCccccccCCCCCcCCCCHHHHHHHHHHh
Q 021444          237 -----VEDAIMYPSISSGSRKVELANDDIQGIQALY  267 (312)
Q Consensus       237 -----~~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LY  267 (312)
                           |++|||...+.. .....++..||.|+++||
T Consensus       177 TPt~~d~~SiM~ac~~~-~~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  177 TPTGADPNSIMNACFSS-GEDGEFTSNDITALNYLY  211 (211)
T ss_pred             CCCCCCCchhhhccccC-CCCCCcChhhHHHHHhhC
Confidence                 356889887654 345689999999999999


No 13 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.47  E-value=5.7e-13  Score=118.38  Aligned_cols=99  Identities=23%  Similarity=0.400  Sum_probs=66.5

Q ss_pred             CCCCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444          106 MPRWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA  185 (312)
Q Consensus       106 ~~kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA  185 (312)
                      ..+||  +..|.|.|...  .+...+.+|++||+.|++.++|+|++... .  .-.|.|..+..| ++++.|..+.-.+ 
T Consensus         7 ~~~Wp--~~~VpY~i~~~--~~~~~~~~I~~Am~~~~~~TCirF~~r~~-~--~~yi~f~~~~~G-C~S~vG~~~~g~q-   77 (200)
T cd04281           7 ERIWP--GGVIPYVIDGN--FTGSQRAMFKQAMRHWENFTCVTFVERTP-E--ENYIVFTYRPCG-CCSYVGRRGNGPQ-   77 (200)
T ss_pred             cCcCC--CCEEEEEECCC--CCHHHHHHHHHHHHHHHhCCceEEEECCC-C--CCEEEEEECCCC-eeEcCCCcCCCce-
Confidence            46894  67999999752  34667899999999999999999998764 2  234455543334 3444554221011 


Q ss_pred             eCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444          186 FSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       186 ~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                            .+.+                 ..++....+++|||||||||-|++
T Consensus        78 ------~isl-----------------~~~C~~~Gti~HEl~HaLGf~HEh  105 (200)
T cd04281          78 ------AISI-----------------GKNCDKFGIVVHELGHVIGFWHEH  105 (200)
T ss_pred             ------eeec-----------------CCCcCcCchHHHHHHHHhcCcchh
Confidence                  1211                 123445689999999999999986


No 14 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.44  E-value=2.6e-13  Score=119.93  Aligned_cols=96  Identities=25%  Similarity=0.492  Sum_probs=60.0

Q ss_pred             CCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeC
Q 021444          108 RWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFS  187 (312)
Q Consensus       108 kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~  187 (312)
                      |||  ..+|.|.|.+.  .+...++.|++||+.|++.++|+|+++. . .....|.|..+ .| ++++.|..+       
T Consensus         2 ~Wp--~~~IpY~~~~~--~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~~-~g-C~S~vG~~~-------   66 (191)
T PF01400_consen    2 KWP--NGTIPYYIDPS--FSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSNG-SG-CWSYVGRQG-------   66 (191)
T ss_dssp             S-G--GGEEEEEEETT--S-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEESS-SS-EEEESS--S-------
T ss_pred             cCC--CCEEEEEECCC--CCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeecC-cc-ccchhhhcC-------
Confidence            794  56999999863  3477889999999999999999999998 3 34444556432 23 233333210       


Q ss_pred             CCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444          188 PPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       188 P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                       +.-.|.+.                 .++....|++|||||||||.|++
T Consensus        67 -g~q~i~l~-----------------~~c~~~~~i~HEl~HaLG~~HEh   97 (191)
T PF01400_consen   67 -GEQTINLG-----------------DGCFSVGTILHELGHALGFWHEH   97 (191)
T ss_dssp             -SEEEEEE------------------TTC-SHHHHHHHHHHHHTB--GG
T ss_pred             -cceeEEec-----------------ceeCCccchHHHHHHHHhhhhhh
Confidence             00122222                 13345679999999999999986


No 15 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.33  E-value=1.5e-11  Score=111.25  Aligned_cols=98  Identities=23%  Similarity=0.380  Sum_probs=66.0

Q ss_pred             CCCCCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCccee
Q 021444          105 GMPRWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAH  184 (312)
Q Consensus       105 ~~~kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAh  184 (312)
                      ...+||  + .|.|.|.+.  .+...+..|++||+.|++.++|+|++.+. .. | .|.|..+. | |+++.|..|    
T Consensus        42 ~~~~Wp--~-~VPY~i~~~--~~~~~~~~I~~Am~~~~~~TCirF~~rt~-e~-~-yi~i~~~~-G-C~S~vG~~g----  107 (230)
T cd04282          42 DTYRWP--F-PIPYILDDS--LDLNAKGVILKAFEMYRLKSCVDFKPYEG-ES-N-YIFFFKGS-G-CWSMVGDQQ----  107 (230)
T ss_pred             cccCCC--c-ceeEEECCC--CCHHHHHHHHHHHHHHHhCCCeeEEECCC-CC-c-EEEEEcCC-C-eeeccCccC----
Confidence            356995  3 899999753  34678899999999999999999998763 22 2 34555432 3 344444311    


Q ss_pred             eeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444          185 AFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       185 A~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                          +...+.+.                 .++....+++|||||||||-|++.
T Consensus       108 ----g~q~isl~-----------------~~C~~~Gti~HEl~HalGf~HEqs  139 (230)
T cd04282         108 ----GGQNLSIG-----------------AGCDYKATVEHEFLHALGFYHEQS  139 (230)
T ss_pred             ----CeEEEEEC-----------------CCcCCCchHHHHHHHHhCCccccc
Confidence                01122221                 245557899999999999999873


No 16 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.32  E-value=1.2e-11  Score=108.37  Aligned_cols=92  Identities=17%  Similarity=0.288  Sum_probs=60.0

Q ss_pred             CceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCce
Q 021444          114 RDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRF  193 (312)
Q Consensus       114 ~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~i  193 (312)
                      -.|.|.|+..  .+...++.|++||+.|++.++|+|++.+. ...-  |.|..+ .| ++++.|..|.        ...|
T Consensus         4 ~~IpY~i~~~--~~~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~y--i~~~~~-~g-C~S~vG~~gg--------~q~i   68 (182)
T cd04283           4 VYVPYVISPQ--YSENERAVIEKAMQEFETLTCVRFVPRTT-ERDY--LNIESR-SG-CWSYIGRQGG--------RQTV   68 (182)
T ss_pred             EEEEEEECCC--CCHHHHHHHHHHHHHHHhCCceeeEECCC-CCcE--EEEEcC-CC-ceEecCccCC--------ceeE
Confidence            3688999742  34677899999999999999999998763 3333  444432 23 3333343210        0112


Q ss_pred             EecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444          194 HLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       194 hfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                      .+.                ..++....+++|||||||||.|++
T Consensus        69 ~l~----------------~~~C~~~G~i~HEl~HaLG~~HEh   95 (182)
T cd04283          69 SLQ----------------KQGCMYKGIIQHELLHALGFYHEQ   95 (182)
T ss_pred             ecC----------------CCCcCccchHHHHHHHHhCCcccc
Confidence            221                123445689999999999999987


No 17 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.26  E-value=3.1e-11  Score=118.37  Aligned_cols=106  Identities=22%  Similarity=0.273  Sum_probs=73.4

Q ss_pred             ccccCCCCCCCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCC
Q 021444           99 HYSFFPGMPRWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGP  178 (312)
Q Consensus        99 ~~~~~~~~~kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~  178 (312)
                      |........+||  ...|.|.|...  .+...+.+|+.||+.|++.++|+|++..+ .+.+..+.+...  | ++.+.|.
T Consensus        72 r~~~~~~~~~Wp--~~~ipY~i~~~--~~~~~r~~i~~A~~~~~~~TCi~F~~~~~-~~~~~~~~~~~~--g-C~S~VGr  143 (411)
T KOG3714|consen   72 RNGTSNPERRWP--NGVIPYYIDGS--FTSSQRALIRQAMREIENHTCIRFVERTT-PDKDYLIVFTGG--G-CYSYVGR  143 (411)
T ss_pred             hhcccChhhcCC--CCeeeeEECCC--CCHHHHHHHHHHHHHHhhCcceEEEeCCC-CCcceEEEeCCC--c-ceeeeCc
Confidence            333444567995  48999999864  44889999999999999999999999874 345555544332  3 3444454


Q ss_pred             CCcceeeeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444          179 MGTLAHAFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       179 gg~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                      .|.--.       .+        +         ...++.-..|++|||+||||+-|.+
T Consensus       144 ~gg~~q-------~~--------s---------l~~~C~~~G~i~HEl~HaLGf~Heh  177 (411)
T KOG3714|consen  144 RGGGQQ-------LL--------S---------LGDGCDRFGTIVHELMHALGFWHEH  177 (411)
T ss_pred             cCCCcc-------ce--------e---------cCCCcCcCchhHHHHHHHhhhhhcc
Confidence            221100       11        1         1235666899999999999999987


No 18 
>PF01471 PG_binding_1:  Putative peptidoglycan binding domain;  InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are:   Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX [].   Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.25  E-value=1.4e-11  Score=87.47  Aligned_cols=55  Identities=31%  Similarity=0.513  Sum_probs=47.3

Q ss_pred             CChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHh
Q 021444            2 DGLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHI   59 (312)
Q Consensus         2 ~~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M   59 (312)
                      +.|..++.+|.++||++.   ..+..+++.+++||+.||+++||++||++|.+|+++|
T Consensus         3 ~~v~~lq~~L~~~gy~~~---~~~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L   57 (57)
T PF01471_consen    3 PDVKALQQYLNRLGYYPG---PVDGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL   57 (57)
T ss_dssp             HHHHHHHHHHHHTTTT-S---STTSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred             HHHHHHHHHHHHcCCCCC---CCCCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence            357899999999999943   2346789999999999999999999999999999987


No 19 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.90  E-value=6.8e-09  Score=91.12  Aligned_cols=123  Identities=24%  Similarity=0.296  Sum_probs=75.9

Q ss_pred             HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCC---------cceeeeCCCCCceEecCC
Q 021444          128 DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMG---------TLAHAFSPPNGRFHLDGD  198 (312)
Q Consensus       128 ~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg---------~lAhA~~P~~G~ihfd~~  198 (312)
                      ..+..++..|++.|++.-|+..+|-.  ..|||+|...+. .|.+  .++.+.         .+.|+-.|+    |++..
T Consensus       103 p~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~-pgtg--~~~~grArtal~~~efl~~~~~~~----~lnhr  173 (236)
T COG5549         103 PRWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNP-PGTG--WRQYGRARTALIAYEFLGHALGLG----HLNHR  173 (236)
T ss_pred             hhHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCC-CCCc--ccccchHHHHHHHHHHhhhccCcc----ccccc
Confidence            45788999999999999666665554  689999976542 2222  222211         122232222    22110


Q ss_pred             CCceecCCCccccCCccccchhhhhhhhhhhcCCC-CCCCCCCccccccCCCCCcCCCCHHHHHHHHHHhCCC
Q 021444          199 ENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLG-HSSVEDAIMYPSISSGSRKVELANDDIQGIQALYGNN  270 (312)
Q Consensus       199 e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~  270 (312)
                        ..+..    .+.....++..+|.||+|||||+. |+.- .++|.+..+    ...++..|++.+..+|-.+
T Consensus       174 --~~i~~----~pg~~~e~L~~tarhElGhaLgi~ghsd~-ks~aly~~s----~~~is~rdv~tL~r~Yeqp  235 (236)
T COG5549         174 --GDIMY----PPGELRENLNPTARHELGHALGIWGHSDL-KSDALYGIS----WQGISRRDVKTLERKYEQP  235 (236)
T ss_pred             --ccccC----CcccchhhhhHHHHHhhcchheecccccc-hhhhhcchh----hcccCHHHHHHHHHHhcCC
Confidence              00000    012335579999999999999997 8774 455544333    2348999999999999754


No 20 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.90  E-value=5.7e-09  Score=85.10  Aligned_cols=124  Identities=22%  Similarity=0.296  Sum_probs=71.8

Q ss_pred             ceEEeecCCCCChHHHHHHHHHHHHHHh-hcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCce
Q 021444          115 DLTYAFLPDNQLTDEVKNVFATAFDRWS-EATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRF  193 (312)
Q Consensus       115 ~LtY~i~~~~~~~~~~~~~i~~A~~~Ws-~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~i  193 (312)
                      +|+|.-...    .+++..|.+|.+.|+ .|.+++|++...   ++|+|.-..    +      +++..+..-..+.|.|
T Consensus         3 tv~Ydas~A----~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~~----~------~~~sya~~~g~G~G~I   65 (132)
T PF02031_consen    3 TVYYDASRA----PEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEGN----D------PRGSYASTDGLGSGYI   65 (132)
T ss_dssp             EEEEEEEE-----GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE------------TT--EEEE-SSS-EEE
T ss_pred             EEEEeCCCC----chhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEec----C------CCCcccccCCCCcEEE
Confidence            466654421    568899999999997 578999999863   888876432    1      1233333333456788


Q ss_pred             EecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCC-C-CCCCccccccCCCC-CcCCCCHHHHHHHHHHhC
Q 021444          194 HLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHS-S-VEDAIMYPSISSGS-RKVELANDDIQGIQALYG  268 (312)
Q Consensus       194 hfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs-~-~~~sIM~p~y~~~~-~~~~L~~dDi~~Iq~LYG  268 (312)
                      .+|..  +           ..+++-..|++|||||.|||..- . .-..+|.-.-.+-. .+..+...++..+++||.
T Consensus        66 ~l~~~--~-----------~qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~aG~sCtN~~Pna~E~a~V~~~~a  130 (132)
T PF02031_consen   66 FLDYQ--Q-----------NQGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSAGTSCTNAYPNAAERARVNSLFA  130 (132)
T ss_dssp             EEEHH--H-----------HHHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT-TT----S--HHHHHHHHHHCT
T ss_pred             EechH--H-----------hhCCccceeeeehhccccCCCCCCCCccHHhhcCCCCCCCCCCCCCCHHHHHHHHHHhc
Confidence            88742  2           13567788999999999999543 2 34578874332211 234678899999999986


No 21 
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=97.75  E-value=6.5e-05  Score=67.03  Aligned_cols=56  Identities=16%  Similarity=0.230  Sum_probs=48.8

Q ss_pred             ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcC
Q 021444            3 GLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILR   61 (312)
Q Consensus         3 ~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~   61 (312)
                      .+..++..|.+.||....   .+..++..+.+||+.||+.+||+++|++|..|...|..
T Consensus        15 ~V~~LQ~~L~~lG~~~g~---idG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~   70 (201)
T TIGR02869        15 DVIEIQRRLKAWGYYNGK---VDGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGI   70 (201)
T ss_pred             HHHHHHHHHHHcCCCCCC---CCCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCC
Confidence            477899999999997532   24568899999999999999999999999999999944


No 22 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.74  E-value=0.00019  Score=63.05  Aligned_cols=49  Identities=27%  Similarity=0.347  Sum_probs=34.0

Q ss_pred             ccchhhhhhhhhhhcCCCCCCCC----------CCccccccCCCCCcCCCCHHHHHHHHH
Q 021444          216 IDLESVAVHEIGHLLGLGHSSVE----------DAIMYPSISSGSRKVELANDDIQGIQA  265 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~~~----------~sIM~p~y~~~~~~~~L~~dDi~~Iq~  265 (312)
                      .....+++|||||.|||.|....          .-||.|.... .....++.--+.-|+.
T Consensus       131 ~~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~-~~~~~FS~CS~~~i~~  189 (192)
T cd04267         131 LLTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG-LNSYRFSQCSIGSIRE  189 (192)
T ss_pred             eeehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC-CCCCccChhhHHHHHH
Confidence            34567999999999999998753          2489886643 2334566666665554


No 23 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.71  E-value=0.00019  Score=64.04  Aligned_cols=119  Identities=18%  Similarity=0.234  Sum_probs=68.4

Q ss_pred             HHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCceEecCCCCceecCCCccccCCc
Q 021444          135 ATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRFHLDGDENWVITSGTTTSISMS  214 (312)
Q Consensus       135 ~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~  214 (312)
                      -+.|..|.......-..  +....|+.+-+...+....   ++.+++++.||.-+   + ........+..       ..
T Consensus        73 L~~F~~w~~~~~~~~~~--~~~~~D~a~llt~~d~~~~---~~~~~~~G~A~~gg---i-C~~~~s~~i~~-------~~  136 (207)
T cd04273          73 LKSFCRWQKKLNPPNDS--DPEHHDHAILLTRQDICRS---NGNCDTLGLAPVGG---M-CSPSRSCSINE-------DT  136 (207)
T ss_pred             HHHHHHHHHHcCCcccc--cccccceEEEEeeeccccc---CCCCCceEEecccc---C-CCCCcceEEEc-------CC
Confidence            57888997643221111  1236777777765443211   35566777776422   1 00001111111       12


Q ss_pred             cccchhhhhhhhhhhcCCCCCCC---------CCCccccccCCCCCcCCCCHHHHHHHHHHhCC
Q 021444          215 AIDLESVAVHEIGHLLGLGHSSV---------EDAIMYPSISSGSRKVELANDDIQGIQALYGN  269 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs~~---------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~  269 (312)
                      +.....|++|||||.||+.|-.+         ..-||+|..........++.-=+..++.+...
T Consensus       137 ~~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~~  200 (207)
T cd04273         137 GLSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLDT  200 (207)
T ss_pred             CceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHcc
Confidence            34556799999999999999875         26899998764334456677667667665543


No 24 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=97.70  E-value=5e-05  Score=67.95  Aligned_cols=48  Identities=27%  Similarity=0.265  Sum_probs=31.6

Q ss_pred             chhhhhhhhhhhcCCCCCCCC--------------CCccccccCCCCCcCCCCHHHHHHHHHHh
Q 021444          218 LESVAVHEIGHLLGLGHSSVE--------------DAIMYPSISSGSRKVELANDDIQGIQALY  267 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~--------------~sIM~p~y~~~~~~~~L~~dDi~~Iq~LY  267 (312)
                      ...+++|||||+|||.|..+.              .+||.+.-..  ....++.-+|.-|+...
T Consensus       137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY~~~~--~~~~FS~~~i~~i~~~~  198 (206)
T PF13583_consen  137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSYASRP--SRPYFSPPSIEYIREVV  198 (206)
T ss_pred             cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCCCCCC--CCCCCCchhHHHHhCCC
Confidence            457899999999999998742              3688632111  11236777777775443


No 25 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.68  E-value=0.00013  Score=64.37  Aligned_cols=139  Identities=21%  Similarity=0.330  Sum_probs=83.3

Q ss_pred             CCceEEeecCCCC---Ch----HHHHHHHHHHHH---HHhhcccceEEEcccCCcccEEEEeeecCCCCCC--CcCCCCC
Q 021444          113 KRDLTYAFLPDNQ---LT----DEVKNVFATAFD---RWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGE--PFDGPMG  180 (312)
Q Consensus       113 ~~~LtY~i~~~~~---~~----~~~~~~i~~A~~---~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~--~fdG~gg  180 (312)
                      .+.++|++.-...   ..    ++..++|...+.   -|.....+.|+.|.. ..+|++|...+...-+..  +.+-.+.
T Consensus        28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~-~~~Df~I~Lasp~T~~~lC~g~~~~~e  106 (203)
T PF11350_consen   28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDS-GAPDFRISLASPGTTDRLCAGLDTSGE  106 (203)
T ss_pred             CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCC-CCCCEEEEECCcchhhhhccCcCcCce
Confidence            3789999873221   11    445555555554   688888899999986 459999999875432211  1221111


Q ss_pred             cceeeeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC-----CCCccccccCCCCCcCCC
Q 021444          181 TLAHAFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV-----EDAIMYPSISSGSRKVEL  255 (312)
Q Consensus       181 ~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~-----~~sIM~p~y~~~~~~~~L  255 (312)
                        ..-+.+..++|.+|.+ .|..+.....  ..-..+.+.|+=||+||+||-+|..=     +..||-.      +.+.|
T Consensus       107 --~SC~~~~g~~VvIN~~-RWv~Ga~~f~--gdl~~YRqYvINHEVGH~LGh~H~~Cpg~G~~APVMmQ------QT~gL  175 (203)
T PF11350_consen  107 --TSCRNPAGGRVVINLA-RWVRGAPAFG--GDLASYRQYVINHEVGHALGHGHEPCPGPGRPAPVMMQ------QTLGL  175 (203)
T ss_pred             --eEeecCCCCeEEEehH-HhhccCcccC--CcHHHHHHHhhhhhhhhhcccCCCcCCCCCCcCCcchh------hhccc
Confidence              1223444567777763 6776533210  12234678899999999999988764     3346642      23345


Q ss_pred             CHHHHHHH
Q 021444          256 ANDDIQGI  263 (312)
Q Consensus       256 ~~dDi~~I  263 (312)
                      .-++|..|
T Consensus       176 ~g~~~~~~  183 (203)
T PF11350_consen  176 DGCELAKL  183 (203)
T ss_pred             CCcccccc
Confidence            55555444


No 26 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.44  E-value=6.3e-05  Score=64.33  Aligned_cols=25  Identities=32%  Similarity=0.498  Sum_probs=16.1

Q ss_pred             cccchhhhhhhhhhhcCCCCCCCCC
Q 021444          215 AIDLESVAVHEIGHLLGLGHSSVED  239 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs~~~~  239 (312)
                      ..+...|++||+||-|||.|.....
T Consensus        66 ~~~~g~TltHEvGH~LGL~HtF~~~   90 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWLGLYHTFGGG   90 (154)
T ss_dssp             TS-SSHHHHHHHHHHTT---TT---
T ss_pred             ccccccchhhhhhhhhcccccccCC
Confidence            4567889999999999999998654


No 27 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.42  E-value=4.8e-05  Score=66.19  Aligned_cols=47  Identities=26%  Similarity=0.219  Sum_probs=19.7

Q ss_pred             chhhhhhhhhhhcCCCCCCCCC-------------CccccccCCCCCcCCCCHHHHHHHHHH
Q 021444          218 LESVAVHEIGHLLGLGHSSVED-------------AIMYPSISSGSRKVELANDDIQGIQAL  266 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~-------------sIM~p~y~~~~~~~~L~~dDi~~Iq~L  266 (312)
                      ...|++|||||.||+.|..+..             .||++..  ......+++--|..|+.+
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~~--~~~~~~fS~cS~~~i~~~  170 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYAG--ICNNDKFSPCSIRQIRAV  170 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS---GGGG--------------
T ss_pred             eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeCC--CCCCcccccccccccccc
Confidence            4568999999999999987642             6898552  233445666666666654


No 28 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.33  E-value=9.3e-05  Score=63.83  Aligned_cols=28  Identities=39%  Similarity=0.624  Sum_probs=25.2

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCccccc
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDAIMYPS  245 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~  245 (312)
                      ...=++||+||.|||.|=.++.-||++.
T Consensus       124 v~KEv~HElGH~~GL~HC~N~~CVM~FS  151 (181)
T COG1913         124 VVKEVLHELGHLLGLSHCPNPRCVMNFS  151 (181)
T ss_pred             HHHHHHHHhhhhcCcccCCCCCcEEeCC
Confidence            4567899999999999999999999865


No 29 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.30  E-value=0.00011  Score=66.64  Aligned_cols=75  Identities=27%  Similarity=0.377  Sum_probs=42.3

Q ss_pred             cccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCC-cCCCCHHHHHHHHHHhCCCCCCCCCCCCCCCCCCCCCCCcch
Q 021444          215 AIDLESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSR-KVELANDDIQGIQALYGNNPNFNGSSVPTDQQRDTSGAHGAY  293 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~-~~~L~~dDi~~Iq~LYG~~~~~~~~~~~~~~~~~~~~~~~~~  293 (312)
                      .+++-.|++|||||.|||.|.+...+   +.....+. .-++...     ...||.+.....+  |..+..|...|.=+|
T Consensus       134 ~~n~g~t~~HEvGH~lGL~HtF~~~~---~c~~~gD~v~DTP~~~-----~~~~~cp~~~dsC--~~~~~~d~~~N~MdY  203 (225)
T cd04275         134 PYNLGDTATHEVGHWLGLYHTFQGGS---PCCTTGDYVADTPAEA-----SPSYGCPAGRDTC--PGQPGLDPIHNYMDY  203 (225)
T ss_pred             cccccceeEEeccceeeeeeeecCCC---CCcCCCCccCCCcccc-----ccccCCCCCCCCC--CCCCCCCchhhcccc
Confidence            45788899999999999999998764   11111100 0011100     1157754433333  333456666776666


Q ss_pred             hcccch
Q 021444          294 FLGSCL  299 (312)
Q Consensus       294 ~~~~~~  299 (312)
                      ---+|.
T Consensus       204 s~d~C~  209 (225)
T cd04275         204 SDDSCM  209 (225)
T ss_pred             cCcchh
Confidence            666664


No 30 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.25  E-value=9.5e-05  Score=65.00  Aligned_cols=23  Identities=35%  Similarity=0.369  Sum_probs=15.2

Q ss_pred             ccchhhhhhhhhhhcCCCCCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGHSSVE  238 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~~~  238 (312)
                      .....|++|||||.||+.|..+.
T Consensus       140 ~~~~~~~AHEiGH~lGa~HD~~~  162 (196)
T PF13688_consen  140 YNGAITFAHEIGHNLGAPHDGDY  162 (196)
T ss_dssp             HHHHHHHHHHHHHHTT-----SS
T ss_pred             CceehhhHHhHHHhcCCCCCCCC
Confidence            45677999999999999998775


No 31 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.13  E-value=0.00023  Score=62.41  Aligned_cols=27  Identities=33%  Similarity=0.548  Sum_probs=24.3

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCcccc
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDAIMYP  244 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p  244 (312)
                      +..+++|||||.|||.|-.++.-||+.
T Consensus       125 ~~k~~~HElGH~lGL~HC~~~~CvM~~  151 (179)
T PRK13267        125 VRKEVTHELGHTLGLEHCDNPRCVMNF  151 (179)
T ss_pred             HHHHHHHHHHHHcCCccCCCCCccCCC
Confidence            455799999999999999999999985


No 32 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.13  E-value=0.00015  Score=58.94  Aligned_cols=18  Identities=50%  Similarity=0.680  Sum_probs=12.7

Q ss_pred             chhhhhhhhhhhcCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHS  235 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs  235 (312)
                      ...+++|||||.|||.|.
T Consensus       107 ~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  107 GVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             STTHHHHHHHHHTT----
T ss_pred             cceEeeehhhHhcCCCCC
Confidence            458999999999999995


No 33 
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=96.98  E-value=0.0018  Score=66.31  Aligned_cols=37  Identities=24%  Similarity=0.311  Sum_probs=33.9

Q ss_pred             CcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcCC
Q 021444           26 DDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILRP   62 (312)
Q Consensus        26 ~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~p   62 (312)
                      ..+|.++.+|||+||+..||..+|+++..|++.|+.|
T Consensus       307 ~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs  343 (608)
T PRK10594        307 AAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT  343 (608)
T ss_pred             cccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence            4578999999999999999999999999999998754


No 34 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.92  E-value=0.00036  Score=61.81  Aligned_cols=28  Identities=43%  Similarity=0.630  Sum_probs=20.0

Q ss_pred             cchhhhhhhhhhhcCCCCCCCCCCccccc
Q 021444          217 DLESVAVHEIGHLLGLGHSSVEDAIMYPS  245 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~  245 (312)
                      .+..+++||+||.+||.|=.+ .-||++.
T Consensus       144 R~~Kea~HElGH~~GL~HC~~-~CvM~~s  171 (194)
T PF07998_consen  144 RVCKEAVHELGHLFGLDHCEN-RCVMNFS  171 (194)
T ss_dssp             HHHHHHHHHHHHHTT----SS-TSTTS--
T ss_pred             HHHHHHHHHHHHHcCCcCCCC-CCccCCC
Confidence            456799999999999999999 9999974


No 35 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.87  E-value=0.0011  Score=60.27  Aligned_cols=49  Identities=41%  Similarity=0.514  Sum_probs=37.3

Q ss_pred             hhhhhhhhhhcCCCCCCCC-------------------------CCccccccCCCCCcCCCCHHHHHHHHHHhCCC
Q 021444          220 SVAVHEIGHLLGLGHSSVE-------------------------DAIMYPSISSGSRKVELANDDIQGIQALYGNN  270 (312)
Q Consensus       220 ~Va~HEiGHaLGL~Hs~~~-------------------------~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~  270 (312)
                      .|++|||||.||+.|-.+.                         .=||+|....  ....++.-=+.-|+.+...+
T Consensus       147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~--~~~~FS~CS~~~~~~~l~~~  220 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS--GITEFSPCTIGNICSLLGRN  220 (228)
T ss_pred             eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC--CCCccChhhHHHHHHHHhcC
Confidence            6999999999999996431                         1199997554  45678888888888887643


No 36 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.81  E-value=0.0017  Score=57.15  Aligned_cols=50  Identities=30%  Similarity=0.384  Sum_probs=33.5

Q ss_pred             cchhhhhhhhhhhcCCCCCCC------CCCccccccCCCCCcCCCCHHHHHHHHHHhC
Q 021444          217 DLESVAVHEIGHLLGLGHSSV------EDAIMYPSISSGSRKVELANDDIQGIQALYG  268 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG  268 (312)
                      ....|++|||||.||+.|-.+      ..-||+|....  ....++.-=+.-++.+..
T Consensus       130 ~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~fS~CS~~~~~~~l~  185 (194)
T cd04269         130 LFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS--LTDAFSNCSYEDYQKFLS  185 (194)
T ss_pred             HHHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC--CCCCCChhhHHHHHHHHh
Confidence            346799999999999999653      35799987553  233455544555555444


No 37 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=95.99  E-value=0.0083  Score=55.06  Aligned_cols=50  Identities=24%  Similarity=0.388  Sum_probs=34.4

Q ss_pred             hhhhhhhhhhhcCCCCCCC-----C------CCccccccCC--CCCcCCCCHHHHHHHHHHhC
Q 021444          219 ESVAVHEIGHLLGLGHSSV-----E------DAIMYPSISS--GSRKVELANDDIQGIQALYG  268 (312)
Q Consensus       219 ~~Va~HEiGHaLGL~Hs~~-----~------~sIM~p~y~~--~~~~~~L~~dDi~~Iq~LYG  268 (312)
                      ..|++|||||.||+.|-..     +      .=||+|.-..  ......++.-=+..++.+.-
T Consensus       168 a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~  230 (244)
T cd04270         168 DLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLE  230 (244)
T ss_pred             HHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHc
Confidence            4699999999999999665     1      3499987532  12345677766666665544


No 38 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=95.88  E-value=0.011  Score=52.23  Aligned_cols=51  Identities=25%  Similarity=0.487  Sum_probs=31.1

Q ss_pred             ccchhhhhhhhhhhcCCCCCCC--------CCCccccccCCCCCcCCCCHHHHHHHHHHh
Q 021444          216 IDLESVAVHEIGHLLGLGHSSV--------EDAIMYPSISSGSRKVELANDDIQGIQALY  267 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~~--------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LY  267 (312)
                      ..+-.+++|||||.||+.|-..        ..-||.+...+. ....++.-=+..++.+.
T Consensus       129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~-~~~~fS~CS~~~~~~~l  187 (199)
T PF01421_consen  129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPS-SSDKFSNCSRRQFEEFL  187 (199)
T ss_dssp             HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSS-STSSS-HHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCC-CCCCcCHHHHHHHHHHH
Confidence            4566799999999999999776        256899865432 22356665555555543


No 39 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.51  E-value=0.018  Score=51.83  Aligned_cols=51  Identities=22%  Similarity=0.320  Sum_probs=34.0

Q ss_pred             cchhhhhhhhhhhcCCCCCCC------------------CCCccccccCCCCCcCCCCHHHHHHHHHHhC
Q 021444          217 DLESVAVHEIGHLLGLGHSSV------------------EDAIMYPSISSGSRKVELANDDIQGIQALYG  268 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~------------------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG  268 (312)
                      ....+++|||||.||+.|-.+                  ..-||.+.... .....++.--+.-++.+..
T Consensus       144 ~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~-~~~~~fS~CS~~~~~~~l~  212 (220)
T cd04272         144 YGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNG-ERQYRFSQCSQRQIRNVFR  212 (220)
T ss_pred             ccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCC-CCCCccCcchHHHHHHHHc
Confidence            456899999999999999532                  23688875432 2344566666666665543


No 40 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=95.46  E-value=0.022  Score=48.98  Aligned_cols=59  Identities=19%  Similarity=0.300  Sum_probs=49.0

Q ss_pred             ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCCCCCHHHHHHhcCCC
Q 021444            3 GLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNL-EPTGQLDSQTMQHILRPR   63 (312)
Q Consensus         3 ~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gL-pvTG~lD~~T~~~M~~pR   63 (312)
                      -+..++.-|.+.||.+.  ...+..+......||+.||+.++| +++|.++++|+..+...+
T Consensus        47 ~v~~lq~~L~~~g~~~~--~~~dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~~  106 (185)
T COG3409          47 SVRILQAALNALGYYPD--GVIDGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQL  106 (185)
T ss_pred             hHHHHHHHHHhcCCCCC--CCccCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHHHh
Confidence            36789999999999873  122345678889999999999999 899999999999987754


No 41 
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=95.31  E-value=0.046  Score=50.07  Aligned_cols=54  Identities=22%  Similarity=0.429  Sum_probs=45.5

Q ss_pred             CChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCCCCCHHHHHHhc
Q 021444            2 DGLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNL-EPTGQLDSQTMQHIL   60 (312)
Q Consensus         2 ~~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gL-pvTG~lD~~T~~~M~   60 (312)
                      +++..+++-|.+|||--+.     ..+|+..+.+|+.||.-|+= .++|+.|.+|+..+.
T Consensus       198 ~~v~~lq~~L~~YGY~v~~-----~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l~  252 (257)
T COG3023         198 EDVAALQEMLARYGYGVEI-----GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAILQ  252 (257)
T ss_pred             CCHHHHHHHHHHhCcCCCc-----chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHHH
Confidence            4688999999999996432     46899999999999988865 689999999988763


No 42 
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=94.96  E-value=0.048  Score=54.80  Aligned_cols=58  Identities=21%  Similarity=0.292  Sum_probs=46.0

Q ss_pred             ChHHHHHHHHhcC-CCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcC
Q 021444            3 GLAKLKKYFHYFG-YIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILR   61 (312)
Q Consensus         3 ~~~~~~~yL~~fG-Yl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~   61 (312)
                      .+..+.+-|.+-| -++... ..+..++.++.+|||+||+.+||+..|++++.|+++|+-
T Consensus       240 ~v~aL~~~L~~~~~d~~~a~-~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~  298 (561)
T COG2989         240 DVPALRARLARSGMDLPSAA-GSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV  298 (561)
T ss_pred             hHHHHHHHHHhcCccchhhc-cCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence            3566677777777 344322 234478999999999999999999999999999999976


No 43 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=94.92  E-value=0.034  Score=54.44  Aligned_cols=74  Identities=20%  Similarity=0.284  Sum_probs=46.8

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCccc---cccCCC-------------------CCcCCCCHHHHHHHHHHhCCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDAIMY---PSISSG-------------------SRKVELANDDIQGIQALYGNNPNFNG  275 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~sIM~---p~y~~~-------------------~~~~~L~~dDi~~Iq~LYG~~~~~~~  275 (312)
                      -...++|||||.||+.|-.  +-||-   ++++..                   -.+-+.++.-+++++-+|-+.-..++
T Consensus       301 tlGA~~HElGH~lgcpHq~--~GvMlrgy~~lNRsftt~e~~~vrt~s~g~aPv~~keec~w~~lD~~Rf~yhp~f~lp~  378 (614)
T KOG4525|consen  301 TLGAVCHELGHCLGCPHQS--EGVMLRGYDYLNRSFTTDEFECVRTGSKGLAPVLAKEECPWRILDARRFSYHPTFSLPG  378 (614)
T ss_pred             HHHHHHHHhhhccCCCCCC--CceEecCchhhcceeeecceeeeecCCcCcCCccccccCchhhhhhhhhcccccccCCC
Confidence            3557899999999999965  56773   332210                   01335677888999999987666555


Q ss_pred             -CCCCCCCCCCCCCCCcch
Q 021444          276 -SSVPTDQQRDTSGAHGAY  293 (312)
Q Consensus       276 -~~~~~~~~~~~~~~~~~~  293 (312)
                       +|=|+....-++...++-
T Consensus       379 dptyps~~~s~y~~~~~s~  397 (614)
T KOG4525|consen  379 DPTYPSSSSSSYSSSSSSA  397 (614)
T ss_pred             CCCCCCccccceecCCCcc
Confidence             333555555554444443


No 44 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=94.62  E-value=0.014  Score=55.22  Aligned_cols=21  Identities=38%  Similarity=0.642  Sum_probs=15.3

Q ss_pred             ccchhhhhhhhhhhcCCCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                      .....+++||+||++||+|..
T Consensus       191 ~s~~~~f~HE~GH~~GL~H~p  211 (305)
T PF10462_consen  191 YSYGNEFSHELGHNFGLGHYP  211 (305)
T ss_dssp             S-SHHHHHHHHHHTTT--SST
T ss_pred             cCccceeehhhhhhcCCCCCC
Confidence            346789999999999999943


No 45 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.53  E-value=0.019  Score=54.72  Aligned_cols=19  Identities=42%  Similarity=0.592  Sum_probs=17.0

Q ss_pred             chhhhhhhhhhhcCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~  236 (312)
                      -..|++||+||-|||.|+.
T Consensus       150 ~~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  150 DWATIMHELGHNLGLWHAG  168 (314)
T ss_pred             cHHHHHHHhhhhccccccC
Confidence            3569999999999999996


No 46 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=93.37  E-value=0.04  Score=54.40  Aligned_cols=57  Identities=23%  Similarity=0.296  Sum_probs=36.2

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCccccccCC-------------CCC---------cCCCCHHHHHHHHHHhCCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDAIMYPSISS-------------GSR---------KVELANDDIQGIQALYGNNPNFNG  275 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~-------------~~~---------~~~L~~dDi~~Iq~LYG~~~~~~~  275 (312)
                      .....+|||||+|||.|..+  .||---|..             .+.         +.+-.+.....++.+|.+.-..+.
T Consensus       316 ~iGA~lHEiGH~fg~pH~~~--GIM~Rdy~~~NR~F~~~E~~~~rtk~~g~~~~~~~~ec~whrld~lrf~~hP~F~lp~  393 (423)
T PF12044_consen  316 GIGAFLHEIGHLFGCPHQED--GIMLRDYDRLNRSFTTREPYSTRTKSKGLRPCLPKDECHWHRLDALRFRYHPSFRLPS  393 (423)
T ss_pred             hhHHHHHHHHHhcCCCCCCC--CeeeCCchhhcceeeccccccccccCCCcCcccCCccccccHhhhHhhhcCccccCCC
Confidence            35578999999999999876  777433311             000         112345667778888876655544


Q ss_pred             C
Q 021444          276 S  276 (312)
Q Consensus       276 ~  276 (312)
                      .
T Consensus       394 D  394 (423)
T PF12044_consen  394 D  394 (423)
T ss_pred             c
Confidence            4


No 47 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=93.15  E-value=0.028  Score=52.78  Aligned_cols=20  Identities=40%  Similarity=0.406  Sum_probs=17.2

Q ss_pred             chhhhhhhhhhhcCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      ...|++||+||.|||.+--+
T Consensus       165 ~igv~~HE~gH~lGLPDlY~  184 (286)
T TIGR03296       165 GVGVIAHELGHDLGLPDLYD  184 (286)
T ss_pred             ceeeeehhhhcccCCCcccc
Confidence            36899999999999988654


No 48 
>PF11150 DUF2927:  Protein of unknown function (DUF2927);  InterPro: IPR021323  This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. 
Probab=91.94  E-value=3.7  Score=37.03  Aligned_cols=150  Identities=15%  Similarity=0.095  Sum_probs=81.8

Q ss_pred             CCCCCCCCceEEeecCCCCCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCC---------CCcC
Q 021444          107 PRWPSNKRDLTYAFLPDNQLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDG---------EPFD  176 (312)
Q Consensus       107 ~kW~~~~~~LtY~i~~~~~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~---------~~fd  176 (312)
                      .||   ...++..+....... ....+.+..-++.-+.++.+.+..+... +|+|.|.|...+....         ..+.
T Consensus        28 ~Rw---~~PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~~~~~  103 (213)
T PF11150_consen   28 RRW---EGPVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGAACFV  103 (213)
T ss_pred             ccC---CCCeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccccccc
Confidence            489   357888887322111 3344555555777788899999888864 8999999987654110         0000


Q ss_pred             C-------CC-CcceeeeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC--CCCCcccccc
Q 021444          177 G-------PM-GTLAHAFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS--VEDAIMYPSI  246 (312)
Q Consensus       177 G-------~g-g~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~--~~~sIM~p~y  246 (312)
                      -       .. .-+..++....+.+.....-- .+.      ......-+...+..||-.+|||.--+  .+.||-+-  
T Consensus       104 ~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi-~i~------~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFND--  174 (213)
T PF11150_consen  104 VPNFSSLPRYARCLVFARSNWSGEITRARAVI-FIR------ADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFND--  174 (213)
T ss_pred             chhhhcccccceEEEEEEecCCCceEEEEEEE-EEe------ccCCchHHHHHHHHHHHHHcCCccCCCcCCCceeeC--
Confidence            0       00 001122221111111100000 000      01112346678888899999995433  23444331  


Q ss_pred             CCCCCcCCCCHHHHHHHHHHhCCCC
Q 021444          247 SSGSRKVELANDDIQGIQALYGNNP  271 (312)
Q Consensus       247 ~~~~~~~~L~~dDi~~Iq~LYG~~~  271 (312)
                        ......|+.-|..-|+.||-+.-
T Consensus       175 --d~~~~~LT~~D~lLLr~LYdP~L  197 (213)
T PF11150_consen  175 --DNEFALLTGFDELLLRMLYDPRL  197 (213)
T ss_pred             --CCccccccHHHHHHHHHHcCccC
Confidence              23445789999999999998543


No 49 
>PTZ00337 surface protease GP63; Provisional
Probab=91.86  E-value=0.18  Score=51.68  Aligned_cols=36  Identities=28%  Similarity=0.371  Sum_probs=24.5

Q ss_pred             CCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCC
Q 021444          188 PPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHS  235 (312)
Q Consensus       188 P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs  235 (312)
                      |..|.|.|+....   .         .......|++|||.||||+...
T Consensus       211 Pi~G~in~np~~i---~---------~~~~~~~v~~HEi~HALGFs~~  246 (567)
T PTZ00337        211 PFAAAVNFDPRQI---A---------VTNGDVRVAAHELGHALGFVRD  246 (567)
T ss_pred             ceEEEEEECHHHc---c---------chhHHHHHHHHHHHHHHccCHH
Confidence            4457787876321   1         1234678999999999999654


No 50 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=91.44  E-value=0.093  Score=53.40  Aligned_cols=60  Identities=27%  Similarity=0.331  Sum_probs=29.2

Q ss_pred             cccEEEEeeecCCCCCCCcCCCCCcceeee--------CCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhh
Q 021444          157 TADIRIGFFTGDHGDGEPFDGPMGTLAHAF--------SPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGH  228 (312)
Q Consensus       157 ~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGH  228 (312)
                      ++|+.|.....+.+        .+++|.|.        .|..|.|.|+...   +.       ......+..|++|||.|
T Consensus       159 ~~D~vlyV~~~~~~--------~~~~A~A~~C~~~~~~RP~~G~in~~p~~---i~-------~~~~~~~~~~~~HEi~H  220 (521)
T PF01457_consen  159 NADFVLYVTARPSS--------SSTLAWAAPCQQDSDGRPIAGVININPSY---IP-------SFYFQEFFRTVIHEIAH  220 (521)
T ss_dssp             S-SEEEEEE----S--------TT---EEEEEEE-TTS-EEEEEEE--GGG-------------S--HHHHHHHHHHHHH
T ss_pred             CccEEEEEEEeecc--------CCceEEEeehhhccCCCCeeEEEEEchhH---cc-------chhhhcccceeeeeeee
Confidence            67887765543221        15677763        2334677776531   11       00123577899999999


Q ss_pred             hcCCCC
Q 021444          229 LLGLGH  234 (312)
Q Consensus       229 aLGL~H  234 (312)
                      |||+.-
T Consensus       221 aLGFs~  226 (521)
T PF01457_consen  221 ALGFSS  226 (521)
T ss_dssp             HTT-SH
T ss_pred             eeeecc
Confidence            999976


No 51 
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=91.36  E-value=0.66  Score=34.71  Aligned_cols=53  Identities=17%  Similarity=0.349  Sum_probs=41.1

Q ss_pred             hHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCC----CCCCCCHHHHHHh
Q 021444            4 LAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLE----PTGQLDSQTMQHI   59 (312)
Q Consensus         4 ~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLp----vTG~lD~~T~~~M   59 (312)
                      ..+|+.=|++-||+...   .+..+++++.+|++.|+..-|+.    .-|++|...++.|
T Consensus        18 ~~evq~~L~~lGyy~g~---~~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~L   74 (74)
T PF08823_consen   18 AREVQEALKRLGYYKGE---ADGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAYL   74 (74)
T ss_pred             HHHHHHHHHHcCCccCC---CCCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhhC
Confidence            46789999999998743   24567899999999999988873    3455888877654


No 52 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=90.99  E-value=0.091  Score=42.80  Aligned_cols=19  Identities=42%  Similarity=0.730  Sum_probs=16.0

Q ss_pred             ccchhhhhhhhhhhcCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGH  234 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~H  234 (312)
                      -....|++|||||-.||+.
T Consensus       107 d~vthvliHEIgHhFGLsD  125 (136)
T COG3824         107 DQVTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             hHhhhhhhhhhhhhcCCCh
Confidence            3567899999999999964


No 53 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=90.64  E-value=0.48  Score=48.92  Aligned_cols=97  Identities=21%  Similarity=0.235  Sum_probs=50.1

Q ss_pred             CcccEEEEeeecCCCCCCCcCCCCCcceeee--------CCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhh
Q 021444          156 TTADIRIGFFTGDHGDGEPFDGPMGTLAHAF--------SPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIG  227 (312)
Q Consensus       156 ~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiG  227 (312)
                      .++|+.|.......      .  .+++|.|-        .|..|.|.|+....   .+       ........|++|||.
T Consensus       204 ~naD~vLYVta~P~------~--~~~lA~A~~C~~~~d~RPi~G~iNinp~~i---~s-------~~~~~~~rv~~HEi~  265 (622)
T PTZ00257        204 SNADFVLYVASVPS------E--PGVLAWATTCQVFSDDHPAVGVMNIPAANI---VS-------RYDQGTTRTVTHEVA  265 (622)
T ss_pred             cccCEEEEEEEecC------C--CCeeEEEEEeccccCCCceEEEEeeCHHHC---CC-------ccchHHHHHHHHHHH
Confidence            36777776654322      1  24555552        35557777775321   00       011246789999999


Q ss_pred             hhcCCCCCCCCCCcccccc---CCCCCc--CCCCHHHHHHHHHHhCCC
Q 021444          228 HLLGLGHSSVEDAIMYPSI---SSGSRK--VELANDDIQGIQALYGNN  270 (312)
Q Consensus       228 HaLGL~Hs~~~~sIM~p~y---~~~~~~--~~L~~dDi~~Iq~LYG~~  270 (312)
                      ||||+.-+.-..--|.-..   .+....  ...++.-++-.+.-||.+
T Consensus       266 HALGFS~~~f~~~~~~~~~~~~rGk~~~~~vi~SP~v~~~Ar~hfgC~  313 (622)
T PTZ00257        266 HALGFSSVFFENAGIVKSVTNLRGKPFAAPVINSSTAVAKAREQYGCP  313 (622)
T ss_pred             HHhcCCHHHHhhcCcccceeccCCCcceeeeecCHHHHHHHHHhcCCC
Confidence            9999965432221111111   111111  112556666677777754


No 54 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=90.20  E-value=0.35  Score=43.71  Aligned_cols=20  Identities=40%  Similarity=0.625  Sum_probs=15.2

Q ss_pred             ccchhhhhhhhhhhcCCCCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      ..-..||+||.|||  +.|...
T Consensus        87 iaAvaVAAHEvGHA--iQ~a~~  106 (222)
T PF04298_consen   87 IAAVAVAAHEVGHA--IQHAEG  106 (222)
T ss_pred             HHHHHHHHHHHhHH--Hhcccc
Confidence            34567999999999  566653


No 55 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=88.11  E-value=0.13  Score=53.46  Aligned_cols=20  Identities=40%  Similarity=0.338  Sum_probs=16.5

Q ss_pred             chhhhhhhhhhhcCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      -..|++||+||.|||.-.-+
T Consensus       221 giGVfaHEfGH~LGLPDlYD  240 (645)
T PF05547_consen  221 GIGVFAHEFGHDLGLPDLYD  240 (645)
T ss_pred             ceEEEEeeccccCCCCCccc
Confidence            47799999999999966443


No 56 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=87.33  E-value=0.45  Score=42.08  Aligned_cols=16  Identities=44%  Similarity=0.681  Sum_probs=12.9

Q ss_pred             ccchhhhhhhhhhhcC
Q 021444          216 IDLESVAVHEIGHLLG  231 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLG  231 (312)
                      ..-..|++||.|||+=
T Consensus        90 ia~~aVAAHEVGHAiQ  105 (226)
T COG2738          90 IAAIAVAAHEVGHAIQ  105 (226)
T ss_pred             HHHHHHHHHHhhHHHh
Confidence            3457799999999964


No 57 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=87.01  E-value=0.25  Score=45.95  Aligned_cols=21  Identities=38%  Similarity=0.676  Sum_probs=13.0

Q ss_pred             cchhhhhhhhhhhcC-CCCCCC
Q 021444          217 DLESVAVHEIGHLLG-LGHSSV  237 (312)
Q Consensus       217 ~l~~Va~HEiGHaLG-L~Hs~~  237 (312)
                      ....|++||+||++| |+-+-.
T Consensus       215 ~~~~v~vHE~GHsf~~LaDEY~  236 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADEYY  236 (264)
T ss_dssp             THHHHHHHHHHHHTT------E
T ss_pred             cccceeeeeccccccccccccc
Confidence            567899999999654 565543


No 58 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=85.40  E-value=0.76  Score=48.53  Aligned_cols=46  Identities=28%  Similarity=0.344  Sum_probs=30.1

Q ss_pred             cchhhhhhhhhhhcCCCCCCC-------CCCccccccCCCCCcCCCCHHHHHHH
Q 021444          217 DLESVAVHEIGHLLGLGHSSV-------EDAIMYPSISSGSRKVELANDDIQGI  263 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~-------~~sIM~p~y~~~~~~~~L~~dDi~~I  263 (312)
                      ....|++|||||.||+.|-..       ..=|| +.+.+...+..++.--++-+
T Consensus       322 ~~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~  374 (716)
T KOG3607|consen  322 AFAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDF  374 (716)
T ss_pred             hHHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHH
Confidence            467799999999999999772       22478 66553223334555444444


No 59 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=83.20  E-value=0.85  Score=47.19  Aligned_cols=48  Identities=27%  Similarity=0.457  Sum_probs=33.5

Q ss_pred             hhhhhhhhhhhcCCCCCCC----------CCCccccccCCC--CCcCCCCHHHHHHHHHH
Q 021444          219 ESVAVHEIGHLLGLGHSSV----------EDAIMYPSISSG--SRKVELANDDIQGIQAL  266 (312)
Q Consensus       219 ~~Va~HEiGHaLGL~Hs~~----------~~sIM~p~y~~~--~~~~~L~~dDi~~Iq~L  266 (312)
                      ..|++|||||..|=.|-.+          .+=|||++-..+  .++..+++--+.-|...
T Consensus       393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~V  452 (764)
T KOG3658|consen  393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKV  452 (764)
T ss_pred             heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHHH
Confidence            4599999999999999765          234899886543  23445666666666544


No 60 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=83.07  E-value=0.51  Score=37.23  Aligned_cols=17  Identities=35%  Similarity=0.626  Sum_probs=13.1

Q ss_pred             cchhhhhhhhhhhcCCC
Q 021444          217 DLESVAVHEIGHLLGLG  233 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~  233 (312)
                      -+..|++|||||.+|+.
T Consensus        72 ~I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   72 LIRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHHcCCC
Confidence            35679999999999985


No 61 
>PF09374 PG_binding_3:  Predicted Peptidoglycan domain;  InterPro: IPR018537  This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=78.21  E-value=2.5  Score=31.36  Aligned_cols=26  Identities=19%  Similarity=0.391  Sum_probs=20.7

Q ss_pred             HHHHHHHHhcC----CCCCCCCCHHHHHHh
Q 021444           34 SAVKTYQKNFN----LEPTGQLDSQTMQHI   59 (312)
Q Consensus        34 ~Ai~~fQ~~~g----LpvTG~lD~~T~~~M   59 (312)
                      .|+|.+|+..|    +.+.|++-+.|+.++
T Consensus         1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al   30 (72)
T PF09374_consen    1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL   30 (72)
T ss_dssp             HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred             CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence            48999999999    999999999999998


No 62 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=78.02  E-value=0.84  Score=40.56  Aligned_cols=15  Identities=47%  Similarity=0.651  Sum_probs=12.7

Q ss_pred             cchhhhhhhhhhhcC
Q 021444          217 DLESVAVHEIGHLLG  231 (312)
Q Consensus       217 ~l~~Va~HEiGHaLG  231 (312)
                      .+..|+.||+||++-
T Consensus        21 ~~l~t~~HE~gHal~   35 (200)
T PF13398_consen   21 RLLVTFVHELGHALA   35 (200)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456799999999975


No 63 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=77.81  E-value=0.82  Score=41.42  Aligned_cols=14  Identities=36%  Similarity=0.570  Sum_probs=12.4

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      ++..|+.||+||||
T Consensus        91 ~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   91 NVLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHHH
Confidence            57889999999995


No 64 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=77.30  E-value=0.82  Score=45.65  Aligned_cols=19  Identities=42%  Similarity=0.644  Sum_probs=15.5

Q ss_pred             ccchhhhhhhhhhhcCCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGHS  235 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs  235 (312)
                      ..+-.|++|||||..| +|-
T Consensus       128 sElagViAHEigHv~q-rH~  146 (484)
T COG4783         128 SELAGVIAHEIGHVAQ-RHL  146 (484)
T ss_pred             HHHHHHHHHHHHHHhh-hhH
Confidence            4689999999999976 453


No 65 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=76.74  E-value=3.3  Score=44.74  Aligned_cols=34  Identities=21%  Similarity=0.407  Sum_probs=26.6

Q ss_pred             cccchhhhhhhhhhhcCCCCCCCCC------CccccccCC
Q 021444          215 AIDLESVAVHEIGHLLGLGHSSVED------AIMYPSISS  248 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs~~~~------sIM~p~y~~  248 (312)
                      +.....+++||+||.+|+.|-...+      .||.|....
T Consensus       314 gl~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~  353 (845)
T KOG3538|consen  314 GLPSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCG  353 (845)
T ss_pred             CCccceeeeeccccccCcccCCcCCCCCCcccccCccccc
Confidence            4446679999999999999986543      589887654


No 66 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=76.70  E-value=0.99  Score=35.22  Aligned_cols=14  Identities=36%  Similarity=0.629  Sum_probs=11.8

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      ....+++|||||.+
T Consensus        41 ~~~f~laHELgH~~   54 (122)
T PF06114_consen   41 RQRFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHH
Confidence            45679999999995


No 67 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=76.43  E-value=1  Score=40.64  Aligned_cols=19  Identities=32%  Similarity=0.480  Sum_probs=14.3

Q ss_pred             cchhhhhhhhhhhcCCCCCCC
Q 021444          217 DLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      .-.-|++||+||+  |-|...
T Consensus        71 r~rFtlAHELGH~--llH~~~   89 (213)
T COG2856          71 RKRFTLAHELGHA--LLHTDL   89 (213)
T ss_pred             HHHHHHHHHHhHH--Hhcccc
Confidence            3466999999999  556554


No 68 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=74.96  E-value=1.6  Score=39.11  Aligned_cols=30  Identities=40%  Similarity=0.619  Sum_probs=26.0

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCccccccC
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDAIMYPSIS  247 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~  247 (312)
                      ...++-|++||++||+|.+.+..||+|.+.
T Consensus       153 tal~~~efl~~~~~~~~lnhr~~i~~~pg~  182 (236)
T COG5549         153 TALIAYEFLGHALGLGHLNHRGDIMYPPGE  182 (236)
T ss_pred             HHHHHHHHhhhccCcccccccccccCCccc
Confidence            455788999999999999999999987653


No 69 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=70.93  E-value=1.6  Score=37.90  Aligned_cols=22  Identities=32%  Similarity=0.404  Sum_probs=15.7

Q ss_pred             chhhhhhhhhhhcCCCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSVED  239 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~  239 (312)
                      +..+++||+||++=-....++.
T Consensus         7 ~i~i~~HE~gH~~~a~~~G~~~   28 (192)
T PF02163_consen    7 LISIVLHELGHALAARLYGDKV   28 (192)
T ss_dssp             HHHHHHHHHHHHHHHHTTT--B
T ss_pred             cccccccccccccccccccccc
Confidence            4578999999998776665544


No 70 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=70.60  E-value=1.7  Score=37.53  Aligned_cols=20  Identities=50%  Similarity=0.592  Sum_probs=14.5

Q ss_pred             chhhhhhhhhhhcCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      +..|.+||+||++=-..-.+
T Consensus         8 ~i~i~iHE~gH~~~A~~~G~   27 (180)
T cd05709           8 LISVTVHELGHALVARRLGV   27 (180)
T ss_pred             HHHHHHHHHHHHHHHHHcCC
Confidence            45789999999985444443


No 71 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=69.83  E-value=1.8  Score=37.93  Aligned_cols=12  Identities=50%  Similarity=0.844  Sum_probs=10.4

Q ss_pred             hhhhhhhhhhhc
Q 021444          219 ESVAVHEIGHLL  230 (312)
Q Consensus       219 ~~Va~HEiGHaL  230 (312)
                      ..|++||+||.+
T Consensus        10 ~~v~iHElGH~~   21 (182)
T cd06163          10 ILIFVHELGHFL   21 (182)
T ss_pred             HHHHHHHHHHHH
Confidence            468999999986


No 72 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=68.80  E-value=1.5  Score=41.43  Aligned_cols=40  Identities=20%  Similarity=0.241  Sum_probs=23.0

Q ss_pred             eCCCCCceEecCCCCce-ecCCCccccCCccccchhhhhhhhhhh
Q 021444          186 FSPPNGRFHLDGDENWV-ITSGTTTSISMSAIDLESVAVHEIGHL  229 (312)
Q Consensus       186 ~~P~~G~ihfd~~e~wt-~~~~~~~~~~~~~~~l~~Va~HEiGHa  229 (312)
                      |-|.+..|.+|... |. +....   ......-...|++||+||.
T Consensus       141 YCp~D~tIYlD~~f-~~~L~~~~---ga~G~~a~ayVlAHEyGHH  181 (292)
T PF04228_consen  141 YCPADQTIYLDLSF-FDELQQRF---GASGDFAQAYVLAHEYGHH  181 (292)
T ss_pred             eCCCCCEEEechHH-HHHHHHHh---CCccHHHHHHHHHHHHHHH
Confidence            56888889998752 11 11000   0011223566999999996


No 73 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=68.57  E-value=3.3  Score=39.60  Aligned_cols=38  Identities=24%  Similarity=0.443  Sum_probs=27.9

Q ss_pred             eCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444          186 FSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       186 ~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      ++|=.++.+.|.+-              ....+..|++||++|-+|...+..
T Consensus       178 y~PFT~EA~vN~~~--------------p~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  178 YNPFTGEANVNTDL--------------PPYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             EcCCcceeecCCCC--------------CcccccHHHHHHHHHHhCCCCHHH
Confidence            35556666666532              235678899999999999999874


No 74 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=66.97  E-value=2.2  Score=38.07  Aligned_cols=17  Identities=41%  Similarity=0.466  Sum_probs=13.0

Q ss_pred             cchhhhhhhhhhhcCCC
Q 021444          217 DLESVAVHEIGHLLGLG  233 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~  233 (312)
                      -+..|++||+||++=-.
T Consensus        37 l~~~v~iHElgH~~~A~   53 (208)
T cd06161          37 LFLSVLLHELGHALVAR   53 (208)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            35679999999986433


No 75 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=66.47  E-value=2.4  Score=37.39  Aligned_cols=20  Identities=35%  Similarity=0.521  Sum_probs=16.0

Q ss_pred             ccchhhhhhhhhhhcCCCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~  236 (312)
                      ..+..|++||+||.. .+|..
T Consensus        87 ~el~aVlaHElgH~~-~~h~~  106 (226)
T PF01435_consen   87 DELAAVLAHELGHIK-HRHIL  106 (226)
T ss_dssp             HHHHHHHHHHHHHHH-TTHCC
T ss_pred             HHHHHHHHHHHHHHH-cCCcc
Confidence            368899999999997 45554


No 76 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=62.58  E-value=2.9  Score=40.79  Aligned_cols=15  Identities=33%  Similarity=0.523  Sum_probs=12.2

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      .+-..+++||+|||+
T Consensus       220 ~~~v~tl~HE~GHa~  234 (427)
T cd06459         220 LDDVFTLAHELGHAF  234 (427)
T ss_pred             hhhHHHHHHHhhHHH
Confidence            445789999999985


No 77 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=62.48  E-value=3  Score=37.83  Aligned_cols=17  Identities=41%  Similarity=0.593  Sum_probs=12.8

Q ss_pred             chhhhhhhhhhhcCCCC
Q 021444          218 LESVAVHEIGHLLGLGH  234 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~H  234 (312)
                      +..|++||+||++=-.+
T Consensus        53 ~~~v~iHElgH~~~A~~   69 (227)
T cd06164          53 FASVLLHELGHSLVARR   69 (227)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            56799999999864433


No 78 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=62.43  E-value=3  Score=40.02  Aligned_cols=16  Identities=31%  Similarity=0.416  Sum_probs=13.1

Q ss_pred             cccchhhhhhhhhhhc
Q 021444          215 AIDLESVAVHEIGHLL  230 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaL  230 (312)
                      ..+-..++.||+|||+
T Consensus       151 ~~~~v~tl~HE~GHa~  166 (365)
T cd06258         151 GHDDINTLFHEFGHAV  166 (365)
T ss_pred             CHHHHHHHHHHHhHHH
Confidence            3455679999999998


No 79 
>PRK03982 heat shock protein HtpX; Provisional
Probab=62.13  E-value=3.2  Score=38.92  Aligned_cols=15  Identities=40%  Similarity=0.614  Sum_probs=13.0

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      ..+..|++|||||.-
T Consensus       123 ~El~AVlAHElgHi~  137 (288)
T PRK03982        123 DELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            468999999999974


No 80 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=61.58  E-value=3.3  Score=38.39  Aligned_cols=15  Identities=47%  Similarity=0.791  Sum_probs=13.0

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      ..+..|+.||+||..
T Consensus       155 dEl~aVlaHElgHi~  169 (302)
T COG0501         155 DELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            368999999999984


No 81 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=60.99  E-value=3.3  Score=41.16  Aligned_cols=25  Identities=12%  Similarity=0.199  Sum_probs=17.8

Q ss_pred             HHHHHHHHHHHHhhcccceEEEccc
Q 021444          130 VKNVFATAFDRWSEATSLTFTRTDV  154 (312)
Q Consensus       130 ~~~~i~~A~~~Ws~~s~l~F~ev~~  154 (312)
                      +..+++.-+...+.+-+|+|+++..
T Consensus       136 ~~~~~~g~~~l~~~lfgi~~~~~~~  160 (458)
T PF01432_consen  136 LEEVLEGLFELAERLFGIRFEEVPD  160 (458)
T ss_dssp             HHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred             HHHHHHHHHHHHHHHhcEEEEeccc
Confidence            3455566666778888899998864


No 82 
>PRK03001 M48 family peptidase; Provisional
Probab=59.90  E-value=3.7  Score=38.39  Aligned_cols=15  Identities=27%  Similarity=0.563  Sum_probs=13.0

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      ..+..|++||+||.-
T Consensus       122 ~El~aVlAHElgHi~  136 (283)
T PRK03001        122 REIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            468999999999974


No 83 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=59.60  E-value=13  Score=31.58  Aligned_cols=37  Identities=27%  Similarity=0.401  Sum_probs=31.9

Q ss_pred             CCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhc
Q 021444           24 FTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHIL   60 (312)
Q Consensus        24 ~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~   60 (312)
                      .+..+....+.|++.||+.++|+++|+.-..|...+.
T Consensus       148 ~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~  184 (185)
T COG3409         148 VDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR  184 (185)
T ss_pred             CCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence            3356778899999999999999999999999987663


No 84 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=59.47  E-value=3.6  Score=38.54  Aligned_cols=14  Identities=50%  Similarity=0.627  Sum_probs=11.7

Q ss_pred             chhhhhhhhhhhcC
Q 021444          218 LESVAVHEIGHLLG  231 (312)
Q Consensus       218 l~~Va~HEiGHaLG  231 (312)
                      +..|++||+||++=
T Consensus       135 ~isvvvHElgHal~  148 (277)
T cd06162         135 LISGVVHEMGHGVA  148 (277)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45799999999963


No 85 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=59.24  E-value=3.7  Score=38.19  Aligned_cols=19  Identities=26%  Similarity=0.289  Sum_probs=13.8

Q ss_pred             chhhhhhhhhhhcCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~  236 (312)
                      ...|++||+||++=-.+..
T Consensus       118 ~isv~iHElgHa~~Ar~~G  136 (263)
T cd06159         118 VVGVVVHELSHGILARVEG  136 (263)
T ss_pred             HHHHHHHHHHHHHHHHHcC
Confidence            4569999999997544433


No 86 
>PRK03072 heat shock protein HtpX; Provisional
Probab=59.19  E-value=3.8  Score=38.48  Aligned_cols=15  Identities=33%  Similarity=0.547  Sum_probs=13.0

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      ..+..|++||+||.-
T Consensus       125 ~El~aVlAHElgHi~  139 (288)
T PRK03072        125 RELRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            468999999999974


No 87 
>PRK01345 heat shock protein HtpX; Provisional
Probab=59.01  E-value=3.9  Score=39.05  Aligned_cols=18  Identities=22%  Similarity=0.372  Sum_probs=14.6

Q ss_pred             ccchhhhhhhhhhhcCCCC
Q 021444          216 IDLESVAVHEIGHLLGLGH  234 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~H  234 (312)
                      ..+..|++|||||.- .+|
T Consensus       122 dEL~aVlAHElgHi~-~~d  139 (317)
T PRK01345        122 EEVAGVMAHELAHVK-NRD  139 (317)
T ss_pred             HHHHHHHHHHHHHHH-cCC
Confidence            368999999999986 344


No 88 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=58.81  E-value=3.7  Score=42.02  Aligned_cols=15  Identities=33%  Similarity=0.368  Sum_probs=12.6

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      .+-..|++||+|||+
T Consensus       335 ~~dv~TL~HElGHa~  349 (549)
T TIGR02289       335 SGDIDVLTHEAGHAF  349 (549)
T ss_pred             hhHHHHHHHHhhHHH
Confidence            455779999999997


No 89 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=58.67  E-value=8.6  Score=28.97  Aligned_cols=30  Identities=10%  Similarity=0.187  Sum_probs=19.4

Q ss_pred             CCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhc
Q 021444          190 NGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLL  230 (312)
Q Consensus       190 ~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaL  230 (312)
                      ..+|+|..... .          .....-..+++||+.|++
T Consensus        44 G~~I~f~~g~~-~----------~~s~~~~~llaHEl~Hv~   73 (79)
T PF13699_consen   44 GNDIYFAPGKY-N----------PDSPEGRALLAHELAHVV   73 (79)
T ss_pred             CCEEEEcCCCc-C----------CCCCCcchhHhHHHHHHH
Confidence            46788865431 0          112245789999999985


No 90 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=58.32  E-value=3.9  Score=41.07  Aligned_cols=14  Identities=36%  Similarity=0.615  Sum_probs=11.8

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      +-..|+.||+||+|
T Consensus       262 ~~V~TLfHEfGHal  275 (472)
T cd06455         262 DEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44668999999998


No 91 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=56.48  E-value=4.3  Score=41.89  Aligned_cols=15  Identities=33%  Similarity=0.364  Sum_probs=12.4

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      .+-..|++||+|||+
T Consensus       373 ~~~v~TL~HE~GHa~  387 (587)
T TIGR02290       373 RRDVSTLAHELGHAY  387 (587)
T ss_pred             chhHHHHHHHhhHHH
Confidence            345679999999998


No 92 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=55.61  E-value=5.1  Score=41.28  Aligned_cols=15  Identities=27%  Similarity=0.507  Sum_probs=12.7

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      .+-..|++||+||++
T Consensus       376 ~~dv~TLaHElGHa~  390 (591)
T TIGR00181       376 LNSVFTLAHELGHSM  390 (591)
T ss_pred             cchHHHHHHHhhhHH
Confidence            455789999999997


No 93 
>PRK04897 heat shock protein HtpX; Provisional
Probab=55.47  E-value=4.6  Score=38.12  Aligned_cols=15  Identities=47%  Similarity=0.620  Sum_probs=12.9

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      ..+..|++||+||.-
T Consensus       135 ~El~aVlAHElgHi~  149 (298)
T PRK04897        135 EELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            468999999999974


No 94 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=55.27  E-value=4.9  Score=34.18  Aligned_cols=12  Identities=33%  Similarity=0.445  Sum_probs=10.3

Q ss_pred             hhhhhhhhhhhc
Q 021444          219 ESVAVHEIGHLL  230 (312)
Q Consensus       219 ~~Va~HEiGHaL  230 (312)
                      .-|++|||+|++
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            559999999974


No 95 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=55.01  E-value=4.8  Score=35.31  Aligned_cols=23  Identities=30%  Similarity=0.592  Sum_probs=16.8

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDA  240 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~s  240 (312)
                      +..+.+||+||++--..-.++..
T Consensus        41 ~~~l~iHElgH~~~A~~~G~~~~   63 (183)
T cd06160          41 LAILGIHEMGHYLAARRHGVKAS   63 (183)
T ss_pred             HHHHHHHHHHHHHHHHHCCCCcc
Confidence            45688999999987666555443


No 96 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=54.82  E-value=5  Score=31.39  Aligned_cols=14  Identities=36%  Similarity=0.482  Sum_probs=12.2

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      .+..|++||+.|++
T Consensus        24 ~~~~~l~HE~~H~~   37 (128)
T PF13485_consen   24 WLDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46799999999996


No 97 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=54.32  E-value=5.1  Score=40.12  Aligned_cols=33  Identities=12%  Similarity=0.130  Sum_probs=22.5

Q ss_pred             HHHHHHHHHHHHhhcccceEEEccc----CCcccEEE
Q 021444          130 VKNVFATAFDRWSEATSLTFTRTDV----YTTADIRI  162 (312)
Q Consensus       130 ~~~~i~~A~~~Ws~~s~l~F~ev~~----~~~aDI~I  162 (312)
                      +..+++-.|..-+..-+|+|++++.    ...+|++.
T Consensus       132 l~~vl~gl~~~~~~lfgi~~~~~~~~~~~~Wh~dV~~  168 (458)
T cd06457         132 LGTVMEGLSRLFSRLYGIRLVPVPLAPGEVWHPDVRK  168 (458)
T ss_pred             HHHHHHHHHHHHHHHhCeEEEecCCCCCCCcCccceE
Confidence            3455666677777888999999853    23567654


No 98 
>PRK05457 heat shock protein HtpX; Provisional
Probab=54.26  E-value=5  Score=37.69  Aligned_cols=14  Identities=43%  Similarity=0.726  Sum_probs=12.5

Q ss_pred             ccchhhhhhhhhhh
Q 021444          216 IDLESVAVHEIGHL  229 (312)
Q Consensus       216 ~~l~~Va~HEiGHa  229 (312)
                      ..+..|++||+||.
T Consensus       132 ~El~aVlAHElgHi  145 (284)
T PRK05457        132 DEVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46899999999996


No 99 
>PRK02870 heat shock protein HtpX; Provisional
Probab=53.86  E-value=5.3  Score=38.51  Aligned_cols=14  Identities=36%  Similarity=0.717  Sum_probs=12.7

Q ss_pred             ccchhhhhhhhhhh
Q 021444          216 IDLESVAVHEIGHL  229 (312)
Q Consensus       216 ~~l~~Va~HEiGHa  229 (312)
                      ..+..|++||+||.
T Consensus       171 dEL~aVlAHELgHi  184 (336)
T PRK02870        171 DELQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46899999999998


No 100
>PRK02391 heat shock protein HtpX; Provisional
Probab=53.70  E-value=5.1  Score=37.84  Aligned_cols=14  Identities=43%  Similarity=0.726  Sum_probs=12.4

Q ss_pred             ccchhhhhhhhhhh
Q 021444          216 IDLESVAVHEIGHL  229 (312)
Q Consensus       216 ~~l~~Va~HEiGHa  229 (312)
                      ..+..|++||+||.
T Consensus       131 ~El~aVlaHElgHi  144 (296)
T PRK02391        131 DELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46899999999995


No 101
>PRK01265 heat shock protein HtpX; Provisional
Probab=53.05  E-value=5.3  Score=38.30  Aligned_cols=14  Identities=50%  Similarity=0.845  Sum_probs=12.4

Q ss_pred             ccchhhhhhhhhhh
Q 021444          216 IDLESVAVHEIGHL  229 (312)
Q Consensus       216 ~~l~~Va~HEiGHa  229 (312)
                      ..+..|++||+||.
T Consensus       138 ~El~aVlAHElgHi  151 (324)
T PRK01265        138 DEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46899999999995


No 102
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=52.75  E-value=5.5  Score=39.07  Aligned_cols=18  Identities=39%  Similarity=0.650  Sum_probs=13.4

Q ss_pred             chhhhhhhhhhhcCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHS  235 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs  235 (312)
                      +..+.+||+||||--.-+
T Consensus       131 vi~~vvHElGHalAA~se  148 (484)
T KOG2921|consen  131 VITVVVHELGHALAAASE  148 (484)
T ss_pred             HHHHHHHHhhHHHHHHhc
Confidence            456889999999854443


No 103
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=52.48  E-value=5.7  Score=39.41  Aligned_cols=32  Identities=19%  Similarity=0.286  Sum_probs=19.4

Q ss_pred             HHHHHHHHHHHhhcccceEEEccc--CCcccEEE
Q 021444          131 KNVFATAFDRWSEATSLTFTRTDV--YTTADIRI  162 (312)
Q Consensus       131 ~~~i~~A~~~Ws~~s~l~F~ev~~--~~~aDI~I  162 (312)
                      ..+++.-|..-+..-+++|+++..  ....|+++
T Consensus       107 ~~v~~gl~~~~~~lfgi~~~~~~~~~~Wh~dV~~  140 (422)
T cd06456         107 EKVLDGLFELAERLYGITFKERTDLPVWHPDVRV  140 (422)
T ss_pred             HHHHHHHHHHHHHHcCeeEEECCCCCCCCCCceE
Confidence            344555566667777888888753  23455543


No 104
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=51.97  E-value=39  Score=25.18  Aligned_cols=29  Identities=21%  Similarity=0.256  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHhcCCCCCCCCCHHHHHHhc
Q 021444           31 MLESAVKTYQKNFNLEPTGQLDSQTMQHIL   60 (312)
Q Consensus        31 ~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~   60 (312)
                      ++.++|++. -++.=+++|..|++|.++|.
T Consensus        20 evq~~L~~l-Gyy~g~~~g~~d~a~~~Al~   48 (74)
T PF08823_consen   20 EVQEALKRL-GYYKGEADGVWDEATEDALR   48 (74)
T ss_pred             HHHHHHHHc-CCccCCCCCcccHHHHHHHH
Confidence            444444432 22334999999999999984


No 105
>PLN02791 Nudix hydrolase homolog
Probab=51.79  E-value=5.8  Score=42.36  Aligned_cols=21  Identities=19%  Similarity=0.183  Sum_probs=17.3

Q ss_pred             cchhhhhhhhhhhcCCCCCCC
Q 021444          217 DLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      .+..|++||+||.+|=.+.-.
T Consensus       584 f~~~v~lHElgHGsG~~~~~~  604 (770)
T PLN02791        584 FFTHTICHECCHGIGPHTITL  604 (770)
T ss_pred             HHHHHHHHHhhccccccceec
Confidence            457899999999999877544


No 106
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=48.44  E-value=7.1  Score=34.09  Aligned_cols=13  Identities=31%  Similarity=0.608  Sum_probs=11.3

Q ss_pred             chhhhhhhhhhhc
Q 021444          218 LESVAVHEIGHLL  230 (312)
Q Consensus       218 l~~Va~HEiGHaL  230 (312)
                      +..+.+||+||++
T Consensus         9 ~~~i~~HE~aHa~   21 (181)
T cd06158           9 LLAITLHEFAHAY   21 (181)
T ss_pred             HHHHHHHHHHHHH
Confidence            4578999999997


No 107
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=47.43  E-value=7.4  Score=38.82  Aligned_cols=12  Identities=42%  Similarity=0.767  Sum_probs=9.6

Q ss_pred             hhhhhhhhhhhc
Q 021444          219 ESVAVHEIGHLL  230 (312)
Q Consensus       219 ~~Va~HEiGHaL  230 (312)
                      ..|++||+||-|
T Consensus        16 ~li~vHElGHfl   27 (449)
T PRK10779         16 VLITVHEFGHFW   27 (449)
T ss_pred             HHHHHHHHHHHH
Confidence            358899999963


No 108
>PHA00024 IX minor coat protein
Probab=47.33  E-value=8  Score=24.28  Aligned_cols=15  Identities=47%  Similarity=0.693  Sum_probs=11.9

Q ss_pred             cchhcccchhHHHHH
Q 021444          291 GAYFLGSCLGLSFFV  305 (312)
Q Consensus       291 ~~~~~~~~~~~~~~~  305 (312)
                      |||+||-||+.+..+
T Consensus        10 gA~ilG~~l~~~Il~   24 (33)
T PHA00024         10 GAYILGWALFYGILV   24 (33)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            789999999877554


No 109
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=47.03  E-value=7.6  Score=38.46  Aligned_cols=12  Identities=50%  Similarity=0.866  Sum_probs=10.1

Q ss_pred             hhhhhhhhhhhc
Q 021444          219 ESVAVHEIGHLL  230 (312)
Q Consensus       219 ~~Va~HEiGHaL  230 (312)
                      ..|++||+||.|
T Consensus        15 ~~v~~HE~gH~~   26 (420)
T TIGR00054        15 VLIFVHELGHFL   26 (420)
T ss_pred             HHHHHHhHHHHH
Confidence            469999999974


No 110
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=46.93  E-value=7.6  Score=38.29  Aligned_cols=13  Identities=38%  Similarity=0.521  Sum_probs=10.6

Q ss_pred             chhhhhhhhhhhc
Q 021444          218 LESVAVHEIGHLL  230 (312)
Q Consensus       218 l~~Va~HEiGHaL  230 (312)
                      -..+++||+||||
T Consensus       159 ~l~t~iHE~GHal  171 (396)
T cd06460         159 ALFSTIHETGHAL  171 (396)
T ss_pred             HHHHHHHHhhHHH
Confidence            3568899999986


No 111
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=46.65  E-value=7.6  Score=40.39  Aligned_cols=14  Identities=29%  Similarity=0.349  Sum_probs=11.7

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      +-..|++||+||++
T Consensus       379 ~dV~TLaHElGHs~  392 (598)
T COG1164         379 RDVFTLAHELGHSV  392 (598)
T ss_pred             hHHHHHHHHccHHH
Confidence            44679999999996


No 112
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=46.19  E-value=27  Score=30.99  Aligned_cols=47  Identities=17%  Similarity=0.383  Sum_probs=33.6

Q ss_pred             HHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHH
Q 021444            7 LKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQ   54 (312)
Q Consensus         7 ~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~   54 (312)
                      +.+||.++||+.+.....+...=-++++.++..-+..|+++ |+||-=
T Consensus       153 Ilr~l~r~g~i~e~~kt~t~K~YLe~E~ilr~iae~~g~s~-gelDL~  199 (210)
T COG1059         153 ILRWLVRYGLIDENPKTLTRKLYLEIEEILRSIAEEVGISL-GELDLY  199 (210)
T ss_pred             HHHHHHHhcccccCcccccHHHHHHHHHHHHHHHHHhCCCc-chhHHH
Confidence            56899999999764222211112378899999999999996 888853


No 113
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=43.48  E-value=8.9  Score=34.26  Aligned_cols=15  Identities=33%  Similarity=0.545  Sum_probs=12.9

Q ss_pred             ccchhhhhhhhhhhc
Q 021444          216 IDLESVAVHEIGHLL  230 (312)
Q Consensus       216 ~~l~~Va~HEiGHaL  230 (312)
                      .....+|.||-||||
T Consensus        26 ~~~~~~A~HEAGhAv   40 (213)
T PF01434_consen   26 EEKRRIAYHEAGHAV   40 (213)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            457889999999996


No 114
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=41.47  E-value=12  Score=34.73  Aligned_cols=23  Identities=30%  Similarity=0.414  Sum_probs=17.4

Q ss_pred             cccchhhhhhhhhhhcCCCCCCC
Q 021444          215 AIDLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      ..+...++.||+||.-|=.+--+
T Consensus       200 a~~yyaTl~HElghwtgh~~rl~  222 (316)
T COG4227         200 AINYYATLLHELGHWTGHEARLD  222 (316)
T ss_pred             hHhHHHHHHHHhccccCchhhhh
Confidence            45788999999999977554433


No 115
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=41.41  E-value=14  Score=32.23  Aligned_cols=28  Identities=32%  Similarity=0.289  Sum_probs=22.5

Q ss_pred             cccchhhhhhhhhhhcCCCCCCCCCCcc
Q 021444          215 AIDLESVAVHEIGHLLGLGHSSVEDAIM  242 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs~~~~sIM  242 (312)
                      ..-+..|++||+=|..-..||..--+.|
T Consensus       161 ~~~idYVvvHEL~Hl~~~nHs~~Fw~~v  188 (205)
T PF01863_consen  161 PEVIDYVVVHELCHLRHPNHSKRFWALV  188 (205)
T ss_pred             ccHHHHHHHHHHHHhccCCCCHHHHHHH
Confidence            3457889999999999999998655544


No 116
>PRK10911 oligopeptidase A; Provisional
Probab=40.65  E-value=11  Score=39.88  Aligned_cols=14  Identities=43%  Similarity=0.475  Sum_probs=11.6

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      +-..|+.||+||+|
T Consensus       462 ~~v~tlfHEfGHal  475 (680)
T PRK10911        462 DEVITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHHHHhHHH
Confidence            34568999999998


No 117
>PF10281 Ish1:  Putative stress-responsive nuclear envelope protein;  InterPro: IPR018803  This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues []. 
Probab=37.70  E-value=82  Score=19.97  Aligned_cols=29  Identities=21%  Similarity=0.208  Sum_probs=18.4

Q ss_pred             HHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHH
Q 021444            5 AKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVK   37 (312)
Q Consensus         5 ~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~   37 (312)
                      .++++||+++|.-.+...    ...++|.+.++
T Consensus         7 ~~L~~wL~~~gi~~~~~~----~~rd~Ll~~~k   35 (38)
T PF10281_consen    7 SDLKSWLKSHGIPVPKSA----KTRDELLKLAK   35 (38)
T ss_pred             HHHHHHHHHcCCCCCCCC----CCHHHHHHHHH
Confidence            579999999999654321    13455554444


No 118
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=37.48  E-value=12  Score=38.34  Aligned_cols=16  Identities=38%  Similarity=0.511  Sum_probs=11.7

Q ss_pred             cchhhhhhhh-hhhcCC
Q 021444          217 DLESVAVHEI-GHLLGL  232 (312)
Q Consensus       217 ~l~~Va~HEi-GHaLGL  232 (312)
                      ....|++||+ ||..|=
T Consensus       292 f~~~v~lHEllGHGsGk  308 (549)
T PF03571_consen  292 FFVQVGLHELLGHGSGK  308 (549)
T ss_dssp             HHHHHHHHHHCCCCS--
T ss_pred             HhhhhhHHhhccCcCcc
Confidence            3567999998 999885


No 119
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=37.14  E-value=13  Score=36.84  Aligned_cols=26  Identities=38%  Similarity=0.500  Sum_probs=17.4

Q ss_pred             ccchhhhhhhhhhhcCCCCCCCCCCcc
Q 021444          216 IDLESVAVHEIGHLLGLGHSSVEDAIM  242 (312)
Q Consensus       216 ~~l~~Va~HEiGHaLGL~Hs~~~~sIM  242 (312)
                      ..+..|+.||+||. =++|.--.-.+|
T Consensus       278 eel~AVl~HELGHW-~~~H~~K~~ii~  303 (428)
T KOG2719|consen  278 EELVAVLAHELGHW-KLNHVLKNIIIM  303 (428)
T ss_pred             HHHHHHHHHHhhHH-HHhhHHHHHHHH
Confidence            46899999999998 355543333333


No 120
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=35.76  E-value=52  Score=35.44  Aligned_cols=22  Identities=18%  Similarity=0.394  Sum_probs=15.7

Q ss_pred             HHHHHHHHHHhhcccceEEEcc
Q 021444          132 NVFATAFDRWSEATSLTFTRTD  153 (312)
Q Consensus       132 ~~i~~A~~~Ws~~s~l~F~ev~  153 (312)
                      +--...|+...+.+.|.|....
T Consensus       170 ~~Y~~i~~~Yd~l~Gl~~~~~~  191 (775)
T PF03272_consen  170 DFYNDIFKFYDDLTGLSDDPSD  191 (775)
T ss_pred             HHHHHHHHHHHhhhCCCCCCCc
Confidence            3446678888888888887543


No 121
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=33.22  E-value=19  Score=25.59  Aligned_cols=21  Identities=38%  Similarity=0.597  Sum_probs=17.6

Q ss_pred             cchhhhhhhhhhhcCCCCCCC
Q 021444          217 DLESVAVHEIGHLLGLGHSSV  237 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~~  237 (312)
                      ...--++||+.+.+||.|.+.
T Consensus        28 ~~eR~~vH~lA~~~gL~s~S~   48 (60)
T cd02641          28 SHDRLLVHELAEELGLRHEST   48 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEee
Confidence            456689999999999999763


No 122
>PRK03094 hypothetical protein; Provisional
Probab=33.20  E-value=31  Score=26.25  Aligned_cols=16  Identities=19%  Similarity=0.382  Sum_probs=14.5

Q ss_pred             CChHHHHHHHHhcCCC
Q 021444            2 DGLAKLKKYFHYFGYI   17 (312)
Q Consensus         2 ~~~~~~~~yL~~fGYl   17 (312)
                      +|+..+++||++-||-
T Consensus         8 ~~Ls~i~~~L~~~GYe   23 (80)
T PRK03094          8 QSLTDVQQALKQKGYE   23 (80)
T ss_pred             cCcHHHHHHHHHCCCE
Confidence            5899999999999995


No 123
>COG5570 Uncharacterized small protein [Function unknown]
Probab=32.45  E-value=43  Score=23.32  Aligned_cols=31  Identities=19%  Similarity=0.392  Sum_probs=24.7

Q ss_pred             HHHHHHHHHHhcCCCCCCCCCHHHHHHhcCCCCCCC
Q 021444           32 LESAVKTYQKNFNLEPTGQLDSQTMQHILRPRCGNP   67 (312)
Q Consensus        32 ~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~pRCG~p   67 (312)
                      +++-|...|+-.     |.|+++..+.|+.|-|---
T Consensus         3 ieshl~eL~kkH-----g~le~ei~ea~n~Ps~dd~   33 (57)
T COG5570           3 IESHLAELEKKH-----GNLEREIQEAMNSPSSDDL   33 (57)
T ss_pred             HHHHHHHHHHhh-----chHHHHHHHHhcCCCcchH
Confidence            455677788765     7999999999999988543


No 124
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=32.27  E-value=14  Score=33.06  Aligned_cols=46  Identities=26%  Similarity=0.368  Sum_probs=27.7

Q ss_pred             chhhhhhhhhhhcCCCCCCCCCCccccccCCCCCcCCCCHHHHHHHHHHhCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSRKVELANDDIQGIQALYGNNPN  272 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~~~  272 (312)
                      +.-++.||++| +=|+|....    .|...   ..-++ ..|..++..|+....+
T Consensus       101 ~~fil~HE~~H-v~~~h~~~~----~~~~~---~~eE~-~AD~~A~~~il~~~~~  146 (206)
T PF10463_consen  101 IAFILLHELAH-VVLGHEGDS----SPSQS---IQEEK-EADSYATEMILSDVGN  146 (206)
T ss_pred             HHHHHHHHHHH-HHHcCcccc----ccchh---HHHHH-hhhHHHHHHHHHHHHh
Confidence            34589999999 569998865    11111   11122 4677777777665433


No 125
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=31.11  E-value=21  Score=32.96  Aligned_cols=21  Identities=29%  Similarity=0.439  Sum_probs=15.0

Q ss_pred             cccchhhhhhhhhhhcCCCCC
Q 021444          215 AIDLESVAVHEIGHLLGLGHS  235 (312)
Q Consensus       215 ~~~l~~Va~HEiGHaLGL~Hs  235 (312)
                      ..+..+|++||+-|.|=+..-
T Consensus       151 ~~dg~NVviHEfAH~LD~~~g  171 (253)
T PF06167_consen  151 PNDGHNVVIHEFAHKLDMEDG  171 (253)
T ss_dssp             SSSS--HHHHHHHHHHHCTTS
T ss_pred             CCCCcchHHHHHHHHHHhhcC
Confidence            456789999999999966543


No 126
>PF03698 UPF0180:  Uncharacterised protein family (UPF0180);  InterPro: IPR005370 The members of this family are small uncharacterised proteins.
Probab=29.79  E-value=38  Score=25.75  Aligned_cols=16  Identities=31%  Similarity=0.573  Sum_probs=14.5

Q ss_pred             CChHHHHHHHHhcCCC
Q 021444            2 DGLAKLKKYFHYFGYI   17 (312)
Q Consensus         2 ~~~~~~~~yL~~fGYl   17 (312)
                      +|+..+++||++-||=
T Consensus         8 ~~Ls~v~~~L~~~Gye   23 (80)
T PF03698_consen    8 EGLSNVKEALREKGYE   23 (80)
T ss_pred             CCchHHHHHHHHCCCE
Confidence            5899999999999994


No 127
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=29.47  E-value=21  Score=37.72  Aligned_cols=32  Identities=16%  Similarity=0.196  Sum_probs=21.6

Q ss_pred             HHHH-HHHHHHHhhcccceEEEccc--CCcccEEE
Q 021444          131 KNVF-ATAFDRWSEATSLTFTRTDV--YTTADIRI  162 (312)
Q Consensus       131 ~~~i-~~A~~~Ws~~s~l~F~ev~~--~~~aDI~I  162 (312)
                      ..++ +-.|..-+..=+|+|+++++  ....|+++
T Consensus       363 ~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~  397 (681)
T PRK10280        363 NTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV  397 (681)
T ss_pred             HHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence            3455 35667777888999999753  34667764


No 128
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=28.18  E-value=23  Score=34.11  Aligned_cols=19  Identities=37%  Similarity=0.548  Sum_probs=15.3

Q ss_pred             cchhhhhhhhhhhcCCCCCC
Q 021444          217 DLESVAVHEIGHLLGLGHSS  236 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~Hs~  236 (312)
                      .+..|+.||++|++ ++|+.
T Consensus       274 glAtvLgHE~aHaV-arH~A  292 (424)
T KOG2661|consen  274 GLATVLGHEIAHAV-ARHAA  292 (424)
T ss_pred             HHHHHHHHHHHHHH-HHHHH
Confidence            47789999999997 56655


No 129
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=28.12  E-value=19  Score=37.02  Aligned_cols=13  Identities=46%  Similarity=0.672  Sum_probs=11.2

Q ss_pred             chhhhhhhhhhhc
Q 021444          218 LESVAVHEIGHLL  230 (312)
Q Consensus       218 l~~Va~HEiGHaL  230 (312)
                      +-.|+.|||||+.
T Consensus       487 IGaVIgHEI~HgF  499 (654)
T COG3590         487 IGAVIGHEIGHGF  499 (654)
T ss_pred             ccceehhhhcccc
Confidence            5679999999974


No 130
>PF03801 Ndc80_HEC:  HEC/Ndc80p family;  InterPro: IPR005550 Members of this family are components of the mitotic spindle. It has been shown that Ndc80 from yeast is part of a complex called the Ndc80p complex []. This complex is thought to bind to the microtubules of the spindle.; PDB: 3IZ0_E 2VE7_B 2IGP_A.
Probab=28.06  E-value=1.2e+02  Score=25.70  Aligned_cols=63  Identities=3%  Similarity=0.071  Sum_probs=37.4

Q ss_pred             hHHHHHHHHhcCCC-CCCCC-CCCCcchHHHHHHHHHHHHhcC--CCCCCCCCHHHHHHhcCCCCCC
Q 021444            4 LAKLKKYFHYFGYI-PDLPS-NFTDDFDDMLESAVKTYQKNFN--LEPTGQLDSQTMQHILRPRCGN   66 (312)
Q Consensus         4 ~~~~~~yL~~fGYl-~~~~~-~~~~~~~~~~~~Ai~~fQ~~~g--LpvTG~lD~~T~~~M~~pRCG~   66 (312)
                      +.+|.+||...||. .+-.. .......+++.+-++-+=++..  .+++.+++++....|+.=+|=.
T Consensus        42 ~~~I~~fL~~~~~~~~~is~k~l~~Pt~kdf~~I~~fL~~~idp~~~~~~k~eeev~~~lK~L~YP~  108 (157)
T PF03801_consen   42 IRKIYEFLSEHGFESHPISPKTLKSPTQKDFVEIFNFLFRQIDPNFKFGKKFEEEVPFLLKALGYPF  108 (157)
T ss_dssp             HHHHHHHHHHTT--SS---TTTTSS--HHHHHHHHHHHHHTTSTT---SSTHHHHHHHHHHHTT-SS
T ss_pred             HHHHHHHHHHcCCCCccccccccCCCCHHHHHHHHHHHHHHhCCCCCcCcCHHHHHHHHHHHhCCCc
Confidence            56789999999992 11111 1223345677766666655555  6787799999999998777655


No 131
>PF08434 CLCA_N:  Calcium-activated chloride channel;  InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs. 
Probab=25.52  E-value=43  Score=31.17  Aligned_cols=12  Identities=50%  Similarity=0.545  Sum_probs=9.9

Q ss_pred             chhhhhhhhhhh
Q 021444          218 LESVAVHEIGHL  229 (312)
Q Consensus       218 l~~Va~HEiGHa  229 (312)
                      .-.|++||-+|-
T Consensus       149 ~grv~VhEWAhl  160 (262)
T PF08434_consen  149 RGRVFVHEWAHL  160 (262)
T ss_pred             cceeeeehhhhh
Confidence            356999999996


No 132
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=25.37  E-value=28  Score=32.58  Aligned_cols=82  Identities=22%  Similarity=0.178  Sum_probs=39.6

Q ss_pred             eeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcC--CCCCCCCC---CccccccC-CCCCcCCCCHH
Q 021444          185 AFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLG--LGHSSVED---AIMYPSIS-SGSRKVELAND  258 (312)
Q Consensus       185 A~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLG--L~Hs~~~~---sIM~p~y~-~~~~~~~L~~d  258 (312)
                      -|-|.+.++.||....=......   ......--..|++||.||.+=  |+|...-+   .-|.-.-. ...-+.+|+.|
T Consensus       137 FYCP~D~kvYlDlsFf~~m~~~f---ga~GdfAqaYViAHEVGHHVQnllGi~~kv~~~~q~~s~~~an~~sVr~ELQAD  213 (295)
T COG2321         137 FYCPADTKVYLDLSFFDEMKTKF---GASGDFAQAYVIAHEVGHHVQNLLGILPKVNQNQQGMSQAEANQLSVRRELQAD  213 (295)
T ss_pred             eecCCCceEEEehhHHHHHHHHh---cCCccHHHHHHHHhhhhHHHHHHhhhhHHHHHHHhhhhhhhccCcceeeeehhh
Confidence            35688888888874211010000   011123346699999999631  12211000   00211111 11234677777


Q ss_pred             HHHHHHHHhCC
Q 021444          259 DIQGIQALYGN  269 (312)
Q Consensus       259 Di~~Iq~LYG~  269 (312)
                      =-++|+.-|-.
T Consensus       214 CfAGVW~~~~~  224 (295)
T COG2321         214 CFAGVWANYVQ  224 (295)
T ss_pred             HHHHHHHHHhc
Confidence            77777777754


No 133
>PRK14749 hypothetical protein; Provisional
Probab=24.63  E-value=47  Score=20.37  Aligned_cols=18  Identities=28%  Similarity=0.362  Sum_probs=13.3

Q ss_pred             chhcccchhHHHHHHHHH
Q 021444          292 AYFLGSCLGLSFFVVAAV  309 (312)
Q Consensus       292 ~~~~~~~~~~~~~~~~~~  309 (312)
                      +.+||-+|..+|-|+.|+
T Consensus         5 aWiLG~~lAc~f~ilna~   22 (30)
T PRK14749          5 LWFVGILLMCSLSTLVLV   22 (30)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            356788888888777765


No 134
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=24.05  E-value=33  Score=30.81  Aligned_cols=14  Identities=29%  Similarity=0.266  Sum_probs=11.4

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      -.-.|++||++||-
T Consensus        92 l~gsiLAHE~mHa~  105 (212)
T PF12315_consen   92 LTGSILAHELMHAW  105 (212)
T ss_pred             HHhhHHHHHHHHHH
Confidence            34679999999993


No 135
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=23.62  E-value=31  Score=34.73  Aligned_cols=44  Identities=32%  Similarity=0.516  Sum_probs=24.6

Q ss_pred             CCCHHHHHHH-----HHHhCCCCCCCCCCCCCCCCCCCCCCCcc--hhcccchhH
Q 021444          254 ELANDDIQGI-----QALYGNNPNFNGSSVPTDQQRDTSGAHGA--YFLGSCLGL  301 (312)
Q Consensus       254 ~L~~dDi~~I-----q~LYG~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~  301 (312)
                      .|..+|+-.+     +..-|.+|+...+..    =||--=.||+  ||=.-.||.
T Consensus       374 ~l~v~DlP~lWn~kme~yLGirp~~d~eG~----LQDIHWs~GsfGYFPsYtlG~  424 (497)
T COG2317         374 ELKVDDLPELWNDKMEEYLGIRPKNDAEGV----LQDIHWSHGSFGYFPTYTLGN  424 (497)
T ss_pred             CcchhhhHHHHHHHHHHHcCCCCCCccccc----cccccccCCCcCcCchhhhHH
Confidence            5666777655     344487777554443    4565555664  555444443


No 136
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=23.20  E-value=35  Score=34.56  Aligned_cols=16  Identities=25%  Similarity=0.289  Sum_probs=13.0

Q ss_pred             cchhhhhhhhhhhcCC
Q 021444          217 DLESVAVHEIGHLLGL  232 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL  232 (312)
                      .-+.|+.||+||+...
T Consensus       246 ~D~~t~~HE~GH~~yy  261 (477)
T cd06461         246 EDFVTVHHEMGHIQYY  261 (477)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4477899999999854


No 137
>TIGR02106 cyd_oper_ybgT cyd operon protein YbgT. This model describes a very small (as short as 33 amino acids) protein of unknown function, essentially always found in an operon with CydAB, subunits of the cytochrome d terminal oxidase. It begins with an aromatic motif MWYFXW and appears to contain a membrane-spanning helix. This protein appears to be restricted to the Proteobacteria and exist in a single copy only. We suggest it may be a membrane subunit of the terminal oxidase. The family is named after the E. coli member YbgT. This model excludes the apparently related protein YccB.
Probab=22.74  E-value=53  Score=20.22  Aligned_cols=17  Identities=35%  Similarity=0.598  Sum_probs=11.7

Q ss_pred             hhcccchhHHHHHHHHH
Q 021444          293 YFLGSCLGLSFFVVAAV  309 (312)
Q Consensus       293 ~~~~~~~~~~~~~~~~~  309 (312)
                      .+||.+|..+|-|+.|+
T Consensus         6 WilG~~lA~~~~v~~a~   22 (30)
T TIGR02106         6 WILGTLLACAFGVLNAM   22 (30)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            46777777777776665


No 138
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=22.68  E-value=34  Score=29.85  Aligned_cols=16  Identities=25%  Similarity=0.316  Sum_probs=12.8

Q ss_pred             chhhhhhhhhhhcCCC
Q 021444          218 LESVAVHEIGHLLGLG  233 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~  233 (312)
                      +-.|++|||.|++.-.
T Consensus        36 lG~ilahel~hafd~~   51 (206)
T PF01431_consen   36 LGFILAHELMHAFDPE   51 (206)
T ss_dssp             HHHHHHHHHHHCTSTT
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            5679999999987553


No 139
>TIGR03141 cytochro_ccmD heme exporter protein CcmD. The model for this protein family describes a small, hydrophobic, and only moderately well-conserved protein, tricky to identify accurately for all of these reasons. However, members are found as part of large operons involved in heme export across the inner membrane for assembly of c-type cytochromes in a large number of bacteria. The gray zone between the trusted cutoff (13.0) and noise cutoff (4.75) includes both low-scoring examples and false-positive matches to hydrophobic domains of longer proteins.
Probab=22.62  E-value=61  Score=21.57  Aligned_cols=22  Identities=23%  Similarity=0.451  Sum_probs=15.3

Q ss_pred             CCCcchhcccchhHHHHHHHHHh
Q 021444          288 GAHGAYFLGSCLGLSFFVVAAVL  310 (312)
Q Consensus       288 ~~~~~~~~~~~~~~~~~~~~~~~  310 (312)
                      +.||.|..+ +-|++|+++++.+
T Consensus         2 Ggy~~yVW~-sYg~t~l~l~~li   23 (45)
T TIGR03141         2 GGYAFYVWL-AYGITALVLAGLI   23 (45)
T ss_pred             CCccHHHHH-HHHHHHHHHHHHH
Confidence            457777664 5688888877654


No 140
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=22.08  E-value=1.1e+02  Score=28.35  Aligned_cols=78  Identities=17%  Similarity=0.207  Sum_probs=47.2

Q ss_pred             HHHHHHHHHHHHHHhh--cccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCceEe--cCCCCcee
Q 021444          128 DEVKNVFATAFDRWSE--ATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRFHL--DGDENWVI  203 (312)
Q Consensus       128 ~~~~~~i~~A~~~Ws~--~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~ihf--d~~e~wt~  203 (312)
                      ..+....+.+...|+.  |..+.|.-..  .++.|+..++..  |+       ++      +  .|.|++  ..+     
T Consensus        79 ~~~ek~l~~~~~k~~~pnv~~v~f~V~~--~dp~irlqkg~k--gg-------g~------~--~~ki~l~l~p~-----  134 (280)
T COG5504          79 SNCEKYLPELIVKWYGPNVPIVDFPVTP--GDPQIRLQKGRK--GG-------GG------I--PGKIMLWLVPS-----  134 (280)
T ss_pred             HHHHHHHHHHHHhhcCCCCceeeecccC--CCchHHHhhccc--CC-------CC------c--CceEEEEEecC-----
Confidence            5567778888899985  4556665443  355566555332  21       11      1  233332  222     


Q ss_pred             cCCCccccCCccccchhhhhhhhhhhcCCCCCCCC
Q 021444          204 TSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVE  238 (312)
Q Consensus       204 ~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~  238 (312)
                               ..-.+..++++||+=|..-|.|..-.
T Consensus       135 ---------~~~~~v~aliaHE~HH~~R~~~i~~~  160 (280)
T COG5504         135 ---------STITSVPALIAHEYHHNCRLRYIDYG  160 (280)
T ss_pred             ---------CCccchHHHHHHHHHhhheecccccC
Confidence                     11236789999999999999887743


No 141
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.97  E-value=42  Score=23.98  Aligned_cols=22  Identities=36%  Similarity=0.386  Sum_probs=18.3

Q ss_pred             chhhhhhhhhhhcCCCCCCCCC
Q 021444          218 LESVAVHEIGHLLGLGHSSVED  239 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~~~  239 (312)
                      -..-++|.+-|-|||.|.++..
T Consensus        29 ~eRriih~la~~lGL~~~s~G~   50 (60)
T cd02639          29 AERRIVHLLASRLGLNHVSDGT   50 (60)
T ss_pred             HHHHHHHHHHHHcCCceEEeCC
Confidence            4568899999999999987544


No 142
>CHL00176 ftsH cell division protein; Validated
Probab=21.91  E-value=35  Score=35.84  Aligned_cols=14  Identities=36%  Similarity=0.707  Sum_probs=12.1

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      ....||.||.||||
T Consensus       437 ~~~~vA~hEaGhA~  450 (638)
T CHL00176        437 NKRLIAYHEVGHAI  450 (638)
T ss_pred             HHHHHHHHhhhhHH
Confidence            45679999999998


No 143
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=21.82  E-value=35  Score=35.81  Aligned_cols=14  Identities=50%  Similarity=0.688  Sum_probs=12.0

Q ss_pred             cchhhhhhhhhhhc
Q 021444          217 DLESVAVHEIGHLL  230 (312)
Q Consensus       217 ~l~~Va~HEiGHaL  230 (312)
                      ....|+.||+||||
T Consensus       407 ~~~~~a~he~gha~  420 (644)
T PRK10733        407 QKESTAYHEAGHAI  420 (644)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45779999999997


No 144
>PF14891 Peptidase_M91:  Effector protein
Probab=21.57  E-value=40  Score=29.01  Aligned_cols=18  Identities=22%  Similarity=0.071  Sum_probs=13.5

Q ss_pred             cchhhhhhhhhhhcCCCC
Q 021444          217 DLESVAVHEIGHLLGLGH  234 (312)
Q Consensus       217 ~l~~Va~HEiGHaLGL~H  234 (312)
                      .-+.|+.||+.||.-..+
T Consensus       102 ~p~v~L~HEL~HA~~~~~  119 (174)
T PF14891_consen  102 PPFVVLYHELIHAYDYMN  119 (174)
T ss_pred             HHHHHHHHHHHHHHHHHC
Confidence            446799999999965433


No 145
>PF02074 Peptidase_M32:  Carboxypeptidase Taq (M32) metallopeptidase;  InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=20.93  E-value=38  Score=34.46  Aligned_cols=17  Identities=41%  Similarity=0.581  Sum_probs=11.8

Q ss_pred             hhhhhhhhhhhc---CCCCC
Q 021444          219 ESVAVHEIGHLL---GLGHS  235 (312)
Q Consensus       219 ~~Va~HEiGHaL---GL~Hs  235 (312)
                      ..-++||.||||   ||...
T Consensus       260 l~s~iHE~GHalYEq~~~~~  279 (494)
T PF02074_consen  260 LFSTIHETGHALYEQGLPPE  279 (494)
T ss_dssp             HHHHHHHHHHHHHHHTS-GG
T ss_pred             HHHHHHHHHHHHHHcCCChh
Confidence            446789999996   66443


No 146
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.56  E-value=34  Score=35.07  Aligned_cols=51  Identities=27%  Similarity=0.188  Sum_probs=30.4

Q ss_pred             chhhhhhhhhhhcCCCCCCC-----------CCCccccc-cCC---CCCcCCCCHHHHHHHHHHhC
Q 021444          218 LESVAVHEIGHLLGLGHSSV-----------EDAIMYPS-ISS---GSRKVELANDDIQGIQALYG  268 (312)
Q Consensus       218 l~~Va~HEiGHaLGL~Hs~~-----------~~sIM~p~-y~~---~~~~~~L~~dDi~~Iq~LYG  268 (312)
                      -..|.+||+||-|||..+.+           ..|||..- +.+   ......++.-|.+-.|.-||
T Consensus       346 aaGv~ahEyghdlglPDEydtqy~GtgEPVgYwsImssGsWaGki~Gt~p~gFs~wdkef~q~~~G  411 (760)
T COG4412         346 AAGVEAHEYGHDLGLPDEYDTQYKGTGEPVGYWSIMSSGSWAGKIKGTLPGGFSFWDKEFPQPSIG  411 (760)
T ss_pred             ccceeeeecccccCCCcccccccccCCCcccceeeecccccccccCCcCCcccchhhhhccCcccC
Confidence            34599999999999987665           23677421 111   12223355556665665554


No 147
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=20.56  E-value=39  Score=31.40  Aligned_cols=11  Identities=36%  Similarity=0.341  Sum_probs=9.0

Q ss_pred             hhhhhhhhhhc
Q 021444          220 SVAVHEIGHLL  230 (312)
Q Consensus       220 ~Va~HEiGHaL  230 (312)
                      --..|||||-+
T Consensus       221 WG~~HE~GH~~  231 (307)
T PF13402_consen  221 WGPWHELGHNH  231 (307)
T ss_dssp             HHHHHHHHHHH
T ss_pred             eeehhhhhhhc
Confidence            35899999985


Done!