Query         035681
Match_columns 320
No_of_seqs    259 out of 1595
Neff          7.1 
Searched_HMMs 46136
Date          Fri Mar 29 05:18:36 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/035681.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/035681hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0   2E-53 4.2E-58  419.7  18.9  226   59-320    29-262 (469)
  2 cd04278 ZnMc_MMP Zinc-dependen 100.0 4.3E-36 9.3E-41  258.1  15.3  150  165-319     1-157 (157)
  3 PF00413 Peptidase_M10:  Matrix 100.0 4.4E-34 9.6E-39  243.4  15.0  147  165-319     1-154 (154)
  4 cd04279 ZnMc_MMP_like_1 Zinc-d  99.9 3.2E-27 6.8E-32  202.7  13.7  139  171-319     2-156 (156)
  5 cd04268 ZnMc_MMP_like Zinc-dep  99.9 4.5E-24 9.7E-29  183.8   9.1  142  170-318     1-165 (165)
  6 smart00235 ZnMc Zinc-dependent  99.9 1.1E-22 2.5E-27  171.1  10.3  133  163-320     2-140 (140)
  7 cd04277 ZnMc_serralysin_like Z  99.9 6.3E-21 1.4E-25  168.1  13.8  122  184-319    34-186 (186)
  8 cd04327 ZnMc_MMP_like_3 Zinc-d  99.7 6.4E-17 1.4E-21  144.3  13.2  108  165-287     2-109 (198)
  9 cd00203 ZnMc Zinc-dependent me  99.7 7.2E-17 1.6E-21  138.9  12.5  119  182-318    20-167 (167)
 10 PF12388 Peptidase_M57:  Dual-a  99.6 1.5E-14 3.2E-19  128.8  12.6  137  165-318    33-211 (211)
 11 cd04276 ZnMc_MMP_like_2 Zinc-d  99.5 1.1E-13 2.5E-18  123.0  12.4  144  170-320     7-197 (197)
 12 cd04280 ZnMc_astacin_like Zinc  99.5 1.1E-13 2.5E-18  121.6  11.2   91  171-288     2-92  (180)
 13 PF01400 Astacin:  Astacin (Pep  99.5 7.5E-14 1.6E-18  123.9   9.2   94  165-288     2-97  (191)
 14 cd04281 ZnMc_BMP1_TLD Zinc-dep  99.5 4.4E-13 9.6E-18  119.5  12.5   99  163-288     7-105 (200)
 15 PF01471 PG_binding_1:  Putativ  99.5 1.8E-13 3.8E-18   97.7   6.6   56   57-121     2-57  (57)
 16 cd04282 ZnMc_meprin Zinc-depen  99.4   2E-12 4.3E-17  117.5  12.4   96  163-288    43-138 (230)
 17 cd04283 ZnMc_hatching_enzyme Z  99.4 4.6E-12 9.9E-17  111.4  11.0   92  171-288     4-95  (182)
 18 KOG3714 Meprin A metalloprotea  99.3 1.4E-11   3E-16  121.4  10.8  100  162-288    78-177 (411)
 19 PF02031 Peptidase_M7:  Strepto  99.0 1.3E-09 2.9E-14   89.0   7.0  112  184-319    13-130 (132)
 20 TIGR02869 spore_SleB spore cor  98.9 2.8E-09   6E-14   95.4   7.1   72   49-130     5-76  (201)
 21 COG5549 Predicted Zn-dependent  98.9 2.5E-09 5.3E-14   94.2   6.4  127  184-319   104-233 (236)
 22 PRK10594 murein L,D-transpepti  98.3 1.1E-06 2.4E-11   89.5   7.0   84   41-124   225-343 (608)
 23 COG3409 Putative peptidoglycan  97.8 2.5E-05 5.3E-10   67.9   5.4   69   49-125    37-106 (185)
 24 cd04267 ZnMc_ADAM_like Zinc-de  97.7 0.00022 4.8E-09   62.8   9.6   48  269-316   132-189 (192)
 25 PF11350 DUF3152:  Protein of u  97.7 0.00017 3.8E-09   63.9   7.9  113  170-289    28-158 (203)
 26 PF13583 Reprolysin_4:  Metallo  97.6   6E-05 1.3E-09   67.7   3.5   46  271-317   138-197 (206)
 27 COG2989 Uncharacterized protei  97.5 0.00014 3.1E-09   72.7   6.2   71   46-123   227-298 (561)
 28 COG1913 Predicted Zn-dependent  97.4 6.7E-05 1.4E-09   64.9   2.0   30  270-299   124-153 (181)
 29 PF05572 Peptidase_M43:  Pregna  97.4 0.00011 2.4E-09   63.1   3.0   23  267-289    66-88  (154)
 30 cd04273 ZnMc_ADAMTS_like Zinc-  97.4 0.00051 1.1E-08   61.5   6.9  116  188-316    71-196 (207)
 31 PF13574 Reprolysin_2:  Metallo  97.3 0.00012 2.5E-09   64.0   1.8   47  270-317   111-170 (173)
 32 COG3023 ampD N-acetyl-anhydrom  97.2 0.00071 1.5E-08   62.0   6.2   58   54-122   194-252 (257)
 33 PRK13267 archaemetzincin-like   97.2 0.00024 5.1E-09   62.5   2.9   29  270-298   125-153 (179)
 34 PF13688 Reprolysin_5:  Metallo  97.1 0.00025 5.4E-09   62.6   1.8   23  268-290   140-162 (196)
 35 PF07998 Peptidase_M54:  Peptid  97.1 0.00032 6.9E-09   62.4   2.5   30  269-299   144-173 (194)
 36 PF13582 Reprolysin_3:  Metallo  97.1 0.00022 4.9E-09   58.1   1.3   17  271-287   108-124 (124)
 37 cd04271 ZnMc_ADAM_fungal Zinc-  96.8  0.0011 2.5E-08   60.4   3.7   45  272-317   147-216 (228)
 38 cd04269 ZnMc_adamalysin_II_lik  96.8  0.0016 3.5E-08   57.4   4.4   31  269-299   130-166 (194)
 39 cd04275 ZnMc_pappalysin_like Z  96.7 0.00035 7.5E-09   63.7  -0.6   24  268-291   135-158 (225)
 40 PF08823 PG_binding_2:  Putativ  96.6  0.0056 1.2E-07   46.1   5.3   55   57-120    15-73  (74)
 41 COG3409 Putative peptidoglycan  96.5  0.0054 1.2E-07   53.1   5.8   62   53-122   123-184 (185)
 42 PF01421 Reprolysin:  Reprolysi  96.1   0.009   2E-07   52.9   5.3   48  268-315   129-184 (199)
 43 cd04270 ZnMc_TACE_like Zinc-de  96.1  0.0075 1.6E-07   55.6   4.9   46  270-315   167-226 (244)
 44 cd04272 ZnMc_salivary_gland_MP  95.8   0.011 2.4E-07   53.3   4.3   47  269-315   144-208 (220)
 45 PF05548 Peptidase_M11:  Gameto  94.1   0.031 6.7E-07   53.5   2.3   18  271-288   151-168 (314)
 46 PF10462 Peptidase_M66:  Peptid  93.9   0.027 5.9E-07   53.5   1.6   20  269-288   192-211 (305)
 47 PF12044 Metallopep:  Putative   93.2   0.052 1.1E-06   53.9   2.3   26  271-298   317-342 (423)
 48 PF07172 GRP:  Glycine rich pro  92.7    0.11 2.3E-06   41.1   2.9   28    1-29      1-28  (95)
 49 PF01457 Peptidase_M8:  Leishma  92.6   0.085 1.8E-06   53.9   2.9   60  212-286   159-226 (521)
 50 TIGR03296 M6dom_TIGR03296 M6 f  92.1    0.04 8.7E-07   52.0  -0.2   19  271-289   166-184 (286)
 51 PF11150 DUF2927:  Protein of u  91.2     4.5 9.8E-05   36.6  12.0  148  164-319    28-194 (213)
 52 PTZ00337 surface protease GP63  90.3    0.25 5.3E-06   51.0   3.4   36  246-287   211-246 (567)
 53 COG3824 Predicted Zn-dependent  89.5    0.16 3.5E-06   41.6   1.1   18  269-286   108-125 (136)
 54 PF09374 PG_binding_3:  Predict  89.0    0.37 8.1E-06   35.9   2.6   26   96-121     1-30  (72)
 55 KOG4525 Jacalin-like lectin do  88.8    0.24 5.2E-06   48.9   1.9   26  271-298   302-327 (614)
 56 PF04298 Zn_peptidase_2:  Putat  88.2    0.73 1.6E-05   41.9   4.5   18  269-288    88-105 (222)
 57 PTZ00257 Glycoprotein GP63 (le  87.8    0.72 1.6E-05   47.9   4.7   38  246-287   236-273 (622)
 58 KOG3658 Tumor necrosis factor-  87.3    0.41 8.9E-06   49.6   2.6   46  271-316   393-451 (764)
 59 COG2738 Predicted Zn-dependent  85.8       1 2.2E-05   40.1   3.8   17  184-200    19-35  (226)
 60 PF06262 DUF1025:  Possibl zinc  85.7    0.48   1E-05   37.5   1.7   16  270-285    73-88  (97)
 61 PF09471 Peptidase_M64:  IgA Pe  85.7    0.44 9.6E-06   44.5   1.7   19  269-287   215-234 (264)
 62 COG4783 Putative Zn-dependent   84.0    0.52 1.1E-05   47.2   1.5   19  267-286   127-145 (484)
 63 PF05547 Peptidase_M6:  Immune   83.3    0.22 4.8E-06   52.0  -1.6   17  271-287   222-238 (645)
 64 KOG3607 Meltrins, fertilins an  83.3     0.6 1.3E-05   49.5   1.6   20  269-288   322-341 (716)
 65 PF04228 Zn_peptidase:  Putativ  77.3     3.4 7.3E-05   39.2   4.4   41  244-285   141-189 (292)
 66 PF14247 DUF4344:  Domain of un  76.7     1.4   3E-05   40.1   1.5   14  269-282    91-104 (220)
 67 PF13398 Peptidase_M50B:  Pepti  76.3     1.4 3.1E-05   39.3   1.5   15  269-283    21-35  (200)
 68 PF06114 DUF955:  Domain of unk  75.9     1.3 2.9E-05   34.6   1.1   27  248-282    28-54  (122)
 69 COG5549 Predicted Zn-dependent  74.4     1.8   4E-05   38.9   1.6   30  270-299   153-182 (236)
 70 COG2856 Predicted Zn peptidase  74.0     1.5 3.3E-05   39.7   1.0   26  270-297    72-98  (213)
 71 PF01435 Peptidase_M48:  Peptid  71.5     2.3   5E-05   37.6   1.7   20  268-288    87-106 (226)
 72 COG0501 HtpX Zn-dependent prot  68.6     2.7 5.9E-05   39.1   1.5   15  268-282   155-169 (302)
 73 PF12725 DUF3810:  Protein of u  68.4     4.6  0.0001   38.7   3.1   37  245-289   179-215 (318)
 74 PRK03982 heat shock protein Ht  68.3     2.8   6E-05   39.5   1.5   15  268-282   123-137 (288)
 75 cd06258 Peptidase_M3_like The   68.2     8.3 0.00018   37.1   4.9   15  268-282   152-166 (365)
 76 PRK03001 M48 family peptidase;  66.9     3.1 6.7E-05   39.1   1.5   15  268-282   122-136 (283)
 77 PRK01345 heat shock protein Ht  66.5     3.1 6.8E-05   39.8   1.5   19  268-287   122-140 (317)
 78 PRK03072 heat shock protein Ht  66.3     3.2 6.9E-05   39.2   1.5   15  268-282   125-139 (288)
 79 cd06161 S2P-M50_SpoIVFB SpoIVF  64.5     3.7   8E-05   36.8   1.5   16  269-284    37-52  (208)
 80 PRK04897 heat shock protein Ht  62.7     4.1 8.9E-05   38.6   1.5   15  268-282   135-149 (298)
 81 PRK02870 heat shock protein Ht  61.5     4.4 9.6E-05   39.2   1.5   14  268-281   171-184 (336)
 82 PRK05457 heat shock protein Ht  61.1     4.6 9.9E-05   38.1   1.5   14  268-281   132-145 (284)
 83 PRK01265 heat shock protein Ht  61.0     4.6 9.9E-05   38.9   1.5   14  268-281   138-151 (324)
 84 PRK02391 heat shock protein Ht  60.6     4.7  0.0001   38.2   1.5   14  268-281   131-144 (296)
 85 PF13485 Peptidase_MA_2:  Pepti  59.1       6 0.00013   31.1   1.7   14  269-282    24-37  (128)
 86 cd06164 S2P-M50_SpoIVFB_CBS Sp  59.1     5.2 0.00011   36.4   1.5   15  270-284    53-67  (227)
 87 KOG3538 Disintegrin metallopro  58.8       3 6.5E-05   45.3  -0.2   32  269-300   316-353 (845)
 88 cd06459 M3B_Oligoendopeptidase  56.7       5 0.00011   39.3   1.0   14  269-282   221-234 (427)
 89 cd06162 S2P-M50_PDZ_SREBP Ster  55.9     6.3 0.00014   37.1   1.5   13  270-282   135-147 (277)
 90 PF13699 DUF4157:  Domain of un  55.0      13 0.00028   28.1   2.8   30  248-282    44-73  (79)
 91 cd06159 S2P-M50_PDZ_Arch Uncha  54.7     6.8 0.00015   36.6   1.5   16  270-285   118-133 (263)
 92 TIGR00181 pepF oligoendopeptid  54.6     7.9 0.00017   40.1   2.1   15  268-282   376-390 (591)
 93 KOG2921 Intramembrane metallop  54.5     6.4 0.00014   38.8   1.3   15  270-284   131-145 (484)
 94 PF01432 Peptidase_M3:  Peptida  54.5     6.2 0.00013   39.4   1.3   24  186-209   137-160 (458)
 95 TIGR02289 M3_not_pepF oligoend  53.9     5.9 0.00013   40.8   1.0   15  268-282   335-349 (549)
 96 cd06455 M3A_TOP Peptidase M3 T  52.3       7 0.00015   39.4   1.2   14  269-282   262-275 (472)
 97 cd06160 S2P-M50_like_2 Unchara  51.6     8.4 0.00018   34.0   1.5   21  270-290    41-61  (183)
 98 TIGR02290 M3_fam_3 oligoendope  50.2     7.3 0.00016   40.4   1.0   14  269-282   374-387 (587)
 99 PF01447 Peptidase_M4:  Thermol  49.8     8.2 0.00018   32.9   1.1   12  271-282   136-147 (150)
100 PLN02791 Nudix hydrolase homol  48.7     8.9 0.00019   41.2   1.4   21  269-289   584-604 (770)
101 PF01863 DUF45:  Protein of unk  48.5      12 0.00026   32.8   2.0   38  248-290   147-184 (205)
102 cd06457 M3A_MIP Peptidase M3 m  48.4     8.7 0.00019   38.6   1.2   24  186-209   133-156 (458)
103 cd06456 M3A_DCP_Oligopeptidase  46.6     9.7 0.00021   37.9   1.2   15  268-282   206-220 (422)
104 PF11337 DUF3139:  Protein of u  46.1      42 0.00091   25.5   4.5   23   52-74     29-51  (85)
105 PF01434 Peptidase_M41:  Peptid  45.4      12 0.00026   33.5   1.6   15  268-282    26-40  (213)
106 TIGR00054 RIP metalloprotease   42.6      11 0.00025   37.4   1.0   12  271-282    15-26  (420)
107 PRK10779 zinc metallopeptidase  42.4      12 0.00025   37.6   1.1   12  271-282    16-27  (449)
108 cd06460 M32_Taq Peptidase fami  42.1      12 0.00026   37.1   1.0   12  271-282   160-171 (396)
109 KOG2719 Metalloprotease [Gener  42.0      13 0.00029   37.0   1.3   19  268-287   278-296 (428)
110 KOG3988 Protein-tyrosine sulfo  40.1      46 0.00099   31.7   4.4   61   42-104   297-357 (378)
111 COG1164 Oligoendopeptidase F [  39.5      13 0.00029   38.7   1.0   14  269-282   379-392 (598)
112 COG4784 Putative Zn-dependent   39.3      21 0.00045   34.8   2.1   15  268-282   122-136 (479)
113 PF03272 Enhancin:  Viral enhan  37.0      38 0.00082   36.6   3.9   42  165-208   129-191 (775)
114 COG4227 Antirestriction protei  36.8      22 0.00047   33.3   1.8   22  267-288   200-221 (316)
115 PRK10911 oligopeptidase A; Pro  35.0      18  0.0004   38.3   1.2   14  269-282   462-475 (680)
116 PF14891 Peptidase_M91:  Effect  34.7      42 0.00091   29.0   3.2   18  269-286   102-119 (174)
117 PF03571 Peptidase_M49:  Peptid  33.7      19 0.00042   37.1   1.0   16  269-284   292-308 (549)
118 COG2321 Predicted metalloprote  33.5      19 0.00042   33.8   0.9   39  244-282   138-179 (295)
119 CHL00132 psaF photosystem I su  31.8   1E+02  0.0023   27.1   5.1   38   59-106    45-82  (185)
120 COG3926 zliS Lysozyme family p  29.0      52  0.0011   30.1   2.8   30   94-123    93-126 (252)
121 KOG2661 Peptidase family M48 [  28.6      31 0.00068   33.4   1.4   19  269-288   274-292 (424)
122 PF11576 DUF3236:  Protein of u  27.5      42 0.00091   28.5   1.9   27   44-70      9-35  (154)
123 PF08434 CLCA_N:  Calcium-activ  26.2      38 0.00082   31.7   1.5   14  271-284   150-164 (262)
124 PF06167 Peptidase_M90:  Glucos  25.5      32  0.0007   31.9   0.9   19  268-286   152-170 (253)
125 PRK10280 dipeptidyl carboxypep  24.3      35 0.00077   36.2   1.1   31  187-217   364-397 (681)
126 PF13721 SecD-TM1:  SecD export  23.7 1.5E+02  0.0032   23.5   4.3   29   48-76     36-64  (101)
127 cd02641 R3H_Smubp-2_like R3H d  23.6      55  0.0012   23.3   1.6   21  269-289    28-48  (60)
128 PF01431 Peptidase_M13:  Peptid  22.8      47   0.001   29.1   1.4   16  269-284    35-50  (206)
129 cd06461 M2_ACE Peptidase famil  22.7      44 0.00095   34.0   1.4   16  269-284   246-261 (477)
130 PF12315 DUF3633:  Protein of u  22.4      47   0.001   30.0   1.4   14  269-282    92-105 (212)
131 PRK10733 hflB ATP-dependent me  22.1      47   0.001   35.0   1.5   14  269-282   407-420 (644)
132 PF10026 DUF2268:  Predicted Zn  20.6 3.3E+02  0.0072   23.8   6.4   18  269-286    64-81  (195)
133 smart00731 SprT SprT homologue  20.2      56  0.0012   27.3   1.3   16  268-283    57-72  (146)
134 cd02639 R3H_RRM R3H domain of   20.0      67  0.0015   23.0   1.5   23  269-291    28-50  (60)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=2e-53  Score=419.74  Aligned_cols=226  Identities=43%  Similarity=0.737  Sum_probs=203.8

Q ss_pred             cHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhccCCccccccCCCcccc
Q 035681           59 GIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTIPRCGVADIVNGTTRM  138 (320)
Q Consensus        59 ~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~PRCgv~D~~~~~~~~  138 (320)
                      ++..++.||++|||+...+.....      .++..+++||+.||++++|++||++|.+|++.|.+|||||||.       
T Consensus        29 ~~~~~~~yl~~~~y~~~~~~~~~~------~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~-------   95 (469)
T KOG1565|consen   29 DKVALQDYLECYGYLPPTDLTATR------ASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG-------   95 (469)
T ss_pred             chhHHHHHhhhcccCCCccccccc------cCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------
Confidence            789999999999999876422111      1578899999999999999999999999999999999999991       


Q ss_pred             cCCcccccCCCccccccceecCCCCCCCCCCcccceEeccCCCc----hhHHHHHHHHHHHhhhcCcceEEEecCCCCCc
Q 035681          139 RSGKKRQQHNHHFRTVSHYSFFPRSPKWPASKFSLTYAFRPGTR----SDAITPVARAFQTWAANTQFKFSKIEDFANAD  214 (320)
Q Consensus       139 ~~~~~~~~~~~~~~~~~~y~~~~~~~kW~~~k~~Lty~~~~~~~----~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aD  214 (320)
                                       +|+++++++||+  +.+|||+|.+.++    .+++.++++||+.|+++++++|.|+.....||
T Consensus        96 -----------------~~~~~~~~~kW~--k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aD  156 (469)
T KOG1565|consen   96 -----------------RYRYFPGKPKWN--KEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEAD  156 (469)
T ss_pred             -----------------ccccCcccCccc--ccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCc
Confidence                             477888999999  9999999998753    37899999999999999999999999866999


Q ss_pred             eEEEeeeCCCCCCCCCCCCCCCCcceeeeeCC---CCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCC
Q 035681          215 IKISFESGDHGDGNPFDGRGGPEGIIAHAFAP---TNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEG  291 (320)
Q Consensus       215 I~I~f~~~~hgd~~~fdG~g~~~g~laha~~P---~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~  291 (320)
                      |+|.|..+.|||++||||++   |+|||||.|   .+|++|||++|.|+++ ...++|+..||+|||||+|||.||.+++
T Consensus       157 i~i~F~~~~h~d~~PFDG~~---g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-~~~g~~l~~Va~HEiGH~LGL~HS~~~~  232 (469)
T KOG1565|consen  157 IRISFFPGDHGDGFPFDGPG---GVLAHAFFPGPGIGGDLHFDKDETWTYG-DSNGVDLFLVAAHEIGHALGLGHSSDPD  232 (469)
T ss_pred             eeeeeeccCCCCCCcccCCC---CceecccCCCCCCCCccccCcccceecc-CCccchhHHHhhhhcccccccCCCCCcc
Confidence            99999999999999999999   999999999   6899999999999997 3457899999999999999999999999


Q ss_pred             CcccccccC-CCCCCCCHHHHHHHHHHhCC
Q 035681          292 AIMYPSISA-GVTKGLHGDDIQGIRDLYNV  320 (320)
Q Consensus       292 siMyp~~~~-~~~~~L~~dDi~~iq~LYg~  320 (320)
                      +||||+|.+ .....|++|||.|||.|||.
T Consensus       233 aiM~P~y~~~~~~~~L~~DDv~giq~lYG~  262 (469)
T KOG1565|consen  233 AIMYPFYQPDSGNFDLSQDDVRGIQHLYGG  262 (469)
T ss_pred             cccccccccCCCCcccChhhhhhhHHHhCC
Confidence            999999994 23458999999999999984


No 2  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00  E-value=4.3e-36  Score=258.06  Aligned_cols=150  Identities=48%  Similarity=0.827  Sum_probs=135.4

Q ss_pred             CCCCCcccceEeccCCC----chhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcce
Q 035681          165 KWPASKFSLTYAFRPGT----RSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGII  240 (320)
Q Consensus       165 kW~~~k~~Lty~~~~~~----~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~l  240 (320)
                      ||+  +++|||+|.+..    ..+++++|++||+.|+++++|+|+|+.+...|||+|.|....|+|+++|+|.+   |++
T Consensus         1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~---g~l   75 (157)
T cd04278           1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPG---GTL   75 (157)
T ss_pred             CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCc---ccc
Confidence            799  999999998754    24788999999999999999999999876689999999999999999999999   999


Q ss_pred             eeeeCC--CCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCCCcccccccCC-CCCCCCHHHHHHHHHH
Q 035681          241 AHAFAP--TNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEGAIMYPSISAG-VTKGLHGDDIQGIRDL  317 (320)
Q Consensus       241 aha~~P--~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~~~-~~~~L~~dDi~~iq~L  317 (320)
                      +||++|  ..|+|+||.++.|++.....+.+++.|++|||||+|||.|+.++.+||||.+... ....|+++||.+||+|
T Consensus        76 ~~a~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~~~~l  155 (157)
T cd04278          76 AHAFFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRGIQAL  155 (157)
T ss_pred             ccccCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHHHHHh
Confidence            999998  7899999999999886534567899999999999999999999999999999843 2459999999999999


Q ss_pred             hC
Q 035681          318 YN  319 (320)
Q Consensus       318 Yg  319 (320)
                      ||
T Consensus       156 Yg  157 (157)
T cd04278         156 YG  157 (157)
T ss_pred             cC
Confidence            98


No 3  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00  E-value=4.4e-34  Score=243.37  Aligned_cols=147  Identities=45%  Similarity=0.843  Sum_probs=130.8

Q ss_pred             CCCCCcccceEeccCCCc----hhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcce
Q 035681          165 KWPASKFSLTYAFRPGTR----SDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGII  240 (320)
Q Consensus       165 kW~~~k~~Lty~~~~~~~----~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~l  240 (320)
                      ||+  +++|||+|.+..+    .+.++++++||+.|+++.+++|+++.+. .+||+|.|....++++..|++.+   +++
T Consensus         1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~-~adi~i~~~~~~~~~~~~~~~~~---~~~   74 (154)
T PF00413_consen    1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDG-NADIRISFGSNNHGDGYSFDGSG---GTL   74 (154)
T ss_dssp             SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSS-SCSEEEEEESSSSSSSS-CSSSS---SES
T ss_pred             CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCC-CcceeeeeeccccCcccccccce---eee
Confidence            799  8999999998653    2688999999999999999999999963 69999999998899888899988   999


Q ss_pred             eeeeCCC---CceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCCCcccccccCCCCCCCCHHHHHHHHHH
Q 035681          241 AHAFAPT---NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEGAIMYPSISAGVTKGLHGDDIQGIRDL  317 (320)
Q Consensus       241 aha~~P~---~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~~~~~~~L~~dDi~~iq~L  317 (320)
                      ++++.|.   .|+++|+.++.|...  ....++..|++|||||+|||.|+.++++||||.+.......|+.+||.+||+|
T Consensus        75 ~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~~~~l~~~Di~~i~~l  152 (154)
T PF00413_consen   75 AHAYFPNNIVSGDIHFNDDESWTID--DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPDNKTLSEDDIDGIQYL  152 (154)
T ss_dssp             EEEEESSSTTTTEEEEETTSHEESS--SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSSSTSTTHHHHHHHHHH
T ss_pred             eccccccccccccccccccccchhh--hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCCCCCCCHHHHHHHHHH
Confidence            9999985   799999999999875  23568999999999999999999999999999998544359999999999999


Q ss_pred             hC
Q 035681          318 YN  319 (320)
Q Consensus       318 Yg  319 (320)
                      ||
T Consensus       153 Yg  154 (154)
T PF00413_consen  153 YG  154 (154)
T ss_dssp             HS
T ss_pred             hC
Confidence            98


No 4  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.95  E-value=3.2e-27  Score=202.67  Aligned_cols=139  Identities=30%  Similarity=0.427  Sum_probs=110.0

Q ss_pred             ccceEeccCCC------chhHHHHHHHHHHHhhhcCcceEEEecCC-CCCceEEEeeeCCCCCCCCCCCCCCCCcceeee
Q 035681          171 FSLTYAFRPGT------RSDAITPVARAFQTWAANTQFKFSKIEDF-ANADIKISFESGDHGDGNPFDGRGGPEGIIAHA  243 (320)
Q Consensus       171 ~~Lty~~~~~~------~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~-~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha  243 (320)
                      ..|+|++.+..      +.+++++|++||+.|+++++|+|+++.+. ..+||+|.|....     +|+|.|   +++||+
T Consensus         2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g---~~~a~a   73 (156)
T cd04279           2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAG---GGLARA   73 (156)
T ss_pred             CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCC---CceEEe
Confidence            46888888753      33788999999999999999999998753 2799999998632     788899   999999


Q ss_pred             eCCCCceeEecCCCCCcc------CCccccchhHHHHHHhhhhhhCCCCCCCCC-CcccccccCCCC--CCCCHHHHHHH
Q 035681          244 FAPTNGRFHYDADEPWAV------GAVQGAFDLETVALHEIGHLLGLHHSSVEG-AIMYPSISAGVT--KGLHGDDIQGI  314 (320)
Q Consensus       244 ~~P~~G~i~~d~~e~w~~------~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~-siMyp~~~~~~~--~~L~~dDi~~i  314 (320)
                      ++|..|.  ++....|..      .....+.+++.|++|||||+|||.|+.++. +||||++..+..  ..|+.+||++|
T Consensus        74 ~~p~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i  151 (156)
T cd04279          74 GFPLISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATL  151 (156)
T ss_pred             cccccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHH
Confidence            9997533  222222221      111124689999999999999999999998 999999985432  48999999999


Q ss_pred             HHHhC
Q 035681          315 RDLYN  319 (320)
Q Consensus       315 q~LYg  319 (320)
                      ++|||
T Consensus       152 ~~lY~  156 (156)
T cd04279         152 KRLYG  156 (156)
T ss_pred             HHHhC
Confidence            99997


No 5  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.90  E-value=4.5e-24  Score=183.85  Aligned_cols=142  Identities=23%  Similarity=0.279  Sum_probs=106.8

Q ss_pred             cccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCc
Q 035681          170 KFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNG  249 (320)
Q Consensus       170 k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G  249 (320)
                      ++.|+|+|.+..+...+++|++||+.|+++++|+|+++...+.+||+|.+....+..    +|..   +.+++...|..|
T Consensus         1 ~~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~---~~~~~~~~~~~g   73 (165)
T cd04268           1 KKPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTW---SYGPSQVDPLTG   73 (165)
T ss_pred             CCCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCcc---ccCCccCCCCCc
Confidence            468999999887778999999999999999999999998766799999988643211    0111   222222245678


Q ss_pred             eeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCC----------------CCCcccccccC-------CCCCCC
Q 035681          250 RFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSV----------------EGAIMYPSISA-------GVTKGL  306 (320)
Q Consensus       250 ~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~----------------~~siMyp~~~~-------~~~~~L  306 (320)
                      +|+++....|.......+.++..|++|||||+|||.|+.+                ..|||++....       +....|
T Consensus        74 ~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~~~~~~  153 (165)
T cd04268          74 EILLARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDGQKYTI  153 (165)
T ss_pred             cEEeeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccccCCCC
Confidence            9998876554321011234688999999999999999998                89999987652       124589


Q ss_pred             CHHHHHHHHHHh
Q 035681          307 HGDDIQGIRDLY  318 (320)
Q Consensus       307 ~~dDi~~iq~LY  318 (320)
                      +.+||.+||.||
T Consensus       154 ~~~Di~ai~~lY  165 (165)
T cd04268         154 GPYDIAAIKKLY  165 (165)
T ss_pred             CHHHHHHHHhcC
Confidence            999999999999


No 6  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.88  E-value=1.1e-22  Score=171.07  Aligned_cols=133  Identities=34%  Similarity=0.528  Sum_probs=108.0

Q ss_pred             CCCCCCCcccceEecc-CCCch-hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcce
Q 035681          163 SPKWPASKFSLTYAFR-PGTRS-DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGII  240 (320)
Q Consensus       163 ~~kW~~~k~~Lty~~~-~~~~~-~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~l  240 (320)
                      ..+|+  +.+|+|+|. +..+. +++++|++||+.|+++++|+|+|+.+  .+||.|.|...++        .|   ..+
T Consensus         2 ~~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~--------~g---~~~   66 (140)
T smart00235        2 SKKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDG--------SG---CTL   66 (140)
T ss_pred             CCcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCC--------CC---cce
Confidence            36899  889999997 54443 45899999999999999999999875  7999999998765        12   236


Q ss_pred             eeeeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCCCC---cccccccCCC-CCCCCHHHHHHHHH
Q 035681          241 AHAFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVEGA---IMYPSISAGV-TKGLHGDDIQGIRD  316 (320)
Q Consensus       241 aha~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~s---iMyp~~~~~~-~~~L~~dDi~~iq~  316 (320)
                      ++++.| .|+++++. +.|..+        ..|++|||||+|||.|+.++.+   +|++.+..-. ...|+.+|..++++
T Consensus        67 a~~g~~-~g~~~~~~-~~~~~~--------~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~~~~  136 (140)
T smart00235       67 SHAGRP-GGDQHFSL-GNGCIN--------TGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLGIPY  136 (140)
T ss_pred             eeeecC-CCceEEEc-cCCcCC--------cccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCCchh
Confidence            888888 88999976 666543        3499999999999999998877   9999997321 34778889999999


Q ss_pred             HhCC
Q 035681          317 LYNV  320 (320)
Q Consensus       317 LYg~  320 (320)
                      +||.
T Consensus       137 ~yg~  140 (140)
T smart00235      137 DYGS  140 (140)
T ss_pred             ccCc
Confidence            9984


No 7  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.86  E-value=6.3e-21  Score=168.15  Aligned_cols=122  Identities=22%  Similarity=0.310  Sum_probs=96.0

Q ss_pred             hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCC-------CceeEecCC
Q 035681          184 DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPT-------NGRFHYDAD  256 (320)
Q Consensus       184 ~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~-------~G~i~~d~~  256 (320)
                      +.++++++||+.|+++++|+|+|+.+...+||+|.+.....         +   +..+.+++|.       .|+++++.+
T Consensus        34 ~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~---~~~g~a~~p~~~~~~~~~g~i~~~~~  101 (186)
T cd04277          34 AQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------G---NTAGYAYYPGSGSGTAYGGDIWFNSS  101 (186)
T ss_pred             HHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------C---CccEEEECCCCCccccccceeEEecC
Confidence            67899999999999999999999997768999998875321         2   5677888873       368999988


Q ss_pred             CCCccCCccccchhHHHHHHhhhhhhCCCCCCCC----------------CCcccccccCC--------CCCCCCHHHHH
Q 035681          257 EPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVE----------------GAIMYPSISAG--------VTKGLHGDDIQ  312 (320)
Q Consensus       257 e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~----------------~siMyp~~~~~--------~~~~L~~dDi~  312 (320)
                      ..+...  ..+.....|++|||||+|||.|+.+.                .+||.+....+        ....+...||+
T Consensus       102 ~~~~~~--~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~DI~  179 (186)
T cd04277         102 YDTNSD--SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDIA  179 (186)
T ss_pred             cccccC--CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHHHH
Confidence            776543  22346789999999999999998752                46887554432        22378899999


Q ss_pred             HHHHHhC
Q 035681          313 GIRDLYN  319 (320)
Q Consensus       313 ~iq~LYg  319 (320)
                      +||+|||
T Consensus       180 AlQ~lYG  186 (186)
T cd04277         180 ALQYLYG  186 (186)
T ss_pred             HHHHhhC
Confidence            9999998


No 8  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.72  E-value=6.4e-17  Score=144.25  Aligned_cols=108  Identities=29%  Similarity=0.495  Sum_probs=76.3

Q ss_pred             CCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeee
Q 035681          165 KWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAF  244 (320)
Q Consensus       165 kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~  244 (320)
                      +|++ .+.|+|.|..+.+.+.++.|++||+.|+++++|+|++++.. .+||+|.|..+. | |+++.|..   +..   .
T Consensus         2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~-~adi~I~f~~~~-G-c~S~vG~~---~~~---~   71 (198)
T cd04327           2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDA-DADIRISFTPGD-G-YWSYVGTD---ALL---I   71 (198)
T ss_pred             CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCC-CCCEEEEEecCC-C-CCCCcCCc---ccc---c
Confidence            6985 56789999887666788999999999999999999999864 799999998742 4 45666653   221   1


Q ss_pred             CCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCC
Q 035681          245 APTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHS  287 (320)
Q Consensus       245 ~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs  287 (320)
                      .+....+.+.    |.... ..+.+...|++|||||||||.|.
T Consensus        72 ~~~~~t~~l~----~~~~~-~~~~~~~~~i~HElgHaLG~~HE  109 (198)
T cd04327          72 GADAPTMNLG----WFTDD-TPDPEFSRVVLHEFGHALGFIHE  109 (198)
T ss_pred             CCCCceeeee----eecCC-CchhhHHHHHHHHHHHHhcCccc
Confidence            1112233332    21110 02346678999999999999996


No 9  
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.72  E-value=7.2e-17  Score=138.89  Aligned_cols=119  Identities=24%  Similarity=0.296  Sum_probs=90.3

Q ss_pred             chhHHHHHHHHHHHhhhcCcceEEEecCC-CCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCC-----CceeEecC
Q 035681          182 RSDAITPVARAFQTWAANTQFKFSKIEDF-ANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPT-----NGRFHYDA  255 (320)
Q Consensus       182 ~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~-~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~-----~G~i~~d~  255 (320)
                      +..++++|+.|++.|+++++++|+++... ..+||.+.+...++        .+   +.++.++.|.     .+.+.+..
T Consensus        20 ~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~---~~~g~a~~~~~c~~~~~~~~~~~   88 (167)
T cd00203          20 SAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DG---GTGGWAYLGRVCDSLRGVGVLQD   88 (167)
T ss_pred             HHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CC---CceEEEecCCccCCCCCcEEEec
Confidence            33688999999999999999999999864 47999988875432        12   4556677653     35565544


Q ss_pred             CCCCccCCccccchhHHHHHHhhhhhhCCCCCCC--------------------CCCcccccccC---CCCCCCCHHHHH
Q 035681          256 DEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSV--------------------EGAIMYPSISA---GVTKGLHGDDIQ  312 (320)
Q Consensus       256 ~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~--------------------~~siMyp~~~~---~~~~~L~~dDi~  312 (320)
                      ...+       ......|++|||||+|||.|...                    ..+||+|....   .....++..||.
T Consensus        89 ~~~~-------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~~  161 (167)
T cd00203          89 NQSG-------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDID  161 (167)
T ss_pred             CCcc-------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCHHHHH
Confidence            3322       23567899999999999999765                    46899998774   245689999999


Q ss_pred             HHHHHh
Q 035681          313 GIRDLY  318 (320)
Q Consensus       313 ~iq~LY  318 (320)
                      .||++|
T Consensus       162 ~i~~~Y  167 (167)
T cd00203         162 QINKLY  167 (167)
T ss_pred             HHHhhC
Confidence            999998


No 10 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.59  E-value=1.5e-14  Score=128.83  Aligned_cols=137  Identities=25%  Similarity=0.292  Sum_probs=95.8

Q ss_pred             CCCCCcccceEeccC----CCchhHHHHHHHHHHHhhhc-CcceEEEe--cCCCCCceEEEeeeCCCCCCCCCCCCCCCC
Q 035681          165 KWPASKFSLTYAFRP----GTRSDAITPVARAFQTWAAN-TQFKFSKI--EDFANADIKISFESGDHGDGNPFDGRGGPE  237 (320)
Q Consensus       165 kW~~~k~~Lty~~~~----~~~~~~~~~i~~A~~~Ws~v-~~l~F~e~--~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~  237 (320)
                      -|....++|+..+.+    ..+...+.++.+|++.|+.+ ..|+|+-+  .....+||.|.-...        +.+.   
T Consensus        33 lV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~---  101 (211)
T PF12388_consen   33 LVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPS---  101 (211)
T ss_pred             eecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCC---
Confidence            453236788888887    33446889999999999996 35788522  233466776632211        1123   


Q ss_pred             cceeeeeCC-CCc----eeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC------------------------
Q 035681          238 GIIAHAFAP-TNG----RFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS------------------------  288 (320)
Q Consensus       238 g~laha~~P-~~G----~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~------------------------  288 (320)
                      |..+.|.+| .+|    .|.+...+.      ......++|++|||||+|||.|++                        
T Consensus       102 G~ggsAGFP~s~G~P~~~I~I~~~~~------~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~Ip  175 (211)
T PF12388_consen  102 GAGGSAGFPTSNGNPYKFIQIYGLSN------YSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIP  175 (211)
T ss_pred             CcceeccCCCCCCCCCceEEEEecCC------CchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECC
Confidence            567788889 554    455511111      112246789999999999999986                        


Q ss_pred             ------CCCCcccccccCCCCCCCCHHHHHHHHHHh
Q 035681          289 ------VEGAIMYPSISAGVTKGLHGDDIQGIRDLY  318 (320)
Q Consensus       289 ------~~~siMyp~~~~~~~~~L~~dDi~~iq~LY  318 (320)
                            |++|||..++..+...+++..||.++++||
T Consensus       176 GTPt~~d~~SiM~ac~~~~~~~~f~~~Di~Al~~lY  211 (211)
T PF12388_consen  176 GTPTGADPNSIMNACFSSGEDGEFTSNDITALNYLY  211 (211)
T ss_pred             CCCCCCCCchhhhccccCCCCCCcChhhHHHHHhhC
Confidence                  467999999887777899999999999999


No 11 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.52  E-value=1.1e-13  Score=123.03  Aligned_cols=144  Identities=17%  Similarity=0.121  Sum_probs=89.5

Q ss_pred             cccceEeccCCCchhHHHHHHHHHHHhhhc------CcceEEEecCC--CCCceEEEeeeCCCCCCCCCCCCCCCCccee
Q 035681          170 KFSLTYAFRPGTRSDAITPVARAFQTWAAN------TQFKFSKIEDF--ANADIKISFESGDHGDGNPFDGRGGPEGIIA  241 (320)
Q Consensus       170 k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v------~~l~F~e~~~~--~~aDI~I~f~~~~hgd~~~fdG~g~~~g~la  241 (320)
                      |+.|||+|.+..|.+++++|++||+.|+++      .++.+++....  ...||++..-+--+.+    ++..   +...
T Consensus         7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~---~~gp   79 (197)
T cd04276           7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGW---AYGP   79 (197)
T ss_pred             CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcc---eecc
Confidence            889999999998888999999999999997      35555555422  2468984433322322    1111   2333


Q ss_pred             eeeCCCCc-----eeEecCCCCCccCC---ccccchhHHHHHHhhhhhhCCCCCCC---------------------CCC
Q 035681          242 HAFAPTNG-----RFHYDADEPWAVGA---VQGAFDLETVALHEIGHLLGLHHSSV---------------------EGA  292 (320)
Q Consensus       242 ha~~P~~G-----~i~~d~~e~w~~~~---~~~~~~l~~Va~HEiGHaLGL~Hs~~---------------------~~s  292 (320)
                      +..-|..|     +|.++.........   ...+..+..+++||+||+|||.|...                     ..|
T Consensus        80 s~~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~~S  159 (197)
T cd04276          80 SVVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGATSS  159 (197)
T ss_pred             cccCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCcce
Confidence            44446444     55665533211110   01123577899999999999999631                     245


Q ss_pred             cccc---cccC-------CCCCCCCHHHHHHHHHHhCC
Q 035681          293 IMYP---SISA-------GVTKGLHGDDIQGIRDLYNV  320 (320)
Q Consensus       293 iMyp---~~~~-------~~~~~L~~dDi~~iq~LYg~  320 (320)
                      ||-.   .+..       .....+...||.+||.+||.
T Consensus       160 VMdY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~  197 (197)
T cd04276         160 VMDYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP  197 (197)
T ss_pred             eecCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence            5532   1100       01236788999999999984


No 12 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.51  E-value=1.1e-13  Score=121.62  Aligned_cols=91  Identities=20%  Similarity=0.230  Sum_probs=64.3

Q ss_pred             ccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCce
Q 035681          171 FSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGR  250 (320)
Q Consensus       171 ~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~  250 (320)
                      ..|.|.|++..+...++.|++||+.|++++||+|++.+.   ++..|.|..+ .| |++.-|.-   +        ...+
T Consensus         2 ~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~G-c~S~vG~~---~--------~~q~   65 (180)
T cd04280           2 GTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SG-CWSYVGRV---G--------GRQV   65 (180)
T ss_pred             CEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CC-cceecCcc---C--------Ccee
Confidence            478999998766688899999999999999999999874   5556677654 34 34433321   0        1123


Q ss_pred             eEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681          251 FHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       251 i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      +.+..           ++....+++|||||||||.|..
T Consensus        66 i~l~~-----------~c~~~g~v~HE~~HalG~~HEh   92 (180)
T cd04280          66 VSLGS-----------GCFSLGTIVHELMHALGFYHEQ   92 (180)
T ss_pred             EEeCC-----------CcCcCchhHHHHHHHhcCcchh
Confidence            33322           1233579999999999999974


No 13 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.50  E-value=7.5e-14  Score=123.90  Aligned_cols=94  Identities=30%  Similarity=0.513  Sum_probs=62.1

Q ss_pred             CCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeee
Q 035681          165 KWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAF  244 (320)
Q Consensus       165 kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~  244 (320)
                      +|+  ..+|.|.|....+...++.|++|+..|++.++|+|+++. . .....|.|.. ..|| +++-|..          
T Consensus         2 ~Wp--~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~-~~gC-~S~vG~~----------   65 (191)
T PF01400_consen    2 KWP--NGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSN-GSGC-WSYVGRQ----------   65 (191)
T ss_dssp             S-G--GGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEES-SSSE-EEESS------------
T ss_pred             cCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeec-Cccc-cchhhhc----------
Confidence            799  889999999877767889999999999999999999998 3 4444556654 2333 3333321          


Q ss_pred             CCCCc--eeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681          245 APTNG--RFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       245 ~P~~G--~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                         .|  .+.+.  .         ++....|++|||||||||.|..
T Consensus        66 ---~g~q~i~l~--~---------~c~~~~~i~HEl~HaLG~~HEh   97 (191)
T PF01400_consen   66 ---GGEQTINLG--D---------GCFSVGTILHELGHALGFWHEH   97 (191)
T ss_dssp             ---SSEEEEEE---T---------TC-SHHHHHHHHHHHHTB--GG
T ss_pred             ---CcceeEEec--c---------eeCCccchHHHHHHHHhhhhhh
Confidence               12  23332  1         1223579999999999999974


No 14 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.48  E-value=4.4e-13  Score=119.54  Aligned_cols=99  Identities=20%  Similarity=0.317  Sum_probs=67.5

Q ss_pred             CCCCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceee
Q 035681          163 SPKWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAH  242 (320)
Q Consensus       163 ~~kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~lah  242 (320)
                      ..+|+  +..|.|.|........++.|++||+.|++.+||+|++..+.   ...|.|..+..|| +++-|.-   +.-. 
T Consensus         7 ~~~Wp--~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~~---~~yi~f~~~~~GC-~S~vG~~---~~g~-   76 (200)
T cd04281           7 ERIWP--GGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTPE---ENYIVFTYRPCGC-CSYVGRR---GNGP-   76 (200)
T ss_pred             cCcCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCCC---CCEEEEEECCCCe-eEcCCCc---CCCc-
Confidence            46999  88999999877665678999999999999999999997642   3345555443443 4555542   1000 


Q ss_pred             eeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681          243 AFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       243 a~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                            -.+.+..           ++....|++|||||||||.|..
T Consensus        77 ------q~isl~~-----------~C~~~Gti~HEl~HaLGf~HEh  105 (200)
T cd04281          77 ------QAISIGK-----------NCDKFGIVVHELGHVIGFWHEH  105 (200)
T ss_pred             ------eeeecCC-----------CcCcCchHHHHHHHHhcCcchh
Confidence                  0122211           1223469999999999999864


No 15 
>PF01471 PG_binding_1:  Putative peptidoglycan binding domain;  InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are:   Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX [].   Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.45  E-value=1.8e-13  Score=97.72  Aligned_cols=56  Identities=29%  Similarity=0.444  Sum_probs=50.2

Q ss_pred             hhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhh
Q 035681           57 IKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKM  121 (320)
Q Consensus        57 ~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m  121 (320)
                      ++.|..+|++|.++||++.         ..++.|++.+++||++||+.+||++||++|.+|+++|
T Consensus         2 ~~~v~~lq~~L~~~gy~~~---------~~~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L   57 (57)
T PF01471_consen    2 GPDVKALQQYLNRLGYYPG---------PVDGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL   57 (57)
T ss_dssp             SHHHHHHHHHHHHTTTT-S---------STTSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred             cHHHHHHHHHHHHcCCCCC---------CCCCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence            5789999999999999953         3789999999999999999999999999999999987


No 16 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.41  E-value=2e-12  Score=117.47  Aligned_cols=96  Identities=22%  Similarity=0.337  Sum_probs=66.4

Q ss_pred             CCCCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceee
Q 035681          163 SPKWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAH  242 (320)
Q Consensus       163 ~~kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~lah  242 (320)
                      ..+|+  + .|.|.|....+...++.|++||+.|++.+||+|++.... ..  .|.|..+ .|| +++-|..   |    
T Consensus        43 ~~~Wp--~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e-~~--yi~i~~~-~GC-~S~vG~~---g----  107 (230)
T cd04282          43 TYRWP--F-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGE-SN--YIFFFKG-SGC-WSMVGDQ---Q----  107 (230)
T ss_pred             ccCCC--c-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCC-Cc--EEEEEcC-CCe-eeccCcc---C----
Confidence            46999  7 899999887766788999999999999999999987642 22  3455543 243 4444432   1    


Q ss_pred             eeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681          243 AFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       243 a~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                          ..-.+.+..           ++....+++|||||||||.|..
T Consensus       108 ----g~q~isl~~-----------~C~~~Gti~HEl~HalGf~HEq  138 (230)
T cd04282         108 ----GGQNLSIGA-----------GCDYKATVEHEFLHALGFYHEQ  138 (230)
T ss_pred             ----CeEEEEECC-----------CcCCCchHHHHHHHHhCCcccc
Confidence                011233321           2334579999999999999974


No 17 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=99.36  E-value=4.6e-12  Score=111.43  Aligned_cols=92  Identities=20%  Similarity=0.273  Sum_probs=60.5

Q ss_pred             ccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCce
Q 035681          171 FSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGR  250 (320)
Q Consensus       171 ~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~  250 (320)
                      -.|.|.|+...+...++.|++|++.|++.++|+|++.... ..-|  .|..+ .| |+++-|..   |-        .-.
T Consensus         4 ~~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~-~~yi--~~~~~-~g-C~S~vG~~---gg--------~q~   67 (182)
T cd04283           4 VYVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTTE-RDYL--NIESR-SG-CWSYIGRQ---GG--------RQT   67 (182)
T ss_pred             EEEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCCC-CcEE--EEEcC-CC-ceEecCcc---CC--------cee
Confidence            3689999877666778999999999999999999987532 3334  44432 23 34444432   10        011


Q ss_pred             eEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681          251 FHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       251 i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      +.+..          .++....+++|||||||||.|..
T Consensus        68 i~l~~----------~~C~~~G~i~HEl~HaLG~~HEh   95 (182)
T cd04283          68 VSLQK----------QGCMYKGIIQHELLHALGFYHEQ   95 (182)
T ss_pred             EecCC----------CCcCccchHHHHHHHHhCCcccc
Confidence            22211          12233579999999999999974


No 18 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.29  E-value=1.4e-11  Score=121.40  Aligned_cols=100  Identities=24%  Similarity=0.412  Sum_probs=69.8

Q ss_pred             CCCCCCCCcccceEeccCCCchhHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCccee
Q 035681          162 RSPKWPASKFSLTYAFRPGTRSDAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIA  241 (320)
Q Consensus       162 ~~~kW~~~k~~Lty~~~~~~~~~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~la  241 (320)
                      ...+|+  ...|.|.|........+.+|+.||+.|++.++|+|++..+. ..+..+.+...  || +++-|..   |   
T Consensus        78 ~~~~Wp--~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~--gC-~S~VGr~---g---  145 (411)
T KOG3714|consen   78 PERRWP--NGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG--GC-YSYVGRR---G---  145 (411)
T ss_pred             hhhcCC--CCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC--cc-eeeeCcc---C---
Confidence            457999  78999999988666889999999999999999999998744 34444444332  43 4444443   1   


Q ss_pred             eeeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCC
Q 035681          242 HAFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       242 ha~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                            +++      +.-+++   .+.....|++|||+||||+.|.+
T Consensus       146 ------g~~------q~~sl~---~~C~~~G~i~HEl~HaLGf~Heh  177 (411)
T KOG3714|consen  146 ------GGQ------QLLSLG---DGCDRFGTIVHELMHALGFWHEH  177 (411)
T ss_pred             ------CCc------cceecC---CCcCcCchhHHHHHHHhhhhhcc
Confidence                  111      011111   13344779999999999999975


No 19 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.97  E-value=1.3e-09  Score=89.05  Aligned_cols=112  Identities=21%  Similarity=0.312  Sum_probs=67.0

Q ss_pred             hHHHHHHHHHHHhhh-cCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCceeEecCCCCCccC
Q 035681          184 DAITPVARAFQTWAA-NTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGRFHYDADEPWAVG  262 (320)
Q Consensus       184 ~~~~~i~~A~~~Ws~-v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~i~~d~~e~w~~~  262 (320)
                      +.+..|.+|.+.|++ |.+++|++..   .++|+|.-...          ++   +..+..-....|.|.+|..  +   
T Consensus        13 ~f~~~i~~aa~iWN~sV~NV~L~~~s---~a~~~~~~~~~----------~~---~sya~~~g~G~G~I~l~~~--~---   71 (132)
T PF02031_consen   13 EFRSAIAQAAQIWNSSVSNVRLVEGS---SADIRYYEGND----------PR---GSYASTDGLGSGYIFLDYQ--Q---   71 (132)
T ss_dssp             GGHHHHHHHHHHHHHH-SSEEEEE-S---S-SEEEEEE------------TT-----EEEE-SSS-EEEEEEHH--H---
T ss_pred             hhHHHHHHHHHHHhcccCceEEeecC---CCcEEEEEecC----------CC---CcccccCCCCcEEEEechH--H---
Confidence            688999999999986 8999999976   38888754321          12   3333333334688888742  2   


Q ss_pred             CccccchhHHHHHHhhhhhhCCC-CCCC-CCCcccccccCCC---CCCCCHHHHHHHHHHhC
Q 035681          263 AVQGAFDLETVALHEIGHLLGLH-HSSV-EGAIMYPSISAGV---TKGLHGDDIQGIRDLYN  319 (320)
Q Consensus       263 ~~~~~~~l~~Va~HEiGHaLGL~-Hs~~-~~siMyp~~~~~~---~~~L~~dDi~~iq~LYg  319 (320)
                        ..+++-..|++|||||+|||. |-+- ...+|.-... |.   +......++..+..||.
T Consensus        72 --~qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~a-G~sCtN~~Pna~E~a~V~~~~a  130 (132)
T PF02031_consen   72 --NQGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSA-GTSCTNAYPNAAERARVNSLFA  130 (132)
T ss_dssp             --HHHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT--TT----S--HHHHHHHHHHCT
T ss_pred             --hhCCccceeeeehhccccCCCCCCCCccHHhhcCCCC-CCCCCCCCCCHHHHHHHHHHhc
Confidence              124677889999999999995 5333 4567875433 42   22567889999999884


No 20 
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=98.91  E-value=2.8e-09  Score=95.43  Aligned_cols=72  Identities=21%  Similarity=0.314  Sum_probs=63.8

Q ss_pred             ccCCCCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhccCCccc
Q 035681           49 KGCHKGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTIPRCGV  128 (320)
Q Consensus        49 ~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~PRCgv  128 (320)
                      ..++.|+.++.|..||+.|...||...         ..|+.|++.|+.||+.||+.+||+++|++|..|+..|..| |.+
T Consensus         5 ~~l~~G~~g~~V~~LQ~~L~~lG~~~g---------~idG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~   74 (201)
T TIGR02869         5 QTYQRGSTGSDVIEIQRRLKAWGYYNG---------KVDGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA-TTQ   74 (201)
T ss_pred             ccCCCCCCcHHHHHHHHHHHHcCCCCC---------CCCCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc-ccc
Confidence            478899999999999999999999863         3689999999999999999999999999999999999554 444


Q ss_pred             cc
Q 035681          129 AD  130 (320)
Q Consensus       129 ~D  130 (320)
                      +.
T Consensus        75 ~~   76 (201)
T TIGR02869        75 QS   76 (201)
T ss_pred             Cc
Confidence            43


No 21 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.90  E-value=2.5e-09  Score=94.21  Aligned_cols=127  Identities=23%  Similarity=0.273  Sum_probs=76.0

Q ss_pred             hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCC-CCcceeeeeCC-CCceeEecCCCCCcc
Q 035681          184 DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGG-PEGIIAHAFAP-TNGRFHYDADEPWAV  261 (320)
Q Consensus       184 ~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~-~~g~laha~~P-~~G~i~~d~~e~w~~  261 (320)
                      ..++++..|++.|++.-|+..+|-.  ..|||+|...+. -|  .+.++.|. ....++--|+- ..|--|++.-..-..
T Consensus       104 ~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~-pg--tg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~  178 (236)
T COG5549         104 RWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNP-PG--TGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMY  178 (236)
T ss_pred             hHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCC-CC--CcccccchHHHHHHHHHHhhhccCcccccccccccC
Confidence            4578999999999999777776654  489999987642 22  22222220 00111111110 111112221111111


Q ss_pred             CCccccchhHHHHHHhhhhhhCCC-CCCCCCCcccccccCCCCCCCCHHHHHHHHHHhC
Q 035681          262 GAVQGAFDLETVALHEIGHLLGLH-HSSVEGAIMYPSISAGVTKGLHGDDIQGIRDLYN  319 (320)
Q Consensus       262 ~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~~~siMyp~~~~~~~~~L~~dDi~~iq~LYg  319 (320)
                      .......++..+|.||+|||||+. ||+-+ +.|.+..+   ...++..|++.+..+|-
T Consensus       179 ~pg~~~e~L~~tarhElGhaLgi~ghsd~k-s~aly~~s---~~~is~rdv~tL~r~Ye  233 (236)
T COG5549         179 PPGELRENLNPTARHELGHALGIWGHSDLK-SDALYGIS---WQGISRRDVKTLERKYE  233 (236)
T ss_pred             CcccchhhhhHHHHHhhcchheecccccch-hhhhcchh---hcccCHHHHHHHHHHhc
Confidence            112234578999999999999997 97754 55655444   34599999999999994


No 22 
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=98.31  E-value=1.1e-06  Score=89.54  Aligned_cols=84  Identities=19%  Similarity=0.272  Sum_probs=65.8

Q ss_pred             chhhhhhcccCCCCCchhcHHHHHHHHHhcCCCCCCCc---CCC--------------------------------CCCC
Q 035681           41 PFKFIEHLKGCHKGDKIKGIHELKEYMEKFGYLNYDNY---KNQ--------------------------------STHA   85 (320)
Q Consensus        41 ~~~~~~~~~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~---~~~--------------------------------~~~~   85 (320)
                      +|..+..-..++.||..+.|..|.+.|...|++.....   ..+                                ....
T Consensus       225 ~Wp~i~~~~~LrpG~~~~~V~~LR~RL~~~G~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~  304 (608)
T PRK10594        225 PWPQLTGKATLRPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPA  304 (608)
T ss_pred             CCcccCCCCCCCCCCCCchHHHHHHHHHHcCCcccccccccccccccccccccccccccccccccccccccccccccccc
Confidence            56666656689999999999999999999999853210   000                                0001


Q ss_pred             CCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhccC
Q 035681           86 DDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTIP  124 (320)
Q Consensus        86 ~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~P  124 (320)
                      .+..||+++.+|||+||+.+||+.+|++++.|+..|+.|
T Consensus       305 ~~~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs  343 (608)
T PRK10594        305 VRAAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT  343 (608)
T ss_pred             cccccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence            245699999999999999999999999999999998755


No 23 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=97.84  E-value=2.5e-05  Score=67.89  Aligned_cols=69  Identities=16%  Similarity=0.205  Sum_probs=60.0

Q ss_pred             ccCCCCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCC-CcccccCHHHHHhhccCC
Q 035681           49 KGCHKGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHL-KSTGVLDANTVAKMTIPR  125 (320)
Q Consensus        49 ~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L-~~TG~ld~~T~~~m~~PR  125 (320)
                      .....+..++.|..+|..|...||...        ...++.|+..++.||++||+.++| +++|+++++|+..+...+
T Consensus        37 ~~~~~~~~~~~v~~lq~~L~~~g~~~~--------~~~dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~~  106 (185)
T COG3409          37 PVLTLGAEGPSVRILQAALNALGYYPD--------GVIDGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQL  106 (185)
T ss_pred             cccccCCCCchHHHHHHHHHhcCCCCC--------CCccCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHHHh
Confidence            345566789999999999999999972        127899999999999999999999 899999999999887753


No 24 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.73  E-value=0.00022  Score=62.85  Aligned_cols=48  Identities=29%  Similarity=0.308  Sum_probs=34.4

Q ss_pred             hhHHHHHHhhhhhhCCCCCCCC----------CCcccccccCCCCCCCCHHHHHHHHH
Q 035681          269 DLETVALHEIGHLLGLHHSSVE----------GAIMYPSISAGVTKGLHGDDIQGIRD  316 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~~----------~siMyp~~~~~~~~~L~~dDi~~iq~  316 (320)
                      ....|++|||||.||+.|....          .-||.|.........++.--+..|+.
T Consensus       132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~~~~~~FS~CS~~~i~~  189 (192)
T cd04267         132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSGLNSYRFSQCSIGSIRE  189 (192)
T ss_pred             eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccCCCCCccChhhHHHHHH
Confidence            4567999999999999997652          24899877631234677766666654


No 25 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.67  E-value=0.00017  Score=63.90  Aligned_cols=113  Identities=25%  Similarity=0.371  Sum_probs=72.8

Q ss_pred             cccceEeccC--CC---c----hhHHHHHHHHHH---HhhhcCcceEEEecCCCCCceEEEeeeCCCCCC--CCCCCCCC
Q 035681          170 KFSLTYAFRP--GT---R----SDAITPVARAFQ---TWAANTQFKFSKIEDFANADIKISFESGDHGDG--NPFDGRGG  235 (320)
Q Consensus       170 k~~Lty~~~~--~~---~----~~~~~~i~~A~~---~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~--~~fdG~g~  235 (320)
                      .+.++|++.-  +.   +    .+..+.|...+.   -|.....+.|+.|.+. .+|++|...+..--+.  .+.+-.  
T Consensus        28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~-~~Df~I~Lasp~T~~~lC~g~~~~--  104 (203)
T PF11350_consen   28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSG-APDFRISLASPGTTDRLCAGLDTS--  104 (203)
T ss_pred             CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCC-CCCEEEEECCcchhhhhccCcCcC--
Confidence            3678998873  21   1    233344444443   4999899999999976 4999999887654332  122221  


Q ss_pred             CCcceeeeeCCCCceeEecCCCCCccCCccc----cchhHHHHHHhhhhhhCCCCCCC
Q 035681          236 PEGIIAHAFAPTNGRFHYDADEPWAVGAVQG----AFDLETVALHEIGHLLGLHHSSV  289 (320)
Q Consensus       236 ~~g~laha~~P~~G~i~~d~~e~w~~~~~~~----~~~l~~Va~HEiGHaLGL~Hs~~  289 (320)
                        +. ...+.|..+++.+|. ..|..+...-    +...+-|+=||.||+||-+|..=
T Consensus       105 --~e-~SC~~~~g~~VvIN~-~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~C  158 (203)
T PF11350_consen  105 --GE-TSCRNPAGGRVVINL-ARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPC  158 (203)
T ss_pred             --ce-eEeecCCCCeEEEeh-HHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcC
Confidence              22 233444577888877 4687765432    23456799999999999999764


No 26 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=97.55  E-value=6e-05  Score=67.72  Aligned_cols=46  Identities=30%  Similarity=0.290  Sum_probs=31.3

Q ss_pred             HHHHHHhhhhhhCCCCCCCC--------------CCcccccccCCCCCCCCHHHHHHHHHH
Q 035681          271 ETVALHEIGHLLGLHHSSVE--------------GAIMYPSISAGVTKGLHGDDIQGIRDL  317 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs~~~--------------~siMyp~~~~~~~~~L~~dDi~~iq~L  317 (320)
                      ..|++|||||+|||.|..+.              .+||.+.-. ....-++.-+|.-|+..
T Consensus       138 ~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY~~~-~~~~~FS~~~i~~i~~~  197 (206)
T PF13583_consen  138 YQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSYASR-PSRPYFSPPSIEYIREV  197 (206)
T ss_pred             chHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCCCCC-CCCCCCCchhHHHHhCC
Confidence            46799999999999997642              268863222 22234778888877643


No 27 
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.54  E-value=0.00014  Score=72.66  Aligned_cols=71  Identities=20%  Similarity=0.250  Sum_probs=60.7

Q ss_pred             hhcccCCCCCchhcHHHHHHHHHhcCC-CCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhcc
Q 035681           46 EHLKGCHKGDKIKGIHELKEYMEKFGY-LNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMTI  123 (320)
Q Consensus        46 ~~~~~~~~g~~~~~v~~l~~yL~~fGY-l~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~~  123 (320)
                      ..-..++.|+....|..|.+.|..-|= ++..       ......||.++.+|||+||+.+||+.+|++++.|++.|.-
T Consensus       227 ~~~~~LrpG~~~~~v~aL~~~L~~~~~d~~~a-------~~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~  298 (561)
T COG2989         227 PAGALLRPGVTSPDVPALRARLARSGMDLPSA-------AGSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV  298 (561)
T ss_pred             CCccccCCCCCchhHHHHHHHHHhcCccchhh-------ccCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence            555678999999999999999999984 3322       2244589999999999999999999999999999999964


No 28 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.43  E-value=6.7e-05  Score=64.94  Aligned_cols=30  Identities=43%  Similarity=0.586  Sum_probs=26.2

Q ss_pred             hHHHHHHhhhhhhCCCCCCCCCCccccccc
Q 035681          270 LETVALHEIGHLLGLHHSSVEGAIMYPSIS  299 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~  299 (320)
                      +..=++||+||+|||.|=+++.-||++..+
T Consensus       124 v~KEv~HElGH~~GL~HC~N~~CVM~FSnS  153 (181)
T COG1913         124 VVKEVLHELGHLLGLSHCPNPRCVMNFSNS  153 (181)
T ss_pred             HHHHHHHHhhhhcCcccCCCCCcEEeCCcc
Confidence            445689999999999999999999998754


No 29 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.40  E-value=0.00011  Score=63.14  Aligned_cols=23  Identities=35%  Similarity=0.642  Sum_probs=15.6

Q ss_pred             cchhHHHHHHhhhhhhCCCCCCC
Q 035681          267 AFDLETVALHEIGHLLGLHHSSV  289 (320)
Q Consensus       267 ~~~l~~Va~HEiGHaLGL~Hs~~  289 (320)
                      ..+...|++||+||-|||.|...
T Consensus        66 ~~~~g~TltHEvGH~LGL~HtF~   88 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWLGLYHTFG   88 (154)
T ss_dssp             TS-SSHHHHHHHHHHTT---TT-
T ss_pred             ccccccchhhhhhhhhccccccc
Confidence            34667899999999999999864


No 30 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.35  E-value=0.00051  Score=61.53  Aligned_cols=116  Identities=18%  Similarity=0.269  Sum_probs=63.2

Q ss_pred             HHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCCCCCCCCCCCCcceeeeeCCCCceeEecCCCCCccCCcccc
Q 035681          188 PVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDGNPFDGRGGPEGIIAHAFAPTNGRFHYDADEPWAVGAVQGA  267 (320)
Q Consensus       188 ~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~~P~~G~i~~d~~e~w~~~~~~~~  267 (320)
                      ..-+.|..|.......-.  .+....|+.+-+...+....   ++++   ++++.|+.-  | + ......+.+.. ..+
T Consensus        71 ~~L~~F~~w~~~~~~~~~--~~~~~~D~a~llt~~d~~~~---~~~~---~~~G~A~~g--g-i-C~~~~s~~i~~-~~~  137 (207)
T cd04273          71 KSLKSFCRWQKKLNPPND--SDPEHHDHAILLTRQDICRS---NGNC---DTLGLAPVG--G-M-CSPSRSCSINE-DTG  137 (207)
T ss_pred             HHHHHHHHHHHHcCCccc--ccccccceEEEEeeeccccc---CCCC---CceEEeccc--c-C-CCCCcceEEEc-CCC
Confidence            346889999774332110  11235777766665433211   3555   777777641  1 1 00000111110 012


Q ss_pred             chhHHHHHHhhhhhhCCCCCCC---------CCCcccccccCC-CCCCCCHHHHHHHHH
Q 035681          268 FDLETVALHEIGHLLGLHHSSV---------EGAIMYPSISAG-VTKGLHGDDIQGIRD  316 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs~~---------~~siMyp~~~~~-~~~~L~~dDi~~iq~  316 (320)
                      .....|++|||||.||+.|-.+         ..-||+|..... ....++.-=+..++.
T Consensus       138 ~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~  196 (207)
T cd04273         138 LSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTS  196 (207)
T ss_pred             ceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHH
Confidence            3345799999999999999765         268999988742 233566544444443


No 31 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.28  E-value=0.00012  Score=64.04  Aligned_cols=47  Identities=28%  Similarity=0.255  Sum_probs=19.3

Q ss_pred             hHHHHHHhhhhhhCCCCCCCC-------------CCcccccccCCCCCCCCHHHHHHHHHH
Q 035681          270 LETVALHEIGHLLGLHHSSVE-------------GAIMYPSISAGVTKGLHGDDIQGIRDL  317 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~~-------------~siMyp~~~~~~~~~L~~dDi~~iq~L  317 (320)
                      ...|++|||||.||+.|..+.             ..||++.... ....++.--|..|+++
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~~~~-~~~~fS~cS~~~i~~~  170 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYAGIC-NNDKFSPCSIRQIRAV  170 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS-GG-GG--------------
T ss_pred             eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeCCCC-CCcccccccccccccc
Confidence            457899999999999998654             2789865321 2236777777777654


No 32 
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=97.21  E-value=0.00071  Score=61.97  Aligned_cols=58  Identities=21%  Similarity=0.339  Sum_probs=49.4

Q ss_pred             CCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCC-CcccccCHHHHHhhc
Q 035681           54 GDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHL-KSTGVLDANTVAKMT  122 (320)
Q Consensus        54 g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L-~~TG~ld~~T~~~m~  122 (320)
                      +-+..+|..||+.|.+|||..+          . +.||..++.+|++||.-|+= ..+|+.|.+|++.+.
T Consensus       194 ~~~~~~v~~lq~~L~~YGY~v~----------~-~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l~  252 (257)
T COG3023         194 LLKGEDVAALQEMLARYGYGVE----------I-GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAILQ  252 (257)
T ss_pred             hhccCCHHHHHHHHHHhCcCCC----------c-chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHHH
Confidence            3344889999999999999862          3 78999999999999998875 569999999998763


No 33 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.20  E-value=0.00024  Score=62.55  Aligned_cols=29  Identities=34%  Similarity=0.494  Sum_probs=25.2

Q ss_pred             hHHHHHHhhhhhhCCCCCCCCCCcccccc
Q 035681          270 LETVALHEIGHLLGLHHSSVEGAIMYPSI  298 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~~~siMyp~~  298 (320)
                      +..+++||+||.|||.|-.++.-+|+..-
T Consensus       125 ~~k~~~HElGH~lGL~HC~~~~CvM~~s~  153 (179)
T PRK13267        125 VRKEVTHELGHTLGLEHCDNPRCVMNFSN  153 (179)
T ss_pred             HHHHHHHHHHHHcCCccCCCCCccCCCCC
Confidence            45679999999999999999999999643


No 34 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.08  E-value=0.00025  Score=62.62  Aligned_cols=23  Identities=35%  Similarity=0.459  Sum_probs=14.9

Q ss_pred             chhHHHHHHhhhhhhCCCCCCCC
Q 035681          268 FDLETVALHEIGHLLGLHHSSVE  290 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs~~~  290 (320)
                      .....|++|||||.||+.|..+.
T Consensus       140 ~~~~~~~AHEiGH~lGa~HD~~~  162 (196)
T PF13688_consen  140 YNGAITFAHEIGHNLGAPHDGDY  162 (196)
T ss_dssp             HHHHHHHHHHHHHHTT-----SS
T ss_pred             CceehhhHHhHHHhcCCCCCCCC
Confidence            34567999999999999997764


No 35 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=97.07  E-value=0.00032  Score=62.36  Aligned_cols=30  Identities=40%  Similarity=0.559  Sum_probs=20.8

Q ss_pred             hhHHHHHHhhhhhhCCCCCCCCCCccccccc
Q 035681          269 DLETVALHEIGHLLGLHHSSVEGAIMYPSIS  299 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~  299 (320)
                      .+..+++||+||++||.|=.+ .-+|++..+
T Consensus       144 R~~Kea~HElGH~~GL~HC~~-~CvM~~s~s  173 (194)
T PF07998_consen  144 RVCKEAVHELGHLFGLDHCEN-RCVMNFSNS  173 (194)
T ss_dssp             HHHHHHHHHHHHHTT----SS-TSTTS--SS
T ss_pred             HHHHHHHHHHHHHcCCcCCCC-CCccCCCCC
Confidence            456799999999999999999 999998643


No 36 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.06  E-value=0.00022  Score=58.12  Aligned_cols=17  Identities=53%  Similarity=0.786  Sum_probs=12.1

Q ss_pred             HHHHHHhhhhhhCCCCC
Q 035681          271 ETVALHEIGHLLGLHHS  287 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs  287 (320)
                      ..|++|||||.|||.|+
T Consensus       108 ~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  108 VDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             TTHHHHHHHHHTT----
T ss_pred             ceEeeehhhHhcCCCCC
Confidence            47999999999999995


No 37 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.80  E-value=0.0011  Score=60.43  Aligned_cols=45  Identities=36%  Similarity=0.402  Sum_probs=32.0

Q ss_pred             HHHHHhhhhhhCCCCCCCC-------------------------CCcccccccCCCCCCCCHHHHHHHHHH
Q 035681          272 TVALHEIGHLLGLHHSSVE-------------------------GAIMYPSISAGVTKGLHGDDIQGIRDL  317 (320)
Q Consensus       272 ~Va~HEiGHaLGL~Hs~~~-------------------------~siMyp~~~~~~~~~L~~dDi~~iq~L  317 (320)
                      .|++|||||.||+.|-.+.                         .=||+|.... ..+.++.-=+.-|+.+
T Consensus       147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~-~~~~FS~CS~~~~~~~  216 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS-GITEFSPCTIGNICSL  216 (228)
T ss_pred             eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC-CCCccChhhHHHHHHH
Confidence            6999999999999996431                         1199998764 3446776666666544


No 38 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.77  E-value=0.0016  Score=57.44  Aligned_cols=31  Identities=39%  Similarity=0.546  Sum_probs=25.0

Q ss_pred             hhHHHHHHhhhhhhCCCCCCC------CCCccccccc
Q 035681          269 DLETVALHEIGHLLGLHHSSV------EGAIMYPSIS  299 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~------~~siMyp~~~  299 (320)
                      ..-.|++|||||.||+.|-.+      ..-||+|...
T Consensus       130 ~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~  166 (194)
T cd04269         130 LFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPS  166 (194)
T ss_pred             HHHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCC
Confidence            445799999999999999543      3679999866


No 39 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.68  E-value=0.00035  Score=63.70  Aligned_cols=24  Identities=46%  Similarity=0.800  Sum_probs=20.8

Q ss_pred             chhHHHHHHhhhhhhCCCCCCCCC
Q 035681          268 FDLETVALHEIGHLLGLHHSSVEG  291 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs~~~~  291 (320)
                      +++..|++||+||.|||.|+....
T Consensus       135 ~n~g~t~~HEvGH~lGL~HtF~~~  158 (225)
T cd04275         135 YNLGDTATHEVGHWLGLYHTFQGG  158 (225)
T ss_pred             ccccceeEEeccceeeeeeeecCC
Confidence            577789999999999999987554


No 40 
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=96.55  E-value=0.0056  Score=46.10  Aligned_cols=55  Identities=22%  Similarity=0.375  Sum_probs=44.8

Q ss_pred             hhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCC----cccccCHHHHHh
Q 035681           57 IKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLK----STGVLDANTVAK  120 (320)
Q Consensus        57 ~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~----~TG~ld~~T~~~  120 (320)
                      .+-+.++|+-|++.||+..+         .++.+++.+++|+++|+..-||.    ..|.+|...++-
T Consensus        15 ~~~~~evq~~L~~lGyy~g~---------~~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~   73 (74)
T PF08823_consen   15 GDVAREVQEALKRLGYYKGE---------ADGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAY   73 (74)
T ss_pred             HHHHHHHHHHHHHcCCccCC---------CCCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhh
Confidence            34578999999999998843         77889999999999999988874    356688877654


No 41 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=96.50  E-value=0.0054  Score=53.06  Aligned_cols=62  Identities=24%  Similarity=0.310  Sum_probs=53.5

Q ss_pred             CCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcCCCcccccCHHHHHhhc
Q 035681           53 KGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYHLKSTGVLDANTVAKMT  122 (320)
Q Consensus        53 ~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~L~~TG~ld~~T~~~m~  122 (320)
                      .+.....+..++.-+...+|...        ...++.|+..++.||+.||+.++|+++|++...|...+.
T Consensus       123 ~~~~~~~~~~~~~~~~~~~~~~~--------~~~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~  184 (185)
T COG3409         123 LGLGGGDVATLQQPLPLLGYRSG--------IRVDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR  184 (185)
T ss_pred             ccccccchHHHHHHHHhcccccC--------CCCCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence            36677778888888888888774        237899999999999999999999999999999988763


No 42 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.14  E-value=0.009  Score=52.93  Aligned_cols=48  Identities=27%  Similarity=0.514  Sum_probs=28.9

Q ss_pred             chhHHHHHHhhhhhhCCCCCCC--------CCCcccccccCCCCCCCCHHHHHHHH
Q 035681          268 FDLETVALHEIGHLLGLHHSSV--------EGAIMYPSISAGVTKGLHGDDIQGIR  315 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs~~--------~~siMyp~~~~~~~~~L~~dDi~~iq  315 (320)
                      ..+-.+++|||||.||+.|-..        ..-||.|.........++.-=+..++
T Consensus       129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~~~~~fS~CS~~~~~  184 (199)
T PF01421_consen  129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPSSSDKFSNCSRRQFE  184 (199)
T ss_dssp             HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSSSTSSS-HHHHHHHH
T ss_pred             HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCCCCCCcCHHHHHHHH
Confidence            3556799999999999999655        35789996653333356554444433


No 43 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.14  E-value=0.0075  Score=55.59  Aligned_cols=46  Identities=28%  Similarity=0.416  Sum_probs=31.0

Q ss_pred             hHHHHHHhhhhhhCCCCCCC-----C------CCcccccccCC---CCCCCCHHHHHHHH
Q 035681          270 LETVALHEIGHLLGLHHSSV-----E------GAIMYPSISAG---VTKGLHGDDIQGIR  315 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~-----~------~siMyp~~~~~---~~~~L~~dDi~~iq  315 (320)
                      ...|++|||||.||+.|-..     +      .=||+|....+   ....++.-=++.++
T Consensus       167 ~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~  226 (244)
T cd04270         167 SDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSIS  226 (244)
T ss_pred             HHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHH
Confidence            45799999999999999655     1      34899986432   12356654444443


No 44 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.79  E-value=0.011  Score=53.32  Aligned_cols=47  Identities=26%  Similarity=0.268  Sum_probs=29.8

Q ss_pred             hhHHHHHHhhhhhhCCCCCCC------------------CCCcccccccCCCCCCCCHHHHHHHH
Q 035681          269 DLETVALHEIGHLLGLHHSSV------------------EGAIMYPSISAGVTKGLHGDDIQGIR  315 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~------------------~~siMyp~~~~~~~~~L~~dDi~~iq  315 (320)
                      ....+++|||||.||+.|-.+                  ..-||.|.........++.--+.-++
T Consensus       144 ~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~fS~CS~~~~~  208 (220)
T cd04272         144 YGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNGERQYRFSQCSQRQIR  208 (220)
T ss_pred             ccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCCCCCCccCcchHHHHH
Confidence            346899999999999999532                  24689986442122345554444443


No 45 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.07  E-value=0.031  Score=53.50  Aligned_cols=18  Identities=44%  Similarity=0.771  Sum_probs=16.5

Q ss_pred             HHHHHHhhhhhhCCCCCC
Q 035681          271 ETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs~  288 (320)
                      ..|+.||+||-|||.|+.
T Consensus       151 ~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  151 WATIMHELGHNLGLWHAG  168 (314)
T ss_pred             HHHHHHHhhhhccccccC
Confidence            469999999999999995


No 46 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=93.93  E-value=0.027  Score=53.51  Aligned_cols=20  Identities=35%  Similarity=0.527  Sum_probs=14.9

Q ss_pred             hhHHHHHHhhhhhhCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      ....+.+||+||++||+|.+
T Consensus       192 s~~~~f~HE~GH~~GL~H~p  211 (305)
T PF10462_consen  192 SYGNEFSHELGHNFGLGHYP  211 (305)
T ss_dssp             -SHHHHHHHHHHTTT--SST
T ss_pred             CccceeehhhhhhcCCCCCC
Confidence            45789999999999999944


No 47 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=93.22  E-value=0.052  Score=53.86  Aligned_cols=26  Identities=42%  Similarity=0.583  Sum_probs=20.7

Q ss_pred             HHHHHHhhhhhhCCCCCCCCCCcccccc
Q 035681          271 ETVALHEIGHLLGLHHSSVEGAIMYPSI  298 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs~~~~siMyp~~  298 (320)
                      ....+|||||++|+.|..+  .||---|
T Consensus       317 iGA~lHEiGH~fg~pH~~~--GIM~Rdy  342 (423)
T PF12044_consen  317 IGAFLHEIGHLFGCPHQED--GIMLRDY  342 (423)
T ss_pred             hHHHHHHHHHhcCCCCCCC--CeeeCCc
Confidence            4578999999999999875  7885333


No 48 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=92.72  E-value=0.11  Score=41.08  Aligned_cols=28  Identities=32%  Similarity=0.435  Sum_probs=18.9

Q ss_pred             CccchhhHHHHHHHHHHHHHhhhhhhhhh
Q 035681            1 MASKAFLSFSIFAVIQIILLSLLSHNYSV   29 (320)
Q Consensus         1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~   29 (320)
                      |++|.+++|.++|++ +||+++-++|+.+
T Consensus         1 MaSK~~llL~l~LA~-lLlisSevaa~~~   28 (95)
T PF07172_consen    1 MASKAFLLLGLLLAA-LLLISSEVAAREL   28 (95)
T ss_pred             CchhHHHHHHHHHHH-HHHHHhhhhhHHh
Confidence            998888777777776 5566555555443


No 49 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=92.61  E-value=0.085  Score=53.93  Aligned_cols=60  Identities=22%  Similarity=0.315  Sum_probs=28.8

Q ss_pred             CCceEEEeeeCCCCCCCCCCCCCCCCcceeeee--------CCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhC
Q 035681          212 NADIKISFESGDHGDGNPFDGRGGPEGIIAHAF--------APTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLG  283 (320)
Q Consensus       212 ~aDI~I~f~~~~hgd~~~fdG~g~~~g~laha~--------~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLG  283 (320)
                      ++|+.|.......+.           +++|.|.        -|.-|.|.|+...   +... ....+..|++|||.|+||
T Consensus       159 ~~D~vlyV~~~~~~~-----------~~~A~A~~C~~~~~~RP~~G~in~~p~~---i~~~-~~~~~~~~~~HEi~HaLG  223 (521)
T PF01457_consen  159 NADFVLYVTARPSSS-----------STLAWAAPCQQDSDGRPIAGVININPSY---IPSF-YFQEFFRTVIHEIAHALG  223 (521)
T ss_dssp             S-SEEEEEE----ST-----------T---EEEEEEE-TTS-EEEEEEE--GGG------S---HHHHHHHHHHHHHHTT
T ss_pred             CccEEEEEEEeeccC-----------CceEEEeehhhccCCCCeeEEEEEchhH---ccch-hhhcccceeeeeeeeeee
Confidence            577776655432211           4666554        2455777776532   1110 123567899999999999


Q ss_pred             CCC
Q 035681          284 LHH  286 (320)
Q Consensus       284 L~H  286 (320)
                      +.-
T Consensus       224 Fs~  226 (521)
T PF01457_consen  224 FSS  226 (521)
T ss_dssp             -SH
T ss_pred             ecc
Confidence            975


No 50 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=92.08  E-value=0.04  Score=51.97  Aligned_cols=19  Identities=42%  Similarity=0.464  Sum_probs=16.2

Q ss_pred             HHHHHHhhhhhhCCCCCCC
Q 035681          271 ETVALHEIGHLLGLHHSSV  289 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs~~  289 (320)
                      ..|++||+||.|||.+--+
T Consensus       166 igv~~HE~gH~lGLPDlY~  184 (286)
T TIGR03296       166 VGVIAHELGHDLGLPDLYD  184 (286)
T ss_pred             eeeeehhhhcccCCCcccc
Confidence            5799999999999987543


No 51 
>PF11150 DUF2927:  Protein of unknown function (DUF2927);  InterPro: IPR021323  This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. 
Probab=91.19  E-value=4.5  Score=36.59  Aligned_cols=148  Identities=17%  Similarity=0.104  Sum_probs=80.7

Q ss_pred             CCCCCCcccceEeccCCCch---hHHHHHHHHHHHhhhcCcceEEEecCCCCCceEEEeeeCCCCCC---------CCCC
Q 035681          164 PKWPASKFSLTYAFRPGTRS---DAITPVARAFQTWAANTQFKFSKIEDFANADIKISFESGDHGDG---------NPFD  231 (320)
Q Consensus       164 ~kW~~~k~~Lty~~~~~~~~---~~~~~i~~A~~~Ws~v~~l~F~e~~~~~~aDI~I~f~~~~hgd~---------~~fd  231 (320)
                      .||.   ..|+.++....+.   ...+.+..-+++-..++.+.+..++.. +|+|.|.|...+....         ..+.
T Consensus        28 ~Rw~---~PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~~~~~  103 (213)
T PF11150_consen   28 RRWE---GPVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGAACFV  103 (213)
T ss_pred             ccCC---CCeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccccccc
Confidence            5897   4799998844333   234566666788888898888888876 8999999987655110         0000


Q ss_pred             CC-----CCCCcceeeeeCCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCC-CCCC-CCCcccccccCCCCC
Q 035681          232 GR-----GGPEGIIAHAFAPTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLH-HSSV-EGAIMYPSISAGVTK  304 (320)
Q Consensus       232 G~-----g~~~g~laha~~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~-~~siMyp~~~~~~~~  304 (320)
                      =+     .....-+..++....+.+.....-. .+.......-+...+..||-.+|||. .|.. +.||-+--   ....
T Consensus       104 ~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi-~i~~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFNDd---~~~~  179 (213)
T PF11150_consen  104 VPNFSSLPRYARCLVFARSNWSGEITRARAVI-FIRADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFNDD---NEFA  179 (213)
T ss_pred             chhhhcccccceEEEEEEecCCCceEEEEEEE-EEeccCCchHHHHHHHHHHHHHcCCccCCCcCCCceeeCC---Cccc
Confidence            00     0000111222221111111100000 00000112356778888999999995 3332 23332211   1233


Q ss_pred             CCCHHHHHHHHHHhC
Q 035681          305 GLHGDDIQGIRDLYN  319 (320)
Q Consensus       305 ~L~~dDi~~iq~LYg  319 (320)
                      .|+.-|..=|+.||-
T Consensus       180 ~LT~~D~lLLr~LYd  194 (213)
T PF11150_consen  180 LLTGFDELLLRMLYD  194 (213)
T ss_pred             cccHHHHHHHHHHcC
Confidence            789999999999994


No 52 
>PTZ00337 surface protease GP63; Provisional
Probab=90.33  E-value=0.25  Score=50.99  Aligned_cols=36  Identities=28%  Similarity=0.371  Sum_probs=24.8

Q ss_pred             CCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCC
Q 035681          246 PTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHS  287 (320)
Q Consensus       246 P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs  287 (320)
                      |.-|.|.|+....=      .......|++|||.||||+...
T Consensus       211 Pi~G~in~np~~i~------~~~~~~~v~~HEi~HALGFs~~  246 (567)
T PTZ00337        211 PFAAAVNFDPRQIA------VTNGDVRVAAHELGHALGFVRD  246 (567)
T ss_pred             ceEEEEEECHHHcc------chhHHHHHHHHHHHHHHccCHH
Confidence            56677877763321      1124678999999999999653


No 53 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=89.53  E-value=0.16  Score=41.56  Aligned_cols=18  Identities=44%  Similarity=0.822  Sum_probs=15.4

Q ss_pred             hhHHHHHHhhhhhhCCCC
Q 035681          269 DLETVALHEIGHLLGLHH  286 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~H  286 (320)
                      .+..|++|||||-.||.-
T Consensus       108 ~vthvliHEIgHhFGLsD  125 (136)
T COG3824         108 QVTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             HhhhhhhhhhhhhcCCCh
Confidence            456899999999999964


No 54 
>PF09374 PG_binding_3:  Predicted Peptidoglycan domain;  InterPro: IPR018537  This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=88.97  E-value=0.37  Score=35.92  Aligned_cols=26  Identities=27%  Similarity=0.376  Sum_probs=20.4

Q ss_pred             HHHHHHHHhcC----CCcccccCHHHHHhh
Q 035681           96 SAIKTYQLNYH----LKSTGVLDANTVAKM  121 (320)
Q Consensus        96 ~Aik~fQ~~~~----L~~TG~ld~~T~~~m  121 (320)
                      .|+|-.|+..|    +.++|++++.|++.+
T Consensus         1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al   30 (72)
T PF09374_consen    1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL   30 (72)
T ss_dssp             HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred             CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence            48899999999    999999999999999


No 55 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=88.76  E-value=0.24  Score=48.87  Aligned_cols=26  Identities=35%  Similarity=0.521  Sum_probs=20.4

Q ss_pred             HHHHHHhhhhhhCCCCCCCCCCcccccc
Q 035681          271 ETVALHEIGHLLGLHHSSVEGAIMYPSI  298 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs~~~~siMyp~~  298 (320)
                      ...++|||||.||+.|.+  +.||--.|
T Consensus       302 lGA~~HElGH~lgcpHq~--~GvMlrgy  327 (614)
T KOG4525|consen  302 LGAVCHELGHCLGCPHQS--EGVMLRGY  327 (614)
T ss_pred             HHHHHHHhhhccCCCCCC--CceEecCc
Confidence            457899999999999976  56884443


No 56 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=88.18  E-value=0.73  Score=41.85  Aligned_cols=18  Identities=39%  Similarity=0.689  Sum_probs=13.3

Q ss_pred             hhHHHHHHhhhhhhCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      .-..|++||.|||+  .|..
T Consensus        88 aAvaVAAHEvGHAi--Q~a~  105 (222)
T PF04298_consen   88 AAVAVAAHEVGHAI--QHAE  105 (222)
T ss_pred             HHHHHHHHHHhHHH--hccc
Confidence            34579999999994  5543


No 57 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=87.81  E-value=0.72  Score=47.88  Aligned_cols=38  Identities=21%  Similarity=0.282  Sum_probs=24.8

Q ss_pred             CCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCC
Q 035681          246 PTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHS  287 (320)
Q Consensus       246 P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs  287 (320)
                      |.-|.+.|+....-   . ........|++|||.|+||+.-+
T Consensus       236 Pi~G~iNinp~~i~---s-~~~~~~~rv~~HEi~HALGFS~~  273 (622)
T PTZ00257        236 PAVGVMNIPAANIV---S-RYDQGTTRTVTHEVAHALGFSSV  273 (622)
T ss_pred             ceEEEEeeCHHHCC---C-ccchHHHHHHHHHHHHHhcCCHH
Confidence            56677877753211   0 01124678999999999999543


No 58 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=87.29  E-value=0.41  Score=49.61  Aligned_cols=46  Identities=28%  Similarity=0.419  Sum_probs=34.4

Q ss_pred             HHHHHHhhhhhhCCCCCCC----------CCCcccccccCCCC---CCCCHHHHHHHHH
Q 035681          271 ETVALHEIGHLLGLHHSSV----------EGAIMYPSISAGVT---KGLHGDDIQGIRD  316 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs~~----------~~siMyp~~~~~~~---~~L~~dDi~~iq~  316 (320)
                      .-|++|||||..|=.|-++          -+=|||++...|++   .+++.--+..|..
T Consensus       393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~  451 (764)
T KOG3658|consen  393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISK  451 (764)
T ss_pred             heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHH
Confidence            4589999999999999665          13489999887643   3777776666653


No 59 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=85.81  E-value=1  Score=40.12  Aligned_cols=17  Identities=12%  Similarity=0.173  Sum_probs=13.6

Q ss_pred             hHHHHHHHHHHHhhhcC
Q 035681          184 DAITPVARAFQTWAANT  200 (320)
Q Consensus       184 ~~~~~i~~A~~~Ws~v~  200 (320)
                      ..+.-+..++++|++|.
T Consensus        19 wAQ~kvks~y~KySkV~   35 (226)
T COG2738          19 WAQSKVKSTYKKYSKVR   35 (226)
T ss_pred             HHHHHHHHHHHHHhhcc
Confidence            45667889999999984


No 60 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=85.69  E-value=0.48  Score=37.52  Aligned_cols=16  Identities=31%  Similarity=0.692  Sum_probs=12.8

Q ss_pred             hHHHHHHhhhhhhCCC
Q 035681          270 LETVALHEIGHLLGLH  285 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~  285 (320)
                      +..|++|||||.+|+.
T Consensus        73 I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   73 IRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHcCCC
Confidence            5679999999999985


No 61 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=85.67  E-value=0.44  Score=44.50  Aligned_cols=19  Identities=37%  Similarity=0.746  Sum_probs=12.0

Q ss_pred             hhHHHHHHhhhhhh-CCCCC
Q 035681          269 DLETVALHEIGHLL-GLHHS  287 (320)
Q Consensus       269 ~l~~Va~HEiGHaL-GL~Hs  287 (320)
                      ....|++||+||++ ||+-.
T Consensus       215 ~~~~v~vHE~GHsf~~LaDE  234 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADE  234 (264)
T ss_dssp             THHHHHHHHHHHHTT-----
T ss_pred             cccceeeeeccccccccccc
Confidence            56789999999965 45554


No 62 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=84.05  E-value=0.52  Score=47.16  Aligned_cols=19  Identities=42%  Similarity=0.642  Sum_probs=15.5

Q ss_pred             cchhHHHHHHhhhhhhCCCC
Q 035681          267 AFDLETVALHEIGHLLGLHH  286 (320)
Q Consensus       267 ~~~l~~Va~HEiGHaLGL~H  286 (320)
                      ..++..|++|||||+.| +|
T Consensus       127 esElagViAHEigHv~q-rH  145 (484)
T COG4783         127 ESELAGVIAHEIGHVAQ-RH  145 (484)
T ss_pred             HHHHHHHHHHHHHHHhh-hh
Confidence            34789999999999976 45


No 63 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=83.30  E-value=0.22  Score=52.05  Aligned_cols=17  Identities=47%  Similarity=0.532  Sum_probs=14.7

Q ss_pred             HHHHHHhhhhhhCCCCC
Q 035681          271 ETVALHEIGHLLGLHHS  287 (320)
Q Consensus       271 ~~Va~HEiGHaLGL~Hs  287 (320)
                      ..|++||+||.|||.--
T Consensus       222 iGVfaHEfGH~LGLPDl  238 (645)
T PF05547_consen  222 IGVFAHEFGHDLGLPDL  238 (645)
T ss_pred             eEEEEeeccccCCCCCc
Confidence            57999999999999643


No 64 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=83.29  E-value=0.6  Score=49.50  Aligned_cols=20  Identities=40%  Similarity=0.710  Sum_probs=17.7

Q ss_pred             hhHHHHHHhhhhhhCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      ....|++|||||.||+.|-.
T Consensus       322 ~~a~v~AhelgH~lGm~hD~  341 (716)
T KOG3607|consen  322 AFAVVLAHELGHNLGMIHDE  341 (716)
T ss_pred             hHHHHHHHHHHhhcCccccc
Confidence            45679999999999999976


No 65 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=77.33  E-value=3.4  Score=39.22  Aligned_cols=41  Identities=27%  Similarity=0.490  Sum_probs=24.5

Q ss_pred             eCCCCceeEecCCCCCc-cC---CccccchhHHHHHHhhhhh----hCCC
Q 035681          244 FAPTNGRFHYDADEPWA-VG---AVQGAFDLETVALHEIGHL----LGLH  285 (320)
Q Consensus       244 ~~P~~G~i~~d~~e~w~-~~---~~~~~~~l~~Va~HEiGHa----LGL~  285 (320)
                      |-|.+..|.+|... |. +.   ...+..-...|++||.||.    ||+-
T Consensus       141 YCp~D~tIYlD~~f-~~~L~~~~ga~G~~a~ayVlAHEyGHHVQ~l~Gil  189 (292)
T PF04228_consen  141 YCPADQTIYLDLSF-FDELQQRFGASGDFAQAYVLAHEYGHHVQNLLGIL  189 (292)
T ss_pred             eCCCCCEEEechHH-HHHHHHHhCCccHHHHHHHHHHHHHHHHHHHhhhH
Confidence            45788899998742 21 10   1112233456999999994    5653


No 66 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=76.75  E-value=1.4  Score=40.13  Aligned_cols=14  Identities=36%  Similarity=0.610  Sum_probs=12.2

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      .+..|+.||+||+|
T Consensus        91 ~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   91 NVLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHHH
Confidence            57789999999985


No 67 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=76.28  E-value=1.4  Score=39.28  Aligned_cols=15  Identities=40%  Similarity=0.638  Sum_probs=12.4

Q ss_pred             hhHHHHHHhhhhhhC
Q 035681          269 DLETVALHEIGHLLG  283 (320)
Q Consensus       269 ~l~~Va~HEiGHaLG  283 (320)
                      ....|+.||+||++-
T Consensus        21 ~~l~t~~HE~gHal~   35 (200)
T PF13398_consen   21 RLLVTFVHELGHALA   35 (200)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456799999999874


No 68 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=75.93  E-value=1.3  Score=34.63  Aligned_cols=27  Identities=22%  Similarity=0.313  Sum_probs=17.6

Q ss_pred             CceeEecCCCCCccCCccccchhHHHHHHhhhhhh
Q 035681          248 NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLL  282 (320)
Q Consensus       248 ~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaL  282 (320)
                      ...|.++...        .......+++|||||.+
T Consensus        28 ~~~I~in~~~--------~~~~~~f~laHELgH~~   54 (122)
T PF06114_consen   28 NPIIFINSNL--------SPERQRFTLAHELGHIL   54 (122)
T ss_dssp             TTEEEEESSS---------HHHHHHHHHHHHHHHH
T ss_pred             CCEEEECCCC--------CHHHHHHHHHHHHHHHH
Confidence            4566666531        12345679999999974


No 69 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=74.37  E-value=1.8  Score=38.88  Aligned_cols=30  Identities=40%  Similarity=0.624  Sum_probs=26.1

Q ss_pred             hHHHHHHhhhhhhCCCCCCCCCCccccccc
Q 035681          270 LETVALHEIGHLLGLHHSSVEGAIMYPSIS  299 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~~~siMyp~~~  299 (320)
                      ...+.-|++||++||+|.+.+..||+|.+.
T Consensus       153 tal~~~efl~~~~~~~~lnhr~~i~~~pg~  182 (236)
T COG5549         153 TALIAYEFLGHALGLGHLNHRGDIMYPPGE  182 (236)
T ss_pred             HHHHHHHHhhhccCcccccccccccCCccc
Confidence            345777999999999999999999998765


No 70 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=73.98  E-value=1.5  Score=39.68  Aligned_cols=26  Identities=27%  Similarity=0.372  Sum_probs=16.6

Q ss_pred             hHHHHHHhhhhhhCCCCCCCC-CCccccc
Q 035681          270 LETVALHEIGHLLGLHHSSVE-GAIMYPS  297 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~~-~siMyp~  297 (320)
                      -.-|++||+||+  |-|.... ...+++.
T Consensus        72 ~rFtlAHELGH~--llH~~~~~~~~~~~~   98 (213)
T COG2856          72 KRFTLAHELGHA--LLHTDLNTRFDAEPT   98 (213)
T ss_pred             HHHHHHHHHhHH--Hhccccchhhhcccc
Confidence            356999999999  4555433 3334444


No 71 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=71.55  E-value=2.3  Score=37.63  Aligned_cols=20  Identities=35%  Similarity=0.506  Sum_probs=15.9

Q ss_pred             chhHHHHHHhhhhhhCCCCCC
Q 035681          268 FDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      ..+..|+.||+||.. .+|..
T Consensus        87 ~el~aVlaHElgH~~-~~h~~  106 (226)
T PF01435_consen   87 DELAAVLAHELGHIK-HRHIL  106 (226)
T ss_dssp             HHHHHHHHHHHHHHH-TTHCC
T ss_pred             HHHHHHHHHHHHHHH-cCCcc
Confidence            378899999999997 45544


No 72 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=68.55  E-value=2.7  Score=39.14  Aligned_cols=15  Identities=47%  Similarity=0.736  Sum_probs=13.1

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .+++.|+.||+||..
T Consensus       155 dEl~aVlaHElgHi~  169 (302)
T COG0501         155 DELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            478999999999984


No 73 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=68.38  E-value=4.6  Score=38.74  Aligned_cols=37  Identities=27%  Similarity=0.499  Sum_probs=26.6

Q ss_pred             CCCCceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCC
Q 035681          245 APTNGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSV  289 (320)
Q Consensus       245 ~P~~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~  289 (320)
                      .|=.|+.+.|.+-.        ...+..|++||++|-+|....+.
T Consensus       179 ~PFT~EA~vN~~~p--------~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  179 NPFTGEANVNTDLP--------PYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             cCCcceeecCCCCC--------cccccHHHHHHHHHHhCCCCHHH
Confidence            45566666665322        24667899999999999998753


No 74 
>PRK03982 heat shock protein HtpX; Provisional
Probab=68.32  E-value=2.8  Score=39.50  Aligned_cols=15  Identities=40%  Similarity=0.536  Sum_probs=13.0

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .++..|++|||||.-
T Consensus       123 ~El~AVlAHElgHi~  137 (288)
T PRK03982        123 DELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            368999999999974


No 75 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=68.25  E-value=8.3  Score=37.15  Aligned_cols=15  Identities=33%  Similarity=0.512  Sum_probs=12.4

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .+-..++.||+||++
T Consensus       152 ~~~v~tl~HE~GHa~  166 (365)
T cd06258         152 HDDINTLFHEFGHAV  166 (365)
T ss_pred             HHHHHHHHHHHhHHH
Confidence            455678999999998


No 76 
>PRK03001 M48 family peptidase; Provisional
Probab=66.87  E-value=3.1  Score=39.07  Aligned_cols=15  Identities=27%  Similarity=0.483  Sum_probs=13.1

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .++..|++|||||.-
T Consensus       122 ~El~aVlAHElgHi~  136 (283)
T PRK03001        122 REIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            478999999999974


No 77 
>PRK01345 heat shock protein HtpX; Provisional
Probab=66.49  E-value=3.1  Score=39.83  Aligned_cols=19  Identities=21%  Similarity=0.336  Sum_probs=15.2

Q ss_pred             chhHHHHHHhhhhhhCCCCC
Q 035681          268 FDLETVALHEIGHLLGLHHS  287 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs  287 (320)
                      .++..|++|||||.- -+|.
T Consensus       122 dEL~aVlAHElgHi~-~~d~  140 (317)
T PRK01345        122 EEVAGVMAHELAHVK-NRDT  140 (317)
T ss_pred             HHHHHHHHHHHHHHH-cCCH
Confidence            378999999999986 3553


No 78 
>PRK03072 heat shock protein HtpX; Provisional
Probab=66.31  E-value=3.2  Score=39.17  Aligned_cols=15  Identities=33%  Similarity=0.472  Sum_probs=13.1

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .++..|++||+||.-
T Consensus       125 ~El~aVlAHElgHi~  139 (288)
T PRK03072        125 RELRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            478999999999974


No 79 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=64.51  E-value=3.7  Score=36.78  Aligned_cols=16  Identities=44%  Similarity=0.546  Sum_probs=12.3

Q ss_pred             hhHHHHHHhhhhhhCC
Q 035681          269 DLETVALHEIGHLLGL  284 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL  284 (320)
                      -+..|+.||+||++=-
T Consensus        37 l~~~v~iHElgH~~~A   52 (208)
T cd06161          37 LFLSVLLHELGHALVA   52 (208)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3568999999997533


No 80 
>PRK04897 heat shock protein HtpX; Provisional
Probab=62.75  E-value=4.1  Score=38.63  Aligned_cols=15  Identities=47%  Similarity=0.536  Sum_probs=13.1

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .++..|++||+||.-
T Consensus       135 ~El~aVlAHElgHi~  149 (298)
T PRK04897        135 EELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            478999999999974


No 81 
>PRK02870 heat shock protein HtpX; Provisional
Probab=61.49  E-value=4.4  Score=39.20  Aligned_cols=14  Identities=36%  Similarity=0.657  Sum_probs=12.8

Q ss_pred             chhHHHHHHhhhhh
Q 035681          268 FDLETVALHEIGHL  281 (320)
Q Consensus       268 ~~l~~Va~HEiGHa  281 (320)
                      .++..|++||+||.
T Consensus       171 dEL~aVlAHELgHi  184 (336)
T PRK02870        171 DELQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHHHH
Confidence            47899999999998


No 82 
>PRK05457 heat shock protein HtpX; Provisional
Probab=61.05  E-value=4.6  Score=38.07  Aligned_cols=14  Identities=43%  Similarity=0.667  Sum_probs=12.6

Q ss_pred             chhHHHHHHhhhhh
Q 035681          268 FDLETVALHEIGHL  281 (320)
Q Consensus       268 ~~l~~Va~HEiGHa  281 (320)
                      .++..|++||+||.
T Consensus       132 ~El~aVlAHElgHi  145 (284)
T PRK05457        132 DEVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHHH
Confidence            47899999999996


No 83 
>PRK01265 heat shock protein HtpX; Provisional
Probab=61.00  E-value=4.6  Score=38.90  Aligned_cols=14  Identities=50%  Similarity=0.790  Sum_probs=12.4

Q ss_pred             chhHHHHHHhhhhh
Q 035681          268 FDLETVALHEIGHL  281 (320)
Q Consensus       268 ~~l~~Va~HEiGHa  281 (320)
                      .+++.|++||+||.
T Consensus       138 ~El~aVlAHElgHi  151 (324)
T PRK01265        138 DEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHH
Confidence            47899999999995


No 84 
>PRK02391 heat shock protein HtpX; Provisional
Probab=60.59  E-value=4.7  Score=38.24  Aligned_cols=14  Identities=43%  Similarity=0.667  Sum_probs=12.5

Q ss_pred             chhHHHHHHhhhhh
Q 035681          268 FDLETVALHEIGHL  281 (320)
Q Consensus       268 ~~l~~Va~HEiGHa  281 (320)
                      .++..|++||+||.
T Consensus       131 ~El~aVlaHElgHi  144 (296)
T PRK02391        131 DELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHH
Confidence            47899999999996


No 85 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=59.11  E-value=6  Score=31.09  Aligned_cols=14  Identities=36%  Similarity=0.451  Sum_probs=12.1

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      .+..|++||+.|.+
T Consensus        24 ~~~~~l~HE~~H~~   37 (128)
T PF13485_consen   24 WLDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46799999999985


No 86 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=59.08  E-value=5.2  Score=36.41  Aligned_cols=15  Identities=47%  Similarity=0.713  Sum_probs=11.9

Q ss_pred             hHHHHHHhhhhhhCC
Q 035681          270 LETVALHEIGHLLGL  284 (320)
Q Consensus       270 l~~Va~HEiGHaLGL  284 (320)
                      +..|+.||+||++=-
T Consensus        53 ~~~v~iHElgH~~~A   67 (227)
T cd06164          53 FASVLLHELGHSLVA   67 (227)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            467999999998643


No 87 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=58.79  E-value=3  Score=45.28  Aligned_cols=32  Identities=22%  Similarity=0.386  Sum_probs=25.3

Q ss_pred             hhHHHHHHhhhhhhCCCCCCCCC------CcccccccC
Q 035681          269 DLETVALHEIGHLLGLHHSSVEG------AIMYPSISA  300 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~~~------siMyp~~~~  300 (320)
                      ....+++||+||.+|+.|-...+      .||.|....
T Consensus       316 ~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~  353 (845)
T KOG3538|consen  316 PSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCG  353 (845)
T ss_pred             ccceeeeeccccccCcccCCcCCCCCCcccccCccccc
Confidence            35579999999999999976533      589888763


No 88 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=56.70  E-value=5  Score=39.34  Aligned_cols=14  Identities=36%  Similarity=0.477  Sum_probs=11.5

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      +-..++.||+||++
T Consensus       221 ~~v~tl~HE~GHa~  234 (427)
T cd06459         221 DDVFTLAHELGHAF  234 (427)
T ss_pred             hhHHHHHHHhhHHH
Confidence            44678999999985


No 89 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=55.93  E-value=6.3  Score=37.11  Aligned_cols=13  Identities=38%  Similarity=0.603  Sum_probs=11.0

Q ss_pred             hHHHHHHhhhhhh
Q 035681          270 LETVALHEIGHLL  282 (320)
Q Consensus       270 l~~Va~HEiGHaL  282 (320)
                      +..|++||+||++
T Consensus       135 ~isvvvHElgHal  147 (277)
T cd06162         135 LISGVVHEMGHGV  147 (277)
T ss_pred             HHHHHHHHHHHHH
Confidence            4579999999985


No 90 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=54.99  E-value=13  Score=28.12  Aligned_cols=30  Identities=10%  Similarity=0.214  Sum_probs=19.3

Q ss_pred             CceeEecCCCCCccCCccccchhHHHHHHhhhhhh
Q 035681          248 NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLL  282 (320)
Q Consensus       248 ~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaL  282 (320)
                      +.+|+|...+. ...    ...-..+++||+.|++
T Consensus        44 G~~I~f~~g~~-~~~----s~~~~~llaHEl~Hv~   73 (79)
T PF13699_consen   44 GNDIYFAPGKY-NPD----SPEGRALLAHELAHVV   73 (79)
T ss_pred             CCEEEEcCCCc-CCC----CCCcchhHhHHHHHHH
Confidence            67888865431 111    1233679999999985


No 91 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=54.67  E-value=6.8  Score=36.59  Aligned_cols=16  Identities=25%  Similarity=0.341  Sum_probs=12.1

Q ss_pred             hHHHHHHhhhhhhCCC
Q 035681          270 LETVALHEIGHLLGLH  285 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~  285 (320)
                      ...|++||+||++=-.
T Consensus       118 ~isv~iHElgHa~~Ar  133 (263)
T cd06159         118 VVGVVVHELSHGILAR  133 (263)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3579999999986433


No 92 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=54.64  E-value=7.9  Score=40.06  Aligned_cols=15  Identities=27%  Similarity=0.474  Sum_probs=12.2

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .+-..|++||+||++
T Consensus       376 ~~dv~TLaHElGHa~  390 (591)
T TIGR00181       376 LNSVFTLAHELGHSM  390 (591)
T ss_pred             cchHHHHHHHhhhHH
Confidence            345679999999995


No 93 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=54.48  E-value=6.4  Score=38.76  Aligned_cols=15  Identities=47%  Similarity=0.756  Sum_probs=12.1

Q ss_pred             hHHHHHHhhhhhhCC
Q 035681          270 LETVALHEIGHLLGL  284 (320)
Q Consensus       270 l~~Va~HEiGHaLGL  284 (320)
                      +..++.||+||+|-=
T Consensus       131 vi~~vvHElGHalAA  145 (484)
T KOG2921|consen  131 VITVVVHELGHALAA  145 (484)
T ss_pred             HHHHHHHHhhHHHHH
Confidence            457889999999754


No 94 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=54.45  E-value=6.2  Score=39.38  Aligned_cols=24  Identities=13%  Similarity=0.160  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHhhhcCcceEEEecC
Q 035681          186 ITPVARAFQTWAANTQFKFSKIED  209 (320)
Q Consensus       186 ~~~i~~A~~~Ws~v~~l~F~e~~~  209 (320)
                      .+++..-+...+.+=+|+|+++..
T Consensus       137 ~~~~~g~~~l~~~lfgi~~~~~~~  160 (458)
T PF01432_consen  137 EEVLEGLFELAERLFGIRFEEVPD  160 (458)
T ss_dssp             HHHHHHHHHHHHHHHTEEEEECTC
T ss_pred             HHHHHHHHHHHHHHhcEEEEeccc
Confidence            355555566677777888888764


No 95 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=53.88  E-value=5.9  Score=40.76  Aligned_cols=15  Identities=33%  Similarity=0.306  Sum_probs=12.3

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .+-..|++||+||++
T Consensus       335 ~~dv~TL~HElGHa~  349 (549)
T TIGR02289       335 SGDIDVLTHEAGHAF  349 (549)
T ss_pred             hhHHHHHHHHhhHHH
Confidence            345679999999997


No 96 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=52.27  E-value=7  Score=39.44  Aligned_cols=14  Identities=36%  Similarity=0.653  Sum_probs=11.6

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      +-..|+.||+||+|
T Consensus       262 ~~V~TLfHEfGHal  275 (472)
T cd06455         262 DEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44578999999997


No 97 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=51.62  E-value=8.4  Score=33.95  Aligned_cols=21  Identities=33%  Similarity=0.675  Sum_probs=14.9

Q ss_pred             hHHHHHHhhhhhhCCCCCCCC
Q 035681          270 LETVALHEIGHLLGLHHSSVE  290 (320)
Q Consensus       270 l~~Va~HEiGHaLGL~Hs~~~  290 (320)
                      +..+..||+||++--..-.++
T Consensus        41 ~~~l~iHElgH~~~A~~~G~~   61 (183)
T cd06160          41 LAILGIHEMGHYLAARRHGVK   61 (183)
T ss_pred             HHHHHHHHHHHHHHHHHCCCC
Confidence            456889999998766544443


No 98 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=50.22  E-value=7.3  Score=40.39  Aligned_cols=14  Identities=29%  Similarity=0.399  Sum_probs=11.6

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      +-..|++||+||++
T Consensus       374 ~~v~TL~HE~GHa~  387 (587)
T TIGR02290       374 RDVSTLAHELGHAY  387 (587)
T ss_pred             hhHHHHHHHhhHHH
Confidence            34578999999998


No 99 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=49.81  E-value=8.2  Score=32.92  Aligned_cols=12  Identities=33%  Similarity=0.396  Sum_probs=10.1

Q ss_pred             HHHHHHhhhhhh
Q 035681          271 ETVALHEIGHLL  282 (320)
Q Consensus       271 ~~Va~HEiGHaL  282 (320)
                      .-|++|||+|.+
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            459999999974


No 100
>PLN02791 Nudix hydrolase homolog
Probab=48.67  E-value=8.9  Score=41.16  Aligned_cols=21  Identities=24%  Similarity=0.275  Sum_probs=17.4

Q ss_pred             hhHHHHHHhhhhhhCCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSSV  289 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~  289 (320)
                      .+.+|++||+||-+|=.+.-.
T Consensus       584 f~~~v~lHElgHGsG~~~~~~  604 (770)
T PLN02791        584 FFTHTICHECCHGIGPHTITL  604 (770)
T ss_pred             HHHHHHHHHhhccccccceec
Confidence            457899999999999977544


No 101
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=48.50  E-value=12  Score=32.77  Aligned_cols=38  Identities=24%  Similarity=0.373  Sum_probs=26.0

Q ss_pred             CceeEecCCCCCccCCccccchhHHHHHHhhhhhhCCCCCCCC
Q 035681          248 NGRFHYDADEPWAVGAVQGAFDLETVALHEIGHLLGLHHSSVE  290 (320)
Q Consensus       248 ~G~i~~d~~e~w~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~  290 (320)
                      .|.|.|+    |.+-..+ ..-+.-|++||+=|..-..||..-
T Consensus       147 ~~~I~ln----~~L~~~P-~~~idYVvvHEL~Hl~~~nHs~~F  184 (205)
T PF01863_consen  147 KGNITLN----WRLVMAP-PEVIDYVVVHELCHLRHPNHSKRF  184 (205)
T ss_pred             CCcEEee----cccccCC-ccHHHHHHHHHHHHhccCCCCHHH
Confidence            5667765    3332111 124678999999999999998753


No 102
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=48.45  E-value=8.7  Score=38.63  Aligned_cols=24  Identities=4%  Similarity=-0.082  Sum_probs=17.3

Q ss_pred             HHHHHHHHHHhhhcCcceEEEecC
Q 035681          186 ITPVARAFQTWAANTQFKFSKIED  209 (320)
Q Consensus       186 ~~~i~~A~~~Ws~v~~l~F~e~~~  209 (320)
                      -.+++--|..-+.+=+|+|++++.
T Consensus       133 ~~vl~gl~~~~~~lfgi~~~~~~~  156 (458)
T cd06457         133 GTVMEGLSRLFSRLYGIRLVPVPL  156 (458)
T ss_pred             HHHHHHHHHHHHHHhCeEEEecCC
Confidence            355566666677788999999763


No 103
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=46.60  E-value=9.7  Score=37.93  Aligned_cols=15  Identities=47%  Similarity=0.649  Sum_probs=12.1

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .+-..|+.||+||+|
T Consensus       206 ~~~v~tLfHEfGHal  220 (422)
T cd06456         206 HDEVTTLFHEFGHAL  220 (422)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            344578999999998


No 104
>PF11337 DUF3139:  Protein of unknown function (DUF3139);  InterPro: IPR021486  This family of proteins with unknown function appears to be restricted to Firmicutes. 
Probab=46.07  E-value=42  Score=25.49  Aligned_cols=23  Identities=22%  Similarity=0.338  Sum_probs=18.1

Q ss_pred             CCCCchhcHHHHHHHHHhcCCCC
Q 035681           52 HKGDKIKGIHELKEYMEKFGYLN   74 (320)
Q Consensus        52 ~~g~~~~~v~~l~~yL~~fGYl~   74 (320)
                      ++-++.....++.+||++-||-.
T Consensus        29 ~~~~k~~~~k~i~~yL~e~gy~e   51 (85)
T PF11337_consen   29 NPYQKHKAEKAIDWYLQEQGYKE   51 (85)
T ss_pred             chhhHHHHHHHHHHHHHHcCCcH
Confidence            44466677889999999999864


No 105
>PF01434 Peptidase_M41:  Peptidase family M41 This is family M41 in the peptidase classification. ;  InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=45.39  E-value=12  Score=33.52  Aligned_cols=15  Identities=33%  Similarity=0.543  Sum_probs=12.8

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .+...+|.||-||+|
T Consensus        26 ~~~~~~A~HEAGhAv   40 (213)
T PF01434_consen   26 EEKRRIAYHEAGHAV   40 (213)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            467889999999975


No 106
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=42.57  E-value=11  Score=37.37  Aligned_cols=12  Identities=42%  Similarity=0.844  Sum_probs=10.1

Q ss_pred             HHHHHHhhhhhh
Q 035681          271 ETVALHEIGHLL  282 (320)
Q Consensus       271 ~~Va~HEiGHaL  282 (320)
                      ..|++||+||.|
T Consensus        15 ~~v~~HE~gH~~   26 (420)
T TIGR00054        15 VLIFVHELGHFL   26 (420)
T ss_pred             HHHHHHhHHHHH
Confidence            469999999964


No 107
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=42.37  E-value=12  Score=37.59  Aligned_cols=12  Identities=33%  Similarity=0.744  Sum_probs=9.7

Q ss_pred             HHHHHHhhhhhh
Q 035681          271 ETVALHEIGHLL  282 (320)
Q Consensus       271 ~~Va~HEiGHaL  282 (320)
                      .-|++||+||-|
T Consensus        16 ~li~vHElGHfl   27 (449)
T PRK10779         16 VLITVHEFGHFW   27 (449)
T ss_pred             HHHHHHHHHHHH
Confidence            358899999964


No 108
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=42.14  E-value=12  Score=37.12  Aligned_cols=12  Identities=42%  Similarity=0.606  Sum_probs=9.9

Q ss_pred             HHHHHHhhhhhh
Q 035681          271 ETVALHEIGHLL  282 (320)
Q Consensus       271 ~~Va~HEiGHaL  282 (320)
                      .-+++||+||+|
T Consensus       160 l~t~iHE~GHal  171 (396)
T cd06460         160 LFSTIHETGHAL  171 (396)
T ss_pred             HHHHHHHhhHHH
Confidence            457899999985


No 109
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=41.96  E-value=13  Score=36.98  Aligned_cols=19  Identities=42%  Similarity=0.630  Sum_probs=14.5

Q ss_pred             chhHHHHHHhhhhhhCCCCC
Q 035681          268 FDLETVALHEIGHLLGLHHS  287 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~Hs  287 (320)
                      .++..|+.||+||- =++|.
T Consensus       278 eel~AVl~HELGHW-~~~H~  296 (428)
T KOG2719|consen  278 EELVAVLAHELGHW-KLNHV  296 (428)
T ss_pred             HHHHHHHHHHhhHH-HHhhH
Confidence            46889999999996 24454


No 110
>KOG3988 consensus Protein-tyrosine sulfotransferase TPST1/TPST2 [Posttranslational modification, protein turnover, chaperones]
Probab=40.06  E-value=46  Score=31.69  Aligned_cols=61  Identities=20%  Similarity=0.334  Sum_probs=43.6

Q ss_pred             hhhhhhcccCCCCCchhcHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHh
Q 035681           42 FKFIEHLKGCHKGDKIKGIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLN  104 (320)
Q Consensus        42 ~~~~~~~~~~~~g~~~~~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~  104 (320)
                      .+++..-.||=.||.+.+++.+--+|..+||.++.++.|=  +..+..--..++...+.||--
T Consensus       297 l~AlskWvg~ip~dvvrdma~iAPmL~~LGYDp~anpPnY--gkpd~~v~~nt~~v~k~f~~~  357 (378)
T KOG3988|consen  297 LEALSKWVGCIPEDVVRDMADIAPMLAILGYDPYANPPNY--GKPDPDVIYNTKRVYKIFNLP  357 (378)
T ss_pred             HHHHHHHhccCCHHHHHHHHHHHHHHHHhCCCCCCCCCCC--CCCCHHHHHHHHHHHHHhccH
Confidence            6777778899999999999999999999999987643221  223332235566666655543


No 111
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=39.53  E-value=13  Score=38.73  Aligned_cols=14  Identities=29%  Similarity=0.332  Sum_probs=11.6

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      +-..|++||+||++
T Consensus       379 ~dV~TLaHElGHs~  392 (598)
T COG1164         379 RDVFTLAHELGHSV  392 (598)
T ss_pred             hHHHHHHHHccHHH
Confidence            44679999999985


No 112
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=39.28  E-value=21  Score=34.78  Aligned_cols=15  Identities=33%  Similarity=0.634  Sum_probs=12.9

Q ss_pred             chhHHHHHHhhhhhh
Q 035681          268 FDLETVALHEIGHLL  282 (320)
Q Consensus       268 ~~l~~Va~HEiGHaL  282 (320)
                      .++-.|+.||+||+.
T Consensus       122 sEvAAVl~HEmgHVt  136 (479)
T COG4784         122 SEVAAVLAHEMGHVT  136 (479)
T ss_pred             HHHHHHHHhhhhhee
Confidence            478899999999984


No 113
>PF03272 Enhancin:  Viral enhancin protein;  InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=37.01  E-value=38  Score=36.58  Aligned_cols=42  Identities=10%  Similarity=0.105  Sum_probs=26.6

Q ss_pred             CCCCCcccceEeccCCC------ch----hHH-----------HHHHHHHHHhhhcCcceEEEec
Q 035681          165 KWPASKFSLTYAFRPGT------RS----DAI-----------TPVARAFQTWAANTQFKFSKIE  208 (320)
Q Consensus       165 kW~~~k~~Lty~~~~~~------~~----~~~-----------~~i~~A~~~Ws~v~~l~F~e~~  208 (320)
                      .|.  +..-.|.+....      |.    .++           +--.+.|+...+.+.|.|....
T Consensus       129 ~w~--~~~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~  191 (775)
T PF03272_consen  129 EWD--DSDSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSD  191 (775)
T ss_pred             hhh--hcCCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCc
Confidence            688  778888876531      21    111           2226778888888888886543


No 114
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=36.84  E-value=22  Score=33.27  Aligned_cols=22  Identities=32%  Similarity=0.475  Sum_probs=17.3

Q ss_pred             cchhHHHHHHhhhhhhCCCCCC
Q 035681          267 AFDLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       267 ~~~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      ..+...|+.||+||--|=.+--
T Consensus       200 a~~yyaTl~HElghwtgh~~rl  221 (316)
T COG4227         200 AINYYATLLHELGHWTGHEARL  221 (316)
T ss_pred             hHhHHHHHHHHhccccCchhhh
Confidence            4578899999999998765543


No 115
>PRK10911 oligopeptidase A; Provisional
Probab=34.99  E-value=18  Score=38.32  Aligned_cols=14  Identities=43%  Similarity=0.548  Sum_probs=11.6

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      +-..|+.||+||+|
T Consensus       462 ~~v~tlfHEfGHal  475 (680)
T PRK10911        462 DEVITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHHHHhHHH
Confidence            34568999999998


No 116
>PF14891 Peptidase_M91:  Effector protein
Probab=34.68  E-value=42  Score=29.03  Aligned_cols=18  Identities=22%  Similarity=0.150  Sum_probs=13.7

Q ss_pred             hhHHHHHHhhhhhhCCCC
Q 035681          269 DLETVALHEIGHLLGLHH  286 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~H  286 (320)
                      +-..|+.|||.||.-..+
T Consensus       102 ~p~v~L~HEL~HA~~~~~  119 (174)
T PF14891_consen  102 PPFVVLYHELIHAYDYMN  119 (174)
T ss_pred             HHHHHHHHHHHHHHHHHC
Confidence            456799999999965543


No 117
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=33.74  E-value=19  Score=37.15  Aligned_cols=16  Identities=44%  Similarity=0.553  Sum_probs=11.7

Q ss_pred             hhHHHHHHhh-hhhhCC
Q 035681          269 DLETVALHEI-GHLLGL  284 (320)
Q Consensus       269 ~l~~Va~HEi-GHaLGL  284 (320)
                      ...+|++||+ ||..|=
T Consensus       292 f~~~v~lHEllGHGsGk  308 (549)
T PF03571_consen  292 FFVQVGLHELLGHGSGK  308 (549)
T ss_dssp             HHHHHHHHHHCCCCS--
T ss_pred             HhhhhhHHhhccCcCcc
Confidence            3467999998 998884


No 118
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=33.55  E-value=19  Score=33.77  Aligned_cols=39  Identities=23%  Similarity=0.342  Sum_probs=22.3

Q ss_pred             eCCCCceeEecCCCCCcc---CCccccchhHHHHHHhhhhhh
Q 035681          244 FAPTNGRFHYDADEPWAV---GAVQGAFDLETVALHEIGHLL  282 (320)
Q Consensus       244 ~~P~~G~i~~d~~e~w~~---~~~~~~~~l~~Va~HEiGHaL  282 (320)
                      |.|.+..+++|....-..   -...+.+-..-|++||.||.+
T Consensus       138 YCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHV  179 (295)
T COG2321         138 YCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHV  179 (295)
T ss_pred             ecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHH
Confidence            456788888887321000   011222344569999999953


No 119
>CHL00132 psaF photosystem I subunit III; Validated
Probab=31.81  E-value=1e+02  Score=27.06  Aligned_cols=38  Identities=5%  Similarity=0.195  Sum_probs=23.5

Q ss_pred             cHHHHHHHHHhcCCCCCCCcCCCCCCCCCchhhHHHHHHHHHHHHhcC
Q 035681           59 GIHELKEYMEKFGYLNYDNYKNQSTHADDDDFDELLESAIKTYQLNYH  106 (320)
Q Consensus        59 ~v~~l~~yL~~fGYl~~~~~~~~~~~~~~~~f~~~~~~Aik~fQ~~~~  106 (320)
                      .|+.|++.|++|   +..       .........+.+.+.++|.++..
T Consensus        45 ~~k~Le~rlk~Y---~~~-------s~p~lal~~qi~~tk~RFe~Y~~   82 (185)
T CHL00132         45 SVKKLENRLAKY---EAN-------SPPALALQQQIDKTKARFDKYGR   82 (185)
T ss_pred             HHHHHHhhhhcc---CCC-------CChHHHHHHHHHHHHHHHHHhcc
Confidence            366777888865   211       11223456777888888888743


No 120
>COG3926 zliS Lysozyme family protein [General function prediction only]
Probab=28.97  E-value=52  Score=30.10  Aligned_cols=30  Identities=27%  Similarity=0.263  Sum_probs=26.5

Q ss_pred             HHHHHHHHHHhcC----CCcccccCHHHHHhhcc
Q 035681           94 LESAIKTYQLNYH----LKSTGVLDANTVAKMTI  123 (320)
Q Consensus        94 ~~~Aik~fQ~~~~----L~~TG~ld~~T~~~m~~  123 (320)
                      --.|++-.|+..+    .+++|++..+|+..+.+
T Consensus        93 p~rAa~~LQkal~~~~~v~~DGvIG~~TLaAl~~  126 (252)
T COG3926          93 PGRAAKWLQKALGPAYTVRVDGVIGAQTLAALKK  126 (252)
T ss_pred             cchHHHHHHHHhccCCcccccCcccHHHHHHHHh
Confidence            3578999999999    99999999999998854


No 121
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=28.61  E-value=31  Score=33.41  Aligned_cols=19  Identities=42%  Similarity=0.574  Sum_probs=14.8

Q ss_pred             hhHHHHHHhhhhhhCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSS  288 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~  288 (320)
                      .+..|+.||++|++ .+|+.
T Consensus       274 glAtvLgHE~aHaV-arH~A  292 (424)
T KOG2661|consen  274 GLATVLGHEIAHAV-ARHAA  292 (424)
T ss_pred             HHHHHHHHHHHHHH-HHHHH
Confidence            47789999999986 45653


No 122
>PF11576 DUF3236:  Protein of unknown function (DUF3236);  InterPro: IPR012019  This family of proteins with unknown function appears to be restricted to Methanobacteria. ; PDB: 3BRC_B.
Probab=27.53  E-value=42  Score=28.47  Aligned_cols=27  Identities=30%  Similarity=0.700  Sum_probs=18.1

Q ss_pred             hhhhcccCCCCCchhcHHHHHHHHHhc
Q 035681           44 FIEHLKGCHKGDKIKGIHELKEYMEKF   70 (320)
Q Consensus        44 ~~~~~~~~~~g~~~~~v~~l~~yL~~f   70 (320)
                      +.+.+.|++.||+-+.|..+|+|...-
T Consensus         9 y~eS~~~~R~GDk~EEv~~Ir~~I~na   35 (154)
T PF11576_consen    9 YLESLEGTRRGDKEEEVEAIREYILNA   35 (154)
T ss_dssp             HHHHHTT---S--HHHHHHHHHHHHH-
T ss_pred             HHHHHhCcccCCcHHHHHHHHHHHhcC
Confidence            455678999999999999999999865


No 123
>PF08434 CLCA_N:  Calcium-activated chloride channel;  InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs. 
Probab=26.17  E-value=38  Score=31.67  Aligned_cols=14  Identities=43%  Similarity=0.665  Sum_probs=10.9

Q ss_pred             HHHHHHhhhhh-hCC
Q 035681          271 ETVALHEIGHL-LGL  284 (320)
Q Consensus       271 ~~Va~HEiGHa-LGL  284 (320)
                      ..|++||-+|. =|+
T Consensus       150 grv~VhEWAhlRWGV  164 (262)
T PF08434_consen  150 GRVFVHEWAHLRWGV  164 (262)
T ss_pred             ceeeeehhhhhcccc
Confidence            46999999995 455


No 124
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=25.51  E-value=32  Score=31.91  Aligned_cols=19  Identities=32%  Similarity=0.476  Sum_probs=13.8

Q ss_pred             chhHHHHHHhhhhhhCCCC
Q 035681          268 FDLETVALHEIGHLLGLHH  286 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLGL~H  286 (320)
                      .|-.+|++||+-|.|=+..
T Consensus       152 ~dg~NVviHEfAH~LD~~~  170 (253)
T PF06167_consen  152 NDGHNVVIHEFAHKLDMED  170 (253)
T ss_dssp             SSS--HHHHHHHHHHHCTT
T ss_pred             CCCcchHHHHHHHHHHhhc
Confidence            3567899999999996654


No 125
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=24.28  E-value=35  Score=36.24  Aligned_cols=31  Identities=23%  Similarity=0.236  Sum_probs=19.2

Q ss_pred             HHH-HHHHHHhhhcCcceEEEecCC--CCCceEE
Q 035681          187 TPV-ARAFQTWAANTQFKFSKIEDF--ANADIKI  217 (320)
Q Consensus       187 ~~i-~~A~~~Ws~v~~l~F~e~~~~--~~aDI~I  217 (320)
                      .++ .-.|..-+..=+|+|+++.+.  =.+|++.
T Consensus       364 ~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~  397 (681)
T PRK10280        364 TVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV  397 (681)
T ss_pred             HHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence            455 356666777888999986531  1356653


No 126
>PF13721 SecD-TM1:  SecD export protein N-terminal TM region
Probab=23.67  E-value=1.5e+02  Score=23.46  Aligned_cols=29  Identities=7%  Similarity=0.185  Sum_probs=22.3

Q ss_pred             cccCCCCCchhcHHHHHHHHHhcCCCCCC
Q 035681           48 LKGCHKGDKIKGIHELKEYMEKFGYLNYD   76 (320)
Q Consensus        48 ~~~~~~g~~~~~v~~l~~yL~~fGYl~~~   76 (320)
                      .+..+.|...+...++++.|++-|--...
T Consensus        36 Is~~~~~~~~~~~~~v~~~L~~~~I~~k~   64 (101)
T PF13721_consen   36 ISASSAGVQLPDAFQVEQALKAAGIAVKS   64 (101)
T ss_pred             EecCCCCccCChHHHHHHHHHHCCCCcce
Confidence            44556677777788999999999987653


No 127
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=23.61  E-value=55  Score=23.32  Aligned_cols=21  Identities=33%  Similarity=0.597  Sum_probs=17.7

Q ss_pred             hhHHHHHHhhhhhhCCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSSV  289 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~  289 (320)
                      ..+.-++||+.+.+||.|.+.
T Consensus        28 ~~eR~~vH~lA~~~gL~s~S~   48 (60)
T cd02641          28 SHDRLLVHELAEELGLRHEST   48 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEee
Confidence            457789999999999998653


No 128
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=22.76  E-value=47  Score=29.11  Aligned_cols=16  Identities=25%  Similarity=0.320  Sum_probs=12.8

Q ss_pred             hhHHHHHHhhhhhhCC
Q 035681          269 DLETVALHEIGHLLGL  284 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL  284 (320)
                      .+-.|++|||+|++.-
T Consensus        35 ~lG~ilahel~hafd~   50 (206)
T PF01431_consen   35 GLGFILAHELMHAFDP   50 (206)
T ss_dssp             THHHHHHHHHHHCTST
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3567999999998754


No 129
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=22.66  E-value=44  Score=33.97  Aligned_cols=16  Identities=25%  Similarity=0.312  Sum_probs=12.7

Q ss_pred             hhHHHHHHhhhhhhCC
Q 035681          269 DLETVALHEIGHLLGL  284 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL  284 (320)
                      +-..|+.||+||+...
T Consensus       246 ~D~~t~~HE~GH~~yy  261 (477)
T cd06461         246 EDFVTVHHEMGHIQYY  261 (477)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4467889999999854


No 130
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=22.45  E-value=47  Score=29.97  Aligned_cols=14  Identities=21%  Similarity=0.218  Sum_probs=11.2

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      -+-.|++||++||-
T Consensus        92 l~gsiLAHE~mHa~  105 (212)
T PF12315_consen   92 LTGSILAHELMHAW  105 (212)
T ss_pred             HHhhHHHHHHHHHH
Confidence            34579999999973


No 131
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=22.08  E-value=47  Score=34.97  Aligned_cols=14  Identities=43%  Similarity=0.698  Sum_probs=12.1

Q ss_pred             hhHHHHHHhhhhhh
Q 035681          269 DLETVALHEIGHLL  282 (320)
Q Consensus       269 ~l~~Va~HEiGHaL  282 (320)
                      +...|+.||.||+|
T Consensus       407 ~~~~~a~he~gha~  420 (644)
T PRK10733        407 QKESTAYHEAGHAI  420 (644)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45789999999986


No 132
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=20.58  E-value=3.3e+02  Score=23.83  Aligned_cols=18  Identities=22%  Similarity=0.301  Sum_probs=15.1

Q ss_pred             hhHHHHHHhhhhhhCCCC
Q 035681          269 DLETVALHEIGHLLGLHH  286 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~H  286 (320)
                      .+..+++||+-|++=+..
T Consensus        64 ~l~~~iaHE~hH~~r~~~   81 (195)
T PF10026_consen   64 ELPALIAHEYHHNCRYEQ   81 (195)
T ss_pred             HHHHHHHHHHHHHHHHhc
Confidence            789999999999975543


No 133
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=20.21  E-value=56  Score=27.33  Aligned_cols=16  Identities=31%  Similarity=0.250  Sum_probs=13.1

Q ss_pred             chhHHHHHHhhhhhhC
Q 035681          268 FDLETVALHEIGHLLG  283 (320)
Q Consensus       268 ~~l~~Va~HEiGHaLG  283 (320)
                      ..+..|++||+=|++=
T Consensus        57 ~~l~~~l~HEm~H~~~   72 (146)
T smart00731       57 DRLRETLLHELCHAAL   72 (146)
T ss_pred             HHHHhhHHHHHHHHHH
Confidence            3677899999999763


No 134
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.03  E-value=67  Score=23.01  Aligned_cols=23  Identities=30%  Similarity=0.332  Sum_probs=19.0

Q ss_pred             hhHHHHHHhhhhhhCCCCCCCCC
Q 035681          269 DLETVALHEIGHLLGLHHSSVEG  291 (320)
Q Consensus       269 ~l~~Va~HEiGHaLGL~Hs~~~~  291 (320)
                      .-+.-++|.+-|-|||.|.++.+
T Consensus        28 ~~eRriih~la~~lGL~~~s~G~   50 (60)
T cd02639          28 PAERRIVHLLASRLGLNHVSDGT   50 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEEeCC
Confidence            44678999999999999987644


Done!