Query         043086
Match_columns 257
No_of_seqs    210 out of 1117
Neff          6.4 
Searched_HMMs 46136
Date          Fri Mar 29 12:43:33 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/043086.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/043086hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1565 Gelatinase A and relat 100.0 9.9E-44 2.2E-48  342.1  16.0  174   57-257    29-229 (469)
  2 cd04278 ZnMc_MMP Zinc-dependen  99.8 1.1E-21 2.4E-26  163.6   6.9   99  155-257     1-125 (157)
  3 PF00413 Peptidase_M10:  Matrix  99.8 2.8E-19 6.1E-24  147.2   5.0   97  155-257     1-123 (154)
  4 PF01471 PG_binding_1:  Putativ  99.5 6.4E-14 1.4E-18   97.8   6.0   56   55-119     2-57  (57)
  5 cd04279 ZnMc_MMP_like_1 Zinc-d  99.4 8.2E-13 1.8E-17  109.9   5.9   68  181-257    49-122 (156)
  6 smart00235 ZnMc Zinc-dependent  99.3 6.3E-12 1.4E-16  102.5   5.8   81  154-256     3-103 (140)
  7 cd04268 ZnMc_MMP_like Zinc-dep  99.0 5.7E-11 1.2E-15   98.8   1.4   71  181-257    42-112 (165)
  8 cd04277 ZnMc_serralysin_like Z  98.9 1.4E-09   3E-14   93.0   5.1   63  181-256    61-130 (186)
  9 TIGR02869 spore_SleB spore cor  98.9 5.9E-09 1.3E-13   91.0   7.1   72   48-129     6-77  (201)
 10 cd04327 ZnMc_MMP_like_3 Zinc-d  98.5 1.2E-07 2.6E-12   82.2   5.9   89  155-256     2-109 (198)
 11 cd00203 ZnMc Zinc-dependent me  98.3 6.6E-07 1.4E-11   74.2   4.7   59  181-256    50-113 (167)
 12 cd04276 ZnMc_MMP_like_2 Zinc-d  98.3 6.8E-07 1.5E-11   77.9   3.5   91  160-256     7-133 (197)
 13 PRK10594 murein L,D-transpepti  98.2 2.3E-06 5.1E-11   85.3   6.6   83   40-122   226-343 (608)
 14 COG3409 Putative peptidoglycan  97.9 1.7E-05 3.6E-10   66.7   4.8   65   49-122    39-105 (185)
 15 PF12388 Peptidase_M57:  Dual-a  97.7 7.3E-05 1.6E-09   65.8   5.4   44  207-256   102-150 (211)
 16 COG2989 Uncharacterized protei  97.5 0.00019 4.1E-09   70.4   5.9   68   47-121   230-298 (561)
 17 PF05572 Peptidase_M43:  Pregna  97.1 0.00025 5.4E-09   59.5   2.1   21  236-256    66-86  (154)
 18 PF13582 Reprolysin_3:  Metallo  97.0 0.00024 5.2E-09   56.2   1.2   18  239-256   107-124 (124)
 19 COG3023 ampD N-acetyl-anhydrom  96.9  0.0016 3.6E-08   58.5   5.0   56   53-119   195-251 (257)
 20 PF13583 Reprolysin_4:  Metallo  96.7 0.00073 1.6E-08   59.1   1.4   19  238-256   136-154 (206)
 21 PF08823 PG_binding_2:  Putativ  96.6  0.0057 1.2E-07   45.3   5.6   57   53-118    13-73  (74)
 22 COG5549 Predicted Zn-dependent  96.6  0.0013 2.7E-08   57.8   2.4   23  235-257   183-206 (236)
 23 cd04273 ZnMc_ADAMTS_like Zinc-  96.6  0.0012 2.5E-08   57.5   2.0   66  181-256    92-157 (207)
 24 cd04275 ZnMc_pappalysin_like Z  96.5 0.00051 1.1E-08   61.1  -0.8   62  181-256    84-154 (225)
 25 PF13688 Reprolysin_5:  Metallo  96.3  0.0018 3.9E-08   55.3   1.5   21  236-256   139-159 (196)
 26 COG3409 Putative peptidoglycan  96.2   0.011 2.3E-07   49.5   5.7   62   51-120   123-184 (185)
 27 cd04280 ZnMc_astacin_like Zinc  96.2  0.0019   4E-08   55.3   1.0   17  240-256    75-91  (180)
 28 PF02031 Peptidase_M7:  Strepto  96.2   0.002 4.4E-08   52.4   1.1   41  207-254    52-92  (132)
 29 COG1913 Predicted Zn-dependent  96.2  0.0021 4.5E-08   54.9   1.1   19  238-256   123-141 (181)
 30 PRK13267 archaemetzincin-like   96.1   0.003 6.5E-08   54.3   1.9   19  238-256   124-142 (179)
 31 PF07998 Peptidase_M54:  Peptid  96.0  0.0034 7.3E-08   54.7   1.6   19  238-256   144-162 (194)
 32 PF13574 Reprolysin_2:  Metallo  96.0   0.003 6.5E-08   53.7   1.2   18  239-256   111-128 (173)
 33 PF01400 Astacin:  Astacin (Pep  95.9   0.003 6.5E-08   54.5   1.2   17  239-255    79-95  (191)
 34 cd04267 ZnMc_ADAM_like Zinc-de  95.9  0.0034 7.3E-08   53.6   1.2   20  237-256   131-150 (192)
 35 cd04283 ZnMc_hatching_enzyme Z  95.8  0.0036 7.8E-08   54.0   1.2   17  240-256    78-94  (182)
 36 cd04269 ZnMc_adamalysin_II_lik  95.8   0.005 1.1E-07   52.7   1.9   19  237-255   129-147 (194)
 37 cd04281 ZnMc_BMP1_TLD Zinc-dep  95.4  0.0067 1.4E-07   53.1   1.2   17  240-256    88-104 (200)
 38 cd04271 ZnMc_ADAM_fungal Zinc-  95.0  0.0083 1.8E-07   53.4   0.7   15  241-255   147-161 (228)
 39 PF01421 Reprolysin:  Reprolysi  95.0    0.01 2.2E-07   51.0   1.2   20  236-255   128-147 (199)
 40 cd04270 ZnMc_TACE_like Zinc-de  94.7   0.017 3.6E-07   51.8   1.9   18  239-256   167-184 (244)
 41 cd04272 ZnMc_salivary_gland_MP  94.6   0.017 3.7E-07   50.6   1.6   18  238-255   144-161 (220)
 42 cd04282 ZnMc_meprin Zinc-depen  94.5   0.016 3.5E-07   51.8   1.2   17  240-256   121-137 (230)
 43 PF11350 DUF3152:  Protein of u  93.5   0.084 1.8E-06   46.2   3.7   73  179-256    78-156 (203)
 44 PF05548 Peptidase_M11:  Gameto  93.3   0.042 9.2E-07   51.2   1.7   20  238-257   149-168 (314)
 45 PF10462 Peptidase_M66:  Peptid  92.8   0.061 1.3E-06   50.0   1.9   20  238-257   192-211 (305)
 46 KOG3714 Meprin A metalloprotea  92.3   0.048   1E-06   52.7   0.5   17  240-256   160-176 (411)
 47 PF09374 PG_binding_3:  Predict  91.2    0.15 3.2E-06   37.4   2.0   26   94-119     1-30  (72)
 48 TIGR03296 M6dom_TIGR03296 M6 f  90.4   0.075 1.6E-06   48.7  -0.2   16  240-255   166-181 (286)
 49 PF04298 Zn_peptidase_2:  Putat  90.2    0.21 4.5E-06   44.5   2.4   16  236-251    86-101 (222)
 50 COG3824 Predicted Zn-dependent  88.8    0.14   3E-06   41.4   0.2   19  236-254   106-124 (136)
 51 PF12044 Metallopep:  Putative   87.0    0.34 7.4E-06   47.0   1.7   17  240-256   317-333 (423)
 52 PTZ00257 Glycoprotein GP63 (le  86.0    0.92   2E-05   46.0   4.3   36  215-254   236-271 (622)
 53 PTZ00337 surface protease GP63  85.1    0.56 1.2E-05   47.2   2.3   35  215-255   211-245 (567)
 54 PF09471 Peptidase_M64:  IgA Pe  85.1    0.49 1.1E-05   43.2   1.7   16  238-253   215-230 (264)
 55 PF06262 DUF1025:  Possibl zinc  84.1    0.62 1.3E-05   36.2   1.6   17  238-254    72-88  (97)
 56 KOG3607 Meltrins, fertilins an  83.2    0.62 1.3E-05   48.1   1.7   20  237-256   321-340 (716)
 57 PF05547 Peptidase_M6:  Immune   81.5    0.28   6E-06   50.1  -1.6   15  240-254   222-236 (645)
 58 PF01457 Peptidase_M8:  Leishma  80.7    0.91   2E-05   45.1   1.7   36  215-254   190-225 (521)
 59 KOG4525 Jacalin-like lectin do  78.2     1.3 2.9E-05   43.1   1.9   18  240-257   302-319 (614)
 60 COG4783 Putative Zn-dependent   75.6     1.4 3.1E-05   43.2   1.4   17  236-252   127-143 (484)
 61 PF07172 GRP:  Glycine rich pro  73.5     2.3 4.9E-05   32.9   1.8    9    9-17      4-12  (95)
 62 PF06114 DUF955:  Domain of unk  72.6     1.7 3.6E-05   32.9   0.8   27  217-251    28-54  (122)
 63 PF14247 DUF4344:  Domain of un  72.5     1.9 4.1E-05   38.3   1.3   14  238-251    91-104 (220)
 64 PF04228 Zn_peptidase:  Putativ  71.3     3.3 7.2E-05   38.3   2.7   39  211-251   139-182 (292)
 65 PF01435 Peptidase_M48:  Peptid  68.4     2.8 6.1E-05   35.9   1.5   15  237-251    87-101 (226)
 66 COG2738 Predicted Zn-dependent  67.1     2.6 5.7E-05   37.0   1.0   16  236-251    89-104 (226)
 67 COG2856 Predicted Zn peptidase  65.9     2.7 5.8E-05   37.2   0.8   26  218-251    59-84  (213)
 68 COG0501 HtpX Zn-dependent prot  64.1     3.7 8.1E-05   36.9   1.4   15  237-251   155-169 (302)
 69 PRK03982 heat shock protein Ht  63.7     3.8 8.3E-05   37.4   1.4   14  238-251   124-137 (288)
 70 PF12725 DUF3810:  Protein of u  62.8     5.6 0.00012   37.2   2.4   33  214-254   179-211 (318)
 71 PRK03001 M48 family peptidase;  61.1     4.5 9.8E-05   36.9   1.5   14  238-251   123-136 (283)
 72 PRK01345 heat shock protein Ht  60.2     4.8  0.0001   37.6   1.4   14  238-251   123-136 (317)
 73 PRK03072 heat shock protein Ht  59.8     4.9 0.00011   36.9   1.5   15  237-251   125-139 (288)
 74 cd06258 Peptidase_M3_like The   59.6      11 0.00024   35.2   3.8   16  236-251   151-166 (365)
 75 PF11337 DUF3139:  Protein of u  56.6      30 0.00065   25.7   5.0   22   53-74     32-53  (85)
 76 PRK04897 heat shock protein Ht  56.0     6.2 0.00014   36.4   1.5   15  237-251   135-149 (298)
 77 PF01863 DUF45:  Protein of unk  55.8     8.1 0.00018   32.8   2.0   38  215-257   145-182 (205)
 78 cd06164 S2P-M50_SpoIVFB_CBS Sp  54.9     6.4 0.00014   34.9   1.3   14  238-251    52-65  (227)
 79 PF13485 Peptidase_MA_2:  Pepti  54.7     6.8 0.00015   29.7   1.3   16  236-251    22-37  (128)
 80 PRK05457 heat shock protein Ht  53.9     7.1 0.00015   35.8   1.4   14  237-250   132-145 (284)
 81 PRK02870 heat shock protein Ht  53.7     7.1 0.00015   36.9   1.4   14  237-250   171-184 (336)
 82 PRK02391 heat shock protein Ht  53.3     7.3 0.00016   36.0   1.4   14  237-250   131-144 (296)
 83 PRK01265 heat shock protein Ht  53.1     7.4 0.00016   36.6   1.4   14  237-250   138-151 (324)
 84 PF01432 Peptidase_M3:  Peptida  52.3     6.9 0.00015   37.8   1.2   16  236-251   239-254 (458)
 85 cd06459 M3B_Oligoendopeptidase  50.7     7.4 0.00016   36.8   1.1   16  236-251   219-234 (427)
 86 KOG3658 Tumor necrosis factor-  50.6     5.6 0.00012   40.8   0.2   16  241-256   394-409 (764)
 87 TIGR00181 pepF oligoendopeptid  50.0      11 0.00024   37.8   2.2   17  235-251   374-390 (591)
 88 cd06162 S2P-M50_PDZ_SREBP Ster  49.3     8.8 0.00019   35.3   1.3   13  239-251   135-147 (277)
 89 cd06455 M3A_TOP Peptidase M3 T  49.0     8.3 0.00018   37.7   1.2   15  237-251   261-275 (472)
 90 TIGR02289 M3_not_pepF oligoend  48.2      17 0.00036   36.3   3.2   17  235-251   333-349 (549)
 91 KOG3538 Disintegrin metallopro  48.1     4.2 9.2E-05   42.8  -1.1   22  235-256   313-334 (845)
 92 PLN02791 Nudix hydrolase homol  47.8     9.8 0.00021   39.9   1.5   19  237-255   583-601 (770)
 93 cd06159 S2P-M50_PDZ_Arch Uncha  47.3     9.9 0.00021   34.7   1.3   13  239-251   118-130 (263)
 94 PF13699 DUF4157:  Domain of un  47.3      11 0.00024   27.9   1.3   30  217-251    44-73  (79)
 95 cd06457 M3A_MIP Peptidase M3 m  46.7     9.5 0.00021   37.2   1.1   16  236-251   245-260 (458)
 96 TIGR02290 M3_fam_3 oligoendope  44.8      10 0.00022   38.0   1.1   15  237-251   373-387 (587)
 97 PF01447 Peptidase_M4:  Thermol  42.7      11 0.00024   31.4   0.8   12  240-251   136-147 (150)
 98 cd06456 M3A_DCP_Oligopeptidase  42.6      13 0.00027   36.1   1.3   16  236-251   205-220 (422)
 99 KOG2921 Intramembrane metallop  41.8      12 0.00027   36.2   1.1   13  239-251   131-143 (484)
100 COG1164 Oligoendopeptidase F [  39.7      24 0.00053   35.8   2.9   16  236-251   377-392 (598)
101 PF03571 Peptidase_M49:  Peptid  38.1      16 0.00035   36.8   1.3   16  237-252   291-307 (549)
102 COG4227 Antirestriction protei  36.3      25 0.00054   32.4   2.1   35  215-254   184-218 (316)
103 COG3926 zliS Lysozyme family p  35.4      27 0.00059   31.4   2.1   29   92-120    93-125 (252)
104 CHL00132 psaF photosystem I su  35.1      88  0.0019   27.1   5.1   39   56-104    44-82  (185)
105 KOG2719 Metalloprotease [Gener  34.3      21 0.00045   34.9   1.3   14  237-250   278-291 (428)
106 PF11045 YbjM:  Putative inner   34.1      38 0.00082   27.6   2.6   25    1-25      1-25  (125)
107 PF04956 TrbC:  TrbC/VIRB2 fami  33.6      73  0.0016   23.9   4.0   25    1-25      1-25  (99)
108 PRK10911 oligopeptidase A; Pro  32.3      22 0.00047   36.8   1.1   15  237-251   461-475 (680)
109 cd06460 M32_Taq Peptidase fami  31.6      23 0.00049   34.3   1.1   14  238-251   158-171 (396)
110 KOG2661 Peptidase family M48 [  29.9      27 0.00058   33.3   1.2   14  238-251   274-287 (424)
111 PF14891 Peptidase_M91:  Effect  29.3      37 0.00081   28.6   1.9   15  237-251   101-115 (174)
112 cd02641 R3H_Smubp-2_like R3H d  28.9      28  0.0006   24.4   0.9   20  238-257    28-47  (60)
113 PLN00081 photosystem I reactio  28.6      17 0.00037   29.5  -0.2   39  216-254    81-127 (141)
114 PF11119 DUF2633:  Protein of u  26.8      90   0.002   22.1   3.1    7    1-7       1-7   (59)
115 TIGR02411 leuko_A4_hydro leuko  26.0      66  0.0014   32.7   3.3   14  237-250   277-290 (601)
116 PRK06287 cobalt transport prot  24.8      88  0.0019   24.6   3.2   28    1-28      1-28  (107)
117 PF07172 GRP:  Glycine rich pro  24.2      91   0.002   24.0   3.1   17    8-24      6-22  (95)
118 cd02639 R3H_RRM R3H domain of   24.0      37  0.0008   24.0   0.8   20  238-257    28-47  (60)
119 PRK14125 cell division suppres  22.8 1.2E+02  0.0025   23.7   3.5   25    1-25      1-25  (103)
120 PF06167 Peptidase_M90:  Glucos  21.2      40 0.00087   30.5   0.6   17  237-253   152-168 (253)
121 PRK10280 dipeptidyl carboxypep  21.1      46   0.001   34.4   1.2   14  238-251   463-476 (681)
122 COG2321 Predicted metalloprote  21.0      44 0.00095   30.9   0.8   40  211-251   136-179 (295)
123 KOG1047 Bifunctional leukotrie  21.0      50  0.0011   33.5   1.3   16  235-250   284-299 (613)
124 KOG3988 Protein-tyrosine sulfo  20.9   2E+02  0.0042   27.1   5.0   36   40-75    297-332 (378)
125 cd02640 R3H_NRF R3H domain of   20.4      52  0.0011   23.2   0.9   21  237-257    27-47  (60)

No 1  
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00  E-value=9.9e-44  Score=342.10  Aligned_cols=174  Identities=44%  Similarity=0.792  Sum_probs=151.7

Q ss_pred             cHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCCCccccCCCCCCcccc
Q 043086           57 GINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGKPRCGVPDVINGTTRM  136 (257)
Q Consensus        57 ~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~PRCGvpD~~~~~~~~  136 (257)
                      ++..++.||++|||+...+...    .  ...+..+++||+.||++++|++||++|.+|++.|.+|||||||.       
T Consensus        29 ~~~~~~~yl~~~~y~~~~~~~~----~--~~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~-------   95 (469)
T KOG1565|consen   29 DKVALQDYLECYGYLPPTDLTA----T--RASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG-------   95 (469)
T ss_pred             chhHHHHHhhhcccCCCccccc----c--ccCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------
Confidence            6788999999999998874211    1  11578899999999999999999999999999999999999991       


Q ss_pred             CCCCCccccccccCCCCCCCCCCccceeEeecCCCc-ce----------------eecC------C-CCCccEEEEeecC
Q 043086          137 QGGPAHYHTHYVFYPGRPKWPATKQIISCAFLPGTR-TD----------------VQEP------D-YDAADVKISFQRG  192 (257)
Q Consensus       137 ~~~~~~~~~~~~~~~~~~kW~~~k~~LTy~i~~~~~-~~----------------~v~~------~-~~~aDI~I~f~~~  192 (257)
                               +|+++++++||+  |++|||+|.+++. +.                .|.+      . ...|||+|+|..+
T Consensus        96 ---------~~~~~~~~~kW~--k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~  164 (469)
T KOG1565|consen   96 ---------RYRYFPGKPKWN--KEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPG  164 (469)
T ss_pred             ---------ccccCcccCccc--ccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeecc
Confidence                     577788999999  9999999999853 21                1222      2 3799999999999


Q ss_pred             CCCCCCCCCCCCCCcceeecCCC---CCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086          193 DHGDGYPFDGPGPYNLLAHSFPP---TDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       193 ~hgd~~~fdG~g~~~~lahaf~P---~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs  257 (257)
                      .|||++||||+|  |+|||||+|   ..|++|||++|.|+++ ...++||+.||+|||||+|||+||+
T Consensus       165 ~h~d~~PFDG~~--g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-~~~g~~l~~Va~HEiGH~LGL~HS~  229 (469)
T KOG1565|consen  165 DHGDGFPFDGPG--GVLAHAFFPGPGIGGDLHFDKDETWTYG-DSNGVDLFLVAAHEIGHALGLGHSS  229 (469)
T ss_pred             CCCCCCcccCCC--CceecccCCCCCCCCccccCcccceecc-CCccchhHHHhhhhcccccccCCCC
Confidence            999999999999  999999999   6789999999999996 3469999999999999999999996


No 2  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.85  E-value=1.1e-21  Score=163.65  Aligned_cols=99  Identities=48%  Similarity=0.924  Sum_probs=82.9

Q ss_pred             CCCCCccceeEeecCCCc---ce--------ee------cC-------CCCCccEEEEeecCCCCCCCCCCCCCCCccee
Q 043086          155 KWPATKQIISCAFLPGTR---TD--------VQ------EP-------DYDAADVKISFQRGDHGDGYPFDGPGPYNLLA  210 (257)
Q Consensus       155 kW~~~k~~LTy~i~~~~~---~~--------~v------~~-------~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~la  210 (257)
                      ||+  |+.|||+|.+...   ..        ++      ..       ...+|||+|.|....|+|+++|||++  |++|
T Consensus         1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~--g~l~   76 (157)
T cd04278           1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPG--GTLA   76 (157)
T ss_pred             CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCc--cccc
Confidence            799  9999999987542   10        11      11       22379999999999999999999999  9999


Q ss_pred             ecCCC--CCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086          211 HSFPP--TDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       211 haf~P--~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs  257 (257)
                      ||++|  ..|+|+||.++.|+......+.+++.|++|||||+|||.|+.
T Consensus        77 ~a~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~  125 (157)
T cd04278          77 HAFFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSS  125 (157)
T ss_pred             cccCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCC
Confidence            99999  789999999999998643467899999999999999999984


No 3  
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.77  E-value=2.8e-19  Score=147.23  Aligned_cols=97  Identities=43%  Similarity=0.852  Sum_probs=80.6

Q ss_pred             CCCCCccceeEeecCCCc-ce----------ee------cC------CCCCccEEEEeecCCCCCCCCCCCCCCCcceee
Q 043086          155 KWPATKQIISCAFLPGTR-TD----------VQ------EP------DYDAADVKISFQRGDHGDGYPFDGPGPYNLLAH  211 (257)
Q Consensus       155 kW~~~k~~LTy~i~~~~~-~~----------~v------~~------~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lah  211 (257)
                      ||+  +++|||+|.+.++ .+          |+      .+      ...++||+|.|....+++...|++.+  +++++
T Consensus         1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~~adi~i~~~~~~~~~~~~~~~~~--~~~~~   76 (154)
T PF00413_consen    1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDGNADIRISFGSNNHGDGYSFDGSG--GTLAH   76 (154)
T ss_dssp             SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSSSCSEEEEEESSSSSSSS-CSSSS--SESEE
T ss_pred             CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCCCcceeeeeeccccCcccccccce--eeeec
Confidence            799  8999999997652 11          11      11      33569999999999999999999998  99999


Q ss_pred             cCCCC---CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086          212 SFPPT---DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       212 af~P~---~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs  257 (257)
                      ++.|.   .|+|+|+.++.|+...  .+.++..|++|||||+|||.|++
T Consensus        77 ~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~v~~HEiGHaLGL~H~~  123 (154)
T PF00413_consen   77 AYFPNNIVSGDIHFNDDESWTIDD--SGNDLQSVAIHEIGHALGLDHSN  123 (154)
T ss_dssp             EEESSSTTTTEEEEETTSHEESSS--SSEEHHHHHHHHHHHHTTBESSS
T ss_pred             cccccccccccccccccccchhhh--hhhhhhhhhhhccccccCcCcCC
Confidence            99996   7899999999998752  56799999999999999999984


No 4  
>PF01471 PG_binding_1:  Putative peptidoglycan binding domain;  InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are:   Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX [].   Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.48  E-value=6.4e-14  Score=97.75  Aligned_cols=56  Identities=32%  Similarity=0.489  Sum_probs=49.8

Q ss_pred             hhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhcc
Q 043086           55 VKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMM  119 (257)
Q Consensus        55 ~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M  119 (257)
                      +++|..+|.+|.++||++...         ++.|++.+++||+.||+.+||++||++|.+|+++|
T Consensus         2 ~~~v~~lq~~L~~~gy~~~~~---------~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L   57 (57)
T PF01471_consen    2 GPDVKALQQYLNRLGYYPGPV---------DGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL   57 (57)
T ss_dssp             SHHHHHHHHHHHHTTTT-SST---------TSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred             cHHHHHHHHHHHHcCCCCCCC---------CCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence            678999999999999995542         68899999999999999999999999999999886


No 5  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.36  E-value=8.2e-13  Score=109.94  Aligned_cols=68  Identities=37%  Similarity=0.463  Sum_probs=50.9

Q ss_pred             CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCCCcccCCCCCcc------CCCCCCchhHHHHHHhhhhhcCCC
Q 043086          181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGRFHYDGDENWTV------GAVPGAVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~------~~~~~~~~l~~va~HEiGH~LGL~  254 (257)
                      .+|||+|.|....     +|+|.|  +++|||++|..|.  |+....|..      .....+.+++.|++|||||+|||.
T Consensus        49 ~~adi~I~f~~~~-----~~~~~g--~~~a~a~~p~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~  119 (156)
T cd04279          49 NDADIVIFFDRPP-----PVGGAG--GGLARAGFPLISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLW  119 (156)
T ss_pred             CCCcEEEEecCCC-----CCCCCC--CceEEecccccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCC
Confidence            3799999998543     799999  9999999997543  333333322      111125789999999999999999


Q ss_pred             CCC
Q 043086          255 HRP  257 (257)
Q Consensus       255 HSs  257 (257)
                      |++
T Consensus       120 H~~  122 (156)
T cd04279         120 HHS  122 (156)
T ss_pred             CCC
Confidence            984


No 6  
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.27  E-value=6.3e-12  Score=102.51  Aligned_cols=81  Identities=35%  Similarity=0.531  Sum_probs=58.6

Q ss_pred             CCCCCCccceeEeec-CC-Cc---ce----e------ecC-----CCCCccEEEEeecCCCCCCCCCCCCCCCcceeecC
Q 043086          154 PKWPATKQIISCAFL-PG-TR---TD----V------QEP-----DYDAADVKISFQRGDHGDGYPFDGPGPYNLLAHSF  213 (257)
Q Consensus       154 ~kW~~~k~~LTy~i~-~~-~~---~~----~------v~~-----~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf  213 (257)
                      .+|+  +..++|.|. +. +.   ..    +      +..     ...++||.|.|...++        .|  ..+||++
T Consensus         3 ~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~~ad~~I~f~~~~~--------~g--~~~a~~g   70 (140)
T smart00235        3 KKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDG--------SG--CTLSHAG   70 (140)
T ss_pred             CcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCCCCCeEEEEEECCC--------CC--cceeeee
Confidence            5899  889999996 32 21   11    1      111     2238999999998765        12  2368998


Q ss_pred             CCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086          214 PPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       214 ~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS  256 (257)
                      .| .|+++++. +.|+.+        ..|++|||||+|||.|+
T Consensus        71 ~~-~g~~~~~~-~~~~~~--------~~~~~HEigHaLGl~H~  103 (140)
T smart00235       71 RP-GGDQHFSL-GNGCIN--------TGVAAHELGHALGLYHE  103 (140)
T ss_pred             cC-CCceEEEc-cCCcCC--------cccHHHHHHHHhcCCcC
Confidence            88 88999987 777653        23999999999999997


No 7  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.04  E-value=5.7e-11  Score=98.85  Aligned_cols=71  Identities=25%  Similarity=0.248  Sum_probs=47.0

Q ss_pred             CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086          181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs  257 (257)
                      ..+||+|.+....+..    +|..  +.+.+...|..|+|+++....|.......+.++..|++|||||+|||.|+.
T Consensus        42 ~~~di~i~~~~~~~~~----~~~~--~~~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~  112 (165)
T cd04268          42 DPADIRYSVIRWIPYN----DGTW--SYGPSQVDPLTGEILLARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNF  112 (165)
T ss_pred             CccCceEEEEEeecCC----CCcc--ccCCccCCCCCccEEeeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccC
Confidence            4799999998643211    1222  333343456779999988766542101134578999999999999999973


No 8  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=98.91  E-value=1.4e-09  Score=93.02  Aligned_cols=63  Identities=21%  Similarity=0.360  Sum_probs=48.6

Q ss_pred             CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCC-------CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCC
Q 043086          181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPT-------DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGL  253 (257)
Q Consensus       181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~-------~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL  253 (257)
                      .++||+|.+.....         +  ++.+.|++|.       .|+|++|.+..|...  ..+.....|++|||||+|||
T Consensus        61 ~~adI~i~~~~~~~---------~--~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~--~~g~~~~~t~~HEiGHaLGL  127 (186)
T cd04277          61 SGADIRFGNSSDPD---------G--NTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSD--SPGSYGYQTIIHEIGHALGL  127 (186)
T ss_pred             CcceEEEEeccCCC---------C--CccEEEECCCCCccccccceeEEecCcccccC--CCChhhHHHHHHHHHHHhcC
Confidence            47999998864322         3  5678888884       368999998887643  23557899999999999999


Q ss_pred             CCC
Q 043086          254 AHR  256 (257)
Q Consensus       254 ~HS  256 (257)
                      .|+
T Consensus       128 ~H~  130 (186)
T cd04277         128 EHP  130 (186)
T ss_pred             CCC
Confidence            997


No 9  
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=98.85  E-value=5.9e-09  Score=91.01  Aligned_cols=72  Identities=21%  Similarity=0.250  Sum_probs=63.4

Q ss_pred             CCCCCCchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCCCccccC
Q 043086           48 GTQKGDTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGKPRCGVP  127 (257)
Q Consensus        48 ~~~~g~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~PRCGvp  127 (257)
                      .++.|+.+++|..+|+.|...||+....         ++.|+..|.+||+.||+.+||++||++|..|+..|..| +.++
T Consensus         6 ~l~~G~~g~~V~~LQ~~L~~lG~~~g~i---------dG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~~   75 (201)
T TIGR02869         6 TYQRGSTGSDVIEIQRRLKAWGYYNGKV---------DGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA-TTQQ   75 (201)
T ss_pred             cCCCCCCcHHHHHHHHHHHHcCCCCCCC---------CCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc-cccC
Confidence            4788999999999999999999987653         57899999999999999999999999999999999554 5555


Q ss_pred             CC
Q 043086          128 DV  129 (257)
Q Consensus       128 D~  129 (257)
                      .+
T Consensus        76 ~~   77 (201)
T TIGR02869        76 SS   77 (201)
T ss_pred             cc
Confidence            53


No 10 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.55  E-value=1.2e-07  Score=82.24  Aligned_cols=89  Identities=29%  Similarity=0.476  Sum_probs=51.9

Q ss_pred             CCCCCccceeEeecCCCcc---e----ee-----------cC-CCCCccEEEEeecCCCCCCCCCCCCCCCcceeecCCC
Q 043086          155 KWPATKQIISCAFLPGTRT---D----VQ-----------EP-DYDAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPP  215 (257)
Q Consensus       155 kW~~~k~~LTy~i~~~~~~---~----~v-----------~~-~~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P  215 (257)
                      +|+. .+.|+|.+......   .    ++           .. ...++||+|.|..+. || +.+.|..  +...++   
T Consensus         2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~~adi~I~f~~~~-Gc-~S~vG~~--~~~~~~---   73 (198)
T cd04327           2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTPGD-GY-WSYVGTD--ALLIGA---   73 (198)
T ss_pred             CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCCCCCEEEEEecCC-CC-CCCcCCc--ccccCC---
Confidence            5874 56788888765321   0    11           11 335799999998743 43 4577764  321111   


Q ss_pred             CCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086          216 TDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       216 ~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS  256 (257)
                      ....+.++....++     .+.++..|++|||||+|||.|.
T Consensus        74 ~~~t~~l~~~~~~~-----~~~~~~~~i~HElgHaLG~~HE  109 (198)
T cd04327          74 DAPTMNLGWFTDDT-----PDPEFSRVVLHEFGHALGFIHE  109 (198)
T ss_pred             CCceeeeeeecCCC-----chhhHHHHHHHHHHHHhcCccc
Confidence            12234453211111     2345778999999999999885


No 11 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=98.32  E-value=6.6e-07  Score=74.25  Aligned_cols=59  Identities=25%  Similarity=0.305  Sum_probs=40.7

Q ss_pred             CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCC-----CCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCC
Q 043086          181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTD-----GRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAH  255 (257)
Q Consensus       181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~-----g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~H  255 (257)
                      ..+||.|.+...++        .+  +.+++|+.|..     |.++++....|       +..+..|++|||||+|||.|
T Consensus        50 ~~~di~~~~~~~~~--------~~--~~~g~a~~~~~c~~~~~~~~~~~~~~~-------~~~~~~~~~HElGH~LGl~H  112 (167)
T cd00203          50 DKADIAILVTRQDF--------DG--GTGGWAYLGRVCDSLRGVGVLQDNQSG-------TKEGAQTIAHELGHALGFYH  112 (167)
T ss_pred             CcCcEEEEeccccC--------CC--CceEEEecCCccCCCCCcEEEecCCcc-------cccchhhHHHHHHHHhCCCc
Confidence            36899998876433        22  45567777643     45555544332       34688999999999999999


Q ss_pred             C
Q 043086          256 R  256 (257)
Q Consensus       256 S  256 (257)
                      .
T Consensus       113 ~  113 (167)
T cd00203         113 D  113 (167)
T ss_pred             c
Confidence            6


No 12 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.25  E-value=6.8e-07  Score=77.90  Aligned_cols=91  Identities=19%  Similarity=0.122  Sum_probs=52.0

Q ss_pred             ccceeEeecCCCcce-----------------------e--ecC-C--CCCccEEEEeecCCCCCCCCCCCCCCCcceee
Q 043086          160 KQIISCAFLPGTRTD-----------------------V--QEP-D--YDAADVKISFQRGDHGDGYPFDGPGPYNLLAH  211 (257)
Q Consensus       160 k~~LTy~i~~~~~~~-----------------------~--v~~-~--~~~aDI~I~f~~~~hgd~~~fdG~g~~~~lah  211 (257)
                      |+.|||+|.++++..                       +  |+. +  ...+||++..-+--|.+    ++..  +...+
T Consensus         7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~--~~gps   80 (197)
T cd04276           7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGW--AYGPS   80 (197)
T ss_pred             CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcc--eeccc
Confidence            889999999876421                       1  111 1  12479995544444433    2332  33345


Q ss_pred             cCCCCCC-----CCcccCCCCCccCC---CCCCchhHHHHHHhhhhhcCCCCC
Q 043086          212 SFPPTDG-----RFHYDGDENWTVGA---VPGAVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       212 af~P~~g-----~ihfd~~e~wt~~~---~~~~~~l~~va~HEiGH~LGL~HS  256 (257)
                      ...|..|     +|+++.........   ...+..+..+++||+||+|||.|.
T Consensus        81 ~~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn  133 (197)
T cd04276          81 VVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHN  133 (197)
T ss_pred             ccCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCccc
Confidence            5567555     45666543221110   012245788999999999999995


No 13 
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=98.21  E-value=2.3e-06  Score=85.29  Aligned_cols=83  Identities=20%  Similarity=0.211  Sum_probs=63.1

Q ss_pred             hhhhhhccCCCCCCchhcHHHHHHHHHhcCCCCCCCcc----c-------------------------------cccccC
Q 043086           40 LEFFKLFQGTQKGDTVKGINALKKYLHTLGYLSNHNHN----H-------------------------------AAAAAD   84 (257)
Q Consensus        40 ~~~~~~l~~~~~g~~~~~v~~~q~yL~~fGYl~~~~~~----~-------------------------------~~~~~~   84 (257)
                      |..+..-..++.|+..++|..+.+.|...|++......    .                               ......
T Consensus       226 Wp~i~~~~~LrpG~~~~~V~~LR~RL~~~G~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~  305 (608)
T PRK10594        226 WPQLTGKATLRPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPAV  305 (608)
T ss_pred             CcccCCCCCCCCCCCCchHHHHHHHHHHcCCccccccccccccccccccccccccccccccccccccccccccccccccc
Confidence            44444344688999999999999999999998532100    0                               000112


Q ss_pred             CCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCCC
Q 043086           85 GDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGKP  122 (257)
Q Consensus        85 ~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~P  122 (257)
                      +..|+.++.+|||+||+.+||.++|+++..|++.|+.|
T Consensus       306 ~~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs  343 (608)
T PRK10594        306 RAAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT  343 (608)
T ss_pred             ccccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence            45699999999999999999999999999999998654


No 14 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=97.88  E-value=1.7e-05  Score=66.67  Aligned_cols=65  Identities=18%  Similarity=0.272  Sum_probs=57.3

Q ss_pred             CCCCCchhcHHHHHHHHHhcCCCC-CCCccccccccCCCcccHHHHHHHHHHHHhcCC-CCCCCCChhhhhccCCC
Q 043086           49 TQKGDTVKGINALKKYLHTLGYLS-NHNHNHAAAAADGDYFDENLESAVKTYQRNFNL-NPTGHMDLKTVSMMGKP  122 (257)
Q Consensus        49 ~~~g~~~~~v~~~q~yL~~fGYl~-~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL-~~TG~lD~~T~~~M~~P  122 (257)
                      ...+..++.|..+|..|...||.. ...         +++|++.+..||+.||+.++| +++|+++++|+..+...
T Consensus        39 ~~~~~~~~~v~~lq~~L~~~g~~~~~~~---------dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~  105 (185)
T COG3409          39 LTLGAEGPSVRILQAALNALGYYPDGVI---------DGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQ  105 (185)
T ss_pred             cccCCCCchHHHHHHHHHhcCCCCCCCc---------cCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHHH
Confidence            445567899999999999999998 543         688999999999999999999 89999999999988653


No 15 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=97.66  E-value=7.3e-05  Score=65.76  Aligned_cols=44  Identities=20%  Similarity=0.309  Sum_probs=29.4

Q ss_pred             cceeecCCC-CCC----CCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086          207 NLLAHSFPP-TDG----RFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       207 ~~lahaf~P-~~g----~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS  256 (257)
                      |..+-|.|| .+|    .|.+..     +.. ..-.-..+|.+|||||++||.|+
T Consensus       102 G~ggsAGFP~s~G~P~~~I~I~~-----~~~-~~~~~~~hvi~HEiGH~IGfRHT  150 (211)
T PF12388_consen  102 GAGGSAGFPTSNGNPYKFIQIYG-----LSN-YSVNVIEHVITHEIGHCIGFRHT  150 (211)
T ss_pred             CcceeccCCCCCCCCCceEEEEe-----cCC-CchhHHHHHHHHHhhhhcccccc
Confidence            556788889 545    355511     111 12234778999999999999996


No 16 
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.47  E-value=0.00019  Score=70.42  Aligned_cols=68  Identities=24%  Similarity=0.287  Sum_probs=57.1

Q ss_pred             cCCCCCCchhcHHHHHHHHHhcCC-CCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccCC
Q 043086           47 QGTQKGDTVKGINALKKYLHTLGY-LSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMGK  121 (257)
Q Consensus        47 ~~~~~g~~~~~v~~~q~yL~~fGY-l~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~~  121 (257)
                      ..+++|+...+|..+.+.|..-|- +....       .....|+.++.+|||+||+.+||+.+|++++.|++.|+-
T Consensus       230 ~~LrpG~~~~~v~aL~~~L~~~~~d~~~a~-------~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~  298 (561)
T COG2989         230 ALLRPGVTSPDVPALRARLARSGMDLPSAA-------GSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV  298 (561)
T ss_pred             cccCCCCCchhHHHHHHHHHhcCccchhhc-------cCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence            457899999999999999999884 22221       123479999999999999999999999999999999964


No 17 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.12  E-value=0.00025  Score=59.51  Aligned_cols=21  Identities=33%  Similarity=0.495  Sum_probs=15.1

Q ss_pred             CchhHHHHHHhhhhhcCCCCC
Q 043086          236 AVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       236 ~~~l~~va~HEiGH~LGL~HS  256 (257)
                      ..+...+++||+||-|||.|.
T Consensus        66 ~~~~g~TltHEvGH~LGL~Ht   86 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWLGLYHT   86 (154)
T ss_dssp             TS-SSHHHHHHHHHHTT---T
T ss_pred             ccccccchhhhhhhhhccccc
Confidence            457789999999999999996


No 18 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.03  E-value=0.00024  Score=56.22  Aligned_cols=18  Identities=39%  Similarity=0.495  Sum_probs=12.6

Q ss_pred             hHHHHHHhhhhhcCCCCC
Q 043086          239 MQTVALHELRHVLGLAHR  256 (257)
Q Consensus       239 l~~va~HEiGH~LGL~HS  256 (257)
                      ...+.+|||||.|||.|+
T Consensus       107 ~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  107 GVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             STTHHHHHHHHHTT----
T ss_pred             cceEeeehhhHhcCCCCC
Confidence            348999999999999995


No 19 
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=96.85  E-value=0.0016  Score=58.55  Aligned_cols=56  Identities=23%  Similarity=0.414  Sum_probs=48.1

Q ss_pred             CchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCC-CCCCCCChhhhhcc
Q 043086           53 DTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNL-NPTGHMDLKTVSMM  119 (257)
Q Consensus        53 ~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL-~~TG~lD~~T~~~M  119 (257)
                      .+.++|..+|+.|.+|||....           +.|++.++.+|+.||.-|+= ..+|+.|.+|.+.+
T Consensus       195 ~~~~~v~~lq~~L~~YGY~v~~-----------~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l  251 (257)
T COG3023         195 LKGEDVAALQEMLARYGYGVEI-----------GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAIL  251 (257)
T ss_pred             hccCCHHHHHHHHHHhCcCCCc-----------chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHH
Confidence            3448999999999999997653           56899999999999998874 58999999998765


No 20 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=96.67  E-value=0.00073  Score=59.10  Aligned_cols=19  Identities=37%  Similarity=0.499  Sum_probs=16.4

Q ss_pred             hhHHHHHHhhhhhcCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HS  256 (257)
                      +-..+.+|||||+|||.|.
T Consensus       136 ~~~~~~aHEiGH~lGl~H~  154 (206)
T PF13583_consen  136 NGYQTFAHEIGHNLGLRHD  154 (206)
T ss_pred             ccchHHHHHHHHHhcCCCC
Confidence            3457899999999999996


No 21 
>PF08823 PG_binding_2:  Putative peptidoglycan binding domain;  InterPro: IPR014927 This entry may be a peptidoglycan binding domain. 
Probab=96.61  E-value=0.0057  Score=45.28  Aligned_cols=57  Identities=19%  Similarity=0.313  Sum_probs=46.8

Q ss_pred             CchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCC----CCCCCCChhhhhc
Q 043086           53 DTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNL----NPTGHMDLKTVSM  118 (257)
Q Consensus        53 ~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL----~~TG~lD~~T~~~  118 (257)
                      ..++-+.++|.-|.+.||+.++.         ++.+++++.+|++.|+-.-++    ...|++|...++-
T Consensus        13 l~~~~~~evq~~L~~lGyy~g~~---------~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~   73 (74)
T PF08823_consen   13 LDGDVAREVQEALKRLGYYKGEA---------DGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAY   73 (74)
T ss_pred             CcHHHHHHHHHHHHHcCCccCCC---------CCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhh
Confidence            44667789999999999998874         577899999999999998886    4566688877654


No 22 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=96.61  E-value=0.0013  Score=57.77  Aligned_cols=23  Identities=35%  Similarity=0.475  Sum_probs=20.3

Q ss_pred             CCchhHHHHHHhhhhhcCCC-CCC
Q 043086          235 GAVDMQTVALHELRHVLGLA-HRP  257 (257)
Q Consensus       235 ~~~~l~~va~HEiGH~LGL~-HSs  257 (257)
                      ...+|..+|-||+||+||+. ||.
T Consensus       183 ~~e~L~~tarhElGhaLgi~ghsd  206 (236)
T COG5549         183 LRENLNPTARHELGHALGIWGHSD  206 (236)
T ss_pred             chhhhhHHHHHhhcchheeccccc
Confidence            45689999999999999998 884


No 23 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=96.57  E-value=0.0012  Score=57.48  Aligned_cols=66  Identities=21%  Similarity=0.282  Sum_probs=39.8

Q ss_pred             CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCC
Q 043086          181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HS  256 (257)
                      ...|+.+-|...+....   ++++  ++++.|+.-   .|. .......+.. ..+.....|++|||||.||+.|-
T Consensus        92 ~~~D~a~llt~~d~~~~---~~~~--~~~G~A~~g---giC-~~~~s~~i~~-~~~~~~a~~~aHElGH~LG~~HD  157 (207)
T cd04273          92 EHHDHAILLTRQDICRS---NGNC--DTLGLAPVG---GMC-SPSRSCSINE-DTGLSSAFTIAHELGHVLGMPHD  157 (207)
T ss_pred             cccceEEEEeeeccccc---CCCC--CceEEeccc---cCC-CCCcceEEEc-CCCceeEEeeeeechhhcCCCCC
Confidence            35788887776544322   4667  778888752   222 1111111211 12455667999999999999995


No 24 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.49  E-value=0.00051  Score=61.12  Aligned_cols=62  Identities=23%  Similarity=0.259  Sum_probs=38.1

Q ss_pred             CCccEEEEeecCCCCCCCCCCCCCCCcceeecCCCCCCC--------CcccCCCCCccC-CCCCCchhHHHHHHhhhhhc
Q 043086          181 DAADVKISFQRGDHGDGYPFDGPGPYNLLAHSFPPTDGR--------FHYDGDENWTVG-AVPGAVDMQTVALHELRHVL  251 (257)
Q Consensus       181 ~~aDI~I~f~~~~hgd~~~fdG~g~~~~lahaf~P~~g~--------ihfd~~e~wt~~-~~~~~~~l~~va~HEiGH~L  251 (257)
                      +...|.|-+..        |. .+  ++++.|++|....        |.++..   +++ .....+++..|++||+||.|
T Consensus        84 ~~~~LNIy~~~--------~~-~~--~~lG~a~fP~~~~~~~~~~dGvvi~~~---~~~~~~~~~~n~g~t~~HEvGH~l  149 (225)
T cd04275          84 GYKYLNIYVAN--------FL-GG--GLLGYATFPDSLVSLAFITDGVVINPS---SLPGGSAAPYNLGDTATHEVGHWL  149 (225)
T ss_pred             CcCeEEEEEEC--------CC-CC--CcCEEEECCCcccCCccccceEEEecc---ccCCCCcccccccceeEEecccee
Confidence            35677787653        22 23  6788899984421        111111   111 11124688899999999999


Q ss_pred             CCCCC
Q 043086          252 GLAHR  256 (257)
Q Consensus       252 GL~HS  256 (257)
                      ||-|.
T Consensus       150 GL~Ht  154 (225)
T cd04275         150 GLYHT  154 (225)
T ss_pred             eeeee
Confidence            99996


No 25 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=96.29  E-value=0.0018  Score=55.32  Aligned_cols=21  Identities=29%  Similarity=0.390  Sum_probs=14.9

Q ss_pred             CchhHHHHHHhhhhhcCCCCC
Q 043086          236 AVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       236 ~~~l~~va~HEiGH~LGL~HS  256 (257)
                      ......|++|||||.||+.|.
T Consensus       139 ~~~~~~~~AHEiGH~lGa~HD  159 (196)
T PF13688_consen  139 TYNGAITFAHEIGHNLGAPHD  159 (196)
T ss_dssp             HHHHHHHHHHHHHHHTT----
T ss_pred             CCceehhhHHhHHHhcCCCCC
Confidence            456778999999999999994


No 26 
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=96.19  E-value=0.011  Score=49.50  Aligned_cols=62  Identities=27%  Similarity=0.359  Sum_probs=52.5

Q ss_pred             CCCchhcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcCCCCCCCCChhhhhccC
Q 043086           51 KGDTVKGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFNLNPTGHMDLKTVSMMG  120 (257)
Q Consensus        51 ~g~~~~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~gL~~TG~lD~~T~~~M~  120 (257)
                      .+..+.++..++.-+...+|.....        .+++|+..++.||+.||+.++|+++|++...|+..+.
T Consensus       123 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~  184 (185)
T COG3409         123 LGLGGGDVATLQQPLPLLGYRSGIR--------VDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR  184 (185)
T ss_pred             ccccccchHHHHHHHHhcccccCCC--------CCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence            3556677778888888888887652        3689999999999999999999999999999998764


No 27 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=96.18  E-value=0.0019  Score=55.34  Aligned_cols=17  Identities=41%  Similarity=0.649  Sum_probs=15.4

Q ss_pred             HHHHHHhhhhhcCCCCC
Q 043086          240 QTVALHELRHVLGLAHR  256 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HS  256 (257)
                      ..+++|||||+|||.|-
T Consensus        75 ~g~v~HE~~HalG~~HE   91 (180)
T cd04280          75 LGTIVHELMHALGFYHE   91 (180)
T ss_pred             CchhHHHHHHHhcCcch
Confidence            57999999999999984


No 28 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=96.18  E-value=0.002  Score=52.36  Aligned_cols=41  Identities=27%  Similarity=0.299  Sum_probs=24.3

Q ss_pred             cceeecCCCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086          207 NLLAHSFPPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       207 ~~lahaf~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~  254 (257)
                      +-.|.......|.|.+|..  +     ..+++-..|++|||||+|||.
T Consensus        52 ~sya~~~g~G~G~I~l~~~--~-----~qgy~~~RIaaHE~GHiLGLP   92 (132)
T PF02031_consen   52 GSYASTDGLGSGYIFLDYQ--Q-----NQGYNSTRIAAHELGHILGLP   92 (132)
T ss_dssp             --EEEE-SSS-EEEEEEHH--H-----HHHS-HHHHHHHHHHHHHT--
T ss_pred             CcccccCCCCcEEEEechH--H-----hhCCccceeeeehhccccCCC
Confidence            3333333335678888842  2     146788889999999999995


No 29 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=96.15  E-value=0.0021  Score=54.94  Aligned_cols=19  Identities=47%  Similarity=0.737  Sum_probs=16.0

Q ss_pred             hhHHHHHHhhhhhcCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HS  256 (257)
                      ..+.=++||+||+|||.|-
T Consensus       123 Rv~KEv~HElGH~~GL~HC  141 (181)
T COG1913         123 RVVKEVLHELGHLLGLSHC  141 (181)
T ss_pred             HHHHHHHHHhhhhcCcccC
Confidence            4555689999999999995


No 30 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=96.09  E-value=0.003  Score=54.33  Aligned_cols=19  Identities=42%  Similarity=0.646  Sum_probs=16.1

Q ss_pred             hhHHHHHHhhhhhcCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HS  256 (257)
                      .+..+++||+||.|||.|-
T Consensus       124 R~~k~~~HElGH~lGL~HC  142 (179)
T PRK13267        124 RVRKEVTHELGHTLGLEHC  142 (179)
T ss_pred             HHHHHHHHHHHHHcCCccC
Confidence            4555699999999999994


No 31 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=95.98  E-value=0.0034  Score=54.72  Aligned_cols=19  Identities=42%  Similarity=0.663  Sum_probs=13.6

Q ss_pred             hhHHHHHHhhhhhcCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HS  256 (257)
                      .+..+++||+||++||.|-
T Consensus       144 R~~Kea~HElGH~~GL~HC  162 (194)
T PF07998_consen  144 RVCKEAVHELGHLFGLDHC  162 (194)
T ss_dssp             HHHHHHHHHHHHHTT----
T ss_pred             HHHHHHHHHHHHHcCCcCC
Confidence            5667999999999999993


No 32 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=95.96  E-value=0.003  Score=53.68  Aligned_cols=18  Identities=39%  Similarity=0.438  Sum_probs=13.5

Q ss_pred             hHHHHHHhhhhhcCCCCC
Q 043086          239 MQTVALHELRHVLGLAHR  256 (257)
Q Consensus       239 l~~va~HEiGH~LGL~HS  256 (257)
                      ...|++|||||.||+.|.
T Consensus       111 ~~~~~aHElGH~lGa~Hd  128 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHD  128 (173)
T ss_dssp             HHHHHHHHHHHHHT---S
T ss_pred             eeeeehhhhHhhcCCCCC
Confidence            667899999999999996


No 33 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=95.94  E-value=0.003  Score=54.54  Aligned_cols=17  Identities=47%  Similarity=0.645  Sum_probs=13.4

Q ss_pred             hHHHHHHhhhhhcCCCC
Q 043086          239 MQTVALHELRHVLGLAH  255 (257)
Q Consensus       239 l~~va~HEiGH~LGL~H  255 (257)
                      ...|++|||||||||.|
T Consensus        79 ~~~~i~HEl~HaLG~~H   95 (191)
T PF01400_consen   79 SVGTILHELGHALGFWH   95 (191)
T ss_dssp             SHHHHHHHHHHHHTB--
T ss_pred             CccchHHHHHHHHhhhh
Confidence            35699999999999998


No 34 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.86  E-value=0.0034  Score=53.59  Aligned_cols=20  Identities=35%  Similarity=0.423  Sum_probs=17.4

Q ss_pred             chhHHHHHHhhhhhcCCCCC
Q 043086          237 VDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       237 ~~l~~va~HEiGH~LGL~HS  256 (257)
                      .....+++||+||.|||.|-
T Consensus       131 ~~~~~~~aHElGH~lG~~HD  150 (192)
T cd04267         131 LLTALTMAHELGHNLGAEHD  150 (192)
T ss_pred             eeehhhhhhhHHhhcCCcCC
Confidence            45667999999999999995


No 35 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=95.85  E-value=0.0036  Score=54.02  Aligned_cols=17  Identities=41%  Similarity=0.694  Sum_probs=15.1

Q ss_pred             HHHHHHhhhhhcCCCCC
Q 043086          240 QTVALHELRHVLGLAHR  256 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HS  256 (257)
                      ..+++|||||||||-|-
T Consensus        78 ~G~i~HEl~HaLG~~HE   94 (182)
T cd04283          78 KGIIQHELLHALGFYHE   94 (182)
T ss_pred             cchHHHHHHHHhCCccc
Confidence            46999999999999983


No 36 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.78  E-value=0.005  Score=52.67  Aligned_cols=19  Identities=37%  Similarity=0.581  Sum_probs=16.7

Q ss_pred             chhHHHHHHhhhhhcCCCC
Q 043086          237 VDMQTVALHELRHVLGLAH  255 (257)
Q Consensus       237 ~~l~~va~HEiGH~LGL~H  255 (257)
                      ..+..+++|||||.||+.|
T Consensus       129 ~~~a~~~AHElGH~lG~~H  147 (194)
T cd04269         129 LLFAVTMAHELGHNLGMEH  147 (194)
T ss_pred             HHHHHHHHHHHHhhcCCCc
Confidence            3556799999999999999


No 37 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=95.36  E-value=0.0067  Score=53.08  Aligned_cols=17  Identities=41%  Similarity=0.735  Sum_probs=14.9

Q ss_pred             HHHHHHhhhhhcCCCCC
Q 043086          240 QTVALHELRHVLGLAHR  256 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HS  256 (257)
                      ..+++|||||+||+-|-
T Consensus        88 ~Gti~HEl~HaLGf~HE  104 (200)
T cd04281          88 FGIVVHELGHVIGFWHE  104 (200)
T ss_pred             CchHHHHHHHHhcCcch
Confidence            35999999999999883


No 38 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=95.01  E-value=0.0083  Score=53.37  Aligned_cols=15  Identities=40%  Similarity=0.685  Sum_probs=14.4

Q ss_pred             HHHHHhhhhhcCCCC
Q 043086          241 TVALHELRHVLGLAH  255 (257)
Q Consensus       241 ~va~HEiGH~LGL~H  255 (257)
                      .|.+||+||.||+.|
T Consensus       147 ~t~AHElGHnLGm~H  161 (228)
T cd04271         147 QVFAHEIGHTFGAVH  161 (228)
T ss_pred             eehhhhhhhhcCCCC
Confidence            699999999999999


No 39 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=94.98  E-value=0.01  Score=51.00  Aligned_cols=20  Identities=35%  Similarity=0.690  Sum_probs=14.6

Q ss_pred             CchhHHHHHHhhhhhcCCCC
Q 043086          236 AVDMQTVALHELRHVLGLAH  255 (257)
Q Consensus       236 ~~~l~~va~HEiGH~LGL~H  255 (257)
                      ...+-.+++|||||.||+.|
T Consensus       128 ~~~~a~~~AHelGH~lGm~H  147 (199)
T PF01421_consen  128 GLSFAVIIAHELGHNLGMPH  147 (199)
T ss_dssp             HHHHHHHHHHHHHHHTT---
T ss_pred             hHHHHHHHHHHHHHhcCCCC
Confidence            45667799999999999998


No 40 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=94.73  E-value=0.017  Score=51.84  Aligned_cols=18  Identities=39%  Similarity=0.554  Sum_probs=15.7

Q ss_pred             hHHHHHHhhhhhcCCCCC
Q 043086          239 MQTVALHELRHVLGLAHR  256 (257)
Q Consensus       239 l~~va~HEiGH~LGL~HS  256 (257)
                      ...|.+||+||.||+.|=
T Consensus       167 ~a~t~AHElGHnlGm~HD  184 (244)
T cd04270         167 SDLVTAHELGHNFGSPHD  184 (244)
T ss_pred             HHHHHHHHHHHhcCCCCC
Confidence            447999999999999993


No 41 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=94.59  E-value=0.017  Score=50.63  Aligned_cols=18  Identities=39%  Similarity=0.460  Sum_probs=16.4

Q ss_pred             hhHHHHHHhhhhhcCCCC
Q 043086          238 DMQTVALHELRHVLGLAH  255 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~H  255 (257)
                      ....+++||+||.||+.|
T Consensus       144 ~~~~~~AHElGH~lG~~H  161 (220)
T cd04272         144 YGVYTMTHELAHLLGAPH  161 (220)
T ss_pred             ccHHHHHHHHHHHhCCCC
Confidence            457899999999999999


No 42 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=94.48  E-value=0.016  Score=51.79  Aligned_cols=17  Identities=35%  Similarity=0.622  Sum_probs=14.9

Q ss_pred             HHHHHHhhhhhcCCCCC
Q 043086          240 QTVALHELRHVLGLAHR  256 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HS  256 (257)
                      ..+++|||||+||+-|-
T Consensus       121 ~Gti~HEl~HalGf~HE  137 (230)
T cd04282         121 KATVEHEFLHALGFYHE  137 (230)
T ss_pred             CchHHHHHHHHhCCccc
Confidence            46899999999999883


No 43 
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=93.50  E-value=0.084  Score=46.22  Aligned_cols=73  Identities=30%  Similarity=0.356  Sum_probs=45.0

Q ss_pred             CCCCccEEEEeecCCCCCCC--CCCCCCCCcceeecCCCCCCCCcccCCCCCccCCCC----CCchhHHHHHHhhhhhcC
Q 043086          179 DYDAADVKISFQRGDHGDGY--PFDGPGPYNLLAHSFPPTDGRFHYDGDENWTVGAVP----GAVDMQTVALHELRHVLG  252 (257)
Q Consensus       179 ~~~~aDI~I~f~~~~hgd~~--~fdG~g~~~~lahaf~P~~g~ihfd~~e~wt~~~~~----~~~~l~~va~HEiGH~LG  252 (257)
                      ..+.+|++|...+..--|..  +.+-.   +.+.. +.|..++|.+|.+ .|..+...    -+.+...|+=||.||.||
T Consensus        78 ~~~~~Df~I~Lasp~T~~~lC~g~~~~---~e~SC-~~~~g~~VvIN~~-RWv~Ga~~f~gdl~~YRqYvINHEVGH~LG  152 (203)
T PF11350_consen   78 DSGAPDFRISLASPGTTDRLCAGLDTS---GETSC-RNPAGGRVVINLA-RWVRGAPAFGGDLASYRQYVINHEVGHALG  152 (203)
T ss_pred             CCCCCCEEEEECCcchhhhhccCcCcC---ceeEe-ecCCCCeEEEehH-HhhccCcccCCcHHHHHHHhhhhhhhhhcc
Confidence            34458999999876654321  23322   23322 2344567777753 67765422    233456699999999999


Q ss_pred             CCCC
Q 043086          253 LAHR  256 (257)
Q Consensus       253 L~HS  256 (257)
                      -.|-
T Consensus       153 h~H~  156 (203)
T PF11350_consen  153 HGHE  156 (203)
T ss_pred             cCCC
Confidence            9885


No 44 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=93.29  E-value=0.042  Score=51.24  Aligned_cols=20  Identities=40%  Similarity=0.484  Sum_probs=17.1

Q ss_pred             hhHHHHHHhhhhhcCCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HSs  257 (257)
                      .-..|.+||+||-|||.||.
T Consensus       149 ~~~~~~~HElgHN~GL~Ha~  168 (314)
T PF05548_consen  149 QDWATIMHELGHNLGLWHAG  168 (314)
T ss_pred             ccHHHHHHHhhhhccccccC
Confidence            34569999999999999983


No 45 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=92.76  E-value=0.061  Score=50.01  Aligned_cols=20  Identities=40%  Similarity=0.580  Sum_probs=15.1

Q ss_pred             hhHHHHHHhhhhhcCCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HSs  257 (257)
                      ....+.+||+||++||.|.+
T Consensus       192 s~~~~f~HE~GH~~GL~H~p  211 (305)
T PF10462_consen  192 SYGNEFSHELGHNFGLGHYP  211 (305)
T ss_dssp             -SHHHHHHHHHHTTT--SST
T ss_pred             CccceeehhhhhhcCCCCCC
Confidence            56789999999999999953


No 46 
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=92.26  E-value=0.048  Score=52.66  Aligned_cols=17  Identities=41%  Similarity=0.602  Sum_probs=15.4

Q ss_pred             HHHHHHhhhhhcCCCCC
Q 043086          240 QTVALHELRHVLGLAHR  256 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HS  256 (257)
                      +-|++||++||||+-|-
T Consensus       160 ~G~i~HEl~HaLGf~He  176 (411)
T KOG3714|consen  160 FGTIVHELMHALGFWHE  176 (411)
T ss_pred             CchhHHHHHHHhhhhhc
Confidence            77999999999999873


No 47 
>PF09374 PG_binding_3:  Predicted Peptidoglycan domain;  InterPro: IPR018537  This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=91.16  E-value=0.15  Score=37.38  Aligned_cols=26  Identities=27%  Similarity=0.397  Sum_probs=20.6

Q ss_pred             HHHHHHHHhcC----CCCCCCCChhhhhcc
Q 043086           94 SAVKTYQRNFN----LNPTGHMDLKTVSMM  119 (257)
Q Consensus        94 ~AIk~FQ~~~g----L~~TG~lD~~T~~~M  119 (257)
                      .|||.+|+..|    +.++|++++.|+.++
T Consensus         1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al   30 (72)
T PF09374_consen    1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL   30 (72)
T ss_dssp             HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred             CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence            48999999999    999999999999998


No 48 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=90.40  E-value=0.075  Score=48.73  Aligned_cols=16  Identities=50%  Similarity=0.621  Sum_probs=14.5

Q ss_pred             HHHHHHhhhhhcCCCC
Q 043086          240 QTVALHELRHVLGLAH  255 (257)
Q Consensus       240 ~~va~HEiGH~LGL~H  255 (257)
                      ..|.+||+||.|||.+
T Consensus       166 igv~~HE~gH~lGLPD  181 (286)
T TIGR03296       166 VGVIAHELGHDLGLPD  181 (286)
T ss_pred             eeeeehhhhcccCCCc
Confidence            6799999999999975


No 49 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=90.23  E-value=0.21  Score=44.46  Aligned_cols=16  Identities=31%  Similarity=0.547  Sum_probs=12.5

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ...=..||+||.||++
T Consensus        86 SiaAvaVAAHEvGHAi  101 (222)
T PF04298_consen   86 SIAAVAVAAHEVGHAI  101 (222)
T ss_pred             CHHHHHHHHHHHhHHH
Confidence            3445569999999996


No 50 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=88.76  E-value=0.14  Score=41.41  Aligned_cols=19  Identities=32%  Similarity=0.595  Sum_probs=16.0

Q ss_pred             CchhHHHHHHhhhhhcCCC
Q 043086          236 AVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       236 ~~~l~~va~HEiGH~LGL~  254 (257)
                      +--...|++|||||..||+
T Consensus       106 gd~vthvliHEIgHhFGLs  124 (136)
T COG3824         106 GDQVTHVLIHEIGHHFGLS  124 (136)
T ss_pred             hhHhhhhhhhhhhhhcCCC
Confidence            4456779999999999996


No 51 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=86.98  E-value=0.34  Score=47.03  Aligned_cols=17  Identities=35%  Similarity=0.645  Sum_probs=14.8

Q ss_pred             HHHHHHhhhhhcCCCCC
Q 043086          240 QTVALHELRHVLGLAHR  256 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HS  256 (257)
                      .-..+|||||++|+.|.
T Consensus       317 iGA~lHEiGH~fg~pH~  333 (423)
T PF12044_consen  317 IGAFLHEIGHLFGCPHQ  333 (423)
T ss_pred             hHHHHHHHHHhcCCCCC
Confidence            34789999999999996


No 52 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=86.05  E-value=0.92  Score=46.03  Aligned_cols=36  Identities=19%  Similarity=0.274  Sum_probs=22.8

Q ss_pred             CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086          215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~  254 (257)
                      |..|.|.|+....   .. ........|++|||.|+||..
T Consensus       236 Pi~G~iNinp~~i---~s-~~~~~~~rv~~HEi~HALGFS  271 (622)
T PTZ00257        236 PAVGVMNIPAANI---VS-RYDQGTTRTVTHEVAHALGFS  271 (622)
T ss_pred             ceEEEEeeCHHHC---CC-ccchHHHHHHHHHHHHHhcCC
Confidence            5556666664321   00 011257899999999999985


No 53 
>PTZ00337 surface protease GP63; Provisional
Probab=85.12  E-value=0.56  Score=47.20  Aligned_cols=35  Identities=29%  Similarity=0.389  Sum_probs=23.8

Q ss_pred             CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCC
Q 043086          215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAH  255 (257)
Q Consensus       215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~H  255 (257)
                      |..|.|.|+....   .   .....+.|++|||.|+||+..
T Consensus       211 Pi~G~in~np~~i---~---~~~~~~~v~~HEi~HALGFs~  245 (567)
T PTZ00337        211 PFAAAVNFDPRQI---A---VTNGDVRVAAHELGHALGFVR  245 (567)
T ss_pred             ceEEEEEECHHHc---c---chhHHHHHHHHHHHHHHccCH
Confidence            4456677775321   1   123578899999999999853


No 54 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=85.11  E-value=0.49  Score=43.18  Aligned_cols=16  Identities=31%  Similarity=0.598  Sum_probs=13.7

Q ss_pred             hhHHHHHHhhhhhcCC
Q 043086          238 DMQTVALHELRHVLGL  253 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL  253 (257)
                      .+..|++||+||++|-
T Consensus       215 ~~~~v~vHE~GHsf~~  230 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGG  230 (264)
T ss_dssp             THHHHHHHHHHHHTT-
T ss_pred             cccceeeeeccccccc
Confidence            6889999999998773


No 55 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=84.10  E-value=0.62  Score=36.23  Aligned_cols=17  Identities=24%  Similarity=0.567  Sum_probs=13.0

Q ss_pred             hhHHHHHHhhhhhcCCC
Q 043086          238 DMQTVALHELRHVLGLA  254 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~  254 (257)
                      -...|++|||||.+|++
T Consensus        72 ~I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   72 LIRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHHcCCC
Confidence            35569999999999985


No 56 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=83.24  E-value=0.62  Score=48.14  Aligned_cols=20  Identities=40%  Similarity=0.644  Sum_probs=17.3

Q ss_pred             chhHHHHHHhhhhhcCCCCC
Q 043086          237 VDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       237 ~~l~~va~HEiGH~LGL~HS  256 (257)
                      ..+..|.+||+||.||+.|-
T Consensus       321 ~~~a~v~AhelgH~lGm~hD  340 (716)
T KOG3607|consen  321 LAFAVVLAHELGHNLGMIHD  340 (716)
T ss_pred             hhHHHHHHHHHHhhcCcccc
Confidence            35677999999999999983


No 57 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=81.46  E-value=0.28  Score=50.05  Aligned_cols=15  Identities=47%  Similarity=0.567  Sum_probs=13.5

Q ss_pred             HHHHHHhhhhhcCCC
Q 043086          240 QTVALHELRHVLGLA  254 (257)
Q Consensus       240 ~~va~HEiGH~LGL~  254 (257)
                      .-|.+||+||.|||.
T Consensus       222 iGVfaHEfGH~LGLP  236 (645)
T PF05547_consen  222 IGVFAHEFGHDLGLP  236 (645)
T ss_pred             eEEEEeeccccCCCC
Confidence            569999999999995


No 58 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=80.65  E-value=0.91  Score=45.14  Aligned_cols=36  Identities=19%  Similarity=0.391  Sum_probs=20.2

Q ss_pred             CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086          215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~  254 (257)
                      |..|.|.|+...   +... ....+..|++|||.|+||+.
T Consensus       190 P~~G~in~~p~~---i~~~-~~~~~~~~~~HEi~HaLGFs  225 (521)
T PF01457_consen  190 PIAGVININPSY---IPSF-YFQEFFRTVIHEIAHALGFS  225 (521)
T ss_dssp             EEEEEEE--GGG------S---HHHHHHHHHHHHHHTT-S
T ss_pred             CeeEEEEEchhH---ccch-hhhcccceeeeeeeeeeeec
Confidence            344667776531   1110 23467889999999999985


No 59 
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=78.20  E-value=1.3  Score=43.15  Aligned_cols=18  Identities=39%  Similarity=0.591  Sum_probs=15.3

Q ss_pred             HHHHHHhhhhhcCCCCCC
Q 043086          240 QTVALHELRHVLGLAHRP  257 (257)
Q Consensus       240 ~~va~HEiGH~LGL~HSs  257 (257)
                      .-..+|||||.||+.|-+
T Consensus       302 lGA~~HElGH~lgcpHq~  319 (614)
T KOG4525|consen  302 LGAVCHELGHCLGCPHQS  319 (614)
T ss_pred             HHHHHHHhhhccCCCCCC
Confidence            447899999999999964


No 60 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=75.57  E-value=1.4  Score=43.21  Aligned_cols=17  Identities=29%  Similarity=0.456  Sum_probs=14.8

Q ss_pred             CchhHHHHHHhhhhhcC
Q 043086          236 AVDMQTVALHELRHVLG  252 (257)
Q Consensus       236 ~~~l~~va~HEiGH~LG  252 (257)
                      ...|-.|++|||||+.|
T Consensus       127 esElagViAHEigHv~q  143 (484)
T COG4783         127 ESELAGVIAHEIGHVAQ  143 (484)
T ss_pred             HHHHHHHHHHHHHHHhh
Confidence            34899999999999876


No 61 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=73.51  E-value=2.3  Score=32.94  Aligned_cols=9  Identities=44%  Similarity=0.574  Sum_probs=3.4

Q ss_pred             HHHHHHHHH
Q 043086            9 HAFISFFLL   17 (257)
Q Consensus         9 ~~~~~~~~~   17 (257)
                      |+||++.|+
T Consensus         4 K~~llL~l~   12 (95)
T PF07172_consen    4 KAFLLLGLL   12 (95)
T ss_pred             hHHHHHHHH
Confidence            334333333


No 62 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=72.56  E-value=1.7  Score=32.94  Aligned_cols=27  Identities=19%  Similarity=0.261  Sum_probs=18.2

Q ss_pred             CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhc
Q 043086          217 DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVL  251 (257)
Q Consensus       217 ~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~L  251 (257)
                      ...|..+.+        ........+++|||||.+
T Consensus        28 ~~~I~in~~--------~~~~~~~f~laHELgH~~   54 (122)
T PF06114_consen   28 NPIIFINSN--------LSPERQRFTLAHELGHIL   54 (122)
T ss_dssp             TTEEEEESS--------S-HHHHHHHHHHHHHHHH
T ss_pred             CCEEEECCC--------CCHHHHHHHHHHHHHHHH
Confidence            445666653        133466779999999975


No 63 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=72.48  E-value=1.9  Score=38.34  Aligned_cols=14  Identities=36%  Similarity=0.513  Sum_probs=12.8

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      ++..|+.||+||+|
T Consensus        91 ~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   91 NVLFTLYHELGHAL  104 (220)
T ss_pred             HHHHHHHHHHHHHH
Confidence            68889999999986


No 64 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=71.33  E-value=3.3  Score=38.33  Aligned_cols=39  Identities=26%  Similarity=0.318  Sum_probs=24.1

Q ss_pred             ecCCCCCCCCcccCCCCCc-----cCCCCCCchhHHHHHHhhhhhc
Q 043086          211 HSFPPTDGRFHYDGDENWT-----VGAVPGAVDMQTVALHELRHVL  251 (257)
Q Consensus       211 haf~P~~g~ihfd~~e~wt-----~~~~~~~~~l~~va~HEiGH~L  251 (257)
                      .-|-|.+..|++|-.. |.     ++. ....-...|++||+||++
T Consensus       139 pFYCp~D~tIYlD~~f-~~~L~~~~ga-~G~~a~ayVlAHEyGHHV  182 (292)
T PF04228_consen  139 PFYCPADQTIYLDLSF-FDELQQRFGA-SGDFAQAYVLAHEYGHHV  182 (292)
T ss_pred             CEeCCCCCEEEechHH-HHHHHHHhCC-ccHHHHHHHHHHHHHHHH
Confidence            3456778889988641 21     111 123445669999999973


No 65 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=68.42  E-value=2.8  Score=35.86  Aligned_cols=15  Identities=33%  Similarity=0.525  Sum_probs=13.4

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      ..+..|++||+||..
T Consensus        87 ~el~aVlaHElgH~~  101 (226)
T PF01435_consen   87 DELAAVLAHELGHIK  101 (226)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            479999999999986


No 66 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=67.13  E-value=2.6  Score=36.98  Aligned_cols=16  Identities=31%  Similarity=0.538  Sum_probs=13.2

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      .+.-..||+||.||++
T Consensus        89 Sia~~aVAAHEVGHAi  104 (226)
T COG2738          89 SIAAIAVAAHEVGHAI  104 (226)
T ss_pred             cHHHHHHHHHHhhHHH
Confidence            3566779999999997


No 67 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=65.92  E-value=2.7  Score=37.23  Aligned_cols=26  Identities=19%  Similarity=0.201  Sum_probs=17.0

Q ss_pred             CCCcccCCCCCccCCCCCCchhHHHHHHhhhhhc
Q 043086          218 GRFHYDGDENWTVGAVPGAVDMQTVALHELRHVL  251 (257)
Q Consensus       218 g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~L  251 (257)
                      ..|+++.+-        ....-.-|++||+||+|
T Consensus        59 ~~I~iN~n~--------~~~r~rFtlAHELGH~l   84 (213)
T COG2856          59 PVIYINANN--------SLERKRFTLAHELGHAL   84 (213)
T ss_pred             ceEEEeCCC--------CHHHHHHHHHHHHhHHH
Confidence            356776541        22344559999999996


No 68 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=64.09  E-value=3.7  Score=36.92  Aligned_cols=15  Identities=33%  Similarity=0.567  Sum_probs=13.2

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      ..+..|++||+||..
T Consensus       155 dEl~aVlaHElgHi~  169 (302)
T COG0501         155 DELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            379999999999974


No 69 
>PRK03982 heat shock protein HtpX; Provisional
Probab=63.65  E-value=3.8  Score=37.44  Aligned_cols=14  Identities=36%  Similarity=0.665  Sum_probs=12.6

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      .+..|++||+||.-
T Consensus       124 El~AVlAHElgHi~  137 (288)
T PRK03982        124 ELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHH
Confidence            69999999999974


No 70 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=62.76  E-value=5.6  Score=37.18  Aligned_cols=33  Identities=30%  Similarity=0.415  Sum_probs=24.0

Q ss_pred             CCCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086          214 PPTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       214 ~P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~  254 (257)
                      .|=.|+.+.|.+-        -..++..|++||++|-+|..
T Consensus       179 ~PFT~EA~vN~~~--------p~~~~P~T~~HElAHq~G~a  211 (318)
T PF12725_consen  179 NPFTGEANVNTDL--------PPYSLPFTICHELAHQLGFA  211 (318)
T ss_pred             cCCcceeecCCCC--------CcccccHHHHHHHHHHhCCC
Confidence            4544566666542        23578999999999999975


No 71 
>PRK03001 M48 family peptidase; Provisional
Probab=61.14  E-value=4.5  Score=36.86  Aligned_cols=14  Identities=43%  Similarity=0.653  Sum_probs=12.6

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      .+..|++||+||.-
T Consensus       123 El~aVlAHElgHi~  136 (283)
T PRK03001        123 EIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHh
Confidence            69999999999974


No 72 
>PRK01345 heat shock protein HtpX; Provisional
Probab=60.19  E-value=4.8  Score=37.56  Aligned_cols=14  Identities=43%  Similarity=0.591  Sum_probs=12.9

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      .+..|++|||||.-
T Consensus       123 EL~aVlAHElgHi~  136 (317)
T PRK01345        123 EVAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHHH
Confidence            79999999999975


No 73 
>PRK03072 heat shock protein HtpX; Provisional
Probab=59.79  E-value=4.9  Score=36.87  Aligned_cols=15  Identities=40%  Similarity=0.545  Sum_probs=13.1

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      ..+..|++||+||.-
T Consensus       125 ~El~aVlAHElgHi~  139 (288)
T PRK03072        125 RELRGVLGHELSHVY  139 (288)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            379999999999974


No 74 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=59.56  E-value=11  Score=35.17  Aligned_cols=16  Identities=25%  Similarity=0.378  Sum_probs=13.6

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ..+-..+++||+||++
T Consensus       151 ~~~~v~tl~HE~GHa~  166 (365)
T cd06258         151 GHDDINTLFHEFGHAV  166 (365)
T ss_pred             CHHHHHHHHHHHhHHH
Confidence            5667789999999997


No 75 
>PF11337 DUF3139:  Protein of unknown function (DUF3139);  InterPro: IPR021486  This family of proteins with unknown function appears to be restricted to Firmicutes. 
Probab=56.57  E-value=30  Score=25.74  Aligned_cols=22  Identities=27%  Similarity=0.427  Sum_probs=15.9

Q ss_pred             CchhcHHHHHHHHHhcCCCCCC
Q 043086           53 DTVKGINALKKYLHTLGYLSNH   74 (257)
Q Consensus        53 ~~~~~v~~~q~yL~~fGYl~~~   74 (257)
                      ++....+.+..||++-||-+..
T Consensus        32 ~k~~~~k~i~~yL~e~gy~e~~   53 (85)
T PF11337_consen   32 QKHKAEKAIDWYLQEQGYKESD   53 (85)
T ss_pred             hHHHHHHHHHHHHHHcCCcHHH
Confidence            3445567788999999987443


No 76 
>PRK04897 heat shock protein HtpX; Provisional
Probab=56.02  E-value=6.2  Score=36.36  Aligned_cols=15  Identities=27%  Similarity=0.479  Sum_probs=13.0

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      ..+..|++||+||.-
T Consensus       135 ~El~aVlAHElgHi~  149 (298)
T PRK04897        135 EELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            379999999999964


No 77 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=55.75  E-value=8.1  Score=32.82  Aligned_cols=38  Identities=21%  Similarity=0.378  Sum_probs=26.7

Q ss_pred             CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCCCCC
Q 043086          215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~HSs  257 (257)
                      +..|.|.|+    |.+-.. -..-+..|++||+-|..-..||.
T Consensus       145 ~~~~~I~ln----~~L~~~-P~~~idYVvvHEL~Hl~~~nHs~  182 (205)
T PF01863_consen  145 SSKGNITLN----WRLVMA-PPEVIDYVVVHELCHLRHPNHSK  182 (205)
T ss_pred             CCCCcEEee----cccccC-CccHHHHHHHHHHHHhccCCCCH
Confidence            445677777    433111 12368889999999999999984


No 78 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=54.91  E-value=6.4  Score=34.93  Aligned_cols=14  Identities=50%  Similarity=0.636  Sum_probs=11.7

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      =+..|.+||+||++
T Consensus        52 l~~~v~iHElgH~~   65 (227)
T cd06164          52 LFASVLLHELGHSL   65 (227)
T ss_pred             HHHHHHHHHHHHHH
Confidence            46679999999985


No 79 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=54.73  E-value=6.8  Score=29.73  Aligned_cols=16  Identities=31%  Similarity=0.277  Sum_probs=13.2

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ...+..|++||+.|.+
T Consensus        22 ~~~~~~~l~HE~~H~~   37 (128)
T PF13485_consen   22 EDWLDRVLAHELAHQW   37 (128)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3467899999999974


No 80 
>PRK05457 heat shock protein HtpX; Provisional
Probab=53.91  E-value=7.1  Score=35.83  Aligned_cols=14  Identities=29%  Similarity=0.619  Sum_probs=12.5

Q ss_pred             chhHHHHHHhhhhh
Q 043086          237 VDMQTVALHELRHV  250 (257)
Q Consensus       237 ~~l~~va~HEiGH~  250 (257)
                      ..+..|++||+||.
T Consensus       132 ~El~aVlAHElgHi  145 (284)
T PRK05457        132 DEVEAVLAHEISHI  145 (284)
T ss_pred             HHHHHHHHHHHHHH
Confidence            37999999999996


No 81 
>PRK02870 heat shock protein HtpX; Provisional
Probab=53.70  E-value=7.1  Score=36.85  Aligned_cols=14  Identities=43%  Similarity=0.712  Sum_probs=12.7

Q ss_pred             chhHHHHHHhhhhh
Q 043086          237 VDMQTVALHELRHV  250 (257)
Q Consensus       237 ~~l~~va~HEiGH~  250 (257)
                      ..+..|++||+||.
T Consensus       171 dEL~aVlAHELgHi  184 (336)
T PRK02870        171 DELQAVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHHHHHH
Confidence            37999999999997


No 82 
>PRK02391 heat shock protein HtpX; Provisional
Probab=53.35  E-value=7.3  Score=35.98  Aligned_cols=14  Identities=43%  Similarity=0.684  Sum_probs=12.5

Q ss_pred             chhHHHHHHhhhhh
Q 043086          237 VDMQTVALHELRHV  250 (257)
Q Consensus       237 ~~l~~va~HEiGH~  250 (257)
                      ..+..|++||+||.
T Consensus       131 ~El~aVlaHElgHi  144 (296)
T PRK02391        131 DELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHH
Confidence            37999999999996


No 83 
>PRK01265 heat shock protein HtpX; Provisional
Probab=53.09  E-value=7.4  Score=36.57  Aligned_cols=14  Identities=43%  Similarity=0.624  Sum_probs=12.5

Q ss_pred             chhHHHHHHhhhhh
Q 043086          237 VDMQTVALHELRHV  250 (257)
Q Consensus       237 ~~l~~va~HEiGH~  250 (257)
                      ..+..|++||+||.
T Consensus       138 ~El~aVlAHElgHi  151 (324)
T PRK01265        138 DEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHH
Confidence            37999999999995


No 84 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=52.33  E-value=6.9  Score=37.79  Aligned_cols=16  Identities=25%  Similarity=0.407  Sum_probs=12.7

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ..+=..+++||+||++
T Consensus       239 ~~~~v~tLfHE~GHa~  254 (458)
T PF01432_consen  239 SHDDVETLFHEFGHAM  254 (458)
T ss_dssp             SHHHHHHHHHHHHHHH
T ss_pred             ChhhHHHHHHHHhHHH
Confidence            3455679999999986


No 85 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=50.68  E-value=7.4  Score=36.80  Aligned_cols=16  Identities=31%  Similarity=0.391  Sum_probs=13.0

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ..+-..+++||+||++
T Consensus       219 ~~~~v~tl~HE~GHa~  234 (427)
T cd06459         219 TLDDVFTLAHELGHAF  234 (427)
T ss_pred             ChhhHHHHHHHhhHHH
Confidence            3566789999999985


No 86 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=50.57  E-value=5.6  Score=40.76  Aligned_cols=16  Identities=31%  Similarity=0.501  Sum_probs=14.3

Q ss_pred             HHHHHhhhhhcCCCCC
Q 043086          241 TVALHELRHVLGLAHR  256 (257)
Q Consensus       241 ~va~HEiGH~LGL~HS  256 (257)
                      .|++|||||.-|-.|-
T Consensus       394 lt~AHEiGHNfGSpHD  409 (764)
T KOG3658|consen  394 LTLAHEIGHNFGSPHD  409 (764)
T ss_pred             eeehhhhccccCCCCC
Confidence            4899999999999984


No 87 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=49.99  E-value=11  Score=37.76  Aligned_cols=17  Identities=29%  Similarity=0.474  Sum_probs=14.0

Q ss_pred             CCchhHHHHHHhhhhhc
Q 043086          235 GAVDMQTVALHELRHVL  251 (257)
Q Consensus       235 ~~~~l~~va~HEiGH~L  251 (257)
                      +..+-..|++||+||++
T Consensus       374 ~~~~dv~TLaHElGHa~  390 (591)
T TIGR00181       374 GTLNSVFTLAHELGHSM  390 (591)
T ss_pred             CCcchHHHHHHHhhhHH
Confidence            34577889999999985


No 88 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=49.29  E-value=8.8  Score=35.34  Aligned_cols=13  Identities=23%  Similarity=0.434  Sum_probs=11.1

Q ss_pred             hHHHHHHhhhhhc
Q 043086          239 MQTVALHELRHVL  251 (257)
Q Consensus       239 l~~va~HEiGH~L  251 (257)
                      +..|++||+||++
T Consensus       135 ~isvvvHElgHal  147 (277)
T cd06162         135 LISGVVHEMGHGV  147 (277)
T ss_pred             HHHHHHHHHHHHH
Confidence            5579999999985


No 89 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=48.96  E-value=8.3  Score=37.73  Aligned_cols=15  Identities=33%  Similarity=0.481  Sum_probs=12.5

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      .+=..+++||+||+|
T Consensus       261 ~~~V~TLfHEfGHal  275 (472)
T cd06455         261 HDEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456678999999997


No 90 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=48.19  E-value=17  Score=36.31  Aligned_cols=17  Identities=29%  Similarity=0.350  Sum_probs=14.2

Q ss_pred             CCchhHHHHHHhhhhhc
Q 043086          235 GAVDMQTVALHELRHVL  251 (257)
Q Consensus       235 ~~~~l~~va~HEiGH~L  251 (257)
                      +..+-..|++||+||++
T Consensus       333 ~t~~dv~TL~HElGHa~  349 (549)
T TIGR02289       333 GTSGDIDVLTHEAGHAF  349 (549)
T ss_pred             CChhHHHHHHHHhhHHH
Confidence            44677889999999986


No 91 
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=48.08  E-value=4.2  Score=42.84  Aligned_cols=22  Identities=27%  Similarity=0.428  Sum_probs=18.4

Q ss_pred             CCchhHHHHHHhhhhhcCCCCC
Q 043086          235 GAVDMQTVALHELRHVLGLAHR  256 (257)
Q Consensus       235 ~~~~l~~va~HEiGH~LGL~HS  256 (257)
                      .+.....+++||+||.+|+.|-
T Consensus       313 ~gl~~aftiahE~GH~~gm~hd  334 (845)
T KOG3538|consen  313 TGLPSAFTIAHELGHNFGMEHD  334 (845)
T ss_pred             CCCccceeeeeccccccCcccC
Confidence            4555778999999999999993


No 92 
>PLN02791 Nudix hydrolase homolog
Probab=47.81  E-value=9.8  Score=39.87  Aligned_cols=19  Identities=21%  Similarity=0.144  Sum_probs=15.8

Q ss_pred             chhHHHHHHhhhhhcCCCC
Q 043086          237 VDMQTVALHELRHVLGLAH  255 (257)
Q Consensus       237 ~~l~~va~HEiGH~LGL~H  255 (257)
                      -.+.+|++||+||-+|=.+
T Consensus       583 af~~~v~lHElgHGsG~~~  601 (770)
T PLN02791        583 SFFTHTICHECCHGIGPHT  601 (770)
T ss_pred             cHHHHHHHHHhhccccccc
Confidence            3678999999999999643


No 93 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=47.30  E-value=9.9  Score=34.65  Aligned_cols=13  Identities=38%  Similarity=0.595  Sum_probs=10.7

Q ss_pred             hHHHHHHhhhhhc
Q 043086          239 MQTVALHELRHVL  251 (257)
Q Consensus       239 l~~va~HEiGH~L  251 (257)
                      +..|.+||+||++
T Consensus       118 ~isv~iHElgHa~  130 (263)
T cd06159         118 VVGVVVHELSHGI  130 (263)
T ss_pred             HHHHHHHHHHHHH
Confidence            4568999999985


No 94 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=47.27  E-value=11  Score=27.93  Aligned_cols=30  Identities=20%  Similarity=0.339  Sum_probs=20.2

Q ss_pred             CCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhc
Q 043086          217 DGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVL  251 (257)
Q Consensus       217 ~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~L  251 (257)
                      +.+|+|...+.=     .+...-..+++||+.|++
T Consensus        44 G~~I~f~~g~~~-----~~s~~~~~llaHEl~Hv~   73 (79)
T PF13699_consen   44 GNDIYFAPGKYN-----PDSPEGRALLAHELAHVV   73 (79)
T ss_pred             CCEEEEcCCCcC-----CCCCCcchhHhHHHHHHH
Confidence            568999654321     123456789999999985


No 95 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=46.70  E-value=9.5  Score=37.24  Aligned_cols=16  Identities=19%  Similarity=0.360  Sum_probs=12.8

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ..+=..++.||+||+|
T Consensus       245 ~~~~v~TLfHEfGHal  260 (458)
T cd06457         245 SPHEVETLFHEMGHAM  260 (458)
T ss_pred             CHHHHHHHHHHHhHHH
Confidence            3456679999999986


No 96 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=44.80  E-value=10  Score=38.05  Aligned_cols=15  Identities=27%  Similarity=0.246  Sum_probs=12.7

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      .+=..+++||+||++
T Consensus       373 ~~~v~TL~HE~GHa~  387 (587)
T TIGR02290       373 RRDVSTLAHELGHAY  387 (587)
T ss_pred             chhHHHHHHHhhHHH
Confidence            466778999999997


No 97 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=42.69  E-value=11  Score=31.36  Aligned_cols=12  Identities=42%  Similarity=0.517  Sum_probs=10.0

Q ss_pred             HHHHHHhhhhhc
Q 043086          240 QTVALHELRHVL  251 (257)
Q Consensus       240 ~~va~HEiGH~L  251 (257)
                      .-|++|||+|..
T Consensus       136 lDVvaHEltHGV  147 (150)
T PF01447_consen  136 LDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cceeeecccccc
Confidence            459999999964


No 98 
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=42.55  E-value=13  Score=36.10  Aligned_cols=16  Identities=38%  Similarity=0.497  Sum_probs=13.2

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ..+-..++.||+||+|
T Consensus       205 ~~~~v~tLfHEfGHal  220 (422)
T cd06456         205 THDEVTTLFHEFGHAL  220 (422)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            4566778999999997


No 99 
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=41.81  E-value=12  Score=36.17  Aligned_cols=13  Identities=54%  Similarity=0.774  Sum_probs=10.7

Q ss_pred             hHHHHHHhhhhhc
Q 043086          239 MQTVALHELRHVL  251 (257)
Q Consensus       239 l~~va~HEiGH~L  251 (257)
                      +..+.+||+||+|
T Consensus       131 vi~~vvHElGHal  143 (484)
T KOG2921|consen  131 VITVVVHELGHAL  143 (484)
T ss_pred             HHHHHHHHhhHHH
Confidence            4457899999997


No 100
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=39.74  E-value=24  Score=35.83  Aligned_cols=16  Identities=25%  Similarity=0.260  Sum_probs=13.3

Q ss_pred             CchhHHHHHHhhhhhc
Q 043086          236 AVDMQTVALHELRHVL  251 (257)
Q Consensus       236 ~~~l~~va~HEiGH~L  251 (257)
                      ..+-..|++||+||+.
T Consensus       377 t~~dV~TLaHElGHs~  392 (598)
T COG1164         377 TLRDVFTLAHELGHSV  392 (598)
T ss_pred             chhHHHHHHHHccHHH
Confidence            4577889999999984


No 101
>PF03571 Peptidase_M49:  Peptidase family M49;  InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=38.10  E-value=16  Score=36.79  Aligned_cols=16  Identities=44%  Similarity=0.488  Sum_probs=12.7

Q ss_pred             chhHHHHHHhh-hhhcC
Q 043086          237 VDMQTVALHEL-RHVLG  252 (257)
Q Consensus       237 ~~l~~va~HEi-GH~LG  252 (257)
                      ....+|++||+ ||-.|
T Consensus       291 af~~~v~lHEllGHGsG  307 (549)
T PF03571_consen  291 AFFVQVGLHELLGHGSG  307 (549)
T ss_dssp             HHHHHHHHHHHCCCCS-
T ss_pred             hHhhhhhHHhhccCcCc
Confidence            35678999997 99888


No 102
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=36.35  E-value=25  Score=32.36  Aligned_cols=35  Identities=26%  Similarity=0.376  Sum_probs=23.8

Q ss_pred             CCCCCCcccCCCCCccCCCCCCchhHHHHHHhhhhhcCCC
Q 043086          215 PTDGRFHYDGDENWTVGAVPGAVDMQTVALHELRHVLGLA  254 (257)
Q Consensus       215 P~~g~ihfd~~e~wt~~~~~~~~~l~~va~HEiGH~LGL~  254 (257)
                      |..-.|++-.-+.+-     +..+...+++||+||--|=+
T Consensus       184 Pa~D~Iq~p~~~aF~-----da~~yyaTl~HElghwtgh~  218 (316)
T COG4227         184 PASDSIQLPPFEAFR-----DAINYYATLLHELGHWTGHE  218 (316)
T ss_pred             CccCeeccCChhhcc-----chHhHHHHHHHHhccccCch
Confidence            433346666554442     45689999999999987743


No 103
>COG3926 zliS Lysozyme family protein [General function prediction only]
Probab=35.39  E-value=27  Score=31.36  Aligned_cols=29  Identities=17%  Similarity=0.239  Sum_probs=26.3

Q ss_pred             HHHHHHHHHHhcC----CCCCCCCChhhhhccC
Q 043086           92 LESAVKTYQRNFN----LNPTGHMDLKTVSMMG  120 (257)
Q Consensus        92 ~~~AIk~FQ~~~g----L~~TG~lD~~T~~~M~  120 (257)
                      -..|++-+||..+    .+++|++...|+..+.
T Consensus        93 p~rAa~~LQkal~~~~~v~~DGvIG~~TLaAl~  125 (252)
T COG3926          93 PGRAAKWLQKALGPAYTVRVDGVIGAQTLAALK  125 (252)
T ss_pred             cchHHHHHHHHhccCCcccccCcccHHHHHHHH
Confidence            3579999999999    9999999999998875


No 104
>CHL00132 psaF photosystem I subunit III; Validated
Probab=35.05  E-value=88  Score=27.07  Aligned_cols=39  Identities=8%  Similarity=0.107  Sum_probs=24.2

Q ss_pred             hcHHHHHHHHHhcCCCCCCCccccccccCCCcccHHHHHHHHHHHHhcC
Q 043086           56 KGINALKKYLHTLGYLSNHNHNHAAAAADGDYFDENLESAVKTYQRNFN  104 (257)
Q Consensus        56 ~~v~~~q~yL~~fGYl~~~~~~~~~~~~~~~~~~~~~~~AIk~FQ~~~g  104 (257)
                      ..|+.+++.|++|-=-.++          .....+....+.++|.++..
T Consensus        44 ~~~k~Le~rlk~Y~~~s~p----------~lal~~qi~~tk~RFe~Y~~   82 (185)
T CHL00132         44 NSVKKLENRLAKYEANSPP----------ALALQQQIDKTKARFDKYGR   82 (185)
T ss_pred             HHHHHHHhhhhccCCCCCh----------HHHHHHHHHHHHHHHHHhcc
Confidence            3566788888877432221          11245677788888888753


No 105
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=34.30  E-value=21  Score=34.87  Aligned_cols=14  Identities=36%  Similarity=0.470  Sum_probs=12.4

Q ss_pred             chhHHHHHHhhhhh
Q 043086          237 VDMQTVALHELRHV  250 (257)
Q Consensus       237 ~~l~~va~HEiGH~  250 (257)
                      ..+..|++||+||-
T Consensus       278 eel~AVl~HELGHW  291 (428)
T KOG2719|consen  278 EELVAVLAHELGHW  291 (428)
T ss_pred             HHHHHHHHHHhhHH
Confidence            47999999999994


No 106
>PF11045 YbjM:  Putative inner membrane protein of Enterobacteriaceae;  InterPro: IPR020368 This entry contains membrane proteins with no known function.; GO: 0016021 integral to membrane
Probab=34.11  E-value=38  Score=27.59  Aligned_cols=25  Identities=24%  Similarity=0.284  Sum_probs=19.4

Q ss_pred             CcchhhhHHHHHHHHHHHHHHHHHh
Q 043086            1 MKHKRNALHAFISFFLLQFFLLHAL   25 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~l~~~~   25 (257)
                      ||+||+....+.||+|++...++-.
T Consensus         1 M~~~r~W~g~i~cfvLf~vVfl~~~   25 (125)
T PF11045_consen    1 MANKRRWAGVICCFVLFIVVFLFLK   25 (125)
T ss_pred             CCCCccchHHHHHHHHHHHHHHHhh
Confidence            8999999999988877766554443


No 107
>PF04956 TrbC:  TrbC/VIRB2 family;  InterPro: IPR007039 Conjugal transfer protein, TrbC has been identified as a subunit of the pilus precursor in bacteria. The protein undergoes three processing steps before gaining its mature cyclic structure[]. This family also contains several VirB2 type IV secretion proteins. The virB2 gene encodes a putative type IV secretion system and is known to be a pathogenicity factor in Bartonella species [].
Probab=33.63  E-value=73  Score=23.87  Aligned_cols=25  Identities=24%  Similarity=0.230  Sum_probs=11.8

Q ss_pred             CcchhhhHHHHHHHHHHHHHHHHHh
Q 043086            1 MKHKRNALHAFISFFLLQFFLLHAL   25 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~l~~~~   25 (257)
                      ||+..+.....+..+++.+++++..
T Consensus         1 mk~~~~~~~~~~~~~~~~~~~~~~~   25 (99)
T PF04956_consen    1 MKKVQSKKRRKFLLLLLALALLLLA   25 (99)
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHhC
Confidence            7755544444444444444444443


No 108
>PRK10911 oligopeptidase A; Provisional
Probab=32.26  E-value=22  Score=36.76  Aligned_cols=15  Identities=33%  Similarity=0.372  Sum_probs=12.4

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      .+=..++.|||||+|
T Consensus       461 ~~~v~tlfHEfGHal  475 (680)
T PRK10911        461 HDEVITLFHEFGHGL  475 (680)
T ss_pred             HHHHHHHHHHHhHHH
Confidence            356679999999997


No 109
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=31.62  E-value=23  Score=34.26  Aligned_cols=14  Identities=29%  Similarity=0.377  Sum_probs=10.9

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      +-...++||+||+|
T Consensus       158 ~~l~t~iHE~GHal  171 (396)
T cd06460         158 SALFSTIHETGHAL  171 (396)
T ss_pred             HHHHHHHHHhhHHH
Confidence            34457899999986


No 110
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=29.93  E-value=27  Score=33.29  Aligned_cols=14  Identities=36%  Similarity=0.541  Sum_probs=12.5

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      .+..|+.|||+|++
T Consensus       274 glAtvLgHE~aHaV  287 (424)
T KOG2661|consen  274 GLATVLGHEIAHAV  287 (424)
T ss_pred             HHHHHHHHHHHHHH
Confidence            58889999999986


No 111
>PF14891 Peptidase_M91:  Effector protein
Probab=29.32  E-value=37  Score=28.60  Aligned_cols=15  Identities=33%  Similarity=0.264  Sum_probs=12.6

Q ss_pred             chhHHHHHHhhhhhc
Q 043086          237 VDMQTVALHELRHVL  251 (257)
Q Consensus       237 ~~l~~va~HEiGH~L  251 (257)
                      .+-+.|+.||+.|+.
T Consensus       101 ~~p~v~L~HEL~HA~  115 (174)
T PF14891_consen  101 RPPFVVLYHELIHAY  115 (174)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            467789999999984


No 112
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=28.88  E-value=28  Score=24.42  Aligned_cols=20  Identities=35%  Similarity=0.577  Sum_probs=16.8

Q ss_pred             hhHHHHHHhhhhhcCCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HSs  257 (257)
                      ...--.+||+.+.+||.|-|
T Consensus        28 ~~eR~~vH~lA~~~gL~s~S   47 (60)
T cd02641          28 SHDRLLVHELAEELGLRHES   47 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEe
Confidence            56668899999999999854


No 113
>PLN00081 photosystem I reaction center subunit V (PsaG); Provisional
Probab=28.62  E-value=17  Score=29.54  Aligned_cols=39  Identities=21%  Similarity=0.362  Sum_probs=28.2

Q ss_pred             CCCCCcccCC-----CCCcc--CCCCCCchhHHHHHHh-hhhhcCCC
Q 043086          216 TDGRFHYDGD-----ENWTV--GAVPGAVDMQTVALHE-LRHVLGLA  254 (257)
Q Consensus       216 ~~g~ihfd~~-----e~wt~--~~~~~~~~l~~va~HE-iGH~LGL~  254 (257)
                      .+|..||++.     |.|.+  ..++.|+++..|++-- |||+.|..
T Consensus        81 QNG~ThF~agd~ra~e~~~~l~tnDPaGF~ivDvlawGsiGH~vg~~  127 (141)
T PLN00081         81 QNGKTHFDAGDERAKEFAALLKSNDPVGFNLVDVLAWGSLGHIVAYY  127 (141)
T ss_pred             cCCcchhhhHHHHHHHHHHhhccCCCCccchhhhhhhhhhhhhhhhe
Confidence            5799999873     34443  2245789999988875 99998853


No 114
>PF11119 DUF2633:  Protein of unknown function (DUF2633);  InterPro: IPR022576  This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known. 
Probab=26.77  E-value=90  Score=22.12  Aligned_cols=7  Identities=43%  Similarity=0.876  Sum_probs=4.2

Q ss_pred             Ccchhhh
Q 043086            1 MKHKRNA    7 (257)
Q Consensus         1 ~~~~~~~    7 (257)
                      ||+|.|.
T Consensus         1 ~r~k~~~    7 (59)
T PF11119_consen    1 MRRKKNS    7 (59)
T ss_pred             CCCcccc
Confidence            5666664


No 115
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=26.02  E-value=66  Score=32.71  Aligned_cols=14  Identities=36%  Similarity=0.340  Sum_probs=11.3

Q ss_pred             chhHHHHHHhhhhh
Q 043086          237 VDMQTVALHELRHV  250 (257)
Q Consensus       237 ~~l~~va~HEiGH~  250 (257)
                      ..+..|++|||.|.
T Consensus       277 ~s~~~viaHElAHq  290 (601)
T TIGR02411       277 RSNVDVIAHELAHS  290 (601)
T ss_pred             hhhhhhHHHHHHhh
Confidence            34678999999994


No 116
>PRK06287 cobalt transport protein CbiN; Validated
Probab=24.83  E-value=88  Score=24.58  Aligned_cols=28  Identities=32%  Similarity=0.197  Sum_probs=14.9

Q ss_pred             CcchhhhHHHHHHHHHHHHHHHHHhhhh
Q 043086            1 MKHKRNALHAFISFFLLQFFLLHALASA   28 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~   28 (257)
                      |.++|..+..++++++++..+++..|++
T Consensus         1 ~~~~~~~~~~~~~~all~a~~~s~~AS~   28 (107)
T PRK06287          1 MMDNKKFLIAGLIVALLIAILAPFLASG   28 (107)
T ss_pred             CCcchhhHHHHHHHHHHHHHHHHhhhcC
Confidence            3455555555555555555555544443


No 117
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=24.19  E-value=91  Score=24.00  Aligned_cols=17  Identities=0%  Similarity=-0.021  Sum_probs=8.6

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 043086            8 LHAFISFFLLQFFLLHA   24 (257)
Q Consensus         8 ~~~~~~~~~~~~~l~~~   24 (257)
                      +.+|.++|.+++|+...
T Consensus         6 ~llL~l~LA~lLlisSe   22 (95)
T PF07172_consen    6 FLLLGLLLAALLLISSE   22 (95)
T ss_pred             HHHHHHHHHHHHHHHhh
Confidence            44555555555555443


No 118
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=24.01  E-value=37  Score=23.98  Aligned_cols=20  Identities=30%  Similarity=0.371  Sum_probs=16.3

Q ss_pred             hhHHHHHHhhhhhcCCCCCC
Q 043086          238 DMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       238 ~l~~va~HEiGH~LGL~HSs  257 (257)
                      .-.--.+|.+-|-|||.|.|
T Consensus        28 ~~eRriih~la~~lGL~~~s   47 (60)
T cd02639          28 PAERRIVHLLASRLGLNHVS   47 (60)
T ss_pred             HHHHHHHHHHHHHcCCceEE
Confidence            45557899999999999864


No 119
>PRK14125 cell division suppressor protein YneA; Provisional
Probab=22.85  E-value=1.2e+02  Score=23.68  Aligned_cols=25  Identities=32%  Similarity=0.434  Sum_probs=14.0

Q ss_pred             CcchhhhHHHHHHHHHHHHHHHHHh
Q 043086            1 MKHKRNALHAFISFFLLQFFLLHAL   25 (257)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~l~~~~   25 (257)
                      ||-+-|-++..+++++|++++.+..
T Consensus         1 ~~~~~~~~~~~ii~~~l~~~~~~~~   25 (103)
T PRK14125          1 LKLKESKIHVSIFFVLTALVLLIFV   25 (103)
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHHH
Confidence            4555566666665555555554443


No 120
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=21.17  E-value=40  Score=30.55  Aligned_cols=17  Identities=35%  Similarity=0.573  Sum_probs=12.3

Q ss_pred             chhHHHHHHhhhhhcCC
Q 043086          237 VDMQTVALHELRHVLGL  253 (257)
Q Consensus       237 ~~l~~va~HEiGH~LGL  253 (257)
                      .|-..|++|||.|.|=+
T Consensus       152 ~dg~NVviHEfAH~LD~  168 (253)
T PF06167_consen  152 NDGHNVVIHEFAHKLDM  168 (253)
T ss_dssp             SSS--HHHHHHHHHHHC
T ss_pred             CCCcchHHHHHHHHHHh
Confidence            46678999999999843


No 121
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=21.14  E-value=46  Score=34.43  Aligned_cols=14  Identities=36%  Similarity=0.501  Sum_probs=11.3

Q ss_pred             hhHHHHHHhhhhhc
Q 043086          238 DMQTVALHELRHVL  251 (257)
Q Consensus       238 ~l~~va~HEiGH~L  251 (257)
                      +=..++-|||||+|
T Consensus       463 ~eV~TlFHEfGHal  476 (681)
T PRK10280        463 DDVITLFHEFGHTL  476 (681)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44558999999997


No 122
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=21.01  E-value=44  Score=30.93  Aligned_cols=40  Identities=25%  Similarity=0.246  Sum_probs=23.0

Q ss_pred             ecCCCCCCCCcccCCCCC----ccCCCCCCchhHHHHHHhhhhhc
Q 043086          211 HSFPPTDGRFHYDGDENW----TVGAVPGAVDMQTVALHELRHVL  251 (257)
Q Consensus       211 haf~P~~g~ihfd~~e~w----t~~~~~~~~~l~~va~HEiGH~L  251 (257)
                      +-|-|.+..++||....=    -++. .+.+--..|++||.||.+
T Consensus       136 PFYCP~D~kvYlDlsFf~~m~~~fga-~GdfAqaYViAHEVGHHV  179 (295)
T COG2321         136 PFYCPADTKVYLDLSFFDEMKTKFGA-SGDFAQAYVIAHEVGHHV  179 (295)
T ss_pred             CeecCCCceEEEehhHHHHHHHHhcC-CccHHHHHHHHhhhhHHH
Confidence            335677777777753100    0111 133455669999999974


No 123
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=20.99  E-value=50  Score=33.48  Aligned_cols=16  Identities=31%  Similarity=0.507  Sum_probs=14.3

Q ss_pred             CCchhHHHHHHhhhhh
Q 043086          235 GAVDMQTVALHELRHV  250 (257)
Q Consensus       235 ~~~~l~~va~HEiGH~  250 (257)
                      ++..|..|++|||.|+
T Consensus       284 GDrsl~~vIaHEIAHS  299 (613)
T KOG1047|consen  284 GDRSLVDVIAHEIAHS  299 (613)
T ss_pred             CCcchhhHHHHHhhhh
Confidence            5678999999999997


No 124
>KOG3988 consensus Protein-tyrosine sulfotransferase TPST1/TPST2 [Posttranslational modification, protein turnover, chaperones]
Probab=20.90  E-value=2e+02  Score=27.15  Aligned_cols=36  Identities=28%  Similarity=0.405  Sum_probs=28.4

Q ss_pred             hhhhhhccCCCCCCchhcHHHHHHHHHhcCCCCCCC
Q 043086           40 LEFFKLFQGTQKGDTVKGINALKKYLHTLGYLSNHN   75 (257)
Q Consensus        40 ~~~~~~l~~~~~g~~~~~v~~~q~yL~~fGYl~~~~   75 (257)
                      ..++.+-.|+-.++...+++.+--+|..+||.+..+
T Consensus       297 l~AlskWvg~ip~dvvrdma~iAPmL~~LGYDp~an  332 (378)
T KOG3988|consen  297 LEALSKWVGCIPEDVVRDMADIAPMLAILGYDPYAN  332 (378)
T ss_pred             HHHHHHHhccCCHHHHHHHHHHHHHHHHhCCCCCCC
Confidence            344555557778888889999999999999998764


No 125
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=20.41  E-value=52  Score=23.17  Aligned_cols=21  Identities=19%  Similarity=0.401  Sum_probs=17.4

Q ss_pred             chhHHHHHHhhhhhcCCCCCC
Q 043086          237 VDMQTVALHELRHVLGLAHRP  257 (257)
Q Consensus       237 ~~l~~va~HEiGH~LGL~HSs  257 (257)
                      ....--++|++-+.+||.|-|
T Consensus        27 t~~eR~~vH~~a~~~gL~s~S   47 (60)
T cd02640          27 SKEERALIHQIAQKYGLKSRS   47 (60)
T ss_pred             CHHHHHHHHHHHHHcCCceee
Confidence            356778999999999998754


Done!