Query         020771
Match_columns 321
No_of_seqs    234 out of 1572
Neff          4.6 
Searched_HMMs 46136
Date          Fri Mar 29 05:01:52 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/020771.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/020771hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2661 Peptidase family M48 [ 100.0 5.4E-56 1.2E-60  425.5   7.5  244    1-318    44-293 (424)
  2 COG4783 Putative Zn-dependent   99.8 6.6E-20 1.4E-24  183.7  10.8  116  121-317    32-147 (484)
  3 COG4784 Putative Zn-dependent   99.8 7.6E-19 1.6E-23  170.8   9.0  121  117-318    20-142 (479)
  4 COG0501 HtpX Zn-dependent prot  99.5 4.7E-14   1E-18  131.2   5.5   57  261-318   115-175 (302)
  5 PF01435 Peptidase_M48:  Peptid  99.5 8.9E-14 1.9E-18  123.5   6.8   78  235-319    28-108 (226)
  6 PRK03982 heat shock protein Ht  99.3 2.7E-12 5.9E-17  121.8   7.8   72  237-317    67-142 (288)
  7 PRK02870 heat shock protein Ht  99.3 3.9E-12 8.4E-17  124.2   8.0   74  236-317   113-190 (336)
  8 PRK01345 heat shock protein Ht  99.3   4E-12 8.6E-17  122.9   7.6   73  237-318    66-142 (317)
  9 PRK03001 M48 family peptidase;  99.2 1.3E-11 2.9E-16  116.8   7.2   53  264-317    85-141 (283)
 10 PRK03072 heat shock protein Ht  99.2 1.3E-11 2.7E-16  117.8   6.7   73  236-317    68-144 (288)
 11 PRK04897 heat shock protein Ht  99.2 2.1E-11 4.5E-16  116.7   7.5   73  236-317    78-154 (298)
 12 PRK05457 heat shock protein Ht  99.2 4.2E-11 9.2E-16  114.1   7.6   61  254-317    87-151 (284)
 13 PRK02391 heat shock protein Ht  99.2 4.6E-11   1E-15  114.5   7.1   73  236-317    74-150 (296)
 14 PRK01265 heat shock protein Ht  99.1 7.9E-11 1.7E-15  114.6   7.7   72  237-317    82-157 (324)
 15 KOG2719 Metalloprotease [Gener  97.1 0.00041   9E-09   70.2   4.4   54  265-318   236-298 (428)
 16 PF13203 DUF2201_N:  Putative m  96.5  0.0043 9.3E-08   58.8   5.2   51  266-317    27-77  (292)
 17 PF05569 Peptidase_M56:  BlaR1   96.1   0.007 1.5E-07   57.4   4.4   51  266-317   160-213 (299)
 18 PF04228 Zn_peptidase:  Putativ  95.2   0.039 8.5E-07   53.6   5.9   43  273-315   136-185 (292)
 19 PF06114 DUF955:  Domain of unk  94.5   0.079 1.7E-06   41.7   5.0   48  265-317     7-59  (122)
 20 PF12388 Peptidase_M57:  Dual-a  92.7    0.09 1.9E-06   49.1   2.9   34  280-316   116-150 (211)
 21 PF13699 DUF4157:  Domain of un  92.4   0.086 1.9E-06   41.7   2.0   39  273-314    37-75  (79)
 22 PF10263 SprT-like:  SprT-like   91.2    0.56 1.2E-05   40.1   6.0   31  283-314    44-74  (157)
 23 smart00731 SprT SprT homologue  90.3    0.34 7.4E-06   41.9   3.8   32  281-313    41-72  (146)
 24 PRK04351 hypothetical protein;  89.9    0.45 9.8E-06   42.0   4.3   33  279-312    41-73  (149)
 25 PF00413 Peptidase_M10:  Matrix  86.8    0.44 9.6E-06   40.1   2.1   22  295-316   100-122 (154)
 26 COG1451 Predicted metal-depend  86.2    0.55 1.2E-05   43.9   2.6   48  270-319   147-195 (223)
 27 PF10026 DUF2268:  Predicted Zn  85.5     1.4 3.1E-05   39.8   4.9   33  282-316    49-81  (195)
 28 COG4219 MecR1 Antirepressor re  84.8     1.3 2.8E-05   44.1   4.5   50  266-316   154-206 (337)
 29 cd04279 ZnMc_MMP_like_1 Zinc-d  84.6    0.63 1.4E-05   40.0   2.1   20  297-316   101-121 (156)
 30 PF13485 Peptidase_MA_2:  Pepti  84.4     1.1 2.5E-05   35.4   3.3   22  295-316    20-41  (128)
 31 PF01863 DUF45:  Protein of unk  84.4    0.64 1.4E-05   41.3   2.1   54  265-320   131-185 (205)
 32 COG2321 Predicted metalloprote  83.2     1.1 2.3E-05   43.8   3.2   42  273-314   133-181 (295)
 33 cd04276 ZnMc_MMP_like_2 Zinc-d  82.6       1 2.2E-05   41.5   2.7   36  282-317    92-134 (197)
 34 cd04268 ZnMc_MMP_like Zinc-dep  82.6    0.99 2.2E-05   38.5   2.5   21  297-317    91-112 (165)
 35 PF04450 BSP:  Peptidase of pla  82.5     3.2   7E-05   38.4   5.9   45  268-313    57-109 (205)
 36 COG3590 PepO Predicted metallo  81.4    0.43 9.4E-06   50.6  -0.2   46  273-318   450-505 (654)
 37 PF14247 DUF4344:  Domain of un  80.3      11 0.00024   35.5   8.7   50  263-312    38-104 (220)
 38 PRK04860 hypothetical protein;  79.1     2.1 4.5E-05   38.4   3.3   32  282-314    46-77  (160)
 39 PF10463 Peptidase_U49:  Peptid  78.1     1.4   3E-05   41.1   2.0   21  298-318    99-119 (206)
 40 COG2856 Predicted Zn peptidase  76.6     2.3 5.1E-05   39.7   3.1   30  282-316    59-88  (213)
 41 cd04278 ZnMc_MMP Zinc-dependen  74.5     1.7 3.7E-05   37.4   1.5   21  297-317   104-125 (157)
 42 cd04277 ZnMc_serralysin_like Z  74.4     2.3 4.9E-05   37.6   2.3   36  282-317    94-131 (186)
 43 cd00203 ZnMc Zinc-dependent me  73.8     1.5 3.3E-05   37.2   1.0   20  297-316    93-113 (167)
 44 cd04269 ZnMc_adamalysin_II_lik  73.0     2.6 5.6E-05   37.4   2.3   19  297-315   128-147 (194)
 45 cd04270 ZnMc_TACE_like Zinc-de  73.0     2.3 5.1E-05   39.8   2.1   32  284-315   146-183 (244)
 46 PF13582 Reprolysin_3:  Metallo  72.7     1.9 4.1E-05   35.1   1.3   14  302-315   109-123 (124)
 47 PF05572 Peptidase_M43:  Pregna  72.1     2.1 4.6E-05   37.7   1.5   23  296-318    65-88  (154)
 48 PF13688 Reprolysin_5:  Metallo  72.0     2.2 4.8E-05   37.6   1.6   20  296-315   138-158 (196)
 49 KOG3624 M13 family peptidase [  71.3     3.2   7E-05   44.2   2.9   47  273-319   481-537 (687)
 50 PF01421 Reprolysin:  Reprolysi  70.7     3.4 7.3E-05   36.8   2.5   21  295-315   126-147 (199)
 51 PF13583 Reprolysin_4:  Metallo  69.5     2.5 5.3E-05   38.7   1.4   16  302-317   139-155 (206)
 52 cd04267 ZnMc_ADAM_like Zinc-de  68.2     2.1 4.6E-05   37.8   0.7   20  297-316   130-150 (192)
 53 PF01447 Peptidase_M4:  Thermol  67.5     3.6 7.7E-05   36.3   1.9   41  271-315   107-150 (150)
 54 smart00235 ZnMc Zinc-dependent  66.7     3.2   7E-05   34.7   1.5   35  276-316    67-103 (140)
 55 PF14891 Peptidase_M91:  Effect  66.4     3.3 7.2E-05   36.8   1.5   17  296-312    99-115 (174)
 56 PF13574 Reprolysin_2:  Metallo  65.8     3.3 7.2E-05   36.7   1.4   17  300-316   111-128 (173)
 57 cd04272 ZnMc_salivary_gland_MP  63.5       4 8.7E-05   37.2   1.5   14  299-312   144-157 (220)
 58 COG3091 SprT Zn-dependent meta  61.5     8.3 0.00018   34.7   3.1   29  283-312    45-73  (156)
 59 PF01433 Peptidase_M1:  Peptida  60.0      12 0.00025   36.2   4.1   51  264-314   254-309 (390)
 60 PRK09672 phage exclusion prote  59.8       6 0.00013   39.1   2.1   20  298-317   163-182 (305)
 61 PF08325 WLM:  WLM domain;  Int  59.3       4 8.8E-05   37.3   0.8   25  296-320    78-103 (186)
 62 TIGR02411 leuko_A4_hydro leuko  58.7     8.1 0.00018   41.0   3.0   32  276-311   259-290 (601)
 63 PF02031 Peptidase_M7:  Strepto  54.0     6.8 0.00015   34.4   1.3   47  262-312    40-89  (132)
 64 KOG1047 Bifunctional leukotrie  53.8     6.3 0.00014   42.0   1.2   27  283-312   274-300 (613)
 65 cd04273 ZnMc_ADAMTS_like Zinc-  53.7     2.1 4.5E-05   38.7  -2.1   16  300-315   140-156 (207)
 66 PF04298 Zn_peptidase_2:  Putat  52.7     7.6 0.00017   36.8   1.5   16  301-316    90-105 (222)
 67 COG2738 Predicted Zn-dependent  51.5     7.9 0.00017   36.4   1.3   16  301-316    93-108 (226)
 68 cd04271 ZnMc_ADAM_fungal Zinc-  51.0       6 0.00013   36.9   0.5   14  302-315   147-161 (228)
 69 PF01457 Peptidase_M8:  Leishma  50.8      17 0.00036   38.0   3.7   54  258-312   156-222 (521)
 70 PF14521 Aspzincin_M35:  Lysine  50.5      20 0.00043   31.2   3.6   40  272-311    64-107 (148)
 71 PF13402 M60-like:  Peptidase M  47.6      23 0.00049   33.5   3.8   51  264-314   178-233 (307)
 72 cd04327 ZnMc_MMP_like_3 Zinc-d  47.6      13 0.00028   33.5   2.1   15  298-312    90-104 (198)
 73 cd06459 M3B_Oligoendopeptidase  46.3      12 0.00026   36.7   1.8   39  271-313   194-235 (427)
 74 TIGR00181 pepF oligoendopeptid  40.9     9.5 0.00021   39.8   0.2   45  266-314   346-392 (591)
 75 TIGR02412 pepN_strep_liv amino  40.5      29 0.00063   38.3   3.8   15   81-95    120-134 (831)
 76 PTZ00337 surface protease GP63  40.1      56  0.0012   35.0   5.6   52  258-312   178-241 (567)
 77 TIGR02414 pepN_proteo aminopep  38.8      16 0.00035   40.7   1.5   24   72-95     99-122 (863)
 78 COG0308 PepN Aminopeptidase N   38.7      23  0.0005   39.3   2.7   51  262-312   262-319 (859)
 79 PF06861 BALF1:  BALF1 protein;  38.3      21 0.00045   32.8   1.9   25  292-316   124-148 (182)
 80 PF12315 DUF3633:  Protein of u  38.1      19  0.0004   34.0   1.6   26  284-314    82-107 (212)
 81 PF09715 Plasmod_dom_1:  Plasmo  37.1      15 0.00033   28.8   0.7   31    2-32      1-31  (67)
 82 PRK13267 archaemetzincin-like   36.6      39 0.00085   30.7   3.4   17  299-315   124-141 (179)
 83 PRK14015 pepN aminopeptidase N  34.3      26 0.00056   39.2   2.2   24   72-95    112-135 (875)
 84 KOG1565 Gelatinase A and relat  33.8      18  0.0004   37.6   0.9   18  299-316   210-228 (469)
 85 TIGR02289 M3_not_pepF oligoend  33.2      21 0.00046   37.1   1.3   41  270-314   308-351 (549)
 86 COG3824 Predicted Zn-dependent  33.0      31 0.00068   30.2   2.0   23  132-154    11-33  (136)
 87 PF10023 DUF2265:  Predicted am  32.9      25 0.00055   35.2   1.7   17  296-312   161-177 (337)
 88 cd04275 ZnMc_pappalysin_like Z  31.7       8 0.00017   36.3  -1.9   20  299-318   136-156 (225)
 89 TIGR01519 plasmod_dom_1 Plasmo  31.6      22 0.00047   28.1   0.8   31    2-32      1-31  (70)
 90 KOG3658 Tumor necrosis factor-  30.8      13 0.00029   40.5  -0.7   32  281-312   368-404 (764)
 91 PF09768 Peptidase_M76:  Peptid  30.6      39 0.00085   30.7   2.4   29  281-312    55-83  (173)
 92 PF12725 DUF3810:  Protein of u  29.9      27 0.00059   34.3   1.4   15  298-312   194-208 (318)
 93 PTZ00257 Glycoprotein GP63 (le  29.9      99  0.0021   33.6   5.5   55  258-313   202-269 (622)
 94 PF02571 CbiJ:  Precorrin-6x re  29.6      50  0.0011   31.4   3.0   37  142-178    54-91  (249)
 95 KOG3607 Meltrins, fertilins an  27.4      35 0.00075   37.5   1.7   21  296-316   319-340 (716)
 96 KOG1046 Puromycin-sensitive am  26.2      94   0.002   34.8   4.8   47  266-312   283-336 (882)
 97 cd06164 S2P-M50_SpoIVFB_CBS Sp  26.2      40 0.00088   31.4   1.7   14  299-312    52-65  (227)
 98 PF07225 NDUF_B4:  NADH-ubiquin  26.0      38 0.00082   29.4   1.4   47   62-116    46-99  (125)
 99 PF01432 Peptidase_M3:  Peptida  25.5      38 0.00081   34.1   1.5   17  296-313   239-255 (458)
100 COG4324 Predicted aminopeptida  25.2      50  0.0011   32.8   2.2   45  261-311   162-208 (376)
101 cd06160 S2P-M50_like_2 Unchara  24.0      48   0.001   30.1   1.7   16  297-312    38-53  (183)
102 COG3864 Uncharacterized protei  24.0      62  0.0013   32.7   2.6   35  281-316    51-85  (396)
103 TIGR02290 M3_fam_3 oligoendope  23.9      39 0.00084   35.5   1.3   44  266-313   342-388 (587)
104 PF01400 Astacin:  Astacin (Pep  23.8      44 0.00095   30.1   1.4   12  301-312    80-91  (191)
105 PF08014 DUF1704:  Domain of un  23.0 1.1E+02  0.0023   30.8   4.1   47  263-313   132-179 (349)
106 COG1164 Oligoendopeptidase F [  23.0      38 0.00083   36.2   1.0   48  263-314   343-394 (598)
107 cd06455 M3A_TOP Peptidase M3 T  22.6      45 0.00097   34.2   1.4   17  297-314   261-277 (472)
108 cd04281 ZnMc_BMP1_TLD Zinc-dep  22.5      40 0.00086   31.1   0.9   12  301-312    88-99  (200)
109 cd04283 ZnMc_hatching_enzyme Z  21.9      42 0.00091   30.6   0.9   12  301-312    78-89  (182)
110 PF02102 Peptidase_M35:  Deuter  21.5 1.3E+02  0.0028   30.5   4.4   42  272-313   262-310 (359)
111 TIGR03296 M6dom_TIGR03296 M6 f  21.1      24 0.00051   33.9  -0.9   12  301-312   166-177 (286)
112 COG4900 Predicted metallopepti  20.9      85  0.0018   27.3   2.5   26  286-312    67-92  (133)
113 cd06159 S2P-M50_PDZ_Arch Uncha  20.3      62  0.0013   31.2   1.7   33  274-312    96-130 (263)

No 1  
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=100.00  E-value=5.4e-56  Score=425.48  Aligned_cols=244  Identities=27%  Similarity=0.417  Sum_probs=224.1

Q ss_pred             ChhhhhhHHHHHHHhh-cccCCCCCCCcccccc-ccccCC----CCCcccccCCcccceeeccccccccCCCCCCCCcce
Q 020771            1 MAYYRRAKAALDAFRN-LSSKAVPKSPVQESCS-RVYSNG----SANSAKFSSGFYSYSCISQRLRNSYCNPNFNTAKRY   74 (321)
Q Consensus         1 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~   74 (321)
                      |.|||+.|+.+|.+++ ++++|.|++....+.+ .+.++.    .+...+.+.|++++.+.-+.++..-+|||.+.+.+|
T Consensus        44 ~~~~~~~~~~~~~l~~~~~~~i~~~t~~~t~~n~~~~~~~~~~~~t~v~S~~v~l~~~~~s~Rn~Ng~~y~rFqsnp~~y  123 (424)
T KOG2661|consen   44 HFYSTFNNKRTGGLSSTKSKEIWRITSKCTVWNDAFSRQLLIKEVTAVPSLSVLLPLSPASIRNINGHTYPRFQSNPVPY  123 (424)
T ss_pred             eeeeeecchhcccccccCCccceecccccceehhhhhhccchhhhhcccccccccccchhhhhccCccccchhhcCCccH
Confidence            7899999999999999 9999999999877776 333322    445578899999999999999999999999999999


Q ss_pred             eEeecceeeeeccCCCCccccCCchhHHHHHHHHHHHHHHHhhccCccCCccchhccCCCHHHHHHHHHHHHHHHHHHhc
Q 020771           75 YYVDRYHVQHFRPRGPRKWLQNPRTVFIVVVIGSGAFITLYLGNLETVPYTKRTHFVLLSKAVERQLGESQFQQMKAAFK  154 (321)
Q Consensus        75 y~~~~~~~~hf~~rg~r~w~~~~r~~~~~v~vg~g~~~~~y~~~lEtvP~TgR~rfillS~~~E~~Lgeq~y~qil~e~~  154 (321)
                      |+.+++.|+||+|++.-+|++|+|++++++++|+.++++|||+|+|++|+|||+||+++|+++|..||+.+|+++++|++
T Consensus       124 s~~~r~~~~sfkpn~~dk~f~n~rk~f~~lf~G~~~~~vfyfthlE~~Pit~Rsrfillskp~ekllge~~ye~im~eyq  203 (424)
T KOG2661|consen  124 SLMIRKWWQSFKPNKKDKVFENIRKNFWKLFLGLLLFVVFYFTHLEVSPITGRSRFILLSKPQEKLLGELEYEAIMEEYQ  203 (424)
T ss_pred             HHHHHHHHHhcCCCCccchhhhHHHHHHHHHhhccceEEEEEeeeeeeccCCcceeeeccCHHHHHhhHHHHHHHHHHhc
Confidence            99999999999999999999999999999999876677899999999999999999999999999999999999999999


Q ss_pred             CCCCCCCChhHHHHHHHHHHHHHHHHhcccccccccccCCCCccccccccchhhhhHHhhhhcccccccccCcccccccc
Q 020771          155 GKILPAIHPDSVRVRLIAKDIIEALQRGLKHETVWSDMGYASTETDFVNEGRAARDTLRALSENSERGKTEGKWHQEDEI  234 (321)
Q Consensus       155 ~kiLP~~hp~s~rVr~Ia~rIi~al~~~l~~~~~~~~~gy~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~l  234 (321)
                      ++|||+.||++.+|++|+.+||++++.                                                     
T Consensus       204 ~~mLp~~hp~sl~V~~vlk~iIea~~~-----------------------------------------------------  230 (424)
T KOG2661|consen  204 NDMLPEKHPRSLAVKEVLKHIIEANKD-----------------------------------------------------  230 (424)
T ss_pred             CccCCcCCchhhHHHHHHHHHHHHhcc-----------------------------------------------------
Confidence            999999999999999999999998520                                                     


Q ss_pred             cchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          235 LDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       235 l~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                                           ...+++..|+++||+||.+||||+|||+|||+||+|..|++||+||+|||||+||.++|
T Consensus       231 ---------------------~~slsgIkWeihVVndPipNAFvLPgGKvfVFtgiLn~ck~ddglAtvLgHE~aHaVar  289 (424)
T KOG2661|consen  231 ---------------------VPSLSGIKWEIHVVNDPIPNAFVLPGGKVFVFTGILNSCKDDDGLATVLGHEIAHAVAR  289 (424)
T ss_pred             ---------------------CCcccCceeEEEEecCCCCceeeccCCeEEEEechhhcccChHHHHHHHHHHHHHHHHH
Confidence                                 01256789999999999999999999999999999999999999999999999999999


Q ss_pred             cCCC
Q 020771          315 CGLP  318 (321)
Q Consensus       315 H~l~  318 (321)
                      |+.+
T Consensus       290 H~AE  293 (424)
T KOG2661|consen  290 HAAE  293 (424)
T ss_pred             HHHH
Confidence            9853


No 2  
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.81  E-value=6.6e-20  Score=183.66  Aligned_cols=116  Identities=27%  Similarity=0.313  Sum_probs=94.1

Q ss_pred             ccCCccchhccCCCHHHHHHHHHHHHHHHHHHhcCCCCCCCChhHHHHHHHHHHHHHHHHhcccccccccccCCCCcccc
Q 020771          121 TVPYTKRTHFVLLSKAVERQLGESQFQQMKAAFKGKILPAIHPDSVRVRLIAKDIIEALQRGLKHETVWSDMGYASTETD  200 (321)
Q Consensus       121 tvP~TgR~rfillS~~~E~~Lgeq~y~qil~e~~~kiLP~~hp~s~rVr~Ia~rIi~al~~~l~~~~~~~~~gy~~~~~~  200 (321)
                      ..|.+|....-.+|+++|..+|...+.|+..+  .                                             
T Consensus        32 ~lp~ig~~~~s~ls~~qev~~g~~~~~Qlr~~--~---------------------------------------------   64 (484)
T COG4783          32 LLPDIGVSAGSTLSPAQEVALGDPANAQLRGS--V---------------------------------------------   64 (484)
T ss_pred             hcchhhhhhhhcCCHHHHHHhhHHHHHHhccC--C---------------------------------------------
Confidence            46888999999999999999999877766543  1                                             


Q ss_pred             ccccchhhhhHHhhhhcccccccccCcccccccccchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC
Q 020771          201 FVNEGRAARDTLRALSENSERGKTEGKWHQEDEILDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP  280 (321)
Q Consensus       201 ~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ll~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP  280 (321)
                                                      +++.||++++|++.  +..++...+++++.+++|++|+|+.+||||+|
T Consensus        65 --------------------------------~~i~D~el~~yv~~--~g~rL~~~a~~~~~~f~f~lV~d~~iNAFA~~  110 (484)
T COG4783          65 --------------------------------PLIRDPELEEYVNS--LGQRLAAAADLVKTPFTFFLVNDDSINAFATP  110 (484)
T ss_pred             --------------------------------CCcCCHHHHHHHHH--HHHHHHHhcCCCCCCeEEEEecCCccchhhcC
Confidence                                            23444445555542  12244444556788999999999999999999


Q ss_pred             CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771          281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l  317 (321)
                      ||.|+|+||||-..+||+|||+||||||||+..+|..
T Consensus       111 Gg~v~vntGLll~ae~esElagViAHEigHv~qrH~a  147 (484)
T COG4783         111 GGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLA  147 (484)
T ss_pred             CceEEEehHHHHhcCCHHHHHHHHHHHHHHHhhhhHH
Confidence            9999999999999999999999999999999999964


No 3  
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=99.77  E-value=7.6e-19  Score=170.81  Aligned_cols=121  Identities=24%  Similarity=0.350  Sum_probs=100.2

Q ss_pred             hccCccCCccchhccCCCHHHHHH-HHHHHHHHHHHHhcCCCCCCCChhH-HHHHHHHHHHHHHHHhcccccccccccCC
Q 020771          117 GNLETVPYTKRTHFVLLSKAVERQ-LGESQFQQMKAAFKGKILPAIHPDS-VRVRLIAKDIIEALQRGLKHETVWSDMGY  194 (321)
Q Consensus       117 ~~lEtvP~TgR~rfillS~~~E~~-Lgeq~y~qil~e~~~kiLP~~hp~s-~rVr~Ia~rIi~al~~~l~~~~~~~~~gy  194 (321)
                      ..|...|-|+-.-+..+++.++.. ||.++-..|+++|++..   +||-. +.|.+|+.++.                  
T Consensus        20 ~~~~~~Ps~~p~~v~~v~~n~~laqmgar~hprIla~yggeY---~D~Kler~Vari~g~lt------------------   78 (479)
T COG4784          20 YESGVQPSDNPQTVDNVQKNDPLAQMGARQHPRILATYGGEY---RDPKLERMVARIVGALT------------------   78 (479)
T ss_pred             cccccCCCCCcchHHHhhccChHhHhhhhhChhHHHhcCCcc---CCHHHHHHHHHHHhHhh------------------
Confidence            367778999999999999988766 99999999999998765   24443 23444443332                  


Q ss_pred             CCccccccccchhhhhHHhhhhcccccccccCcccccccccchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCC
Q 020771          195 ASTETDFVNEGRAARDTLRALSENSERGKTEGKWHQEDEILDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVI  274 (321)
Q Consensus       195 ~~~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ll~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~  274 (321)
                                                                                  ..++-+...+.|.|+|+|.+
T Consensus        79 ------------------------------------------------------------~~S~~p~q~YriTilnSP~I   98 (479)
T COG4784          79 ------------------------------------------------------------AVSENPQQTYRITILNSPNI   98 (479)
T ss_pred             ------------------------------------------------------------hhccCCCceEEEEEecCCCc
Confidence                                                                        22344677899999999999


Q ss_pred             ceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCCC
Q 020771          275 NAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGLP  318 (321)
Q Consensus       275 NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l~  318 (321)
                      |||++|||.|+|+.|||..+.++.|+|+||+||||||..+|+++
T Consensus        99 NAFALPGGYlYitRGLlAland~sEvAAVl~HEmgHVtAnHgi~  142 (479)
T COG4784          99 NAFALPGGYLYITRGLLALANDSSEVAAVLAHEMGHVTANHGIQ  142 (479)
T ss_pred             cccccCCceEEEehhHHHHcCCHHHHHHHHHhhhhheecchhHH
Confidence            99999999999999999999999999999999999999999975


No 4  
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.46  E-value=4.7e-14  Score=131.21  Aligned_cols=57  Identities=35%  Similarity=0.529  Sum_probs=53.1

Q ss_pred             CCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCCC
Q 020771          261 GLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGLP  318 (321)
Q Consensus       261 ~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l~  318 (321)
                      ...|+++|++++.+||||+|+    |.|+|++|||+.+ |+|||++|||||+||+.++|++.
T Consensus       115 ~~~~~v~i~~~~~~NAFa~g~~~~~~~V~vt~gLl~~l-~~dEl~aVlaHElgHi~~rd~~~  175 (302)
T COG0501         115 PHMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLDLL-NDDELEAVLAHELGHIKNRHTLV  175 (302)
T ss_pred             CCCCeeEEecCCCccceecCCCCCCeeEEecHHHHhhC-CHHHHHHHHHHHHHHHhcccHHH
Confidence            457999999999999999998    7999999999977 99999999999999999999863


No 5  
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.46  E-value=8.9e-14  Score=123.55  Aligned_cols=78  Identities=29%  Similarity=0.394  Sum_probs=59.2

Q ss_pred             cchHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCCCe---EEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771          235 LDDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPGGK---IVVFTGLLEHFRTDAEIATIIGHEPRGL  311 (321)
Q Consensus       235 l~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPgG~---I~V~tGLL~~~~~edeLAaVLaHEiaHv  311 (321)
                      .+++++++.++.      +++..+....+++++|++++.+||||+|+|.   |+|++||+..+ |+|||++|||||+||+
T Consensus        28 ~~~~~L~~~v~~------l~~~~~~~~~~~~v~v~~~~~~NA~~~g~~~~~~I~v~~~ll~~~-~~~el~aVlaHElgH~  100 (226)
T PF01435_consen   28 LEDPELRRIVEE------LARRAGLGIPPPRVYVIDSPSPNAFATGGGPRKRIVVTSGLLESL-SEDELAAVLAHELGHI  100 (226)
T ss_dssp             -HHHHHHHHHHH------HHHHHHCTSS--EEEEE--SSEEEEEETTTC--EEEEEHHHHHHS-SHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHH------HHHHhcCCCCCCeEEEEcCCCCcEEEEccCCCcEEEEeChhhhcc-cHHHHHHHHHHHHHHH
Confidence            345666666653      2233333445689999999999999999999   99999999665 9999999999999999


Q ss_pred             hcccCCCC
Q 020771          312 QRICGLPF  319 (321)
Q Consensus       312 ~~rH~l~~  319 (321)
                      ..+|....
T Consensus       101 ~~~h~~~~  108 (226)
T PF01435_consen  101 KHRHILKS  108 (226)
T ss_dssp             HTTHCCCC
T ss_pred             HcCCcchH
Confidence            99998764


No 6  
>PRK03982 heat shock protein HtpX; Provisional
Probab=99.33  E-value=2.7e-12  Score=121.77  Aligned_cols=72  Identities=18%  Similarity=0.244  Sum_probs=58.1

Q ss_pred             hHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC----CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          237 DKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP----GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       237 D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP----gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      +|++++.++.      ++...+++  ..+++|++++.+|||+++    +|.|+|++||++.+ |+|||++|||||+||+.
T Consensus        67 ~p~L~~~v~~------la~~~g~~--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~l-~~~El~AVlAHElgHi~  137 (288)
T PRK03982         67 APELYRIVER------LAERANIP--KPKVAIVPTQTPNAFATGRDPKHAVVAVTEGILNLL-NEDELEGVIAHELTHIK  137 (288)
T ss_pred             hHHHHHHHHH------HHHHcCCC--CCeEEEEeCCCcceEEeccCCCCeEEEeehHHHhhC-CHHHHHHHHHHHHHHHH
Confidence            5666666553      23344443  348999999999999994    46789999999999 89999999999999999


Q ss_pred             cccCC
Q 020771          313 RICGL  317 (321)
Q Consensus       313 ~rH~l  317 (321)
                      ++|.+
T Consensus       138 ~~h~~  142 (288)
T PRK03982        138 NRDTL  142 (288)
T ss_pred             cCCHH
Confidence            99974


No 7  
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.31  E-value=3.9e-12  Score=124.19  Aligned_cols=74  Identities=19%  Similarity=0.305  Sum_probs=59.2

Q ss_pred             chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEe----CCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771          236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCL----PGGKIVVFTGLLEHFRTDAEIATIIGHEPRGL  311 (321)
Q Consensus       236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfal----PgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv  311 (321)
                      +++++.+.++.      ++...+++ ..++++|++++.+||||+    |++.|+|++|||+.+ |+|||++|||||+||+
T Consensus       113 ~~~~L~~~ve~------La~~ag~p-~~p~V~vi~~~~~NAFA~G~~~~~~~Ivvt~GLL~~L-~~dEL~aVlAHELgHi  184 (336)
T PRK02870        113 QERQLYNVVEE------LLVAAGLR-FMPKVYIIDAPYMNAFASGYSEKSAMVAITTGLLEKL-DRDELQAVMAHELSHI  184 (336)
T ss_pred             hhHHHHHHHHH------HHHHcCCC-CCCeEEEEcCCCCceEEecCCCCCcEEEEehHHhhhC-CHHHHHHHHHHHHHHH
Confidence            45566665543      23333432 347999999999999999    478999999999999 8999999999999999


Q ss_pred             hcccCC
Q 020771          312 QRICGL  317 (321)
Q Consensus       312 ~~rH~l  317 (321)
                      .++|..
T Consensus       185 k~~di~  190 (336)
T PRK02870        185 RHGDIR  190 (336)
T ss_pred             HcccHH
Confidence            999863


No 8  
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.30  E-value=4e-12  Score=122.89  Aligned_cols=73  Identities=21%  Similarity=0.283  Sum_probs=58.5

Q ss_pred             hHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          237 DKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       237 D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      +|++.+.++.      ++...+++  ..+++|++++.+|||+++.    +.|+|++|||+.+ |+|||++||||||||+.
T Consensus        66 ~p~L~~~v~~------La~~agi~--~p~v~vid~~~~NAFa~G~~~~~~~V~vt~gLL~~L-~~dEL~aVlAHElgHi~  136 (317)
T PRK01345         66 APELYRMVRD------LARRAGLP--MPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRL-SPEEVAGVMAHELAHVK  136 (317)
T ss_pred             CHHHHHHHHH------HHHHcCCC--CCcEEEEcCCCcceEEecCCCCCeEEEechHHHhhC-CHHHHHHHHHHHHHHHH
Confidence            4556665543      34445554  3589999999999999953    4799999999998 79999999999999999


Q ss_pred             cccCCC
Q 020771          313 RICGLP  318 (321)
Q Consensus       313 ~rH~l~  318 (321)
                      ++|.+.
T Consensus       137 ~~d~~~  142 (317)
T PRK01345        137 NRDTLT  142 (317)
T ss_pred             cCCHHH
Confidence            999753


No 9  
>PRK03001 M48 family peptidase; Provisional
Probab=99.24  E-value=1.3e-11  Score=116.83  Aligned_cols=53  Identities=26%  Similarity=0.335  Sum_probs=48.7

Q ss_pred             eEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771          264 WEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       264 w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l  317 (321)
                      .+++|++++.+|||+++.    +.|+|++|||+.+ |+|||++|||||+||+.++|..
T Consensus        85 p~v~v~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlAHElgHi~~~h~~  141 (283)
T PRK03001         85 PKVYLINEDQPNAFATGRNPEHAAVAATTGILRVL-SEREIRGVMAHELAHVKHRDIL  141 (283)
T ss_pred             CeEEEecCCCcceEEecCCCCCeEEEecHHHHhhC-CHHHHHHHHHHHHHHHhCCChH
Confidence            489999999999999974    3599999999999 8999999999999999999975


No 10 
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.23  E-value=1.3e-11  Score=117.77  Aligned_cols=73  Identities=15%  Similarity=0.177  Sum_probs=59.4

Q ss_pred             chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771          236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGL  311 (321)
Q Consensus       236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv  311 (321)
                      +||++.+.++.      ++...+++  ..+++|++++.+|||+++.    +.|+|++||++.+ |+|||++|||||+||+
T Consensus        68 ~~p~L~~~v~~------la~~~g~p--~p~vyv~~~~~~NAFa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAHElgHi  138 (288)
T PRK03072         68 QAPAMYRIVRE------LSTAARQP--MPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQIL-NERELRGVLGHELSHV  138 (288)
T ss_pred             hhHHHHHHHHH------HHHHcCCC--CCCEEEecCCCCceEEecCCCCCcEEEecHHHHHhC-CHHHHHHHHHHHHHHH
Confidence            56777776653      34445554  4589999999999999952    2499999999999 8999999999999999


Q ss_pred             hcccCC
Q 020771          312 QRICGL  317 (321)
Q Consensus       312 ~~rH~l  317 (321)
                      .++|.+
T Consensus       139 ~~~d~~  144 (288)
T PRK03072        139 YNRDIL  144 (288)
T ss_pred             hcCCHH
Confidence            999975


No 11 
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.22  E-value=2.1e-11  Score=116.66  Aligned_cols=73  Identities=21%  Similarity=0.294  Sum_probs=59.2

Q ss_pred             chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC----CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771          236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP----GGKIVVFTGLLEHFRTDAEIATIIGHEPRGL  311 (321)
Q Consensus       236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP----gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv  311 (321)
                      ++|++.+.++.      ++...+++  ..++++++++.+|||+++    ++.|+|++|||+.+ ++|||++|||||+||+
T Consensus        78 ~~p~L~~~v~~------la~~~gip--~p~v~v~~~~~~NAfa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAHElgHi  148 (298)
T PRK04897         78 EAPELWHIVED------MAMVAQIP--MPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIM-NREELEGVIGHEISHI  148 (298)
T ss_pred             hhHHHHHHHHH------HHHHcCCC--CCcEEEecCCCCceEEeccCCCCcEEEeehHHHhhC-CHHHHHHHHHHHHHHH
Confidence            45666666653      34445554  358999999999999997    35799999999999 8999999999999999


Q ss_pred             hcccCC
Q 020771          312 QRICGL  317 (321)
Q Consensus       312 ~~rH~l  317 (321)
                      .++|..
T Consensus       149 ~~~d~~  154 (298)
T PRK04897        149 RNYDIR  154 (298)
T ss_pred             hcCCHH
Confidence            999964


No 12 
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.18  E-value=4.2e-11  Score=114.13  Aligned_cols=61  Identities=25%  Similarity=0.347  Sum_probs=51.9

Q ss_pred             ccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771          254 SATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       254 ~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l  317 (321)
                      +...+++  ..+++|++++.+|||+++.    +.|+|++|||+.+ |+|||++|||||+||+.++|..
T Consensus        87 a~~~g~p--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVlAHElgHi~~~d~~  151 (284)
T PRK05457         87 ARQAGIG--MPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNM-SRDEVEAVLAHEISHIANGDMV  151 (284)
T ss_pred             HHhCCCC--CCCEEEEeCCCceEEEecCCCCCeEEEeehHHhhhC-CHHHHHHHHHHHHHHHHcCCHH
Confidence            3444544  4579999999999999973    3589999999999 8999999999999999999964


No 13 
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.17  E-value=4.6e-11  Score=114.54  Aligned_cols=73  Identities=21%  Similarity=0.232  Sum_probs=58.1

Q ss_pred             chHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeCC----CeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771          236 DDKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLPG----GKIVVFTGLLEHFRTDAEIATIIGHEPRGL  311 (321)
Q Consensus       236 ~D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~edeLAaVLaHEiaHv  311 (321)
                      ++|++.+.++.      ++...+.+  ..+++|++++.+|||+++.    +.|+|++||++.+ |+|||.+|||||+||+
T Consensus        74 ~~p~L~~~v~~------la~~~~~~--~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVlaHElgHi  144 (296)
T PRK02391         74 EYPELHAMVER------LCALADLP--KPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRL-DPDELEAVLAHELSHV  144 (296)
T ss_pred             hCHHHHHHHHH------HHHHcCCC--CCcEEEEeCCCCceEEecCCCCCcEEEecHHHHhhC-CHHHHHHHHHHHHHHH
Confidence            35666666543      33444544  2489999999999999963    3589999999999 7999999999999999


Q ss_pred             hcccCC
Q 020771          312 QRICGL  317 (321)
Q Consensus       312 ~~rH~l  317 (321)
                      .++|.+
T Consensus       145 ~~~di~  150 (296)
T PRK02391        145 KNRDVA  150 (296)
T ss_pred             HcCCHH
Confidence            999974


No 14 
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.14  E-value=7.9e-11  Score=114.61  Aligned_cols=72  Identities=22%  Similarity=0.280  Sum_probs=58.1

Q ss_pred             hHHHHHHhhhcccccccccccCCCCCCeEEEEEeCCCCceEEeC----CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          237 DKWVQQSRKKGQEKGLQSATSHLDGLNWEVLVVNEPVINAFCLP----GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       237 D~~v~~~~~~G~~~~~l~~~s~l~~~~w~v~Vv~~~~~NAfalP----gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      ++++.+-++.      ++...+++  ..+++|++++.+||||.+    ++.|+|++|||+.+ |+|||.+|||||+||+.
T Consensus        82 ~~~L~~~v~~------la~~~g~~--~p~vyv~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlAHElgHik  152 (324)
T PRK01265         82 YGWLYSIVAE------VAKYNGIR--VPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKIL-NRDEIKAVAGHELGHLK  152 (324)
T ss_pred             cHHHHHHHHH------HHHHcCCC--CCeEEEecCCCCCeEEeccCCCCCEEEEehHHHhhC-CHHHHHHHHHHHHHHHH
Confidence            4556655543      33444554  458999999999999973    56899999999999 89999999999999999


Q ss_pred             cccCC
Q 020771          313 RICGL  317 (321)
Q Consensus       313 ~rH~l  317 (321)
                      ++|..
T Consensus       153 ~~d~~  157 (324)
T PRK01265        153 HRDVE  157 (324)
T ss_pred             cccHH
Confidence            99974


No 15 
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=97.15  E-value=0.00041  Score=70.24  Aligned_cols=54  Identities=20%  Similarity=0.237  Sum_probs=43.1

Q ss_pred             EEEEEeC----CCCceEEeCC---CeEEEcHhHHh-hc-CCHHHHHHHHHHHHHhhhcccCCC
Q 020771          265 EVLVVNE----PVINAFCLPG---GKIVVFTGLLE-HF-RTDAEIATIIGHEPRGLQRICGLP  318 (321)
Q Consensus       265 ~v~Vv~~----~~~NAfalPg---G~I~V~tGLL~-~~-~~edeLAaVLaHEiaHv~~rH~l~  318 (321)
                      .++|++.    ...||+-.+=   -.|++|.-|+. .. -|+||++||+|||+||-.++|.++
T Consensus       236 k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~~~~~~~eel~AVl~HELGHW~~~H~~K  298 (428)
T KOG2719|consen  236 KYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLEEEHLNNEELVAVLAHELGHWKLNHVLK  298 (428)
T ss_pred             EEEEEecCCCCCCCCeeeeeccccceEEEehhhhhhhhccccHHHHHHHHHHhhHHHHhhHHH
Confidence            5677772    3589998864   36999999993 33 389999999999999999999753


No 16 
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=96.46  E-value=0.0043  Score=58.78  Aligned_cols=51  Identities=16%  Similarity=0.082  Sum_probs=45.5

Q ss_pred             EEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771          266 VLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       266 v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l  317 (321)
                      +.++.++.+..+++-|..|+++-..+..+ +.+|+.+||.||+-|++.+|..
T Consensus        27 l~~~~~~~~~t~~tDg~~l~~nP~~~~~l-~~~~~~~~l~HevlH~~~~H~~   77 (292)
T PF13203_consen   27 LPPVEDDGIPTAATDGRRLYYNPEFLESL-SPEERVGLLLHEVLHCLLRHPW   77 (292)
T ss_pred             CeeeeCCCCceeeEcCcEEEECcHHHhcC-CHHHHHHHHHHHHHHHHccchh
Confidence            44555888899999999999999999888 7999999999999999999974


No 17 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=96.07  E-value=0.007  Score=57.37  Aligned_cols=51  Identities=24%  Similarity=0.240  Sum_probs=40.7

Q ss_pred             EEEEeCCC-CceEEeCC--CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771          266 VLVVNEPV-INAFCLPG--GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       266 v~Vv~~~~-~NAfalPg--G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l  317 (321)
                      +.++.++. ..+|+++-  .+|++..++++.+ +++|+..||.||++|+.++|.+
T Consensus       160 ~~i~~s~~i~sP~~~G~~~p~I~lP~~~~~~~-~~~el~~il~HEl~Hikr~D~~  213 (299)
T PF05569_consen  160 IRIRVSSGISSPFVFGFLRPVIVLPESLLEDL-SEEELRAILLHELAHIKRRDLL  213 (299)
T ss_pred             eEEEEcCCCCCCeeecCcceEEEecCcccccc-CHHHHHHHHHHHHHHHHCCChH
Confidence            44444443 46777742  5899999999888 8999999999999999999874


No 18 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=95.19  E-value=0.039  Score=53.65  Aligned_cols=43  Identities=16%  Similarity=0.207  Sum_probs=32.3

Q ss_pred             CCceEEeCC-CeEEEcHhHHhhcC------CHHHHHHHHHHHHHhhhccc
Q 020771          273 VINAFCLPG-GKIVVFTGLLEHFR------TDAEIATIIGHEPRGLQRIC  315 (321)
Q Consensus       273 ~~NAfalPg-G~I~V~tGLL~~~~------~edeLAaVLaHEiaHv~~rH  315 (321)
                      ..|||-+|. .+|++-+.+++.++      .+-..+.|||||.||.+...
T Consensus       136 ~~gpFYCp~D~tIYlD~~f~~~L~~~~ga~G~~a~ayVlAHEyGHHVQ~l  185 (292)
T PF04228_consen  136 ATGPFYCPADQTIYLDLSFFDELQQRFGASGDFAQAYVLAHEYGHHVQNL  185 (292)
T ss_pred             CCCCEeCCCCCEEEechHHHHHHHHHhCCccHHHHHHHHHHHHHHHHHHH
Confidence            357787775 68999998876653      24456789999999987654


No 19 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=94.48  E-value=0.079  Score=41.74  Aligned_cols=48  Identities=15%  Similarity=0.187  Sum_probs=30.6

Q ss_pred             EEEEEeCCC--CceEEeC---CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccCC
Q 020771          265 EVLVVNEPV--INAFCLP---GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       265 ~v~Vv~~~~--~NAfalP---gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~l  317 (321)
                      +|...+...  ...++.+   .+.|+|.+.     .++.+..++|+||+||+..+|..
T Consensus         7 ~v~~~~~~~~~~~~~~~~~~~~~~I~in~~-----~~~~~~~f~laHELgH~~~~~~~   59 (122)
T PF06114_consen    7 DVVYTDLGPKDLGGFSIPKENNPIIFINSN-----LSPERQRFTLAHELGHILLHHGD   59 (122)
T ss_dssp             EEEEE--TT--EEEETT----TTEEEEESS-----S-HHHHHHHHHHHHHHHHHHH-H
T ss_pred             EEEEEcCCCCCcCEEEEecCcCCEEEECCC-----CCHHHHHHHHHHHHHHHHhhhcc
Confidence            444444433  3444444   567887766     36888999999999999987753


No 20 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=92.71  E-value=0.09  Score=49.14  Aligned_cols=34  Identities=24%  Similarity=0.098  Sum_probs=26.0

Q ss_pred             CCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccC
Q 020771          280 PGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       280 PgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~  316 (321)
                      |++.|.| .|+-.  .+-+.++.||.|||||.+ .||+
T Consensus       116 P~~~I~I-~~~~~--~~~~~~~hvi~HEiGH~IGfRHT  150 (211)
T PF12388_consen  116 PYKFIQI-YGLSN--YSVNVIEHVITHEIGHCIGFRHT  150 (211)
T ss_pred             CCceEEE-EecCC--CchhHHHHHHHHHhhhhcccccc
Confidence            4567888 46422  267889999999999987 8885


No 21 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=92.37  E-value=0.086  Score=41.75  Aligned_cols=39  Identities=21%  Similarity=0.264  Sum_probs=27.9

Q ss_pred             CCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          273 VINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       273 ~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      ...||++ |+.|++-.|-...  +..+=..+|+||++|+...
T Consensus        37 ~A~A~T~-G~~I~f~~g~~~~--~s~~~~~llaHEl~Hv~Qq   75 (79)
T PF13699_consen   37 GARAFTV-GNDIYFAPGKYNP--DSPEGRALLAHELAHVVQQ   75 (79)
T ss_pred             CCeEEEE-CCEEEEcCCCcCC--CCCCcchhHhHHHHHHHhh
Confidence            4678888 7899997775432  2333457899999999753


No 22 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=91.22  E-value=0.56  Score=40.13  Aligned_cols=31  Identities=19%  Similarity=0.233  Sum_probs=27.8

Q ss_pred             eEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          283 KIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       283 ~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      .|.++.-+++. .++++|..+|.|||+|+...
T Consensus        44 ~I~ls~~~~~~-~~~~~~~~tL~HEm~H~~~~   74 (157)
T PF10263_consen   44 EIRLSPKLLDR-NPEEELIDTLLHEMAHAAAY   74 (157)
T ss_pred             EEEECHHHHHh-hHHHHHHHHHHHHHHHHHhh
Confidence            69999999998 47999999999999999763


No 23 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=90.29  E-value=0.34  Score=41.85  Aligned_cols=32  Identities=16%  Similarity=0.062  Sum_probs=28.5

Q ss_pred             CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771          281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQR  313 (321)
Q Consensus       281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~  313 (321)
                      .+.|.++.-|++.. .++.|..||.|||+|+..
T Consensus        41 ~~~I~ln~~l~~~~-~~~~l~~~l~HEm~H~~~   72 (146)
T smart00731       41 SAEIRLNPKLLTEN-GRDRLRETLLHELCHAAL   72 (146)
T ss_pred             CCEEEeCHHHHhhc-cHHHHHhhHHHHHHHHHH
Confidence            68899999988866 689999999999999986


No 24 
>PRK04351 hypothetical protein; Provisional
Probab=89.87  E-value=0.45  Score=42.05  Aligned_cols=33  Identities=27%  Similarity=0.283  Sum_probs=28.8

Q ss_pred             eCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          279 LPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       279 lPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      +....|-++.-+++.. +++.|..||.||++|+.
T Consensus        41 l~~~~I~lnp~ll~~~-~~~~l~~vv~HElcH~~   73 (149)
T PRK04351         41 LKDHHIEFNPKMLEEY-GLEELIGIIKHELCHYH   73 (149)
T ss_pred             cCCCeEEeCHHHHhhc-cHHHHHhhHHHHHHHHH
Confidence            4557899999999876 79999999999999984


No 25 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=86.76  E-value=0.44  Score=40.10  Aligned_cols=22  Identities=9%  Similarity=0.010  Sum_probs=18.0

Q ss_pred             CCHHHHHHHHHHHHHhhh-cccC
Q 020771          295 RTDAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       295 ~~edeLAaVLaHEiaHv~-~rH~  316 (321)
                      .+...+..|+.|||||++ +.|.
T Consensus       100 ~~~~~~~~v~~HEiGHaLGL~H~  122 (154)
T PF00413_consen  100 DSGNDLQSVAIHEIGHALGLDHS  122 (154)
T ss_dssp             SSSEEHHHHHHHHHHHHTTBESS
T ss_pred             hhhhhhhhhhhhccccccCcCcC
Confidence            355678999999999997 7775


No 26 
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=86.21  E-value=0.55  Score=43.92  Aligned_cols=48  Identities=21%  Similarity=0.165  Sum_probs=35.8

Q ss_pred             eCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccCCCC
Q 020771          270 NEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICGLPF  319 (321)
Q Consensus       270 ~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~l~~  319 (321)
                      +-...=+-|.+.|.|.+.+-|.. + .++.+..|+.||++|+. ..|+-.|
T Consensus       147 ~~k~~WGScs~~~~i~~~~~l~~-~-p~~~i~YVvvHELaHLke~nHs~~F  195 (223)
T COG1451         147 NMKRRWGSCSKAGEIRFNWRLVM-A-PEEVIDYVVVHELAHLKEKNHSKRF  195 (223)
T ss_pred             eccceeeeecCCCcEEeehhhhc-C-CHHHHHHHHHHHHHHHhhhhccHHH
Confidence            34455677888887777776655 3 69999999999999997 5555444


No 27 
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=85.48  E-value=1.4  Score=39.83  Aligned_cols=33  Identities=27%  Similarity=0.273  Sum_probs=25.9

Q ss_pred             CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771          282 GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG  316 (321)
Q Consensus       282 G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~  316 (321)
                      |+|+++-  +..-.+.++|.++||||.-|+.+-..
T Consensus        49 ~~i~l~~--~~~~~~~~~l~~~iaHE~hH~~r~~~   81 (195)
T PF10026_consen   49 GYIFLFL--LPNDYSLEELPALIAHEYHHNCRYEQ   81 (195)
T ss_pred             CEEEEEe--cCCcccHHHHHHHHHHHHHHHHHHhc
Confidence            5788775  45555899999999999999976443


No 28 
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=84.83  E-value=1.3  Score=44.14  Aligned_cols=50  Identities=30%  Similarity=0.282  Sum_probs=38.8

Q ss_pred             EEEEeCC---CCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771          266 VLVVNEP---VINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG  316 (321)
Q Consensus       266 v~Vv~~~---~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~  316 (321)
                      +.+..++   .+-.|.++--.|++-+.+.+.+ +++|+.-|++||.+|+.++..
T Consensus       154 i~ir~s~~i~~P~v~gl~kp~IvlP~d~~~r~-~~ee~~yIilHEl~Hlk~gD~  206 (337)
T COG4219         154 ILIRKSKAIDGPMVFGLVKPCIVLPADFVERL-TDEELKYIILHELSHLKRGDA  206 (337)
T ss_pred             eeEeecccCCCceeeccCcceEEccHHHHhhc-CHHhhhhhHhHHHhhhhcccH
Confidence            4444443   3445555667899999999999 899999999999999987654


No 29 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=84.63  E-value=0.63  Score=40.01  Aligned_cols=20  Identities=15%  Similarity=-0.031  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHHHHhhh-cccC
Q 020771          297 DAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       297 edeLAaVLaHEiaHv~-~rH~  316 (321)
                      ...+..|+.|||||++ +.|.
T Consensus       101 ~~~~~~~~~HEiGHaLGL~H~  121 (156)
T cd04279         101 AENLQAIALHELGHALGLWHH  121 (156)
T ss_pred             chHHHHHHHHHhhhhhcCCCC
Confidence            5679999999999997 6665


No 30 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=84.39  E-value=1.1  Score=35.37  Aligned_cols=22  Identities=9%  Similarity=-0.086  Sum_probs=18.3

Q ss_pred             CCHHHHHHHHHHHHHhhhcccC
Q 020771          295 RTDAEIATIIGHEPRGLQRICG  316 (321)
Q Consensus       295 ~~edeLAaVLaHEiaHv~~rH~  316 (321)
                      .+++.+..||+||++|+.....
T Consensus        20 ~~~~~~~~~l~HE~~H~~~~~~   41 (128)
T PF13485_consen   20 SDEDWLDRVLAHELAHQWFGNY   41 (128)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHH
Confidence            5788899999999999975543


No 31 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=84.37  E-value=0.64  Score=41.26  Aligned_cols=54  Identities=17%  Similarity=0.161  Sum_probs=40.5

Q ss_pred             EEEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccCCCCC
Q 020771          265 EVLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICGLPFY  320 (321)
Q Consensus       265 ~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~l~~~  320 (321)
                      ++.|-+...-=.-|...|.|-++.-|+..  .++-+..|+.||++|+. .+|+-.||
T Consensus       131 ~i~ir~~ksrWGsc~~~~~I~ln~~L~~~--P~~~idYVvvHEL~Hl~~~nHs~~Fw  185 (205)
T PF01863_consen  131 KIKIRDMKSRWGSCSSKGNITLNWRLVMA--PPEVIDYVVVHELCHLRHPNHSKRFW  185 (205)
T ss_pred             eEEEeehhhccccCCCCCcEEeecccccC--CccHHHHHHHHHHHHhccCCCCHHHH
Confidence            45555555555557557999999988773  68899999999999995 66765554


No 32 
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=83.24  E-value=1.1  Score=43.81  Aligned_cols=42  Identities=19%  Similarity=0.257  Sum_probs=32.9

Q ss_pred             CCceEEeCC-CeEEEcHhHHhhcC------CHHHHHHHHHHHHHhhhcc
Q 020771          273 VINAFCLPG-GKIVVFTGLLEHFR------TDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       273 ~~NAfalPg-G~I~V~tGLL~~~~------~edeLAaVLaHEiaHv~~r  314 (321)
                      ..--|-+|+ .++++...+.+.|+      .|=.-|.|+|||++|.+.+
T Consensus       133 asGPFYCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHVQn  181 (295)
T COG2321         133 ASGPFYCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHVQN  181 (295)
T ss_pred             CCCCeecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHHHH
Confidence            344567775 68999999999885      4666789999999998753


No 33 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=82.60  E-value=1  Score=41.45  Aligned_cols=36  Identities=14%  Similarity=0.309  Sum_probs=27.7

Q ss_pred             CeEEEcHhHHhhcCC------HHHHHHHHHHHHHhhh-cccCC
Q 020771          282 GKIVVFTGLLEHFRT------DAEIATIIGHEPRGLQ-RICGL  317 (321)
Q Consensus       282 G~I~V~tGLL~~~~~------edeLAaVLaHEiaHv~-~rH~l  317 (321)
                      +.|.++++.+.....      .+-|..+++||+||.+ +.|..
T Consensus        92 a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~  134 (197)
T cd04276          92 ADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNF  134 (197)
T ss_pred             EEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccc
Confidence            678899998866532      3568899999999987 77753


No 34 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=82.59  E-value=0.99  Score=38.48  Aligned_cols=21  Identities=14%  Similarity=0.034  Sum_probs=17.2

Q ss_pred             HHHHHHHHHHHHHhhh-cccCC
Q 020771          297 DAEIATIIGHEPRGLQ-RICGL  317 (321)
Q Consensus       297 edeLAaVLaHEiaHv~-~rH~l  317 (321)
                      .+.+-+|+.|||||++ +.|..
T Consensus        91 ~~~~~~~~~HEiGHaLGL~H~~  112 (165)
T cd04268          91 GARLRNTAEHELGHALGLRHNF  112 (165)
T ss_pred             HHHHHHHHHHHHHHHhcccccC
Confidence            4678999999999998 66654


No 35 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=82.46  E-value=3.2  Score=38.43  Aligned_cols=45  Identities=16%  Similarity=0.184  Sum_probs=33.4

Q ss_pred             EEeCCCCceEEeCC----CeEEEcHhHHhhcCC----HHHHHHHHHHHHHhhhc
Q 020771          268 VVNEPVINAFCLPG----GKIVVFTGLLEHFRT----DAEIATIIGHEPRGLQR  313 (321)
Q Consensus       268 Vv~~~~~NAfalPg----G~I~V~tGLL~~~~~----edeLAaVLaHEiaHv~~  313 (321)
                      ++++-.-=|++. |    ..|.+++..|....+    .+|+..||-|||.|+.+
T Consensus        57 ~~~~~~gVA~t~-gd~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~Q  109 (205)
T PF04450_consen   57 ILDDMDGVAYTS-GDDDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCWQ  109 (205)
T ss_pred             EEECCCeeEEEe-cCCCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHhh
Confidence            445433334444 3    689999999998853    46999999999999965


No 36 
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=81.43  E-value=0.43  Score=50.56  Aligned_cols=46  Identities=28%  Similarity=0.358  Sum_probs=32.3

Q ss_pred             CCceEEeCCCe-EEEcHhHHh--hcC-------CHHHHHHHHHHHHHhhhcccCCC
Q 020771          273 VINAFCLPGGK-IVVFTGLLE--HFR-------TDAEIATIIGHEPRGLQRICGLP  318 (321)
Q Consensus       273 ~~NAfalPgG~-I~V~tGLL~--~~~-------~edeLAaVLaHEiaHv~~rH~l~  318 (321)
                      .+|||.-|+-+ ||.-.+||.  .++       |=-.+-+||||||+|.--.-+.+
T Consensus       450 ~VNAYYnp~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGak  505 (654)
T COG3590         450 TVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGAK  505 (654)
T ss_pred             HhhhhcCCCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCccc
Confidence            58999999854 666666664  221       33468899999999986554444


No 37 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=80.31  E-value=11  Score=35.51  Aligned_cols=50  Identities=14%  Similarity=0.100  Sum_probs=36.0

Q ss_pred             CeEEEEEeCCCCceEEeCC-CeEEEcHhHHhhcC--------C--------HHHHHHHHHHHHHhhh
Q 020771          263 NWEVLVVNEPVINAFCLPG-GKIVVFTGLLEHFR--------T--------DAEIATIIGHEPRGLQ  312 (321)
Q Consensus       263 ~w~v~Vv~~~~~NAfalPg-G~I~V~tGLL~~~~--------~--------edeLAaVLaHEiaHv~  312 (321)
                      +..+.+-.-.+.|||--|. ..|.+...+++.+.        +        .+-+..+|-||+||+.
T Consensus        38 ~l~i~~~~CGe~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHAl  104 (220)
T PF14247_consen   38 DLTIRFAECGEDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHAL  104 (220)
T ss_pred             CeEEEEeecCCCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHHH
Confidence            3445444567899999995 67998888776442        1        2346678999999985


No 38 
>PRK04860 hypothetical protein; Provisional
Probab=79.14  E-value=2.1  Score=38.37  Aligned_cols=32  Identities=25%  Similarity=0.150  Sum_probs=26.0

Q ss_pred             CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          282 GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       282 G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      +.|-++--+|... ++++|..||.||++|+...
T Consensus        46 ~~I~~Np~ll~~~-~~~~l~~~v~HEl~H~~~~   77 (160)
T PRK04860         46 NEIRLNPVLLLEN-QQAFIDEVVPHELAHLLVY   77 (160)
T ss_pred             CCeeeCHHHHhhC-cHHHHHhHHHHHHHHHHHH
Confidence            5677777777766 7999999999999998643


No 39 
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=78.06  E-value=1.4  Score=41.13  Aligned_cols=21  Identities=24%  Similarity=-0.016  Sum_probs=18.0

Q ss_pred             HHHHHHHHHHHHhhhcccCCC
Q 020771          298 AEIATIIGHEPRGLQRICGLP  318 (321)
Q Consensus       298 deLAaVLaHEiaHv~~rH~l~  318 (321)
                      -.++.+|.||++|+..+|...
T Consensus        99 ~A~~fil~HE~~Hv~~~h~~~  119 (206)
T PF10463_consen   99 CAIAFILLHELAHVVLGHEGD  119 (206)
T ss_pred             HHHHHHHHHHHHHHHHcCccc
Confidence            357899999999999999754


No 40 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=76.61  E-value=2.3  Score=39.71  Aligned_cols=30  Identities=10%  Similarity=-0.065  Sum_probs=22.9

Q ss_pred             CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771          282 GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG  316 (321)
Q Consensus       282 G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~  316 (321)
                      +.|++++-     .+++.--++||||+||++++-.
T Consensus        59 ~~I~iN~n-----~~~~r~rFtlAHELGH~llH~~   88 (213)
T COG2856          59 PVIYINAN-----NSLERKRFTLAHELGHALLHTD   88 (213)
T ss_pred             ceEEEeCC-----CCHHHHHHHHHHHHhHHHhccc
Confidence            46776544     3788888999999999986543


No 41 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=74.53  E-value=1.7  Score=37.45  Aligned_cols=21  Identities=10%  Similarity=0.007  Sum_probs=16.8

Q ss_pred             HHHHHHHHHHHHHhhh-cccCC
Q 020771          297 DAEIATIIGHEPRGLQ-RICGL  317 (321)
Q Consensus       297 edeLAaVLaHEiaHv~-~rH~l  317 (321)
                      ...+..|+.|||||++ +.|.-
T Consensus       104 ~~~~~~~~~HEiGHaLGL~H~~  125 (157)
T cd04278         104 GTDLFSVAAHEIGHALGLGHSS  125 (157)
T ss_pred             cchHHHHHHHHhccccccCCCC
Confidence            3458899999999997 67753


No 42 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=74.39  E-value=2.3  Score=37.64  Aligned_cols=36  Identities=14%  Similarity=-0.164  Sum_probs=23.2

Q ss_pred             CeEEEcHhHHhhc-CCHHHHHHHHHHHHHhhh-cccCC
Q 020771          282 GKIVVFTGLLEHF-RTDAEIATIIGHEPRGLQ-RICGL  317 (321)
Q Consensus       282 G~I~V~tGLL~~~-~~edeLAaVLaHEiaHv~-~rH~l  317 (321)
                      |.|++........ ......-.++.|||||++ +.|.-
T Consensus        94 g~i~~~~~~~~~~~~~g~~~~~t~~HEiGHaLGL~H~~  131 (186)
T cd04277          94 GDIWFNSSYDTNSDSPGSYGYQTIIHEIGHALGLEHPG  131 (186)
T ss_pred             ceeEEecCcccccCCCChhhHHHHHHHHHHHhcCCCCC
Confidence            5666665433321 123556789999999998 67753


No 43 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=73.80  E-value=1.5  Score=37.23  Aligned_cols=20  Identities=25%  Similarity=0.036  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHHHhhh-cccC
Q 020771          297 DAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       297 edeLAaVLaHEiaHv~-~rH~  316 (321)
                      ...++.+++|||||++ +.|.
T Consensus        93 ~~~~~~~~~HElGH~LGl~H~  113 (167)
T cd00203          93 TKEGAQTIAHELGHALGFYHD  113 (167)
T ss_pred             cccchhhHHHHHHHHhCCCcc
Confidence            3578999999999997 5554


No 44 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=72.97  E-value=2.6  Score=37.38  Aligned_cols=19  Identities=16%  Similarity=-0.130  Sum_probs=15.5

Q ss_pred             HHHHHHHHHHHHHhhh-ccc
Q 020771          297 DAEIATIIGHEPRGLQ-RIC  315 (321)
Q Consensus       297 edeLAaVLaHEiaHv~-~rH  315 (321)
                      ....|.++||||||.. ..|
T Consensus       128 ~~~~a~~~AHElGH~lG~~H  147 (194)
T cd04269         128 LLLFAVTMAHELGHNLGMEH  147 (194)
T ss_pred             hHHHHHHHHHHHHhhcCCCc
Confidence            5678999999999987 444


No 45 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=72.96  E-value=2.3  Score=39.85  Aligned_cols=32  Identities=19%  Similarity=0.059  Sum_probs=21.1

Q ss_pred             EEEcHhHHhhcC-----CHHHHHHHHHHHHHhhh-ccc
Q 020771          284 IVVFTGLLEHFR-----TDAEIATIIGHEPRGLQ-RIC  315 (321)
Q Consensus       284 I~V~tGLL~~~~-----~edeLAaVLaHEiaHv~-~rH  315 (321)
                      ++..+||.....     .....|.+|||||||.. ..|
T Consensus       146 ~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnlGm~H  183 (244)
T cd04270         146 KYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNFGSPH  183 (244)
T ss_pred             eeeecceEeeeccCCccchhHHHHHHHHHHHHhcCCCC
Confidence            455667664221     24567899999999986 344


No 46 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=72.66  E-value=1.9  Score=35.13  Aligned_cols=14  Identities=14%  Similarity=-0.126  Sum_probs=10.3

Q ss_pred             HHHHHHHHhhh-ccc
Q 020771          302 TIIGHEPRGLQ-RIC  315 (321)
Q Consensus       302 aVLaHEiaHv~-~rH  315 (321)
                      .+++|||||.. +.|
T Consensus       109 ~~~~HEiGH~lGl~H  123 (124)
T PF13582_consen  109 DTFAHEIGHNLGLNH  123 (124)
T ss_dssp             THHHHHHHHHTT---
T ss_pred             eEeeehhhHhcCCCC
Confidence            89999999986 555


No 47 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=72.12  E-value=2.1  Score=37.69  Aligned_cols=23  Identities=9%  Similarity=0.031  Sum_probs=14.8

Q ss_pred             CHHHHHHHHHHHHHhhh-cccCCC
Q 020771          296 TDAEIATIIGHEPRGLQ-RICGLP  318 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~-~rH~l~  318 (321)
                      +.....-+|.||+||.+ +.|+.+
T Consensus        65 ~~~~~g~TltHEvGH~LGL~HtF~   88 (154)
T PF05572_consen   65 SQYNFGKTLTHEVGHWLGLYHTFG   88 (154)
T ss_dssp             TTS-SSHHHHHHHHHHTT---TT-
T ss_pred             Cccccccchhhhhhhhhccccccc
Confidence            45566899999999987 888753


No 48 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=71.98  E-value=2.2  Score=37.62  Aligned_cols=20  Identities=15%  Similarity=-0.122  Sum_probs=14.0

Q ss_pred             CHHHHHHHHHHHHHhhh-ccc
Q 020771          296 TDAEIATIIGHEPRGLQ-RIC  315 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~-~rH  315 (321)
                      ....-+-|+||||||.. +.|
T Consensus       138 ~~~~~~~~~AHEiGH~lGa~H  158 (196)
T PF13688_consen  138 PTYNGAITFAHEIGHNLGAPH  158 (196)
T ss_dssp             -HHHHHHHHHHHHHHHTT---
T ss_pred             CCCceehhhHHhHHHhcCCCC
Confidence            45777889999999986 444


No 49 
>KOG3624 consensus M13 family peptidase [Amino acid transport and metabolism]
Probab=71.27  E-value=3.2  Score=44.17  Aligned_cols=47  Identities=32%  Similarity=0.390  Sum_probs=36.7

Q ss_pred             CCceEEeCC-CeEEEcHhHHhh----c-----CCHHHHHHHHHHHHHhhhcccCCCC
Q 020771          273 VINAFCLPG-GKIVVFTGLLEH----F-----RTDAEIATIIGHEPRGLQRICGLPF  319 (321)
Q Consensus       273 ~~NAfalPg-G~I~V~tGLL~~----~-----~~edeLAaVLaHEiaHv~~rH~l~~  319 (321)
                      .+|||-.|. ..|+|.-|+|..    .     -+=.-+.+||||||+|.--.++.++
T Consensus       481 ~~na~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~~  537 (687)
T KOG3624|consen  481 QVNAFYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQY  537 (687)
T ss_pred             eeeccccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhccccccccc
Confidence            589988775 578999999885    1     1567789999999999887776654


No 50 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=70.70  E-value=3.4  Score=36.85  Aligned_cols=21  Identities=29%  Similarity=0.191  Sum_probs=15.7

Q ss_pred             CCHHHHHHHHHHHHHhhh-ccc
Q 020771          295 RTDAEIATIIGHEPRGLQ-RIC  315 (321)
Q Consensus       295 ~~edeLAaVLaHEiaHv~-~rH  315 (321)
                      .+...+|.+|||||||.. ..|
T Consensus       126 ~~~~~~a~~~AHelGH~lGm~H  147 (199)
T PF01421_consen  126 RSGLSFAVIIAHELGHNLGMPH  147 (199)
T ss_dssp             SSHHHHHHHHHHHHHHHTT---
T ss_pred             chhHHHHHHHHHHHHHhcCCCC
Confidence            467889999999999986 444


No 51 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=69.51  E-value=2.5  Score=38.70  Aligned_cols=16  Identities=13%  Similarity=-0.058  Sum_probs=13.1

Q ss_pred             HHHHHHHHhhh-cccCC
Q 020771          302 TIIGHEPRGLQ-RICGL  317 (321)
Q Consensus       302 aVLaHEiaHv~-~rH~l  317 (321)
                      -+++|||||.. ++|..
T Consensus       139 ~~~aHEiGH~lGl~H~~  155 (206)
T PF13583_consen  139 QTFAHEIGHNLGLRHDF  155 (206)
T ss_pred             hHHHHHHHHHhcCCCCc
Confidence            45999999997 77764


No 52 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=68.17  E-value=2.1  Score=37.77  Aligned_cols=20  Identities=15%  Similarity=-0.085  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHHHhhh-cccC
Q 020771          297 DAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       297 edeLAaVLaHEiaHv~-~rH~  316 (321)
                      .-..+.++|||+||.. ..|.
T Consensus       130 ~~~~~~~~aHElGH~lG~~HD  150 (192)
T cd04267         130 TLLTALTMAHELGHNLGAEHD  150 (192)
T ss_pred             ceeehhhhhhhHHhhcCCcCC
Confidence            3567889999999987 5665


No 53 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=67.50  E-value=3.6  Score=36.33  Aligned_cols=41  Identities=24%  Similarity=0.234  Sum_probs=23.7

Q ss_pred             CCCCceEEeCCCeEEEcHhHHhhcCCHHHHHH---HHHHHHHhhhccc
Q 020771          271 EPVINAFCLPGGKIVVFTGLLEHFRTDAEIAT---IIGHEPRGLQRIC  315 (321)
Q Consensus       271 ~~~~NAfalPgG~I~V~tGLL~~~~~edeLAa---VLaHEiaHv~~rH  315 (321)
                      ..-.|||.- |..|+.-.|--..+.   .+++   |+||||+|.+-.|
T Consensus       107 ~~~~NAfW~-g~~m~yGdG~~~~f~---~~~~~lDVvaHEltHGVte~  150 (150)
T PF01447_consen  107 KNYNNAFWN-GSQMVYGDGDGQIFK---PFASSLDVVAHELTHGVTEY  150 (150)
T ss_dssp             SSTT-EEE--SSSEEEE---SSSBS----GGG-HHHHHHHHHHHHHHT
T ss_pred             CCccCcccc-CCEEEEECCCCcccc---cCccccceeeeccccccccC
Confidence            346899996 567777766442332   3333   9999999987554


No 54 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=66.69  E-value=3.2  Score=34.73  Aligned_cols=35  Identities=17%  Similarity=0.118  Sum_probs=20.8

Q ss_pred             eEE-eCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh-cccC
Q 020771          276 AFC-LPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       276 Afa-lPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~-~rH~  316 (321)
                      |++ .|+|.+-+..+  .-+.    -..|+.|||||++ +.|.
T Consensus        67 a~~g~~~g~~~~~~~--~~~~----~~~~~~HEigHaLGl~H~  103 (140)
T smart00235       67 SHAGRPGGDQHFSLG--NGCI----NTGVAAHELGHALGLYHE  103 (140)
T ss_pred             eeeecCCCceEEEcc--CCcC----CcccHHHHHHHHhcCCcC
Confidence            444 35776666431  1121    2359999999997 5554


No 55 
>PF14891 Peptidase_M91:  Effector protein
Probab=66.42  E-value=3.3  Score=36.83  Aligned_cols=17  Identities=12%  Similarity=-0.050  Sum_probs=13.6

Q ss_pred             CHHHHHHHHHHHHHhhh
Q 020771          296 TDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~  312 (321)
                      .+..=+.||+|||+|+.
T Consensus        99 ~~~~p~v~L~HEL~HA~  115 (174)
T PF14891_consen   99 SPRPPFVVLYHELIHAY  115 (174)
T ss_pred             hhHHHHHHHHHHHHHHH
Confidence            34556789999999985


No 56 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=65.81  E-value=3.3  Score=36.74  Aligned_cols=17  Identities=12%  Similarity=-0.136  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHhhh-cccC
Q 020771          300 IATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       300 LAaVLaHEiaHv~-~rH~  316 (321)
                      -.-++||||||.. ..|.
T Consensus       111 ~~~~~aHElGH~lGa~Hd  128 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHD  128 (173)
T ss_dssp             HHHHHHHHHHHHHT---S
T ss_pred             eeeeehhhhHhhcCCCCC
Confidence            3456999999987 5664


No 57 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=63.54  E-value=4  Score=37.22  Aligned_cols=14  Identities=21%  Similarity=-0.009  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHhhh
Q 020771          299 EIATIIGHEPRGLQ  312 (321)
Q Consensus       299 eLAaVLaHEiaHv~  312 (321)
                      ..+.++|||+||..
T Consensus       144 ~~~~~~AHElGH~l  157 (220)
T cd04272         144 YGVYTMTHELAHLL  157 (220)
T ss_pred             ccHHHHHHHHHHHh
Confidence            45899999999987


No 58 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=61.52  E-value=8.3  Score=34.70  Aligned_cols=29  Identities=28%  Similarity=0.304  Sum_probs=23.9

Q ss_pred             eEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          283 KIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       283 ~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      .|=++-=||... .+|.|..|+.||++|..
T Consensus        45 ~I~lNP~ll~en-~~~f~~~vV~HELaHl~   73 (156)
T COG3091          45 EIRLNPKLLEEN-GEDFIEQVVPHELAHLH   73 (156)
T ss_pred             ccccCHHHHHHc-cHHHHHHHHHHHHHHHH
Confidence            566666677766 79999999999999985


No 59 
>PF01433 Peptidase_M1:  Peptidase family M1 This is family M1 in the peptidase classification.;  InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.  Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=59.99  E-value=12  Score=36.19  Aligned_cols=51  Identities=20%  Similarity=0.055  Sum_probs=28.0

Q ss_pred             eEEEEEeCCCCceEEeCCCeEEEcHhHHhh--cC---CHHHHHHHHHHHHHhhhcc
Q 020771          264 WEVLVVNEPVINAFCLPGGKIVVFTGLLEH--FR---TDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       264 w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~--~~---~edeLAaVLaHEiaHv~~r  314 (321)
                      ..+.++.+-..+++.-+|-.++--..++-.  ..   ...+++.|||||+||.--+
T Consensus       254 ~~~v~~p~~~~~~me~~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfG  309 (390)
T PF01433_consen  254 LDIVAVPDFPFGGMENWGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFG  309 (390)
T ss_dssp             EEEEEEST-SSSEE--TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBT
T ss_pred             eeEEEEeccccccccccccccccccccccCcccccchhhhhhHHHHHHHHHHHHhc
Confidence            445555433356776666444444444422  11   2457899999999997544


No 60 
>PRK09672 phage exclusion protein Lit; Provisional
Probab=59.78  E-value=6  Score=39.08  Aligned_cols=20  Identities=20%  Similarity=0.086  Sum_probs=17.9

Q ss_pred             HHHHHHHHHHHHhhhcccCC
Q 020771          298 AEIATIIGHEPRGLQRICGL  317 (321)
Q Consensus       298 deLAaVLaHEiaHv~~rH~l  317 (321)
                      ..+|.|+-||++|+...|..
T Consensus       163 ~A~a~i~~HEiaHv~~~h~~  182 (305)
T PRK09672        163 CALAWILLHEIAHVEFQHSS  182 (305)
T ss_pred             HHHHHHHHHHHHHHHhcccc
Confidence            35899999999999999986


No 61 
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=59.30  E-value=4  Score=37.26  Aligned_cols=25  Identities=16%  Similarity=-0.012  Sum_probs=20.0

Q ss_pred             CHHHHHHHHHHHHHhhh-cccCCCCC
Q 020771          296 TDAEIATIIGHEPRGLQ-RICGLPFY  320 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~-~rH~l~~~  320 (321)
                      +-+.|..||-||++|.. ..|...||
T Consensus        78 ~~~~i~~t~lHELaH~~~~~H~~~F~  103 (186)
T PF08325_consen   78 PYETILGTMLHELAHNVHGPHDDKFW  103 (186)
T ss_pred             eHHHHHHHHHHHHHhcccCCccHHHH
Confidence            56799999999999987 45666555


No 62 
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=58.72  E-value=8.1  Score=41.01  Aligned_cols=32  Identities=19%  Similarity=0.174  Sum_probs=21.8

Q ss_pred             eEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHHhh
Q 020771          276 AFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPRGL  311 (321)
Q Consensus       276 AfalPgG~I~V~tGLL~~~~~edeLAaVLaHEiaHv  311 (321)
                      |+--|| .++....+|.   .+.+++.|++|||||-
T Consensus       259 gMEN~~-ltf~~~~ll~---~d~s~~~viaHElAHq  290 (601)
T TIGR02411       259 GMENPN-LTFATPTLIA---GDRSNVDVIAHELAHS  290 (601)
T ss_pred             cccccc-ceeecccccc---CChhhhhhHHHHHHhh
Confidence            444454 6666666653   3456789999999996


No 63 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=53.96  E-value=6.8  Score=34.36  Aligned_cols=47  Identities=17%  Similarity=0.028  Sum_probs=23.3

Q ss_pred             CCeEEEEEeCCC-CceEEeCCC--eEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          262 LNWEVLVVNEPV-INAFCLPGG--KIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       262 ~~w~v~Vv~~~~-~NAfalPgG--~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      .+..++...++. .=|-..+.|  .|++-+.--..-   +.+. |.+||+||++
T Consensus        40 a~~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy---~~~R-IaaHE~GHiL   89 (132)
T PF02031_consen   40 ADIRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGY---NSTR-IAAHELGHIL   89 (132)
T ss_dssp             -SEEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS----HHH-HHHHHHHHHH
T ss_pred             CcEEEEEecCCCCcccccCCCCcEEEEechHHhhCC---ccce-eeeehhcccc
Confidence            455555544433 335555554  566554432222   2222 8999999986


No 64 
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=53.79  E-value=6.3  Score=41.99  Aligned_cols=27  Identities=26%  Similarity=0.242  Sum_probs=21.2

Q ss_pred             eEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          283 KIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       283 ~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      ..||+--||.   .|.-|+.||||||||.-
T Consensus       274 ltF~TpTlla---GDrsl~~vIaHEIAHSW  300 (613)
T KOG1047|consen  274 LTFVTPTLLA---GDRSLVDVIAHEIAHSW  300 (613)
T ss_pred             eeeecchhhc---CCcchhhHHHHHhhhhh
Confidence            5666666665   67888999999999963


No 65 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=53.72  E-value=2.1  Score=38.74  Aligned_cols=16  Identities=25%  Similarity=0.040  Sum_probs=13.0

Q ss_pred             HHHHHHHHHHhhh-ccc
Q 020771          300 IATIIGHEPRGLQ-RIC  315 (321)
Q Consensus       300 LAaVLaHEiaHv~-~rH  315 (321)
                      .|.++||||||.. ..|
T Consensus       140 ~a~~~aHElGH~LG~~H  156 (207)
T cd04273         140 SAFTIAHELGHVLGMPH  156 (207)
T ss_pred             eEEeeeeechhhcCCCC
Confidence            5789999999987 444


No 66 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=52.75  E-value=7.6  Score=36.75  Aligned_cols=16  Identities=13%  Similarity=0.102  Sum_probs=13.2

Q ss_pred             HHHHHHHHHhhhcccC
Q 020771          301 ATIIGHEPRGLQRICG  316 (321)
Q Consensus       301 AaVLaHEiaHv~~rH~  316 (321)
                      .+|=|||+||+.+++.
T Consensus        90 vaVAAHEvGHAiQ~a~  105 (222)
T PF04298_consen   90 VAVAAHEVGHAIQHAE  105 (222)
T ss_pred             HHHHHHHHhHHHhccc
Confidence            4689999999987763


No 67 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=51.51  E-value=7.9  Score=36.42  Aligned_cols=16  Identities=13%  Similarity=-0.010  Sum_probs=13.0

Q ss_pred             HHHHHHHHHhhhcccC
Q 020771          301 ATIIGHEPRGLQRICG  316 (321)
Q Consensus       301 AaVLaHEiaHv~~rH~  316 (321)
                      .+|-|||+||+.+++.
T Consensus        93 ~aVAAHEVGHAiQd~~  108 (226)
T COG2738          93 IAVAAHEVGHAIQDQE  108 (226)
T ss_pred             HHHHHHHhhHHHhhhc
Confidence            4789999999987653


No 68 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=50.96  E-value=6  Score=36.90  Aligned_cols=14  Identities=14%  Similarity=0.157  Sum_probs=11.4

Q ss_pred             HHHHHHHHhhh-ccc
Q 020771          302 TIIGHEPRGLQ-RIC  315 (321)
Q Consensus       302 aVLaHEiaHv~-~rH  315 (321)
                      .+|||||||.. ..|
T Consensus       147 ~t~AHElGHnLGm~H  161 (228)
T cd04271         147 QVFAHEIGHTFGAVH  161 (228)
T ss_pred             eehhhhhhhhcCCCC
Confidence            69999999987 344


No 69 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=50.78  E-value=17  Score=37.97  Aligned_cols=54  Identities=19%  Similarity=0.227  Sum_probs=29.6

Q ss_pred             CCCCCCeEEEEEeCCC---CceEEeC----------CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          258 HLDGLNWEVLVVNEPV---INAFCLP----------GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       258 ~l~~~~w~v~Vv~~~~---~NAfalP----------gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      ++.+.|.-++|...|.   .-|+|.|          -|.|.+....|... ..+.+.-|+.|||+|++
T Consensus       156 gv~~~D~vlyV~~~~~~~~~~A~A~~C~~~~~~RP~~G~in~~p~~i~~~-~~~~~~~~~~HEi~HaL  222 (521)
T PF01457_consen  156 GVPNADFVLYVTARPSSSSTLAWAAPCQQDSDGRPIAGVININPSYIPSF-YFQEFFRTVIHEIAHAL  222 (521)
T ss_dssp             -EES-SEEEEEE----STT---EEEEEEE-TTS-EEEEEEE--GGG---S---HHHHHHHHHHHHHHT
T ss_pred             CcCCccEEEEEEEeeccCCceEEEeehhhccCCCCeeEEEEEchhHccch-hhhcccceeeeeeeeee
Confidence            3456778888876554   3477764          26888887777655 57778889999999996


No 70 
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=50.45  E-value=20  Score=31.23  Aligned_cols=40  Identities=13%  Similarity=0.113  Sum_probs=28.1

Q ss_pred             CCCceEEeCCC---eEEEcHhHHhhc-CCHHHHHHHHHHHHHhh
Q 020771          272 PVINAFCLPGG---KIVVFTGLLEHF-RTDAEIATIIGHEPRGL  311 (321)
Q Consensus       272 ~~~NAfalPgG---~I~V~tGLL~~~-~~edeLAaVLaHEiaHv  311 (321)
                      +..-|+|.|+.   .|+|.......= ...+--+.+|-||++|.
T Consensus        64 ~~~~a~~~~~~~~~~IyLc~~F~~~p~~g~~Sk~~TLiHE~SHf  107 (148)
T PF14521_consen   64 NNTYAYVYPDSPTYTIYLCPAFFSAPTTGKDSKEGTLIHEWSHF  107 (148)
T ss_dssp             SSSSEEE-TTST-TEEEE-HHHHHS-SSSTT-HHHHHHHHHHHS
T ss_pred             cceEEEEECCCCceEEEEChhhcCCCCCCCCchHHHHHHhhhhh
Confidence            35678899985   599988888732 34567799999999993


No 71 
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=47.64  E-value=23  Score=33.46  Aligned_cols=51  Identities=12%  Similarity=-0.046  Sum_probs=29.1

Q ss_pred             eEEEEEeCCCCc-eEEe-CCCeEEEcHhHHhhcCCHHHH---HHHHHHHHHhhhcc
Q 020771          264 WEVLVVNEPVIN-AFCL-PGGKIVVFTGLLEHFRTDAEI---ATIIGHEPRGLQRI  314 (321)
Q Consensus       264 w~v~Vv~~~~~N-Afal-PgG~I~V~tGLL~~~~~edeL---AaVLaHEiaHv~~r  314 (321)
                      ....++.+...+ +++. .|+.|....+..+.+-+.+.+   .=-+.||+||....
T Consensus       178 ~~~r~v~~v~~~~g~m~a~g~~i~~~~~~~~~~l~~~~~~~~~WG~~HE~GH~~Q~  233 (307)
T PF13402_consen  178 PNNRFVFDVQISAGYMHASGYPIGFPPNWMNELLNPNPLRKGGWGPWHELGHNHQQ  233 (307)
T ss_dssp             --EEEEEETT----SEEEETTEEEEETT--HHHH-HHHHHHH-HHHHHHHHHHH-B
T ss_pred             ccceEEEeccccccceeecCCcEEeeCcHHhcccCHhHcCCCCeeehhhhhhhcCc
Confidence            344777777655 5665 577777777766655466655   33589999998654


No 72 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=47.58  E-value=13  Score=33.51  Aligned_cols=15  Identities=20%  Similarity=0.184  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHhhh
Q 020771          298 AEIATIIGHEPRGLQ  312 (321)
Q Consensus       298 deLAaVLaHEiaHv~  312 (321)
                      ....+++-|||||++
T Consensus        90 ~~~~~~i~HElgHaL  104 (198)
T cd04327          90 PEFSRVVLHEFGHAL  104 (198)
T ss_pred             hhHHHHHHHHHHHHh
Confidence            345578899999997


No 73 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=46.26  E-value=12  Score=36.65  Aligned_cols=39  Identities=15%  Similarity=0.181  Sum_probs=23.7

Q ss_pred             CCCCceEEeC---CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771          271 EPVINAFCLP---GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQR  313 (321)
Q Consensus       271 ~~~~NAfalP---gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~  313 (321)
                      .+..-|||.+   ++..+|++..   -++-++ ...|+||+||+.+
T Consensus       194 gK~~gaf~~~~~~~~~p~i~~n~---~~~~~~-v~tl~HE~GHa~h  235 (427)
T cd06459         194 GKRSGAYCTGLPPGKHPFILMNF---NGTLDD-VFTLAHELGHAFH  235 (427)
T ss_pred             CCCCCeecCCCCCCCCCeEEecC---CCChhh-HHHHHHHhhHHHH
Confidence            3345688875   3455654332   234444 4568999999764


No 74 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=40.90  E-value=9.5  Score=39.83  Aligned_cols=45  Identities=16%  Similarity=0.036  Sum_probs=24.9

Q ss_pred             EEEEeCCCCceEEeCCC--eEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          266 VLVVNEPVINAFCLPGG--KIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       266 v~Vv~~~~~NAfalPgG--~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      ++--+....-|||.+.+  .-+|   |+..-.+-++ ..+|+||+||..+.
T Consensus       346 ~~~r~gK~~Ga~~~~~~~~~p~i---l~N~~~~~~d-v~TLaHElGHa~H~  392 (591)
T TIGR00181       346 YAENKGKRSGAYSIGGYKVKPYI---LMNWDGTLNS-VFTLAHELGHSMHS  392 (591)
T ss_pred             ecCCCCCCCCcccCCCCCCCCeE---EEecCCCcch-HHHHHHHhhhHHHH
Confidence            33334455679998542  2222   1222223333 35799999999744


No 75 
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=40.53  E-value=29  Score=38.27  Aligned_cols=15  Identities=20%  Similarity=0.437  Sum_probs=13.4

Q ss_pred             eeeeeccCCCCcccc
Q 020771           81 HVQHFRPRGPRKWLQ   95 (321)
Q Consensus        81 ~~~hf~~rg~r~w~~   95 (321)
                      -+-||.|.+-|+||.
T Consensus       120 ~~Tq~ep~~Ar~~fP  134 (831)
T TIGR02412       120 LYTQFEPADARRVFA  134 (831)
T ss_pred             EEECCCCcCceeeEe
Confidence            378999999999997


No 76 
>PTZ00337 surface protease GP63; Provisional
Probab=40.09  E-value=56  Score=34.96  Aligned_cols=52  Identities=12%  Similarity=0.088  Sum_probs=35.6

Q ss_pred             CCCCCCeEEEEEeCCC--CceEEeCC----------CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          258 HLDGLNWEVLVVNEPV--INAFCLPG----------GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       258 ~l~~~~w~v~Vv~~~~--~NAfalPg----------G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      ++.+.|..++|.-.|.  .=|+|.|-          |.|.+....|.   +.++..-|+.|||+|++
T Consensus       178 Gv~naD~vLyV~a~p~~~~~A~A~~C~~~~~~RPi~G~in~np~~i~---~~~~~~~v~~HEi~HAL  241 (567)
T PTZ00337        178 GVAGADMILYANIFPTSGPAAWAIPCFLLDDGRPFAAAVNFDPRQIA---VTNGDVRVAAHELGHAL  241 (567)
T ss_pred             CccccceEEEEEeccCCCcceEEEEcccccCCCceEEEEEECHHHcc---chhHHHHHHHHHHHHHH
Confidence            4456677777765443  33777642          67777765543   46777889999999997


No 77 
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=38.83  E-value=16  Score=40.68  Aligned_cols=24  Identities=13%  Similarity=0.133  Sum_probs=16.4

Q ss_pred             cceeEeecceeeeeccCCCCcccc
Q 020771           72 KRYYYVDRYHVQHFRPRGPRKWLQ   95 (321)
Q Consensus        72 ~r~y~~~~~~~~hf~~rg~r~w~~   95 (321)
                      .-+|.....-+-+|.|.|-|+|+.
T Consensus        99 ~GlY~s~~~~~TQ~Ep~gaR~ifp  122 (863)
T TIGR02414        99 EGLYKSGGNFCTQCEAEGFRRITY  122 (863)
T ss_pred             eEEEEeCCeEEEEecCCCCCcCCC
Confidence            345555555678999999996554


No 78 
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=38.70  E-value=23  Score=39.29  Aligned_cols=51  Identities=16%  Similarity=0.148  Sum_probs=32.9

Q ss_pred             CCeEEEEEeCCCCceEEeCC-CeEEEcHhH-Hhhc---C--CHHHHHHHHHHHHHhhh
Q 020771          262 LNWEVLVVNEPVINAFCLPG-GKIVVFTGL-LEHF---R--TDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       262 ~~w~v~Vv~~~~~NAfalPg-G~I~V~tGL-L~~~---~--~edeLAaVLaHEiaHv~  312 (321)
                      .+|...+|.-|..|+-+|=| |.|.+.+.. |..-   .  +-...+.|++||++|--
T Consensus       262 y~l~~~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqW  319 (859)
T COG0308         262 YALPIDIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQW  319 (859)
T ss_pred             CCCcccEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhc
Confidence            34444677778888888865 566666663 2210   0  12567779999999963


No 79 
>PF06861 BALF1:  BALF1 protein;  InterPro: IPR010677  Epstein-Barr virus (strain GD1) (HHV-4), a human tumour DNA virus and a prominent member of gamma-herpesviruses, encodes homologues of cellular antiapoptotic viral Bcl-2 proteins BALF1 and BHRF1. They protect the virus from apoptosis in its host cell during virus synthesis [, ]. The virus infects B lymphocytes to establish a latent infection and yield proliferating, growth-transformed B cells in vitro. Bcl-2 genes are essential for the initial evasion of apoptosis which allows it to establish a latent infection or cause cellular transformation, or both []. Bcl-2 family proteins can inhibit or induce programmed cell death in part by counteracting the activity of other BCL-2 family members. BALF1, inhibits the antiapoptotic activity of EBV BHRF1 and of KSBcl-2 in several transfected cell lines. BALF1 fails, however, to inhibit the cellular BCL-2 family member, BCL-x(L). Thus, BALF1 acts as a negative regulator of the survival function of BHRF1, similar to the counterbalance observed between cellular BCL-2 family members []. 
Probab=38.29  E-value=21  Score=32.84  Aligned_cols=25  Identities=16%  Similarity=0.171  Sum_probs=21.7

Q ss_pred             hhcCCHHHHHHHHHHHHHhhhcccC
Q 020771          292 EHFRTDAEIATIIGHEPRGLQRICG  316 (321)
Q Consensus       292 ~~~~~edeLAaVLaHEiaHv~~rH~  316 (321)
                      ..++|+.|-++|++|++||...+|-
T Consensus       124 ~~l~~d~e~~s~v~~~lA~Fy~~~r  148 (182)
T PF06861_consen  124 RNLLNDHENASLVSHALAHFYLRYR  148 (182)
T ss_pred             HHhcCchHHHHHHHHHHHHHHHHHH
Confidence            4567999999999999999988873


No 80 
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=38.05  E-value=19  Score=34.03  Aligned_cols=26  Identities=27%  Similarity=0.290  Sum_probs=17.3

Q ss_pred             EEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          284 IVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       284 I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      |.|-.||=..+     ..+|||||+.|+-.+
T Consensus        82 Ilvl~GLPrll-----~gsiLAHE~mHa~Lr  107 (212)
T PF12315_consen   82 ILVLYGLPRLL-----TGSILAHELMHAWLR  107 (212)
T ss_pred             EEEECCCCHHH-----HhhHHHHHHHHHHhc
Confidence            44555543332     478999999998654


No 81 
>PF09715 Plasmod_dom_1:  Plasmodium protein of unknown function (Plasmod_dom_1);  InterPro: IPR006410 These sequences represent an uncharacterised family consisting of a small number of hypothetical proteins of the malaria parasite Plasmodium falciparum (isolate 3D7). 
Probab=37.10  E-value=15  Score=28.79  Aligned_cols=31  Identities=23%  Similarity=0.490  Sum_probs=23.4

Q ss_pred             hhhhhhHHHHHHHhhcccCCCCCCCcccccc
Q 020771            2 AYYRRAKAALDAFRNLSSKAVPKSPVQESCS   32 (321)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   32 (321)
                      ++++|+|+.+|+|-++-+.-+=.++++...|
T Consensus         1 sFFKk~klvm~~fDnifidklId~~i~nk~s   31 (67)
T PF09715_consen    1 SFFKKAKLVMDIFDNIFIDKLIDNNIQNKSS   31 (67)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHccccccCCC
Confidence            4799999999999996554444667766655


No 82 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=36.55  E-value=39  Score=30.69  Aligned_cols=17  Identities=12%  Similarity=-0.035  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHhhh-ccc
Q 020771          299 EIATIIGHEPRGLQ-RIC  315 (321)
Q Consensus       299 eLAaVLaHEiaHv~-~rH  315 (321)
                      -++-++.||+||.. ..|
T Consensus       124 R~~k~~~HElGH~lGL~H  141 (179)
T PRK13267        124 RVRKEVTHELGHTLGLEH  141 (179)
T ss_pred             HHHHHHHHHHHHHcCCcc
Confidence            44555899999995 455


No 83 
>PRK14015 pepN aminopeptidase N; Provisional
Probab=34.31  E-value=26  Score=39.19  Aligned_cols=24  Identities=13%  Similarity=0.153  Sum_probs=18.7

Q ss_pred             cceeEeecceeeeeccCCCCcccc
Q 020771           72 KRYYYVDRYHVQHFRPRGPRKWLQ   95 (321)
Q Consensus        72 ~r~y~~~~~~~~hf~~rg~r~w~~   95 (321)
                      .-+|..+.+-+-+|.|.|-|+|+.
T Consensus       112 ~Gly~s~~~~~TQ~Ep~gAR~~fP  135 (875)
T PRK14015        112 EGLYRSGGMFCTQCEAEGFRRITY  135 (875)
T ss_pred             eeeEEECCEEEEeccccCcCCccc
Confidence            346666666678999999999886


No 84 
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=33.75  E-value=18  Score=37.56  Aligned_cols=18  Identities=11%  Similarity=-0.018  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHhhh-cccC
Q 020771          299 EIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       299 eLAaVLaHEiaHv~-~rH~  316 (321)
                      .|..|.+|||||++ +.|.
T Consensus       210 ~l~~Va~HEiGH~LGL~HS  228 (469)
T KOG1565|consen  210 DLFLVAAHEIGHALGLGHS  228 (469)
T ss_pred             hhHHHhhhhcccccccCCC
Confidence            58899999999987 6665


No 85 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=33.21  E-value=21  Score=37.14  Aligned_cols=41  Identities=15%  Similarity=0.276  Sum_probs=24.7

Q ss_pred             eCCCCceEEeC--C-CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          270 NEPVINAFCLP--G-GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       270 ~~~~~NAfalP--g-G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      +.....|||.+  . +.-+|+   +..-.+-+.+ .+|+||+||+.+.
T Consensus       308 ~gK~~Gayc~~~~~~~~P~I~---~Nf~~t~~dv-~TL~HElGHa~H~  351 (549)
T TIGR02289       308 KGKAAGGYCTYLPKYKAPFIF---SNFNGTSGDI-DVLTHEAGHAFHV  351 (549)
T ss_pred             CCCCCCcccCCCCCCCCcEEE---EeCCCChhHH-HHHHHHhhHHHHH
Confidence            34567899985  2 334442   3333344444 5699999999744


No 86 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=32.97  E-value=31  Score=30.24  Aligned_cols=23  Identities=13%  Similarity=0.199  Sum_probs=18.5

Q ss_pred             CCCHHHHHHHHHHHHHHHHHHhc
Q 020771          132 LLSKAVERQLGESQFQQMKAAFK  154 (321)
Q Consensus       132 llS~~~E~~Lgeq~y~qil~e~~  154 (321)
                      -.|.+...+|+.++|.++=++|.
T Consensus        11 aps~~~fe~La~~A~d~lP~efr   33 (136)
T COG3824          11 APSLERFEELASDALDHLPQEFR   33 (136)
T ss_pred             CCCHHHHHHHHHHHHHhCcHHHH
Confidence            57888888999999987777764


No 87 
>PF10023 DUF2265:  Predicted aminopeptidase (DUF2265);  InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=32.93  E-value=25  Score=35.24  Aligned_cols=17  Identities=35%  Similarity=0.378  Sum_probs=15.8

Q ss_pred             CHHHHHHHHHHHHHhhh
Q 020771          296 TDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~  312 (321)
                      +|.+||.+|=||++|.+
T Consensus       161 ~~~~LA~LIfHELaHq~  177 (337)
T PF10023_consen  161 PDGELARLIFHELAHQT  177 (337)
T ss_pred             CchHHHHHHHHHHhhce
Confidence            79999999999999974


No 88 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=31.70  E-value=8  Score=36.25  Aligned_cols=20  Identities=10%  Similarity=0.024  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHhhh-cccCCC
Q 020771          299 EIATIIGHEPRGLQ-RICGLP  318 (321)
Q Consensus       299 eLAaVLaHEiaHv~-~rH~l~  318 (321)
                      -+..+|.||+||.+ +.|+.+
T Consensus       136 n~g~t~~HEvGH~lGL~HtF~  156 (225)
T cd04275         136 NLGDTATHEVGHWLGLYHTFQ  156 (225)
T ss_pred             cccceeEEeccceeeeeeeec
Confidence            46678999999987 888754


No 89 
>TIGR01519 plasmod_dom_1 Plasmodium falciparum uncharacterized domain. This model represents an uncharacterized domain present in roughly eight hypothetical proteins of the malaria parasite Plasmodium falciparum.
Probab=31.59  E-value=22  Score=28.12  Aligned_cols=31  Identities=19%  Similarity=0.379  Sum_probs=23.9

Q ss_pred             hhhhhhHHHHHHHhhcccCCCCCCCcccccc
Q 020771            2 AYYRRAKAALDAFRNLSSKAVPKSPVQESCS   32 (321)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   32 (321)
                      ++++|+|+.+|+|-++-+.-+=.++++...|
T Consensus         1 sFFKkaklvm~~fDnifidklid~~v~nk~S   31 (70)
T TIGR01519         1 SFFKRALLLMNIFDNIFIDKLIDNNVHNKGS   31 (70)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHccccccCCC
Confidence            4799999999999996555555677776655


No 90 
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=30.78  E-value=13  Score=40.45  Aligned_cols=32  Identities=19%  Similarity=0.106  Sum_probs=25.0

Q ss_pred             CCeEEEcHhHHhhcC-----CHHHHHHHHHHHHHhhh
Q 020771          281 GGKIVVFTGLLEHFR-----TDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       281 gG~I~V~tGLL~~~~-----~edeLAaVLaHEiaHv~  312 (321)
                      |-+..++|||+....     =..+---|+||||||.-
T Consensus       368 G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNf  404 (764)
T KOG3658|consen  368 GKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNF  404 (764)
T ss_pred             cceEEeecceeeeeecCCccCcchhheeehhhhcccc
Confidence            557999999998764     24556678999999974


No 91 
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=30.64  E-value=39  Score=30.72  Aligned_cols=29  Identities=17%  Similarity=0.227  Sum_probs=22.6

Q ss_pred             CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      ...|+|-..   .+.+..+|.-+|+||+-|+-
T Consensus        55 ~~~I~lC~N---~~~~~~~l~~~l~HELIHay   83 (173)
T PF09768_consen   55 KKGIVLCQN---RIRSQGHLEDTLTHELIHAY   83 (173)
T ss_pred             CCCEEEeeC---CCCCHHHHHHHHHHHHHHHH
Confidence            556666543   45799999999999999984


No 92 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=29.90  E-value=27  Score=34.29  Aligned_cols=15  Identities=20%  Similarity=0.056  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHHHhhh
Q 020771          298 AEIATIIGHEPRGLQ  312 (321)
Q Consensus       298 deLAaVLaHEiaHv~  312 (321)
                      -.+-++++||+||..
T Consensus       194 ~~~P~T~~HElAHq~  208 (318)
T PF12725_consen  194 YSLPFTICHELAHQL  208 (318)
T ss_pred             ccccHHHHHHHHHHh
Confidence            346689999999974


No 93 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=29.88  E-value=99  Score=33.59  Aligned_cols=55  Identities=15%  Similarity=0.184  Sum_probs=35.8

Q ss_pred             CCCCCCeEEEEEeCCC---CceEEeC----------CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771          258 HLDGLNWEVLVVNEPV---INAFCLP----------GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQR  313 (321)
Q Consensus       258 ~l~~~~w~v~Vv~~~~---~NAfalP----------gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~  313 (321)
                      +..+.|.-++|.-.|.   .=|+|.+          -|.|.+.---|.. ..+.+..-|+.|||+|++-
T Consensus       202 Gv~naD~vLYVta~P~~~~~lA~A~~C~~~~d~RPi~G~iNinp~~i~s-~~~~~~~rv~~HEi~HALG  269 (622)
T PTZ00257        202 GVSNADFVLYVASVPSEPGVLAWATTCQVFSDDHPAVGVMNIPAANIVS-RYDQGTTRTVTHEVAHALG  269 (622)
T ss_pred             CccccCEEEEEEEecCCCCeeEEEEEeccccCCCceEEEEeeCHHHCCC-ccchHHHHHHHHHHHHHhc
Confidence            3456778888876553   2377764          1567766544432 2345678999999999973


No 94 
>PF02571 CbiJ:  Precorrin-6x reductase CbiJ/CobK;  InterPro: IPR003723 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase []. In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase [].  There are at least two distinct cobalamin biosynthetic pathways in bacteria []:  Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway []; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis []; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii subsp. shermanii.   Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) []. There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobK and CbiJ precorrin-6x reductase (1.3.1.54 from EC). In the aerobic pathway, CobK catalyses the reduction of the macrocycle of precorrin-6X to produce precorrin-6Y; while in the anaerobic pathway CbiJ catalyses the reduction of the macrocycle of cobalt-precorrin-6X into cobalt-precorrin-6Y [, ].; GO: 0016994 precorrin-6A reductase activity, 0009236 cobalamin biosynthetic process, 0055114 oxidation-reduction process
Probab=29.58  E-value=50  Score=31.36  Aligned_cols=37  Identities=14%  Similarity=0.127  Sum_probs=26.5

Q ss_pred             HHHHHHHHHHHhcC-CCCCCCChhHHHHHHHHHHHHHH
Q 020771          142 GESQFQQMKAAFKG-KILPAIHPDSVRVRLIAKDIIEA  178 (321)
Q Consensus       142 geq~y~qil~e~~~-kiLP~~hp~s~rVr~Ia~rIi~a  178 (321)
                      ++.+.++++++.+- .++.++||....|..-+.+..+.
T Consensus        54 ~~~~l~~~l~~~~i~~vIDATHPfA~~is~na~~a~~~   91 (249)
T PF02571_consen   54 DEEGLAEFLRENGIDAVIDATHPFAAEISQNAIEACRE   91 (249)
T ss_pred             CHHHHHHHHHhCCCcEEEECCCchHHHHHHHHHHHHhh
Confidence            44556667776663 48899999999988766666554


No 95 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=27.44  E-value=35  Score=37.46  Aligned_cols=21  Identities=19%  Similarity=0.088  Sum_probs=16.6

Q ss_pred             CHHHHHHHHHHHHHhhh-cccC
Q 020771          296 TDAEIATIIGHEPRGLQ-RICG  316 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~-~rH~  316 (321)
                      +....|.||||||||.+ ..|.
T Consensus       319 ~~~~~a~v~AhelgH~lGm~hD  340 (716)
T KOG3607|consen  319 ILLAFAVVLAHELGHNLGMIHD  340 (716)
T ss_pred             cchhHHHHHHHHHHhhcCcccc
Confidence            56778999999999986 4443


No 96 
>KOG1046 consensus Puromycin-sensitive aminopeptidase and related aminopeptidases [Amino acid transport and metabolism; Posttranslational modification, protein turnover, chaperones]
Probab=26.24  E-value=94  Score=34.77  Aligned_cols=47  Identities=23%  Similarity=0.222  Sum_probs=26.5

Q ss_pred             EEEEeCCCCceEEeCC-CeEEEc-HhHHhhc-----CCHHHHHHHHHHHHHhhh
Q 020771          266 VLVVNEPVINAFCLPG-GKIVVF-TGLLEHF-----RTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       266 v~Vv~~~~~NAfalPg-G~I~V~-tGLL~~~-----~~edeLAaVLaHEiaHv~  312 (321)
                      +-+|--|+..+-||=+ |.|..- +-||-.-     .+...+|.|+|||+||--
T Consensus       283 ~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQW  336 (882)
T KOG1046|consen  283 LDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQW  336 (882)
T ss_pred             ccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHH
Confidence            3344444555444433 444443 3333221     246779999999999963


No 97 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=26.24  E-value=40  Score=31.42  Aligned_cols=14  Identities=21%  Similarity=0.116  Sum_probs=11.7

Q ss_pred             HHHHHHHHHHHhhh
Q 020771          299 EIATIIGHEPRGLQ  312 (321)
Q Consensus       299 eLAaVLaHEiaHv~  312 (321)
                      =+.+|+-||+||..
T Consensus        52 l~~~v~iHElgH~~   65 (227)
T cd06164          52 LFASVLLHELGHSL   65 (227)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45789999999974


No 98 
>PF07225 NDUF_B4:  NADH-ubiquinone oxidoreductase B15 subunit (NDUFB4);  InterPro: IPR009866  NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This family contains human NADH-ubiquinone oxidoreductase subunit NDUFB4 and related sequences.; GO: 0008137 NADH dehydrogenase (ubiquinone) activity, 0005739 mitochondrion
Probab=26.03  E-value=38  Score=29.43  Aligned_cols=47  Identities=30%  Similarity=0.507  Sum_probs=27.9

Q ss_pred             ccCCCCCC-----CC-cce-eEeecceeeeeccCCCCccccCCchhHHHHHHHHHHHHHHHh
Q 020771           62 SYCNPNFN-----TA-KRY-YYVDRYHVQHFRPRGPRKWLQNPRTVFIVVVIGSGAFITLYL  116 (321)
Q Consensus        62 ~~~~~~~~-----~~-~r~-y~~~~~~~~hf~~rg~r~w~~~~r~~~~~v~vg~g~~~~~y~  116 (321)
                      ...||+-.     ++ -|+ |.-.-..|.||.|.        ||+.++.++.+++.++++|+
T Consensus        46 q~~nP~~~gli~DPAL~Rw~~a~~~~~y~~FRpT--------Pktsllg~~~~v~P~i~~~~   99 (125)
T PF07225_consen   46 QYNNPHRKGLIFDPALQRWAYARAVNIYEYFRPT--------PKTSLLGLGFGVVPLIFYYY   99 (125)
T ss_pred             HcCCCCCCCccCChHHHHHHHHHHhCcccccccC--------chHHHHHHHHHHHHHHHHHh
Confidence            45666631     22 467 55444789999887        47777666665554444433


No 99 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=25.54  E-value=38  Score=34.09  Aligned_cols=17  Identities=12%  Similarity=0.122  Sum_probs=13.3

Q ss_pred             CHHHHHHHHHHHHHhhhc
Q 020771          296 TDAEIATIIGHEPRGLQR  313 (321)
Q Consensus       296 ~edeLAaVLaHEiaHv~~  313 (321)
                      +-+++ ..|+||+||+.+
T Consensus       239 ~~~~v-~tLfHE~GHa~H  255 (458)
T PF01432_consen  239 SHDDV-ETLFHEFGHAMH  255 (458)
T ss_dssp             SHHHH-HHHHHHHHHHHH
T ss_pred             ChhhH-HHHHHHHhHHHH
Confidence            55666 679999999864


No 100
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=25.21  E-value=50  Score=32.80  Aligned_cols=45  Identities=24%  Similarity=0.261  Sum_probs=27.9

Q ss_pred             CCCeEEEEEeCCCCceEEeCCCeEEEcHhHHhhc--CCHHHHHHHHHHHHHhh
Q 020771          261 GLNWEVLVVNEPVINAFCLPGGKIVVFTGLLEHF--RTDAEIATIIGHEPRGL  311 (321)
Q Consensus       261 ~~~w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~--~~edeLAaVLaHEiaHv  311 (321)
                      ..-|+|+|-   .+-|+++-|   ....-||..+  .+|.+||.+|=||++|.
T Consensus       162 aqG~DVyV~---Gv~AYSTLG---wFdDPlLstmlr~dd~~lA~LIFHELAHQ  208 (376)
T COG4324         162 AQGLDVYVS---GVTAYSTLG---WFDDPLLSTMLRQDDTYLASLIFHELAHQ  208 (376)
T ss_pred             hcCCceeee---cccceeccc---ccccHHHHHHhcCChHHHHHHHHHHHhhh
Confidence            344666663   345555533   2223344322  38999999999999996


No 101
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=24.05  E-value=48  Score=30.09  Aligned_cols=16  Identities=25%  Similarity=-0.056  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHhhh
Q 020771          297 DAEIATIIGHEPRGLQ  312 (321)
Q Consensus       297 edeLAaVLaHEiaHv~  312 (321)
                      -.=+..++.||+||..
T Consensus        38 l~l~~~l~iHElgH~~   53 (183)
T cd06160          38 LALLAILGIHEMGHYL   53 (183)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3456788999999974


No 102
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.96  E-value=62  Score=32.74  Aligned_cols=35  Identities=9%  Similarity=-0.047  Sum_probs=29.4

Q ss_pred             CCeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcccC
Q 020771          281 GGKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRICG  316 (321)
Q Consensus       281 gG~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~rH~  316 (321)
                      +=.++.+--++..| ..+++-+.|-||+-|+...|.
T Consensus        51 ~~~~y~NPei~~~~-p~~~~~aLl~HEV~Hi~l~Hi   85 (396)
T COG3864          51 YFTMYFNPEIFLNC-PISEMKALLKHEVYHIMLNHI   85 (396)
T ss_pred             ceEEEeCHHHHccC-CHHHHHHHHHHHHHHHHHHHH
Confidence            33577777777777 799999999999999999995


No 103
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=23.92  E-value=39  Score=35.46  Aligned_cols=44  Identities=16%  Similarity=0.091  Sum_probs=23.5

Q ss_pred             EEEEeCCCCceEEeC--CCe-EEEcHhHHhhcCCHHHHHHHHHHHHHhhhc
Q 020771          266 VLVVNEPVINAFCLP--GGK-IVVFTGLLEHFRTDAEIATIIGHEPRGLQR  313 (321)
Q Consensus       266 v~Vv~~~~~NAfalP--gG~-I~V~tGLL~~~~~edeLAaVLaHEiaHv~~  313 (321)
                      ++--+.....|||.+  ++. -+|..   ..-.+-+++ .+|+||+||+.+
T Consensus       342 ~~~r~gK~~Ga~~~~~~~~~~p~i~~---N~~~~~~~v-~TL~HE~GHa~H  388 (587)
T TIGR02290       342 AEPRPGKRGGAFCTGFPPSKEPRVLM---NYDGSRRDV-STLAHELGHAYH  388 (587)
T ss_pred             cCCCCCCCCCcccCCCCCCCCCEEEE---ecCCCchhH-HHHHHHhhHHHH
Confidence            333344556788864  222 23221   111233333 569999999975


No 104
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=23.80  E-value=44  Score=30.14  Aligned_cols=12  Identities=25%  Similarity=0.005  Sum_probs=10.4

Q ss_pred             HHHHHHHHHhhh
Q 020771          301 ATIIGHEPRGLQ  312 (321)
Q Consensus       301 AaVLaHEiaHv~  312 (321)
                      .+++.||++|++
T Consensus        80 ~~~i~HEl~HaL   91 (191)
T PF01400_consen   80 VGTILHELGHAL   91 (191)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             ccchHHHHHHHH
Confidence            468999999997


No 105
>PF08014 DUF1704:  Domain of unknown function (DUF1704);  InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=23.01  E-value=1.1e+02  Score=30.81  Aligned_cols=47  Identities=15%  Similarity=0.345  Sum_probs=31.7

Q ss_pred             CeEEEEEeCCCCceEEeCCCeEEEcHhHHhhcCCHHHHHHHHHHHHH-hhhc
Q 020771          263 NWEVLVVNEPVINAFCLPGGKIVVFTGLLEHFRTDAEIATIIGHEPR-GLQR  313 (321)
Q Consensus       263 ~w~v~Vv~~~~~NAfalPgG~I~V~tGLL~~~~~edeLAaVLaHEia-Hv~~  313 (321)
                      ++.|.+-++=..+|. +.++.|.|.++  ..+ ++.++.+++-|||| |+..
T Consensus       132 ~~~V~~sddl~a~A~-v~~~~l~I~~~--~~f-s~~~l~~L~~HEigvH~lt  179 (349)
T PF08014_consen  132 EVKVELSDDLLARAM-VSGDRLKINKN--AMF-SERDLEALLHHEIGVHLLT  179 (349)
T ss_pred             eEEEEEcCCcchhhc-ccCCeeEEcCC--CCc-CHHHHHHHHHHhhhhhhcc
Confidence            444444444345664 55778888876  223 79999999999995 6653


No 106
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=22.99  E-value=38  Score=36.15  Aligned_cols=48  Identities=13%  Similarity=0.097  Sum_probs=28.9

Q ss_pred             Ce-EEEEEeCCCCceEEeCC---CeEEEcHhHHhhcCCHHHHHHHHHHHHHhhhcc
Q 020771          263 NW-EVLVVNEPVINAFCLPG---GKIVVFTGLLEHFRTDAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       263 ~w-~v~Vv~~~~~NAfalPg---G~I~V~tGLL~~~~~edeLAaVLaHEiaHv~~r  314 (321)
                      .| ++..-+.+..-|||.+.   ++.+|.-   ....+... ..+|+||+||..+.
T Consensus       343 ~WiD~~~~~gKrsGaYs~~~~~~~~p~Ilm---N~~gt~~d-V~TLaHElGHs~Hs  394 (598)
T COG1164         343 RWIDVYPRKGKRSGAYSIGFYKGDHPFILM---NYDGTLRD-VFTLAHELGHSVHS  394 (598)
T ss_pred             CCeeccCCCCCCCCcccCCCCCCCCCeEEE---eCCCchhH-HHHHHHHccHHHHH
Confidence            44 33333455688999863   5566643   33333333 36799999998653


No 107
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=22.62  E-value=45  Score=34.15  Aligned_cols=17  Identities=18%  Similarity=0.073  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHhhhcc
Q 020771          297 DAEIATIIGHEPRGLQRI  314 (321)
Q Consensus       297 edeLAaVLaHEiaHv~~r  314 (321)
                      -+++ ..|+||+||+.+.
T Consensus       261 ~~~V-~TLfHEfGHalH~  277 (472)
T cd06455         261 HDEV-ETFFHEFGHVIHH  277 (472)
T ss_pred             HHHH-HHHHHHHHHHHHH
Confidence            3554 5689999999753


No 108
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=22.49  E-value=40  Score=31.14  Aligned_cols=12  Identities=25%  Similarity=0.146  Sum_probs=10.5

Q ss_pred             HHHHHHHHHhhh
Q 020771          301 ATIIGHEPRGLQ  312 (321)
Q Consensus       301 AaVLaHEiaHv~  312 (321)
                      .+++.|||+|++
T Consensus        88 ~Gti~HEl~HaL   99 (200)
T cd04281          88 FGIVVHELGHVI   99 (200)
T ss_pred             CchHHHHHHHHh
Confidence            368999999997


No 109
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=21.90  E-value=42  Score=30.59  Aligned_cols=12  Identities=33%  Similarity=0.180  Sum_probs=10.5

Q ss_pred             HHHHHHHHHhhh
Q 020771          301 ATIIGHEPRGLQ  312 (321)
Q Consensus       301 AaVLaHEiaHv~  312 (321)
                      .+++.||++|++
T Consensus        78 ~G~i~HEl~HaL   89 (182)
T cd04283          78 KGIIQHELLHAL   89 (182)
T ss_pred             cchHHHHHHHHh
Confidence            378999999997


No 110
>PF02102 Peptidase_M35:  Deuterolysin metalloprotease (M35) family;  InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=21.54  E-value=1.3e+02  Score=30.49  Aligned_cols=42  Identities=21%  Similarity=0.214  Sum_probs=22.9

Q ss_pred             CCCceEEeCC-CeEEEcHhHHhhcC------CHHHHHHHHHHHHHhhhc
Q 020771          272 PVINAFCLPG-GKIVVFTGLLEHFR------TDAEIATIIGHEPRGLQR  313 (321)
Q Consensus       272 ~~~NAfalPg-G~I~V~tGLL~~~~------~edeLAaVLaHEiaHv~~  313 (321)
                      +.+-|+++|. +.|+..--.+..+.      ....-++++-||+.|...
T Consensus       262 ~~vlAYT~p~~~~I~~Cp~ff~~lp~~~~~C~~qDqatt~LHE~TH~~~  310 (359)
T PF02102_consen  262 SGVLAYTLPSQNQIVNCPIFFSDLPALSNRCHAQDQATTTLHEMTHAPA  310 (359)
T ss_dssp             TT--EEEEGGGTEEEE-HHHHHH--SS--STT---HHHHHHHHHHT-TT
T ss_pred             CCeEEEEEcCCCeEEECchhhccCCCccccccCCCccchhhhhhhcccc
Confidence            4577999986 45555544443221      345578999999999853


No 111
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=21.13  E-value=24  Score=33.93  Aligned_cols=12  Identities=25%  Similarity=0.166  Sum_probs=10.5

Q ss_pred             HHHHHHHHHhhh
Q 020771          301 ATIIGHEPRGLQ  312 (321)
Q Consensus       301 AaVLaHEiaHv~  312 (321)
                      ..|++||++|..
T Consensus       166 igv~~HE~gH~l  177 (286)
T TIGR03296       166 VGVIAHELGHDL  177 (286)
T ss_pred             eeeeehhhhccc
Confidence            589999999964


No 112
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=20.87  E-value=85  Score=27.33  Aligned_cols=26  Identities=19%  Similarity=0.252  Sum_probs=18.3

Q ss_pred             EcHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          286 VFTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       286 V~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      +-+-=++.+ +-+|-.-||.||+.|+-
T Consensus        67 l~sekF~rL-s~~ekvKviiHEllHIP   92 (133)
T COG4900          67 LLSEKFKRL-SCAEKVKVIIHELLHIP   92 (133)
T ss_pred             eehhhcCCC-ChHHHHHHHHHHHhcCc
Confidence            333344555 56777789999999984


No 113
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=20.27  E-value=62  Score=31.15  Aligned_cols=33  Identities=24%  Similarity=0.362  Sum_probs=20.5

Q ss_pred             CceEEeCCCeEEE--cHhHHhhcCCHHHHHHHHHHHHHhhh
Q 020771          274 INAFCLPGGKIVV--FTGLLEHFRTDAEIATIIGHEPRGLQ  312 (321)
Q Consensus       274 ~NAfalPgG~I~V--~tGLL~~~~~edeLAaVLaHEiaHv~  312 (321)
                      .|.-.+||..+++  ..+++      .=+.+|+-||+||..
T Consensus        96 ~~~~~ipGv~~~i~~~~~~i------aL~isv~iHElgHa~  130 (263)
T cd06159          96 RNVLVIPGVNIFIPLPYGII------ALVVGVVVHELSHGI  130 (263)
T ss_pred             ceeeccCCcchHHHHHHHHH------HHHHHHHHHHHHHHH
Confidence            3444678766553  22222      245689999999975


Done!