Query         010667
Match_columns 504
No_of_seqs    317 out of 571
Neff          4.5 
Searched_HMMs 46136
Date          Fri Mar 29 03:11:28 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/010667.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/010667hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF08325 WLM:  WLM domain;  Int 100.0   5E-44 1.1E-48  340.0  11.5  155    2-162    20-186 (186)
  2 KOG4842 Protein involved in si  99.9 4.7E-23   1E-27  202.9   4.3  143    1-158   136-278 (278)
  3 PF09409 PUB:  PUB domain;  Int  99.8 5.7E-19 1.2E-23  148.4   8.1   83  409-498     2-87  (87)
  4 KOG4842 Protein involved in si  99.7 2.7E-18 5.8E-23  169.6   6.6  166    2-173    49-236 (278)
  5 smart00580 PUG domain in prote  99.0   6E-10 1.3E-14   88.5   4.8   49  415-469     1-49  (58)
  6 PF01863 DUF45:  Protein of unk  98.5 9.3E-08   2E-12   90.4   4.7   62   38-104   138-199 (205)
  7 COG1451 Predicted metal-depend  97.9 7.2E-06 1.6E-10   81.0   4.1   62   38-104   149-210 (223)
  8 smart00731 SprT SprT homologue  96.8 0.00097 2.1E-08   61.3   3.2   34   64-97     54-93  (146)
  9 PF10263 SprT-like:  SprT-like   95.6   0.012 2.6E-07   53.7   3.8   59   39-98     30-97  (157)
 10 KOG2699 Predicted ubiquitin re  94.8   0.011 2.4E-07   63.3   1.0   64  416-486   310-373 (407)
 11 PRK04351 hypothetical protein;  91.9    0.16 3.6E-06   47.7   3.7   31   66-96     58-95  (149)
 12 PRK04860 hypothetical protein;  90.4     0.3 6.4E-06   46.5   3.7   31   65-95     59-96  (160)
 13 KOG2699 Predicted ubiquitin re  87.3    0.28 6.1E-06   52.9   1.5   62  398-463   152-214 (407)
 14 COG3091 SprT Zn-dependent meta  85.5    0.94   2E-05   43.1   3.8   30   67-96     59-95  (156)
 15 KOG3854 SPRT-like metalloprote  80.7     1.3 2.8E-05   48.7   3.0   41   46-91    333-380 (505)
 16 PF13485 Peptidase_MA_2:  Pepti  79.8     1.8 3.9E-05   36.5   3.1   22   65-86     21-42  (128)
 17 cd04270 ZnMc_TACE_like Zinc-de  76.5       2 4.2E-05   43.0   2.7   28   62-89    160-187 (244)
 18 PF06114 DUF955:  Domain of unk  74.3     3.1 6.7E-05   34.8   3.0   26   64-89     37-62  (122)
 19 PF04450 BSP:  Peptidase of pla  69.4     3.2 6.9E-05   41.0   2.2   53   49-104    74-131 (205)
 20 COG4900 Predicted metallopepti  68.5     6.3 0.00014   36.2   3.7   73   19-95     26-114 (133)
 21 PF05569 Peptidase_M56:  BlaR1   68.5       5 0.00011   40.6   3.5   23   64-86    191-213 (299)
 22 PRK04897 heat shock protein Ht  68.2     3.7   8E-05   42.3   2.5   66   12-85     88-153 (298)
 23 PRK03072 heat shock protein Ht  67.8     3.8 8.3E-05   42.0   2.5   23   63-85    121-143 (288)
 24 KOG3931 Uncharacterized conser  67.5      11 0.00023   40.7   5.7   51   46-97     83-142 (484)
 25 PRK03982 heat shock protein Ht  66.7     4.2   9E-05   41.5   2.5   66   12-85     76-141 (288)
 26 COG0501 HtpX Zn-dependent prot  65.7     4.1 8.9E-05   40.6   2.2   24   63-86    151-174 (302)
 27 PRK03001 M48 family peptidase;  64.8     4.8  0.0001   41.0   2.5   65   12-84     75-139 (283)
 28 PF10460 Peptidase_M30:  Peptid  64.4     5.9 0.00013   42.5   3.2   41   44-84    114-154 (366)
 29 PRK02391 heat shock protein Ht  62.9     5.4 0.00012   41.3   2.5   21   64-84    128-148 (296)
 30 PF01435 Peptidase_M48:  Peptid  62.7     5.4 0.00012   37.8   2.3   44   40-88     65-108 (226)
 31 PRK02870 heat shock protein Ht  62.4     5.4 0.00012   42.2   2.5   21   64-84    168-188 (336)
 32 PRK05457 heat shock protein Ht  62.2     5.7 0.00012   40.8   2.5   21   64-84    129-149 (284)
 33 cd04271 ZnMc_ADAM_fungal Zinc-  60.8     3.5 7.5E-05   41.0   0.7   18   71-88    147-164 (228)
 34 PRK01345 heat shock protein Ht  60.3     6.4 0.00014   41.1   2.5   21   64-84    119-139 (317)
 35 PRK01265 heat shock protein Ht  58.5     7.1 0.00015   41.2   2.5   65   12-84     91-155 (324)
 36 PF12388 Peptidase_M57:  Dual-a  56.9      10 0.00022   37.9   3.1   40   44-88    113-152 (211)
 37 PF13688 Reprolysin_5:  Metallo  54.8     8.8 0.00019   36.1   2.3   25   64-88    137-161 (196)
 38 cd04272 ZnMc_salivary_gland_MP  53.6     8.3 0.00018   37.5   2.0   20   69-88    145-164 (220)
 39 PF13582 Reprolysin_3:  Metallo  53.0     7.4 0.00016   33.8   1.4   16   71-86    109-124 (124)
 40 PF13058 DUF3920:  Protein of u  52.5     8.3 0.00018   35.3   1.6   20   63-82     70-89  (126)
 41 cd04269 ZnMc_adamalysin_II_lik  50.6      12 0.00025   35.5   2.4   23   66-88    128-150 (194)
 42 PRK14015 pepN aminopeptidase N  49.3      10 0.00022   44.9   2.1   20   65-84    292-311 (875)
 43 TIGR02414 pepN_proteo aminopep  48.9      11 0.00023   44.8   2.1   21   64-84    278-298 (863)
 44 cd04267 ZnMc_ADAM_like Zinc-de  48.6       9  0.0002   36.1   1.3   22   67-88    131-152 (192)
 45 COG0308 PepN Aminopeptidase N   48.5      11 0.00023   44.5   2.1   25   62-86    300-324 (859)
 46 TIGR02412 pepN_strep_liv amino  48.3      11 0.00025   44.0   2.3   18   66-83    284-301 (831)
 47 PF01447 Peptidase_M4:  Thermol  48.0      12 0.00026   35.2   2.0   20   62-81    128-147 (150)
 48 PF14521 Aspzincin_M35:  Lysine  46.6      11 0.00024   35.0   1.5   31   50-82     78-109 (148)
 49 PF13574 Reprolysin_2:  Metallo  46.2      12 0.00027   35.4   1.8   20   69-88    111-130 (173)
 50 TIGR02411 leuko_A4_hydro leuko  45.2      12 0.00026   42.4   1.7   16   69-84    279-294 (601)
 51 PF01433 Peptidase_M1:  Peptida  43.3      17 0.00036   37.5   2.4   19   66-84    292-310 (390)
 52 KOG1046 Puromycin-sensitive am  42.1      16 0.00035   43.2   2.3   21   64-84    319-339 (882)
 53 PF01421 Reprolysin:  Reprolysi  40.1      22 0.00047   33.9   2.4   25   64-88    126-150 (199)
 54 PF13699 DUF4157:  Domain of un  39.3      15 0.00032   31.0   1.1   33   47-83     43-75  (79)
 55 PF06262 DUF1025:  Possibl zinc  39.1      21 0.00046   31.5   2.0   17   66-82     70-86  (97)
 56 PF08919 F_actin_bind:  F-actin  38.6      41  0.0009   30.5   3.8   73  387-467     7-85  (110)
 57 PF02102 Peptidase_M35:  Deuter  37.8      17 0.00037   39.1   1.4   21   68-88    296-318 (359)
 58 PF00413 Peptidase_M10:  Matrix  37.6      22 0.00048   31.8   2.0   24   64-87    100-123 (154)
 59 PF10023 DUF2265:  Predicted am  37.0      40 0.00087   36.0   4.0   43   65-107   161-204 (337)
 60 COG4219 MecR1 Antirepressor re  35.5      23  0.0005   37.7   1.9   21   64-84    185-205 (337)
 61 cd04273 ZnMc_ADAMTS_like Zinc-  35.1       9  0.0002   36.9  -1.0   19   70-88    141-159 (207)
 62 PHA02456 zinc metallopeptidase  34.8      22 0.00048   32.8   1.5   39   40-82     53-92  (141)
 63 cd00203 ZnMc Zinc-dependent me  34.7      20 0.00044   32.4   1.3   22   67-88     94-115 (167)
 64 PF12725 DUF3810:  Protein of u  34.6      17 0.00038   38.0   0.9   35   64-98    191-227 (318)
 65 cd04268 ZnMc_MMP_like Zinc-dep  33.6      31 0.00068   31.3   2.3   22   67-88     92-113 (165)
 66 PF13203 DUF2201_N:  Putative m  33.2      32 0.00069   34.8   2.5   20   65-84     56-75  (292)
 67 KOG2719 Metalloprotease [Gener  33.1      44 0.00096   36.8   3.7   73    9-83    222-294 (428)
 68 PF10463 Peptidase_U49:  Peptid  32.0      38 0.00082   33.8   2.7   35   69-103   101-135 (206)
 69 PF14891 Peptidase_M91:  Effect  31.0      23 0.00049   33.7   1.0   19   70-88    104-122 (174)
 70 PF09768 Peptidase_M76:  Peptid  28.9      43 0.00093   32.5   2.5   20   62-81     64-83  (173)
 71 PF15639 Tox-MPTase3:  Metallop  28.8      24 0.00052   33.0   0.7   77    4-80     19-111 (135)
 72 cd04279 ZnMc_MMP_like_1 Zinc-d  28.2      41  0.0009   30.7   2.2   22   67-88    102-123 (156)
 73 PF07998 Peptidase_M54:  Peptid  26.7      46   0.001   32.9   2.3   23   66-88    142-164 (194)
 74 PRK13267 archaemetzincin-like   26.0      48   0.001   32.1   2.3   24   65-88    121-144 (179)
 75 KOG3607 Meltrins, fertilins an  25.7      39 0.00084   39.5   1.8   22   66-87    320-341 (716)
 76 PRK09672 phage exclusion prote  24.1      69  0.0015   33.9   3.1   36   69-104   165-200 (305)
 77 PF14247 DUF4344:  Domain of un  22.8      48   0.001   33.3   1.6   21   66-86     89-109 (220)
 78 cd04277 ZnMc_serralysin_like Z  22.4      54  0.0012   30.9   1.8   38   49-87     94-131 (186)
 79 cd04278 ZnMc_MMP Zinc-dependen  22.2      49  0.0011   30.3   1.5   23   65-87    103-125 (157)
 80 KOG1047 Bifunctional leukotrie  21.8      43 0.00093   38.2   1.1   18   68-85    287-304 (613)
 81 PF13583 Reprolysin_4:  Metallo  20.4      56  0.0012   31.9   1.5   19   70-88    138-156 (206)

No 1  
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=100.00  E-value=5e-44  Score=339.95  Aligned_cols=155  Identities=43%  Similarity=0.590  Sum_probs=133.7

Q ss_pred             cccccCcc-hhhHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhh
Q 010667            2 HMLAADPG-IIAIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM   80 (504)
Q Consensus         2 ~~L~~~a~-i~pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~   80 (504)
                      ++|.++|. |.|||++|+|+|++|+||+|++      ..+||+|+|+|++|+||||+++.++|+||++|+.|||||||||
T Consensus        20 ~lL~rlA~~v~pIM~~~~~~V~~L~E~~P~~------~~llG~N~N~G~~I~lrLR~~~~~~fl~~~~i~~t~lHELaH~   93 (186)
T PF08325_consen   20 ELLERLAADVKPIMRKHGWRVGSLEEFYPNG------ERLLGLNVNKGEKICLRLRTPDDGGFLPYETILGTMLHELAHN   93 (186)
T ss_pred             HHHHHHHHHHHHHHHHcCcccCeeeccCCCC------CCCcceecCCCcEEEEEeCCCCCCCEeeHHHHHHHHHHHHHhc
Confidence            57888874 5599999999999999999996      4599999999999999999965599999999999999999999


Q ss_pred             hcCccchhHHHHHHHHHHHHHHhccc------cCCCeeecCCCCCCcccccccccCCCCCCccccCCCC-----CCCccH
Q 010667           81 VYSEHDANFYGLDKQLNQEAVALDWT------KSRGHTLSGVRHTSHHEDDLFVGDSRSFSQKLGGNIS-----DQLASA  149 (504)
Q Consensus        81 vhg~Hd~~F~~L~~~L~~E~~~l~~~------~s~G~~LgG~~~~~~~~~~~~~~~~~~~g~rLGG~~~-----~~~~s~  149 (504)
                      +|++||.+||++|++|..||++|+|+      ++.|++|||.........+.......++|++|||++.     .++.++
T Consensus        94 ~~~~H~~~F~~l~~~l~~e~~~l~~~G~~~gf~~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~LgG~s~~~~~~~~~~~~  173 (186)
T PF08325_consen   94 VHGPHDDKFWKLLDELRKECEELDAKGYTEGFWSSGRRLGGSSGQPSEERELRGNGLSGGGQRLGGGSSSRPRKAQPKSP  173 (186)
T ss_pred             ccCCccHHHHHHHHHHHHHHHHHHhcCCccccCCCCcccCCCCcccchhhhhhccccCCCCeeCCCCCCCCCCcCCCcCH
Confidence            99999999999999999999999996      7999999987654321111223334668999999975     467899


Q ss_pred             HHHHHHHHHHHHh
Q 010667          150 RASSVAAAYRRLA  162 (504)
Q Consensus       150 Re~aa~AAerR~~  162 (504)
                      |+++|+||+||++
T Consensus       174 Re~~a~AAerR~~  186 (186)
T PF08325_consen  174 REAAAAAAERRLR  186 (186)
T ss_pred             HHHHHHHHHhhcC
Confidence            9999999999975


No 2  
>KOG4842 consensus Protein involved in sister chromatid separation and/or segregation [Cell cycle control, cell division, chromosome partitioning]
Probab=99.87  E-value=4.7e-23  Score=202.95  Aligned_cols=143  Identities=43%  Similarity=0.522  Sum_probs=130.0

Q ss_pred             CcccccCcchhhHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhh
Q 010667            1 MHMLAADPGIIAIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM   80 (504)
Q Consensus         1 ~~~L~~~a~i~pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~   80 (504)
                      |++|+.+++|..+|-.|.|.|+.|+|+.|..++++++.++||+|.|.| +|.+|||+++.++||.|+.|-.|++|||||+
T Consensus       136 L~~Lrad~gii~~mg~hrW~vg~l~el~g~~nt~v~~~~tLg~stnqG-~i~lrlrtdrkkgfR~y~tissTl~heLtr~  214 (278)
T KOG4842|consen  136 LDELRADQGIIEQMGLHRWFVGNLQELGGRANTRVNRYPTLGISTNQG-VIVLRLRTDRKKGFRHYETISSTLRHELTRE  214 (278)
T ss_pred             HHHHhhchhHHHHhcccceechhhhhcccccceeecCccceeeccccc-eEEEecccchhcccccCCCchHHHHhhhhhh
Confidence            689999999999999999999999999999999999999999999999 9999999999999999999999999999999


Q ss_pred             hcCccchhHHHHHHHHHHHHHHhccccCCCeeecCCCCCCcccccccccCCCCCCccccCCCCCCCccHHHHHHHHHH
Q 010667           81 VYSEHDANFYGLDKQLNQEAVALDWTKSRGHTLSGVRHTSHHEDDLFVGDSRSFSQKLGGNISDQLASARASSVAAAY  158 (504)
Q Consensus        81 vhg~Hd~~F~~L~~~L~~E~~~l~~~~s~G~~LgG~~~~~~~~~~~~~~~~~~~g~rLGG~~~~~~~s~Re~aa~AAe  158 (504)
                      +|+.|+..||+|..++.+|..+++|..++|++.+....              --.+.+|+.......+.|+...+|||
T Consensus       215 v~~ehde~fyrLdrql~kek~~ad~~~srg~~~sde~~--------------~vdq~de~~p~D~l~~~rdl~~aaae  278 (278)
T KOG4842|consen  215 VAAEHDERFYRLDRQLGKEKNNADQIISRGISSSDEVV--------------IVDQDDEVLPGDTLIEVRDLTYAAAE  278 (278)
T ss_pred             HhhhhhhHHHHHHHHhCcccchhhhhcCCCceeeccee--------------ecccccCCCcccccchhhhhhhhhcC
Confidence            99999999999999999999999999999987653211              12467777766677788998888885


No 3  
>PF09409 PUB:  PUB domain;  InterPro: IPR018997  The PUB (also known as PUG) domain is found in peptide N-glycanase where it functions as a AAA ATPase binding domain []. This domain is also found on other proteins linked to the ubiquitin-proteasome system. ; PDB: 2CM0_A 2CCQ_A 2D5U_A 2HPL_A 2HPJ_A.
Probab=99.77  E-value=5.7e-19  Score=148.43  Aligned_cols=83  Identities=37%  Similarity=0.527  Sum_probs=67.8

Q ss_pred             ChhHHHHHHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHcCCcccccccccCCcccEEEEc---C
Q 010667          409 SPLESTTVLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLVGFNEDVVLDEIGKAETYLVLK---R  485 (504)
Q Consensus       409 ~~e~~~~al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~alGF~e~~~~de~g~ge~~LVL~---~  485 (504)
                      +.++..+|+++|.+||.||+.||.++|||+||++|++|+++|++++||.+||.++||++..       .+.||+|+   .
T Consensus         2 ~~~~~~~al~~L~~il~NI~~~P~~~kyR~Ir~~N~~f~~~i~~~~g~~~~L~~~GF~~~~-------~~~~~vl~~~~~   74 (87)
T PF09409_consen    2 DPEAFQKALETLEKILSNILSNPNEEKYRRIRLSNKTFQEKILPVPGARELLEALGFREVT-------DEEFLVLPEDSE   74 (87)
T ss_dssp             -HHHHHHHHHHHHHHHHHHHHSTT-CGGGEEETTSHHHHHHTTTSTTHHHHHHHHT-EE----------SSEEE----TT
T ss_pred             CHHHHHHHHHHHHHHHHHHccCCCcccceEeecCcchHHHHhcCChhHHHHHHHCCCEEec-------CCCEEEEecCCc
Confidence            4578899999999999999999999999999999999999999999999999999999986       26899999   3


Q ss_pred             CCHHHHHHHHHHh
Q 010667          486 NDLALLWLAKSSL  498 (504)
Q Consensus       486 ~d~~lL~~Aks~L  498 (504)
                      .++..|+.|++.|
T Consensus        75 ~~~~~l~~~~~~l   87 (87)
T PF09409_consen   75 PDLQHLQKALSEL   87 (87)
T ss_dssp             S-HHHHHHHHH--
T ss_pred             CCHHHHHHHHhcC
Confidence            3899999888765


No 4  
>KOG4842 consensus Protein involved in sister chromatid separation and/or segregation [Cell cycle control, cell division, chromosome partitioning]
Probab=99.73  E-value=2.7e-18  Score=169.58  Aligned_cols=166  Identities=22%  Similarity=0.235  Sum_probs=123.8

Q ss_pred             cccccCcchh-hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeec-CCCCCccchHHHHHHHHHHHhh
Q 010667            2 HMLAADPGII-AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRT-DDLKGFRKYESIKKTLLHELAH   79 (504)
Q Consensus         2 ~~L~~~a~i~-pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~-~d~~~Flp~~~V~~tlLHELaH   79 (504)
                      ..|++++..+ |||++|+|.|.++.++++.+      ..|+|+|||+|.+|.|+||. .+..+|+|++.+++|++|||||
T Consensus        49 ~llk~iahl~~p~mkEh~f~vti~~Dk~irn------q~~sg~nvn~gski~lslr~~~~e~~~lp~e~pmgtylhel~h  122 (278)
T KOG4842|consen   49 NLLKEIAHLVSPLMKEHHFKVTILVDKYIRN------QRLSGMNVNHGSKIMLSLRCSTDEFQFLPMECPMGTYLHELTH  122 (278)
T ss_pred             hhhhhhhhhhhhhhccccceeEEeehhHHHh------hhhhccccCCcceEEEEeeccccccccccccccchhhhhhhhh
Confidence            3578899888 99999999999999999986      78999999999999999997 5677899999999999999999


Q ss_pred             hhcCccchhHHHHHHHHHHHHH------HhccccCCCeeecCCCCCCccccc--cc---ccC--CCCCCccccCCCC---
Q 010667           80 MVYSEHDANFYGLDKQLNQEAV------ALDWTKSRGHTLSGVRHTSHHEDD--LF---VGD--SRSFSQKLGGNIS---  143 (504)
Q Consensus        80 ~vhg~Hd~~F~~L~~~L~~E~~------~l~~~~s~G~~LgG~~~~~~~~~~--~~---~~~--~~~~g~rLGG~~~---  143 (504)
                      ++.+||+++|++.++.|+..--      .+.|+.+-++.|+|+..-.....-  .+   .+.  -+.+..|+-|...   
T Consensus       123 ~lqgPhd~rfl~~L~~Lrad~gii~~mg~hrW~vg~l~el~g~~nt~v~~~~tLg~stnqG~i~lrlrtdrkkgfR~y~t  202 (278)
T KOG4842|consen  123 NLQGPHDKRFLNKLDELRADQGIIEQMGLHRWFVGNLQELGGRANTRVNRYPTLGISTNQGVIVLRLRTDRKKGFRHYET  202 (278)
T ss_pred             hhcCCChHHHHHHHHHHhhchhHHHHhcccceechhhhhcccccceeecCccceeeccccceEEEecccchhcccccCCC
Confidence            9999999999999999987542      233566777888875432100000  00   000  0112222222211   


Q ss_pred             ----CCCccHHHHHHHHHHHHHhhhccccCCCCC
Q 010667          144 ----DQLASARASSVAAAYRRLANASANSLGVSE  173 (504)
Q Consensus       144 ----~~~~s~Re~aa~AAerR~~d~~~~~~~~s~  173 (504)
                          .....+|++++++++++.+...+|..+...
T Consensus       203 issTl~heLtr~v~~ehde~fyrLdrql~kek~~  236 (278)
T KOG4842|consen  203 ISSTLRHELTREVAAEHDERFYRLDRQLGKEKNN  236 (278)
T ss_pred             chHHHHhhhhhhHhhhhhhHHHHHHHHhCcccch
Confidence                124678999999999999999998755443


No 5  
>smart00580 PUG domain in protein kinases, N-glycanases and other nuclear proteins.
Probab=98.97  E-value=6e-10  Score=88.50  Aligned_cols=49  Identities=31%  Similarity=0.632  Sum_probs=46.6

Q ss_pred             HHHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHcCCcccc
Q 010667          415 TVLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLVGFNEDV  469 (504)
Q Consensus       415 ~al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~alGF~e~~  469 (504)
                      .|+++|+|+++||++||+|.|      +|++||++|++++||.++|.++||....
T Consensus         1 ~sv~dLLr~irNi~~hp~e~k------~n~~~~~~l~~~pg~~~~l~~~gFp~l~   49 (58)
T smart00580        1 ESVRDLLRALRNILHHPREEK------GNPAIKERLGDVPGGFELYFTVGFPRLL   49 (58)
T ss_pred             CcHHHHHHHHHHHhhCcchhh------cCHHHHHHhcCCCcHHHHHHHcCCCccc
Confidence            378999999999999999999      9999999999999999999999999875


No 6  
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=98.51  E-value=9.3e-08  Score=90.36  Aligned_cols=62  Identities=23%  Similarity=0.287  Sum_probs=53.0

Q ss_pred             CcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHhc
Q 010667           38 KCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVALD  104 (504)
Q Consensus        38 ~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l~  104 (504)
                      ++.||-. +.--.|.|=+|-    .++|.+.|.+|++|||||..|.||++.||+++.++-+++.+..
T Consensus       138 ksrWGsc-~~~~~I~ln~~L----~~~P~~~idYVvvHEL~Hl~~~nHs~~Fw~~v~~~~Pd~k~~~  199 (205)
T PF01863_consen  138 KSRWGSC-SSKGNITLNWRL----VMAPPEVIDYVVVHELCHLRHPNHSKRFWALVEKYMPDYKERR  199 (205)
T ss_pred             hhccccC-CCCCcEEeeccc----ccCCccHHHHHHHHHHHHhccCCCCHHHHHHHHHHCcCHHHHH
Confidence            5578988 544567777666    4789999999999999999999999999999999999887765


No 7  
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=97.94  E-value=7.2e-06  Score=80.97  Aligned_cols=62  Identities=27%  Similarity=0.336  Sum_probs=50.7

Q ss_pred             CcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHhc
Q 010667           38 KCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVALD  104 (504)
Q Consensus        38 ~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l~  104 (504)
                      ++.||=..-.| .|.+-+|--    -.|.+.+.++++|||||..+++|+.+||+++..+-+++.++.
T Consensus       149 k~~WGScs~~~-~i~~~~~l~----~~p~~~i~YVvvHELaHLke~nHs~~Fw~lv~~~~P~~~~~~  210 (223)
T COG1451         149 KRRWGSCSKAG-EIRFNWRLV----MAPEEVIDYVVVHELAHLKEKNHSKRFWRLVEKYMPDYRAAK  210 (223)
T ss_pred             cceeeeecCCC-cEEeehhhh----cCCHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHCCChHHHH
Confidence            45677555566 666666653    378899999999999999999999999999999998887765


No 8  
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=96.80  E-value=0.00097  Score=61.34  Aligned_cols=34  Identities=29%  Similarity=0.475  Sum_probs=30.3

Q ss_pred             cchHHHHHHHHHHHhhhhcC------ccchhHHHHHHHHH
Q 010667           64 RKYESIKKTLLHELAHMVYS------EHDANFYGLDKQLN   97 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg------~Hd~~F~~L~~~L~   97 (504)
                      .|.+.|.+|++|||||....      +|++.||+++.++.
T Consensus        54 ~~~~~l~~~l~HEm~H~~~~~~g~~~~Hg~~f~~~~~~~~   93 (146)
T smart00731       54 NGRDRLRETLLHELCHAALYLFGRGYGHGDEWKRWMRQVN   93 (146)
T ss_pred             ccHHHHHhhHHHHHHHHHHHHhCCCCCcCHHHHHHHHHHc
Confidence            46789999999999999988      99999999987664


No 9  
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=95.60  E-value=0.012  Score=53.70  Aligned_cols=59  Identities=24%  Similarity=0.296  Sum_probs=39.7

Q ss_pred             cccccc-cCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhc--------CccchhHHHHHHHHHH
Q 010667           39 CVLGFN-KNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVY--------SEHDANFYGLDKQLNQ   98 (504)
Q Consensus        39 ~lLGlN-~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vh--------g~Hd~~F~~L~~~L~~   98 (504)
                      ..+|.- .+......|+|-.. .-...+.+.++.||+|||||...        .+|+..|.++..++..
T Consensus        30 ~~~G~~~~~~~~~~~I~ls~~-~~~~~~~~~~~~tL~HEm~H~~~~~~~~~~~~~Hg~~fk~~~~~ig~   97 (157)
T PF10263_consen   30 RTAGRCRYKRRSPCEIRLSPK-LLDRNPEEELIDTLLHEMAHAAAYVFGGGRRRGHGKEFKQWARRIGA   97 (157)
T ss_pred             CceEEEEECCCCceEEEECHH-HHHhhHHHHHHHHHHHHHHHHHhhhccCCCCCCCCHHHHHHHHHHCC
Confidence            445652 23332344444441 11126789999999999999887        6999999998877754


No 10 
>KOG2699 consensus Predicted ubiquitin regulatory protein [Posttranslational modification, protein turnover, chaperones]
Probab=94.77  E-value=0.011  Score=63.32  Aligned_cols=64  Identities=20%  Similarity=0.218  Sum_probs=59.5

Q ss_pred             HHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHcCCcccccccccCCcccEEEEcCC
Q 010667          416 VLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLVGFNEDVVLDEIGKAETYLVLKRN  486 (504)
Q Consensus       416 al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~alGF~e~~~~de~g~ge~~LVL~~~  486 (504)
                      +..++.+|..|++++|+...|+.|+..|..|...++.+.+|.+++..+||-...       +.++++++..
T Consensus       310 ~~~~~~~~~~~~ak~pd~~l~q~~f~~~~~~~~~~g~~~~a~~~~~l~~el~~~-------~~e~~~~p~~  373 (407)
T KOG2699|consen  310 AFLTLYTPFKNVAKDPDGELLQGIFLPNELLLARYGFVSEALEFMELPGELLQE-------GSEFLSLPSV  373 (407)
T ss_pred             HHHHhhcccchhccCcchhhhhhhhchhHHHHHHHhccccchhhhhhhhHHhhc-------cccccccccc
Confidence            888999999999999999999999999999999999999999999999999885       3678888764


No 11 
>PRK04351 hypothetical protein; Provisional
Probab=91.95  E-value=0.16  Score=47.66  Aligned_cols=31  Identities=35%  Similarity=0.525  Sum_probs=24.8

Q ss_pred             hHHHHHHHHHHHhhh------h-cCccchhHHHHHHHH
Q 010667           66 YESIKKTLLHELAHM------V-YSEHDANFYGLDKQL   96 (504)
Q Consensus        66 ~~~V~~tlLHELaH~------v-hg~Hd~~F~~L~~~L   96 (504)
                      .+.+..|+.|||||.      . +.+|+..|..++.++
T Consensus        58 ~~~l~~vv~HElcH~~~~~~g~g~~h~g~~fk~~~~~v   95 (149)
T PRK04351         58 LEELIGIIKHELCHYHLHLEGKGYQHRDRDFKELLKQV   95 (149)
T ss_pred             HHHHHhhHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHh
Confidence            678999999999997      2 456889998877554


No 12 
>PRK04860 hypothetical protein; Provisional
Probab=90.36  E-value=0.3  Score=46.50  Aligned_cols=31  Identities=26%  Similarity=0.455  Sum_probs=25.3

Q ss_pred             chHHHHHHHHHHHhhhh-c---C---ccchhHHHHHHH
Q 010667           65 KYESIKKTLLHELAHMV-Y---S---EHDANFYGLDKQ   95 (504)
Q Consensus        65 p~~~V~~tlLHELaH~v-h---g---~Hd~~F~~L~~~   95 (504)
                      +.+.+..|+.|||||.+ |   |   +|+..|..++.+
T Consensus        59 ~~~~l~~~v~HEl~H~~~~~~~g~~~~Hg~ewk~lm~~   96 (160)
T PRK04860         59 QQAFIDEVVPHELAHLLVYQLFGRVAPHGKEWQWMMES   96 (160)
T ss_pred             cHHHHHhHHHHHHHHHHHHHHcCCCCCCCHHHHHHHHH
Confidence            46889999999999975 2   3   999998888765


No 13 
>KOG2699 consensus Predicted ubiquitin regulatory protein [Posttranslational modification, protein turnover, chaperones]
Probab=87.34  E-value=0.28  Score=52.86  Aligned_cols=62  Identities=16%  Similarity=0.122  Sum_probs=51.1

Q ss_pred             HHHHHHHHhcCCh-hHHHHHHHHHHHHHHHHhcCCCccccchhcccChhHHHhhcChhhHHHHHHHc
Q 010667          398 QKAIEMLRAEVSP-LESTTVLQTLCKIIRNVIEHPDETKYKRLRKANPIIQRSVANYKAAMEILFLV  463 (504)
Q Consensus       398 ~~Al~~L~~~n~~-e~~~~al~tL~Kil~NIl~~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L~al  463 (504)
                      -.++..+...+.+ +-...+.++..|+..|+-.||..+|||++ +.|..|+++|   .|+.+.|.++
T Consensus       152 ~ss~~~~~~~~k~~e~se~~~d~~~kqe~~~~~~pE~ek~R~r-l~~el~~~~v---~g~he~le~v  214 (407)
T KOG2699|consen  152 PSSLTKIFVNTKAKELSESGRDIRQKQEQNTERNPEREKERKR-LGNELLKEKV---LGCHEPLEAV  214 (407)
T ss_pred             chhhhhccccccchhhhhhhhhhHHHhhhhhccchhhhhhhhc-cccchhhccc---ccccchHHHH
Confidence            3444455555444 44678899999999999999999999988 9999999999   6888888887


No 14 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=85.53  E-value=0.94  Score=43.13  Aligned_cols=30  Identities=33%  Similarity=0.523  Sum_probs=24.8

Q ss_pred             HHHHHHHHHHHhhhh-------cCccchhHHHHHHHH
Q 010667           67 ESIKKTLLHELAHMV-------YSEHDANFYGLDKQL   96 (504)
Q Consensus        67 ~~V~~tlLHELaH~v-------hg~Hd~~F~~L~~~L   96 (504)
                      +.|..|+.|||||..       +-|||..|..++.++
T Consensus        59 ~f~~~vV~HELaHl~ly~~~gr~~phg~ewk~lm~qV   95 (156)
T COG3091          59 DFIEQVVPHELAHLHLYQEFGRYKPHGKEWKLLMQQV   95 (156)
T ss_pred             HHHHHHHHHHHHHHHHHHHcCCCCCCchhHHHHHHHh
Confidence            468899999999974       349999999888755


No 15 
>KOG3854 consensus SPRT-like metalloprotease [Function unknown]
Probab=80.74  E-value=1.3  Score=48.74  Aligned_cols=41  Identities=27%  Similarity=0.331  Sum_probs=28.7

Q ss_pred             CCccEEEEEeecCCCCCccchHHHHHHHHHHHhhh-------hcCccchhHHH
Q 010667           46 NHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHM-------VYSEHDANFYG   91 (504)
Q Consensus        46 N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~-------vhg~Hd~~F~~   91 (504)
                      ++-.+|+|-+---     -.-+.|+.||+|||||-       .+..|++.|..
T Consensus       333 kr~A~IeLs~kV~-----tTAERir~TLiHEmCHaAaWv~dr~e~gHGd~wKr  380 (505)
T KOG3854|consen  333 KRYAKIELSDKVC-----TTAERIRDTLIHEMCHAAAWVFDREELGHGDNWKR  380 (505)
T ss_pred             CceeEEEehhhhh-----hHHHHHHHHHHHHHHHHHHhhccccccCcchHHHH
Confidence            3445677755441     12378999999999994       27899988654


No 16 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=79.80  E-value=1.8  Score=36.55  Aligned_cols=22  Identities=27%  Similarity=0.366  Sum_probs=17.5

Q ss_pred             chHHHHHHHHHHHhhhhcCccc
Q 010667           65 KYESIKKTLLHELAHMVYSEHD   86 (504)
Q Consensus        65 p~~~V~~tlLHELaH~vhg~Hd   86 (504)
                      +...+..++.|||+|.+++...
T Consensus        21 ~~~~~~~~l~HE~~H~~~~~~~   42 (128)
T PF13485_consen   21 DEDWLDRVLAHELAHQWFGNYF   42 (128)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHc
Confidence            3466779999999999987544


No 17 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=76.49  E-value=2  Score=43.02  Aligned_cols=28  Identities=25%  Similarity=0.200  Sum_probs=23.5

Q ss_pred             CccchHHHHHHHHHHHhhhhcCccchhH
Q 010667           62 GFRKYESIKKTLLHELAHMVYSEHDANF   89 (504)
Q Consensus        62 ~Flp~~~V~~tlLHELaH~vhg~Hd~~F   89 (504)
                      ...|+..+..||.|||.|+.-++||..-
T Consensus       160 ~~~~~~~~a~t~AHElGHnlGm~HD~~~  187 (244)
T cd04270         160 KRVPTKESDLVTAHELGHNFGSPHDPDI  187 (244)
T ss_pred             CccchhHHHHHHHHHHHHhcCCCCCCCC
Confidence            3456666789999999999999999864


No 18 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=74.31  E-value=3.1  Score=34.82  Aligned_cols=26  Identities=35%  Similarity=0.468  Sum_probs=19.8

Q ss_pred             cchHHHHHHHHHHHhhhhcCccchhH
Q 010667           64 RKYESIKKTLLHELAHMVYSEHDANF   89 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~Hd~~F   89 (504)
                      ++...-+.|++|||+|.....+...+
T Consensus        37 ~~~~~~~f~laHELgH~~~~~~~~~~   62 (122)
T PF06114_consen   37 LSPERQRFTLAHELGHILLHHGDETF   62 (122)
T ss_dssp             S-HHHHHHHHHHHHHHHHHHH-HHHH
T ss_pred             CCHHHHHHHHHHHHHHHHhhhccccc
Confidence            44566788999999999998887654


No 19 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=69.36  E-value=3.2  Score=40.99  Aligned_cols=53  Identities=25%  Similarity=0.298  Sum_probs=40.4

Q ss_pred             cEEEEEeec-CCC---CCccchHHHHHHHHHHHhhhhc-CccchhHHHHHHHHHHHHHHhc
Q 010667           49 EEISLRLRT-DDL---KGFRKYESIKKTLLHELAHMVY-SEHDANFYGLDKQLNQEAVALD  104 (504)
Q Consensus        49 ~~I~LRLR~-~d~---~~Flp~~~V~~tlLHELaH~vh-g~Hd~~F~~L~~~L~~E~~~l~  104 (504)
                      -+|.+-++. ...   +.+.  .+|.++|.|||+||+. .+++..++.|...|. ++..+.
T Consensus        74 ~~I~~S~~~i~~~~~~~~~~--~Ei~Gvl~HE~~H~~Q~~~~~~~P~~liEGIA-DyVRl~  131 (205)
T PF04450_consen   74 KEIHFSARYIAKYPADGDVR--DEIIGVLYHEMVHCWQWDGRGTAPGGLIEGIA-DYVRLK  131 (205)
T ss_pred             cEEEEeHHHHhhcccccchH--HHHHHHHHHHHHHHhhcCCCCCCChhheecHH-HHHHHH
Confidence            788888888 221   2233  7899999999999996 568899999998886 455554


No 20 
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=68.52  E-value=6.3  Score=36.17  Aligned_cols=73  Identities=22%  Similarity=0.226  Sum_probs=46.2

Q ss_pred             eeeceecccCCCCCCCCCCCccccccc--------CCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcC------c
Q 010667           19 WRVGIMTELAPVGYVGVSPKCVLGFNK--------NHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYS------E   84 (504)
Q Consensus        19 w~V~~L~Ef~P~~~~~~s~~~lLGlN~--------N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg------~   84 (504)
                      +.+....=+.|.+++.- -.++||.|.        |.+-.|+|- +..  ..-||-++=+.+|+|||.|+-..      +
T Consensus        26 ~~~rv~vVys~gS~~~A-~ARIwg~pki~~e~lglnP~YviEl~-sek--F~rLs~~ekvKviiHEllHIP~tfSGgLRa  101 (133)
T COG4900          26 FQVRVVVVYSPGSHSKA-VARIWGIPKIFQEVLGLNPVYVIELL-SEK--FKRLSCAEKVKVIIHELLHIPATFSGGLRA  101 (133)
T ss_pred             eeeeEEEEECCCCccee-hhhhhcccHHHHHHhCCCCeeeeeee-hhh--cCCCChHHHHHHHHHHHhcCcccccCcccc
Confidence            34444444455543211 145777663        778888774 221  12377888999999999999753      8


Q ss_pred             cc--hhHHHHHHH
Q 010667           85 HD--ANFYGLDKQ   95 (504)
Q Consensus        85 Hd--~~F~~L~~~   95 (504)
                      |+  .+||..++-
T Consensus       102 Hg~~vn~rrv~kL  114 (133)
T COG4900         102 HGPLVNFRRVYKL  114 (133)
T ss_pred             CCcchhHHHHHHH
Confidence            88  788875543


No 21 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=68.47  E-value=5  Score=40.65  Aligned_cols=23  Identities=35%  Similarity=0.423  Sum_probs=19.4

Q ss_pred             cchHHHHHHHHHHHhhhhcCccc
Q 010667           64 RKYESIKKTLLHELAHMVYSEHD   86 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~Hd   86 (504)
                      +.-+++..+++|||+|..++..=
T Consensus       191 ~~~~el~~il~HEl~Hikr~D~~  213 (299)
T PF05569_consen  191 LSEEELRAILLHELAHIKRRDLL  213 (299)
T ss_pred             cCHHHHHHHHHHHHHHHHCCChH
Confidence            45678999999999999988654


No 22 
>PRK04897 heat shock protein HtpX; Provisional
Probab=68.23  E-value=3.7  Score=42.31  Aligned_cols=66  Identities=11%  Similarity=0.113  Sum_probs=37.7

Q ss_pred             hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCcc
Q 010667           12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEH   85 (504)
Q Consensus        12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~H   85 (504)
                      -+.++.|+.++.+..+. ..   ....-..|++.+++ .|.+-  + ..-.-++-+++..||.|||+|.+|+..
T Consensus        88 ~la~~~gip~p~v~v~~-~~---~~NAfa~G~~~~~~-~v~vt--~-gLl~~l~~~El~aVlAHElgHi~~~d~  153 (298)
T PRK04897         88 DMAMVAQIPMPRVFIID-DP---SPNAFATGSSPKNA-AVAVT--T-GLLAIMNREELEGVIGHEISHIRNYDI  153 (298)
T ss_pred             HHHHHcCCCCCcEEEec-CC---CCceEEeccCCCCc-EEEee--h-HHHhhCCHHHHHHHHHHHHHHHhcCCH
Confidence            55666677666665542 11   01134566655433 23332  1 000123569999999999999998764


No 23 
>PRK03072 heat shock protein HtpX; Provisional
Probab=67.78  E-value=3.8  Score=42.04  Aligned_cols=23  Identities=22%  Similarity=0.315  Sum_probs=19.7

Q ss_pred             ccchHHHHHHHHHHHhhhhcCcc
Q 010667           63 FRKYESIKKTLLHELAHMVYSEH   85 (504)
Q Consensus        63 Flp~~~V~~tlLHELaH~vhg~H   85 (504)
                      -++.+++..||.|||+|..++.-
T Consensus       121 ~l~~~El~aVlAHElgHi~~~d~  143 (288)
T PRK03072        121 ILNERELRGVLGHELSHVYNRDI  143 (288)
T ss_pred             hCCHHHHHHHHHHHHHHHhcCCH
Confidence            34679999999999999999853


No 24 
>KOG3931 consensus Uncharacterized conserved protein [Function unknown]
Probab=67.53  E-value=11  Score=40.67  Aligned_cols=51  Identities=29%  Similarity=0.432  Sum_probs=39.5

Q ss_pred             CCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhh---------cCccchhHHHHHHHHH
Q 010667           46 NHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMV---------YSEHDANFYGLDKQLN   97 (504)
Q Consensus        46 N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~v---------hg~Hd~~F~~L~~~L~   97 (504)
                      .+|--..|||-.+ .-..+|..+++.|||||+-|-.         .+.|.+.|.+.+..|+
T Consensus        83 g~gg~csIRLSeP-LLkLRPRkDLVETLLHEMIHAYlFV~n~~~dr~GHGP~F~~hMhrIN  142 (484)
T KOG3931|consen   83 GKGGMCSIRLSEP-LLKLRPRKDLVETLLHEMIHAYLFVTNNDKDREGHGPEFCKHMHRIN  142 (484)
T ss_pred             cccceEEEEecch-hhccCchHHHHHHHHHHHHHHheeEecccccCCCCChHHHHHHHHHh
Confidence            3455577777664 2358899999999999999953         3589999999887764


No 25 
>PRK03982 heat shock protein HtpX; Provisional
Probab=66.71  E-value=4.2  Score=41.53  Aligned_cols=66  Identities=12%  Similarity=0.252  Sum_probs=36.9

Q ss_pred             hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCcc
Q 010667           12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEH   85 (504)
Q Consensus        12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~H   85 (504)
                      .+.++.|+..+.+.-+ +...   ...=.+|++.++|- |.|   ++..-.-++-++|..||.|||+|..|+..
T Consensus        76 ~la~~~g~~~p~v~v~-~~~~---~NAfa~G~~~~~~~-V~v---t~gLl~~l~~~El~AVlAHElgHi~~~h~  141 (288)
T PRK03982         76 RLAERANIPKPKVAIV-PTQT---PNAFATGRDPKHAV-VAV---TEGILNLLNEDELEGVIAHELTHIKNRDT  141 (288)
T ss_pred             HHHHHcCCCCCeEEEE-eCCC---cceEEeccCCCCeE-EEe---ehHHHhhCCHHHHHHHHHHHHHHHHcCCH
Confidence            5666666655544333 1110   11335666554433 322   11111124678999999999999999854


No 26 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=65.73  E-value=4.1  Score=40.60  Aligned_cols=24  Identities=21%  Similarity=0.314  Sum_probs=20.6

Q ss_pred             ccchHHHHHHHHHHHhhhhcCccc
Q 010667           63 FRKYESIKKTLLHELAHMVYSEHD   86 (504)
Q Consensus        63 Flp~~~V~~tlLHELaH~vhg~Hd   86 (504)
                      -+.-++|..||.||++|.+|+..=
T Consensus       151 ~l~~dEl~aVlaHElgHi~~rd~~  174 (302)
T COG0501         151 LLNDDELEAVLAHELGHIKNRHTL  174 (302)
T ss_pred             hCCHHHHHHHHHHHHHHHhcccHH
Confidence            466789999999999999998544


No 27 
>PRK03001 M48 family peptidase; Provisional
Probab=64.78  E-value=4.8  Score=40.98  Aligned_cols=65  Identities=15%  Similarity=0.187  Sum_probs=36.3

Q ss_pred             hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCc
Q 010667           12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~   84 (504)
                      -+.++.|+.++.+.-+.-.    ....-..|.+.+++. |.|-  . ..-.-+.-+++..||+|||+|.+|+.
T Consensus        75 ~la~~~g~~~p~v~v~~~~----~~NAfa~G~~~~~~~-Ivvt--~-gLl~~l~~~El~aVlAHElgHi~~~h  139 (283)
T PRK03001         75 ELAQRAGLPMPKVYLINED----QPNAFATGRNPEHAA-VAAT--T-GILRVLSEREIRGVMAHELAHVKHRD  139 (283)
T ss_pred             HHHHHcCCCCCeEEEecCC----CcceEEecCCCCCeE-EEec--H-HHHhhCCHHHHHHHHHHHHHHHhCCC
Confidence            5566667766655443211    011335676555443 2221  0 00011356899999999999999873


No 28 
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=64.41  E-value=5.9  Score=42.54  Aligned_cols=41  Identities=27%  Similarity=0.248  Sum_probs=28.1

Q ss_pred             ccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCc
Q 010667           44 NKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        44 N~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~   84 (504)
                      .-|.|+.+.|-.-+-...+.--.+.+..||+||+-|+++-.
T Consensus       114 ~SNe~e~~YiD~~~~~~~~~~~~~~~~sTlAHEfQHmInfy  154 (366)
T PF10460_consen  114 YSNESEYFYIDSETLYLGGNSGPDTVYSTLAHEFQHMINFY  154 (366)
T ss_pred             CCcceeEEEEecHHhhccCCccHHHHHHHHHHHHHHHHHHH
Confidence            34778888887665211112226889999999999998653


No 29 
>PRK02391 heat shock protein HtpX; Provisional
Probab=62.92  E-value=5.4  Score=41.25  Aligned_cols=21  Identities=24%  Similarity=0.417  Sum_probs=18.7

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      +.-+++..++.|||+|..|+.
T Consensus       128 L~~~El~aVlaHElgHi~~~d  148 (296)
T PRK02391        128 LDPDELEAVLAHELSHVKNRD  148 (296)
T ss_pred             CCHHHHHHHHHHHHHHHHcCC
Confidence            456999999999999999985


No 30 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=62.69  E-value=5.4  Score=37.77  Aligned_cols=44  Identities=16%  Similarity=0.073  Sum_probs=28.4

Q ss_pred             ccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchh
Q 010667           40 VLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        40 lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      .+|.+.  +..|.|--.-=   .-++-+++..+|.|||+|..++..-..
T Consensus        65 ~~g~~~--~~~I~v~~~ll---~~~~~~el~aVlaHElgH~~~~h~~~~  108 (226)
T PF01435_consen   65 ATGGGP--RKRIVVTSGLL---ESLSEDELAAVLAHELGHIKHRHILKS  108 (226)
T ss_dssp             EETTTC----EEEEEHHHH---HHSSHHHHHHHHHHHHHHHHTTHCCCC
T ss_pred             EEccCC--CcEEEEeChhh---hcccHHHHHHHHHHHHHHHHcCCcchH
Confidence            344433  44555543321   235668999999999999998876655


No 31 
>PRK02870 heat shock protein HtpX; Provisional
Probab=62.42  E-value=5.4  Score=42.19  Aligned_cols=21  Identities=19%  Similarity=0.430  Sum_probs=18.0

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      ++-+++..||.|||+|..|+.
T Consensus       168 L~~dEL~aVlAHELgHik~~d  188 (336)
T PRK02870        168 LDRDELQAVMAHELSHIRHGD  188 (336)
T ss_pred             CCHHHHHHHHHHHHHHHHccc
Confidence            466899999999999998753


No 32 
>PRK05457 heat shock protein HtpX; Provisional
Probab=62.19  E-value=5.7  Score=40.82  Aligned_cols=21  Identities=19%  Similarity=0.477  Sum_probs=18.8

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      ++-+++..++.|||+|..|+.
T Consensus       129 L~~~El~aVlAHElgHi~~~d  149 (284)
T PRK05457        129 MSRDEVEAVLAHEISHIANGD  149 (284)
T ss_pred             CCHHHHHHHHHHHHHHHHcCC
Confidence            566999999999999999974


No 33 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=60.78  E-value=3.5  Score=41.01  Aligned_cols=18  Identities=28%  Similarity=0.512  Sum_probs=16.6

Q ss_pred             HHHHHHHhhhhcCccchh
Q 010667           71 KTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        71 ~tlLHELaH~vhg~Hd~~   88 (504)
                      .||.|||.||.-++||..
T Consensus       147 ~t~AHElGHnLGm~HD~~  164 (228)
T cd04271         147 QVFAHEIGHTFGAVHDCT  164 (228)
T ss_pred             eehhhhhhhhcCCCCCCC
Confidence            699999999999999974


No 34 
>PRK01345 heat shock protein HtpX; Provisional
Probab=60.27  E-value=6.4  Score=41.12  Aligned_cols=21  Identities=29%  Similarity=0.395  Sum_probs=18.3

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      +.-++|..||.|||+|.+|+.
T Consensus       119 L~~dEL~aVlAHElgHi~~~d  139 (317)
T PRK01345        119 LSPEEVAGVMAHELAHVKNRD  139 (317)
T ss_pred             CCHHHHHHHHHHHHHHHHcCC
Confidence            456899999999999999864


No 35 
>PRK01265 heat shock protein HtpX; Provisional
Probab=58.53  E-value=7.1  Score=41.18  Aligned_cols=65  Identities=17%  Similarity=0.129  Sum_probs=36.7

Q ss_pred             hHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCc
Q 010667           12 AIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        12 pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~   84 (504)
                      -+.++.|+.++.+.-+.-.    ....-..|.+... ..|.|----   -.-+.-+++..|+.|||+|..|+.
T Consensus        91 ~la~~~g~~~p~vyv~~~~----~~NAfa~G~~~~~-~~Ivvt~gL---l~~l~~~El~aVlAHElgHik~~d  155 (324)
T PRK01265         91 EVAKYNGIRVPKVYIADVP----FPNAFAYGSPIAG-KRIAITLPL---LKILNRDEIKAVAGHELGHLKHRD  155 (324)
T ss_pred             HHHHHcCCCCCeEEEecCC----CCCeEEeccCCCC-CEEEEehHH---HhhCCHHHHHHHHHHHHHHHHccc
Confidence            4555666666555443211    0113345655433 445442111   112567999999999999998874


No 36 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=56.92  E-value=10  Score=37.95  Aligned_cols=40  Identities=20%  Similarity=0.189  Sum_probs=29.1

Q ss_pred             ccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccchh
Q 010667           44 NKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        44 N~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      |-+.+-.|.|   +.  ..-.....+..||.|||=||+--.|.+-
T Consensus       113 ~G~P~~~I~I---~~--~~~~~~~~~~hvi~HEiGH~IGfRHTD~  152 (211)
T PF12388_consen  113 NGNPYKFIQI---YG--LSNYSVNVIEHVITHEIGHCIGFRHTDY  152 (211)
T ss_pred             CCCCCceEEE---Ee--cCCCchhHHHHHHHHHhhhhccccccCc
Confidence            4456778888   31  1223556789999999999999999763


No 37 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=54.83  E-value=8.8  Score=36.14  Aligned_cols=25  Identities=24%  Similarity=0.263  Sum_probs=16.5

Q ss_pred             cchHHHHHHHHHHHhhhhcCccchh
Q 010667           64 RKYESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      .+...-..|+.|||.|+.-.+||..
T Consensus       137 ~~~~~~~~~~AHEiGH~lGa~HD~~  161 (196)
T PF13688_consen  137 PPTYNGAITFAHEIGHNLGAPHDGD  161 (196)
T ss_dssp             --HHHHHHHHHHHHHHHTT-----S
T ss_pred             CCCCceehhhHHhHHHhcCCCCCCC
Confidence            5577888999999999999999876


No 38 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=53.61  E-value=8.3  Score=37.53  Aligned_cols=20  Identities=40%  Similarity=0.517  Sum_probs=18.2

Q ss_pred             HHHHHHHHHhhhhcCccchh
Q 010667           69 IKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        69 V~~tlLHELaH~vhg~Hd~~   88 (504)
                      ...||.|||+|+.-++||..
T Consensus       145 ~~~~~AHElGH~lG~~HD~~  164 (220)
T cd04272         145 GVYTMTHELAHLLGAPHDGS  164 (220)
T ss_pred             cHHHHHHHHHHHhCCCCCCC
Confidence            36999999999999999976


No 39 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=53.02  E-value=7.4  Score=33.82  Aligned_cols=16  Identities=38%  Similarity=0.580  Sum_probs=11.7

Q ss_pred             HHHHHHHhhhhcCccc
Q 010667           71 KTLLHELAHMVYSEHD   86 (504)
Q Consensus        71 ~tlLHELaH~vhg~Hd   86 (504)
                      .|+.|||-|+.-.+||
T Consensus       109 ~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  109 DTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             THHHHHHHHHTT----
T ss_pred             eEeeehhhHhcCCCCC
Confidence            8999999999999996


No 40 
>PF13058 DUF3920:  Protein of unknown function (DUF3920)
Probab=52.51  E-value=8.3  Score=35.27  Aligned_cols=20  Identities=45%  Similarity=0.795  Sum_probs=17.6

Q ss_pred             ccchHHHHHHHHHHHhhhhc
Q 010667           63 FRKYESIKKTLLHELAHMVY   82 (504)
Q Consensus        63 Flp~~~V~~tlLHELaH~vh   82 (504)
                      |-.|+.|+.|++||+-|-..
T Consensus        70 we~y~qvlktllhefrh~mQ   89 (126)
T PF13058_consen   70 WEEYEQVLKTLLHEFRHAMQ   89 (126)
T ss_pred             hHHHHHHHHHHHHHHHHHHH
Confidence            66789999999999999765


No 41 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=50.65  E-value=12  Score=35.50  Aligned_cols=23  Identities=35%  Similarity=0.390  Sum_probs=20.2

Q ss_pred             hHHHHHHHHHHHhhhhcCccchh
Q 010667           66 YESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      +.....|+.|||.|+.-++||..
T Consensus       128 ~~~~a~~~AHElGH~lG~~HD~~  150 (194)
T cd04269         128 LLLFAVTMAHELGHNLGMEHDDG  150 (194)
T ss_pred             hHHHHHHHHHHHHhhcCCCcCCC
Confidence            44567899999999999999987


No 42 
>PRK14015 pepN aminopeptidase N; Provisional
Probab=49.31  E-value=10  Score=44.94  Aligned_cols=20  Identities=30%  Similarity=0.443  Sum_probs=17.0

Q ss_pred             chHHHHHHHHHHHhhhhcCc
Q 010667           65 KYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        65 p~~~V~~tlLHELaH~vhg~   84 (504)
                      -+..+..++.|||+|+++||
T Consensus       292 ~~~~i~~vIaHElaHqWFGN  311 (875)
T PRK14015        292 DYERIESVIAHEYFHNWTGN  311 (875)
T ss_pred             HHHHHHHHHHHHHHHHHHhC
Confidence            34568899999999999995


No 43 
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=48.88  E-value=11  Score=44.77  Aligned_cols=21  Identities=29%  Similarity=0.412  Sum_probs=18.0

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      ..+..|..++.||++|+++|+
T Consensus       278 ~~~~~i~~VIaHElaHqWfGN  298 (863)
T TIGR02414       278 ADYERIESVIAHEYFHNWTGN  298 (863)
T ss_pred             HHHHHHHHHHHHHHHHHHhcc
Confidence            346778999999999999995


No 44 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=48.56  E-value=9  Score=36.10  Aligned_cols=22  Identities=36%  Similarity=0.427  Sum_probs=19.2

Q ss_pred             HHHHHHHHHHHhhhhcCccchh
Q 010667           67 ESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        67 ~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      .....|+.|||.|+.-..||..
T Consensus       131 ~~~~~~~aHElGH~lG~~HD~~  152 (192)
T cd04267         131 LLTALTMAHELGHNLGAEHDGG  152 (192)
T ss_pred             eeehhhhhhhHHhhcCCcCCCC
Confidence            4556799999999999999976


No 45 
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=48.46  E-value=11  Score=44.50  Aligned_cols=25  Identities=28%  Similarity=0.456  Sum_probs=20.5

Q ss_pred             CccchHHHHHHHHHHHhhhhcCccc
Q 010667           62 GFRKYESIKKTLLHELAHMVYSEHD   86 (504)
Q Consensus        62 ~Flp~~~V~~tlLHELaH~vhg~Hd   86 (504)
                      .--.|+.|-.|.+|||||++.|+==
T Consensus       300 t~~~~~~~~~viaHElaHqWfGnlV  324 (859)
T COG0308         300 TDSDYENVEEVIAHELAHQWFGNLV  324 (859)
T ss_pred             hhHHHHHHHHHHHHHHhhhccccee
Confidence            3445688889999999999999754


No 46 
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=48.32  E-value=11  Score=44.03  Aligned_cols=18  Identities=39%  Similarity=0.617  Sum_probs=15.9

Q ss_pred             hHHHHHHHHHHHhhhhcC
Q 010667           66 YESIKKTLLHELAHMVYS   83 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vhg   83 (504)
                      ...+..+++|||||.++|
T Consensus       284 ~~~~~~viaHElAHqWFG  301 (831)
T TIGR02412       284 KENRAGVILHEMAHMWFG  301 (831)
T ss_pred             HHHHHHHHHHHHHHHHhC
Confidence            456789999999999999


No 47 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=48.02  E-value=12  Score=35.20  Aligned_cols=20  Identities=35%  Similarity=0.487  Sum_probs=16.4

Q ss_pred             CccchHHHHHHHHHHHhhhh
Q 010667           62 GFRKYESIKKTLLHELAHMV   81 (504)
Q Consensus        62 ~Flp~~~V~~tlLHELaH~v   81 (504)
                      .|.||-.-+.|+.|||+|=|
T Consensus       128 ~f~~~~~~lDVvaHEltHGV  147 (150)
T PF01447_consen  128 IFKPFASSLDVVAHELTHGV  147 (150)
T ss_dssp             SBS-GGG-HHHHHHHHHHHH
T ss_pred             ccccCccccceeeecccccc
Confidence            59999999999999999976


No 48 
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=46.58  E-value=11  Score=35.02  Aligned_cols=31  Identities=26%  Similarity=0.227  Sum_probs=17.6

Q ss_pred             EEEEEeec-CCCCCccchHHHHHHHHHHHhhhhc
Q 010667           50 EISLRLRT-DDLKGFRKYESIKKTLLHELAHMVY   82 (504)
Q Consensus        50 ~I~LRLR~-~d~~~Flp~~~V~~tlLHELaH~vh   82 (504)
                      +|.|.-.- ..+.  --...=..||+|||+|..-
T Consensus        78 ~IyLc~~F~~~p~--~g~~Sk~~TLiHE~SHf~~  109 (148)
T PF14521_consen   78 TIYLCPAFFSAPT--TGKDSKEGTLIHEWSHFTA  109 (148)
T ss_dssp             EEEE-HHHHHS-S--SSTT-HHHHHHHHHHHSCC
T ss_pred             EEEEChhhcCCCC--CCCCchHHHHHHhhhhhhh
Confidence            57776554 2221  1223458999999999543


No 49 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=46.17  E-value=12  Score=35.37  Aligned_cols=20  Identities=35%  Similarity=0.423  Sum_probs=15.6

Q ss_pred             HHHHHHHHHhhhhcCccchh
Q 010667           69 IKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        69 V~~tlLHELaH~vhg~Hd~~   88 (504)
                      -..|+.|||.|..-.+|+..
T Consensus       111 ~~~~~aHElGH~lGa~Hd~~  130 (173)
T PF13574_consen  111 GIDTFAHELGHQLGAPHDFD  130 (173)
T ss_dssp             HHHHHHHHHHHHHT---SSS
T ss_pred             eeeeehhhhHhhcCCCCCCC
Confidence            67789999999999999988


No 50 
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=45.19  E-value=12  Score=42.43  Aligned_cols=16  Identities=31%  Similarity=0.314  Sum_probs=14.2

Q ss_pred             HHHHHHHHHhhhhcCc
Q 010667           69 IKKTLLHELAHMVYSE   84 (504)
Q Consensus        69 V~~tlLHELaH~vhg~   84 (504)
                      ...|++|||||.++|+
T Consensus       279 ~~~viaHElAHqWfGN  294 (601)
T TIGR02411       279 NVDVIAHELAHSWSGN  294 (601)
T ss_pred             hhhhHHHHHHhhccCc
Confidence            4689999999999994


No 51 
>PF01433 Peptidase_M1:  Peptidase family M1 This is family M1 in the peptidase classification.;  InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.  Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=43.27  E-value=17  Score=37.54  Aligned_cols=19  Identities=32%  Similarity=0.503  Sum_probs=17.0

Q ss_pred             hHHHHHHHHHHHhhhhcCc
Q 010667           66 YESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vhg~   84 (504)
                      ...+..+++|||||.++|+
T Consensus       292 ~~~~~~~iahElahqWfGn  310 (390)
T PF01433_consen  292 KQEIASLIAHELAHQWFGN  310 (390)
T ss_dssp             HHHHHHHHHHHHHTTTBTT
T ss_pred             hhhhHHHHHHHHHHHHhcc
Confidence            4678999999999999995


No 52 
>KOG1046 consensus Puromycin-sensitive aminopeptidase and related aminopeptidases [Amino acid transport and metabolism; Posttranslational modification, protein turnover, chaperones]
Probab=42.11  E-value=16  Score=43.23  Aligned_cols=21  Identities=24%  Similarity=0.482  Sum_probs=18.4

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      .-+..|..+++|||||.++||
T Consensus       319 ~~k~~va~vIaHElAHQWFGN  339 (882)
T KOG1046|consen  319 SNKQRVAEVIAHELAHQWFGN  339 (882)
T ss_pred             HHHHHHHHHHHHHHHHHHhcC
Confidence            336789999999999999996


No 53 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=40.07  E-value=22  Score=33.87  Aligned_cols=25  Identities=32%  Similarity=0.365  Sum_probs=18.5

Q ss_pred             cchHHHHHHHHHHHhhhhcCccchh
Q 010667           64 RKYESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      ..+-..-.+|.|||+|+.-++||..
T Consensus       126 ~~~~~~a~~~AHelGH~lGm~HD~~  150 (199)
T PF01421_consen  126 RSGLSFAVIIAHELGHNLGMPHDGD  150 (199)
T ss_dssp             SSHHHHHHHHHHHHHHHTT---TTT
T ss_pred             chhHHHHHHHHHHHHHhcCCCCCCC
Confidence            3456677899999999999999988


No 54 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=39.29  E-value=15  Score=31.01  Aligned_cols=33  Identities=27%  Similarity=0.315  Sum_probs=19.8

Q ss_pred             CccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcC
Q 010667           47 HGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYS   83 (504)
Q Consensus        47 ~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg   83 (504)
                      .|..|.++--..++..    ..=...|.|||+|.+.-
T Consensus        43 ~G~~I~f~~g~~~~~s----~~~~~llaHEl~Hv~Qq   75 (79)
T PF13699_consen   43 VGNDIYFAPGKYNPDS----PEGRALLAHELAHVVQQ   75 (79)
T ss_pred             ECCEEEEcCCCcCCCC----CCcchhHhHHHHHHHhh
Confidence            5888988421111100    11367899999998854


No 55 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=39.08  E-value=21  Score=31.51  Aligned_cols=17  Identities=47%  Similarity=0.596  Sum_probs=14.0

Q ss_pred             hHHHHHHHHHHHhhhhc
Q 010667           66 YESIKKTLLHELAHMVY   82 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vh   82 (504)
                      -+.|+.|++||+||..-
T Consensus        70 ~~~I~~tlvhEiah~fG   86 (97)
T PF06262_consen   70 AELIRDTLVHEIAHHFG   86 (97)
T ss_dssp             HHHHHHHHHHHHHHHTT
T ss_pred             HHHHHHHHHHHHHHHcC
Confidence            35799999999999753


No 56 
>PF08919 F_actin_bind:  F-actin binding;  InterPro: IPR015015 The F-actin binding domain forms a compact bundle of four antiparallel alpha-helices, which are arranged in a left-handed topology. Binding of F-actin to the F-actin binding domain may result in cytoplasmic retention and subcellular distribution of the protein, as well as possible inhibition of protein function []. ; GO: 0004715 non-membrane spanning protein tyrosine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1ZZP_A 2KK1_A.
Probab=38.58  E-value=41  Score=30.49  Aligned_cols=73  Identities=12%  Similarity=0.170  Sum_probs=54.3

Q ss_pred             hhhhhhhhHHHHHHHHHHHhc----CChhHHHHHHHHHHHHHHHHhc--CCCccccchhcccChhHHHhhcChhhHHHHH
Q 010667          387 QDTVTVVCSRLQKAIEMLRAE----VSPLESTTVLQTLCKIIRNVIE--HPDETKYKRLRKANPIIQRSVANYKAAMEIL  460 (504)
Q Consensus       387 ~e~v~~~~e~L~~Al~~L~~~----n~~e~~~~al~tL~Kil~NIl~--~P~e~KyRrIR~sN~~Fq~kV~~~~Ga~E~L  460 (504)
                      ++.|...++-|+.+|..+..+    ..--++...+..|+.+|.+.+.  -|--.||        .|.+.|.+++.-..-|
T Consensus         7 k~~Ile~~~~L~~~l~~l~~~~~~~~~~~qL~D~~~~l~~~C~~yaD~~~~p~~KF--------~FREllsrLE~~~rqL   78 (110)
T PF08919_consen    7 KESILELAELLENALSNLKSSSISMAQWSQLSDKLQQLHSSCSGYADSIIQPHAKF--------AFRELLSRLESQSRQL   78 (110)
T ss_dssp             HHHHHHHHHHHHHHCCCTTTC------HHHHHHHHHHHHHHHHHHGGG-S-CCCHH--------HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHhchhhhhHHHHHHHHHHHHHHHHHHHHHHHHcCcCcchhh--------hHHHHHHHHHHHHHHH
Confidence            455666777788888777666    2335667788889999999887  3456788        5999999999999999


Q ss_pred             HHcCCcc
Q 010667          461 FLVGFNE  467 (504)
Q Consensus       461 ~alGF~e  467 (504)
                      ..+|=.-
T Consensus        79 r~~~s~~   85 (110)
T PF08919_consen   79 RSCGSSN   85 (110)
T ss_dssp             CHSSSSS
T ss_pred             HhccCCC
Confidence            9887544


No 57 
>PF02102 Peptidase_M35:  Deuterolysin metalloprotease (M35) family;  InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=37.76  E-value=17  Score=39.06  Aligned_cols=21  Identities=33%  Similarity=0.477  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHhhhh--cCccchh
Q 010667           68 SIKKTLLHELAHMV--YSEHDAN   88 (504)
Q Consensus        68 ~V~~tlLHELaH~v--hg~Hd~~   88 (504)
                      +--.|+||||+|+.  ++|+-.+
T Consensus       296 Dqatt~LHE~TH~~~V~~pgt~D  318 (359)
T PF02102_consen  296 DQATTTLHEMTHAPAVYSPGTDD  318 (359)
T ss_dssp             -HHHHHHHHHHT-TTTSSS--B-
T ss_pred             CccchhhhhhhccccccCCCccc
Confidence            35689999999995  5554443


No 58 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=37.57  E-value=22  Score=31.80  Aligned_cols=24  Identities=17%  Similarity=0.362  Sum_probs=20.1

Q ss_pred             cchHHHHHHHHHHHhhhhcCccch
Q 010667           64 RKYESIKKTLLHELAHMVYSEHDA   87 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~Hd~   87 (504)
                      .+...+..|++|||-|..-..|+.
T Consensus       100 ~~~~~~~~v~~HEiGHaLGL~H~~  123 (154)
T PF00413_consen  100 DSGNDLQSVAIHEIGHALGLDHSN  123 (154)
T ss_dssp             SSSEEHHHHHHHHHHHHTTBESSS
T ss_pred             hhhhhhhhhhhhccccccCcCcCC
Confidence            345578999999999999999873


No 59 
>PF10023 DUF2265:  Predicted aminopeptidase (DUF2265);  InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=36.98  E-value=40  Score=36.04  Aligned_cols=43  Identities=23%  Similarity=0.358  Sum_probs=34.2

Q ss_pred             chHHHHHHHHHHHhhhh-cCccchhHHHHHHHHHHHHHHhcccc
Q 010667           65 KYESIKKTLLHELAHMV-YSEHDANFYGLDKQLNQEAVALDWTK  107 (504)
Q Consensus        65 p~~~V~~tlLHELaH~v-hg~Hd~~F~~L~~~L~~E~~~l~~~~  107 (504)
                      |-..+..+++|||||-+ +-+-|..|-+-+.....+.-...|..
T Consensus       161 ~~~~LA~LIfHELaHq~~Yv~~dt~FNEsfAtfVe~~G~~~wl~  204 (337)
T PF10023_consen  161 PDGELARLIFHELAHQTLYVKGDTAFNESFATFVEREGARRWLA  204 (337)
T ss_pred             CchHHHHHHHHHHhhceeecCCCchhhHHHHHHHHHHHHHHHHH
Confidence            44578999999999987 66888889888888887776777743


No 60 
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=35.45  E-value=23  Score=37.72  Aligned_cols=21  Identities=33%  Similarity=0.510  Sum_probs=18.3

Q ss_pred             cchHHHHHHHHHHHhhhhcCc
Q 010667           64 RKYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vhg~   84 (504)
                      ++-+++.++++|||+|..+|.
T Consensus       185 ~~~ee~~yIilHEl~Hlk~gD  205 (337)
T COG4219         185 LTDEELKYIILHELSHLKRGD  205 (337)
T ss_pred             cCHHhhhhhHhHHHhhhhccc
Confidence            566889999999999999884


No 61 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=35.06  E-value=9  Score=36.93  Aligned_cols=19  Identities=37%  Similarity=0.552  Sum_probs=17.6

Q ss_pred             HHHHHHHHhhhhcCccchh
Q 010667           70 KKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        70 ~~tlLHELaH~vhg~Hd~~   88 (504)
                      ..||.|||.|+.-.+||..
T Consensus       141 a~~~aHElGH~LG~~HD~~  159 (207)
T cd04273         141 AFTIAHELGHVLGMPHDGD  159 (207)
T ss_pred             EEeeeeechhhcCCCCCCC
Confidence            5799999999999999987


No 62 
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=34.75  E-value=22  Score=32.76  Aligned_cols=39  Identities=21%  Similarity=0.233  Sum_probs=22.3

Q ss_pred             ccccccC-CccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhc
Q 010667           40 VLGFNKN-HGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVY   82 (504)
Q Consensus        40 lLGlN~N-~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vh   82 (504)
                      |--+|.| .|+-+-...=.+|.    .-.-.+.||.|||+|.+.
T Consensus        53 F~~L~~~LCG~~~~~i~IDP~~----~~KGC~~TL~HEL~H~WQ   92 (141)
T PHA02456         53 FRALPQDLCGQFVGWIEIDPDY----ANKGCRDTLAHELNHAWQ   92 (141)
T ss_pred             hhhcCcchhhcceeEEEECCcc----cccchHHHHHHHHHHHHh
Confidence            3335555 45544333333221    113478999999999874


No 63 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=34.70  E-value=20  Score=32.43  Aligned_cols=22  Identities=32%  Similarity=0.394  Sum_probs=19.2

Q ss_pred             HHHHHHHHHHHhhhhcCccchh
Q 010667           67 ESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        67 ~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      .....|++|||.|..-..|+..
T Consensus        94 ~~~~~~~~HElGH~LGl~H~~~  115 (167)
T cd00203          94 KEGAQTIAHELGHALGFYHDHD  115 (167)
T ss_pred             ccchhhHHHHHHHHhCCCccCc
Confidence            4578999999999999999865


No 64 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=34.57  E-value=17  Score=38.01  Aligned_cols=35  Identities=37%  Similarity=0.354  Sum_probs=24.0

Q ss_pred             cchHHHHHHHHHHHhhhhc--CccchhHHHHHHHHHH
Q 010667           64 RKYESIKKTLLHELAHMVY--SEHDANFYGLDKQLNQ   98 (504)
Q Consensus        64 lp~~~V~~tlLHELaH~vh--g~Hd~~F~~L~~~L~~   98 (504)
                      +|.-.+=-|+||||||-.-  ..=.++|++.+.-.+.
T Consensus       191 ~p~~~~P~T~~HElAHq~G~a~E~EANFiayLac~~s  227 (318)
T PF12725_consen  191 LPPYSLPFTICHELAHQLGFASEDEANFIAYLACINS  227 (318)
T ss_pred             CCcccccHHHHHHHHHHhCCCCHHHHHHHHHHHHhcC
Confidence            4555566799999999872  2445778877654443


No 65 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=33.63  E-value=31  Score=31.28  Aligned_cols=22  Identities=27%  Similarity=0.391  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHhhhhcCccchh
Q 010667           67 ESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        67 ~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      .....|++|||-|..-..|...
T Consensus        92 ~~~~~~~~HEiGHaLGL~H~~~  113 (165)
T cd04268          92 ARLRNTAEHELGHALGLRHNFA  113 (165)
T ss_pred             HHHHHHHHHHHHHHhcccccCc
Confidence            4689999999999999988765


No 66 
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=33.18  E-value=32  Score=34.85  Aligned_cols=20  Identities=30%  Similarity=0.328  Sum_probs=17.1

Q ss_pred             chHHHHHHHHHHHhhhhcCc
Q 010667           65 KYESIKKTLLHELAHMVYSE   84 (504)
Q Consensus        65 p~~~V~~tlLHELaH~vhg~   84 (504)
                      +.+.+..+++||+-||+++.
T Consensus        56 ~~~~~~~~l~HevlH~~~~H   75 (292)
T PF13203_consen   56 SPEERVGLLLHEVLHCLLRH   75 (292)
T ss_pred             CHHHHHHHHHHHHHHHHccc
Confidence            45778899999999999874


No 67 
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=33.09  E-value=44  Score=36.79  Aligned_cols=73  Identities=18%  Similarity=0.136  Sum_probs=48.2

Q ss_pred             chhhHhhhcCeeeceecccCCCCCCCCCCCcccccccCCccEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcC
Q 010667            9 GIIAIMNKHRWRVGIMTELAPVGYVGVSPKCVLGFNKNHGEEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYS   83 (504)
Q Consensus         9 ~i~pIM~~~~w~V~~L~Ef~P~~~~~~s~~~lLGlN~N~G~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg   83 (504)
                      .|.-.++..||-...+-+.--.-..+.|.+-+.|+-.|+.=+|.=-|--.  ..-+.=++|..+++|||.|-.|+
T Consensus       222 ~Ie~la~s~gfp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~--~~~~~~eel~AVl~HELGHW~~~  294 (428)
T KOG2719|consen  222 KIERLADSVGFPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLE--EEHLNNEELVAVLAHELGHWKLN  294 (428)
T ss_pred             HHHHHHHhcCCCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhh--hhccccHHHHHHHHHHhhHHHHh
Confidence            45577788888777776666222234566789999888776665333310  00034589999999999998776


No 68 
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=32.04  E-value=38  Score=33.81  Aligned_cols=35  Identities=34%  Similarity=0.329  Sum_probs=22.0

Q ss_pred             HHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHh
Q 010667           69 IKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVAL  103 (504)
Q Consensus        69 V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l  103 (504)
                      +.-.|+||++|.+++.=.......-.+.+.+|+..
T Consensus       101 ~~fil~HE~~Hv~~~h~~~~~~~~~~~eE~~AD~~  135 (206)
T PF10463_consen  101 IAFILLHELAHVVLGHEGDSSPSQSIQEEKEADSY  135 (206)
T ss_pred             HHHHHHHHHHHHHHcCccccccchhHHHHHhhhHH
Confidence            34679999999887743332555555555555543


No 69 
>PF14891 Peptidase_M91:  Effector protein
Probab=31.01  E-value=23  Score=33.67  Aligned_cols=19  Identities=26%  Similarity=0.160  Sum_probs=15.2

Q ss_pred             HHHHHHHHhhhhcCccchh
Q 010667           70 KKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        70 ~~tlLHELaH~vhg~Hd~~   88 (504)
                      .-+|.|||+|..|.-++..
T Consensus       104 ~v~L~HEL~HA~~~~~Gt~  122 (174)
T PF14891_consen  104 FVVLYHELIHAYDYMNGTM  122 (174)
T ss_pred             HHHHHHHHHHHHHHHCCCC
Confidence            5689999999988766654


No 70 
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=28.94  E-value=43  Score=32.51  Aligned_cols=20  Identities=35%  Similarity=0.481  Sum_probs=16.2

Q ss_pred             CccchHHHHHHHHHHHhhhh
Q 010667           62 GFRKYESIKKTLLHELAHMV   81 (504)
Q Consensus        62 ~Flp~~~V~~tlLHELaH~v   81 (504)
                      .+.....+..||.|||.|..
T Consensus        64 ~~~~~~~l~~~l~HELIHay   83 (173)
T PF09768_consen   64 RIRSQGHLEDTLTHELIHAY   83 (173)
T ss_pred             CCCCHHHHHHHHHHHHHHHH
Confidence            35567889999999999953


No 71 
>PF15639 Tox-MPTase3:  Metallopeptidase toxin 3
Probab=28.79  E-value=24  Score=33.03  Aligned_cols=77  Identities=18%  Similarity=0.204  Sum_probs=36.8

Q ss_pred             cccCcchhhHhhhcC-eeeceecccCCCCCC---CCC---CCccccccc--CCccEEEEEeec-CCCCCcc------chH
Q 010667            4 LAADPGIIAIMNKHR-WRVGIMTELAPVGYV---GVS---PKCVLGFNK--NHGEEISLRLRT-DDLKGFR------KYE   67 (504)
Q Consensus         4 L~~~a~i~pIM~~~~-w~V~~L~Ef~P~~~~---~~s---~~~lLGlN~--N~G~~I~LRLR~-~d~~~Fl------p~~   67 (504)
                      |...|.|+-+|+|.+ .+-.-|.|=+-+++-   .|.   ..+-.|+-.  -+|++|.|+--. .....=.      -+=
T Consensus        19 ~~~~p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~~~A~G~t~p~~~~~~i~I~~~lV~~~Ek~~~~~r~~~~~   98 (135)
T PF15639_consen   19 VLKVPAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVGGTAYGYTTPDDGSQVIRIDGDLVNMFEKGPGERRAGNVY   98 (135)
T ss_pred             hhcChHHHHHHHHHcccchhhhhhccccCCCCEEEEeecccceeeeeecCCCCCcEEEeeHHHhhhhhcCCcccccceEE
Confidence            355677777777732 223334443333320   010   012233322  367888655433 1110000      112


Q ss_pred             HHHHHHHHHHhhh
Q 010667           68 SIKKTLLHELAHM   80 (504)
Q Consensus        68 ~V~~tlLHELaH~   80 (504)
                      .|-.|+||||.|-
T Consensus        99 ~v~~TlLHEliHw  111 (135)
T PF15639_consen   99 LVGSTLLHELIHW  111 (135)
T ss_pred             EeehHHHHHHHHh
Confidence            4678999999994


No 72 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=28.17  E-value=41  Score=30.74  Aligned_cols=22  Identities=32%  Similarity=0.472  Sum_probs=19.2

Q ss_pred             HHHHHHHHHHHhhhhcCccchh
Q 010667           67 ESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        67 ~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      ..+..|++|||-|..-..|...
T Consensus       102 ~~~~~~~~HEiGHaLGL~H~~~  123 (156)
T cd04279         102 ENLQAIALHELGHALGLWHHSD  123 (156)
T ss_pred             hHHHHHHHHHhhhhhcCCCCCC
Confidence            5789999999999998888755


No 73 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=26.71  E-value=46  Score=32.89  Aligned_cols=23  Identities=35%  Similarity=0.488  Sum_probs=16.5

Q ss_pred             hHHHHHHHHHHHhhhhcCccchh
Q 010667           66 YESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      .+.+..+++|||.|+.-..|=..
T Consensus       142 ~~R~~Kea~HElGH~~GL~HC~~  164 (194)
T PF07998_consen  142 LERVCKEAVHELGHLFGLDHCEN  164 (194)
T ss_dssp             HHHHHHHHHHHHHHHTT----SS
T ss_pred             HHHHHHHHHHHHHHHcCCcCCCC
Confidence            57889999999999998888544


No 74 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=26.04  E-value=48  Score=32.14  Aligned_cols=24  Identities=38%  Similarity=0.444  Sum_probs=19.6

Q ss_pred             chHHHHHHHHHHHhhhhcCccchh
Q 010667           65 KYESIKKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        65 p~~~V~~tlLHELaH~vhg~Hd~~   88 (504)
                      -.+.+.++++|||.|+.-..|-..
T Consensus       121 ~~~R~~k~~~HElGH~lGL~HC~~  144 (179)
T PRK13267        121 FEERVRKEVTHELGHTLGLEHCDN  144 (179)
T ss_pred             HHHHHHHHHHHHHHHHcCCccCCC
Confidence            346788889999999999999543


No 75 
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=25.72  E-value=39  Score=39.46  Aligned_cols=22  Identities=32%  Similarity=0.339  Sum_probs=20.2

Q ss_pred             hHHHHHHHHHHHhhhhcCccch
Q 010667           66 YESIKKTLLHELAHMVYSEHDA   87 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vhg~Hd~   87 (504)
                      +..+..||.|||+|++-+.||.
T Consensus       320 ~~~~a~v~AhelgH~lGm~hD~  341 (716)
T KOG3607|consen  320 LLAFAVVLAHELGHNLGMIHDE  341 (716)
T ss_pred             chhHHHHHHHHHHhhcCccccc
Confidence            5667899999999999999998


No 76 
>PRK09672 phage exclusion protein Lit; Provisional
Probab=24.08  E-value=69  Score=33.85  Aligned_cols=36  Identities=25%  Similarity=0.262  Sum_probs=22.1

Q ss_pred             HHHHHHHHHhhhhcCccchhHHHHHHHHHHHHHHhc
Q 010667           69 IKKTLLHELAHMVYSEHDANFYGLDKQLNQEAVALD  104 (504)
Q Consensus        69 V~~tlLHELaH~vhg~Hd~~F~~L~~~L~~E~~~l~  104 (504)
                      +-.+++||++|++++.=.-.=.+.-.+.+.||+..-
T Consensus       165 ~a~i~~HEiaHv~~~h~~~~~~e~s~~eE~eaDs~A  200 (305)
T PRK09672        165 LAWILLHEIAHVEFQHSSLESNEDSIQEEKEADSYA  200 (305)
T ss_pred             HHHHHHHHHHHHHhccccccCchHHHHHHHHHHHHH
Confidence            346789999999987433233334445555665543


No 77 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=22.78  E-value=48  Score=33.30  Aligned_cols=21  Identities=33%  Similarity=0.479  Sum_probs=16.5

Q ss_pred             hHHHHHHHHHHHhhhhcCccc
Q 010667           66 YESIKKTLLHELAHMVYSEHD   86 (504)
Q Consensus        66 ~~~V~~tlLHELaH~vhg~Hd   86 (504)
                      ...++.||+||+.|....-.+
T Consensus        89 ~~~~~~~l~HE~GHAlI~~~~  109 (220)
T PF14247_consen   89 IGNVLFTLYHELGHALIDDLD  109 (220)
T ss_pred             HHHHHHHHHHHHHHHHHHHhc
Confidence            345789999999998876554


No 78 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=22.42  E-value=54  Score=30.89  Aligned_cols=38  Identities=21%  Similarity=0.158  Sum_probs=24.3

Q ss_pred             cEEEEEeecCCCCCccchHHHHHHHHHHHhhhhcCccch
Q 010667           49 EEISLRLRTDDLKGFRKYESIKKTLLHELAHMVYSEHDA   87 (504)
Q Consensus        49 ~~I~LRLR~~d~~~Flp~~~V~~tlLHELaH~vhg~Hd~   87 (504)
                      -.|.|........ ..+=.....|++|||-|..-..|.-
T Consensus        94 g~i~~~~~~~~~~-~~~g~~~~~t~~HEiGHaLGL~H~~  131 (186)
T cd04277          94 GDIWFNSSYDTNS-DSPGSYGYQTIIHEIGHALGLEHPG  131 (186)
T ss_pred             ceeEEecCccccc-CCCChhhHHHHHHHHHHHhcCCCCC
Confidence            3466655442211 1233557899999999999887764


No 79 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=22.20  E-value=49  Score=30.33  Aligned_cols=23  Identities=17%  Similarity=0.193  Sum_probs=19.6

Q ss_pred             chHHHHHHHHHHHhhhhcCccch
Q 010667           65 KYESIKKTLLHELAHMVYSEHDA   87 (504)
Q Consensus        65 p~~~V~~tlLHELaH~vhg~Hd~   87 (504)
                      +...+..|++|||-|..-..|..
T Consensus       103 ~~~~~~~~~~HEiGHaLGL~H~~  125 (157)
T cd04278         103 GGTDLFSVAAHEIGHALGLGHSS  125 (157)
T ss_pred             ccchHHHHHHHHhccccccCCCC
Confidence            34679999999999999888874


No 80 
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=21.84  E-value=43  Score=38.16  Aligned_cols=18  Identities=28%  Similarity=0.366  Sum_probs=15.3

Q ss_pred             HHHHHHHHHHhhhhcCcc
Q 010667           68 SIKKTLLHELAHMVYSEH   85 (504)
Q Consensus        68 ~V~~tlLHELaH~vhg~H   85 (504)
                      ....+++||+||-+.||=
T Consensus       287 sl~~vIaHEIAHSWtGNl  304 (613)
T KOG1047|consen  287 SLVDVIAHEIAHSWTGNL  304 (613)
T ss_pred             chhhHHHHHhhhhhcccc
Confidence            347899999999999973


No 81 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=20.41  E-value=56  Score=31.88  Aligned_cols=19  Identities=32%  Similarity=0.459  Sum_probs=17.4

Q ss_pred             HHHHHHHHhhhhcCccchh
Q 010667           70 KKTLLHELAHMVYSEHDAN   88 (504)
Q Consensus        70 ~~tlLHELaH~vhg~Hd~~   88 (504)
                      ..|+.|||-|+.-..|+..
T Consensus       138 ~~~~aHEiGH~lGl~H~~~  156 (206)
T PF13583_consen  138 YQTFAHEIGHNLGLRHDFD  156 (206)
T ss_pred             chHHHHHHHHHhcCCCCcc
Confidence            3789999999999999987


Done!