Query         psy2264
Match_columns 148
No_of_seqs    140 out of 1049
Neff          8.3 
Searched_HMMs 46136
Date          Fri Aug 16 20:17:20 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2264.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2264hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1046|consensus              100.0 4.7E-44   1E-48  312.1  10.3  142    1-146   196-345 (882)
  2 TIGR02412 pepN_strep_liv amino 100.0 1.7E-38 3.8E-43  276.4  12.4  141    1-146   161-308 (831)
  3 TIGR02414 pepN_proteo aminopep 100.0   2E-38 4.3E-43  275.4  12.1  137    6-146   156-304 (863)
  4 PF01433 Peptidase_M1:  Peptida 100.0 3.2E-38   7E-43  253.7  11.2  142    1-146   167-316 (390)
  5 PRK14015 pepN aminopeptidase N 100.0 9.7E-38 2.1E-42  271.6  12.8  140    3-146   166-317 (875)
  6 COG0308 PepN Aminopeptidase N  100.0 6.7E-36 1.5E-40  260.7   9.8  141    1-146   177-328 (859)
  7 TIGR02411 leuko_A4_hydro leuko 100.0 1.5E-34 3.2E-39  244.4  10.0  124   12-145   173-299 (601)
  8 KOG1047|consensus               99.9 7.2E-28 1.6E-32  196.6   5.7  127   11-147   181-313 (613)
  9 KOG1932|consensus               99.3 6.4E-12 1.4E-16  110.0   8.5  129   11-143   227-359 (1180)
 10 PF13485 Peptidase_MA_2:  Pepti  95.3  0.0038 8.3E-08   41.8  -0.3   18  126-143    26-43  (128)
 11 COG3975 Predicted protease wit  94.8   0.093   2E-06   44.3   6.3   81   59-142   182-264 (558)
 12 PRK04860 hypothetical protein;  93.5    0.18 3.9E-06   36.4   4.8   67   63-138     5-76  (160)
 13 PF01863 DUF45:  Protein of unk  90.9    0.63 1.4E-05   34.1   5.2   69   62-140   109-179 (205)
 14 PF06114 DUF955:  Domain of unk  88.7    0.27 5.9E-06   32.3   1.6   30  112-141    28-58  (122)
 15 PF10026 DUF2268:  Predicted Zn  86.3     2.2 4.7E-05   31.5   5.4   67   65-137     3-77  (195)
 16 PF01435 Peptidase_M48:  Peptid  83.1    0.95 2.1E-05   33.4   2.2   16  126-141    90-105 (226)
 17 smart00731 SprT SprT homologue  83.0     1.9 4.2E-05   30.2   3.7   61   70-139     6-73  (146)
 18 PF13574 Reprolysin_2:  Metallo  79.5    0.91   2E-05   32.8   1.0   11  127-137   113-123 (173)
 19 PF12725 DUF3810:  Protein of u  78.9     1.6 3.4E-05   34.9   2.3   10  127-136   198-207 (318)
 20 COG0501 HtpX Zn-dependent prot  73.7     4.2 9.1E-05   31.4   3.4   69   70-143   104-175 (302)
 21 PF01447 Peptidase_M4:  Thermol  72.4     7.3 0.00016   27.7   4.0   22   59-80     68-89  (150)
 22 COG4783 Putative Zn-dependent   71.7     2.6 5.7E-05   35.4   1.8   47   88-135    93-140 (484)
 23 PF13699 DUF4157:  Domain of un  67.4     2.6 5.7E-05   26.6   0.8   63   70-137     6-73  (79)
 24 cd04269 ZnMc_adamalysin_II_lik  67.2      22 0.00048   25.7   5.8   12  126-137   132-143 (194)
 25 COG1451 Predicted metal-depend  66.9     9.7 0.00021   28.9   3.9   66   62-140   120-190 (223)
 26 COG2856 Predicted Zn peptidase  65.8     3.8 8.3E-05   30.9   1.5   53   87-141    32-88  (213)
 27 cd04272 ZnMc_salivary_gland_MP  65.8      13 0.00029   27.6   4.5   12  126-137   146-157 (220)
 28 PF10263 SprT-like:  SprT-like   65.7     3.4 7.4E-05   28.9   1.2   12  127-138    62-73  (157)
 29 PF13688 Reprolysin_5:  Metallo  65.2     3.2   7E-05   30.1   1.0   11  127-137   144-154 (196)
 30 PHA02456 zinc metallopeptidase  64.9     4.4 9.5E-05   27.5   1.5   13  125-137    79-91  (141)
 31 PF08325 WLM:  WLM domain;  Int  64.6     3.6 7.8E-05   30.3   1.2   13  128-140    85-97  (186)
 32 PF10460 Peptidase_M30:  Peptid  64.4      15 0.00033   30.0   4.8   37   46-82     20-65  (366)
 33 PRK03982 heat shock protein Ht  61.9      12 0.00027   29.2   3.8   14  127-140   127-140 (288)
 34 PRK01345 heat shock protein Ht  61.2      13 0.00029   29.6   3.9   15  126-140   125-139 (317)
 35 PRK03001 M48 family peptidase;  60.6      14  0.0003   28.8   3.9   12  126-137   125-136 (283)
 36 cd04279 ZnMc_MMP_like_1 Zinc-d  60.2      49  0.0011   23.0   6.4   36   46-82      3-41  (156)
 37 PRK03072 heat shock protein Ht  56.9      18 0.00039   28.4   3.9   14  127-140   129-142 (288)
 38 cd04273 ZnMc_ADAMTS_like Zinc-  56.3     5.9 0.00013   29.3   1.1   10  127-136   142-151 (207)
 39 PRK02870 heat shock protein Ht  54.1      16 0.00034   29.5   3.3   10  127-136   175-184 (336)
 40 PF13582 Reprolysin_3:  Metallo  53.9     6.7 0.00015   26.1   1.0   12  126-137   108-119 (124)
 41 PF13265 DUF4056:  Protein of u  53.6     5.8 0.00012   30.8   0.7   11  129-140   152-162 (270)
 42 PF04450 BSP:  Peptidase of pla  52.4      27 0.00059   26.1   4.1   13  128-140    99-111 (205)
 43 KOG3658|consensus               51.3     4.1   9E-05   35.7  -0.5   10  128-137   395-404 (764)
 44 PF13402 M60-like:  Peptidase M  49.7      22 0.00049   27.6   3.5   78   58-136   143-230 (307)
 45 PF10023 DUF2265:  Predicted am  49.5       8 0.00017   31.2   0.9   42   57-98     45-96  (337)
 46 COG4324 Predicted aminopeptida  49.1     8.8 0.00019   30.2   1.0   53   59-111    79-145 (376)
 47 PF14891 Peptidase_M91:  Effect  48.8     8.8 0.00019   27.7   1.0   11  127-137   105-115 (174)
 48 PRK04351 hypothetical protein;  48.6     7.8 0.00017   27.5   0.7   10  127-136    63-72  (149)
 49 cd04267 ZnMc_ADAM_like Zinc-de  47.8     8.3 0.00018   27.9   0.7   11  127-137   135-145 (192)
 50 COG3091 SprT Zn-dependent meta  47.7     8.4 0.00018   27.5   0.7   22   66-89      6-27  (156)
 51 PF08219 TOM13:  Outer membrane  46.9      11 0.00025   23.6   1.1   12  126-137    53-64  (77)
 52 PF01421 Reprolysin:  Reprolysi  46.5      42 0.00091   24.4   4.3   10  127-136   133-142 (199)
 53 KOG2661|consensus               46.0      13 0.00028   30.2   1.6   13  126-138   276-288 (424)
 54 cd04270 ZnMc_TACE_like Zinc-de  45.8      10 0.00022   29.0   1.0   11  127-137   169-179 (244)
 55 PF05572 Peptidase_M43:  Pregna  45.1      16 0.00035   25.9   1.8   16  126-142    70-85  (154)
 56 PF06262 DUF1025:  Possibl zinc  44.4       7 0.00015   25.7  -0.1   32  105-136    50-84  (97)
 57 PF14521 Aspzincin_M35:  Lysine  42.6      15 0.00032   25.9   1.3   17  124-140    95-112 (148)
 58 PRK02391 heat shock protein Ht  41.7      14  0.0003   29.2   1.2   15  126-140   134-148 (296)
 59 PRK04897 heat shock protein Ht  41.3      14 0.00031   29.1   1.2   15  126-140   138-152 (298)
 60 PRK05457 heat shock protein Ht  40.3      15 0.00033   28.8   1.2   16  126-141   135-150 (284)
 61 PF06167 Peptidase_M90:  Glucos  39.5     7.7 0.00017   30.0  -0.6   10  128-137   157-166 (253)
 62 cd04271 ZnMc_ADAM_fungal Zinc-  38.2      12 0.00025   28.4   0.2    9  128-136   148-156 (228)
 63 cd04277 ZnMc_serralysin_like Z  36.0      20 0.00042   25.9   1.1   20   63-82     35-54  (186)
 64 PRK01265 heat shock protein Ht  36.0      19 0.00042   28.9   1.2   11  126-136   141-151 (324)
 65 PF13583 Reprolysin_4:  Metallo  35.4      18 0.00038   26.9   0.8   11  127-137   139-149 (206)
 66 cd04268 ZnMc_MMP_like Zinc-dep  35.0      21 0.00046   24.9   1.2   21   62-82     15-35  (165)
 67 PF00413 Peptidase_M10:  Matrix  34.0      21 0.00045   24.6   0.9   22   61-82     21-42  (154)
 68 PF05569 Peptidase_M56:  BlaR1   34.0      25 0.00053   27.4   1.5   11  126-136   197-207 (299)
 69 cd00203 ZnMc Zinc-dependent me  33.0      20 0.00044   24.9   0.8   23   60-82     20-42  (167)
 70 PF02031 Peptidase_M7:  Strepto  32.3      25 0.00054   24.5   1.1   10  127-136    79-88  (132)
 71 PF12315 DUF3633:  Protein of u  32.1      23 0.00051   26.7   1.0   12  127-138    95-106 (212)
 72 KOG3607|consensus               31.3 1.3E+02  0.0028   27.0   5.6   32   47-79    243-274 (716)
 73 TIGR02289 M3_not_pepF oligoend  30.7      28 0.00061   29.8   1.4   31  106-136   318-348 (549)
 74 PF12388 Peptidase_M57:  Dual-a  28.5      28  0.0006   26.3   0.9   11  126-136   134-144 (211)
 75 cd06007 R3H_DEXH_helicase R3H   26.7      44 0.00096   19.8   1.4   12  126-137    29-40  (59)
 76 smart00235 ZnMc Zinc-dependent  26.6      30 0.00065   23.5   0.7   20   63-82     24-43  (140)
 77 COG4900 Predicted metallopepti  26.3      47   0.001   22.6   1.6   10  127-136    82-91  (133)
 78 COG4101 Predicted mannose-6-ph  26.0      22 0.00047   24.6  -0.1   10  133-142    77-86  (142)
 79 PF04228 Zn_peptidase:  Putativ  25.6      34 0.00074   27.1   0.9    9  128-136   173-181 (292)
 80 PF10584 Proteasome_A_N:  Prote  24.6      24 0.00053   16.9  -0.0   18   47-65      5-22  (23)
 81 PF01431 Peptidase_M13:  Peptid  24.3      38 0.00082   24.7   0.9   12  127-138    38-49  (206)
 82 cd04278 ZnMc_MMP Zinc-dependen  23.9      35 0.00077   23.8   0.7   22   61-82     21-42  (157)
 83 PF09471 Peptidase_M64:  IgA Pe  23.7      47   0.001   25.9   1.4   13  127-139   218-230 (264)
 84 cd02640 R3H_NRF R3H domain of   23.6      55  0.0012   19.4   1.4   13  125-137    29-41  (60)
 85 cd04280 ZnMc_astacin_like Zinc  22.9 1.8E+02  0.0039   20.9   4.3   21   62-82     15-35  (180)
 86 cd06459 M3B_Oligoendopeptidase  22.6      48   0.001   26.9   1.3   25  112-136   209-233 (427)
 87 PF14831 DUF4484:  Domain of un  21.1      67  0.0014   23.5   1.7   14  126-139   145-158 (176)
 88 PF01400 Astacin:  Astacin (Pep  20.9 1.3E+02  0.0028   22.0   3.2   19   64-82     21-39  (191)
 89 PF09768 Peptidase_M76:  Peptid  20.8 1.8E+02   0.004   21.1   3.9   12  126-137    72-83  (173)
 90 PF10186 Atg14:  UV radiation r  20.8 1.5E+02  0.0032   22.7   3.7   29   57-87    194-222 (302)
 91 cd04281 ZnMc_BMP1_TLD Zinc-dep  20.7 1.8E+02  0.0039   21.6   3.9   19   64-82     28-46  (200)
 92 PF04036 DUF372:  Domain of unk  20.6      49  0.0011   17.9   0.7   13  132-144    13-25  (38)
 93 PRK15410 DgsA anti-repressor M  20.1      45 0.00098   26.0   0.6    9  128-136   145-153 (260)
 94 cd06160 S2P-M50_like_2 Unchara  20.0      53  0.0012   24.0   1.0   11  128-138    44-54  (183)

No 1  
>KOG1046|consensus
Probab=100.00  E-value=4.7e-44  Score=312.11  Aligned_cols=142  Identities=37%  Similarity=0.637  Sum_probs=129.7

Q ss_pred             CCCcceeeeeCCCEEEEEEecCCCCCceEEEEEEecceeeeecCC---ceEEEecCccccccHHHHHHHHHHHHHHHHHH
Q psy2264           1 MPVSTSVEIANSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKDG---KFRVWGREDVVKTQGQYIFDKGPKILAALSDY   77 (148)
Q Consensus         1 MP~~~~~~~~~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~~---~i~v~~~~~~~~~~~~~~l~~~~~~l~~~e~~   77 (148)
                      ||+.++..+ ++|+++++|++||+|||||+||+||+|+..+....   ++|+|++|+. ..+++++++.++++|++++++
T Consensus       196 m~v~~~~~~-~~~~~~~~F~~Tp~MstYLvAf~V~~f~~~e~~~~~~v~vrv~a~p~~-~~~~~~al~~~~~~L~~~e~~  273 (882)
T KOG1046|consen  196 MPVIKEEPV-DDGWKTTTFEKTPKMSTYLVAFAVGDFVYVETITKSGVPVRVYARPEK-INQGQFALEVATKVLEFYEDY  273 (882)
T ss_pred             Ccccccccc-cCCeeEEEEEecCCCchhhheeeeeccccceeecCCCceEEEEeChHH-hhHHHHHHHHHHHHHHHHHHH
Confidence            888888888 77799999999999999999999999999887653   8999999999 999999999999999999999


Q ss_pred             hCCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhccccccccccc-hhhh----hHHHHhhhhhhcCCeeeecc
Q psy2264          78 MGFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATHLY-LLQH----TSMIRMTNIPWFPAWISWKY  146 (148)
Q Consensus        78 ~g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~~-~~~~----~~iaHElaHqWfGn~Vt~~~  146 (148)
                      ||+||  |++|+|+|++|+|..|||||||||+|+|..+++++..+ ..++    .+||||+|||||||+|||+-
T Consensus       274 f~i~y--PLpK~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGNLVTm~w  345 (882)
T KOG1046|consen  274 FGIPY--PLPKLDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGNLVTMKW  345 (882)
T ss_pred             hCCCC--CCccccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcCcccHhh
Confidence            99999  99999999999999999999999999999999987543 2222    25999999999999999764


No 2  
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=100.00  E-value=1.7e-38  Score=276.43  Aligned_cols=141  Identities=22%  Similarity=0.376  Sum_probs=116.3

Q ss_pred             CCCcceeeeeCCCEEEEEEecCCCCCceEEEEEEecceeeeecC-C-ceEEEecCcccccc--HHHHHHHHHHHHHHHHH
Q psy2264           1 MPVSTSVEIANSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKD-G-KFRVWGREDVVKTQ--GQYIFDKGPKILAALSD   76 (148)
Q Consensus         1 MP~~~~~~~~~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~-~-~i~v~~~~~~~~~~--~~~~l~~~~~~l~~~e~   76 (148)
                      ||+.+.+.  .++.++++|++||||||||+||+||+|+..+... + ++++|++++. .+.  ++++++.++++|++|++
T Consensus       161 g~~~~~~~--~~~~~~~~F~~t~pmstYL~a~~vG~f~~~~~~~~gvpi~v~~~~~~-~~~~~~~~al~~~~~~l~~~e~  237 (831)
T TIGR02412       161 SRETDVTP--EPADRRWEFPETPKLSTYLTAVAAGPYHSVQDESRSYPLGIYARRSL-AQYLDADAIFTITRQGLAFFHR  237 (831)
T ss_pred             Cccccccc--cCCCeEEEecCCCCcccceEEEEEeceEEEeecCCCEEEEEEECcch-hhhhhHHHHHHHHHHHHHHHHH
Confidence            45444332  3567789999999999999999999999988643 3 8999999987 554  57899999999999999


Q ss_pred             HhCCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhccccccccccchh---hhhHHHHhhhhhhcCCeeeecc
Q psy2264          77 YMGFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATHLYLL---QHTSMIRMTNIPWFPAWISWKY  146 (148)
Q Consensus        77 ~~g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~~~~---~~~~iaHElaHqWfGn~Vt~~~  146 (148)
                      +||+||  |++|+|+|++|+|..|||||||+|+|+|..++.+......   ...+|+||+|||||||+||++.
T Consensus       238 ~fg~pY--P~~k~d~V~vP~f~~GaMEn~Glit~~e~~l~~~~~~~~~~~~~~~viaHElAHqWFGnlVT~~w  308 (831)
T TIGR02412       238 KFGYPY--PFKKYDQIFVPEFNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLVTMRW  308 (831)
T ss_pred             HhCCCC--CcccCCEEEcCCCCCCcccccceeeechhhccCCcCCHHHHHHHHHHHHHHHHHHHhCCEecccc
Confidence            999999  9999999999999999999999999999965443222111   1246999999999999999763


No 3  
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=100.00  E-value=2e-38  Score=275.43  Aligned_cols=137  Identities=23%  Similarity=0.341  Sum_probs=120.5

Q ss_pred             eeeeeCCCEEEEEEecCCCCCceEEEEEEecceeeeec----CC---ceEEEecCccccccHHHHHHHHHHHHHHHHHHh
Q psy2264           6 SVEIANSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDK----DG---KFRVWGREDVVKTQGQYIFDKGPKILAALSDYM   78 (148)
Q Consensus         6 ~~~~~~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~----~~---~i~v~~~~~~~~~~~~~~l~~~~~~l~~~e~~~   78 (148)
                      +... .+|+++++|+.++|||+||+||+||+|+++++.    .|   ++++|++++. .+++.++++.++++++++|++|
T Consensus       156 ~~~~-~~g~~~~~f~~t~pmptYLfA~vaGdf~~~~~~~~t~sg~~v~l~iy~~p~~-~~~~~~al~~~~~~L~~~E~~f  233 (863)
T TIGR02414       156 SGEL-PDGRHWAEWEDPFPKPSYLFALVAGDLDVLEDTFTTKSGREVALRVYVEEGN-KDKCDHAMESLKKAMKWDEEVF  233 (863)
T ss_pred             ceec-CCCeEEEEEeCCCCcChhHheEEEeCCEEEEEEeeccCCCceEEEEEEccCc-HHHHHHHHHHHHHHHHHHHHHh
Confidence            3344 688999999999999999999999999998753    22   7899999998 8889999999999999999999


Q ss_pred             CCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhccccccccccc-hhh----hhHHHHhhhhhhcCCeeeecc
Q psy2264          79 GFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATHLY-LLQ----HTSMIRMTNIPWFPAWISWKY  146 (148)
Q Consensus        79 g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~~-~~~----~~~iaHElaHqWfGn~Vt~~~  146 (148)
                      |.||  |++|+|+|++|+|..||||||||++|+++.++.++... ...    ..+||||+|||||||+|||+.
T Consensus       234 G~pY--Pl~k~diVavpdf~~GaMEN~GLi~f~e~~lL~~~~~~td~~~~~i~~VIaHElaHqWfGNlVT~~~  304 (863)
T TIGR02414       234 GLEY--DLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRD  304 (863)
T ss_pred             CCCC--ChhhccEEecCCCCCccccccceeccccceEEeCCCCCCHHHHHHHHHHHHHHHHHHHhcceeeecc
Confidence            9999  99999999999999999999999999999998876542 211    247999999999999999753


No 4  
>PF01433 Peptidase_M1:  Peptidase family M1 This is family M1 in the peptidase classification.;  InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.  Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=100.00  E-value=3.2e-38  Score=253.69  Aligned_cols=142  Identities=34%  Similarity=0.597  Sum_probs=117.6

Q ss_pred             CCCcceeeeeCCCEEEEEEecCCCCCceEEEEEEecceeeeecCC---ceEEEecCccccccHHHHHHHHHHHHHHHHHH
Q psy2264           1 MPVSTSVEIANSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKDG---KFRVWGREDVVKTQGQYIFDKGPKILAALSDY   77 (148)
Q Consensus         1 MP~~~~~~~~~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~~---~i~v~~~~~~~~~~~~~~l~~~~~~l~~~e~~   77 (148)
                      ||+.++... ++|+++++|+.|+|||+|++||+||+|+..+....   ++++|++++. .++++.+++.+++++++|+++
T Consensus       167 g~~~~~~~~-~~~~~~~~f~~t~p~~~yl~a~~vg~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~  244 (390)
T PF01433_consen  167 GPLEEEESN-DDGWKTTTFETTPPMPTYLFAFAVGDFESVEVTTKSGVPVRVYARPGD-EEQLQFALDIAPKALEYYEEY  244 (390)
T ss_dssp             SEEEEEEEE-TTTEEEEEEEEEEEEEGGG--EEEESEEEEEEETTTEEEEEEEEECTC-GGGHHHHHHHHHHHHHHHHHH
T ss_pred             ccccccccc-cccceeEeeecccccCchhhhhhcCcccccccccccccchheeehhhh-HHHHHHHHHhhHHHHHHHHhh
Confidence            455566666 68999999999999999999999999999876553   7999999999 899999999999999999999


Q ss_pred             hCCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhccccccccccchh-h----hhHHHHhhhhhhcCCeeeecc
Q psy2264          78 MGFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATHLYLL-Q----HTSMIRMTNIPWFPAWISWKY  146 (148)
Q Consensus        78 ~g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~~~~-~----~~~iaHElaHqWfGn~Vt~~~  146 (148)
                      ||.||  |++|+|+|++|+|..+||||||+|+|+++.++++++.... +    ..+||||+|||||||+||++.
T Consensus       245 ~g~~y--p~~k~~~v~~p~~~~~~me~~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfGn~vt~~~  316 (390)
T PF01433_consen  245 FGIPY--PFKKLDIVAVPDFPFGGMENWGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFGNLVTPKW  316 (390)
T ss_dssp             HTS----SSSEEEEEEEST-SSSEE--TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBTTTEEESS
T ss_pred             ccccc--eecceeEEEEeccccccccccccccccccccccCcccccchhhhhhHHHHHHHHHHHHhccCCcccc
Confidence            99999  9999999999999999999999999999999987654322 1    246999999999999999763


No 5  
>PRK14015 pepN aminopeptidase N; Provisional
Probab=100.00  E-value=9.7e-38  Score=271.62  Aligned_cols=140  Identities=21%  Similarity=0.308  Sum_probs=121.3

Q ss_pred             CcceeeeeCCCEEEEEEecCCCCCceEEEEEEecceeeeec----CC---ceEEEecCccccccHHHHHHHHHHHHHHHH
Q psy2264           3 VSTSVEIANSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDK----DG---KFRVWGREDVVKTQGQYIFDKGPKILAALS   75 (148)
Q Consensus         3 ~~~~~~~~~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~----~~---~i~v~~~~~~~~~~~~~~l~~~~~~l~~~e   75 (148)
                      ..++... .+|+++++|+.++|||+||+||+||+|+.+++.    .|   ++++|++|+. .+++.++++.++++|+++|
T Consensus       166 l~~~~~~-~~g~~~~~w~~~~PmpsYL~Al~aGdf~~~~d~~~~~~g~~vpl~iy~~p~~-~~~~~~al~~~~~~L~~~E  243 (875)
T PRK14015        166 LVESGEL-PDGRHWATWEDPFPKPSYLFALVAGDLDVLEDTFTTRSGREVALEIYVEPGN-LDKCDHAMDSLKKSMKWDE  243 (875)
T ss_pred             cccceec-cCCeEEEEEEeCCCcccceEEEEEeCCEEEEEEeeccCCCeEEEEEEEeCCc-HHHHHHHHHHHHHHHHHHH
Confidence            3333334 578999999999999999999999999998753    22   7899999998 8889999999999999999


Q ss_pred             HHhCCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhccccccccccch-h----hhhHHHHhhhhhhcCCeeeecc
Q psy2264          76 DYMGFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATHLYL-L----QHTSMIRMTNIPWFPAWISWKY  146 (148)
Q Consensus        76 ~~~g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~~~-~----~~~~iaHElaHqWfGn~Vt~~~  146 (148)
                      ++||.||  |++|||+|++|+|..||||||||++|+++.++.++.... .    ...+||||+|||||||+|||+.
T Consensus       244 ~~FG~pY--P~~k~diVavp~f~~GaMEN~Gl~~f~~~~lL~~~~~~t~~~~~~i~~vIaHElaHqWFGNlVT~~~  317 (875)
T PRK14015        244 ERFGLEY--DLDIFMIVAVDDFNMGAMENKGLNIFNSKYVLADPETATDADYERIESVIAHEYFHNWTGNRVTCRD  317 (875)
T ss_pred             HHhCCCC--ChhhhCEEeCCCCCCcccccccccccccceEecCcccCCHHHHHHHHHHHHHHHHHHHHhCcceecc
Confidence            9999999  999999999999999999999999999999988765421 1    2357999999999999999753


No 6  
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=100.00  E-value=6.7e-36  Score=260.73  Aligned_cols=141  Identities=27%  Similarity=0.520  Sum_probs=122.8

Q ss_pred             CCCcceeeeeCCCEEEEEEecCCCCCceEEEEEEecceeeeecC-----C-ceEEEecCccccccHHHHHHHHHHHHHHH
Q psy2264           1 MPVSTSVEIANSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKD-----G-KFRVWGREDVVKTQGQYIFDKGPKILAAL   74 (148)
Q Consensus         1 MP~~~~~~~~~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~-----~-~i~v~~~~~~~~~~~~~~l~~~~~~l~~~   74 (148)
                      ||....... .+|+++++|+.++||||||+|+++|+|+++++..     + ++++|++++. ..+++++++.++++++++
T Consensus       177 ~~~~~~~~~-~~g~~~~~f~~~~~mptYL~al~~G~~~~~~~~~~~~~~~v~l~iy~~~g~-~~~a~~~~~~~~~~~~~~  254 (859)
T COG0308         177 GNLIDGGTL-VDGRKIVKFEDTPPMPTYLFALVAGDLEVFRDKFDTRSRDVPLEIYVPPGV-LDRAKYALDETKRSIEFY  254 (859)
T ss_pred             CCccccccc-cCCcEEEEEcCCCCcchHhhheeeecceeeeeeeccCCCCeeEEEEecCcc-hhhhhhhHHHHHHHhhhH
Confidence            344444444 6789999999999999999999999999888765     3 8899999977 889999999999999999


Q ss_pred             HHHhCCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhccccccccc-cchh----hhhHHHHhhhhhhcCCeeeecc
Q psy2264          75 SDYMGFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATH-LYLL----QHTSMIRMTNIPWFPAWISWKY  146 (148)
Q Consensus        75 e~~~g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~-~~~~----~~~~iaHElaHqWfGn~Vt~~~  146 (148)
                      |++||.||  |++| ++|++|+|+.|||||||+++|++..++.++. ....    ..++|+||+|||||||+|||+.
T Consensus       255 e~~fg~~y--~l~~-~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqWfGnlVT~~~  328 (859)
T COG0308         255 EEYFGLPY--ALPI-DIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTMKW  328 (859)
T ss_pred             HHhcCCCC--CCcc-cEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhcccceeeccC
Confidence            99999999  9999 9999999999999999999999999887743 3222    2247999999999999999875


No 7  
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=100.00  E-value=1.5e-34  Score=244.35  Aligned_cols=124  Identities=21%  Similarity=0.335  Sum_probs=105.9

Q ss_pred             CCEEEEEEecCCCCCceEEEEEEecceeeeecCC-ceEEEecCccccccHHHHHH-HHHHHHHHHHHHhCCcCCCCCCcc
Q psy2264          12 SDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKDG-KFRVWGREDVVKTQGQYIFD-KGPKILAALSDYMGFDYYKMLPKM   89 (148)
Q Consensus        12 ~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~~-~i~v~~~~~~~~~~~~~~l~-~~~~~l~~~e~~~g~~y~~p~~k~   89 (148)
                      ++..+++|++++|||+||+||+||+|+...  .| ++++|++|+. .+++.+.+. .++++++++|+++| ||  |++||
T Consensus       173 ~~~~~~~F~~t~pmptYLia~avG~~~~~~--~g~~~~v~~~p~~-~~~~~~~~~~~~~~~l~~~e~~~~-pY--p~~k~  246 (601)
T TIGR02411       173 NDPGKYLFKQKVPIPAYLIALASGDLASAP--IGPRSSVYSEPEQ-LEKCQYEFEHDTENFIKTAEDLIF-PY--EWGQY  246 (601)
T ss_pred             CCCceEEEEeCCCcchhhheeeeccceecc--cCCceEEEccchh-HHHHHHHHHHhHHHHHHHHHHhCC-CC--cCccc
Confidence            345578999999999999999999998764  24 7899999998 888888888 99999999999866 99  99999


Q ss_pred             cEEEe-CCCCCCchhhhhhhhhhccccccccccchhhhhHHHHhhhhhhcCCeeeec
Q psy2264          90 DLVAV-PDFDAGAMENWGMNTYRLGLVVEATHLYLLQHTSMIRMTNIPWFPAWISWK  145 (148)
Q Consensus        90 d~v~v-p~~~~~~MEn~Gli~~~e~~ll~~~~~~~~~~~~iaHElaHqWfGn~Vt~~  145 (148)
                      |+|++ |+|++||||||| ++|.++.++..+.+   ...+||||||||||||+||++
T Consensus       247 d~vvlpp~f~~GgMEN~~-ltf~~~~ll~~d~s---~~~viaHElAHqWfGNlVT~~  299 (601)
T TIGR02411       247 DLLVLPPSFPYGGMENPN-LTFATPTLIAGDRS---NVDVIAHELAHSWSGNLVTNC  299 (601)
T ss_pred             eEEEecCccccccccccc-ceeeccccccCChh---hhhhHHHHHHhhccCceeecC
Confidence            99987 789999999999 56777777755332   235799999999999999954


No 8  
>KOG1047|consensus
Probab=99.94  E-value=7.2e-28  Score=196.60  Aligned_cols=127  Identities=24%  Similarity=0.427  Sum_probs=108.4

Q ss_pred             CCCEEEEEEecCCCCCceEEEEEEecceeeeecCC-ceEEEecCccccccHHHHHH-HHHHHHHHHHHHhCCcCCCCCCc
Q psy2264          11 NSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKDG-KFRVWGREDVVKTQGQYIFD-KGPKILAALSDYMGFDYYKMLPK   88 (148)
Q Consensus        11 ~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~~-~i~v~~~~~~~~~~~~~~l~-~~~~~l~~~e~~~g~~y~~p~~k   88 (148)
                      ++|....+|++..|+|+||+||++|++.+.+  -| +.+||+.|.. .+.+.+-+. .++++|+.-|+.+| ||  ++++
T Consensus       181 ~~~~~~f~f~q~~pIP~YLiai~~G~L~s~e--IgpRs~VwaEp~~-~~a~~~ef~~~~e~~L~~Ae~l~G-pY--~Wgr  254 (613)
T KOG1047|consen  181 SNGRAIFRFKQEVPIPSYLIAIAVGDLESRE--IGPRSRVWAEPCL-LDACQEEFAGETEDFLKAAEKLFG-PY--VWGR  254 (613)
T ss_pred             CCCcceEEEEeccCchhhhHHHhhccccccc--cCCccceecchhh-hHHHHHHHHhhhHHHHHHHHHHcC-Cc--cccc
Confidence            5667889999999999999999999998665  35 6899999999 888888887 99999999999999 99  9999


Q ss_pred             ccEEEe-CCCCCCchhhhhhhhhhccccccccccchhhhhHHHHhhhhhhcCCeee---eccc
Q psy2264          89 MDLVAV-PDFDAGAMENWGMNTYRLGLVVEATHLYLLQHTSMIRMTNIPWFPAWIS---WKYY  147 (148)
Q Consensus        89 ~d~v~v-p~~~~~~MEn~Gli~~~e~~ll~~~~~~~~~~~~iaHElaHqWfGn~Vt---~~~~  147 (148)
                      ||+|++ |+|++|||||+ +++|-...|+-.+.+   ...+|||||||.||||+||   |+||
T Consensus       255 yDllvlPpSFP~gGMENP-cltF~TpTllaGDrs---l~~vIaHEIAHSWtGNlVTN~sWehf  313 (613)
T KOG1047|consen  255 YDLLVLPPSFPFGGMENP-CLTFVTPTLLAGDRS---LVDVIAHEIAHSWTGNLVTNASWEHF  313 (613)
T ss_pred             ceEEEecCCCCcccccCc-ceeeecchhhcCCcc---hhhHHHHHhhhhhcccccccCccchh
Confidence            999999 57999999999 555555555544332   2346999999999999998   8876


No 9  
>KOG1932|consensus
Probab=99.31  E-value=6.4e-12  Score=110.01  Aligned_cols=129  Identities=13%  Similarity=0.166  Sum_probs=106.9

Q ss_pred             CCCEEEEEEecCCCCCceEEEEEEecceeeeecCC-ceEEEecCccccccHHHHHHHHHHHHHHHHHHhCCcCCCCCCcc
Q psy2264          11 NSDKVTDTFQQSPPMPTYLVGFAVFEFNSLSDKDG-KFRVWGREDVVKTQGQYIFDKGPKILAALSDYMGFDYYKMLPKM   89 (148)
Q Consensus        11 ~~~~~~~~F~~tppmstYl~a~~vg~f~~~~~~~~-~i~v~~~~~~~~~~~~~~l~~~~~~l~~~e~~~g~~y~~p~~k~   89 (148)
                      +-+.++.+|+.|-|+++.-+|||||.|+....... ++..+|.|+. ....+...-...++++|+|+.++..|  ||+-|
T Consensus       227 D~~Kkt~~ys~tvPvA~~~I~~AiG~F~~~~~P~~~~i~~f~LP~~-~~~v~nt~~~l~k~iefye~~ls~rY--PF~~~  303 (1180)
T KOG1932|consen  227 DLRKKTYHYSLTVPVAPSNIGFAIGPFKSYVEPSMIDITHFCLPGL-EPLVKNTTVYLHKAIEFYEEELSSRY--PFSCY  303 (1180)
T ss_pred             cccccEEEEEEeccCCccccceeeccccccCCCccCcceeEecCcc-hHHhhhHHHHHHHHHHHHHHHhccCC--Cccee
Confidence            44678899999999999999999999998854444 8999999998 88888888899999999999999889  99999


Q ss_pred             cEEEeCCCCCCchhhhhhhhhhcccccccccc-c--hhhhhHHHHhhhhhhcCCeee
Q psy2264          90 DLVAVPDFDAGAMENWGMNTYRLGLVVEATHL-Y--LLQHTSMIRMTNIPWFPAWIS  143 (148)
Q Consensus        90 d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~-~--~~~~~~iaHElaHqWfGn~Vt  143 (148)
                      .+|+||.-..--+...+|.++..+ +|++... +  ...+..+|-.||-||||-++|
T Consensus       304 k~VFvd~~~~~i~~~asl~I~st~-lLy~~~iIDq~~~tr~~La~aLA~Q~fg~yIs  359 (1180)
T KOG1932|consen  304 KTVFVDEAAVEISSYASLSIFSTS-LLYSKNIIDQTFLTRRKLAWALASQWFGVYIS  359 (1180)
T ss_pred             eEEEecCCcceeeecceeeeeecc-ccchHhhhhHHHHHHHHHHHHHHHhhhEEEee
Confidence            999999876666667777777755 5554432 2  223456999999999999998


No 10 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=95.33  E-value=0.0038  Score=41.77  Aligned_cols=18  Identities=11%  Similarity=0.383  Sum_probs=15.6

Q ss_pred             hhHHHHhhhhhhcCCeee
Q psy2264         126 HTSMIRMTNIPWFPAWIS  143 (148)
Q Consensus       126 ~~~iaHElaHqWfGn~Vt  143 (148)
                      ..+++||++|+|+++.+.
T Consensus        26 ~~~l~HE~~H~~~~~~~~   43 (128)
T PF13485_consen   26 DRVLAHELAHQWFGNYFG   43 (128)
T ss_pred             HHHHHHHHHHHHHHHHcC
Confidence            467999999999998874


No 11 
>COG3975 Predicted protease with the C-terminal PDZ domain [General function prediction only]
Probab=94.84  E-value=0.093  Score=44.31  Aligned_cols=81  Identities=12%  Similarity=-0.010  Sum_probs=52.6

Q ss_pred             cHHHHHHHHHHHHHHHHHHhCCcCCCCCCcccEEEe-CCCCCCchhhhhhhhhhccccccccccc-hhhhhHHHHhhhhh
Q psy2264          59 QGQYIFDKGPKILAALSDYMGFDYYKMLPKMDLVAV-PDFDAGAMENWGMNTYRLGLVVEATHLY-LLQHTSMIRMTNIP  136 (148)
Q Consensus        59 ~~~~~l~~~~~~l~~~e~~~g~~y~~p~~k~d~v~v-p~~~~~~MEn~Gli~~~e~~ll~~~~~~-~~~~~~iaHElaHq  136 (148)
                      +.+...+.+++++..=.+.|| +-  |+.||.+++- .+...||||+---.....+.....+... ..-...++||..|-
T Consensus       182 d~~~~~~~~k~ii~~~~~vFg-~~--~~~~Y~Fl~~~s~q~~GGlEH~~St~l~~~r~~~~~~~ky~~~l~llsHEyfH~  258 (558)
T COG3975         182 DKERLASDTKKIIEAEIKVFG-SA--PFDKYVFLLHLSDQIYGGLEHRRSTALIYDRFGFTDQDKYQDLLGLLSHEYFHA  258 (558)
T ss_pred             cHHHHHHHHHHHHHHHHHHhc-CC--CccceEEEEEecCCCCCCceeccccccccccccccchhHHHHHHHHHHHHHHHh
Confidence            556666788889999889999 67  8999987765 4456789998532222222222211111 11134699999999


Q ss_pred             hcCCee
Q psy2264         137 WFPAWI  142 (148)
Q Consensus       137 WfGn~V  142 (148)
                      |-+-.+
T Consensus       259 WNvKrI  264 (558)
T COG3975         259 WNVKRI  264 (558)
T ss_pred             ccceec
Confidence            987654


No 12 
>PRK04860 hypothetical protein; Provisional
Probab=93.48  E-value=0.18  Score=36.35  Aligned_cols=67  Identities=9%  Similarity=-0.006  Sum_probs=35.4

Q ss_pred             HHHHHHHHHHHHHHHhCCcCCCCCCcccEEEeCCCCCCchhhhhhhhhhcccccccccc-----chhhhhHHHHhhhhhh
Q psy2264          63 IFDKGPKILAALSDYMGFDYYKMLPKMDLVAVPDFDAGAMENWGMNTYRLGLVVEATHL-----YLLQHTSMIRMTNIPW  137 (148)
Q Consensus        63 ~l~~~~~~l~~~e~~~g~~y~~p~~k~d~v~vp~~~~~~MEn~Gli~~~e~~ll~~~~~-----~~~~~~~iaHElaHqW  137 (148)
                      +...+...+..-+.+||.|+  +-++..+=  .....+|+     -......+-.++..     ...-..+|+||+||-|
T Consensus         5 ~~~~~~~~~~~a~~~f~~~f--~~p~~~f~--~R~rtaG~-----~~l~~~~I~~Np~ll~~~~~~~l~~~v~HEl~H~~   75 (160)
T PRK04860          5 VMRRLRECLAQANLYFKRTF--PEPKVSYT--QRGTSAGT-----AWLQSNEIRLNPVLLLENQQAFIDEVVPHELAHLL   75 (160)
T ss_pred             HHHHHHHHHHHHHHHhCCCC--CCCEEEEe--ecchhhcc-----hhHhcCCeeeCHHHHhhCcHHHHHhHHHHHHHHHH
Confidence            44566777788888999888  66533211  12222333     22222333333221     1112346899999976


Q ss_pred             c
Q psy2264         138 F  138 (148)
Q Consensus       138 f  138 (148)
                      .
T Consensus        76 ~   76 (160)
T PRK04860         76 V   76 (160)
T ss_pred             H
Confidence            3


No 13 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=90.92  E-value=0.63  Score=34.14  Aligned_cols=69  Identities=14%  Similarity=0.137  Sum_probs=36.6

Q ss_pred             HHHHHHHHHHHHHHHHhCCcCCCCCCcccEEEeCCCC--CCchhhhhhhhhhccccccccccchhhhhHHHHhhhhhhcC
Q psy2264          62 YIFDKGPKILAALSDYMGFDYYKMLPKMDLVAVPDFD--AGAMENWGMNTYRLGLVVEATHLYLLQHTSMIRMTNIPWFP  139 (148)
Q Consensus        62 ~~l~~~~~~l~~~e~~~g~~y~~p~~k~d~v~vp~~~--~~~MEn~Gli~~~e~~ll~~~~~~~~~~~~iaHElaHqWfG  139 (148)
                      .+.+..++.+..+++.+|.++       .-|.+-+..  .|.--.-|.|.++-..+..+..   ..+-+|+|||||--.-
T Consensus       109 ~~~~~l~~~~~~~~~~~~~~~-------~~i~ir~~ksrWGsc~~~~~I~ln~~L~~~P~~---~idYVvvHEL~Hl~~~  178 (205)
T PF01863_consen  109 QAKEYLPERLKKYAKKLGLPP-------PKIKIRDMKSRWGSCSSKGNITLNWRLVMAPPE---VIDYVVVHELCHLRHP  178 (205)
T ss_pred             HHHHHHHHHHHHHHHHcCCCc-------ceEEEeehhhccccCCCCCcEEeecccccCCcc---HHHHHHHHHHHHhccC
Confidence            344555666666667666544       234443321  2333334455555444444332   2344799999997655


Q ss_pred             C
Q psy2264         140 A  140 (148)
Q Consensus       140 n  140 (148)
                      |
T Consensus       179 n  179 (205)
T PF01863_consen  179 N  179 (205)
T ss_pred             C
Confidence            4


No 14 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=88.66  E-value=0.27  Score=32.26  Aligned_cols=30  Identities=7%  Similarity=-0.096  Sum_probs=18.7

Q ss_pred             ccccccccccc-hhhhhHHHHhhhhhhcCCe
Q psy2264         112 LGLVVEATHLY-LLQHTSMIRMTNIPWFPAW  141 (148)
Q Consensus       112 e~~ll~~~~~~-~~~~~~iaHElaHqWfGn~  141 (148)
                      ...+..+...+ ..+..+++|||+|.+++..
T Consensus        28 ~~~I~in~~~~~~~~~f~laHELgH~~~~~~   58 (122)
T PF06114_consen   28 NPIIFINSNLSPERQRFTLAHELGHILLHHG   58 (122)
T ss_dssp             TTEEEEESSS-HHHHHHHHHHHHHHHHHHH-
T ss_pred             CCEEEECCCCCHHHHHHHHHHHHHHHHhhhc
Confidence            44455554433 2345689999999998754


No 15 
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=86.29  E-value=2.2  Score=31.49  Aligned_cols=67  Identities=6%  Similarity=-0.066  Sum_probs=36.4

Q ss_pred             HHHHHHHHHHHHHhCCcCCCCCCcccEEEeCCCCCC--chhh---hhhhhhhcccccc--ccccc-hhhhhHHHHhhhhh
Q psy2264          65 DKGPKILAALSDYMGFDYYKMLPKMDLVAVPDFDAG--AMEN---WGMNTYRLGLVVE--ATHLY-LLQHTSMIRMTNIP  136 (148)
Q Consensus        65 ~~~~~~l~~~e~~~g~~y~~p~~k~d~v~vp~~~~~--~MEn---~Gli~~~e~~ll~--~~~~~-~~~~~~iaHElaHq  136 (148)
                      +.+.+.+....+.+      |.+++++.++|.-+.+  -+++   .|-..+....++.  +.... ..-..+||||+.|-
T Consensus         3 ~~i~~~~~~~~~~~------~~~~i~v~i~p~~~~~~~~~~~~g~~g~~~~~~~i~l~~~~~~~~~~~l~~~iaHE~hH~   76 (195)
T PF10026_consen    3 EIIEEALEKSIELL------PGPDIPVFIFPADPENPFLIPELGGKGGGAIPGYIFLFLLPNDYSLEELPALIAHEYHHN   76 (195)
T ss_pred             HHHHHHHHHHHHHc------CCCCCCEEEEeccCCCcccccccCcccccCCCCEEEEEecCCcccHHHHHHHHHHHHHHH
Confidence            34566666666666      5567888766643322  2222   3444444444333  22222 12234699999998


Q ss_pred             h
Q psy2264         137 W  137 (148)
Q Consensus       137 W  137 (148)
                      +
T Consensus        77 ~   77 (195)
T PF10026_consen   77 C   77 (195)
T ss_pred             H
Confidence            5


No 16 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=83.14  E-value=0.95  Score=33.39  Aligned_cols=16  Identities=0%  Similarity=-0.275  Sum_probs=12.4

Q ss_pred             hhHHHHhhhhhhcCCe
Q psy2264         126 HTSMIRMTNIPWFPAW  141 (148)
Q Consensus       126 ~~~iaHElaHqWfGn~  141 (148)
                      ..+||||++|---++.
T Consensus        90 ~aVlaHElgH~~~~h~  105 (226)
T PF01435_consen   90 AAVLAHELGHIKHRHI  105 (226)
T ss_dssp             HHHHHHHHHHHHTTHC
T ss_pred             HHHHHHHHHHHHcCCc
Confidence            3479999999876654


No 17 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=82.97  E-value=1.9  Score=30.21  Aligned_cols=61  Identities=11%  Similarity=0.010  Sum_probs=32.3

Q ss_pred             HHHHHH-HHhCCcCCCCCCcccEEEeCCCCCCchh-hhhhhhhhccccccccccch-----hhhhHHHHhhhhhhcC
Q psy2264          70 ILAALS-DYMGFDYYKMLPKMDLVAVPDFDAGAME-NWGMNTYRLGLVVEATHLYL-----LQHTSMIRMTNIPWFP  139 (148)
Q Consensus        70 ~l~~~e-~~~g~~y~~p~~k~d~v~vp~~~~~~ME-n~Gli~~~e~~ll~~~~~~~-----~~~~~iaHElaHqWfG  139 (148)
                      .+..+. .+|+.++  |-+   .|....    .|- .||.-..+...|..++....     .-..+|.||++|.+..
T Consensus         6 ~~~~~n~~~F~~~l--~~~---~i~w~~----r~~~~~G~~~~~~~~I~ln~~l~~~~~~~~l~~~l~HEm~H~~~~   73 (146)
T smart00731        6 RLEDASLRVFGRKL--PHP---KVVWNK----RLRKTGGRCLLKSAEIRLNPKLLTENGRDRLRETLLHELCHAALY   73 (146)
T ss_pred             HHHHHHHHHHCCCC--CCC---EEEEeh----hhhhhhHHhhcCCCEEEeCHHHHhhccHHHHHhhHHHHHHHHHHH
Confidence            344444 6888677  544   222211    222 35665554444554443311     1123699999999874


No 18 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=79.54  E-value=0.91  Score=32.85  Aligned_cols=11  Identities=0%  Similarity=-0.091  Sum_probs=9.6

Q ss_pred             hHHHHhhhhhh
Q psy2264         127 TSMIRMTNIPW  137 (148)
Q Consensus       127 ~~iaHElaHqW  137 (148)
                      ..+||||+||+
T Consensus       113 ~~~aHElGH~l  123 (173)
T PF13574_consen  113 DTFAHELGHQL  123 (173)
T ss_dssp             HHHHHHHHHHH
T ss_pred             eeehhhhHhhc
Confidence            46999999996


No 19 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=78.93  E-value=1.6  Score=34.87  Aligned_cols=10  Identities=0%  Similarity=-0.153  Sum_probs=9.0

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      ..++||+|||
T Consensus       198 ~T~~HElAHq  207 (318)
T PF12725_consen  198 FTICHELAHQ  207 (318)
T ss_pred             HHHHHHHHHH
Confidence            4699999998


No 20 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=73.69  E-value=4.2  Score=31.44  Aligned_cols=69  Identities=14%  Similarity=0.032  Sum_probs=35.8

Q ss_pred             HHHHHHHHhCCcCCCCCCcccEEEeCCCCCCchhhh---hhhhhhccccccccccchhhhhHHHHhhhhhhcCCeee
Q psy2264          70 ILAALSDYMGFDYYKMLPKMDLVAVPDFDAGAMENW---GMNTYRLGLVVEATHLYLLQHTSMIRMTNIPWFPAWIS  143 (148)
Q Consensus        70 ~l~~~e~~~g~~y~~p~~k~d~v~vp~~~~~~MEn~---Gli~~~e~~ll~~~~~~~~~~~~iaHElaHqWfGn~Vt  143 (148)
                      .+..+...-+.++   -+++.++-.|..+.-++.+.   |.|...+..+-  ...++.-+.+++||++|.=-++.++
T Consensus       104 ~v~~~a~~~~~~~---~~~v~i~~~~~~NAFa~g~~~~~~~V~vt~gLl~--~l~~dEl~aVlaHElgHi~~rd~~~  175 (302)
T COG0501         104 KVAELARQAGIPH---MPEVYILETPQPNAFALGGGPKNGRVVVTTGLLD--LLNDDELEAVLAHELGHIKNRHTLV  175 (302)
T ss_pred             HHHHHHHHCCCCC---CCeeEEecCCCccceecCCCCCCeeEEecHHHHh--hCCHHHHHHHHHHHHHHHhcccHHH
Confidence            3344444444332   45555555554444455553   66665544332  1111123457999999987666543


No 21 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=72.41  E-value=7.3  Score=27.68  Aligned_cols=22  Identities=9%  Similarity=0.033  Sum_probs=18.0

Q ss_pred             cHHHHHHHHHHHHHHHHHHhCC
Q psy2264          59 QGQYIFDKGPKILAALSDYMGF   80 (148)
Q Consensus        59 ~~~~~l~~~~~~l~~~e~~~g~   80 (148)
                      .+..+..-+.+..++|.+.||.
T Consensus        68 ~~vdA~~~~~~v~d~y~~~~gr   89 (150)
T PF01447_consen   68 AAVDAHYNAGKVYDYYKNVFGR   89 (150)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHSS
T ss_pred             HHHHHHHhHHHHHHHHHHHHCC
Confidence            4566777888999999999994


No 22 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=71.69  E-value=2.6  Score=35.44  Aligned_cols=47  Identities=15%  Similarity=0.054  Sum_probs=25.7

Q ss_pred             cccEEEeCCCCCCchhhhhhhhh-hccccccccccchhhhhHHHHhhhh
Q psy2264          88 KMDLVAVPDFDAGAMENWGMNTY-RLGLVVEATHLYLLQHTSMIRMTNI  135 (148)
Q Consensus        88 k~d~v~vp~~~~~~MEn~Gli~~-~e~~ll~~~~~~~~~~~~iaHElaH  135 (148)
                      -+.+++|-+-..++=-.+|--++ +...++..+... .--.+||||++|
T Consensus        93 ~f~f~lV~d~~iNAFA~~Gg~v~vntGLll~ae~es-ElagViAHEigH  140 (484)
T COG4783          93 PFTFFLVNDDSINAFATPGGYVVVNTGLLLTAENES-ELAGVIAHEIGH  140 (484)
T ss_pred             CeEEEEecCCccchhhcCCceEEEehHHHHhcCCHH-HHHHHHHHHHHH
Confidence            45888886655555555443333 333333333211 122469999999


No 23 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=67.41  E-value=2.6  Score=26.58  Aligned_cols=63  Identities=13%  Similarity=0.014  Sum_probs=29.5

Q ss_pred             HHHHHHHHhCCcCCCCCCcccEEEeCCCC--CCchhhhhhhhhhcccccccccc---chhhhhHHHHhhhhhh
Q psy2264          70 ILAALSDYMGFDYYKMLPKMDLVAVPDFD--AGAMENWGMNTYRLGLVVEATHL---YLLQHTSMIRMTNIPW  137 (148)
Q Consensus        70 ~l~~~e~~~g~~y~~p~~k~d~v~vp~~~--~~~MEn~Gli~~~e~~ll~~~~~---~~~~~~~iaHElaHqW  137 (148)
                      +-..||..||.++    ++..+-.=|.-.  ...+ ++-.++..+...+.....   +.....+++||++|=+
T Consensus         6 ~r~~~e~~~G~dl----~~Vrvh~~~~a~~~~~~~-~A~A~T~G~~I~f~~g~~~~~s~~~~~llaHEl~Hv~   73 (79)
T PF13699_consen    6 IRSRLERAFGADL----SDVRVHTGPAASRAAAAL-GARAFTVGNDIYFAPGKYNPDSPEGRALLAHELAHVV   73 (79)
T ss_pred             HHHHHHHHhCCCc----cceEEEeCCchhhhhhcc-CCeEEEECCEEEEcCCCcCCCCCCcchhHhHHHHHHH
Confidence            4567889998555    332222212111  1122 122344444444422211   1223568999999943


No 24 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=67.24  E-value=22  Score=25.73  Aligned_cols=12  Identities=8%  Similarity=-0.050  Sum_probs=9.8

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      ..++||||+|+.
T Consensus       132 a~~~AHElGH~l  143 (194)
T cd04269         132 AVTMAHELGHNL  143 (194)
T ss_pred             HHHHHHHHHhhc
Confidence            357999999985


No 25 
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=66.88  E-value=9.7  Score=28.86  Aligned_cols=66  Identities=9%  Similarity=-0.011  Sum_probs=33.9

Q ss_pred             HHHHHHHHHHHHHHHHhCCcCCCCCCcccEEEeCCCCCCchh-hhhhhhhhcccccccccc-c---hhhhhHHHHhhhhh
Q psy2264          62 YIFDKGPKILAALSDYMGFDYYKMLPKMDLVAVPDFDAGAME-NWGMNTYRLGLVVEATHL-Y---LLQHTSMIRMTNIP  136 (148)
Q Consensus        62 ~~l~~~~~~l~~~e~~~g~~y~~p~~k~d~v~vp~~~~~~ME-n~Gli~~~e~~ll~~~~~-~---~~~~~~iaHElaHq  136 (148)
                      .+.+..+..++.+.+.+|.++       ..+-+-     .|- -||-..... .+..+-.. .   ..-+-+|+||+||=
T Consensus       120 ~~~~~l~~~~~~~~~~l~~~~-------~~~~ik-----~~k~~WGScs~~~-~i~~~~~l~~~p~~~i~YVvvHELaHL  186 (223)
T COG1451         120 ILREILEIRLKEYAKKLGVPP-------RAIKLK-----NMKRRWGSCSKAG-EIRFNWRLVMAPEEVIDYVVVHELAHL  186 (223)
T ss_pred             HHHHHHHHHHHHHHHHhCCCc-------cceeee-----eccceeeeecCCC-cEEeehhhhcCCHHHHHHHHHHHHHHH
Confidence            455666777777778888766       222221     121 255444433 22222111 0   11233699999995


Q ss_pred             hcCC
Q psy2264         137 WFPA  140 (148)
Q Consensus       137 WfGn  140 (148)
                      =..|
T Consensus       187 ke~n  190 (223)
T COG1451         187 KEKN  190 (223)
T ss_pred             hhhh
Confidence            5444


No 26 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=65.84  E-value=3.8  Score=30.88  Aligned_cols=53  Identities=6%  Similarity=-0.098  Sum_probs=28.3

Q ss_pred             CcccEEEeCCCCCCchhhhh-hhhhhcc--ccccccccc-hhhhhHHHHhhhhhhcCCe
Q psy2264          87 PKMDLVAVPDFDAGAMENWG-MNTYRLG--LVVEATHLY-LLQHTSMIRMTNIPWFPAW  141 (148)
Q Consensus        87 ~k~d~v~vp~~~~~~MEn~G-li~~~e~--~ll~~~~~~-~~~~~~iaHElaHqWfGn~  141 (148)
                      .+.++++++.-  +.+.|+. .-.|.+.  .+..+...+ ..++.++||||+|=|++--
T Consensus        32 e~~gv~vf~~~--~~~~~~d~~~~~~~~~~~I~iN~n~~~~r~rFtlAHELGH~llH~~   88 (213)
T COG2856          32 EKAGVLVFELS--FINKNIDAYGLYDEEKPVIYINANNSLERKRFTLAHELGHALLHTD   88 (213)
T ss_pred             hcCCeEEEecc--ccccccccceeeeccCceEEEeCCCCHHHHHHHHHHHHhHHHhccc
Confidence            45566666541  2333332 3333322  333343333 3345689999999998754


No 27 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=65.82  E-value=13  Score=27.63  Aligned_cols=12  Identities=8%  Similarity=-0.188  Sum_probs=9.7

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      ..++||||+|..
T Consensus       146 ~~~~AHElGH~l  157 (220)
T cd04272         146 VYTMTHELAHLL  157 (220)
T ss_pred             HHHHHHHHHHHh
Confidence            457999999964


No 28 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=65.74  E-value=3.4  Score=28.92  Aligned_cols=12  Identities=0%  Similarity=-0.080  Sum_probs=10.7

Q ss_pred             hHHHHhhhhhhc
Q psy2264         127 TSMIRMTNIPWF  138 (148)
Q Consensus       127 ~~iaHElaHqWf  138 (148)
                      .+|+||++|.|.
T Consensus        62 ~tL~HEm~H~~~   73 (157)
T PF10263_consen   62 DTLLHEMAHAAA   73 (157)
T ss_pred             HHHHHHHHHHHh
Confidence            469999999998


No 29 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=65.21  E-value=3.2  Score=30.11  Aligned_cols=11  Identities=0%  Similarity=-0.109  Sum_probs=9.3

Q ss_pred             hHHHHhhhhhh
Q psy2264         127 TSMIRMTNIPW  137 (148)
Q Consensus       127 ~~iaHElaHqW  137 (148)
                      .++||||+|.+
T Consensus       144 ~~~AHEiGH~l  154 (196)
T PF13688_consen  144 ITFAHEIGHNL  154 (196)
T ss_dssp             HHHHHHHHHHT
T ss_pred             hhhHHhHHHhc
Confidence            46999999975


No 30 
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=64.88  E-value=4.4  Score=27.54  Aligned_cols=13  Identities=15%  Similarity=0.389  Sum_probs=11.0

Q ss_pred             hhhHHHHhhhhhh
Q psy2264         125 QHTSMIRMTNIPW  137 (148)
Q Consensus       125 ~~~~iaHElaHqW  137 (148)
                      .+..++||++|-|
T Consensus        79 C~~TL~HEL~H~W   91 (141)
T PHA02456         79 CRDTLAHELNHAW   91 (141)
T ss_pred             hHHHHHHHHHHHH
Confidence            3457999999999


No 31 
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=64.59  E-value=3.6  Score=30.32  Aligned_cols=13  Identities=8%  Similarity=-0.036  Sum_probs=11.8

Q ss_pred             HHHHhhhhhhcCC
Q psy2264         128 SMIRMTNIPWFPA  140 (148)
Q Consensus       128 ~iaHElaHqWfGn  140 (148)
                      ++.|||||-++|+
T Consensus        85 t~lHELaH~~~~~   97 (186)
T PF08325_consen   85 TMLHELAHNVHGP   97 (186)
T ss_pred             HHHHHHHhcccCC
Confidence            5899999999997


No 32 
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=64.44  E-value=15  Score=29.99  Aligned_cols=37  Identities=19%  Similarity=0.344  Sum_probs=20.2

Q ss_pred             ceEEEecCccc----cc--cHHHHHHH-HH--HHHHHHHHHhCCcC
Q psy2264          46 KFRVWGREDVV----KT--QGQYIFDK-GP--KILAALSDYMGFDY   82 (148)
Q Consensus        46 ~i~v~~~~~~~----~~--~~~~~l~~-~~--~~l~~~e~~~g~~y   82 (148)
                      .+.||+..+..    ..  .++...+. ..  ++-....+.||.|+
T Consensus        20 ~vnvWVed~e~~~~~is~~~~~~l~~~F~~~~~iYp~~~~ifG~pw   65 (366)
T PF10460_consen   20 TVNVWVEDGEYGSNKISDAQADSLAQEFDNSGKIYPRLVEIFGEPW   65 (366)
T ss_pred             EEEEEEEcCccCccccCHHHHHHHHHHhccccccchhHHHhcCCCC
Confidence            78899887620    22  12222222 22  46666777888773


No 33 
>PRK03982 heat shock protein HtpX; Provisional
Probab=61.87  E-value=12  Score=29.20  Aligned_cols=14  Identities=0%  Similarity=-0.358  Sum_probs=10.2

Q ss_pred             hHHHHhhhhhhcCC
Q psy2264         127 TSMIRMTNIPWFPA  140 (148)
Q Consensus       127 ~~iaHElaHqWfGn  140 (148)
                      .++|||++|-==|+
T Consensus       127 AVlAHElgHi~~~h  140 (288)
T PRK03982        127 GVIAHELTHIKNRD  140 (288)
T ss_pred             HHHHHHHHHHHcCC
Confidence            46999999964433


No 34 
>PRK01345 heat shock protein HtpX; Provisional
Probab=61.16  E-value=13  Score=29.59  Aligned_cols=15  Identities=7%  Similarity=-0.312  Sum_probs=10.9

Q ss_pred             hhHHHHhhhhhhcCC
Q psy2264         126 HTSMIRMTNIPWFPA  140 (148)
Q Consensus       126 ~~~iaHElaHqWfGn  140 (148)
                      ..++|||++|-==++
T Consensus       125 ~aVlAHElgHi~~~d  139 (317)
T PRK01345        125 AGVMAHELAHVKNRD  139 (317)
T ss_pred             HHHHHHHHHHHHcCC
Confidence            347999999964444


No 35 
>PRK03001 M48 family peptidase; Provisional
Probab=60.64  E-value=14  Score=28.84  Aligned_cols=12  Identities=8%  Similarity=-0.232  Sum_probs=9.1

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      ..++|||++|-=
T Consensus       125 ~aVlAHElgHi~  136 (283)
T PRK03001        125 RGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHh
Confidence            346999999953


No 36 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=60.18  E-value=49  Score=23.02  Aligned_cols=36  Identities=11%  Similarity=0.106  Sum_probs=24.3

Q ss_pred             ceEEEecCcccc---ccHHHHHHHHHHHHHHHHHHhCCcC
Q psy2264          46 KFRVWGREDVVK---TQGQYIFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        46 ~i~v~~~~~~~~---~~~~~~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ++++|..+.. .   .......+...++++..+++.++.+
T Consensus         3 ~i~~~i~~~~-~~~~~~~~~~~~~v~~A~~~W~~~~~l~F   41 (156)
T cd04279           3 PIRVYIDPTP-APPDSRAQSWLQAVKQAAAEWENVGPLKF   41 (156)
T ss_pred             CeEEEEcCCC-CccccchHHHHHHHHHHHHHHHHhCCeEE
Confidence            5677776653 2   2345567788888888888876554


No 37 
>PRK03072 heat shock protein HtpX; Provisional
Probab=56.89  E-value=18  Score=28.38  Aligned_cols=14  Identities=0%  Similarity=-0.230  Sum_probs=10.0

Q ss_pred             hHHHHhhhhhhcCC
Q psy2264         127 TSMIRMTNIPWFPA  140 (148)
Q Consensus       127 ~~iaHElaHqWfGn  140 (148)
                      .++|||++|-==|+
T Consensus       129 aVlAHElgHi~~~d  142 (288)
T PRK03072        129 GVLGHELSHVYNRD  142 (288)
T ss_pred             HHHHHHHHHHhcCC
Confidence            46999999953333


No 38 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=56.33  E-value=5.9  Score=29.26  Aligned_cols=10  Identities=0%  Similarity=-0.173  Sum_probs=8.4

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      .++||||+|.
T Consensus       142 ~~~aHElGH~  151 (207)
T cd04273         142 FTIAHELGHV  151 (207)
T ss_pred             Eeeeeechhh
Confidence            4699999995


No 39 
>PRK02870 heat shock protein HtpX; Provisional
Probab=54.08  E-value=16  Score=29.51  Aligned_cols=10  Identities=10%  Similarity=-0.114  Sum_probs=8.5

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      .++|||++|-
T Consensus       175 aVlAHELgHi  184 (336)
T PRK02870        175 AVMAHELSHI  184 (336)
T ss_pred             HHHHHHHHHH
Confidence            4699999995


No 40 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=53.89  E-value=6.7  Score=26.12  Aligned_cols=12  Identities=0%  Similarity=-0.138  Sum_probs=9.6

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      ...++|||+|..
T Consensus       108 ~~~~~HEiGH~l  119 (124)
T PF13582_consen  108 VDTFAHEIGHNL  119 (124)
T ss_dssp             TTHHHHHHHHHT
T ss_pred             ceEeeehhhHhc
Confidence            357999999973


No 41 
>PF13265 DUF4056:  Protein of unknown function (DUF4056)
Probab=53.61  E-value=5.8  Score=30.80  Aligned_cols=11  Identities=9%  Similarity=0.311  Sum_probs=8.8

Q ss_pred             HHHhhhhhhcCC
Q psy2264         129 MIRMTNIPWFPA  140 (148)
Q Consensus       129 iaHElaHqWfGn  140 (148)
                      --|||| ||||=
T Consensus       152 ~WHEIA-QWyG~  162 (270)
T PF13265_consen  152 QWHEIA-QWYGY  162 (270)
T ss_pred             HHHHHH-HHcCc
Confidence            349998 89994


No 42 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=52.37  E-value=27  Score=26.10  Aligned_cols=13  Identities=8%  Similarity=-0.051  Sum_probs=9.9

Q ss_pred             HHHHhhhhhhcCC
Q psy2264         128 SMIRMTNIPWFPA  140 (148)
Q Consensus       128 ~iaHElaHqWfGn  140 (148)
                      +|.||++|-|=.|
T Consensus        99 vl~HE~~H~~Q~~  111 (205)
T PF04450_consen   99 VLYHEMVHCWQWD  111 (205)
T ss_pred             HHHHHHHHHhhcC
Confidence            5899999966543


No 43 
>KOG3658|consensus
Probab=51.35  E-value=4.1  Score=35.73  Aligned_cols=10  Identities=0%  Similarity=0.082  Sum_probs=9.1

Q ss_pred             HHHHhhhhhh
Q psy2264         128 SMIRMTNIPW  137 (148)
Q Consensus       128 ~iaHElaHqW  137 (148)
                      .+|||+.|.|
T Consensus       395 t~AHEiGHNf  404 (764)
T KOG3658|consen  395 TLAHEIGHNF  404 (764)
T ss_pred             eehhhhcccc
Confidence            5899999998


No 44 
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=49.71  E-value=22  Score=27.60  Aligned_cols=78  Identities=13%  Similarity=0.042  Sum_probs=38.3

Q ss_pred             ccHHHHHHHHHHHHHHHHHHhCCcCC------CCCCcccEEEeCCCCCCchh-hhhhhhhhccc---cccccccchhhhh
Q psy2264          58 TQGQYIFDKGPKILAALSDYMGFDYY------KMLPKMDLVAVPDFDAGAME-NWGMNTYRLGL---VVEATHLYLLQHT  127 (148)
Q Consensus        58 ~~~~~~l~~~~~~l~~~e~~~g~~y~------~p~~k~d~v~vp~~~~~~ME-n~Gli~~~e~~---ll~~~~~~~~~~~  127 (148)
                      .+.+..++...++++...++.|.+..      +|..+..+|.-.....+.|- ..+-|.+..+.   ++..... ....-
T Consensus       143 ~d~~~ll~~~D~ii~~~~el~Gl~~~~~~~~~~~~~~~r~v~~v~~~~g~m~a~g~~i~~~~~~~~~~l~~~~~-~~~~W  221 (307)
T PF13402_consen  143 EDPEELLRFWDRIIDAEYELAGLDKSSPGPENNPMPNNRFVFDVQISAGYMHASGYPIGFPPNWMNELLNPNPL-RKGGW  221 (307)
T ss_dssp             SSSHHHHHHHHHHHHHHHHHTT-BSS--GGGB--S--EEEEEETT----SEEEETTEEEEETT--HHHH-HHHH-HHH-H
T ss_pred             hhHHHHHHHHHHHHHHHHHHhCCCcccCCccccCcccceEEEeccccccceeecCCcEEeeCcHHhcccCHhHc-CCCCe
Confidence            34667778888899999999997762      12222366666666666776 23333333221   1110000 00112


Q ss_pred             HHHHhhhhh
Q psy2264         128 SMIRMTNIP  136 (148)
Q Consensus       128 ~iaHElaHq  136 (148)
                      -+.||+.|+
T Consensus       222 G~~HE~GH~  230 (307)
T PF13402_consen  222 GPWHELGHN  230 (307)
T ss_dssp             HHHHHHHHH
T ss_pred             eehhhhhhh
Confidence            389999996


No 45 
>PF10023 DUF2265:  Predicted aminopeptidase (DUF2265);  InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=49.47  E-value=8  Score=31.25  Aligned_cols=42  Identities=17%  Similarity=0.237  Sum_probs=25.0

Q ss_pred             cccHHHHHHHHHHHHHHHHHHhCCc-------CC---CCCCcccEEEeCCCC
Q psy2264          57 KTQGQYIFDKGPKILAALSDYMGFD-------YY---KMLPKMDLVAVPDFD   98 (148)
Q Consensus        57 ~~~~~~~l~~~~~~l~~~e~~~g~~-------y~---~p~~k~d~v~vp~~~   98 (148)
                      .+....-|..+.++=+|-.+.+|.|       |.   +|.--..+++.|.+.
T Consensus        45 ~~~lr~rL~~~~~iR~FA~~~L~Lpdn~sY~~YadL~Rp~vvWnV~Aap~~s   96 (337)
T PF10023_consen   45 PPALRARLRLAQQIRRFASEELGLPDNGSYRSYADLDRPYVVWNVFAAPEFS   96 (337)
T ss_pred             CHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhhhcCCCcEEEEEEecCccc
Confidence            4455566677777778877777744       11   122233567777664


No 46 
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=49.11  E-value=8.8  Score=30.23  Aligned_cols=53  Identities=15%  Similarity=0.227  Sum_probs=28.9

Q ss_pred             cHHHHHHHHHHHHHHHHHHhCCc-------CC---CCCCcccEEEeCCCCCCchh----hhhhhhhh
Q psy2264          59 QGQYIFDKGPKILAALSDYMGFD-------YY---KMLPKMDLVAVPDFDAGAME----NWGMNTYR  111 (148)
Q Consensus        59 ~~~~~l~~~~~~l~~~e~~~g~~-------y~---~p~~k~d~v~vp~~~~~~ME----n~Gli~~~  111 (148)
                      ....-+..+.++-.|-++.++.|       |.   +|+-..++.+.|.+.....-    -.||+.|+
T Consensus        79 rLr~rL~~A~aiR~fAseeL~LPDN~SYR~YadIgRp~vvwnVFAaPelSl~p~TwCFpv~GCV~YR  145 (376)
T COG4324          79 RLRARLTSASAIRRFASEELALPDNSSYRSYADIGRPDVVWNVFAAPELSLTPITWCFPVFGCVPYR  145 (376)
T ss_pred             HHHHHHhhHHHHHHHHHHhccCCCCcceeeeeccCCcceeeeeccCcccccceeeeeeeeecceecc
Confidence            34455666777777777777754       11   23334456666776432221    13666664


No 47 
>PF14891 Peptidase_M91:  Effector protein
Probab=48.84  E-value=8.8  Score=27.71  Aligned_cols=11  Identities=0%  Similarity=-0.064  Sum_probs=9.3

Q ss_pred             hHHHHhhhhhh
Q psy2264         127 TSMIRMTNIPW  137 (148)
Q Consensus       127 ~~iaHElaHqW  137 (148)
                      .+|+|||+|-|
T Consensus       105 v~L~HEL~HA~  115 (174)
T PF14891_consen  105 VVLYHELIHAY  115 (174)
T ss_pred             HHHHHHHHHHH
Confidence            36999999977


No 48 
>PRK04351 hypothetical protein; Provisional
Probab=48.60  E-value=7.8  Score=27.52  Aligned_cols=10  Identities=0%  Similarity=-0.406  Sum_probs=8.2

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      .+|+||++|=
T Consensus        63 ~vv~HElcH~   72 (149)
T PRK04351         63 GIIKHELCHY   72 (149)
T ss_pred             hhHHHHHHHH
Confidence            4699999984


No 49 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=47.80  E-value=8.3  Score=27.92  Aligned_cols=11  Identities=9%  Similarity=-0.070  Sum_probs=8.9

Q ss_pred             hHHHHhhhhhh
Q psy2264         127 TSMIRMTNIPW  137 (148)
Q Consensus       127 ~~iaHElaHqW  137 (148)
                      .++||||+|..
T Consensus       135 ~~~aHElGH~l  145 (192)
T cd04267         135 LTMAHELGHNL  145 (192)
T ss_pred             hhhhhhHHhhc
Confidence            47999999953


No 50 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=47.69  E-value=8.4  Score=27.55  Aligned_cols=22  Identities=18%  Similarity=0.060  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHHhCCcCCCCCCcc
Q psy2264          66 KGPKILAALSDYMGFDYYKMLPKM   89 (148)
Q Consensus        66 ~~~~~l~~~e~~~g~~y~~p~~k~   89 (148)
                      ...+.+.-.+..++..+  |-+|.
T Consensus         6 ~L~~~~~~as~~~~r~~--~~p~~   27 (156)
T COG3091           6 KLQQCVEQASLKFFRKF--FRPKA   27 (156)
T ss_pred             HHHHHHHHHHHHhcCCC--CCcce
Confidence            34455555565555577  65543


No 51 
>PF08219 TOM13:  Outer membrane protein TOM13;  InterPro: IPR013262 The TOM13 family of proteins are mitochondrial outer membrane proteins that mediate the assembly of beta-barrel proteins [].; GO: 0005741 mitochondrial outer membrane
Probab=46.90  E-value=11  Score=23.62  Aligned_cols=12  Identities=0%  Similarity=-0.155  Sum_probs=9.5

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      -+++|||+++.|
T Consensus        53 GEl~AhE~~fr~   64 (77)
T PF08219_consen   53 GELFAHEIAFRL   64 (77)
T ss_pred             HHHHHHHHHHHh
Confidence            357999999775


No 52 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=46.53  E-value=42  Score=24.37  Aligned_cols=10  Identities=0%  Similarity=-0.180  Sum_probs=8.3

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      .++||||+|.
T Consensus       133 ~~~AHelGH~  142 (199)
T PF01421_consen  133 VIIAHELGHN  142 (199)
T ss_dssp             HHHHHHHHHH
T ss_pred             HHHHHHHHHh
Confidence            4689999994


No 53 
>KOG2661|consensus
Probab=45.95  E-value=13  Score=30.20  Aligned_cols=13  Identities=8%  Similarity=-0.255  Sum_probs=10.3

Q ss_pred             hhHHHHhhhhhhc
Q psy2264         126 HTSMIRMTNIPWF  138 (148)
Q Consensus       126 ~~~iaHElaHqWf  138 (148)
                      ..+++||+|||--
T Consensus       276 AtvLgHE~aHaVa  288 (424)
T KOG2661|consen  276 ATVLGHEIAHAVA  288 (424)
T ss_pred             HHHHHHHHHHHHH
Confidence            3579999999853


No 54 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=45.78  E-value=10  Score=28.97  Aligned_cols=11  Identities=0%  Similarity=-0.163  Sum_probs=9.4

Q ss_pred             hHHHHhhhhhh
Q psy2264         127 TSMIRMTNIPW  137 (148)
Q Consensus       127 ~~iaHElaHqW  137 (148)
                      .++||||+|.+
T Consensus       169 ~t~AHElGHnl  179 (244)
T cd04270         169 LVTAHELGHNF  179 (244)
T ss_pred             HHHHHHHHHhc
Confidence            46999999975


No 55 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=45.08  E-value=16  Score=25.94  Aligned_cols=16  Identities=6%  Similarity=0.304  Sum_probs=10.6

Q ss_pred             hhHHHHhhhhhhcCCee
Q psy2264         126 HTSMIRMTNIPWFPAWI  142 (148)
Q Consensus       126 ~~~iaHElaHqWfGn~V  142 (148)
                      ...++||+.| |||=+=
T Consensus        70 g~TltHEvGH-~LGL~H   85 (154)
T PF05572_consen   70 GKTLTHEVGH-WLGLYH   85 (154)
T ss_dssp             SHHHHHHHHH-HTT---
T ss_pred             ccchhhhhhh-hhcccc
Confidence            4579999998 777543


No 56 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=44.41  E-value=7  Score=25.74  Aligned_cols=32  Identities=16%  Similarity=0.026  Sum_probs=18.1

Q ss_pred             hhhhhhhccccccccccchhhh---hHHHHhhhhh
Q psy2264         105 WGMNTYRLGLVVEATHLYLLQH---TSMIRMTNIP  136 (148)
Q Consensus       105 ~Gli~~~e~~ll~~~~~~~~~~---~~iaHElaHq  136 (148)
                      .-+++||......-......++   ..+.||+||.
T Consensus        50 ~rI~lyR~pl~~~~~~~~eL~~~I~~tlvhEiah~   84 (97)
T PF06262_consen   50 DRIVLYRRPLERRARSREELAELIRDTLVHEIAHH   84 (97)
T ss_dssp             EEEEEEHHHHHHT-SSHHHHHHHHHHHHHHHHHHH
T ss_pred             CEEEEehHHHHHHhCCHHHHHHHHHHHHHHHHHHH
Confidence            3477777765554332222222   3589999995


No 57 
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=42.65  E-value=15  Score=25.87  Aligned_cols=17  Identities=6%  Similarity=-0.070  Sum_probs=11.2

Q ss_pred             hhhhHHHHhhhh-hhcCC
Q psy2264         124 LQHTSMIRMTNI-PWFPA  140 (148)
Q Consensus       124 ~~~~~iaHElaH-qWfGn  140 (148)
                      .+..+|+||++| ..+|.
T Consensus        95 Sk~~TLiHE~SHf~~~~~  112 (148)
T PF14521_consen   95 SKEGTLIHEWSHFTAVGG  112 (148)
T ss_dssp             -HHHHHHHHHHHSCCCT-
T ss_pred             chHHHHHHhhhhhhhhcC
Confidence            355689999999 24544


No 58 
>PRK02391 heat shock protein HtpX; Provisional
Probab=41.67  E-value=14  Score=29.18  Aligned_cols=15  Identities=0%  Similarity=-0.254  Sum_probs=11.3

Q ss_pred             hhHHHHhhhhhhcCC
Q psy2264         126 HTSMIRMTNIPWFPA  140 (148)
Q Consensus       126 ~~~iaHElaHqWfGn  140 (148)
                      +.++|||++|--=|+
T Consensus       134 ~aVlaHElgHi~~~d  148 (296)
T PRK02391        134 EAVLAHELSHVKNRD  148 (296)
T ss_pred             HHHHHHHHHHHHcCC
Confidence            457999999965554


No 59 
>PRK04897 heat shock protein HtpX; Provisional
Probab=41.30  E-value=14  Score=29.08  Aligned_cols=15  Identities=0%  Similarity=-0.389  Sum_probs=10.9

Q ss_pred             hhHHHHhhhhhhcCC
Q psy2264         126 HTSMIRMTNIPWFPA  140 (148)
Q Consensus       126 ~~~iaHElaHqWfGn  140 (148)
                      +.++|||++|-=-|+
T Consensus       138 ~aVlAHElgHi~~~d  152 (298)
T PRK04897        138 EGVIGHEISHIRNYD  152 (298)
T ss_pred             HHHHHHHHHHHhcCC
Confidence            347999999954444


No 60 
>PRK05457 heat shock protein HtpX; Provisional
Probab=40.25  E-value=15  Score=28.78  Aligned_cols=16  Identities=0%  Similarity=-0.278  Sum_probs=12.3

Q ss_pred             hhHHHHhhhhhhcCCe
Q psy2264         126 HTSMIRMTNIPWFPAW  141 (148)
Q Consensus       126 ~~~iaHElaHqWfGn~  141 (148)
                      +.++|||++|.=-||.
T Consensus       135 ~aVlAHElgHi~~~d~  150 (284)
T PRK05457        135 EAVLAHEISHIANGDM  150 (284)
T ss_pred             HHHHHHHHHHHHcCCH
Confidence            3469999999866663


No 61 
>PF06167 Peptidase_M90:  Glucose-regulated metallo-peptidase M90;  InterPro: IPR010384  MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=39.49  E-value=7.7  Score=30.01  Aligned_cols=10  Identities=10%  Similarity=-0.090  Sum_probs=8.5

Q ss_pred             HHHHhhhhhh
Q psy2264         128 SMIRMTNIPW  137 (148)
Q Consensus       128 ~iaHElaHqW  137 (148)
                      ++.||.||+=
T Consensus       157 VviHEfAH~L  166 (253)
T PF06167_consen  157 VVIHEFAHKL  166 (253)
T ss_dssp             HHHHHHHHHH
T ss_pred             hHHHHHHHHH
Confidence            6899999973


No 62 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=38.22  E-value=12  Score=28.40  Aligned_cols=9  Identities=0%  Similarity=-0.141  Sum_probs=8.0

Q ss_pred             HHHHhhhhh
Q psy2264         128 SMIRMTNIP  136 (148)
Q Consensus       128 ~iaHElaHq  136 (148)
                      ++||||+|-
T Consensus       148 t~AHElGHn  156 (228)
T cd04271         148 VFAHEIGHT  156 (228)
T ss_pred             ehhhhhhhh
Confidence            699999995


No 63 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=35.98  E-value=20  Score=25.88  Aligned_cols=20  Identities=20%  Similarity=0.232  Sum_probs=14.6

Q ss_pred             HHHHHHHHHHHHHHHhCCcC
Q psy2264          63 IFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        63 ~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ..+.++++++..++..++.+
T Consensus        35 ~~~~i~~A~~~w~~~~~l~F   54 (186)
T cd04277          35 QQAAARDALEAWEDVADIDF   54 (186)
T ss_pred             HHHHHHHHHHHHHhhcCcee
Confidence            34567778888888888666


No 64 
>PRK01265 heat shock protein HtpX; Provisional
Probab=35.97  E-value=19  Score=28.88  Aligned_cols=11  Identities=0%  Similarity=-0.354  Sum_probs=8.8

Q ss_pred             hhHHHHhhhhh
Q psy2264         126 HTSMIRMTNIP  136 (148)
Q Consensus       126 ~~~iaHElaHq  136 (148)
                      +.++|||++|-
T Consensus       141 ~aVlAHElgHi  151 (324)
T PRK01265        141 KAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHH
Confidence            34699999994


No 65 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=35.42  E-value=18  Score=26.89  Aligned_cols=11  Identities=0%  Similarity=-0.091  Sum_probs=8.9

Q ss_pred             hHHHHhhhhhh
Q psy2264         127 TSMIRMTNIPW  137 (148)
Q Consensus       127 ~~iaHElaHqW  137 (148)
                      .+++|||+|..
T Consensus       139 ~~~aHEiGH~l  149 (206)
T PF13583_consen  139 QTFAHEIGHNL  149 (206)
T ss_pred             hHHHHHHHHHh
Confidence            46999999964


No 66 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=35.03  E-value=21  Score=24.86  Aligned_cols=21  Identities=5%  Similarity=0.029  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHHHHHHHhCCcC
Q psy2264          62 YIFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        62 ~~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ...+.++++++..++.+.+.+
T Consensus        15 ~~r~~i~~A~~~W~~~~~i~F   35 (165)
T cd04268          15 KLRAAILDAIEAWNKAFAIGF   35 (165)
T ss_pred             HHHHHHHHHHHHHHHHhCcCc
Confidence            345677888899999887665


No 67 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=34.01  E-value=21  Score=24.56  Aligned_cols=22  Identities=14%  Similarity=0.108  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHHHHHHhCCcC
Q psy2264          61 QYIFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        61 ~~~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ....+.+.++++.-++..++.|
T Consensus        21 ~~~~~~i~~A~~~W~~~~~~~F   42 (154)
T PF00413_consen   21 SEQRDAIRQAFQAWNDVAPLNF   42 (154)
T ss_dssp             HHHHHHHHHHHHHHHTTSSEEE
T ss_pred             HHHHHHHHHHHHHHHhcCCceE
Confidence            4456678888888888765443


No 68 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=34.00  E-value=25  Score=27.41  Aligned_cols=11  Identities=0%  Similarity=-0.082  Sum_probs=8.9

Q ss_pred             hhHHHHhhhhh
Q psy2264         126 HTSMIRMTNIP  136 (148)
Q Consensus       126 ~~~iaHElaHq  136 (148)
                      +.+++||++|=
T Consensus       197 ~~il~HEl~Hi  207 (299)
T PF05569_consen  197 RAILLHELAHI  207 (299)
T ss_pred             HHHHHHHHHHH
Confidence            45799999994


No 69 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=32.99  E-value=20  Score=24.92  Aligned_cols=23  Identities=13%  Similarity=0.161  Sum_probs=17.6

Q ss_pred             HHHHHHHHHHHHHHHHHHhCCcC
Q psy2264          60 GQYIFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        60 ~~~~l~~~~~~l~~~e~~~g~~y   82 (148)
                      .....+.+..+++.+++..++.+
T Consensus        20 ~~~~~~~v~~a~~~w~~~~~i~f   42 (167)
T cd00203          20 SAQIQSLILIAMQIWRDYLNIRF   42 (167)
T ss_pred             HHHHHHHHHHHHHHHHhhhCceE
Confidence            34566778888999999887766


No 70 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=32.26  E-value=25  Score=24.48  Aligned_cols=10  Identities=0%  Similarity=-0.349  Sum_probs=8.2

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      .+.+||+.|-
T Consensus        79 RIaaHE~GHi   88 (132)
T PF02031_consen   79 RIAAHELGHI   88 (132)
T ss_dssp             HHHHHHHHHH
T ss_pred             eeeeehhccc
Confidence            4789999993


No 71 
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=32.07  E-value=23  Score=26.66  Aligned_cols=12  Identities=8%  Similarity=0.310  Sum_probs=10.4

Q ss_pred             hHHHHhhhhhhc
Q psy2264         127 TSMIRMTNIPWF  138 (148)
Q Consensus       127 ~~iaHElaHqWf  138 (148)
                      .++|||+.|-|.
T Consensus        95 siLAHE~mHa~L  106 (212)
T PF12315_consen   95 SILAHELMHAWL  106 (212)
T ss_pred             hHHHHHHHHHHh
Confidence            479999999995


No 72 
>KOG3607|consensus
Probab=31.25  E-value=1.3e+02  Score=27.02  Aligned_cols=32  Identities=9%  Similarity=0.213  Sum_probs=16.5

Q ss_pred             eEEEecCccccccHHHHHHHHHHHHHHHHHHhC
Q psy2264          47 FRVWGREDVVKTQGQYIFDKGPKILAALSDYMG   79 (148)
Q Consensus        47 i~v~~~~~~~~~~~~~~l~~~~~~l~~~e~~~g   79 (148)
                      +.+|...+. ....+.+.+.....+.|=..++.
T Consensus       243 lE~Wt~~dk-i~~~~~~~~tL~~F~~wr~~~l~  274 (716)
T KOG3607|consen  243 LEIWTDGNK-IDVSEDLRETLHNFLKWRKSYLT  274 (716)
T ss_pred             EEecCCCCe-ecccccHHHHHHHHHHHHHhhcc
Confidence            557777766 44444444444444444444443


No 73 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=30.67  E-value=28  Score=29.79  Aligned_cols=31  Identities=3%  Similarity=-0.055  Sum_probs=17.9

Q ss_pred             hhhhhhccccccccccchhhhhHHHHhhhhh
Q psy2264         106 GMNTYRLGLVVEATHLYLLQHTSMIRMTNIP  136 (148)
Q Consensus       106 Gli~~~e~~ll~~~~~~~~~~~~iaHElaHq  136 (148)
                      |+..+..+.++.+-.-+......++||++|-
T Consensus       318 ~~~~~~~P~I~~Nf~~t~~dv~TL~HElGHa  348 (549)
T TIGR02289       318 YLPKYKAPFIFSNFNGTSGDIDVLTHEAGHA  348 (549)
T ss_pred             CCCCCCCcEEEEeCCCChhHHHHHHHHhhHH
Confidence            3444455666654333333445799999864


No 74 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=28.52  E-value=28  Score=26.31  Aligned_cols=11  Identities=0%  Similarity=-0.176  Sum_probs=8.8

Q ss_pred             hhHHHHhhhhh
Q psy2264         126 HTSMIRMTNIP  136 (148)
Q Consensus       126 ~~~iaHElaHq  136 (148)
                      +-+|+|||.|-
T Consensus       134 ~hvi~HEiGH~  144 (211)
T PF12388_consen  134 EHVITHEIGHC  144 (211)
T ss_pred             HHHHHHHhhhh
Confidence            34799999994


No 75 
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=26.68  E-value=44  Score=19.78  Aligned_cols=12  Identities=8%  Similarity=-0.025  Sum_probs=9.8

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      +..++|++|++|
T Consensus        29 eR~~vH~~a~~~   40 (59)
T cd06007          29 ERAVIHRLCRKL   40 (59)
T ss_pred             HHHHHHHHHHHc
Confidence            456999999994


No 76 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=26.59  E-value=30  Score=23.54  Aligned_cols=20  Identities=15%  Similarity=0.228  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHHHHHHhCCcC
Q psy2264          63 IFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        63 ~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ..+.+.++++..++..++.+
T Consensus        24 ~~~~i~~A~~~w~~~t~i~F   43 (140)
T smart00235       24 VREAIARAFAEWSDVTCLRF   43 (140)
T ss_pred             HHHHHHHHHHHHhcCCeeEE
Confidence            35677788888888877655


No 77 
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=26.26  E-value=47  Score=22.62  Aligned_cols=10  Identities=10%  Similarity=-0.160  Sum_probs=8.0

Q ss_pred             hHHHHhhhhh
Q psy2264         127 TSMIRMTNIP  136 (148)
Q Consensus       127 ~~iaHElaHq  136 (148)
                      .+++||+.|-
T Consensus        82 KviiHEllHI   91 (133)
T COG4900          82 KVIIHELLHI   91 (133)
T ss_pred             HHHHHHHhcC
Confidence            3699999983


No 78 
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=25.96  E-value=22  Score=24.57  Aligned_cols=10  Identities=10%  Similarity=0.538  Sum_probs=8.5

Q ss_pred             hhhhhcCCee
Q psy2264         133 TNIPWFPAWI  142 (148)
Q Consensus       133 laHqWfGn~V  142 (148)
                      -||.||||..
T Consensus        77 ~ah~w~G~rL   86 (142)
T COG4101          77 EAHTWYGNRL   86 (142)
T ss_pred             eeeeeeccce
Confidence            5899999975


No 79 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=25.59  E-value=34  Score=27.09  Aligned_cols=9  Identities=0%  Similarity=-0.189  Sum_probs=7.9

Q ss_pred             HHHHhhhhh
Q psy2264         128 SMIRMTNIP  136 (148)
Q Consensus       128 ~iaHElaHq  136 (148)
                      +||||..|+
T Consensus       173 VlAHEyGHH  181 (292)
T PF04228_consen  173 VLAHEYGHH  181 (292)
T ss_pred             HHHHHHHHH
Confidence            599999996


No 80 
>PF10584 Proteasome_A_N:  Proteasome subunit A N-terminal signature;  InterPro: IPR000426 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). This family contains the alpha subunit sequences which range from 210 to 290 amino acids. These sequences are classified as non-peptidase homologues in MEROPS peptidase family T1 (clan PB(T)). ; GO: 0004175 endopeptidase activity, 0006511 ubiquitin-dependent protein catabolic process, 0019773 proteasome core complex, alpha-subunit complex; PDB: 3H4P_M 1IRU_O 3UN4_U 1FNT_A 3OEV_G 3OEU_U 3SDK_U 3DY3_G 3MG7_G 3L5Q_C ....
Probab=24.62  E-value=24  Score=16.90  Aligned_cols=18  Identities=11%  Similarity=0.198  Sum_probs=13.7

Q ss_pred             eEEEecCccccccHHHHHH
Q psy2264          47 FRVWGREDVVKTQGQYIFD   65 (148)
Q Consensus        47 i~v~~~~~~~~~~~~~~l~   65 (148)
                      +.+|+|.+. +-+.+++++
T Consensus         5 ~t~FSp~Gr-l~QVEYA~~   22 (23)
T PF10584_consen    5 ITTFSPDGR-LFQVEYAMK   22 (23)
T ss_dssp             TTSBBTTSS-BHHHHHHHH
T ss_pred             ceeECCCCe-EEeeEeeec
Confidence            457888888 778888764


No 81 
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=24.34  E-value=38  Score=24.70  Aligned_cols=12  Identities=0%  Similarity=-0.266  Sum_probs=9.4

Q ss_pred             hHHHHhhhhhhc
Q psy2264         127 TSMIRMTNIPWF  138 (148)
Q Consensus       127 ~~iaHElaHqWf  138 (148)
                      .+|||||+|---
T Consensus        38 ~ilahel~hafd   49 (206)
T PF01431_consen   38 FILAHELMHAFD   49 (206)
T ss_dssp             HHHHHHHHHCTS
T ss_pred             HHHHHHHHHHHH
Confidence            479999999653


No 82 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=23.88  E-value=35  Score=23.81  Aligned_cols=22  Identities=9%  Similarity=0.101  Sum_probs=15.7

Q ss_pred             HHHHHHHHHHHHHHHHHhCCcC
Q psy2264          61 QYIFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        61 ~~~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ....+.+.++++..++..++.+
T Consensus        21 ~~~~~~i~~A~~~W~~~~~l~F   42 (157)
T cd04278          21 DDVRRAIARAFRVWSDVTPLTF   42 (157)
T ss_pred             HHHHHHHHHHHHHHHhhcCcee
Confidence            4445677888888888877554


No 83 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=23.73  E-value=47  Score=25.88  Aligned_cols=13  Identities=0%  Similarity=-0.112  Sum_probs=9.8

Q ss_pred             hHHHHhhhhhhcC
Q psy2264         127 TSMIRMTNIPWFP  139 (148)
Q Consensus       127 ~~iaHElaHqWfG  139 (148)
                      .+++||+.|...|
T Consensus       218 ~v~vHE~GHsf~~  230 (264)
T PF09471_consen  218 QVVVHEFGHSFGG  230 (264)
T ss_dssp             HHHHHHHHHHTT-
T ss_pred             ceeeeeccccccc
Confidence            4699999997554


No 84 
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=23.64  E-value=55  Score=19.44  Aligned_cols=13  Identities=8%  Similarity=0.261  Sum_probs=10.1

Q ss_pred             hhhHHHHhhhhhh
Q psy2264         125 QHTSMIRMTNIPW  137 (148)
Q Consensus       125 ~~~~iaHElaHqW  137 (148)
                      .+..++|++|+++
T Consensus        29 ~eR~~vH~~a~~~   41 (60)
T cd02640          29 EERALIHQIAQKY   41 (60)
T ss_pred             HHHHHHHHHHHHc
Confidence            3456999999994


No 85 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=22.89  E-value=1.8e+02  Score=20.90  Aligned_cols=21  Identities=0%  Similarity=-0.122  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHHHHHHHhCCcC
Q psy2264          62 YIFDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        62 ~~l~~~~~~l~~~e~~~g~~y   82 (148)
                      ...+.++++++.+++.+.+.+
T Consensus        15 ~~~~~I~~A~~~w~~~TcIrF   35 (180)
T cd04280          15 SDRSLILRAMREIESNTCIRF   35 (180)
T ss_pred             HHHHHHHHHHHHHHhCCcceE
Confidence            445778889999999887665


No 86 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=22.61  E-value=48  Score=26.92  Aligned_cols=25  Identities=4%  Similarity=-0.133  Sum_probs=14.4

Q ss_pred             ccccccccccchhhhhHHHHhhhhh
Q psy2264         112 LGLVVEATHLYLLQHTSMIRMTNIP  136 (148)
Q Consensus       112 e~~ll~~~~~~~~~~~~iaHElaHq  136 (148)
                      ...++.+-..+...-..++||++|-
T Consensus       209 ~p~i~~n~~~~~~~v~tl~HE~GHa  233 (427)
T cd06459         209 HPFILMNFNGTLDDVFTLAHELGHA  233 (427)
T ss_pred             CCeEEecCCCChhhHHHHHHHhhHH
Confidence            4445544332333345699999884


No 87 
>PF14831 DUF4484:  Domain of unknown function (DUF4484)
Probab=21.13  E-value=67  Score=23.54  Aligned_cols=14  Identities=7%  Similarity=0.290  Sum_probs=12.0

Q ss_pred             hhHHHHhhhhhhcC
Q psy2264         126 HTSMIRMTNIPWFP  139 (148)
Q Consensus       126 ~~~iaHElaHqWfG  139 (148)
                      ....++||++.|||
T Consensus       145 D~~FV~el~~~yf~  158 (176)
T PF14831_consen  145 DAAFVKELVAKYFG  158 (176)
T ss_pred             HHHHHHHHHHHHcC
Confidence            34589999999999


No 88 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=20.89  E-value=1.3e+02  Score=21.95  Aligned_cols=19  Identities=5%  Similarity=0.032  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHHHhCCcC
Q psy2264          64 FDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        64 l~~~~~~l~~~e~~~g~~y   82 (148)
                      .+.+.++++.+++.+.+.+
T Consensus        21 ~~~I~~Am~~~e~~TcI~F   39 (191)
T PF01400_consen   21 RQRIRKAMDEWEKNTCIRF   39 (191)
T ss_dssp             HHHHHHHHHHHHHHSSEEE
T ss_pred             HHHHHHHHHHHHhCCCEEE
Confidence            3678899999999998766


No 89 
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=20.78  E-value=1.8e+02  Score=21.13  Aligned_cols=12  Identities=0%  Similarity=0.025  Sum_probs=9.8

Q ss_pred             hhHHHHhhhhhh
Q psy2264         126 HTSMIRMTNIPW  137 (148)
Q Consensus       126 ~~~iaHElaHqW  137 (148)
                      +.+|+|||.|-|
T Consensus        72 ~~~l~HELIHay   83 (173)
T PF09768_consen   72 EDTLTHELIHAY   83 (173)
T ss_pred             HHHHHHHHHHHH
Confidence            457999999975


No 90 
>PF10186 Atg14:  UV radiation resistance protein and autophagy-related subunit 14;  InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 [].  The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=20.75  E-value=1.5e+02  Score=22.67  Aligned_cols=29  Identities=17%  Similarity=0.284  Sum_probs=24.8

Q ss_pred             cccHHHHHHHHHHHHHHHHHHhCCcCCCCCC
Q psy2264          57 KTQGQYIFDKGPKILAALSDYMGFDYYKMLP   87 (148)
Q Consensus        57 ~~~~~~~l~~~~~~l~~~e~~~g~~y~~p~~   87 (148)
                      .+...-++..+.+++..+..|+|+|+  |++
T Consensus       194 ~~~isaALgyvahlv~lls~yL~v~L--py~  222 (302)
T PF10186_consen  194 DEEISAALGYVAHLVSLLSRYLGVPL--PYP  222 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhCCCC--CCC
Confidence            34677899999999999999999988  665


No 91 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=20.66  E-value=1.8e+02  Score=21.59  Aligned_cols=19  Identities=0%  Similarity=0.016  Sum_probs=14.5

Q ss_pred             HHHHHHHHHHHHHHhCCcC
Q psy2264          64 FDKGPKILAALSDYMGFDY   82 (148)
Q Consensus        64 l~~~~~~l~~~e~~~g~~y   82 (148)
                      ...+.++++.+++.+-+.+
T Consensus        28 ~~~I~~Am~~~~~~TCirF   46 (200)
T cd04281          28 RAMFKQAMRHWENFTCVTF   46 (200)
T ss_pred             HHHHHHHHHHHHhCCceEE
Confidence            4577889999999876555


No 92 
>PF04036 DUF372:  Domain of unknown function (DUF372);  InterPro: IPR007179 This is a group of proteins of unknown function. It is found N-terminal to another domain of unknown function (IPR007181 from INTERPRO).; PDB: 2I52_B 2IEC_D 2OGF_C.
Probab=20.59  E-value=49  Score=17.87  Aligned_cols=13  Identities=8%  Similarity=-0.197  Sum_probs=9.6

Q ss_pred             hhhhhhcCCeeee
Q psy2264         132 MTNIPWFPAWISW  144 (148)
Q Consensus       132 ElaHqWfGn~Vt~  144 (148)
                      .|-|||.|-=|+-
T Consensus        13 aLyHQF~GtPvs~   25 (38)
T PF04036_consen   13 ALYHQFVGTPVSP   25 (38)
T ss_dssp             HHHHHHTT-EEST
T ss_pred             HHHHHhcCCcCCc
Confidence            4789999988764


No 93 
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=20.09  E-value=45  Score=25.97  Aligned_cols=9  Identities=11%  Similarity=-0.001  Sum_probs=8.1

Q ss_pred             HHHHhhhhh
Q psy2264         128 SMIRMTNIP  136 (148)
Q Consensus       128 ~iaHElaHq  136 (148)
                      ++.||+|||
T Consensus       145 VvIHEFAH~  153 (260)
T PRK15410        145 LIIHEVAHK  153 (260)
T ss_pred             hhHhHHHhH
Confidence            689999997


No 94 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=20.01  E-value=53  Score=24.01  Aligned_cols=11  Identities=9%  Similarity=-0.345  Sum_probs=8.3

Q ss_pred             HHHHhhhhhhc
Q psy2264         128 SMIRMTNIPWF  138 (148)
Q Consensus       128 ~iaHElaHqWf  138 (148)
                      ++.||++|-|.
T Consensus        44 l~iHElgH~~~   54 (183)
T cd06160          44 LGIHEMGHYLA   54 (183)
T ss_pred             HHHHHHHHHHH
Confidence            46899999763


Done!