Query         psy12854
Match_columns 265
No_of_seqs    129 out of 1161
Neff          7.0 
Searched_HMMs 46136
Date          Fri Aug 16 21:30:04 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy12854.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12854hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG0339 Dcp Zn-dependent oligo 100.0 1.6E-72 3.6E-77  545.6  24.5  239    7-265   439-682 (683)
  2 PRK10280 dipeptidyl carboxypep 100.0 5.5E-71 1.2E-75  549.9  25.5  238    8-265   437-680 (681)
  3 PRK10911 oligopeptidase A; Pro 100.0 8.8E-68 1.9E-72  527.9  26.2  240    7-265   435-679 (680)
  4 KOG2089|consensus              100.0 2.1E-68 4.5E-73  511.5  18.6  242    5-264   465-708 (718)
  5 cd06456 M3A_DCP_Oligopeptidase 100.0 1.7E-64 3.6E-69  480.8  26.4  238    7-264   180-422 (422)
  6 cd06457 M3A_MIP Peptidase M3 m 100.0 3.4E-62 7.4E-67  469.7  23.8  231    7-260   220-456 (458)
  7 cd06455 M3A_TOP Peptidase M3 T 100.0 3.3E-60 7.2E-65  457.5  25.1  235    7-262   235-472 (472)
  8 KOG2090|consensus              100.0 6.7E-56 1.4E-60  421.2  21.0  233    6-261   450-688 (704)
  9 PF01432 Peptidase_M3:  Peptida 100.0 5.6E-56 1.2E-60  426.2  20.1  239    6-264   213-458 (458)
 10 cd06258 Peptidase_M3_like The  100.0 2.5E-49 5.5E-54  369.7  25.5  234    9-261   128-364 (365)
 11 TIGR00181 pepF oligoendopeptid 100.0 2.4E-33 5.2E-38  277.3  18.3  208    4-261   354-576 (591)
 12 TIGR02290 M3_fam_3 oligoendope 100.0 4.4E-33 9.6E-38  275.3  19.3  210    4-261   350-572 (587)
 13 cd06459 M3B_Oligoendopeptidase 100.0 1.3E-32 2.8E-37  261.0  19.1  211    4-261   197-421 (427)
 14 TIGR02289 M3_not_pepF oligoend 100.0   2E-29 4.3E-34  247.6  19.5  204    4-254   312-528 (549)
 15 COG1164 Oligoendopeptidase F [  99.9 2.5E-25 5.5E-30  219.9  19.2  212    3-261   354-580 (598)
 16 cd06460 M32_Taq Peptidase fami  99.8 3.4E-17 7.3E-22  154.5  18.8  230   13-262   144-391 (396)
 17 cd06461 M2_ACE Peptidase famil  99.3   1E-10 2.2E-15  113.3  17.1  214   12-260   233-464 (477)
 18 PF02074 Peptidase_M32:  Carbox  98.7 4.9E-07 1.1E-11   88.0  16.6  213   33-262   257-490 (494)
 19 COG2317 Zn-dependent carboxype  98.3 1.7E-05 3.6E-10   75.8  13.8  226   14-262   245-491 (497)
 20 PF01401 Peptidase_M2:  Angiote  98.2 4.6E-05   1E-09   76.0  16.7  211   28-261   342-567 (595)
 21 KOG3690|consensus               97.1   0.059 1.3E-06   53.4  17.9  214   28-261   364-596 (646)
 22 COG2856 Predicted Zn peptidase  92.8   0.079 1.7E-06   46.5   2.6   32    8-49     55-86  (213)
 23 PF14247 DUF4344:  Domain of un  88.9    0.28 6.1E-06   43.2   2.3   18   34-51     91-108 (220)
 24 PF00413 Peptidase_M10:  Matrix  88.8    0.23 4.9E-06   40.2   1.5   13   35-47    105-117 (154)
 25 PF06114 DUF955:  Domain of unk  88.4    0.41 8.9E-06   36.4   2.7   22   32-53     39-60  (122)
 26 cd04268 ZnMc_MMP_like Zinc-dep  85.7    0.45 9.8E-06   39.0   1.6   14   35-48     94-107 (165)
 27 cd04279 ZnMc_MMP_like_1 Zinc-d  85.0    0.48   1E-05   38.8   1.5   15   34-48    103-117 (156)
 28 cd04278 ZnMc_MMP Zinc-dependen  84.8    0.38 8.3E-06   39.6   0.8   16   33-48    105-120 (157)
 29 PF13058 DUF3920:  Protein of u  84.3    0.71 1.5E-05   36.2   2.0   16   34-49     75-90  (126)
 30 cd04277 ZnMc_serralysin_like Z  83.3    0.66 1.4E-05   39.2   1.6   16   33-48    111-126 (186)
 31 PF13688 Reprolysin_5:  Metallo  81.9    0.73 1.6E-05   39.0   1.4   17   32-48    139-155 (196)
 32 PF05572 Peptidase_M43:  Pregna  81.9     1.4 2.9E-05   36.6   2.9   17   32-48     66-82  (154)
 33 PF13582 Reprolysin_3:  Metallo  81.9     0.7 1.5E-05   36.0   1.2   13   35-47    107-119 (124)
 34 PF13583 Reprolysin_4:  Metallo  81.8     1.3 2.8E-05   38.4   2.9   15   33-47    135-149 (206)
 35 smart00235 ZnMc Zinc-dependent  81.6    0.71 1.5E-05   37.0   1.1   11   37-47     88-98  (140)
 36 PF13398 Peptidase_M50B:  Pepti  81.5     1.2 2.6E-05   38.4   2.6   24   33-56     20-43  (200)
 37 PF13574 Reprolysin_2:  Metallo  81.4     0.9 1.9E-05   38.3   1.7   14   35-48    111-124 (173)
 38 cd04327 ZnMc_MMP_like_3 Zinc-d  79.4       1 2.2E-05   38.7   1.4   15   36-50     93-107 (198)
 39 PF04298 Zn_peptidase_2:  Putat  78.9     1.5 3.3E-05   38.6   2.4   19   32-50     86-104 (222)
 40 cd00203 ZnMc Zinc-dependent me  78.7       1 2.2E-05   36.8   1.2   14   34-47     95-108 (167)
 41 cd04267 ZnMc_ADAM_like Zinc-de  78.2     2.1 4.6E-05   36.1   3.1   16   32-47    130-145 (192)
 42 PF14891 Peptidase_M91:  Effect  78.2     1.6 3.4E-05   36.9   2.2   21   34-54    102-122 (174)
 43 PF04228 Zn_peptidase:  Putativ  76.1     1.7 3.6E-05   40.0   1.9   19   35-53    170-188 (292)
 44 cd04272 ZnMc_salivary_gland_MP  75.0     1.8   4E-05   37.6   1.8   16   32-47    142-157 (220)
 45 PF10460 Peptidase_M30:  Peptid  74.4     4.3 9.2E-05   38.5   4.2   46   35-83    139-184 (366)
 46 cd04271 ZnMc_ADAM_fungal Zinc-  72.2     1.2 2.6E-05   39.4  -0.1   14   34-47    144-157 (228)
 47 PF13485 Peptidase_MA_2:  Pepti  72.2     7.3 0.00016   29.5   4.5   35   37-78     27-61  (128)
 48 cd04270 ZnMc_TACE_like Zinc-de  71.2     2.2 4.8E-05   38.0   1.4   12   36-47    168-179 (244)
 49 cd04280 ZnMc_astacin_like Zinc  71.0     2.1 4.5E-05   36.3   1.2   13   36-48     75-87  (180)
 50 PF02031 Peptidase_M7:  Strepto  69.2     3.2 6.9E-05   33.5   1.8   17   32-48     74-90  (132)
 51 PF01400 Astacin:  Astacin (Pep  68.0     3.4 7.4E-05   35.3   1.9   15   35-49     79-93  (191)
 52 PF12388 Peptidase_M57:  Dual-a  67.6     2.5 5.5E-05   37.0   1.0   13   37-49    135-147 (211)
 53 cd04269 ZnMc_adamalysin_II_lik  67.0       3 6.6E-05   35.3   1.4   14   34-47    130-143 (194)
 54 cd04283 ZnMc_hatching_enzyme Z  66.1     3.2   7E-05   35.5   1.3   15   36-50     78-92  (182)
 55 PF13699 DUF4157:  Domain of un  66.0     3.9 8.4E-05   30.1   1.6   19   32-50     58-76  (79)
 56 PF02163 Peptidase_M50:  Peptid  64.0     4.5 9.7E-05   34.1   1.8   24   36-59      8-31  (192)
 57 cd04281 ZnMc_BMP1_TLD Zinc-dep  60.8     4.5 9.8E-05   35.1   1.3   14   36-49     88-101 (200)
 58 TIGR03296 M6dom_TIGR03296 M6 f  59.1     1.9 4.2E-05   39.3  -1.4   14   35-48    165-178 (286)
 59 cd05709 S2P-M50 Site-2 proteas  58.7     6.2 0.00014   32.9   1.8   19   36-54      9-27  (180)
 60 cd04276 ZnMc_MMP_like_2 Zinc-d  57.5     5.9 0.00013   34.3   1.5   12   37-48    118-129 (197)
 61 PF09471 Peptidase_M64:  IgA Pe  56.0       6 0.00013   35.8   1.3   19   35-53    216-234 (264)
 62 cd06163 S2P-M50_PDZ_RseP-like   56.0     7.3 0.00016   33.2   1.8   20   36-55     10-29  (182)
 63 cd06161 S2P-M50_SpoIVFB SpoIVF  53.5     8.5 0.00018   33.3   1.8   17   37-53     40-56  (208)
 64 cd04282 ZnMc_meprin Zinc-depen  52.8     7.1 0.00015   34.6   1.2   16   36-51    121-136 (230)
 65 COG2738 Predicted Zn-dependent  51.8      10 0.00022   33.0   1.9   18   32-49     89-106 (226)
 66 PF01435 Peptidase_M48:  Peptid  50.8     8.3 0.00018   32.9   1.3   21   31-51     84-105 (226)
 67 PF01421 Reprolysin:  Reprolysi  50.6      15 0.00033   31.1   2.9   15   33-47    129-143 (199)
 68 PF07607 DUF1570:  Protein of u  47.7      19  0.0004   29.1   2.8   41   35-81      1-42  (128)
 69 cd06159 S2P-M50_PDZ_Arch Uncha  47.0      12 0.00026   33.8   1.8   16   36-51    119-134 (263)
 70 cd04273 ZnMc_ADAMTS_like Zinc-  46.2       4 8.7E-05   35.1  -1.4   15   33-47    138-152 (207)
 71 cd06164 S2P-M50_SpoIVFB_CBS Sp  44.4      14 0.00031   32.5   1.8   16   36-51     54-69  (227)
 72 PF00427 PBS_linker_poly:  Phyc  43.5      14  0.0003   29.9   1.5   31  226-256    58-89  (131)
 73 COG0501 HtpX Zn-dependent prot  43.1      12 0.00026   33.5   1.2   20   30-49    151-171 (302)
 74 PF05548 Peptidase_M11:  Gameto  42.6      13 0.00027   34.6   1.2   14   34-47    149-162 (314)
 75 PF10462 Peptidase_M66:  Peptid  42.0      14 0.00031   34.2   1.4   18   30-47    188-205 (305)
 76 PRK03982 heat shock protein Ht  40.4      14 0.00031   33.6   1.2   19   31-49    120-139 (288)
 77 cd06162 S2P-M50_PDZ_SREBP Ster  40.3      18 0.00038   33.1   1.8   16   36-51    136-151 (277)
 78 PF13402 M60-like:  Peptidase M  39.5      16 0.00034   33.1   1.3   16   35-50    219-234 (307)
 79 PRK13267 archaemetzincin-like   39.0      17 0.00038   30.9   1.5   17   31-47    121-137 (179)
 80 cd04275 ZnMc_pappalysin_like Z  38.8     4.9 0.00011   35.5  -2.0   17   32-48    134-150 (225)
 81 PRK03001 M48 family peptidase;  37.9      16 0.00035   33.1   1.1   19   31-49    119-138 (283)
 82 cd06160 S2P-M50_like_2 Unchara  37.4      22 0.00047   30.3   1.8   17   37-53     43-59  (183)
 83 KOG3714|consensus               37.4      16 0.00035   35.1   1.1   20   36-55    160-179 (411)
 84 PF01447 Peptidase_M4:  Thermol  36.8      23  0.0005   29.2   1.8   19   29-47    128-147 (150)
 85 TIGR00054 RIP metalloprotease   36.1      21 0.00045   34.3   1.7   17   36-52     15-31  (420)
 86 PF07998 Peptidase_M54:  Peptid  35.8      21 0.00046   30.8   1.5   22   27-48    137-158 (194)
 87 PRK02870 heat shock protein Ht  35.7      19  0.0004   33.8   1.2   20   30-49    167-187 (336)
 88 PF01742 Peptidase_M27:  Clostr  35.0      28 0.00062   33.3   2.3   24   31-54    210-233 (408)
 89 PRK01345 heat shock protein Ht  35.0      19 0.00042   33.3   1.2   20   31-50    119-139 (317)
 90 PRK04897 heat shock protein Ht  34.9      19 0.00042   33.0   1.2   19   31-49    132-151 (298)
 91 PRK05457 heat shock protein Ht  31.4      24 0.00052   32.1   1.2   19   31-49    129-148 (284)
 92 PRK10779 zinc metallopeptidase  31.3      28 0.00062   33.7   1.7   15   36-50     16-30  (449)
 93 PF05547 Peptidase_M6:  Immune   30.8     7.8 0.00017   39.5  -2.3   13   35-47    221-233 (645)
 94 PRK03072 heat shock protein Ht  30.8      25 0.00054   32.1   1.2   19   31-49    122-141 (288)
 95 KOG2921|consensus               29.7      31 0.00068   33.1   1.7   34   22-55    118-151 (484)
 96 COG4227 Antirestriction protei  29.7      24 0.00051   32.1   0.8   11   36-46    204-214 (316)
 97 COG5549 Predicted Zn-dependent  29.3      26 0.00057   30.8   1.0   22   27-48    179-200 (236)
 98 PRK02391 heat shock protein Ht  27.9      30 0.00065   31.7   1.2   19   31-49    128-147 (296)
 99 PF04539 Sigma70_r3:  Sigma-70   27.9      50  0.0011   23.3   2.2   24  242-265    10-33  (78)
100 PRK01265 heat shock protein Ht  27.1      32 0.00069   32.1   1.2   19   31-49    135-154 (324)
101 PHA00527 hypothetical protein   26.9 1.5E+02  0.0033   23.2   4.6   22   36-57     74-95  (129)
102 TIGR00488 putative HD superfam  25.4 3.7E+02  0.0079   21.8   7.8   98   35-133    35-146 (158)
103 PHA02456 zinc metallopeptidase  24.6   1E+02  0.0022   24.5   3.4   13   37-49     81-93  (141)
104 KOG2719|consensus               24.3      47   0.001   32.2   1.8   18   36-53    281-298 (428)
105 PHA00657 crystallin beta/gamma  24.3      50  0.0011   36.4   2.1   24   28-51    784-807 (2052)
106 COG1913 Predicted Zn-dependent  23.2      45 0.00098   28.4   1.3   20   28-47    117-136 (181)
107 KOG1565|consensus               23.0      29 0.00062   34.1   0.1   18   35-52    211-228 (469)
108 PLN02791 Nudix hydrolase homol  22.2      37 0.00079   35.5   0.7   20   34-53    584-603 (770)
109 PF12315 DUF3633:  Protein of u  22.0      51  0.0011   28.9   1.4   13   36-48     94-106 (212)
110 PF13946 DUF4214:  Domain of un  21.9      72  0.0016   22.9   2.0   21  241-261    25-45  (75)
111 KOG2661|consensus               21.8 1.1E+02  0.0023   29.1   3.5   53   32-84    271-329 (424)
112 KOG3607|consensus               21.5      48   0.001   34.4   1.4   16   32-47    320-335 (716)
113 PF01457 Peptidase_M8:  Leishma  21.2      52  0.0011   32.6   1.5   15   34-48    209-223 (521)
114 PF02128 Peptidase_M36:  Fungal  21.1      16 0.00035   34.8  -2.0   40   14-53    148-203 (378)
115 COG2321 Predicted metalloprote  20.6      56  0.0012   29.9   1.4   20   34-53    166-185 (295)
116 PF10263 SprT-like:  SprT-like   20.5      70  0.0015   25.8   1.9   17   36-52     61-77  (157)

No 1  
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=100.00  E-value=1.6e-72  Score=545.64  Aligned_cols=239  Identities=34%  Similarity=0.551  Sum_probs=227.3

Q ss_pred             CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854          7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV   86 (265)
Q Consensus         7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~   86 (265)
                      +.+|+||++++|||.+|++++|+||+|+||+||||||||+||++|++++|+.+|||+ |||||||+||||||||||+|++
T Consensus       439 ~~~q~PV~ylvCNF~~p~~gkPaLls~dEV~TLFHEfGHgLH~mlt~v~~~~vsGt~-v~wDfVElPSQ~mE~w~~~p~v  517 (683)
T COG0339         439 GGGQKPVIYLVCNFTKPVGGKPALLSHDEVTTLFHEFGHGLHHLLTRVKYPGVSGTN-VPWDFVELPSQFMENWCWEPEV  517 (683)
T ss_pred             CCcccceEEEeccCCCCCCCCCceeeHHHHHHHHHHhhhHHHHHhhcCCccccCCCC-CCcchhhccHHHHHHhhcCHHH
Confidence            458999999999999999999999999999999999999999999999999999996 9999999999999999999999


Q ss_pred             HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCCCCCCCCCC
Q psy12854         87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFGFPIDKYSN  161 (265)
Q Consensus        87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~~~~~~~~~  161 (265)
                      |..+++||+||++||++ +++++++++++.|+.++||+.+|+|||.+|...+    ..+.++.+++.++..-.+..++.+
T Consensus       518 L~~~a~Hy~TGe~lP~~ll~k~laaknf~~g~~t~rql~fal~Dm~~H~~~~~~~~~~i~~~~~~~~~~~~v~~~~~~~~  597 (683)
T COG0339         518 LAKYARHYQTGEPLPKELLDKMLAAKNFQAGLFTLRQLEFALFDMRLHTEFDPDANADILEFEAEVLKKVAVLPSIPPRR  597 (683)
T ss_pred             HHHHHHhhccCCcCHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHhhccCCcccccCHHHHHHHHHHHhCCCCCcchhh
Confidence            99999999999999999 9999999999999999999999999999999873    357889999998877665567788


Q ss_pred             cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854        162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS  241 (265)
Q Consensus       162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~  241 (265)
                      |+++|+|||+|||+||||||+||+|+++|.|.+|.+.|.    +|               +++|+|||+.||+.|||++|
T Consensus       598 ~~~~F~HIFagGYsAGYYSY~WaeVLsaDafa~Fee~g~----~~---------------~e~G~rfrd~ILs~GGS~dp  658 (683)
T COG0339         598 RPHSFGHIFAGGYSAGYYSYLWAEVLSADAFAAFEEEGP----FN---------------RETGQRFRDAILSRGGSRDP  658 (683)
T ss_pred             ccccccceecCcccchhHHHHHHHHHhhHHHHHHHhcCC----CC---------------HHHHHHHHHHHHhccCCcCH
Confidence            999999999999999999999999999999999999876    66               99999999999999999999


Q ss_pred             HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854        242 SEVFRRFRGRDPCFKPFLDMFRLN  265 (265)
Q Consensus       242 ~e~l~~flGr~ps~~a~~~~~g~~  265 (265)
                      ++++++|+||+|+.||+|+++|+.
T Consensus       659 ~e~f~~frGrep~~dalLr~~Gl~  682 (683)
T COG0339         659 MELFKAFRGREPSIDALLRHRGLA  682 (683)
T ss_pred             HHHHHHHhcCCCChhHHHHhcCCC
Confidence            999999999999999999999974


No 2  
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=100.00  E-value=5.5e-71  Score=549.94  Aligned_cols=238  Identities=26%  Similarity=0.440  Sum_probs=218.5

Q ss_pred             ccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHHH
Q psy12854          8 TVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERVF   87 (265)
Q Consensus         8 ~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L   87 (265)
                      .+++|+++++|||++|++++|+||+|+||+||||||||+||++|++++|+++|||+ |+|||||+||||||+|||+|++|
T Consensus       437 ~~~~Pv~~lvcNf~~p~~~~p~LL~~~eV~TlFHEfGHalH~lls~~~y~~~sGt~-v~~DfVE~PSq~mE~w~~~~~vL  515 (681)
T PRK10280        437 NETRPVIYNVCNYQKPAAGQPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTN-TPRDFVEFPSQINEHWASHPQVF  515 (681)
T ss_pred             CCCCCeEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhCCCccccCCCC-CCcchhcCcHHHHHHHhcCHHHH
Confidence            46789999999999999999999999999999999999999999999999999985 99999999999999999999999


Q ss_pred             HHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCC-CCCCCCCC
Q psy12854         88 AELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFG-FPIDKYSN  161 (265)
Q Consensus        88 ~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~-~~~~~~~~  161 (265)
                      +.+++||+||+|||++ +++|+++++++.|+.+++|+.+|+|||.+|....    ....+..+++..+... ++..++++
T Consensus       516 ~~~a~Hy~TgepiP~~l~~~l~~ar~~~~g~~~~~ql~~al~D~~lH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  595 (681)
T PRK10280        516 ARYARHYQSGEAMPDELQEKMRNASLFNKGYDMSELLSAALLDMRWHCLEENEAMQDVDDFELRALVAENLDLPAVPPRY  595 (681)
T ss_pred             HHHhhccCCCCCCCHHHHHHHHHhhCcchHHHHHHHHHHHHHhHHHhccCcccccccHHHHHHHHHHHhCCCCCCCCCCC
Confidence            9999999999999999 9999999999999999999999999999997632    1344555555555543 56667788


Q ss_pred             cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854        162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS  241 (265)
Q Consensus       162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~  241 (265)
                      ++++|+|||++||+|+||||+||+|+|+|+|++|++++.    +|               +++|++||++||++|||++|
T Consensus       596 ~~~~F~Hif~ggY~AgYYsYlwaevlaaD~f~~f~~~g~----~n---------------~~~G~~fr~~iL~~GGs~d~  656 (681)
T PRK10280        596 RSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGG----LT---------------RENGQRFREAILSRGNSTDL  656 (681)
T ss_pred             CCCcccccccCCcchhhHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHhhcCCCcCH
Confidence            999999999889999999999999999999999988654    45               99999999999999999999


Q ss_pred             HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854        242 SEVFRRFRGRDPCFKPFLDMFRLN  265 (265)
Q Consensus       242 ~e~l~~flGr~ps~~a~~~~~g~~  265 (265)
                      .++|++||||+|+++|||+++|++
T Consensus       657 ~~~~~~FlGR~P~~~alL~~~Gl~  680 (681)
T PRK10280        657 ERLYRQWRGHAPQIMPMLQHRGLN  680 (681)
T ss_pred             HHHHHHhcCCCCChHHHHHhcCCC
Confidence            999999999999999999999985


No 3  
>PRK10911 oligopeptidase A; Provisional
Probab=100.00  E-value=8.8e-68  Score=527.94  Aligned_cols=240  Identities=30%  Similarity=0.541  Sum_probs=226.2

Q ss_pred             CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854          7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV   86 (265)
Q Consensus         7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~   86 (265)
                      |.++.|+++|+|||++|.+++|+||+|+||.||||||||+||++|++++|+.++||++++|||||+|||+||+|+|+|++
T Consensus       435 g~~~~Pv~~l~~Nf~~p~~~~p~LL~~~~v~tlfHEfGHalH~~ls~~~~~~~sGt~~~~~D~vE~pS~~~E~~~~~~~v  514 (680)
T PRK10911        435 GSLQKPVAYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLTRIETAGVSGISGVPWDAVELPSQFMENWCWEPEA  514 (680)
T ss_pred             CceeCCeEEEECCCCCCCCCCCcccCHHHHHHHHHHHhHHHHHHHhCCCcCcCCCcCCCCchHhhccHHHHHHHhcCHHH
Confidence            55688999999999999999999999999999999999999999999999999998889999999999999999999999


Q ss_pred             HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCCCCCCCCCC
Q psy12854         87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFGFPIDKYSN  161 (265)
Q Consensus        87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~~~~~~~~~  161 (265)
                      |+.+++||+||++||++ +++++++++++.++.++||+.+|.||+.+|...+    ..+.++++++++++..++..++++
T Consensus       515 L~~~a~H~~tgeplp~~l~~~l~~~~~~~~~~~~~rql~~a~~D~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  594 (680)
T PRK10911        515 LAFISGHYETGEPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFRLHAEFDPDQGAKILETLAEIKKQVAVVPSPSWGR  594 (680)
T ss_pred             HHHHHHHhcCCCCCCHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHhhccccccccCHHHHHHHHHHHcCCCCCCCCCc
Confidence            99999999999999999 9999999999999999999999999999997532    157889999998887667667889


Q ss_pred             cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854        162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS  241 (265)
Q Consensus       162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~  241 (265)
                      ++++|+|||++||+|+||||+||+++|+|+|++|++++.    +|               +++|++||++||++|||++|
T Consensus       595 ~~~~F~Hif~gGY~AgYYsYlwa~vla~d~~~~f~~~g~----~~---------------~~~g~~~r~~iL~~Ggs~~p  655 (680)
T PRK10911        595 FPHAFSHIFAGGYAAGYYSYLWADVLAADAFSRFEEEGI----FN---------------RETGQSFLDNILSRGGSEEP  655 (680)
T ss_pred             CCccccccCcCCcccchHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHHhCcCCcCH
Confidence            999999999999999999999999999999999988765    56               99999999999999999999


Q ss_pred             HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854        242 SEVFRRFRGRDPCFKPFLDMFRLN  265 (265)
Q Consensus       242 ~e~l~~flGr~ps~~a~~~~~g~~  265 (265)
                      .+++++||||+|+++|||+++|++
T Consensus       656 ~~~~~~F~GR~P~~~all~~~gl~  679 (680)
T PRK10911        656 MELFKRFRGREPQLDAMLEHYGIK  679 (680)
T ss_pred             HHHHHHhcCCCCChHHHHHhcCCC
Confidence            999999999999999999999985


No 4  
>KOG2089|consensus
Probab=100.00  E-value=2.1e-68  Score=511.53  Aligned_cols=242  Identities=37%  Similarity=0.649  Sum_probs=230.3

Q ss_pred             CCCccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccH
Q psy12854          5 GTKTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEE   84 (265)
Q Consensus         5 g~~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~   84 (265)
                      -++.+++||++++|||++|..++|+||+|+||+|+||||||+||++|+++.|+.+||+++++|||||+||||||||+|++
T Consensus       465 ~~ss~~~PVaalv~nfS~p~~~kpsll~~~ev~t~FheFGh~~q~ll~Qa~~~~fsG~~~vewDave~psq~Lenwv~~~  544 (718)
T KOG2089|consen  465 KDSSRRIPVAALVCNFSKPQSDKPSLLGHDEVETLFHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDP  544 (718)
T ss_pred             cCCccccchHHHHHhcCCcccCCCCccchHHHHHHHHHHhHHHHHHHhcCccccccCcccCCcchhhchHHHHHHhccCc
Confidence            36788999999999999999999999999999999999999999999999999999998899999999999999999999


Q ss_pred             HHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCCcHHHHHHHHccccCCCCCCCCCCcC
Q psy12854         85 RVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKPHWSEISRELYPLHFGFPIDKYSNLP  163 (265)
Q Consensus        85 ~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~  163 (265)
                      ++|..+|.||+||++||++ +++|+.+|..+.|+.++|||++|.||+.+|........+.++++++++.++|..+++++|
T Consensus       545 d~L~~lS~Hy~tge~l~eEl~~kl~~~r~~~~gl~tlrqL~~a~~D~~lht~~d~~~~~~~~~l~~~i~~~p~~~~d~~p  624 (718)
T KOG2089|consen  545 DTLRSLSKHYKTGEPLPEELLKKLILTRTVNAGLFTLRQLVLADFDLELHTKTDADLADTYRQLCQEISIVPATPGDNMP  624 (718)
T ss_pred             hHhhhhcccccCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHhhccccchHHHHHHhchhheecCCCCCCCCC
Confidence            9999999999999999999 999999999999999999999999999999988666678899999999999999999999


Q ss_pred             ccccccCCCCccCCchhhHhHHHHHHHHHHH-HHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHH
Q psy12854        164 CRFVEVGSGDLAAGYYSFLWSKLVSADIFYA-FKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSS  242 (265)
Q Consensus       164 ~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~  242 (265)
                      |+|+|||++||+|+||+|+||+|+|+|||.. |+++|.    +|              ..++|+|||++||++||+.+++
T Consensus       625 csF~hifa~gy~A~yY~yLWsEv~aaDif~t~fe~~g~----~N--------------~~~~G~ryR~tiLa~GG~~~~~  686 (718)
T KOG2089|consen  625 CSFGHIFAGGYAAGYYSYLWSEVLAADIFSTFFEQEGE----DN--------------IKEVGMRYRNTILAPGGGKDPM  686 (718)
T ss_pred             ccccchhcCchHHHHHHHHHHHHHHHHHHHHHhhhcCC----cc--------------HHHHHHHHHHhhhcCCCCccHH
Confidence            9999999989999999999999999999986 665665    45              2399999999999999999999


Q ss_pred             HHHHHhhCCCCCchHHHHhcCC
Q psy12854        243 EVFRRFRGRDPCFKPFLDMFRL  264 (265)
Q Consensus       243 e~l~~flGr~ps~~a~~~~~g~  264 (265)
                      |+|++|+||+||.+|||+++||
T Consensus       687 e~f~~FlGRePS~~Afl~s~gl  708 (718)
T KOG2089|consen  687 EVFKRFLGREPSQEAFLKSLGL  708 (718)
T ss_pred             HHHHHhhCCCCChhHHHHhhcc
Confidence            9999999999999999999987


No 5  
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=100.00  E-value=1.7e-64  Score=480.83  Aligned_cols=238  Identities=36%  Similarity=0.548  Sum_probs=222.6

Q ss_pred             CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854          7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV   86 (265)
Q Consensus         7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~   86 (265)
                      +.++.|+++++|||++|.+++|+||+++||.|||||||||||++|++++|+.++|+ ++++||||+|||+||+|+|||++
T Consensus       180 ~~~~~P~~~l~~nf~~~~~~~p~lL~~~~v~tLfHEfGHalH~~ls~~~~~~l~~~-~~~~d~~E~pS~~~E~~~~d~~v  258 (422)
T cd06456         180 GLGQKPVAYLVCNFTKPAGGKPALLTHDEVTTLFHEFGHALHHLLTDVEYPSLGGT-NVEWDFVELPSQFMENWAWEPEV  258 (422)
T ss_pred             CCCCCCEEEEECCCCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHhcCCccccCCC-cCchhHhhccHHHHHHHhcCHHH
Confidence            56789999999999999999999999999999999999999999999999999887 49999999999999999999999


Q ss_pred             HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC----CcHHHHHHHHccccCCCCCCCCCC
Q psy12854         87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK----PHWSEISRELYPLHFGFPIDKYSN  161 (265)
Q Consensus        87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~----~~~~~l~~~l~~~~~~~~~~~~~~  161 (265)
                      |+.+++|++|+++||++ +++++++++++.++.+++|+.+|.||+.+|...+    +.+.++++++.+++.+++..++++
T Consensus       259 L~~~s~h~~t~~~lp~~l~~~~~~~~~~~~~~~~~~ql~~a~fD~~lh~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  338 (422)
T cd06456         259 LKLFAKHYETGEPLPDELIDKLLAARNFNSGFATVRQLEFALLDLALHSLTDPEILDVVQFELDALRKEGLVIPPPPPRY  338 (422)
T ss_pred             HHHHHhhcCCCCCCCHHHHHHHHHHhhHhHHHHHHHHHHHHHHHHHHhcCCcccccchHHHHHHHHHHhcCCCCCCCCCC
Confidence            99999999999999999 9999999999999999999999999999998753    246778888887777777667889


Q ss_pred             cCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccH
Q psy12854        162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHS  241 (265)
Q Consensus       162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~  241 (265)
                      ++++|+|||+++|+|+||||+||+++|+|+|++|++++.    +|               +++|++||++||++||+++|
T Consensus       339 ~~~~f~Hi~~~gY~A~YYsYlws~vla~di~~~f~~~~~----~~---------------~~~G~~~r~~iL~~Ggs~~~  399 (422)
T cd06456         339 FSNYFSHIFSGGYAAGYYSYKWAEVLDADAFSAFEEEGI----FN---------------RETGRRFRDTILSKGGSRDP  399 (422)
T ss_pred             CCCccCcCCCCCcchhhHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHhhcCCCcCH
Confidence            999999999879999999999999999999999888664    55               99999999999999999999


Q ss_pred             HHHHHHhhCCCCCchHHHHhcCC
Q psy12854        242 SEVFRRFRGRDPCFKPFLDMFRL  264 (265)
Q Consensus       242 ~e~l~~flGr~ps~~a~~~~~g~  264 (265)
                      .+++++||||+||++||++.+|+
T Consensus       400 ~e~~~~F~Gr~p~~~a~l~~~gl  422 (422)
T cd06456         400 MELFRAFRGRDPSIEALLRRRGL  422 (422)
T ss_pred             HHHHHHhcCCCCChHHHHHHcCC
Confidence            99999999999999999999986


No 6  
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=100.00  E-value=3.4e-62  Score=469.67  Aligned_cols=231  Identities=27%  Similarity=0.409  Sum_probs=213.8

Q ss_pred             CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854          7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV   86 (265)
Q Consensus         7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~   86 (265)
                      |.+|+|+++|+|||++|++++|+||+|+||.||||||||+||++|++++|+.++||+ +|+||||+||||||+|+|++++
T Consensus       220 g~~q~Pv~~lvcnf~~p~~~~p~lL~~~~v~TLfHEfGHalH~~ls~~~~~~~sgt~-~~~d~vE~pS~~~E~~~~~~~~  298 (458)
T cd06457         220 GTYQLPVVALMCNFPPPSPSGPTLLSPHEVETLFHEMGHAMHSMLGRTEYQHVSGTR-CATDFVEVPSILMEYFASDPRV  298 (458)
T ss_pred             CceeCCeEEEECCCCCCCCCCCCCcCHHHHHHHHHHHhHHHHHHHcCCCccccCCCC-CCcchhhcCHHHHHHHHhhHHH
Confidence            677999999999999999999999999999999999999999999999999999985 9999999999999999999999


Q ss_pred             HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC---cHHHHHHHHccccCCCCCCCC-CC
Q psy12854         87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP---HWSEISRELYPLHFGFPIDKY-SN  161 (265)
Q Consensus        87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~---~~~~l~~~l~~~~~~~~~~~~-~~  161 (265)
                      |+.+++|++|++++|++ +++++++++++.++.++||+.+|.||+.+|...+.   .+.++++++.+++.+++..++ ++
T Consensus       299 L~~~a~h~~t~e~ip~~l~~~l~~~~~~~~~~~~~rq~~~a~fD~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  378 (458)
T cd06457         299 LKLFARHYSTGEPLPEEMLARLLASKNSFAALETQQQIVYALLDQELHGEQPLSPTFTSDVLRDSTEIFYGLPYVPGGTA  378 (458)
T ss_pred             HHHHhcccCCCCcCcHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCcccccCHHHHHHHHHHHhcCCCCCCCCCC
Confidence            99999999999999999 99999999999999999999999999999976532   467999999999888776665 78


Q ss_pred             cCccccccCCCCccCCchhhHhHHHHHHHHHHH-HHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCcc
Q psy12854        162 LPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYA-FKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCH  240 (265)
Q Consensus       162 ~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~  240 (265)
                      ++++|+||+  +|+|+||||+||+++|+|+|++ |++++     +|               +++|++||++||++|||++
T Consensus       379 ~~~~f~Hl~--gy~a~YYsYl~a~vla~di~~~~f~~~~-----~n---------------~~~g~~y~~~iL~~Ggs~~  436 (458)
T cd06457         379 WQLRFGHLV--GYGATYYSYLFDRAIASKIWQKLFAADP-----LS---------------REAGERLREELLKHGGGKD  436 (458)
T ss_pred             Ccccccccc--CccccchHHHHHHHHHHHHHHHHHHhCC-----CC---------------HHHHHHHHHHHccCCCCcC
Confidence            899999998  6999999999999999999975 66654     45               8999999999999999999


Q ss_pred             HHHHHHHhhCCCCCchHHHH
Q psy12854        241 SSEVFRRFRGRDPCFKPFLD  260 (265)
Q Consensus       241 ~~e~l~~flGr~ps~~a~~~  260 (265)
                      |.+++++||||+|+.+....
T Consensus       437 p~e~l~~flGrdp~~~~~~~  456 (458)
T cd06457         437 PWELLAGVLGKPPLVKGGAG  456 (458)
T ss_pred             HHHHHHHHcCCCCCCCcccc
Confidence            99999999999999987643


No 7  
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=100.00  E-value=3.3e-60  Score=457.51  Aligned_cols=235  Identities=34%  Similarity=0.560  Sum_probs=219.0

Q ss_pred             CccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHH
Q psy12854          7 KTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERV   86 (265)
Q Consensus         7 ~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~   86 (265)
                      |.+|+|+++|+|||++|.+++|+||+++||.||||||||+||++|++++++.++|++ +|+||||+|||+||+|+|+|++
T Consensus       235 g~~~~P~~~i~~Nf~~~~~~~p~ll~~~~V~TLfHEfGHalH~~ls~~~~~~~sg~~-~~~d~aE~pS~~~E~~~~~~~~  313 (472)
T cd06455         235 GSRQYPVAALVCNFPKPTADKPSLLRHDEVETFFHEFGHVIHHLLGRTKYARFSGTR-VERDFVEAPSQMLENWCWEPEV  313 (472)
T ss_pred             CCEeCCEEEEECcCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhcCCCccccCCCc-CChhhhhcchHHHHHHhcCHHH
Confidence            456789999999999999999999999999999999999999999999999999985 9999999999999999999999


Q ss_pred             HHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC--cHHHHHHHHccccCCCCCCCCCCcC
Q psy12854         87 FAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP--HWSEISRELYPLHFGFPIDKYSNLP  163 (265)
Q Consensus        87 L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~--~~~~l~~~l~~~~~~~~~~~~~~~~  163 (265)
                      |+.+++|++|+++||++ +++++++++++.++.+++|+.+|.||+.+|...+.  .+.+++.++++++.+.+..+++++|
T Consensus       314 l~~l~~h~~t~e~i~~~li~~~~~~~~~~~~~~~~~q~~~a~fd~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  393 (472)
T cd06455         314 LKRLSKHYKTGEKIPDELIERLIASRHFNRGLFYLRQLFFALFDLALHTGDPADLDTTKLYNDLREEISLIPSTEGTHGP  393 (472)
T ss_pred             HHHHhhccCCCCCCCHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHhCCCCCCCCCCcc
Confidence            99999999999999999 99999999999999999999999999999987653  6899999999999887666667899


Q ss_pred             ccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHH
Q psy12854        164 CRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSE  243 (265)
Q Consensus       164 ~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e  243 (265)
                      ++|+|++ ++|+|+||+|+||+++|+++|++|++++.    +|               +++|++||++||++|||++|.+
T Consensus       394 ~~f~H~~-~~Y~a~yY~Y~~a~~la~~~~~~~~~~~~----~~---------------~~~g~~y~~~~L~~Ggs~~p~e  453 (472)
T cd06455         394 ASFGHLA-GGYDAGYYGYLWSEVFAADMFSSFFKDGL----LN---------------PEVGLRYRDTVLAPGGSKDAAD  453 (472)
T ss_pred             ccccccc-cCcccchHHHHHHHHHHHHHHHHHHhcCC----CC---------------HHHHHHHHHHHhcCcCCcCHHH
Confidence            9999999 46899999999999999999999877664    45               8999999989999999999999


Q ss_pred             HHHHhhCCCCCchHHHHhc
Q psy12854        244 VFRRFRGRDPCFKPFLDMF  262 (265)
Q Consensus       244 ~l~~flGr~ps~~a~~~~~  262 (265)
                      ++++||||+|+.+||++++
T Consensus       454 ll~~flGr~~~~~a~~~~~  472 (472)
T cd06455         454 MLKDFLGREPNNDAFLKSL  472 (472)
T ss_pred             HHHHHhCCCCChhHHhhcC
Confidence            9999999999999999975


No 8  
>KOG2090|consensus
Probab=100.00  E-value=6.7e-56  Score=421.17  Aligned_cols=233  Identities=24%  Similarity=0.366  Sum_probs=219.2

Q ss_pred             CCccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHH
Q psy12854          6 TKTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEER   85 (265)
Q Consensus         6 ~~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~   85 (265)
                      +|.+|+||.+|+|||..+....|++|++.+|+|||||||||||++|++|+|++++||| ||.||||+||.+||.|++|+.
T Consensus       450 Dg~yQlPVi~L~cnf~rss~~s~t~L~~~~vetLFHEmGHAMHSmLGrT~YQhvtGTR-c~tDfaEiPSiLMEyFa~D~r  528 (704)
T KOG2090|consen  450 DGTYQLPVIVLVCNFVRSSQSSPTFLSLSEVETLFHEMGHAMHSMLGRTHYQHVTGTR-CPTDFAEIPSILMEYFANDYR  528 (704)
T ss_pred             CCCeeceeeEEeecccccccCCCcccCHHHHHHHHHHHHHHHHHHhccchhccccCcc-cchhHhhhhHHHHHHHhcchH
Confidence            4677999999999999999999999999999999999999999999999999999997 999999999999999999999


Q ss_pred             HHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCC-C---CcHHHHHHHHccccCCCCCCCCC
Q psy12854         86 VFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGP-K---PHWSEISRELYPLHFGFPIDKYS  160 (265)
Q Consensus        86 ~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~-~---~~~~~l~~~l~~~~~~~~~~~~~  160 (265)
                      +|..+++||.|++++|.+ +.++..+++.+.+.++.+|+++|++|+.+|... +   ....+++.++.+++.+++..+++
T Consensus       529 Vl~~~aRhy~t~e~l~~~mv~~l~~s~n~~Aa~e~q~Qv~ya~~Dq~fhg~~~~~~~~~~~~~~~~v~~k~~~~~~~~~t  608 (704)
T KOG2090|consen  529 VLRFFARHYSTGEPLPEDMVNRLCESRNSFAAQETQRQVFYALLDQEFHGIACPLIAEDTTDLLSEVKRKFSGLLYVPPT  608 (704)
T ss_pred             HHHHHHHHcCCCCCCCHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHhcccccccccchhHHHHHHHHhcCCCCCCCCC
Confidence            999999999999999999 999999999999999999999999999999942 2   24678999999999998888899


Q ss_pred             CcCccccccCCCCccCCchhhHhHHHHHHHHHHH-HHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCc
Q psy12854        161 NLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYA-FKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSC  239 (265)
Q Consensus       161 ~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~  239 (265)
                      .|+.+|+|++  +|+|.||||++++++|.-||++ |.+||+     +               +++|++||++||+.||++
T Consensus       609 aw~~rFsHl~--gYGA~YYSYL~~r~~AS~IWq~~Fe~dPf-----s---------------R~aGek~r~eil~hGG~~  666 (704)
T KOG2090|consen  609 AWQLRFSHLV--GYGATYYSYLFARAIASLIWQQLFENDPF-----S---------------RKAGEKFRKEILKHGGGR  666 (704)
T ss_pred             ccccchhhhh--ccCchHHHHHHHHHHHHHHHHHHHhcCcc-----c---------------hhhhHHHHHHHHHhcCCC
Confidence            9999999998  7999999999999999999986 888885     5               999999999999999999


Q ss_pred             cHHHHHHHhhCCCCCchHHHHh
Q psy12854        240 HSSEVFRRFRGRDPCFKPFLDM  261 (265)
Q Consensus       240 ~~~e~l~~flGr~ps~~a~~~~  261 (265)
                      +|+++++++||++|..+.+.+.
T Consensus       667 ~P~~lva~~L~~~~~~~g~~~A  688 (704)
T KOG2090|consen  667 DPAELVADILGKPPLENGGVDA  688 (704)
T ss_pred             ChHHHHHHHhcCCCCccchHHH
Confidence            9999999999999998776653


No 9  
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=100.00  E-value=5.6e-56  Score=426.16  Aligned_cols=239  Identities=33%  Similarity=0.531  Sum_probs=216.5

Q ss_pred             CCccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHH
Q psy12854          6 TKTVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEER   85 (265)
Q Consensus         6 ~~~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~   85 (265)
                      +|++|+|+++|+|||++|.+++|+||+++||.|||||||||||++|++++++.++|+ ++|+|+||+||+|||+|+|+++
T Consensus       213 ~~~~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tLfHE~GHa~H~~ls~~~~~~~sg~-~~~~d~aE~~S~~~E~~~~~~~  291 (458)
T PF01432_consen  213 DGERQLPVPYIFCNFTGPSAGKPSLLSHDDVETLFHEFGHAMHSLLSRTKYQHLSGT-RVPMDFAEFPSQFMENWLWDPL  291 (458)
T ss_dssp             TSTCECEEEEEEEEE-S-BTTC--B-SHHHHHHHHHHHHHHHHHHHCCCSSGGGSTT-SS-CHHCHHHHHHHHHHGGCHH
T ss_pred             cccCCCCceEEEecCCCCCCCCCCccChhhHHHHHHHHhHHHHHHHhccccccccCC-chhHHHHhcchHHHHHhhhchh
Confidence            578999999999999999999999999999999999999999999999999999998 5999999999999999999999


Q ss_pred             HHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC------cHHHHHHHHccccCCCCCCC
Q psy12854         86 VFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP------HWSEISRELYPLHFGFPIDK  158 (265)
Q Consensus        86 ~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~------~~~~l~~~l~~~~~~~~~~~  158 (265)
                      +++.+++|++++++||++ +++++++++.+.+..+.+|+.++.||+.+|...+.      .+.++++++++++.+.+..+
T Consensus       292 ~l~~~~~~~~~~~~i~~~~l~~~~~~~~~~~~~~~~~ql~~a~fd~~~h~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~  371 (458)
T PF01432_consen  292 VLKAFSRHYETGEPIPEELLEDLIASRNFFAAIFLFRQLLFALFDQELHESPEDGEPESEDLNELYRELQKEYYGDPSDP  371 (458)
T ss_dssp             HHHHH-BSTTTHHHHHHHHHHHHHHTTTTTHHHHHHHHHHHHHHHHHHCCHCCCTHHHHHHHHHHHHHHHHHCCTTTCBT
T ss_pred             hhhhhccChhhhhhHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHhccccCCccchhHHHHHHHHHHHhcCcchhc
Confidence            999999999999999999 99999999999999999999999999999998763      36799999999999888778


Q ss_pred             CCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCC
Q psy12854        159 YSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGS  238 (265)
Q Consensus       159 ~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs  238 (265)
                      +..++++|.|+++++|+++||+|+||+++|+++|++|++++.    +|               +++|++||+++|++|||
T Consensus       372 ~~~~~~~f~hl~~~~Y~a~yY~Y~ya~~~a~~~~~~~~~~~~----~n---------------~~~g~~~~~~~L~~Ggs  432 (458)
T PF01432_consen  372 DDYFPASFSHLFSHFYAAGYYSYLYAEVLAADLFSQFFEEDP----LN---------------RETGRRFYKEFLSPGGS  432 (458)
T ss_dssp             TCCGGGGGGGGHCCCGTTTTTCHHHHHHHHHHHHHHHHHCHT----TC---------------HHHHHHHHHHHCTTTTS
T ss_pred             cccccceehhhccCccCCCCchhHHHHHHHHHHHHHHHhcCC----cc---------------hHHHHHHHHHHhcCCCC
Confidence            888999999997678999999999999999999999877543    45               89999999999999999


Q ss_pred             ccHHHHHHHhhCCCCCchHHHHhcCC
Q psy12854        239 CHSSEVFRRFRGRDPCFKPFLDMFRL  264 (265)
Q Consensus       239 ~~~~e~l~~flGr~ps~~a~~~~~g~  264 (265)
                      ++|.+++++|+||+|+.++|++.+|+
T Consensus       433 ~~~~e~l~~~~g~~~~~~~~~~~~~~  458 (458)
T PF01432_consen  433 KDPLELLKKFLGREPSPDAFLKALGL  458 (458)
T ss_dssp             S-HHHHHHHCCSSTTSSHHHHHHHT-
T ss_pred             CCHHHHHHHhCCCCCChHHHHHHcCC
Confidence            99999999999999999999999986


No 10 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=100.00  E-value=2.5e-49  Score=369.68  Aligned_cols=234  Identities=24%  Similarity=0.329  Sum_probs=211.5

Q ss_pred             cccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhccHHHHH
Q psy12854          9 VITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYEERVFA   88 (265)
Q Consensus         9 ~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L~   88 (265)
                      ++.|+++|+|||.+|.+++|+|+++++|.|||||||||+|+++++++++.++|+ ++++||+|+||++||+|++++++|+
T Consensus       128 ~~~~~~~i~~n~~~~~~~~~~ll~~~~v~tl~HE~GHa~h~~l~~~~~~~~~g~-~~~~~~~E~~S~~~E~~~~~~~~L~  206 (365)
T cd06258         128 RQDKDVRILANFTSPAAPDPVLLGHDDINTLFHEFGHAVHFLLIQQRYPFQERT-PTSTDFAEAQSMFLESFATDPEWLE  206 (365)
T ss_pred             CCCCeEEEEccCCCCCCCCCCcCCHHHHHHHHHHHhHHHHHHHhcCCCCcCCCC-CCCccHHhccHHHHHHHHCCHHHHH
Confidence            367999999999999999999999999999999999999999999998888887 5999999999999999999999999


Q ss_pred             HHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCCC--cHHHHHHHHccccCCCCCCCCCCcCcc
Q psy12854         89 ELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPKP--HWSEISRELYPLHFGFPIDKYSNLPCR  165 (265)
Q Consensus        89 ~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~~--~~~~l~~~l~~~~~~~~~~~~~~~~~~  165 (265)
                      .++.|++++ ++|.+ ++++++.+....+..+.+|+.++.||+.+|.....  .+.++|.++.+++.+++..+.+.++++
T Consensus       207 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~~l~~~~~g~~~~~~~~~~~~  285 (365)
T cd06258         207 RYARHYQGG-VVPDELIEKLIAARLPNTLYETRRILVVAKFEKALYENPDRELELQKLWRDLVKEILGVRPDPSTPDPAA  285 (365)
T ss_pred             HHhhhcCCC-CCcHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHhCCCCccCCHHHHHHHHHHHhcCCCCCCCCCCCCc
Confidence            999999988 89999 99999998888889999999999999999986543  488999999999888766666788889


Q ss_pred             ccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHHHH
Q psy12854        166 FVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSEVF  245 (265)
Q Consensus       166 f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l  245 (265)
                      |.|+++ ||+++||+|+||+++|+++|++|+++..                +...++++|++||++||++||++++.|++
T Consensus       286 ~~H~~~-gy~~yyy~Y~~~~v~a~qi~~~~~~~~~----------------~~~~~~~~g~~l~~~il~~G~s~~~~el~  348 (365)
T cd06258         286 FPHLAG-GSPAYYYGYLLAEMLASQLRATFKKKVG----------------YLTDNPEAGPRLREHILRPGNSEPWKELL  348 (365)
T ss_pred             cchhcc-CcccchHHHHHHHHHHHHHHHHHHhcCC----------------CCCCCHHHHHHHHHHHccCcCCcCHHHHH
Confidence            999985 4999999999999999999999877531                00123899999998899999999999999


Q ss_pred             HHhhCCCCCchHHHHh
Q psy12854        246 RRFRGRDPCFKPFLDM  261 (265)
Q Consensus       246 ~~flGr~ps~~a~~~~  261 (265)
                      ++|+||+||.++|++.
T Consensus       349 ~~~~G~~~~~~a~~~~  364 (365)
T cd06258         349 KRATGEDPNADAFLDH  364 (365)
T ss_pred             HHHcCCCCChHHHHhc
Confidence            9999999999999985


No 11 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=100.00  E-value=2.4e-33  Score=277.33  Aligned_cols=208  Identities=16%  Similarity=0.141  Sum_probs=164.6

Q ss_pred             CCCCcccc--ceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHh
Q psy12854          4 FGTKTVIT--PLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFL   81 (265)
Q Consensus         4 ~g~~~~~~--p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~   81 (265)
                      +|++|...  +..+|+|||++         +++||.||||||||+||+++++..++..++  ++|+++||+||++||+|+
T Consensus       354 ~Ga~~~~~~~~~p~il~N~~~---------~~~dv~TLaHElGHa~H~~~~~~~~~~~~~--~~~~~~aE~aS~~~E~l~  422 (591)
T TIGR00181       354 SGAYSIGGYKVKPYILMNWDG---------TLNSVFTLAHELGHSMHSYFSSKHQPYPNS--DYSIFYAEIASTFNELLL  422 (591)
T ss_pred             CCcccCCCCCCCCeEEEecCC---------CcchHHHHHHHhhhHHHHHHHccCCCCccC--CCCchhhhHhHHHHHHHH
Confidence            46665444  56899999999         999999999999999999999866665555  589999999999999999


Q ss_pred             ccHHHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-------CcHHHHHHHHccccCC
Q psy12854         82 YEERVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-------PHWSEISRELYPLHFG  153 (265)
Q Consensus        82 ~~~~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-------~~~~~l~~~l~~~~~~  153 (265)
                      +++     +..|+ ++++.+.. +++.+.  +++.  .+++|+.++.||+.+|...+       +.++++|.++.+++.|
T Consensus       423 ~~~-----l~~~~-~~~~~k~~~l~~~l~--~~~~--~~~~q~~~~~Fe~~~~~~~~~~~~lt~~~l~~~~~~~~~~~~g  492 (591)
T TIGR00181       423 ADY-----LLKNS-NDPEMKIYILLERIS--NFFG--TFTRQTLFAEFEYEAYELIEEGEPLTAETLNEIYANLLKKYFG  492 (591)
T ss_pred             HHH-----HHhhC-CCHHHHHHHHHHHHH--HHHH--HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHcC
Confidence            987     55666 55667777 777776  2332  67899999999999996421       2467899999888754


Q ss_pred             C--CCCCC-CCcCccccccCCCCccCCchhhHhHHHHHHHH--HHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHH
Q psy12854        154 F--PIDKY-SNLPCRFVEVGSGDLAAGYYSFLWSKLVSADI--FYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRL  228 (265)
Q Consensus       154 ~--~~~~~-~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv--~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~y  228 (265)
                      .  ..... ..+|.+++|+    |.+ ||+|.||.++++++  |+++++++                      +.++++|
T Consensus       493 ~~~~~~~~~~~~w~~~~Hf----y~~-fY~Y~Ya~g~~~a~~l~~~~~~~~----------------------~~~~~~Y  545 (591)
T TIGR00181       493 DLVKIDEGAGLTWMRIPHF----YMG-FYVYKYATGQVAATALYEKIKEEG----------------------KGAVEKY  545 (591)
T ss_pred             CccccCCcccceeeecCcc----cCC-CcCHHHHHHHHHHHHHHHHHHHCC----------------------ccHHHHH
Confidence            2  22222 4567789997    665 99999988655555  88898876                      3589999


Q ss_pred             HHHHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854        229 RDTFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM  261 (265)
Q Consensus       229 R~~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~  261 (265)
                      + ++|++|||++|.++++. +|-+++..++++.
T Consensus       546 ~-~~L~~Ggs~~p~ell~~-~Gvd~~~~~~~~~  576 (591)
T TIGR00181       546 L-KFLKSGGSKYPLETLKI-AGVDLTKPQPWQA  576 (591)
T ss_pred             H-HHHhCcCCCCHHHHHHH-cCcCCCChHHHHH
Confidence            7 99999999999999999 5999999888764


No 12 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=100.00  E-value=4.4e-33  Score=275.33  Aligned_cols=210  Identities=13%  Similarity=0.152  Sum_probs=158.6

Q ss_pred             CCCCccccce---EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHH
Q psy12854          4 FGTKTVITPL---NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHF   80 (265)
Q Consensus         4 ~g~~~~~~p~---~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~   80 (265)
                      +|++|..+|.   .+|+|||++         ++++|.||||||||++|++++++....  ++ ++|+|+||+||+|||+|
T Consensus       350 ~Ga~~~~~~~~~~p~i~~N~~~---------~~~~v~TL~HE~GHa~H~~ls~~~~~~--~~-~~~~~~aE~~S~~~E~l  417 (587)
T TIGR02290       350 GGAFCTGFPPSKEPRVLMNYDG---------SRRDVSTLAHELGHAYHSELAKDQPLL--NA-RYPMTLAETASIFAEML  417 (587)
T ss_pred             CCcccCCCCCCCCCEEEEecCC---------CchhHHHHHHHhhHHHHHHHHccCCcc--cC-CCCchhhhHHHHHHHHH
Confidence            4666655432   579999999         899999999999999999999874432  32 48999999999999999


Q ss_pred             hccHHHHHHHHhhhcCCcccHHHHHHHHHHHhhhhhHH-HHHHHHHHHHHHHHhcCC--C----CcHHHHHHHHccccCC
Q psy12854         81 LYEERVFAELNSHFATGEKLKMTDEQLKALRAHNAGID-VCSELFKANLDLQLHNGP--K----PHWSEISRELYPLHFG  153 (265)
Q Consensus        81 ~~~~~~L~~ls~h~~t~~~lp~~~~~l~~~~~~~~~~~-~~~ql~~a~~D~~lh~~~--~----~~~~~l~~~l~~~~~~  153 (265)
                      +|++ .|+.+    .+    ++....++.. ....++. +++|+.++.||+.+|...  +    ..+.+++.++.+++.|
T Consensus       418 ~~~~-ll~~~----~~----~~~~~~~~~~-~l~~~~~~~~~q~~~~~fE~~l~~~~~~~~lt~~~l~~~~~~~~~~~~g  487 (587)
T TIGR02290       418 LFDA-LLKEA----KT----DEEKLSLLAE-KLEDAIATLVRIHARFLFERRFHEARKEGELSADDICDLMLEAQKESYG  487 (587)
T ss_pred             HHHH-HHhhC----CC----HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHhC
Confidence            9997 34432    21    1113334433 4455553 689999999999999642  1    2367888888877644


Q ss_pred             --CCCCC-CCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHH
Q psy12854        154 --FPIDK-YSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRD  230 (265)
Q Consensus       154 --~~~~~-~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~  230 (265)
                        ++..+ ...+|.+++|++..+|  .||.|+||.++|.++|.+|++++                      +.+.++|+ 
T Consensus       488 ~~~~~~~~~~~~w~~~~H~y~~~f--Y~y~Y~~g~~~a~~l~~~~~~~~----------------------~~~~~~y~-  542 (587)
T TIGR02290       488 DALDESELHPYMWAYKPHFYHAPF--YNYPYTFGYLFVLGLYAKYREEG----------------------ESFVPKYI-  542 (587)
T ss_pred             cccccCCCCCceeecCCeecCCCC--cChhhHHHHHHHHHHHHHHHHCC----------------------hhHHHHHH-
Confidence              23222 3567788999953222  23688899999999999998876                      35778895 


Q ss_pred             HHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854        231 TFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM  261 (265)
Q Consensus       231 ~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~  261 (265)
                      +||+.||+++|.+++++ +|++|+.++|++.
T Consensus       543 ~~L~~Ggs~~p~ell~~-~G~d~~~~~~~~~  572 (587)
T TIGR02290       543 ALLRDTGSMTPEELVKK-FGFDLTSPDFWQK  572 (587)
T ss_pred             HHHHCcCCCCHHHHHHH-hCcCCCChHHHHH
Confidence            99999999999999999 8999999999974


No 13 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=100.00  E-value=1.3e-32  Score=261.03  Aligned_cols=211  Identities=18%  Similarity=0.193  Sum_probs=162.4

Q ss_pred             CCCCccccc---eEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHH
Q psy12854          4 FGTKTVITP---LNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHF   80 (265)
Q Consensus         4 ~g~~~~~~p---~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~   80 (265)
                      +|++|...+   .++|+|||.+         ++++|.|||||||||+|.++++..++.+.+  ++|+|++|+||++||+|
T Consensus       197 ~gaf~~~~~~~~~p~i~~n~~~---------~~~~v~tl~HE~GHa~h~~~~~~~~~~~~~--~~~~~~~E~~S~~~E~~  265 (427)
T cd06459         197 SGAYCTGLPPGKHPFILMNFNG---------TLDDVFTLAHELGHAFHSYLSRDNQPYLYS--DYPIFLAEIASTFNELL  265 (427)
T ss_pred             CCeecCCCCCCCCCeEEecCCC---------ChhhHHHHHHHhhHHHHHHHHccCCCcccC--CCCchhhHHHHHHHHHH
Confidence            355555443   4689999998         899999999999999999999876553332  48999999999999999


Q ss_pred             hccHHHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-------CcHHHHHHHHccccC
Q psy12854         81 LYEERVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-------PHWSEISRELYPLHF  152 (265)
Q Consensus        81 ~~~~~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-------~~~~~l~~~l~~~~~  152 (265)
                      ++++     +..|+++.+ .+.. +..++..   .. ..+.+|+.++.||+.+|...+       +...++|+++.+++.
T Consensus       266 ~~~~-----l~~~~~~~~-~~~~~~~~~~~~---~~-~~~~~~~~~~~fe~~l~~~~~~~~~~~~~~~~~~~~~~~~~y~  335 (427)
T cd06459         266 LFDY-----LLKFAKDPE-EKLYLLEHLLED---IR-ATLPRQTMFAEFEHEVYENPEEGEPLTAEELNEIYRELEKKYG  335 (427)
T ss_pred             HHHH-----HHHhCCCHH-HHHHHHHHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHhc
Confidence            9985     456666531 2223 3333321   11 267899999999999997421       246788999988887


Q ss_pred             CCCC---CCCCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHH
Q psy12854        153 GFPI---DKYSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLR  229 (265)
Q Consensus       153 ~~~~---~~~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR  229 (265)
                      +...   ......|.+|.|++.+++  +||+|+||.++|.++|++|++++                      +.++++|+
T Consensus       336 g~~~~~~~~~~~~w~~~~H~~~~~f--Yyy~Y~~g~~~a~~l~~~~~~~~----------------------~~~~~~y~  391 (427)
T cd06459         336 GDLVEIDEEHGYEWARIPHFYYVPF--YVYPYAFGQLAALALYAKYKEDG----------------------EGAVEKYL  391 (427)
T ss_pred             CCccccCCccCeeeeecCeecCCCC--cChHHHHHHHHHHHHHHHHHHcC----------------------ccHHHHHH
Confidence            7532   233445668999975322  47899999999999999999876                      46899995


Q ss_pred             HHHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854        230 DTFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM  261 (265)
Q Consensus       230 ~~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~  261 (265)
                       ++|+.||+++|.++++.+ |.+++...+++.
T Consensus       392 -~~L~~Ggs~~~~ell~~~-g~d~~~~~~~~~  421 (427)
T cd06459         392 -ELLKAGGSKSPLELLKKA-GVDLTSPDFWEE  421 (427)
T ss_pred             -HHHHccCCCCHHHHHHHc-CcCCCChHHHHH
Confidence             999999999999999975 999999988875


No 14 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=99.97  E-value=2e-29  Score=247.62  Aligned_cols=204  Identities=12%  Similarity=0.077  Sum_probs=153.8

Q ss_pred             CCCCccccce---EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHH
Q psy12854          4 FGTKTVITPL---NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHF   80 (265)
Q Consensus         4 ~g~~~~~~p~---~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~   80 (265)
                      +|++|..+|.   .+|.|||++         +++||.||+||||||+|+++++....  ...+..|+|+||+||++||+|
T Consensus       312 ~Gayc~~~~~~~~P~I~~Nf~~---------t~~dv~TL~HElGHa~H~~~s~~~~~--~~~~~~~~~~aE~aS~~~E~l  380 (549)
T TIGR02289       312 AGGYCTYLPKYKAPFIFSNFNG---------TSGDIDVLTHEAGHAFHVYESRKDLL--PEYRWPTYEAAELASMSMELL  380 (549)
T ss_pred             CCcccCCCCCCCCcEEEEeCCC---------ChhHHHHHHHHhhHHHHHHHhcCCcc--cccccCcchhhhhhHHHHHHH
Confidence            5677766553   699999999         99999999999999999999986322  222346899999999999999


Q ss_pred             hccHHHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC---CcHHHHHHHHccccCC-CC
Q psy12854         81 LYEERVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK---PHWSEISRELYPLHFG-FP  155 (265)
Q Consensus        81 ~~~~~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~---~~~~~l~~~l~~~~~~-~~  155 (265)
                      +|++     +..+++++++.+.. ++.|.....     .+.+|+....|...+|...+   +...++|.++.+++.+ ..
T Consensus       381 ~~~~-----l~~~~~~~e~~~~~~~~~L~~~~~-----~~~~~~~~d~fe~~vy~~~~lt~~e~~~~~~~l~~~y~~~~~  450 (549)
T TIGR02289       381 SMPW-----WDLFYTDEKDLKKAKISHLKGALS-----FLPYGVIVDHFQHWVYENPNHTPEERDEKYAELEKRYQPSTV  450 (549)
T ss_pred             HHHH-----HHHHcCCcchHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHccCCCCCHHHHHHHHHHHHHHhCCCcc
Confidence            9985     66778877777777 766654422     12466677777777776532   2467888888887653 11


Q ss_pred             --C---CCCCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHH
Q psy12854        156 --I---DKYSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRD  230 (265)
Q Consensus       156 --~---~~~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~  230 (265)
                        .   .....+|.+++|++.  +...||+|.||+++|.++|.++++++                      +.+.++| .
T Consensus       451 ~~~~~~~~~~~~W~~~~H~~~--~pFYyy~Y~~a~~~a~~l~~~~~~~~----------------------~~~~~~Y-~  505 (549)
T TIGR02289       451 YIAGLELEIGTFWLRQLHIFE--VPFYYIEYTIAQIGALQIYKIYKEDP----------------------EKALKDY-K  505 (549)
T ss_pred             cCCccccccCCcceeeceeec--CCCcchhhHHHHHHHHHHHHHHHhCH----------------------HHHHHHH-H
Confidence              1   122345678999973  22235799999999999999998876                      4788999 6


Q ss_pred             HHhcCCCCccHHHHHHHhhCCCCC
Q psy12854        231 TFLTFGGSCHSSEVFRRFRGRDPC  254 (265)
Q Consensus       231 ~iL~~Ggs~~~~e~l~~flGr~ps  254 (265)
                      ++|+.|||++|.++++.+ |-+..
T Consensus       506 ~~L~~Ggs~~~~ell~~a-Gid~~  528 (549)
T TIGR02289       506 KLCSAGGSQSFLELYETA-GLTFP  528 (549)
T ss_pred             HHHhccCCcCHHHHHHHh-CCCCC
Confidence            999999999999999995 75544


No 15 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=99.93  E-value=2.5e-25  Score=219.86  Aligned_cols=212  Identities=14%  Similarity=0.155  Sum_probs=157.5

Q ss_pred             cCCCCccccce---EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHH
Q psy12854          3 IFGTKTVITPL---NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTH   79 (265)
Q Consensus         3 ~~g~~~~~~p~---~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~   79 (265)
                      -+|++|...+.   .+|.+||.+         +.+||.||+||+||++|+.+++.+.+.+..  ..|+.+||+||++||.
T Consensus       354 rsGaYs~~~~~~~~p~IlmN~~g---------t~~dV~TLaHElGHs~Hs~~s~~~qp~~~~--~~~i~~AEiAS~fnE~  422 (598)
T COG1164         354 RSGAYSIGFYKGDHPFILMNYDG---------TLRDVFTLAHELGHSVHSYFSRKHQPYLYA--DYSIFLAEIASTFNEM  422 (598)
T ss_pred             CCCcccCCCCCCCCCeEEEeCCC---------chhHHHHHHHHccHHHHHHHHhccCCcccc--CCchHHHHHHHHHHHH
Confidence            47888766555   799999999         999999999999999999999987764554  4789999999999999


Q ss_pred             HhccHHHHHHHHhhhcCCcccHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-------CcHHHHHHHHccccC
Q psy12854         80 FLYEERVFAELNSHFATGEKLKMTDEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-------PHWSEISRELYPLHF  152 (265)
Q Consensus        80 ~~~~~~~L~~ls~h~~t~~~lp~~~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-------~~~~~l~~~l~~~~~  152 (265)
                      ++++.     +...+++.    ++...++....-..--.+.||+.++.|++.+|...+       +.++++|.++.+++.
T Consensus       423 l~~~~-----ll~~~~~~----~~~~~il~~~l~~~~~t~~rq~~f~~FE~~~h~~~~~~~~lt~~~l~~~~~~l~~~y~  493 (598)
T COG1164         423 LLFDY-----LLERFKDP----EERLAILEEKLEGFFATLFRQTLFAEFEHRVHELIEEGEELTAEELNELYLELQKEYY  493 (598)
T ss_pred             HHHHH-----HHHHcCCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHhc
Confidence            99985     33333331    222222222111111346799999999999999843       237889999999987


Q ss_pred             CC--CCCCC-CCcCccccccCCCCccCCchhhHhHH--HHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHH
Q psy12854        153 GF--PIDKY-SNLPCRFVEVGSGDLAAGYYSFLWSK--LVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSR  227 (265)
Q Consensus       153 ~~--~~~~~-~~~~~~f~Hl~s~~Y~a~YYsYl~s~--~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~  227 (265)
                      |-  ..++. ..+|.+++|+    |.++||+|.|+.  +.|..++..+++++.                      +.-++
T Consensus       494 gd~~~~d~~~~~~W~~ipHf----y~~pFYvy~Ya~G~~~a~~l~~~~~~~~~----------------------~~~~~  547 (598)
T COG1164         494 GDAVKLDELSGLEWARIPHF----YHSPFYVYQYATGQLAALALYAKILTNDA----------------------EAFEK  547 (598)
T ss_pred             CCccccCcccccchhhcCee----eecCceehHhHHHHHHHHHHHHHHHhccH----------------------HHHHH
Confidence            63  34443 4578889999    455666666555  677777888887753                      34444


Q ss_pred             HHHHHhcCCCCccHHHHHHHhhCCCCCchHHHHh
Q psy12854        228 LRDTFLTFGGSCHSSEVFRRFRGRDPCFKPFLDM  261 (265)
Q Consensus       228 yR~~iL~~Ggs~~~~e~l~~flGr~ps~~a~~~~  261 (265)
                      |..++|+.|||.+|.|+++ -+|-+.+..++++.
T Consensus       548 ~y~~~Lk~GgS~~p~ell~-~~GiD~~~~~~~~~  580 (598)
T COG1164         548 YYIAFLKSGGSKSPLELLK-IAGIDLTTPDPWEE  580 (598)
T ss_pred             HHHHHHhCCCCCCHHHHHH-HcCCCCCCchHHHH
Confidence            4468999999999999999 99999888777763


No 16 
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=99.76  E-value=3.4e-17  Score=154.51  Aligned_cols=230  Identities=12%  Similarity=0.090  Sum_probs=151.7

Q ss_pred             eEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHH-hhcC-CCCCCcccCCccchhHhhchHHHHHHhcc-HH----
Q psy12854         13 LNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRS-LTAT-HYSDVSGVNNVEWDSVYIINYFLTHFLYE-ER----   85 (265)
Q Consensus        13 ~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~l-ls~~-~~~~~sg~~~~~~d~~E~pS~~~E~~~~~-~~----   85 (265)
                      -+.|.+||....       -.++|.|+.||+|||+|.. ++.. ..+.+..  +.++-+.|.+|.|+|+.+.+ +.    
T Consensus       144 dvRItt~y~~~d-------~~~~l~t~iHE~GHalye~~l~~~~~~~pl~~--~~sm~ihESqS~f~E~~vgrs~~F~~~  214 (396)
T cd06460         144 DVRITTRYDEND-------FRSALFSTIHETGHALYEQGLPPELRGTPLGG--GASMGIHESQSRLWENQVGRSRAFWEF  214 (396)
T ss_pred             CceEEeeeCCcc-------hHHHHHHHHHHhhHHHHHhcCCHhHccCCccc--CCCchhHHHHHHHHHHHHhcCHHHHHH
Confidence            467888888721       2788999999999999999 6531 1122322  36899999999999998866 22    


Q ss_pred             HHHHHHhhhcCC-cccHHH-HHHHHHHHh--hh-----hhHHHHHHHHHHHHHHHHhcCCC--CcHHHHHHHHccccCCC
Q psy12854         86 VFAELNSHFATG-EKLKMT-DEQLKALRA--HN-----AGIDVCSELFKANLDLQLHNGPK--PHWSEISRELYPLHFGF  154 (265)
Q Consensus        86 ~L~~ls~h~~t~-~~lp~~-~~~l~~~~~--~~-----~~~~~~~ql~~a~~D~~lh~~~~--~~~~~l~~~l~~~~~~~  154 (265)
                      .+..+..++... .....+ +-.+.+...  +.     .-...+++++...|+..+....-  +.+.++|++..+++.|+
T Consensus       215 l~~~l~~~f~~~~~~~~~e~~~~~~n~v~ps~iRveADEvty~lHiilR~e~Ek~l~~G~l~v~dLp~~W~~~~~eylG~  294 (396)
T cd06460         215 LYPKLKKTFPEQLKDVSLENFYRAVNRVQPSLIRVEADEVTYNLHIMLRYELEKALIEGDLEVADLPEAWNEKMKEYLGI  294 (396)
T ss_pred             HHHHHHHHcccccccCCHHHHHHHHhhcCccceeeecchhhHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHcCC
Confidence            333444444221 112222 333332211  01     00245789999999998887642  34778999999999886


Q ss_pred             CCCCCCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhc
Q psy12854        155 PIDKYSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLT  234 (265)
Q Consensus       155 ~~~~~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~  234 (265)
                      ........+.+-.|...+.+ +.||+|..+.++|+++|++++++...   ++   . .-.+|.   -....+=+|++|++
T Consensus       295 ~~~~d~~g~lQd~HW~~g~f-GYfptYalG~l~Aaql~~~~~~~~~~---~~---~-~i~~g~---f~~~~~wL~e~i~~  363 (396)
T cd06460         295 RPPNDAEGCLQDIHWSGGSF-GYFPTYTLGNLYAAQLFAAAKKDLPD---ID---E-QIERGD---FSPLLEWLRENIHQ  363 (396)
T ss_pred             CCCCCccccccccccccCcC-CcCchhHHHHHHHHHHHHHHHHhCcc---HH---H-HHhccC---hHHHHHHHHHHHHh
Confidence            54333334445677754322 24689999999999999987765320   11   0 000111   12255667788999


Q ss_pred             CCCCccHHHHHHHhhCCCCCchHHHHhc
Q psy12854        235 FGGSCHSSEVFRRFRGRDPCFKPFLDMF  262 (265)
Q Consensus       235 ~Ggs~~~~e~l~~flGr~ps~~a~~~~~  262 (265)
                      .|++.+|.|++++..|.+.+.++|++.|
T Consensus       364 ~G~~~~p~eLl~~atG~~l~~~~f~~yL  391 (396)
T cd06460         364 HGSRYSPDELLKKATGEPLNPEYFLEYL  391 (396)
T ss_pred             ccCCCCHHHHHHHHhCCCCChHHHHHHH
Confidence            9999999999999999999999999876


No 17 
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=99.30  E-value=1e-10  Score=113.29  Aligned_cols=214  Identities=13%  Similarity=0.040  Sum_probs=133.0

Q ss_pred             ceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhh-c-CCCCCCcccCCccc--hhHhhchHHH---HHHhccH
Q psy12854         12 PLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLT-A-THYSDVSGVNNVEW--DSVYIINYFL---THFLYEE   84 (265)
Q Consensus        12 p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls-~-~~~~~~sg~~~~~~--d~~E~pS~~~---E~~~~~~   84 (265)
                      ....|.+|+..         +..||.|+.||+||+. -.++ + .++-...|. +.+.  .++|+.|+.|   |++..-+
T Consensus       233 ~d~rI~~c~~~---------t~~D~~t~~HE~GH~~-yy~~y~~~p~~~r~~a-np~fheav~e~~smS~~tpe~L~~~~  301 (477)
T cd06461         233 KDFRIKMCTKV---------NMEDFVTVHHEMGHIQ-YYLQYKDQPVLFREGA-NPGFHEAVGDAIALSVSTPKHLHKIG  301 (477)
T ss_pred             CCcceeeCCCC---------CHHHHHHHHHHHHHHH-HHHHhccCCHHHhCCC-CCChHHHHHHHHHHhcCCHHHHhhcc
Confidence            35778888887         9999999999999999 4434 3 332223443 3677  8999999999   8887322


Q ss_pred             HHHHHHHhhhcCCcccHHH-HHHHHHHHhhhhhHHHHHHHHHHHHHHHHhcCCC-----CcHHHHHHHHccccCCC-CCC
Q psy12854         85 RVFAELNSHFATGEKLKMT-DEQLKALRAHNAGIDVCSELFKANLDLQLHNGPK-----PHWSEISRELYPLHFGF-PID  157 (265)
Q Consensus        85 ~~L~~ls~h~~t~~~lp~~-~~~l~~~~~~~~~~~~~~ql~~a~~D~~lh~~~~-----~~~~~l~~~l~~~~~~~-~~~  157 (265)
                       .|...  .+.....| .. +..++..        +....+...+|..-|....     +.+++.|.+|.++|.++ |+.
T Consensus       302 -ll~~~--~~~~~~~i-~~l~~~al~~--------i~fLP~~~~vD~fr~~vy~~~~~~~e~n~~WweL~~~yqgv~pP~  369 (477)
T cd06461         302 -LLDSE--VDDEEADI-NFLLKMALDK--------IAFLPFGYLLDKWRWDVFDGEIPKDDYNKAWWELREKYQGVVPPV  369 (477)
T ss_pred             -ccccc--CCCcHHHH-HHHHHHHHHH--------HHHhhHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhcCCCCCC
Confidence             22111  01111111 11 2222211        2233466777887665543     34788899998888775 222


Q ss_pred             C---CCCcCccccccCCCCccCCchhhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHHhc
Q psy12854        158 K---YSNLPCRFVEVGSGDLAAGYYSFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLT  234 (265)
Q Consensus       158 ~---~~~~~~~f~Hl~s~~Y~a~YYsYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~  234 (265)
                      +   ....+...-|+-.   ...|..|..|.+++.++|+.+.+.....--++.        =+-..++++|++|+ ++|+
T Consensus       370 ~r~~~~fd~gak~h~~~---~~PYi~YflA~ilqfQf~~~lc~~a~~~~plh~--------C~~~~s~~Ag~~l~-~~l~  437 (477)
T cd06461         370 PRSEEDFDPGAKYHVPA---NTPYIRYFLSFILQFQFHKALCKAAGHTGPLHK--------CDIYGSKEAGKKLR-AMLS  437 (477)
T ss_pred             CCCccccCccceeeccC---CCcHHHHHHHHHHHHHHHHHHHHhccCCCCccc--------ccccChHHHHHHHH-HHHh
Confidence            2   1223444556642   444789999999999999886543110000000        00012378999995 8999


Q ss_pred             CCCCccHHHHHHHhhCC-CCCchHHHH
Q psy12854        235 FGGSCHSSEVFRRFRGR-DPCFKPFLD  260 (265)
Q Consensus       235 ~Ggs~~~~e~l~~flGr-~ps~~a~~~  260 (265)
                      .|||++-.|+++...|. +-+.+++++
T Consensus       438 lG~S~~w~e~le~~~g~~~~s~~~l~e  464 (477)
T cd06461         438 LGSSKPWPEALEALTGEREMDASALLE  464 (477)
T ss_pred             CcCCCCHHHHHHHhcCCCCCChHHHHH
Confidence            99999999999999985 566666665


No 18 
>PF02074 Peptidase_M32:  Carboxypeptidase Taq (M32) metallopeptidase;  InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=98.73  E-value=4.9e-07  Score=88.03  Aligned_cols=213  Identities=15%  Similarity=0.181  Sum_probs=119.8

Q ss_pred             cccHHHHHhHHhHHHhHHhhcCC--CCCCcccCCccchhHhhchHHHHHHhcc-HHHHHHHH----hhhcCCcccHHHHH
Q psy12854         33 FKQVENLFFHFGSLLQRSLTATH--YSDVSGVNNVEWDSVYIINYFLTHFLYE-ERVFAELN----SHFATGEKLKMTDE  105 (265)
Q Consensus        33 ~~~v~tLfHEfGHalH~lls~~~--~~~~sg~~~~~~d~~E~pS~~~E~~~~~-~~~L~~ls----~h~~t~~~lp~~~~  105 (265)
                      .+.+....||.||+|=-.-....  .+.+.+  ++++-+=|-.|-|+||.+.. .+..+.+.    .++.....++.+  
T Consensus       257 ~~~l~s~iHE~GHalYEq~~~~~~~~tpl~~--~~smgiHESQSrf~En~vgrS~~Fw~~~~p~l~~~f~~~~~v~~e--  332 (494)
T PF02074_consen  257 LSALFSTIHETGHALYEQGLPPEYYGTPLGE--GVSMGIHESQSRFWENMVGRSRAFWEFLYPKLQEHFPQLKDVSLE--  332 (494)
T ss_dssp             HHHHHHHHHHHHHHHHHHTS-GGGTTSTTSS---S-HHHHHHHHHHHHTTTTTSHHHHHHHHHHHHHHCGGGTT-SHH--
T ss_pred             HHHHHHHHHHHHHHHHHcCCChhHccCcccc--ccccchHHHHHHHHHHHHccCHHHHHHHHHHHHHHhCcccCCCHH--
Confidence            35677788999999976544332  223333  47788899999999996543 33333332    233221222222  


Q ss_pred             HHHHHHh-----hhhh-----HHHHHHHHHHHHHHHHhcCCC--CcHHHHHHHHccccCCCCCCCCCCcCccccccCCCC
Q psy12854        106 QLKALRA-----HNAG-----IDVCSELFKANLDLQLHNGPK--PHWSEISRELYPLHFGFPIDKYSNLPCRFVEVGSGD  173 (265)
Q Consensus       106 ~l~~~~~-----~~~~-----~~~~~ql~~a~~D~~lh~~~~--~~~~~l~~~l~~~~~~~~~~~~~~~~~~f~Hl~s~~  173 (265)
                      .+.++.+     +.+.     ..-+.-++.-.++..+-...-  ..+-++|++-+++|.|+.+.........=-|=.+|.
T Consensus       333 ~~y~~~N~V~ps~IR~eADElTY~lHIilRyEiEk~li~g~l~v~dLP~~Wn~km~eyLGi~p~~d~eG~LQDvHWs~G~  412 (494)
T PF02074_consen  333 NFYRAINRVKPSLIRVEADELTYPLHIILRYEIEKALINGELSVDDLPEAWNEKMEEYLGITPPNDAEGVLQDVHWSSGS  412 (494)
T ss_dssp             HHHHHHT-----S-GGG--TTTHHHHHHHHHHHHHHHHTTSS-GGGHHHHHHHHHHHHHS---SSCTTTTTS-STTTTT-
T ss_pred             HHHHHHccCCCcccchhhHHHHHHHHHHHHHHHHHHHHcCCCChhhhHHHHHHHHHHHcCCCCCCCCccccccccccCCc
Confidence            2222211     1110     123455566666666665442  356788888888888863321111111111322221


Q ss_pred             ccCCch-hhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCC-CCCCCCchHHHHHHHHHHhcCCCCccHHHHHHHhhCC
Q psy12854        174 LAAGYY-SFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGV-SGGERTEPNVGSRLRDTFLTFGGSCHSSEVFRRFRGR  251 (265)
Q Consensus       174 Y~a~YY-sYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l~~flGr  251 (265)
                      +  ||+ +|..+.++|+++|.+.+++-. -  ++     +.. .|.   ......=.|++|.+.|+..+|.|++++..|.
T Consensus       413 f--GYFPtY~LG~~~AaQl~~~~~~~~p-d--~~-----~~i~~G~---f~~i~~WL~e~Ih~~G~~y~~~eLl~~aTGe  479 (494)
T PF02074_consen  413 F--GYFPTYALGNIYAAQLFAAMKKDIP-D--LD-----EQIAKGD---FSPIRDWLRENIHQYGSRYTPDELLKKATGE  479 (494)
T ss_dssp             T--S-THHHHHHHHHHHHHHHHHHHHHT-T--HH-----HHHHTT-----HHHHHHHHHHTGGGTTSS-HHHHHHHHHSS
T ss_pred             c--ccchhhHHHHHHHHHHHHHHHHHcc-C--HH-----HHHHcCC---hHHHHHHHHHHHHHhccCCCHHHHHHHHhCC
Confidence            2  344 999999999999987655321 0  10     000 111   1456677889999999999999999999999


Q ss_pred             CCCchHHHHhc
Q psy12854        252 DPCFKPFLDMF  262 (265)
Q Consensus       252 ~ps~~a~~~~~  262 (265)
                      ++|++.|++.|
T Consensus       480 ~l~~~~~~~YL  490 (494)
T PF02074_consen  480 PLNPEYFIDYL  490 (494)
T ss_dssp             S--THHHHHHH
T ss_pred             CCCHHHHHHHH
Confidence            99999999875


No 19 
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=98.28  E-value=1.7e-05  Score=75.83  Aligned_cols=226  Identities=13%  Similarity=0.176  Sum_probs=125.9

Q ss_pred             EEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCC--CCCCcccCCccchhHhhchHHHHHHhcc-HHHHHHH
Q psy12854         14 NTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATH--YSDVSGVNNVEWDSVYIINYFLTHFLYE-ERVFAEL   90 (265)
Q Consensus        14 ~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~--~~~~sg~~~~~~d~~E~pS~~~E~~~~~-~~~L~~l   90 (265)
                      +-|..++....       -.+.+....||+|||+=-.-....  ...+.+  +.++-+=|-.|-|+||.+-. +...+.+
T Consensus       245 VRITTRy~~~d-------f~~aL~g~iHE~GHAlYEqn~~~~l~gtPlg~--g~smgiHESQSrF~En~VGRs~aFw~~~  315 (497)
T COG2317         245 VRITTRYNEQD-------FRSALFGTIHETGHALYEQNLDEALLGTPLGE--GRSMGIHESQSRFWENQVGRSRAFWEAI  315 (497)
T ss_pred             eeEEeecCCcc-------HHHHHHHHHHhhhhHHHHcCCCHHHcCCcccC--CcccccccHHHHHHHHHhccCHHHHHHH
Confidence            45566666521       235577889999999865433221  222322  46788899999999999764 4343333


Q ss_pred             Hh----hhcCCcccHHH-HHHHHHHHhhh-hhH-----H----HHHHHHHHHHHHHHhcCCC--CcHHHHHHHHccccCC
Q psy12854         91 NS----HFATGEKLKMT-DEQLKALRAHN-AGI-----D----VCSELFKANLDLQLHNGPK--PHWSEISRELYPLHFG  153 (265)
Q Consensus        91 s~----h~~t~~~lp~~-~~~l~~~~~~~-~~~-----~----~~~ql~~a~~D~~lh~~~~--~~~~~l~~~l~~~~~~  153 (265)
                      ..    |+.  +.+++. .+.+....+.. .++     +    -+.-++.-.++..+-+..-  .++-++|++-+.+|.|
T Consensus       316 ~~~~~~~~~--~~~~~~s~dd~y~~vn~v~~~lIRveADElTY~lHIilRyEiEk~li~G~l~v~DlP~lWn~kme~yLG  393 (497)
T COG2317         316 YPLLRKHFP--EQFDKYSLDDFYRAVNRVEPSLIRVEADELTYPLHIILRYEIEKELISGELKVDDLPELWNDKMEEYLG  393 (497)
T ss_pred             HHHHHHhhh--hhcccCCHHHHHHHHhcccCcceeeeccccceeeeeehhHHHHHHHHcCCcchhhhHHHHHHHHHHHcC
Confidence            22    221  112222 33332221110 000     0    0111222333333333221  2355778888888888


Q ss_pred             CCCCCCCCcCccccccCCCCccCCch-hhHhHHHHHHHHHHHHHHcCCCCcccccCCCCCCCCCCCCCchHHHHHHHHHH
Q psy12854        154 FPIDKYSNLPCRFVEVGSGDLAAGYY-SFLWSKLVSADIFYAFKEDGEGRLEDESGRSDSGVSGGERTEPNVGSRLRDTF  232 (265)
Q Consensus       154 ~~~~~~~~~~~~f~Hl~s~~Y~a~YY-sYl~s~~~A~dv~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~i  232 (265)
                      +.+.........=-|-..|.+  ||+ +|..+.++|+++|.+.+++-.        .-|+...-|  +-.....=+|++|
T Consensus       394 irp~~d~eG~LQDIHWs~Gsf--GYFPsYtlG~v~AAQ~~~a~~~~~p--------~~d~~i~~g--d~~~i~~WL~e~i  461 (497)
T COG2317         394 IRPKNDAEGVLQDIHWSHGSF--GYFPTYTLGNVYAAQLYAAMKKDIP--------DVDALIAKG--DFSPIKNWLRENI  461 (497)
T ss_pred             CCCCCccccccccccccCCCc--CcCchhhhHHHHHHHHHHHHHhhcc--------cHHHHHHcC--CcHHHHHHHHHHH
Confidence            644321111110112111222  344 999999999999987665311        011111111  0145667788999


Q ss_pred             hcCCCCccHHHHHHHhhCCCCCchHHHHhc
Q psy12854        233 LTFGGSCHSSEVFRRFRGRDPCFKPFLDMF  262 (265)
Q Consensus       233 L~~Ggs~~~~e~l~~flGr~ps~~a~~~~~  262 (265)
                      -+.|...+|.|++++-.|.+-|++.|++++
T Consensus       462 h~~Gs~~~p~eLi~~atGE~lnp~y~i~yL  491 (497)
T COG2317         462 HRHGSRYPPKELLKRATGEALNPEYFIDYL  491 (497)
T ss_pred             HHccCcCCcHHHHHHhhCCcCCHHHHHHHH
Confidence            999999999999999999999999999975


No 20 
>PF01401 Peptidase_M2:  Angiotensin-converting enzyme This Prosite motif covers only the active site.;  InterPro: IPR001548 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M2 (clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The catalytic residues and zinc ligands have been identified, the zinc ion being ligated to two His residues within the motif HEXXH, showing that the enzyme belongs to the E sub-group of metalloproteases [].   Pepetidyl-dipeptidase A (angiotensin-converting enzyme) is a mammalian enzyme responsible for cleavage of dipeptides from the C-termini of proteins, notably converting angiotensin I to angiotensin II []. The enzyme exists in two differentially transcribed forms, the most common of which is from lung endothelium; this contains two homologous domains that have arisen by gene duplication []. The testis-specific form contains only the C-terminal domain, arising from a duplicated promoter region present in intron 12 of the gene []. Both enzymatic forms are membrane proteins that are anchored by means of a C-terminal transmembrane domain. Both domains of the endothelial enzyme are active, but have differing kinetic constants []. ]. A number of insect enzymes have been shown to be similar to peptidyl-dipeptidase A, these containing a single catalytic domain.; GO: 0008237 metallopeptidase activity, 0008241 peptidyl-dipeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 2YDM_A 3BKL_A 2C6N_B 1UZE_A 3BKK_A 2C6F_B 2IUX_A 2IUL_A 2XYD_B 3NXQ_B ....
Probab=98.25  E-value=4.6e-05  Score=76.03  Aligned_cols=211  Identities=16%  Similarity=0.164  Sum_probs=130.6

Q ss_pred             CCccCcccHHHHHhHHhHHHhHHhhcC-CCCCCcccCCccchhHhhchHHHHHHhccHHHHHHHHhhhcCCcccHHH-HH
Q psy12854         28 PCLLTFKQVENLFFHFGSLLQRSLTAT-HYSDVSGVNNVEWDSVYIINYFLTHFLYEERVFAELNSHFATGEKLKMT-DE  105 (265)
Q Consensus        28 p~ll~~~~v~tLfHEfGHalH~lls~~-~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L~~ls~h~~t~~~lp~~-~~  105 (265)
                      ++-++.+|+.|+.||+||....+.-+. +...-.|   .---|-|..+-++-.-+..|.-|+++.---.. ..-.+. ++
T Consensus       342 Ct~v~~~df~t~hhemghi~Y~~~y~~qp~~fr~g---an~gfhEAigd~ials~~tp~hL~~~gLl~~~-~~~~~~~in  417 (595)
T PF01401_consen  342 CTEVTMEDFLTAHHEMGHIQYYMQYKDQPVLFREG---ANPGFHEAIGDTIALSVSTPKHLKRIGLLDNV-TDDEESDIN  417 (595)
T ss_dssp             ---SSHHHHHHHHHHHHHHHHHHHTTTS-GGGSS----SSHHHHHHHHHHHHHHHTSHHHHHHTTSSSTT-TTSHHHHHH
T ss_pred             cccCCHHHHHHHHHHHHHHHHHHHHhhCChhhhcC---CCchHHHHHHHHHHHHcCCHHHHHHcCCCCCC-CcchHHHHH
Confidence            566799999999999999998766543 2222223   23568899999999899899888887542111 111233 55


Q ss_pred             HHHHHHhhhhhHHHH-HHHHHHHHHHH---HhcC--CCCcHHHHHHHHccccCCC-CCCC---CCCcCccccccCCCCcc
Q psy12854        106 QLKALRAHNAGIDVC-SELFKANLDLQ---LHNG--PKPHWSEISRELYPLHFGF-PIDK---YSNLPCRFVEVGSGDLA  175 (265)
Q Consensus       106 ~l~~~~~~~~~~~~~-~ql~~a~~D~~---lh~~--~~~~~~~l~~~l~~~~~~~-~~~~---~~~~~~~f~Hl~s~~Y~  175 (265)
                      .|..     .++..+ ...+.-..|+-   +...  .++.+++-|-++..+|.|+ |+.+   .+.-|...-|+.+   .
T Consensus       418 ~L~~-----~AL~ki~~lPf~~~~d~wRw~vf~g~i~~~~~n~~wW~lr~~yqGi~pP~~r~~~~fD~~ak~hi~~---~  489 (595)
T PF01401_consen  418 FLLK-----QALDKIAFLPFGYLMDKWRWDVFSGSIPPDQYNEAWWELREKYQGIVPPVERSESDFDPGAKYHIPA---N  489 (595)
T ss_dssp             HHHH-----HHHHHCCHHHHHHHHHHHHHHHHTTSS-CCGHHHHHHHHHHHHHSEE-SS-EETTS-GGGGSHHHHT---T
T ss_pred             HHHH-----HhhhHHHhccHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHcCCccCCCCCCcccCCcchhhhhcc---c
Confidence            4432     233322 22333344443   2222  1256899999999999885 3332   2222455668853   4


Q ss_pred             CCchhhHhHHHHHHHHHHHHHHc-CC-CCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHHHHHHhhC-CC
Q psy12854        176 AGYYSFLWSKLVSADIFYAFKED-GE-GRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSEVFRRFRG-RD  252 (265)
Q Consensus       176 a~YYsYl~s~~~A~dv~~~f~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l~~flG-r~  252 (265)
                      ..|..|-.|.++--++++.+.+. |. +-  +.  +-|  -.|    ++++|++++ ++|+.|+|++=.++++.+.| ++
T Consensus       490 ~py~ryfls~ilqFQ~~~~lC~~ag~~~p--Lh--~Cd--iy~----s~~aG~~l~-~ml~~G~S~~W~~~l~~~tg~~~  558 (595)
T PF01401_consen  490 TPYIRYFLSTILQFQFYKALCEAAGHEGP--LH--RCD--IYG----SKEAGRKLK-KMLSLGSSKPWPEALEELTGERE  558 (595)
T ss_dssp             --SHHHHHHHHHHHHHHHHHHHHTTTTSS--GG--G-----TT-----HHHHHHHH-HHHTTTTSS-HHHHHHHHHSSSS
T ss_pred             cccHHHHHHHHHHHHHHHHHHHhcCCCCC--cc--ccc--ccc----chhhHHHHH-HHhcCCCCCCHHHHHHHHhCCCC
Confidence            56899999999999999885543 21 00  00  000  012    389999995 89999999999999999999 68


Q ss_pred             CCchHHHHh
Q psy12854        253 PCFKPFLDM  261 (265)
Q Consensus       253 ps~~a~~~~  261 (265)
                      -|.+|+++.
T Consensus       559 ~~a~~ll~Y  567 (595)
T PF01401_consen  559 MSASALLEY  567 (595)
T ss_dssp             --SHHHHHH
T ss_pred             cChHHHHHH
Confidence            999999875


No 21 
>KOG3690|consensus
Probab=97.05  E-value=0.059  Score=53.35  Aligned_cols=214  Identities=15%  Similarity=0.112  Sum_probs=127.8

Q ss_pred             CCccCcccHHHHHhHHhHHHhHHhhc-CCCCCCcccCCccchhHhhchHHHHHHhccHHHHHHHHhhhcCCcc-cHHHHH
Q psy12854         28 PCLLTFKQVENLFFHFGSLLQRSLTA-THYSDVSGVNNVEWDSVYIINYFLTHFLYEERVFAELNSHFATGEK-LKMTDE  105 (265)
Q Consensus        28 p~ll~~~~v~tLfHEfGHalH~lls~-~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~~~~L~~ls~h~~t~~~-lp~~~~  105 (265)
                      ++.+...+..+..||+||.==.+.-+ .+.-.-.|   .---|-|...-.+-.-+..|.-|+++.---.++.. .+..|+
T Consensus       364 Ct~vd~~~f~~~Hhem~hiQY~l~y~~qP~llre~---anPgFheAIgd~~als~stPrhL~slgLL~~~~~~d~~~~IN  440 (646)
T KOG3690|consen  364 CTKVDEEDFEQAHHEMGHIQYYLQYKQQPFLLREG---ANPGFHEAIGDAFALSVSTPRHLHSLGLLSEDLDEDDEVRIN  440 (646)
T ss_pred             cCcCCHHHHHHHHHHHHHHHHHHHHHhccHHhhCC---CCCcHHHHHHHHHHHhcCCHHHHHHhcccccccccchHHHHH
Confidence            67789999999999999965555443 23222222   22345666666777777888777776431111111 111255


Q ss_pred             HHHHHHhhhhhHHHH-HHHHHHHHHHHHhcCC----C-CcHHHHHHHHccccCCC-CCCCCCC---cCccccccCCCCcc
Q psy12854        106 QLKALRAHNAGIDVC-SELFKANLDLQLHNGP----K-PHWSEISRELYPLHFGF-PIDKYSN---LPCRFVEVGSGDLA  175 (265)
Q Consensus       106 ~l~~~~~~~~~~~~~-~ql~~a~~D~~lh~~~----~-~~~~~l~~~l~~~~~~~-~~~~~~~---~~~~f~Hl~s~~Y~  175 (265)
                      .|.+     .++..+ ...+.-..|..-+..-    + ..+++-|-+|..+|.|+ |+.+.+.   -+..--|+ .  =+
T Consensus       441 ~L~k-----~aL~~v~~LPf~y~~DkwR~~Vf~G~i~k~~~N~~~W~lr~ky~Gv~~P~~r~~~dfD~~akf~~-~--~d  512 (646)
T KOG3690|consen  441 RLFK-----MALDKVAFLPFTYALDKWRYEVFDGEIPKDQYNCAYWELRNKYEGVRPPVPRTNKDFDPPAKFHI-Q--VD  512 (646)
T ss_pred             HHHH-----HHHHHHhhhhhhhhhhhhHHHHhcCCCCcchhhHHHHHHHHHhcCcCCCCcCccccCCchhhhcc-C--cc
Confidence            4432     233322 2223333443322211    1 35899999999999885 3333332   12223354 2  23


Q ss_pred             CCchhhHhHHHHHHHHHHHHHH-cCC-----CCcccccCCCCCCCCCCCCCchHHHHHHHHHHhcCCCCccHHHHHHHhh
Q psy12854        176 AGYYSFLWSKLVSADIFYAFKE-DGE-----GRLEDESGRSDSGVSGGERTEPNVGSRLRDTFLTFGGSCHSSEVFRRFR  249 (265)
Q Consensus       176 a~YYsYl~s~~~A~dv~~~f~~-~~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~yR~~iL~~Ggs~~~~e~l~~fl  249 (265)
                      ..|-.|..|.++--++++.|.. .|.     +|..++  .-      +--.++++|+++| +.++.|.|++=.|+++.++
T Consensus       513 ~~~~ryf~s~vlqFQf~kalC~~agq~~pg~pr~pLh--~C------Di~~sk~aG~kL~-~~M~lG~S~~W~evLE~~t  583 (646)
T KOG3690|consen  513 VSYTRYFLSTVLQFQFLKALCQAAGQYGPGDPRKPLH--NC------DIYDSKEAGEKLR-TMMSLGSSRHWREVLEEIT  583 (646)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCCCch--hc------cccCCHHHHHHHH-HHHHccCCccHHHHHHHHc
Confidence            3456888899998888877543 221     122221  00      1112379999995 8999999999999999999


Q ss_pred             C-CCCCchHHHHh
Q psy12854        250 G-RDPCFKPFLDM  261 (265)
Q Consensus       250 G-r~ps~~a~~~~  261 (265)
                      | ++.|.+++|+.
T Consensus       584 Ge~els~~~LLeY  596 (646)
T KOG3690|consen  584 GEGELSGRALLEY  596 (646)
T ss_pred             CCCcCChHHHHHH
Confidence            9 56899988875


No 22 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=92.85  E-value=0.079  Score=46.46  Aligned_cols=32  Identities=16%  Similarity=0.076  Sum_probs=23.3

Q ss_pred             ccccceEEEEeccCCCCCCCCCccCcccHHHHHhHHhHHHhH
Q psy12854          8 TVITPLNTLILNLTEPQYGKPCLLTFKQVENLFFHFGSLLQR   49 (265)
Q Consensus         8 ~~~~p~~~l~~nf~~p~~~~p~ll~~~~v~tLfHEfGHalH~   49 (265)
                      ..+.|+.+++.|-+          ......||+||+||++=|
T Consensus        55 ~~~~~~I~iN~n~~----------~~r~rFtlAHELGH~llH   86 (213)
T COG2856          55 DEEKPVIYINANNS----------LERKRFTLAHELGHALLH   86 (213)
T ss_pred             eccCceEEEeCCCC----------HHHHHHHHHHHHhHHHhc
Confidence            33457777777774          356789999999997643


No 23 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=88.91  E-value=0.28  Score=43.21  Aligned_cols=18  Identities=22%  Similarity=0.383  Sum_probs=15.5

Q ss_pred             ccHHHHHhHHhHHHhHHh
Q psy12854         34 KQVENLFFHFGSLLQRSL   51 (265)
Q Consensus        34 ~~v~tLfHEfGHalH~ll   51 (265)
                      .-+.||+||+|||+-..+
T Consensus        91 ~~~~~l~HE~GHAlI~~~  108 (220)
T PF14247_consen   91 NVLFTLYHELGHALIDDL  108 (220)
T ss_pred             HHHHHHHHHHHHHHHHHh
Confidence            346899999999999887


No 24 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=88.78  E-value=0.23  Score=40.24  Aligned_cols=13  Identities=15%  Similarity=0.135  Sum_probs=11.1

Q ss_pred             cHHHHHhHHhHHH
Q psy12854         35 QVENLFFHFGSLL   47 (265)
Q Consensus        35 ~v~tLfHEfGHal   47 (265)
                      -..++.||+||+|
T Consensus       105 ~~~v~~HEiGHaL  117 (154)
T PF00413_consen  105 LQSVAIHEIGHAL  117 (154)
T ss_dssp             HHHHHHHHHHHHT
T ss_pred             hhhhhhhcccccc
Confidence            3478999999997


No 25 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=88.35  E-value=0.41  Score=36.35  Aligned_cols=22  Identities=18%  Similarity=0.120  Sum_probs=16.6

Q ss_pred             CcccHHHHHhHHhHHHhHHhhc
Q psy12854         32 TFKQVENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH~lls~   53 (265)
                      ......|++||+||.+.+--..
T Consensus        39 ~~~~~f~laHELgH~~~~~~~~   60 (122)
T PF06114_consen   39 PERQRFTLAHELGHILLHHGDE   60 (122)
T ss_dssp             HHHHHHHHHHHHHHHHHHH-HH
T ss_pred             HHHHHHHHHHHHHHHHhhhccc
Confidence            4566899999999988766543


No 26 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=85.68  E-value=0.45  Score=39.00  Aligned_cols=14  Identities=14%  Similarity=0.078  Sum_probs=11.8

Q ss_pred             cHHHHHhHHhHHHh
Q psy12854         35 QVENLFFHFGSLLQ   48 (265)
Q Consensus        35 ~v~tLfHEfGHalH   48 (265)
                      -..|+.||+||+|=
T Consensus        94 ~~~~~~HEiGHaLG  107 (165)
T cd04268          94 LRNTAEHELGHALG  107 (165)
T ss_pred             HHHHHHHHHHHHhc
Confidence            46899999999874


No 27 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=85.04  E-value=0.48  Score=38.85  Aligned_cols=15  Identities=13%  Similarity=0.140  Sum_probs=12.0

Q ss_pred             ccHHHHHhHHhHHHh
Q psy12854         34 KQVENLFFHFGSLLQ   48 (265)
Q Consensus        34 ~~v~tLfHEfGHalH   48 (265)
                      +-..|+.||+||+|=
T Consensus       103 ~~~~~~~HEiGHaLG  117 (156)
T cd04279         103 NLQAIALHELGHALG  117 (156)
T ss_pred             HHHHHHHHHhhhhhc
Confidence            345789999999984


No 28 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=84.81  E-value=0.38  Score=39.55  Aligned_cols=16  Identities=13%  Similarity=0.019  Sum_probs=12.9

Q ss_pred             cccHHHHHhHHhHHHh
Q psy12854         33 FKQVENLFFHFGSLLQ   48 (265)
Q Consensus        33 ~~~v~tLfHEfGHalH   48 (265)
                      .+-..|+.||+||+|=
T Consensus       105 ~~~~~~~~HEiGHaLG  120 (157)
T cd04278         105 TDLFSVAAHEIGHALG  120 (157)
T ss_pred             chHHHHHHHHhccccc
Confidence            4457799999999974


No 29 
>PF13058 DUF3920:  Protein of unknown function (DUF3920)
Probab=84.29  E-value=0.71  Score=36.22  Aligned_cols=16  Identities=19%  Similarity=0.428  Sum_probs=13.8

Q ss_pred             ccHHHHHhHHhHHHhH
Q psy12854         34 KQVENLFFHFGSLLQR   49 (265)
Q Consensus        34 ~~v~tLfHEfGHalH~   49 (265)
                      .-+.||+|||-|+||+
T Consensus        75 qvlktllhefrh~mQh   90 (126)
T PF13058_consen   75 QVLKTLLHEFRHAMQH   90 (126)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            4478999999999993


No 30 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=83.28  E-value=0.66  Score=39.20  Aligned_cols=16  Identities=13%  Similarity=0.179  Sum_probs=13.4

Q ss_pred             cccHHHHHhHHhHHHh
Q psy12854         33 FKQVENLFFHFGSLLQ   48 (265)
Q Consensus        33 ~~~v~tLfHEfGHalH   48 (265)
                      -....|+.||+||+|=
T Consensus       111 ~~~~~t~~HEiGHaLG  126 (186)
T cd04277         111 SYGYQTIIHEIGHALG  126 (186)
T ss_pred             hhhHHHHHHHHHHHhc
Confidence            4568899999999974


No 31 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=81.95  E-value=0.73  Score=39.03  Aligned_cols=17  Identities=18%  Similarity=0.251  Sum_probs=13.9

Q ss_pred             CcccHHHHHhHHhHHHh
Q psy12854         32 TFKQVENLFFHFGSLLQ   48 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH   48 (265)
                      +...+.+++||+||.|=
T Consensus       139 ~~~~~~~~AHEiGH~lG  155 (196)
T PF13688_consen  139 TYNGAITFAHEIGHNLG  155 (196)
T ss_dssp             HHHHHHHHHHHHHHHTT
T ss_pred             CCceehhhHHhHHHhcC
Confidence            46778999999999653


No 32 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=81.89  E-value=1.4  Score=36.64  Aligned_cols=17  Identities=18%  Similarity=0.249  Sum_probs=12.5

Q ss_pred             CcccHHHHHhHHhHHHh
Q psy12854         32 TFKQVENLFFHFGSLLQ   48 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH   48 (265)
                      ....-.||.||+||-|-
T Consensus        66 ~~~~g~TltHEvGH~LG   82 (154)
T PF05572_consen   66 QYNFGKTLTHEVGHWLG   82 (154)
T ss_dssp             TS-SSHHHHHHHHHHTT
T ss_pred             ccccccchhhhhhhhhc
Confidence            44556899999999653


No 33 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=81.88  E-value=0.7  Score=36.04  Aligned_cols=13  Identities=23%  Similarity=0.360  Sum_probs=11.2

Q ss_pred             cHHHHHhHHhHHH
Q psy12854         35 QVENLFFHFGSLL   47 (265)
Q Consensus        35 ~v~tLfHEfGHal   47 (265)
                      .+.|+.||+||.|
T Consensus       107 ~~~~~~HEiGH~l  119 (124)
T PF13582_consen  107 GVDTFAHEIGHNL  119 (124)
T ss_dssp             STTHHHHHHHHHT
T ss_pred             cceEeeehhhHhc
Confidence            4689999999976


No 34 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=81.81  E-value=1.3  Score=38.40  Aligned_cols=15  Identities=13%  Similarity=0.157  Sum_probs=12.1

Q ss_pred             cccHHHHHhHHhHHH
Q psy12854         33 FKQVENLFFHFGSLL   47 (265)
Q Consensus        33 ~~~v~tLfHEfGHal   47 (265)
                      -....+++||+||.+
T Consensus       135 ~~~~~~~aHEiGH~l  149 (206)
T PF13583_consen  135 ANGYQTFAHEIGHNL  149 (206)
T ss_pred             cccchHHHHHHHHHh
Confidence            346688999999976


No 35 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=81.58  E-value=0.71  Score=37.00  Aligned_cols=11  Identities=18%  Similarity=0.153  Sum_probs=10.0

Q ss_pred             HHHHhHHhHHH
Q psy12854         37 ENLFFHFGSLL   47 (265)
Q Consensus        37 ~tLfHEfGHal   47 (265)
                      .++.||+||+|
T Consensus        88 ~~~~HEigHaL   98 (140)
T smart00235       88 GVAAHELGHAL   98 (140)
T ss_pred             ccHHHHHHHHh
Confidence            48999999998


No 36 
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=81.50  E-value=1.2  Score=38.43  Aligned_cols=24  Identities=21%  Similarity=0.343  Sum_probs=19.4

Q ss_pred             cccHHHHHhHHhHHHhHHhhcCCC
Q psy12854         33 FKQVENLFFHFGSLLQRSLTATHY   56 (265)
Q Consensus        33 ~~~v~tLfHEfGHalH~lls~~~~   56 (265)
                      ..-+.|++||+||++=.+++-.+.
T Consensus        20 ~~~l~t~~HE~gHal~a~l~G~~v   43 (200)
T PF13398_consen   20 FRLLVTFVHELGHALAALLTGGRV   43 (200)
T ss_pred             HHHHHHHHHHHHHHHHHHHhCCCc
Confidence            345679999999999999986543


No 37 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=81.37  E-value=0.9  Score=38.25  Aligned_cols=14  Identities=14%  Similarity=0.361  Sum_probs=12.2

Q ss_pred             cHHHHHhHHhHHHh
Q psy12854         35 QVENLFFHFGSLLQ   48 (265)
Q Consensus        35 ~v~tLfHEfGHalH   48 (265)
                      ++.+++||+||.+=
T Consensus       111 ~~~~~aHElGH~lG  124 (173)
T PF13574_consen  111 GIDTFAHELGHQLG  124 (173)
T ss_dssp             HHHHHHHHHHHHHT
T ss_pred             eeeeehhhhHhhcC
Confidence            78999999999663


No 38 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=79.36  E-value=1  Score=38.72  Aligned_cols=15  Identities=20%  Similarity=0.257  Sum_probs=12.2

Q ss_pred             HHHHHhHHhHHHhHH
Q psy12854         36 VENLFFHFGSLLQRS   50 (265)
Q Consensus        36 v~tLfHEfGHalH~l   50 (265)
                      ..|+.|||||||=..
T Consensus        93 ~~~i~HElgHaLG~~  107 (198)
T cd04327          93 SRVVLHEFGHALGFI  107 (198)
T ss_pred             HHHHHHHHHHHhcCc
Confidence            468999999998544


No 39 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=78.88  E-value=1.5  Score=38.61  Aligned_cols=19  Identities=16%  Similarity=0.219  Sum_probs=15.9

Q ss_pred             CcccHHHHHhHHhHHHhHH
Q psy12854         32 TFKQVENLFFHFGSLLQRS   50 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH~l   50 (265)
                      +...|-.=+||.|||+|+-
T Consensus        86 SiaAvaVAAHEvGHAiQ~a  104 (222)
T PF04298_consen   86 SIAAVAVAAHEVGHAIQHA  104 (222)
T ss_pred             CHHHHHHHHHHHhHHHhcc
Confidence            5667888899999999964


No 40 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=78.73  E-value=1  Score=36.79  Aligned_cols=14  Identities=14%  Similarity=0.347  Sum_probs=12.2

Q ss_pred             ccHHHHHhHHhHHH
Q psy12854         34 KQVENLFFHFGSLL   47 (265)
Q Consensus        34 ~~v~tLfHEfGHal   47 (265)
                      ....++.||+||+|
T Consensus        95 ~~~~~~~HElGH~L  108 (167)
T cd00203          95 EGAQTIAHELGHAL  108 (167)
T ss_pred             cchhhHHHHHHHHh
Confidence            45789999999998


No 41 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=78.23  E-value=2.1  Score=36.13  Aligned_cols=16  Identities=19%  Similarity=0.218  Sum_probs=13.4

Q ss_pred             CcccHHHHHhHHhHHH
Q psy12854         32 TFKQVENLFFHFGSLL   47 (265)
Q Consensus        32 ~~~~v~tLfHEfGHal   47 (265)
                      ......+++||+||.+
T Consensus       130 ~~~~~~~~aHElGH~l  145 (192)
T cd04267         130 TLLTALTMAHELGHNL  145 (192)
T ss_pred             ceeehhhhhhhHHhhc
Confidence            3566789999999987


No 42 
>PF14891 Peptidase_M91:  Effector protein
Probab=78.16  E-value=1.6  Score=36.86  Aligned_cols=21  Identities=5%  Similarity=-0.088  Sum_probs=17.8

Q ss_pred             ccHHHHHhHHhHHHhHHhhcC
Q psy12854         34 KQVENLFFHFGSLLQRSLTAT   54 (265)
Q Consensus        34 ~~v~tLfHEfGHalH~lls~~   54 (265)
                      .-+..|+|||.||.|.+-...
T Consensus       102 ~p~v~L~HEL~HA~~~~~Gt~  122 (174)
T PF14891_consen  102 PPFVVLYHELIHAYDYMNGTM  122 (174)
T ss_pred             HHHHHHHHHHHHHHHHHCCCC
Confidence            447899999999999887765


No 43 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=76.12  E-value=1.7  Score=39.99  Aligned_cols=19  Identities=26%  Similarity=0.252  Sum_probs=15.6

Q ss_pred             cHHHHHhHHhHHHhHHhhc
Q psy12854         35 QVENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        35 ~v~tLfHEfGHalH~lls~   53 (265)
                      ..-.|+||+||.++.++.-
T Consensus       170 ~ayVlAHEyGHHVQ~l~Gi  188 (292)
T PF04228_consen  170 QAYVLAHEYGHHVQNLLGI  188 (292)
T ss_pred             HHHHHHHHHHHHHHHHhhh
Confidence            3446999999999998854


No 44 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=74.99  E-value=1.8  Score=37.59  Aligned_cols=16  Identities=19%  Similarity=0.332  Sum_probs=13.9

Q ss_pred             CcccHHHHHhHHhHHH
Q psy12854         32 TFKQVENLFFHFGSLL   47 (265)
Q Consensus        32 ~~~~v~tLfHEfGHal   47 (265)
                      ++..+.|++||+||.|
T Consensus       142 ~~~~~~~~AHElGH~l  157 (220)
T cd04272         142 SYYGVYTMTHELAHLL  157 (220)
T ss_pred             CcccHHHHHHHHHHHh
Confidence            4567999999999987


No 45 
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=74.37  E-value=4.3  Score=38.52  Aligned_cols=46  Identities=4%  Similarity=-0.065  Sum_probs=31.0

Q ss_pred             cHHHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHHHHhcc
Q psy12854         35 QVENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLTHFLYE   83 (265)
Q Consensus        35 ~v~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E~~~~~   83 (265)
                      -+.||+|||=|.+|..-.......-   ...+.-.=|.-|+++|.++..
T Consensus       139 ~~sTlAHEfQHmInfy~~~v~~g~~---~~~dtWLnE~lS~~aEdl~s~  184 (366)
T PF10460_consen  139 VYSTLAHEFQHMINFYQRGVLHGKQ---YAMDTWLNEMLSMSAEDLYSS  184 (366)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhcCCC---cccccHHHHHHHHHHHHHHhc
Confidence            3679999999999986443221101   124566778899999988654


No 46 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=72.21  E-value=1.2  Score=39.37  Aligned_cols=14  Identities=7%  Similarity=0.180  Sum_probs=11.6

Q ss_pred             ccHHHHHhHHhHHH
Q psy12854         34 KQVENLFFHFGSLL   47 (265)
Q Consensus        34 ~~v~tLfHEfGHal   47 (265)
                      ..+.|++||+||.|
T Consensus       144 ~~~~t~AHElGHnL  157 (228)
T cd04271         144 NEWQVFAHEIGHTF  157 (228)
T ss_pred             ccceehhhhhhhhc
Confidence            34579999999987


No 47 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=72.15  E-value=7.3  Score=29.52  Aligned_cols=35  Identities=6%  Similarity=-0.107  Sum_probs=22.6

Q ss_pred             HHHHhHHhHHHhHHhhcCCCCCCcccCCccchhHhhchHHHH
Q psy12854         37 ENLFFHFGSLLQRSLTATHYSDVSGVNNVEWDSVYIINYFLT   78 (265)
Q Consensus        37 ~tLfHEfGHalH~lls~~~~~~~sg~~~~~~d~~E~pS~~~E   78 (265)
                      .+|.||++|.+|......     .  ...|.-+.|=-+..+|
T Consensus        27 ~~l~HE~~H~~~~~~~~~-----~--~~~~~W~~EG~A~y~~   61 (128)
T PF13485_consen   27 RVLAHELAHQWFGNYFGG-----D--DNAPRWFNEGLAEYVE   61 (128)
T ss_pred             HHHHHHHHHHHHHHHcCC-----C--ccCchHHHHHHHHHHh
Confidence            889999999999877543     1  1245555555444444


No 48 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=71.15  E-value=2.2  Score=37.96  Aligned_cols=12  Identities=8%  Similarity=-0.103  Sum_probs=10.3

Q ss_pred             HHHHHhHHhHHH
Q psy12854         36 VENLFFHFGSLL   47 (265)
Q Consensus        36 v~tLfHEfGHal   47 (265)
                      ..|++||+||.+
T Consensus       168 a~t~AHElGHnl  179 (244)
T cd04270         168 DLVTAHELGHNF  179 (244)
T ss_pred             HHHHHHHHHHhc
Confidence            379999999975


No 49 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=71.01  E-value=2.1  Score=36.31  Aligned_cols=13  Identities=8%  Similarity=0.125  Sum_probs=11.2

Q ss_pred             HHHHHhHHhHHHh
Q psy12854         36 VENLFFHFGSLLQ   48 (265)
Q Consensus        36 v~tLfHEfGHalH   48 (265)
                      +.|+.|||||||=
T Consensus        75 ~g~v~HE~~HalG   87 (180)
T cd04280          75 LGTIVHELMHALG   87 (180)
T ss_pred             CchhHHHHHHHhc
Confidence            5789999999874


No 50 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=69.17  E-value=3.2  Score=33.51  Aligned_cols=17  Identities=12%  Similarity=0.268  Sum_probs=13.5

Q ss_pred             CcccHHHHHhHHhHHHh
Q psy12854         32 TFKQVENLFFHFGSLLQ   48 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH   48 (265)
                      .++.+....||+||.|=
T Consensus        74 gy~~~RIaaHE~GHiLG   90 (132)
T PF02031_consen   74 GYNSTRIAAHELGHILG   90 (132)
T ss_dssp             HS-HHHHHHHHHHHHHT
T ss_pred             CCccceeeeehhccccC
Confidence            46678899999999875


No 51 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=67.97  E-value=3.4  Score=35.32  Aligned_cols=15  Identities=20%  Similarity=0.343  Sum_probs=12.4

Q ss_pred             cHHHHHhHHhHHHhH
Q psy12854         35 QVENLFFHFGSLLQR   49 (265)
Q Consensus        35 ~v~tLfHEfGHalH~   49 (265)
                      .+.|+.|||||||=.
T Consensus        79 ~~~~i~HEl~HaLG~   93 (191)
T PF01400_consen   79 SVGTILHELGHALGF   93 (191)
T ss_dssp             SHHHHHHHHHHHHTB
T ss_pred             CccchHHHHHHHHhh
Confidence            567999999999843


No 52 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=67.56  E-value=2.5  Score=36.97  Aligned_cols=13  Identities=8%  Similarity=0.094  Sum_probs=10.7

Q ss_pred             HHHHhHHhHHHhH
Q psy12854         37 ENLFFHFGSLLQR   49 (265)
Q Consensus        37 ~tLfHEfGHalH~   49 (265)
                      .++.||+||+|=.
T Consensus       135 hvi~HEiGH~IGf  147 (211)
T PF12388_consen  135 HVITHEIGHCIGF  147 (211)
T ss_pred             HHHHHHhhhhccc
Confidence            4789999999853


No 53 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=66.97  E-value=3  Score=35.26  Aligned_cols=14  Identities=14%  Similarity=0.102  Sum_probs=12.1

Q ss_pred             ccHHHHHhHHhHHH
Q psy12854         34 KQVENLFFHFGSLL   47 (265)
Q Consensus        34 ~~v~tLfHEfGHal   47 (265)
                      .-+.+++||+||.+
T Consensus       130 ~~a~~~AHElGH~l  143 (194)
T cd04269         130 LFAVTMAHELGHNL  143 (194)
T ss_pred             HHHHHHHHHHHhhc
Confidence            44789999999987


No 54 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=66.13  E-value=3.2  Score=35.47  Aligned_cols=15  Identities=7%  Similarity=-0.235  Sum_probs=12.0

Q ss_pred             HHHHHhHHhHHHhHH
Q psy12854         36 VENLFFHFGSLLQRS   50 (265)
Q Consensus        36 v~tLfHEfGHalH~l   50 (265)
                      +-|+.|||||||=..
T Consensus        78 ~G~i~HEl~HaLG~~   92 (182)
T cd04283          78 KGIIQHELLHALGFY   92 (182)
T ss_pred             cchHHHHHHHHhCCc
Confidence            568999999998543


No 55 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=66.01  E-value=3.9  Score=30.06  Aligned_cols=19  Identities=11%  Similarity=0.086  Sum_probs=15.6

Q ss_pred             CcccHHHHHhHHhHHHhHH
Q psy12854         32 TFKQVENLFFHFGSLLQRS   50 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH~l   50 (265)
                      +..+...|+||+.|.++.-
T Consensus        58 s~~~~~llaHEl~Hv~Qq~   76 (79)
T PF13699_consen   58 SPEGRALLAHELAHVVQQR   76 (79)
T ss_pred             CCCcchhHhHHHHHHHhhc
Confidence            5567788999999999853


No 56 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=63.98  E-value=4.5  Score=34.07  Aligned_cols=24  Identities=13%  Similarity=0.239  Sum_probs=15.6

Q ss_pred             HHHHHhHHhHHHhHHhhcCCCCCC
Q psy12854         36 VENLFFHFGSLLQRSLTATHYSDV   59 (265)
Q Consensus        36 v~tLfHEfGHalH~lls~~~~~~~   59 (265)
                      +..+.||+||++=......+....
T Consensus         8 i~i~~HE~gH~~~a~~~G~~~~~~   31 (192)
T PF02163_consen    8 ISIVLHELGHALAARLYGDKVPRF   31 (192)
T ss_dssp             HHHHHHHHHHHHHHHTTT--B--E
T ss_pred             cccccccccccccccccccccccc
Confidence            457899999999877766554444


No 57 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=60.81  E-value=4.5  Score=35.08  Aligned_cols=14  Identities=7%  Similarity=0.180  Sum_probs=11.6

Q ss_pred             HHHHHhHHhHHHhH
Q psy12854         36 VENLFFHFGSLLQR   49 (265)
Q Consensus        36 v~tLfHEfGHalH~   49 (265)
                      +-|+.||+||||=.
T Consensus        88 ~Gti~HEl~HaLGf  101 (200)
T cd04281          88 FGIVVHELGHVIGF  101 (200)
T ss_pred             CchHHHHHHHHhcC
Confidence            56899999998853


No 58 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=59.14  E-value=1.9  Score=39.28  Aligned_cols=14  Identities=21%  Similarity=0.183  Sum_probs=11.6

Q ss_pred             cHHHHHhHHhHHHh
Q psy12854         35 QVENLFFHFGSLLQ   48 (265)
Q Consensus        35 ~v~tLfHEfGHalH   48 (265)
                      .+-|++|||||+|-
T Consensus       165 ~igv~~HE~gH~lG  178 (286)
T TIGR03296       165 GVGVIAHELGHDLG  178 (286)
T ss_pred             ceeeeehhhhcccC
Confidence            37899999999763


No 59 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=58.69  E-value=6.2  Score=32.94  Aligned_cols=19  Identities=11%  Similarity=0.109  Sum_probs=14.2

Q ss_pred             HHHHHhHHhHHHhHHhhcC
Q psy12854         36 VENLFFHFGSLLQRSLTAT   54 (265)
Q Consensus        36 v~tLfHEfGHalH~lls~~   54 (265)
                      +..+.||+||++=...-..
T Consensus         9 i~i~iHE~gH~~~A~~~G~   27 (180)
T cd05709           9 ISVTVHELGHALVARRLGV   27 (180)
T ss_pred             HHHHHHHHHHHHHHHHcCC
Confidence            4568999999986665544


No 60 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=57.49  E-value=5.9  Score=34.27  Aligned_cols=12  Identities=25%  Similarity=0.127  Sum_probs=10.3

Q ss_pred             HHHHhHHhHHHh
Q psy12854         37 ENLFFHFGSLLQ   48 (265)
Q Consensus        37 ~tLfHEfGHalH   48 (265)
                      .++.||+||+|=
T Consensus       118 ~~~~he~gh~lG  129 (197)
T cd04276         118 YLLAHEVGHTLG  129 (197)
T ss_pred             HHHHHHHHHHhc
Confidence            589999999874


No 61 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=56.04  E-value=6  Score=35.83  Aligned_cols=19  Identities=11%  Similarity=0.097  Sum_probs=12.3

Q ss_pred             cHHHHHhHHhHHHhHHhhc
Q psy12854         35 QVENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        35 ~v~tLfHEfGHalH~lls~   53 (265)
                      -..++.|||||++-.|--+
T Consensus       216 ~~~v~vHE~GHsf~~LaDE  234 (264)
T PF09471_consen  216 FKQVVVHEFGHSFGGLADE  234 (264)
T ss_dssp             HHHHHHHHHHHHTT-----
T ss_pred             ccceeeeeccccccccccc
Confidence            3678999999999887654


No 62 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=56.04  E-value=7.3  Score=33.20  Aligned_cols=20  Identities=15%  Similarity=0.255  Sum_probs=14.9

Q ss_pred             HHHHHhHHhHHHhHHhhcCC
Q psy12854         36 VENLFFHFGSLLQRSLTATH   55 (265)
Q Consensus        36 v~tLfHEfGHalH~lls~~~   55 (265)
                      +..+.||+||.+=.-....+
T Consensus        10 ~~v~iHElGH~~~Ar~~Gv~   29 (182)
T cd06163          10 ILIFVHELGHFLVAKLFGVK   29 (182)
T ss_pred             HHHHHHHHHHHHHHHHcCCe
Confidence            45789999998877665443


No 63 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=53.46  E-value=8.5  Score=33.33  Aligned_cols=17  Identities=18%  Similarity=0.190  Sum_probs=12.6

Q ss_pred             HHHHhHHhHHHhHHhhc
Q psy12854         37 ENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        37 ~tLfHEfGHalH~lls~   53 (265)
                      ..++||+||++=...-.
T Consensus        40 ~v~iHElgH~~~A~~~G   56 (208)
T cd06161          40 SVLLHELGHALVARRYG   56 (208)
T ss_pred             HHHHHHHHHHHHHHHcC
Confidence            56899999987655443


No 64 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=52.77  E-value=7.1  Score=34.64  Aligned_cols=16  Identities=13%  Similarity=-0.118  Sum_probs=12.4

Q ss_pred             HHHHHhHHhHHHhHHh
Q psy12854         36 VENLFFHFGSLLQRSL   51 (265)
Q Consensus        36 v~tLfHEfGHalH~ll   51 (265)
                      +-|+.||+||||=..-
T Consensus       121 ~Gti~HEl~HalGf~H  136 (230)
T cd04282         121 KATVEHEFLHALGFYH  136 (230)
T ss_pred             CchHHHHHHHHhCCcc
Confidence            4689999999985433


No 65 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=51.84  E-value=10  Score=32.97  Aligned_cols=18  Identities=11%  Similarity=0.189  Sum_probs=14.1

Q ss_pred             CcccHHHHHhHHhHHHhH
Q psy12854         32 TFKQVENLFFHFGSLLQR   49 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH~   49 (265)
                      +...+..-+||.|||+|+
T Consensus        89 Sia~~aVAAHEVGHAiQd  106 (226)
T COG2738          89 SIAAIAVAAHEVGHAIQD  106 (226)
T ss_pred             cHHHHHHHHHHhhHHHhh
Confidence            345566779999999996


No 66 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=50.76  E-value=8.3  Score=32.87  Aligned_cols=21  Identities=19%  Similarity=0.366  Sum_probs=14.8

Q ss_pred             cCcccHH-HHHhHHhHHHhHHh
Q psy12854         31 LTFKQVE-NLFFHFGSLLQRSL   51 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~ll   51 (265)
                      ++-+|+. .|+||+||..|.=.
T Consensus        84 ~~~~el~aVlaHElgH~~~~h~  105 (226)
T PF01435_consen   84 LSEDELAAVLAHELGHIKHRHI  105 (226)
T ss_dssp             SSHHHHHHHHHHHHHHHHTTHC
T ss_pred             ccHHHHHHHHHHHHHHHHcCCc
Confidence            3445654 78999999987533


No 67 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=50.63  E-value=15  Score=31.12  Aligned_cols=15  Identities=13%  Similarity=0.115  Sum_probs=11.7

Q ss_pred             cccHHHHHhHHhHHH
Q psy12854         33 FKQVENLFFHFGSLL   47 (265)
Q Consensus        33 ~~~v~tLfHEfGHal   47 (265)
                      ..-..+++||+||.+
T Consensus       129 ~~~a~~~AHelGH~l  143 (199)
T PF01421_consen  129 LSFAVIIAHELGHNL  143 (199)
T ss_dssp             HHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHhc
Confidence            345678999999965


No 68 
>PF07607 DUF1570:  Protein of unknown function (DUF1570);  InterPro: IPR011464 This entry represents hypothetical proteins confined to bacteria.
Probab=47.69  E-value=19  Score=29.06  Aligned_cols=41  Identities=2%  Similarity=-0.112  Sum_probs=27.8

Q ss_pred             cHHHHHhHHhHHHhHHhhcCCC-CCCcccCCccchhHhhchHHHHHHh
Q psy12854         35 QVENLFFHFGSLLQRSLTATHY-SDVSGVNNVEWDSVYIINYFLTHFL   81 (265)
Q Consensus        35 ~v~tLfHEfGHalH~lls~~~~-~~~sg~~~~~~d~~E~pS~~~E~~~   81 (265)
                      ++.|++||   +.|+++-++.. +.++   .+|+-+.|=-.+++|..-
T Consensus         1 ~~~T~~HE---a~HQl~~N~Gl~~r~~---~~P~Wv~EGlA~yFE~~~   42 (128)
T PF07607_consen    1 NIATIAHE---ATHQLAFNTGLHPRLA---DWPRWVSEGLATYFETPG   42 (128)
T ss_pred             CchHHHHH---HHHHHHHHccccccCC---CCchHHHHhHHHHcCCCc
Confidence            57899999   67777766542 3333   478888887777777553


No 69 
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=46.98  E-value=12  Score=33.83  Aligned_cols=16  Identities=6%  Similarity=0.056  Sum_probs=12.0

Q ss_pred             HHHHHhHHhHHHhHHh
Q psy12854         36 VENLFFHFGSLLQRSL   51 (265)
Q Consensus        36 v~tLfHEfGHalH~ll   51 (265)
                      +..+.||+||++=...
T Consensus       119 isv~iHElgHa~~Ar~  134 (263)
T cd06159         119 VGVVVHELSHGILARV  134 (263)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3467899999986544


No 70 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=46.20  E-value=4  Score=35.09  Aligned_cols=15  Identities=13%  Similarity=0.312  Sum_probs=12.4

Q ss_pred             cccHHHHHhHHhHHH
Q psy12854         33 FKQVENLFFHFGSLL   47 (265)
Q Consensus        33 ~~~v~tLfHEfGHal   47 (265)
                      +..+.+++||+||.|
T Consensus       138 ~~~a~~~aHElGH~L  152 (207)
T cd04273         138 LSSAFTIAHELGHVL  152 (207)
T ss_pred             ceeEEeeeeechhhc
Confidence            445788999999987


No 71 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=44.41  E-value=14  Score=32.51  Aligned_cols=16  Identities=19%  Similarity=0.214  Sum_probs=11.7

Q ss_pred             HHHHHhHHhHHHhHHh
Q psy12854         36 VENLFFHFGSLLQRSL   51 (265)
Q Consensus        36 v~tLfHEfGHalH~ll   51 (265)
                      +..+.||+||++=...
T Consensus        54 ~~v~iHElgH~~~A~~   69 (227)
T cd06164          54 ASVLLHELGHSLVARR   69 (227)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4557999999876443


No 72 
>PF00427 PBS_linker_poly:  Phycobilisome Linker polypeptide;  InterPro: IPR001297 Phycobilisomes (PBSs) are the major light-harvesting systems in cyanobacteria and red algae. PBS is a supercomplex that is composed of a core complex and multiple peripheral rod complexes. Typically, the core consists of two or five cylinders lying on the membrane with, in most cases, multiple rods radiating from the core to form a hemidiscoidal structure. The building units of the core cylinders and the peripheral rods are trimeric and hexameric discs, in which a monomer consists of a pair of related phycobiliproteins (PBPs), such as phycorerythrins, phycoerythrocyanins, phycocyanins, and allophycocyanins. The discs are connected to each other via specific linker polypeptides to form peripheral rods or core cylinders. Linker polypeptides share a conserved domain of ~180 residues, which can be present in one or multiple copies [, , , , ].; GO: 0015979 photosynthesis, 0030089 phycobilisome; PDB: 2L8V_A 2KY4_A 3OSJ_D 2L06_A 3NPH_B 2L3W_A 3PRU_C 3OHW_A.
Probab=43.49  E-value=14  Score=29.94  Aligned_cols=31  Identities=19%  Similarity=0.292  Sum_probs=20.6

Q ss_pred             HHHHHHHhcC-CCCccHHHHHHHhhCCCCCch
Q psy12854        226 SRLRDTFLTF-GGSCHSSEVFRRFRGRDPCFK  256 (265)
Q Consensus       226 ~~yR~~iL~~-Ggs~~~~e~l~~flGr~ps~~  256 (265)
                      +.||+.++.+ ...+-..-.++.+|||+|...
T Consensus        58 ~~yr~~f~~~~~~~R~iEl~~khlLGR~p~~~   89 (131)
T PF00427_consen   58 ELYRKRFFEPNSNYRFIELAFKHLLGRAPYNQ   89 (131)
T ss_dssp             HHHHHHHTTTS-HHHHHHHHHHHHCSS--SSH
T ss_pred             HHHHHHHcccccchHHHHHHHHHHhCCCCCCH
Confidence            5688888874 344455668999999999843


No 73 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=43.13  E-value=12  Score=33.54  Aligned_cols=20  Identities=30%  Similarity=0.433  Sum_probs=15.2

Q ss_pred             ccCcccHH-HHHhHHhHHHhH
Q psy12854         30 LLTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        30 ll~~~~v~-tLfHEfGHalH~   49 (265)
                      .|+-+|+. .|+||+||..|.
T Consensus       151 ~l~~dEl~aVlaHElgHi~~r  171 (302)
T COG0501         151 LLNDDELEAVLAHELGHIKNR  171 (302)
T ss_pred             hCCHHHHHHHHHHHHHHHhcc
Confidence            45667765 489999998874


No 74 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=42.63  E-value=13  Score=34.61  Aligned_cols=14  Identities=14%  Similarity=0.344  Sum_probs=11.2

Q ss_pred             ccHHHHHhHHhHHH
Q psy12854         34 KQVENLFFHFGSLL   47 (265)
Q Consensus        34 ~~v~tLfHEfGHal   47 (265)
                      .+..++|||+||-+
T Consensus       149 ~~~~~~~HElgHN~  162 (314)
T PF05548_consen  149 QDWATIMHELGHNL  162 (314)
T ss_pred             ccHHHHHHHhhhhc
Confidence            45679999999954


No 75 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=41.95  E-value=14  Score=34.18  Aligned_cols=18  Identities=11%  Similarity=0.216  Sum_probs=13.4

Q ss_pred             ccCcccHHHHHhHHhHHH
Q psy12854         30 LLTFKQVENLFFHFGSLL   47 (265)
Q Consensus        30 ll~~~~v~tLfHEfGHal   47 (265)
                      .+....-.++.||+||++
T Consensus       188 t~~~s~~~~f~HE~GH~~  205 (305)
T PF10462_consen  188 TLDYSYGNEFSHELGHNF  205 (305)
T ss_dssp             E-SS-SHHHHHHHHHHTT
T ss_pred             ecccCccceeehhhhhhc
Confidence            346667889999999986


No 76 
>PRK03982 heat shock protein HtpX; Provisional
Probab=40.37  E-value=14  Score=33.56  Aligned_cols=19  Identities=11%  Similarity=0.163  Sum_probs=14.0

Q ss_pred             cCcccH-HHHHhHHhHHHhH
Q psy12854         31 LTFKQV-ENLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v-~tLfHEfGHalH~   49 (265)
                      ++-+|+ .-++||+||.-|.
T Consensus       120 l~~~El~AVlAHElgHi~~~  139 (288)
T PRK03982        120 LNEDELEGVIAHELTHIKNR  139 (288)
T ss_pred             CCHHHHHHHHHHHHHHHHcC
Confidence            455664 4589999998874


No 77 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=40.27  E-value=18  Score=33.07  Aligned_cols=16  Identities=6%  Similarity=0.154  Sum_probs=12.1

Q ss_pred             HHHHHhHHhHHHhHHh
Q psy12854         36 VENLFFHFGSLLQRSL   51 (265)
Q Consensus        36 v~tLfHEfGHalH~ll   51 (265)
                      +.-+.||+||++=...
T Consensus       136 isvvvHElgHal~A~~  151 (277)
T cd06162         136 ISGVVHEMGHGVAAVR  151 (277)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3567899999986544


No 78 
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=39.49  E-value=16  Score=33.12  Aligned_cols=16  Identities=13%  Similarity=0.034  Sum_probs=11.0

Q ss_pred             cHHHHHhHHhHHHhHH
Q psy12854         35 QVENLFFHFGSLLQRS   50 (265)
Q Consensus        35 ~v~tLfHEfGHalH~l   50 (265)
                      +..=++||+||.++.-
T Consensus       219 ~~WG~~HE~GH~~Q~~  234 (307)
T PF13402_consen  219 GGWGPWHELGHNHQQG  234 (307)
T ss_dssp             H-HHHHHHHHHHH-BG
T ss_pred             CCeeehhhhhhhcCcc
Confidence            3456899999999843


No 79 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=39.05  E-value=17  Score=30.88  Aligned_cols=17  Identities=12%  Similarity=0.204  Sum_probs=11.9

Q ss_pred             cCcccHHHHHhHHhHHH
Q psy12854         31 LTFKQVENLFFHFGSLL   47 (265)
Q Consensus        31 l~~~~v~tLfHEfGHal   47 (265)
                      +.-.-+.+..||+||.+
T Consensus       121 ~~~R~~k~~~HElGH~l  137 (179)
T PRK13267        121 FEERVRKEVTHELGHTL  137 (179)
T ss_pred             HHHHHHHHHHHHHHHHc
Confidence            33344566899999975


No 80 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=38.78  E-value=4.9  Score=35.48  Aligned_cols=17  Identities=12%  Similarity=0.198  Sum_probs=13.2

Q ss_pred             CcccHHHHHhHHhHHHh
Q psy12854         32 TFKQVENLFFHFGSLLQ   48 (265)
Q Consensus        32 ~~~~v~tLfHEfGHalH   48 (265)
                      ..+.-.|+.||+||.|-
T Consensus       134 ~~n~g~t~~HEvGH~lG  150 (225)
T cd04275         134 PYNLGDTATHEVGHWLG  150 (225)
T ss_pred             cccccceeEEeccceee
Confidence            34556799999999764


No 81 
>PRK03001 M48 family peptidase; Provisional
Probab=37.89  E-value=16  Score=33.10  Aligned_cols=19  Identities=11%  Similarity=0.282  Sum_probs=14.4

Q ss_pred             cCcccHH-HHHhHHhHHHhH
Q psy12854         31 LTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~   49 (265)
                      ++-+|+. -++||+||.-|.
T Consensus       119 l~~~El~aVlAHElgHi~~~  138 (283)
T PRK03001        119 LSEREIRGVMAHELAHVKHR  138 (283)
T ss_pred             CCHHHHHHHHHHHHHHHhCC
Confidence            4666644 589999998875


No 82 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=37.43  E-value=22  Score=30.35  Aligned_cols=17  Identities=12%  Similarity=0.006  Sum_probs=11.9

Q ss_pred             HHHHhHHhHHHhHHhhc
Q psy12854         37 ENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        37 ~tLfHEfGHalH~lls~   53 (265)
                      ..+.||+||++=...-.
T Consensus        43 ~l~iHElgH~~~A~~~G   59 (183)
T cd06160          43 ILGIHEMGHYLAARRHG   59 (183)
T ss_pred             HHHHHHHHHHHHHHHCC
Confidence            34689999987654443


No 83 
>KOG3714|consensus
Probab=37.35  E-value=16  Score=35.12  Aligned_cols=20  Identities=5%  Similarity=-0.066  Sum_probs=15.9

Q ss_pred             HHHHHhHHhHHHhHHhhcCC
Q psy12854         36 VENLFFHFGSLLQRSLTATH   55 (265)
Q Consensus        36 v~tLfHEfGHalH~lls~~~   55 (265)
                      .-|..||+||||=..-.+++
T Consensus       160 ~G~i~HEl~HaLGf~HehsR  179 (411)
T KOG3714|consen  160 FGTIVHELMHALGFWHEHSR  179 (411)
T ss_pred             CchhHHHHHHHhhhhhccCc
Confidence            66889999999977666554


No 84 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=36.83  E-value=23  Score=29.20  Aligned_cols=19  Identities=5%  Similarity=0.105  Sum_probs=14.1

Q ss_pred             CccCccc-HHHHHhHHhHHH
Q psy12854         29 CLLTFKQ-VENLFFHFGSLL   47 (265)
Q Consensus        29 ~ll~~~~-v~tLfHEfGHal   47 (265)
                      ..-.+.. +-.++|||+|++
T Consensus       128 ~f~~~~~~lDVvaHEltHGV  147 (150)
T PF01447_consen  128 IFKPFASSLDVVAHELTHGV  147 (150)
T ss_dssp             SBS-GGG-HHHHHHHHHHHH
T ss_pred             ccccCccccceeeecccccc
Confidence            4445554 889999999987


No 85 
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=36.09  E-value=21  Score=34.33  Aligned_cols=17  Identities=24%  Similarity=0.337  Sum_probs=12.9

Q ss_pred             HHHHHhHHhHHHhHHhh
Q psy12854         36 VENLFFHFGSLLQRSLT   52 (265)
Q Consensus        36 v~tLfHEfGHalH~lls   52 (265)
                      +..+.||+||.+=.-.-
T Consensus        15 ~~v~~HE~gH~~~a~~~   31 (420)
T TIGR00054        15 VLIFVHELGHFLAARLC   31 (420)
T ss_pred             HHHHHHhHHHHHHHHHc
Confidence            56789999998766543


No 86 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=35.80  E-value=21  Score=30.81  Aligned_cols=22  Identities=14%  Similarity=0.209  Sum_probs=16.5

Q ss_pred             CCCccCcccHHHHHhHHhHHHh
Q psy12854         27 KPCLLTFKQVENLFFHFGSLLQ   48 (265)
Q Consensus        27 ~p~ll~~~~v~tLfHEfGHalH   48 (265)
                      .+.++.-+-+.+..||+||.+-
T Consensus       137 ~~~l~~~R~~Kea~HElGH~~G  158 (194)
T PF07998_consen  137 DEELFLERVCKEAVHELGHLFG  158 (194)
T ss_dssp             SHHHHHHHHHHHHHHHHHHHTT
T ss_pred             cHHHHHHHHHHHHHHHHHHHcC
Confidence            3455556667899999999764


No 87 
>PRK02870 heat shock protein HtpX; Provisional
Probab=35.72  E-value=19  Score=33.83  Aligned_cols=20  Identities=5%  Similarity=0.102  Sum_probs=14.7

Q ss_pred             ccCcccHH-HHHhHHhHHHhH
Q psy12854         30 LLTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        30 ll~~~~v~-tLfHEfGHalH~   49 (265)
                      .|+-+|+. -++||+||.-|.
T Consensus       167 ~L~~dEL~aVlAHELgHik~~  187 (336)
T PRK02870        167 KLDRDELQAVMAHELSHIRHG  187 (336)
T ss_pred             hCCHHHHHHHHHHHHHHHHcc
Confidence            34666654 579999999875


No 88 
>PF01742 Peptidase_M27:  Clostridial neurotoxin zinc protease This family is a subset of the Prosite family;  InterPro: IPR000395 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M27 (clan MA(E)). A number of the proteins have been classified as non-peptidase homologues as they have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the family. There are seven antigenically distinct forms of botulinum neurotoxin, designated A, B, C1, D, E, F and G. The seven neurotoxins are potent protein toxins that inhibit neurotransmitter release from peripheral cholinergic synapses []. On binding to the neuronal synapses, the molecules are internalised and move by retrograde transport up the axon into the spinal cord, where they can move between post- and presynaptic neurons. The toxin inhibits neurotransmitter release by acting as a zinc endopeptidase that cleaves synaptic proteins such as synaptobrevins, syntaxin and SNAP-25 []. The protein toxins exist as disulphide-linked heterodimers of light and heavy chains. The light chain has the pharmacological activity, while the N- and C-termini of the heavy chain mediate channel formation and toxin binding []. The light chain exhibits a high level of sequence similarity to tetanus toxin (TeTx). Alignment of all characterised neurotoxin sequences reveals the presence of highly conserved amino acid domains interspersed with amino acid tracts with little overall similarity. The most divergent region corresponds to the C-terminal extremity of each toxin, which may reflect differences in specificity of binding to neurone acceptor sites []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0009405 pathogenesis; PDB: 2QN0_A 3D3X_A 3NF3_A 2ISG_A 2ISE_B 2IMC_B 2IMA_B 2ILP_A 2IMB_A 2ISH_A ....
Probab=35.03  E-value=28  Score=33.26  Aligned_cols=24  Identities=8%  Similarity=-0.029  Sum_probs=18.3

Q ss_pred             cCcccHHHHHhHHhHHHhHHhhcC
Q psy12854         31 LTFKQVENLFFHFGSLLQRSLTAT   54 (265)
Q Consensus        31 l~~~~v~tLfHEfGHalH~lls~~   54 (265)
                      .-.|=..+|+||+=|+||.|..-.
T Consensus       210 F~~DPAl~LmheLIh~Lh~LYGik  233 (408)
T PF01742_consen  210 FYADPALELMHELIHSLHGLYGIK  233 (408)
T ss_dssp             EE--HHHHHHHHHHHHHHHHTTTH
T ss_pred             eecCHHHHHHHHHHHHHHhhcccc
Confidence            334557899999999999998753


No 89 
>PRK01345 heat shock protein HtpX; Provisional
Probab=34.97  E-value=19  Score=33.32  Aligned_cols=20  Identities=15%  Similarity=0.243  Sum_probs=14.1

Q ss_pred             cCcccHH-HHHhHHhHHHhHH
Q psy12854         31 LTFKQVE-NLFFHFGSLLQRS   50 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~l   50 (265)
                      ++-+|+. .++||+||..|.=
T Consensus       119 L~~dEL~aVlAHElgHi~~~d  139 (317)
T PRK01345        119 LSPEEVAGVMAHELAHVKNRD  139 (317)
T ss_pred             CCHHHHHHHHHHHHHHHHcCC
Confidence            3456644 5899999988753


No 90 
>PRK04897 heat shock protein HtpX; Provisional
Probab=34.92  E-value=19  Score=32.96  Aligned_cols=19  Identities=5%  Similarity=0.086  Sum_probs=13.7

Q ss_pred             cCcccHH-HHHhHHhHHHhH
Q psy12854         31 LTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~   49 (265)
                      ++-+|+. -++||+||.-|.
T Consensus       132 l~~~El~aVlAHElgHi~~~  151 (298)
T PRK04897        132 MNREELEGVIGHEISHIRNY  151 (298)
T ss_pred             CCHHHHHHHHHHHHHHHhcC
Confidence            3556644 589999998864


No 91 
>PRK05457 heat shock protein HtpX; Provisional
Probab=31.38  E-value=24  Score=32.14  Aligned_cols=19  Identities=16%  Similarity=0.223  Sum_probs=14.0

Q ss_pred             cCcccHH-HHHhHHhHHHhH
Q psy12854         31 LTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~   49 (265)
                      |+.+|+. -++||+||.-|.
T Consensus       129 L~~~El~aVlAHElgHi~~~  148 (284)
T PRK05457        129 MSRDEVEAVLAHEISHIANG  148 (284)
T ss_pred             CCHHHHHHHHHHHHHHHHcC
Confidence            4556654 579999998774


No 92 
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=31.29  E-value=28  Score=33.69  Aligned_cols=15  Identities=20%  Similarity=0.208  Sum_probs=11.3

Q ss_pred             HHHHHhHHhHHHhHH
Q psy12854         36 VENLFFHFGSLLQRS   50 (265)
Q Consensus        36 v~tLfHEfGHalH~l   50 (265)
                      +..++||+||-+=.-
T Consensus        16 ~li~vHElGHfl~Ar   30 (449)
T PRK10779         16 VLITVHEFGHFWVAR   30 (449)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456799999987553


No 93 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=30.82  E-value=7.8  Score=39.51  Aligned_cols=13  Identities=23%  Similarity=0.322  Sum_probs=11.1

Q ss_pred             cHHHHHhHHhHHH
Q psy12854         35 QVENLFFHFGSLL   47 (265)
Q Consensus        35 ~v~tLfHEfGHal   47 (265)
                      -|-+++|||||+|
T Consensus       221 giGVfaHEfGH~L  233 (645)
T PF05547_consen  221 GIGVFAHEFGHDL  233 (645)
T ss_pred             ceEEEEeeccccC
Confidence            3678999999987


No 94 
>PRK03072 heat shock protein HtpX; Provisional
Probab=30.80  E-value=25  Score=32.10  Aligned_cols=19  Identities=16%  Similarity=0.238  Sum_probs=13.9

Q ss_pred             cCcccHH-HHHhHHhHHHhH
Q psy12854         31 LTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~   49 (265)
                      ++.+|+. -++||+||.-|.
T Consensus       122 l~~~El~aVlAHElgHi~~~  141 (288)
T PRK03072        122 LNERELRGVLGHELSHVYNR  141 (288)
T ss_pred             CCHHHHHHHHHHHHHHHhcC
Confidence            4556644 589999998874


No 95 
>KOG2921|consensus
Probab=29.74  E-value=31  Score=33.14  Aligned_cols=34  Identities=18%  Similarity=0.214  Sum_probs=23.3

Q ss_pred             CCCCCCCCccCcccHHHHHhHHhHHHhHHhhcCC
Q psy12854         22 EPQYGKPCLLTFKQVENLFFHFGSLLQRSLTATH   55 (265)
Q Consensus        22 ~p~~~~p~ll~~~~v~tLfHEfGHalH~lls~~~   55 (265)
                      -|-.+-+-.|+--=+.-.+||+|||+-...-..+
T Consensus       118 Lpl~~I~yf~t~lvi~~vvHElGHalAA~segV~  151 (484)
T KOG2921|consen  118 LPLSGIAYFLTSLVITVVVHELGHALAAASEGVQ  151 (484)
T ss_pred             cccccchhhhhhHHHHHHHHHhhHHHHHHhcCce
Confidence            3334445556666677789999999987765544


No 96 
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=29.72  E-value=24  Score=32.12  Aligned_cols=11  Identities=18%  Similarity=0.407  Sum_probs=9.7

Q ss_pred             HHHHHhHHhHH
Q psy12854         36 VENLFFHFGSL   46 (265)
Q Consensus        36 v~tLfHEfGHa   46 (265)
                      ..|++||+||.
T Consensus       204 yaTl~HElghw  214 (316)
T COG4227         204 YATLLHELGHW  214 (316)
T ss_pred             HHHHHHHhccc
Confidence            68999999984


No 97 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=29.28  E-value=26  Score=30.76  Aligned_cols=22  Identities=18%  Similarity=0.081  Sum_probs=17.0

Q ss_pred             CCCccCcccHHHHHhHHhHHHh
Q psy12854         27 KPCLLTFKQVENLFFHFGSLLQ   48 (265)
Q Consensus        27 ~p~ll~~~~v~tLfHEfGHalH   48 (265)
                      .|..+.-+=+.|.-||+|||+=
T Consensus       179 ~pg~~~e~L~~tarhElGhaLg  200 (236)
T COG5549         179 PPGELRENLNPTARHELGHALG  200 (236)
T ss_pred             CcccchhhhhHHHHHhhcchhe
Confidence            3555566668899999999984


No 98 
>PRK02391 heat shock protein HtpX; Provisional
Probab=27.89  E-value=30  Score=31.74  Aligned_cols=19  Identities=16%  Similarity=0.200  Sum_probs=13.5

Q ss_pred             cCcccH-HHHHhHHhHHHhH
Q psy12854         31 LTFKQV-ENLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v-~tLfHEfGHalH~   49 (265)
                      |+-+|+ .-++||+||.-|.
T Consensus       128 L~~~El~aVlaHElgHi~~~  147 (296)
T PRK02391        128 LDPDELEAVLAHELSHVKNR  147 (296)
T ss_pred             CCHHHHHHHHHHHHHHHHcC
Confidence            344554 4589999998775


No 99 
>PF04539 Sigma70_r3:  Sigma-70 region 3;  InterPro: IPR007624 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes.  With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ].  Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but in some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme [].; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1L0O_C 1KU2_B 1RP3_C 1SC5_A 1TTY_A 2BE5_F 2A6E_F 2CW0_F 2A69_P 2A6H_P ....
Probab=27.88  E-value=50  Score=23.28  Aligned_cols=24  Identities=13%  Similarity=0.260  Sum_probs=17.5

Q ss_pred             HHHHHHhhCCCCCchHHHHhcCCC
Q psy12854        242 SEVFRRFRGRDPCFKPFLDMFRLN  265 (265)
Q Consensus       242 ~e~l~~flGr~ps~~a~~~~~g~~  265 (265)
                      ..-|..=+||+|+.+-..+.+|+.
T Consensus        10 ~~~L~~~lgr~Pt~eEiA~~lgis   33 (78)
T PF04539_consen   10 RRELEQELGREPTDEEIAEELGIS   33 (78)
T ss_dssp             HHHHHHHHSS--BHHHHHHHHTS-
T ss_pred             HHHHHHHhCCCCCHHHHHHHHccc
Confidence            456777889999999999998874


No 100
>PRK01265 heat shock protein HtpX; Provisional
Probab=27.05  E-value=32  Score=32.12  Aligned_cols=19  Identities=21%  Similarity=0.277  Sum_probs=14.2

Q ss_pred             cCcccHH-HHHhHHhHHHhH
Q psy12854         31 LTFKQVE-NLFFHFGSLLQR   49 (265)
Q Consensus        31 l~~~~v~-tLfHEfGHalH~   49 (265)
                      ++.+|+. .++||+||.-|.
T Consensus       135 l~~~El~aVlAHElgHik~~  154 (324)
T PRK01265        135 LNRDEIKAVAGHELGHLKHR  154 (324)
T ss_pred             CCHHHHHHHHHHHHHHHHcc
Confidence            4567754 589999998775


No 101
>PHA00527 hypothetical protein
Probab=26.89  E-value=1.5e+02  Score=23.18  Aligned_cols=22  Identities=5%  Similarity=-0.047  Sum_probs=18.8

Q ss_pred             HHHHHhHHhHHHhHHhhcCCCC
Q psy12854         36 VENLFFHFGSLLQRSLTATHYS   57 (265)
Q Consensus        36 v~tLfHEfGHalH~lls~~~~~   57 (265)
                      +.||.||--|.--.++++..-.
T Consensus        74 ~~T~~HECAH~AF~vC~~VGV~   95 (129)
T PHA00527         74 AATLVHECAHVAFYVCRDVGVT   95 (129)
T ss_pred             HHHHHHHHHHHHHHHHHhcCcc
Confidence            5799999999999999986543


No 102
>TIGR00488 putative HD superfamily hydrolase of NAD metabolism. The function of this protein family is unknown. Members of this family of uncharacterized proteins from the Mycoplasmas are longer at the amino end, fused to a region of nicotinamide nucleotide adenylyltransferase, an NAD salvage biosynthesis enzyme. Members are putative metal-dependent phosphohydrolases for NAD metabolism.
Probab=25.35  E-value=3.7e+02  Score=21.76  Aligned_cols=98  Identities=13%  Similarity=0.216  Sum_probs=52.5

Q ss_pred             cHHHHHhHHhHHHhH--Hh---hcCCCC-CCcccCCccchhHhhchHHHHHHh-c-cHHHHHHHHhhhcCCcccHH--H-
Q psy12854         35 QVENLFFHFGSLLQR--SL---TATHYS-DVSGVNNVEWDSVYIINYFLTHFL-Y-EERVFAELNSHFATGEKLKM--T-  103 (265)
Q Consensus        35 ~v~tLfHEfGHalH~--ll---s~~~~~-~~sg~~~~~~d~~E~pS~~~E~~~-~-~~~~L~~ls~h~~t~~~lp~--~-  103 (265)
                      -+.-|.|.+|..+-.  ++   .+...+ .... .....+-.|+-..+++.|. . +++++..+..|....+..+.  . 
T Consensus        35 ~~AGLLHDIGk~~~~~~~~~~~~~~~~~~~~~~-~~~~~~H~~vGa~ll~~w~~~~~~~i~~aI~~H~~~~~~~~~l~~i  113 (158)
T TIGR00488        35 EIAGAYHDLAKFLPKEQLKQIAKREKMPAHLLY-PSPKLLHAYVGAYILKREFGVQDEDILDAIRNHTSGPPGMSLLDMI  113 (158)
T ss_pred             HHHHHHHHHhccCCHHHHHHHHHHcCCCchhhc-ccccccHHHHHHHHHHHHhCCCcHHHHHHHHHhCCCCCCCCHHHHH
Confidence            356789999984321  11   111100 0100 1134568899999999996 5 66788888777543332221  1 


Q ss_pred             --HH-HHHHHHhhhhhHHHHHHHHHHHHHHHHh
Q psy12854        104 --DE-QLKALRAHNAGIDVCSELFKANLDLQLH  133 (265)
Q Consensus       104 --~~-~l~~~~~~~~~~~~~~ql~~a~~D~~lh  133 (265)
                        ++ .+...+....+.+-+|.+.+-.+|..+.
T Consensus       114 V~lAD~i~~~~~~~~~~~~~~~~~~~~l~~~~~  146 (158)
T TIGR00488       114 IYVADKLEPNRGAGIEIDELRKLAKTDLKQAYI  146 (158)
T ss_pred             HHhHHHHhhcccccccHHHHHHHHHHHHHHHHH
Confidence              12 2222222344556667776666666554


No 103
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=24.60  E-value=1e+02  Score=24.49  Aligned_cols=13  Identities=15%  Similarity=0.130  Sum_probs=11.4

Q ss_pred             HHHHhHHhHHHhH
Q psy12854         37 ENLFFHFGSLLQR   49 (265)
Q Consensus        37 ~tLfHEfGHalH~   49 (265)
                      .||.||+-|+-|.
T Consensus        81 ~TL~HEL~H~WQ~   93 (141)
T PHA02456         81 DTLAHELNHAWQF   93 (141)
T ss_pred             HHHHHHHHHHHhh
Confidence            5899999999884


No 104
>KOG2719|consensus
Probab=24.34  E-value=47  Score=32.23  Aligned_cols=18  Identities=17%  Similarity=0.139  Sum_probs=13.5

Q ss_pred             HHHHHhHHhHHHhHHhhc
Q psy12854         36 VENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        36 v~tLfHEfGHalH~lls~   53 (265)
                      +..++||+||--|.-..+
T Consensus       281 ~AVl~HELGHW~~~H~~K  298 (428)
T KOG2719|consen  281 VAVLAHELGHWKLNHVLK  298 (428)
T ss_pred             HHHHHHHhhHHHHhhHHH
Confidence            557899999988764443


No 105
>PHA00657 crystallin beta/gamma motif-containing protein
Probab=24.30  E-value=50  Score=36.42  Aligned_cols=24  Identities=13%  Similarity=0.202  Sum_probs=19.9

Q ss_pred             CCccCcccHHHHHhHHhHHHhHHh
Q psy12854         28 PCLLTFKQVENLFFHFGSLLQRSL   51 (265)
Q Consensus        28 p~ll~~~~v~tLfHEfGHalH~ll   51 (265)
                      -+|+.-.|+.|+.||+||.+--++
T Consensus       784 I~l~KnAD~STFvHE~gH~fLE~~  807 (2052)
T PHA00657        784 VSLLKNADLSTFLHESGHFFLEVQ  807 (2052)
T ss_pred             EEEeccccHHHHHHHHHHHHHHHH
Confidence            467778999999999999966555


No 106
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=23.23  E-value=45  Score=28.43  Aligned_cols=20  Identities=25%  Similarity=0.426  Sum_probs=15.2

Q ss_pred             CCccCcccHHHHHhHHhHHH
Q psy12854         28 PCLLTFKQVENLFFHFGSLL   47 (265)
Q Consensus        28 p~ll~~~~v~tLfHEfGHal   47 (265)
                      +-|+.-+-+....||+||.+
T Consensus       117 ~~lf~ERv~KEv~HElGH~~  136 (181)
T COG1913         117 RELFKERVVKEVLHELGHLL  136 (181)
T ss_pred             hHHHHHHHHHHHHHHhhhhc
Confidence            34555666888999999976


No 107
>KOG1565|consensus
Probab=23.02  E-value=29  Score=34.13  Aligned_cols=18  Identities=11%  Similarity=-0.047  Sum_probs=12.7

Q ss_pred             cHHHHHhHHhHHHhHHhh
Q psy12854         35 QVENLFFHFGSLLQRSLT   52 (265)
Q Consensus        35 ~v~tLfHEfGHalH~lls   52 (265)
                      =..+.+||+||+|=..-|
T Consensus       211 l~~Va~HEiGH~LGL~HS  228 (469)
T KOG1565|consen  211 LFLVAAHEIGHALGLGHS  228 (469)
T ss_pred             hHHHhhhhcccccccCCC
Confidence            345789999999854333


No 108
>PLN02791 Nudix hydrolase homolog
Probab=22.17  E-value=37  Score=35.49  Aligned_cols=20  Identities=0%  Similarity=-0.175  Sum_probs=15.0

Q ss_pred             ccHHHHHhHHhHHHhHHhhc
Q psy12854         34 KQVENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        34 ~~v~tLfHEfGHalH~lls~   53 (265)
                      ..+.|++||+||.+=-++..
T Consensus       584 f~~~v~lHElgHGsG~~~~~  603 (770)
T PLN02791        584 FFTHTICHECCHGIGPHTIT  603 (770)
T ss_pred             HHHHHHHHHhhcccccccee
Confidence            34899999999987655543


No 109
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=22.04  E-value=51  Score=28.88  Aligned_cols=13  Identities=8%  Similarity=-0.230  Sum_probs=10.5

Q ss_pred             HHHHHhHHhHHHh
Q psy12854         36 VENLFFHFGSLLQ   48 (265)
Q Consensus        36 v~tLfHEfGHalH   48 (265)
                      ...|+||++||.=
T Consensus        94 gsiLAHE~mHa~L  106 (212)
T PF12315_consen   94 GSILAHELMHAWL  106 (212)
T ss_pred             hhHHHHHHHHHHh
Confidence            4568999999865


No 110
>PF13946 DUF4214:  Domain of unknown function (DUF4214)
Probab=21.91  E-value=72  Score=22.86  Aligned_cols=21  Identities=14%  Similarity=0.300  Sum_probs=16.3

Q ss_pred             HHHHHHHhhCCCCCchHHHHh
Q psy12854        241 SSEVFRRFRGRDPCFKPFLDM  261 (265)
Q Consensus       241 ~~e~l~~flGr~ps~~a~~~~  261 (265)
                      ...+++..|||+|+.+.+--|
T Consensus        25 V~~lY~~~LgR~~D~~Gl~yw   45 (75)
T PF13946_consen   25 VNSLYQNILGREPDPEGLAYW   45 (75)
T ss_pred             HHHHHHHHHCCCCCHHHHHHH
Confidence            356888889999988877555


No 111
>KOG2661|consensus
Probab=21.77  E-value=1.1e+02  Score=29.08  Aligned_cols=53  Identities=13%  Similarity=-0.035  Sum_probs=28.3

Q ss_pred             CcccHH-HHHhHHhHHHhHHhhc--CCCCCC--cccCCccchhHhhch-HHHHHHhccH
Q psy12854         32 TFKQVE-NLFFHFGSLLQRSLTA--THYSDV--SGVNNVEWDSVYIIN-YFLTHFLYEE   84 (265)
Q Consensus        32 ~~~~v~-tLfHEfGHalH~lls~--~~~~~~--sg~~~~~~d~~E~pS-~~~E~~~~~~   84 (265)
                      .-++|. .|.||+||++-.--++  +.-..+  -|.--+.++.+-.|+ .+.|.++.-|
T Consensus       271 ~ddglAtvLgHE~aHaVarH~AEki~k~~~~siLgLvlyt~~~a~~~n~~Ll~~flrlP  329 (424)
T KOG2661|consen  271 DDDGLATVLGHEIAHAVARHAAEKIGKVHLLSILGLVLYTMIWAICPNDKLLEYFLRLP  329 (424)
T ss_pred             ChHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccchHHHHHHHhcCc
Confidence            344544 4789999987443332  111111  121123566666777 6666666665


No 112
>KOG3607|consensus
Probab=21.54  E-value=48  Score=34.35  Aligned_cols=16  Identities=19%  Similarity=0.108  Sum_probs=13.8

Q ss_pred             CcccHHHHHhHHhHHH
Q psy12854         32 TFKQVENLFFHFGSLL   47 (265)
Q Consensus        32 ~~~~v~tLfHEfGHal   47 (265)
                      ....+.+++||+||.|
T Consensus       320 ~~~~a~v~AhelgH~l  335 (716)
T KOG3607|consen  320 LLAFAVVLAHELGHNL  335 (716)
T ss_pred             chhHHHHHHHHHHhhc
Confidence            6677899999999965


No 113
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=21.18  E-value=52  Score=32.60  Aligned_cols=15  Identities=7%  Similarity=0.131  Sum_probs=12.8

Q ss_pred             ccHHHHHhHHhHHHh
Q psy12854         34 KQVENLFFHFGSLLQ   48 (265)
Q Consensus        34 ~~v~tLfHEfGHalH   48 (265)
                      .-+.++.||+.|+|=
T Consensus       209 ~~~~~~~HEi~HaLG  223 (521)
T PF01457_consen  209 EFFRTVIHEIAHALG  223 (521)
T ss_dssp             HHHHHHHHHHHHHTT
T ss_pred             cccceeeeeeeeeee
Confidence            458899999999984


No 114
>PF02128 Peptidase_M36:  Fungalysin metallopeptidase (M36);  InterPro: IPR001842 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M36 (fungalysin family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Fungalysin is produced by fungi, Aspergillus and other species, to aid degradation of host lung cell walls on infection. The enzyme is a 42kDa single chain protein, with a pH optimum of 7.5-8.0 and optimal temperature of 60 celcius [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0005615 extracellular space
Probab=21.08  E-value=16  Score=34.82  Aligned_cols=40  Identities=18%  Similarity=0.242  Sum_probs=28.2

Q ss_pred             EEEEeccCCCCCCCCCccCc----------------ccHHHHHhHHhHHHhHHhhc
Q psy12854         14 NTLILNLTEPQYGKPCLLTF----------------KQVENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        14 ~~l~~nf~~p~~~~p~ll~~----------------~~v~tLfHEfGHalH~lls~   53 (265)
                      ..-..||.-|+++.|..|+.                -|-..++||.||.|+.-|+-
T Consensus       148 g~NNAnf~TPpDG~~prMqMylw~~~~~~~p~rDg~~D~~Ii~HEy~HGiSnRLvg  203 (378)
T PF02128_consen  148 GTNNANFATPPDGQPPRMQMYLWTPSTPASPNRDGDFDNGIIAHEYGHGISNRLVG  203 (378)
T ss_pred             CcCCceeecCCCCCCceeeEEEecCCCCcCCCCCcccccCeeEEeecccccccccC
Confidence            35567888777766554443                24556899999999987765


No 115
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=20.59  E-value=56  Score=29.88  Aligned_cols=20  Identities=20%  Similarity=0.130  Sum_probs=15.7

Q ss_pred             ccHHHHHhHHhHHHhHHhhc
Q psy12854         34 KQVENLFFHFGSLLQRSLTA   53 (265)
Q Consensus        34 ~~v~tLfHEfGHalH~lls~   53 (265)
                      .+--.++||.||-+++++.-
T Consensus       166 AqaYViAHEVGHHVQnllGi  185 (295)
T COG2321         166 AQAYVIAHEVGHHVQNLLGI  185 (295)
T ss_pred             HHHHHHHhhhhHHHHHHhhh
Confidence            34446899999999998864


No 116
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=20.49  E-value=70  Score=25.77  Aligned_cols=17  Identities=6%  Similarity=0.198  Sum_probs=15.1

Q ss_pred             HHHHHhHHhHHHhHHhh
Q psy12854         36 VENLFFHFGSLLQRSLT   52 (265)
Q Consensus        36 v~tLfHEfGHalH~lls   52 (265)
                      ..||.||+=|++..+..
T Consensus        61 ~~tL~HEm~H~~~~~~~   77 (157)
T PF10263_consen   61 IDTLLHEMAHAAAYVFG   77 (157)
T ss_pred             HHHHHHHHHHHHhhhcc
Confidence            67999999999998884


Done!