Query         psy768
Match_columns 475
No_of_seqs    355 out of 1928
Neff          6.6 
Searched_HMMs 46136
Date          Sat Aug 17 00:39:39 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy768.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/768hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2661|consensus              100.0 5.3E-77 1.2E-81  581.7  22.4  386   15-459    22-417 (424)
  2 COG4783 Putative Zn-dependent  100.0 1.1E-36 2.3E-41  313.7  24.3  227  181-440    31-263 (484)
  3 COG4784 Putative Zn-dependent  100.0 3.5E-32 7.6E-37  268.1  20.2  228  179-439    21-253 (479)
  4 PRK01345 heat shock protein Ht  99.9   6E-26 1.3E-30  230.5  22.8  193  218-439    61-281 (317)
  5 PRK03982 heat shock protein Ht  99.9 1.3E-25 2.7E-30  225.7  22.8  190  228-439    69-283 (288)
  6 PRK03001 M48 family peptidase;  99.9   2E-25 4.3E-30  223.8  21.9  186  228-439    68-280 (283)
  7 PRK02870 heat shock protein Ht  99.9 3.1E-25 6.8E-30  226.0  20.8  187  227-435   115-333 (336)
  8 PF01435 Peptidase_M48:  Peptid  99.9   4E-26 8.6E-31  219.4  13.1  185  228-432    32-226 (226)
  9 PRK03072 heat shock protein Ht  99.9 3.4E-25 7.4E-30  222.4  19.0  193  218-439    64-285 (288)
 10 COG0501 HtpX Zn-dependent prot  99.9 1.3E-24 2.9E-29  218.2  18.9  188  227-440    99-300 (302)
 11 PRK04897 heat shock protein Ht  99.9 1.1E-23 2.3E-28  212.6  25.2  186  228-435    81-297 (298)
 12 PRK02391 heat shock protein Ht  99.9 5.6E-24 1.2E-28  214.2  19.0  198  217-439    69-292 (296)
 13 PRK05457 heat shock protein Ht  99.9 2.8E-23 6.2E-28  208.1  20.5  185  228-435    78-282 (284)
 14 PRK01265 heat shock protein Ht  99.9 5.8E-23 1.3E-27  208.3  20.1  196  217-436    74-321 (324)
 15 KOG2719|consensus               99.7 8.4E-17 1.8E-21  165.9  16.4  198  215-439   208-426 (428)
 16 PF05569 Peptidase_M56:  BlaR1   98.2 3.1E-05 6.6E-10   78.4  14.7  136  253-433   160-299 (299)
 17 PF10463 Peptidase_U49:  Peptid  97.7 9.8E-05 2.1E-09   70.8   6.9   60  369-433   123-189 (206)
 18 PRK09672 phage exclusion prote  97.7 0.00013 2.8E-09   72.9   8.0   60  369-433   187-253 (305)
 19 PF04228 Zn_peptidase:  Putativ  97.2  0.0015 3.2E-08   66.1   9.0  131  260-437   136-281 (292)
 20 PF06114 DUF955:  Domain of unk  97.1  0.0031 6.8E-08   53.6   8.5   50  251-305     6-60  (122)
 21 COG4219 MecR1 Antirepressor re  96.8   0.038 8.1E-07   56.1  15.2   53  253-306   154-209 (337)
 22 PF13203 DUF2201_N:  Putative m  96.1   0.009   2E-07   60.3   5.9   56  252-308    26-81  (292)
 23 COG2321 Predicted metalloprote  94.3    0.24 5.3E-06   49.3   8.9   39  263-301   136-181 (295)
 24 PRK04351 hypothetical protein;  93.8    0.22 4.8E-06   45.6   7.3   38  260-298    35-72  (149)
 25 PF10263 SprT-like:  SprT-like   92.3    0.34 7.3E-06   44.0   6.2   41  262-303    33-76  (157)
 26 PF14247 DUF4344:  Domain of un  92.1     5.1 0.00011   39.1  14.3   50  249-298    37-103 (220)
 27 PF13699 DUF4157:  Domain of un  92.1    0.12 2.6E-06   42.2   2.7   39  260-301    37-75  (79)
 28 smart00731 SprT SprT homologue  89.7    0.37   8E-06   43.7   3.7   33  268-301    41-73  (146)
 29 PF04298 Zn_peptidase_2:  Putat  88.8     3.6 7.8E-05   40.1   9.9   52  250-304    54-106 (222)
 30 PF01863 DUF45:  Protein of unk  87.4       3 6.4E-05   39.4   8.4   51  252-304   131-181 (205)
 31 COG1451 Predicted metal-depend  87.0     2.2 4.8E-05   41.6   7.5   44  260-305   150-193 (223)
 32 PF12388 Peptidase_M57:  Dual-a  86.2    0.61 1.3E-05   45.1   3.0   34  268-304   117-151 (211)
 33 PF10026 DUF2268:  Predicted Zn  86.0     1.9 4.1E-05   41.1   6.3   33  269-303    49-81  (195)
 34 PF04450 BSP:  Peptidase of pla  85.2     4.7  0.0001   38.9   8.6   49  255-303    57-112 (205)
 35 COG2856 Predicted Zn peptidase  82.0     1.4   3E-05   42.7   3.5   29  269-302    59-87  (213)
 36 PF02031 Peptidase_M7:  Strepto  82.0     2.7 5.9E-05   37.4   5.0   47  249-299    40-89  (132)
 37 COG2738 Predicted Zn-dependent  80.6     5.1 0.00011   38.2   6.5   20  286-305    91-110 (226)
 38 COG3590 PepO Predicted metallo  80.1    0.43 9.4E-06   51.9  -0.8   46  259-304   449-504 (654)
 39 PF08325 WLM:  WLM domain;  Int  79.5     2.4 5.1E-05   40.3   4.1   73  226-305    19-100 (186)
 40 PF13485 Peptidase_MA_2:  Pepti  79.3     2.5 5.5E-05   35.8   3.9   22  282-303    20-41  (128)
 41 PF12725 DUF3810:  Protein of u  79.1      16 0.00034   37.7  10.2   15  285-299   194-208 (318)
 42 cd04268 ZnMc_MMP_like Zinc-dep  78.5     2.3   5E-05   38.6   3.6   51  250-300    45-107 (165)
 43 PF13402 M60-like:  Peptidase M  75.5     6.7 0.00014   39.5   6.3   49  254-302   181-234 (307)
 44 PRK04860 hypothetical protein;  75.5     2.7 5.9E-05   38.9   3.2   33  268-301    45-77  (160)
 45 PF05572 Peptidase_M43:  Pregna  72.6     1.5 3.3E-05   40.3   0.7   57  249-305    16-88  (154)
 46 PF00413 Peptidase_M10:  Matrix  72.2     2.3 5.1E-05   38.0   1.9   35  269-303    86-122 (154)
 47 COG4324 Predicted aminopeptida  71.5     8.3 0.00018   38.6   5.5   37  272-308   180-221 (376)
 48 cd04279 ZnMc_MMP_like_1 Zinc-d  69.8     2.8   6E-05   38.1   1.8   16  284-299   101-116 (156)
 49 cd04270 ZnMc_TACE_like Zinc-de  68.0     3.2 6.9E-05   41.0   1.9   31  271-301   146-181 (244)
 50 smart00235 ZnMc Zinc-dependent  65.8     6.9 0.00015   34.6   3.5   44  250-299    51-98  (140)
 51 PF01431 Peptidase_M13:  Peptid  65.6     6.9 0.00015   37.1   3.7   44  262-305     1-54  (206)
 52 COG1164 Oligoendopeptidase F [  63.6      13 0.00028   41.6   5.9   49  252-304   346-397 (598)
 53 COG3091 SprT Zn-dependent meta  63.0     6.9 0.00015   35.9   2.9   30  270-300    45-74  (156)
 54 PF10023 DUF2265:  Predicted am  62.2     6.1 0.00013   40.9   2.7   48  260-310   139-191 (337)
 55 cd04276 ZnMc_MMP_like_2 Zinc-d  62.1     5.7 0.00012   38.1   2.4   47  253-299    65-128 (197)
 56 cd04269 ZnMc_adamalysin_II_lik  62.1     4.7  0.0001   37.8   1.8   16  284-299   128-143 (194)
 57 PF01447 Peptidase_M4:  Thermol  60.9      13 0.00027   34.1   4.3   49  249-301    97-149 (150)
 58 PF13688 Reprolysin_5:  Metallo  58.9     5.4 0.00012   37.4   1.6   17  284-300   139-155 (196)
 59 PRK06926 flagellar motor prote  58.5 2.1E+02  0.0045   28.9  13.3   35  272-306   112-148 (271)
 60 PF01421 Reprolysin:  Reprolysi  58.4     7.7 0.00017   36.6   2.6   17  283-299   127-143 (199)
 61 PF14891 Peptidase_M91:  Effect  57.9     5.7 0.00012   37.1   1.6   15  285-299   101-115 (174)
 62 PF13582 Reprolysin_3:  Metallo  57.5     5.4 0.00012   34.4   1.2   12  288-299   108-119 (124)
 63 cd04267 ZnMc_ADAM_like Zinc-de  56.9     4.3 9.3E-05   38.0   0.5   15  285-299   131-145 (192)
 64 COG1291 MotA Flagellar motor c  55.6 2.1E+02  0.0047   28.8  12.2  110  283-403   121-245 (266)
 65 cd04278 ZnMc_MMP Zinc-dependen  55.2       6 0.00013   35.9   1.2   32  268-299    85-119 (157)
 66 PF09768 Peptidase_M76:  Peptid  54.6      38 0.00082   31.9   6.4   36  261-299    48-83  (173)
 67 cd00203 ZnMc Zinc-dependent me  54.3     5.7 0.00012   35.9   0.9   15  285-299    94-108 (167)
 68 cd04277 ZnMc_serralysin_like Z  53.9     9.9 0.00021   35.5   2.5   36  269-304    94-131 (186)
 69 PF13574 Reprolysin_2:  Metallo  53.8     6.6 0.00014   36.6   1.2   15  287-301   111-125 (173)
 70 cd04272 ZnMc_salivary_gland_MP  53.1     7.6 0.00016   37.4   1.6   14  286-299   144-157 (220)
 71 TIGR02411 leuko_A4_hydro leuko  52.0      12 0.00027   41.8   3.2   37  263-303   259-295 (601)
 72 KOG3624|consensus               50.4      12 0.00025   42.6   2.7   46  260-305   481-536 (687)
 73 KOG1047|consensus               48.8      11 0.00024   41.3   2.1   32  270-304   274-305 (613)
 74 cd04271 ZnMc_ADAM_fungal Zinc-  48.5     6.1 0.00013   38.6   0.1   12  289-300   147-158 (228)
 75 TIGR00181 pepF oligoendopeptid  48.4     7.5 0.00016   43.2   0.8   49  252-304   345-395 (591)
 76 cd06459 M3B_Oligoendopeptidase  48.3      11 0.00024   39.6   2.0   49  252-304   188-239 (427)
 77 PF01433 Peptidase_M1:  Peptida  45.9      27 0.00059   36.1   4.5   52  251-303   254-311 (390)
 78 COG3824 Predicted Zn-dependent  45.8      13 0.00029   32.7   1.8   30  270-299    86-121 (136)
 79 PF13583 Reprolysin_4:  Metallo  44.5      11 0.00023   36.2   1.1   16  289-304   139-155 (206)
 80 PF01457 Peptidase_M8:  Leishma  43.7      25 0.00054   38.7   4.0   54  246-300   157-223 (521)
 81 PF12652 CotJB:  CotJB protein;  43.6      57  0.0012   26.6   5.0   39  417-455    19-59  (78)
 82 PF06262 DUF1025:  Possibl zinc  42.5      37  0.0008   28.8   3.9   33  269-301    50-87  (97)
 83 PF13150 DUF3989:  Protein of u  41.5      48   0.001   27.5   4.3   18  133-150    13-30  (85)
 84 PF06861 BALF1:  BALF1 protein;  41.2      18  0.0004   33.8   2.0   28  279-306   124-151 (182)
 85 PF12315 DUF3633:  Protein of u  41.2      14 0.00031   35.6   1.4   26  271-301    82-107 (212)
 86 COG3864 Uncharacterized protei  40.6      30 0.00064   35.6   3.5   37  268-305    51-87  (396)
 87 cd04327 ZnMc_MMP_like_3 Zinc-d  39.5      22 0.00048   33.7   2.5   17  285-301    90-106 (198)
 88 cd04273 ZnMc_ADAMTS_like Zinc-  39.4     4.4 9.6E-05   38.7  -2.4   14  286-299   139-152 (207)
 89 PF02163 Peptidase_M50:  Peptid  36.3      22 0.00047   33.2   1.8   15  287-301     7-21  (192)
 90 PF14521 Aspzincin_M35:  Lysine  35.8      47   0.001   30.2   3.8   39  260-298    65-107 (148)
 91 TIGR03818 MotA1 flagellar moto  35.4 4.9E+02   0.011   26.4  13.0   34  272-305   125-161 (282)
 92 TIGR02289 M3_not_pepF oligoend  35.3      19  0.0004   39.8   1.3   49  252-304   303-354 (549)
 93 PRK12482 flagellar motor prote  34.3 5.2E+02   0.011   26.3  14.3   25  282-306   138-162 (287)
 94 cd06161 S2P-M50_SpoIVFB SpoIVF  33.5      25 0.00055   33.7   1.8   15  286-300    37-51  (208)
 95 COG2976 Uncharacterized protei  31.4      61  0.0013   31.2   3.9   22  157-178    22-43  (207)
 96 TIGR02412 pepN_strep_liv amino  30.8      52  0.0011   38.4   4.0   36  269-304   265-304 (831)
 97 PTZ00337 surface protease GP63  30.5      78  0.0017   35.3   5.1   53  245-300   178-242 (567)
 98 PRK08990 flagellar motor prote  30.1 5.6E+02   0.012   25.5  13.0   24  282-305   116-139 (254)
 99 COG0308 PepN Aminopeptidase N   29.7      44 0.00096   39.1   3.2   56  249-304   262-324 (859)
100 TIGR03296 M6dom_TIGR03296 M6 f  29.5      13 0.00029   37.4  -0.9   11  288-298   166-176 (286)
101 PF13398 Peptidase_M50B:  Pepti  29.2      33 0.00072   32.7   1.8   16  286-301    21-36  (200)
102 cd05709 S2P-M50 Site-2 proteas  29.1      34 0.00073   31.6   1.8   13  287-299     8-20  (180)
103 PRK08124 flagellar motor prote  28.7   6E+02   0.013   25.3  12.4   24  282-305   120-143 (263)
104 KOG3658|consensus               27.7      21 0.00045   40.1   0.1   34  268-301   368-406 (764)
105 KOG1046|consensus               27.2 1.1E+02  0.0024   36.0   6.0   52  253-304   283-341 (882)
106 PF01432 Peptidase_M3:  Peptida  26.9      30 0.00065   37.0   1.2   18  283-301   239-256 (458)
107 cd06455 M3A_TOP Peptidase M3 T  26.9      33 0.00071   37.2   1.5   18  285-303   262-279 (472)
108 cd06164 S2P-M50_SpoIVFB_CBS Sp  26.7      39 0.00084   33.0   1.8   15  286-300    52-66  (227)
109 COG3426 Butyrate kinase [Energ  26.6      60  0.0013   33.1   3.1   40  259-298    67-120 (358)
110 PF12911 OppC_N:  N-terminal TM  26.5 1.3E+02  0.0028   22.1   4.3   17  149-165     7-23  (56)
111 TIGR02290 M3_fam_3 oligoendope  25.7      34 0.00074   38.0   1.4   48  252-303   341-391 (587)
112 TIGR00035 asp_race aspartate r  24.9 1.5E+02  0.0032   28.7   5.5   60  390-449     9-76  (229)
113 cd06160 S2P-M50_like_2 Unchara  24.6      45 0.00098   31.5   1.8   15  286-300    40-54  (183)
114 PRK08456 flagellar motor prote  24.5   7E+02   0.015   24.7  12.0   23  282-304   119-141 (257)
115 TIGR02414 pepN_proteo aminopep  23.1      62  0.0013   38.0   2.8   19  286-304   282-300 (863)
116 PTZ00257 Glycoprotein GP63 (le  23.0 1.3E+02  0.0029   33.8   5.2   55  246-301   203-270 (622)
117 cd06163 S2P-M50_PDZ_RseP-like   22.6      53  0.0011   31.0   1.8   13  288-300    10-22  (182)
118 PRK13267 archaemetzincin-like   22.6   1E+02  0.0023   29.0   3.8   17  286-302   124-141 (179)
119 KOG3607|consensus               22.0      50  0.0011   37.8   1.8   17  284-300   320-336 (716)
120 cd06258 Peptidase_M3_like The   21.9      41 0.00089   34.7   1.0   21  283-304   151-171 (365)
121 KOG2975|consensus               21.8 1.2E+02  0.0026   30.5   4.1   33  375-407   172-204 (288)
122 PF04612 T2SM:  Type II secreti  21.6      31 0.00067   31.2   0.0   14  134-147     3-16  (160)
123 cd04281 ZnMc_BMP1_TLD Zinc-dep  21.5      43 0.00093   32.2   1.0   15  288-302    88-102 (200)
124 cd04283 ZnMc_hatching_enzyme Z  21.4      44 0.00095   31.6   1.0   14  288-301    78-91  (182)
125 cd06159 S2P-M50_PDZ_Arch Uncha  21.1      57  0.0012   32.7   1.8   15  286-300   117-131 (263)
126 PF05960 DUF885:  Bacterial pro  21.0 4.2E+02  0.0091   28.9   8.7   74  227-301   300-388 (549)
127 PF01400 Astacin:  Astacin (Pep  20.5      55  0.0012   30.9   1.5   14  288-301    80-93  (191)
128 KOG3314|consensus               20.5 2.5E+02  0.0054   26.4   5.6   32  271-305    78-109 (194)

No 1  
>KOG2661|consensus
Probab=100.00  E-value=5.3e-77  Score=581.72  Aligned_cols=386  Identities=35%  Similarity=0.586  Sum_probs=353.7

Q ss_pred             HHHhccCCCccccCCCCCCCCCCCCCCCCChhhhhhhhhhhccccccccceeeeeecCcccccchh-------hcccccc
Q psy768           15 LLRYIGHTSPECLETLDTPRNIPASRSSINTELEQDTIGCILSCDKKYSTCLNFQITGHQDKHLTR-------VLFHQSA   87 (475)
Q Consensus        15 ~~~~~g~~~~ec~~~~~~~~n~p~~~p~~~~~~e~~~~~~il~~~~~e~~~v~~~~~~~~~s~~~q-------~~p~~~~   87 (475)
                      .+.|+|.++++|++..++|+|++|+.+++|+     +.+-+.+.+++|.|.++++|+.|||.+.+|       ++|+.++
T Consensus        22 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-----~~~~l~~~~~~~i~~~t~~~t~~n~~~~~~~~~~~~t~v~S~~v   96 (424)
T KOG2661|consen   22 VNKYQGLGVNQCDRWSFLPGNFHFYSTFNNK-----RTGGLSSTKSKEIWRITSKCTVWNDAFSRQLLIKEVTAVPSLSV   96 (424)
T ss_pred             HHHHhccCcchhhhHhhCCCcceeeeeecch-----hcccccccCCccceecccccceehhhhhhccchhhhhccccccc
Confidence            5789999999999999999999999999998     999999999999999999999999999999       8999999


Q ss_pred             cccccccccccccccccCCC-CCCcchHHHHhhccceeeeeeeecchhHHHHHHhcCchhhHHHHHHHHhcccch-HHHH
Q psy768           88 NHQLQVFNIRVLRCFHTSQP-KNALPGFIYLIFKPILRVVGAIVIARISRKWWSKLSPDQKKIILSQIKKHQDKI-AGIT  165 (475)
Q Consensus        88 ~~~~~~~~~~~~r~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~l~~~~r~~~~~~~~~~~~~~-~~~~  165 (475)
                      .+|++.-..|.. .|||.+| +..|.|...+.+                 +||+.+.|+++....+.++++.|++ +|+ 
T Consensus        97 ~l~~~~~s~Rn~-Ng~~y~rFqsnp~~ys~~~r-----------------~~~~sfkpn~~dk~f~n~rk~f~~lf~G~-  157 (424)
T KOG2661|consen   97 LLPLSPASIRNI-NGHTYPRFQSNPVPYSLMIR-----------------KWWQSFKPNKKDKVFENIRKNFWKLFLGL-  157 (424)
T ss_pred             ccccchhhhhcc-CccccchhhcCCccHHHHHH-----------------HHHHhcCCCCccchhhhHHHHHHHHHhhc-
Confidence            999988888888 9999999 888877776544                 3999999999999999888888877 443 


Q ss_pred             HHHHHHHHHHHHhhccccCCCCccceeccCHHHHHHHHHHHHHHHHHHhcCCCCCCCCHHHHHHHHHHHHHHHhcccccc
Q psy768          166 LTSIGLAYIYYFLHLETCPITGRQKFIIVKPNQLNDVTQIAYDNFIEEHGNQVLPLGHPAYKRVGAVVKRLIDANKVYME  245 (475)
Q Consensus       166 ~~~~~~~~~~~~~~~~~~P~tgr~~~~~ls~~~e~~lg~~~~~~~~~~~~~~~lp~~dp~~~~v~~i~~rL~~~~~~~p~  245 (475)
                          +++++|||+|+|.+|+|||++|+++|+++|+.||+.+|++++++++++++|..||++-+|++|+.++++++++.|+
T Consensus       158 ----~~~~vfyfthlE~~Pit~Rsrfillskp~ekllge~~ye~im~eyq~~mLp~~hp~sl~V~~vlk~iIea~~~~~s  233 (424)
T KOG2661|consen  158 ----LLFVVFYFTHLEVSPITGRSRFILLSKPQEKLLGELEYEAIMEEYQNDMLPEKHPRSLAVKEVLKHIIEANKDVPS  233 (424)
T ss_pred             ----cceEEEEEeeeeeeccCCcceeeeccCHHHHHhhHHHHHHHHHHhcCccCCcCCchhhHHHHHHHHHHHHhccCCc
Confidence                4467799999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCcceeEEEEecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHH
Q psy768          246 HNNFKYPITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIW  325 (475)
Q Consensus       246 ~~~~~~~v~Vv~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~  325 (475)
                      +++..|+|+||++|.+|||++|||+|||++|+|..++|||+||+|||||+||.++||.+|.+++..+++++++++.+++|
T Consensus       234 lsgIkWeihVVndPipNAFvLPgGKvfVFtgiLn~ck~ddglAtvLgHE~aHaVarH~AEki~k~~~~siLgLvlyt~~~  313 (424)
T KOG2661|consen  234 LSGIKWEIHVVNDPIPNAFVLPGGKVFVFTGILNSCKDDDGLATVLGHEIAHAVARHAAEKIGKVHLLSILGLVLYTMIW  313 (424)
T ss_pred             ccCceeEEEEecCCCCceeeccCCeEEEEechhhcccChHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHh
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999888888


Q ss_pred             HhccchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHh
Q psy768          326 FLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMAL  405 (475)
Q Consensus       326 ~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~  405 (475)
                      +..+.          ..+.+++                  +.++|||++|.|||.+|+.+|++|||||+++..+||+|+.
T Consensus       314 a~~~n----------~~Ll~~f------------------lrlPfSRKMEtEADyIGLlLma~Acfdpras~tvwErM~~  365 (424)
T KOG2661|consen  314 AICPN----------DKLLEYF------------------LRLPFSRKMETEADYIGLLLMAKACFDPRASSTVWERMEF  365 (424)
T ss_pred             hccch----------HHHHHHH------------------hcCcchhhhhhhhhHHHHHHHHHhhcCcccchHHHHHHHH
Confidence            76541          1233444                  7899999999999999999999999999999999999998


Q ss_pred             hccccccCC-CcccccCCCCChHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCCCC
Q psy768          406 KETQDQVGP-KMEEYLSTHPSHENRANNLESKMKEALDIRKECNCLPLGPLFIPR  459 (475)
Q Consensus       406 ~~~~~~~g~-~~~~~lsTHP~~~~RI~~L~~~i~~a~~~~~~~~c~~~~~~~~~~  459 (475)
                      .+  ...|. ..++|+||||+..+||+++++++++|..+||.|+|++.... ++|
T Consensus       366 ~e--gqmg~~~~~eflSTHPSskkRie~~~~lLpqAnvirE~sdcy~e~~p-~~~  417 (424)
T KOG2661|consen  366 VE--GQMGQPKMPEFLSTHPSSKKRIEYLDRLLPQANVIREMSDCYPESNP-DPR  417 (424)
T ss_pred             hh--hhcCCCCCchhhhcCCCccchhHHHHHhchHHHHHHHHhcCChhcCc-chh
Confidence            87  33333 35999999999999999999999999999999999987776 444


No 2  
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=100.00  E-value=1.1e-36  Score=313.73  Aligned_cols=227  Identities=26%  Similarity=0.376  Sum_probs=178.2

Q ss_pred             cccCCCCccceeccCHHHHHHHHHHHHHHHHHHhcCCCCCCCCHHHH-HHHHHHHHHHHhccccccCCCcceeEEEEecC
Q psy768          181 ETCPITGRQKFIIVKPNQLNDVTQIAYDNFIEEHGNQVLPLGHPAYK-RVGAVVKRLIDANKVYMEHNNFKYPITIIDDP  259 (475)
Q Consensus       181 ~~~P~tgr~~~~~ls~~~e~~lg~~~~~~~~~~~~~~~lp~~dp~~~-~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~  259 (475)
                      ...|.+|......+|+++|..+|+..    +.+..+.+....||+++ +|++++++|++++.    ..+.+|+|++|+++
T Consensus        31 ~~lp~ig~~~~s~ls~~qev~~g~~~----~~Qlr~~~~~i~D~el~~yv~~~g~rL~~~a~----~~~~~f~f~lV~d~  102 (484)
T COG4783          31 DLLPDIGVSAGSTLSPAQEVALGDPA----NAQLRGSVPLIRDPELEEYVNSLGQRLAAAAD----LVKTPFTFFLVNDD  102 (484)
T ss_pred             hhcchhhhhhhhcCCHHHHHHhhHHH----HHHhccCCCCcCCHHHHHHHHHHHHHHHHhcC----CCCCCeEEEEecCC
Confidence            35788999999999999999999854    44555566778999985 99999999998752    34689999999999


Q ss_pred             CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHH---HHHHHHHHHHHHHHHh-ccch-HHH
Q psy768          260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKT---FLEILYIVPLMIIWFL-LPDL-GAI  334 (475)
Q Consensus       260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~---l~~~l~~~~~~~i~~~-~~~~-~~~  334 (475)
                      .+||||+|||+|+|++|||...+||+||++|||||+||+..||..+.+....   ...++++.++ ++.+. .++. .+.
T Consensus       103 ~iNAFA~~Gg~v~vntGLll~ae~esElagViAHEigHv~qrH~aR~~e~~~r~~~~~i~~ml~g-i~aa~a~~~ag~a~  181 (484)
T COG4783         103 SINAFATPGGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLARSMEQQQRAAPMAIAGMLLG-ILAALAGADAGMAG  181 (484)
T ss_pred             ccchhhcCCceEEEehHHHHhcCCHHHHHHHHHHHHHHHhhhhHHHHHHHHhhhchhHHHHHHHH-HHHHHhCccccHHH
Confidence            9999999999999999999999999999999999999999999998776633   2222222222 22222 1221 111


Q ss_pred             HHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccccccCC
Q psy768          335 VTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQVGP  414 (475)
Q Consensus       335 l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~g~  414 (475)
                      ++.....                     ...-.+.|||++|.|||++|+.++.++||||++|++||+|+....   ..++
T Consensus       182 iag~~a~---------------------~~~g~L~~sR~~E~eADr~Gi~~L~raGydp~gM~~ff~rl~~~~---~~~~  237 (484)
T COG4783         182 IAGALAG---------------------AAQGQLNFSRQNEQEADRIGITTLVRAGYDPQGMPEFFERLADQL---RYGG  237 (484)
T ss_pred             HHHHHHH---------------------hhhhhhhcchhhHHHHHHHHHHHHHHcCCCchhHHHHHHHHHHHH---hcCC
Confidence            1111110                     001357899999999999999999999999999999999999754   2366


Q ss_pred             CcccccCCCCChHHHHHHHHHHHHHH
Q psy768          415 KMEEYLSTHPSHENRANNLESKMKEA  440 (475)
Q Consensus       415 ~~~~~lsTHP~~~~RI~~L~~~i~~a  440 (475)
                      ..|+|++|||-+++||..+++.+.+.
T Consensus       238 ~~p~yl~THPlp~~RIa~lr~ra~q~  263 (484)
T COG4783         238 QPPEYLLTHPLPEERIADLRNRAEQS  263 (484)
T ss_pred             CCChHHhcCCCchhHHHHHHHHHHhC
Confidence            77999999999999999999877653


No 3  
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=100.00  E-value=3.5e-32  Score=268.07  Aligned_cols=228  Identities=24%  Similarity=0.341  Sum_probs=174.4

Q ss_pred             hccccCCCCccceeccCHHHHH-HHHHHHHHHHHHHhcCCCCCCCCHHHH-HHHHHHHHHHHhccccccCCCcceeEEEE
Q psy768          179 HLETCPITGRQKFIIVKPNQLN-DVTQIAYDNFIEEHGNQVLPLGHPAYK-RVGAVVKRLIDANKVYMEHNNFKYPITII  256 (475)
Q Consensus       179 ~~~~~P~tgr~~~~~ls~~~e~-~lg~~~~~~~~~~~~~~~lp~~dp~~~-~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv  256 (475)
                      .+...|-++...+..+++.++. +||..+.++++.+|++..   +||.++ +|.+|+.+|...+. .|   +..|+|.|+
T Consensus        21 ~~~~~Ps~~p~~v~~v~~n~~laqmgar~hprIla~yggeY---~D~Kler~Vari~g~lt~~S~-~p---~q~YriTil   93 (479)
T COG4784          21 ESGVQPSDNPQTVDNVQKNDPLAQMGARQHPRILATYGGEY---RDPKLERMVARIVGALTAVSE-NP---QQTYRITIL   93 (479)
T ss_pred             ccccCCCCCcchHHHhhccChHhHhhhhhChhHHHhcCCcc---CCHHHHHHHHHHHhHhhhhcc-CC---CceEEEEEe
Confidence            3456788888778888887654 599988999999998654   677775 79999999987652 23   578999999


Q ss_pred             ecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhccchHHHHH
Q psy768          257 DDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVT  336 (475)
Q Consensus       257 ~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~~~~~~~l~  336 (475)
                      |+|.+||||+|||++||+.|||..++|+.|||+||+|||||+..+|..++.++.....+.+-++...+   ..+..+-..
T Consensus        94 nSP~INAFALPGGYlYitRGLlAland~sEvAAVl~HEmgHVtAnHgi~rQ~~e~a~~ia~rvva~vl---~~~~agk~A  170 (479)
T COG4784          94 NSPNINAFALPGGYLYITRGLLALANDSSEVAAVLAHEMGHVTANHGIQRQQREAAEVIASRVVAEVL---GSDAAGKQA  170 (479)
T ss_pred             cCCCccccccCCceEEEehhHHHHcCCHHHHHHHHHhhhhheecchhHHHHHHHHHHHHHHHHHHHHh---CCcchhhHH
Confidence            99999999999999999999999999999999999999999999999998877655443332222111   001111111


Q ss_pred             HHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhcccccc-CC-
Q psy768          337 QVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQV-GP-  414 (475)
Q Consensus       337 ~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~-g~-  414 (475)
                      ...+                       .+....|||+||.|||.+|++++.+|||||.++..|++.|+........ |. 
T Consensus       171 ~~rG-----------------------klrla~fsRnqELqAD~iG~~~lgeAGYDP~A~~rfl~sm~ay~~F~s~~g~a  227 (479)
T COG4784         171 LIRG-----------------------KLRLAQFSRNQELQADAIGIKMLGEAGYDPYAAARFLQSMAAYTDFRSVSGAA  227 (479)
T ss_pred             Hhhh-----------------------hHHHhhhccchhhhhhhhhHHHHHhcCCChHHHHHHHHHHHhhhhhcccCcch
Confidence            1110                       0145689999999999999999999999999999999999977732221 21 


Q ss_pred             -CcccccCCCCChHHHHHHHHHHHHH
Q psy768          415 -KMEEYLSTHPSHENRANNLESKMKE  439 (475)
Q Consensus       415 -~~~~~lsTHP~~~~RI~~L~~~i~~  439 (475)
                       ..-+||+|||.+.+||+..+.-..+
T Consensus       228 dqsldfl~sHPntpqRiqla~~hARq  253 (479)
T COG4784         228 DQSLDFLASHPNTPQRIQLARRHARQ  253 (479)
T ss_pred             hhhcchhhcCCCChHHHHHHHHHHHh
Confidence             1268999999999999877665433


No 4  
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.94  E-value=6e-26  Score=230.53  Aligned_cols=193  Identities=22%  Similarity=0.280  Sum_probs=124.8

Q ss_pred             CCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHH
Q psy768          218 VLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSH  293 (475)
Q Consensus       218 ~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAH  293 (475)
                      +.|.++|+   +.+++++|+... .+|     ..+|+++|++.+|||++|.    +.|+|++||++.+ |+|||++||||
T Consensus        61 v~~~~~p~---L~~~v~~La~~a-gi~-----~p~v~vid~~~~NAFa~G~~~~~~~V~vt~gLL~~L-~~dEL~aVlAH  130 (317)
T PRK01345         61 VDERSAPE---LYRMVRDLARRA-GLP-----MPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRL-SPEEVAGVMAH  130 (317)
T ss_pred             CCcccCHH---HHHHHHHHHHHc-CCC-----CCcEEEEcCCCcceEEecCCCCCeEEEechHHHhhC-CHHHHHHHHHH
Confidence            44555554   555666665543 122     2479999999999999963    3699999999998 57999999999


Q ss_pred             HHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhc-cc----hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhcc
Q psy768          294 ELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLL-PD----LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFEL  368 (475)
Q Consensus       294 ElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~-~~----~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~  368 (475)
                      ||||++++|..+......+..++..+.....+... .+    ....++..+    ..++  .|+         ...++.+
T Consensus       131 ElgHi~~~d~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~i~~~~----~~~~--~~~---------~~~l~~~  195 (317)
T PRK01345        131 ELAHVKNRDTLTMTITATLAGAISMLANFAFFFGGNRENNNGPLGLVGTLA----AMIV--APL---------AAMLVQM  195 (317)
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccchHHHHHHHH----HHHH--HHH---------HHHHHHH
Confidence            99999999998765443332222111111111110 00    001111110    0000  010         1113677


Q ss_pred             ccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc----cc---------------cCCCcccccCCCCChHHH
Q psy768          369 PFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ----DQ---------------VGPKMEEYLSTHPSHENR  429 (475)
Q Consensus       369 ~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~----~~---------------~g~~~~~~lsTHP~~~~R  429 (475)
                      .|||.+|++||++|++++    .||++++++|+||......    ..               .+....+||+|||++++|
T Consensus       196 ~~SR~rE~~AD~~A~~lt----g~p~~L~~AL~KL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lfsTHP~~~eR  271 (317)
T PRK01345        196 AISRTREYAADRRGAEIC----GNPLWLASALGKIERGAHGVPNEEAERNPATAHMFIINPLSGEGMDNLFSTHPATENR  271 (317)
T ss_pred             HHHHHHHHHHHHHHHHHh----CCHHHHHHHHHHHHhhhccCcccccccChHHHHHHhcCCccccchhHHhcCCcChHHH
Confidence            899999999999999875    3999999999999965310    00               011124799999999999


Q ss_pred             HHHHHHHHHH
Q psy768          430 ANNLESKMKE  439 (475)
Q Consensus       430 I~~L~~~i~~  439 (475)
                      |++|+++..+
T Consensus       272 I~~L~~~~~~  281 (317)
T PRK01345        272 IAALQRMAGE  281 (317)
T ss_pred             HHHHHHHHHh
Confidence            9999998766


No 5  
>PRK03982 heat shock protein HtpX; Provisional
Probab=99.94  E-value=1.3e-25  Score=225.74  Aligned_cols=190  Identities=23%  Similarity=0.270  Sum_probs=120.2

Q ss_pred             HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeC----CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalp----gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      .+++.++++++..+ +|     ..++++++++.+|||++|    +|.|+|++||++.+ |+|||++|||||+||++++|.
T Consensus        69 ~L~~~v~~la~~~g-~~-----~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~l-~~~El~AVlAHElgHi~~~h~  141 (288)
T PRK03982         69 ELYRIVERLAERAN-IP-----KPKVAIVPTQTPNAFATGRDPKHAVVAVTEGILNLL-NEDELEGVIAHELTHIKNRDT  141 (288)
T ss_pred             HHHHHHHHHHHHcC-CC-----CCeEEEEeCCCcceEEeccCCCCeEEEeehHHHhhC-CHHHHHHHHHHHHHHHHcCCH
Confidence            45555566655431 22     238999999999999995    45688999999998 689999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHh-ccchH---HHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHH
Q psy768          304 AEKLSNKTFLEILYIVPLMIIWFL-LPDLG---AIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEAD  379 (475)
Q Consensus       304 ~~~~~~~~l~~~l~~~~~~~i~~~-~~~~~---~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD  379 (475)
                      .+......+...+..+.....+.. ..+..   ...+..++..+..++  .++         ...++.+.|||++|+|||
T Consensus       142 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~---------~~~l~~~~~SR~~E~~AD  210 (288)
T PRK03982        142 LIQTIAATLAGAIMYLAQWLSWGLWFGGGGRDDRNGGNPIGSLLLIIL--API---------AATLIQFAISRQREFSAD  210 (288)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchHHHHHHHHHH--HHH---------HHHHHHHHHhHHHHHHHh
Confidence            877655433221111111111110 00100   000000000000010  000         012366789999999999


Q ss_pred             HHHHHHHHHcCCCchhHHHHHHHHHhhcccc-c----------------cCCCcccccCCCCChHHHHHHHHHHHHH
Q psy768          380 EVGLKLMARACYDVRVAPLFWQKMALKETQD-Q----------------VGPKMEEYLSTHPSHENRANNLESKMKE  439 (475)
Q Consensus       380 ~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~-~----------------~g~~~~~~lsTHP~~~~RI~~L~~~i~~  439 (475)
                      ++|+++    +.+|.+++++++|++...+.. .                .+....+|++|||++++||+++++..++
T Consensus       211 ~~A~~~----~~~p~~l~~aL~kL~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~l~sTHP~~~eRI~~l~~~~~~  283 (288)
T PRK03982        211 EGGARL----TGNPLALANALQKLEKGVRYIPLKNGNPATAHMFIINPFRGQFLANLFSTHPPTEERIERLLEMAQE  283 (288)
T ss_pred             HHHHHH----hCCHHHHHHHHHHHHhhhccCCCCCCCHHHHhHhhcCCCCCchhhHHhCCCcCHHHHHHHHHHHHHh
Confidence            999886    468999999999998653110 0                0111257999999999999999987644


No 6  
>PRK03001 M48 family peptidase; Provisional
Probab=99.94  E-value=2e-25  Score=223.78  Aligned_cols=186  Identities=19%  Similarity=0.270  Sum_probs=120.2

Q ss_pred             HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCC----eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDG----RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG----~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      .+.+++++++...+ +|     .-+++++|++.+|||++|.+    +|+|++||++.+ |+|||++||||||||++++|.
T Consensus        68 ~L~~~v~~la~~~g-~~-----~p~v~v~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlAHElgHi~~~h~  140 (283)
T PRK03001         68 QFYRMVRELAQRAG-LP-----MPKVYLINEDQPNAFATGRNPEHAAVAATTGILRVL-SEREIRGVMAHELAHVKHRDI  140 (283)
T ss_pred             HHHHHHHHHHHHcC-CC-----CCeEEEecCCCcceEEecCCCCCeEEEecHHHHhhC-CHHHHHHHHHHHHHHHhCCCh
Confidence            45566666655431 22     23899999999999999753    599999999998 689999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhccch--H--HHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHH
Q psy768          304 AEKLSNKTFLEILYIVPLMIIWFLLPDL--G--AIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEAD  379 (475)
Q Consensus       304 ~~~~~~~~l~~~l~~~~~~~i~~~~~~~--~--~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD  379 (475)
                      ........+..++..+.....+....+.  .  ......    +..++  .++         ...++.+.|||.+|++||
T Consensus       141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~--~~~---------~~~~~~~~~SR~~E~~AD  205 (283)
T PRK03001        141 LISTISATMAGAISALANFAMFFGGRDENGRPVNPIAGI----AVAIL--APL---------AASLIQMAISRAREFEAD  205 (283)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccchHHHH----HHHHH--HHH---------HHHHHHHHHhHHHHHHHh
Confidence            8765443222222211111111110000  0  000000    00000  000         011366789999999999


Q ss_pred             HHHHHHHHHcCCCchhHHHHHHHHHhhccccc-------------------cCCCcccccCCCCChHHHHHHHHHHHHH
Q psy768          380 EVGLKLMARACYDVRVAPLFWQKMALKETQDQ-------------------VGPKMEEYLSTHPSHENRANNLESKMKE  439 (475)
Q Consensus       380 ~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~-------------------~g~~~~~~lsTHP~~~~RI~~L~~~i~~  439 (475)
                      ++|+++    ++||++++++++|+...+....                   .++....|++|||++++||++++++.++
T Consensus       206 ~~a~~l----~~~p~~l~~AL~Kl~~~~~~~p~~~~~~~~~~~~~~~~~p~~~~~~~~l~~THP~~~eRI~~l~~~~~~  280 (283)
T PRK03001        206 RGGARI----SGDPQALASALDKIHRYASGIPFQAAEAHPATAQMMIINPLSGGGLANLFSTHPSTEERIARLMAMART  280 (283)
T ss_pred             HHHHHH----hCCHHHHHHHHHHHHhhhccCCcccccCCHHHHHHHhcCCCCcchHHHHHcCCcCHHHHHHHHHHHHHh
Confidence            999886    4699999999999997652100                   0011246899999999999999987644


No 7  
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.93  E-value=3.1e-25  Score=225.99  Aligned_cols=187  Identities=27%  Similarity=0.366  Sum_probs=122.4

Q ss_pred             HHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeC----CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhccc
Q psy768          227 KRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKH  302 (475)
Q Consensus       227 ~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalp----gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH  302 (475)
                      .++.++++++....+ .|    ..++|++++++.+||||+|    +++|+||+||++.+ |+|||++||||||||+.++|
T Consensus       115 ~~L~~~ve~La~~ag-~p----~~p~V~vi~~~~~NAFA~G~~~~~~~Ivvt~GLL~~L-~~dEL~aVlAHELgHik~~d  188 (336)
T PRK02870        115 RQLYNVVEELLVAAG-LR----FMPKVYIIDAPYMNAFASGYSEKSAMVAITTGLLEKL-DRDELQAVMAHELSHIRHGD  188 (336)
T ss_pred             HHHHHHHHHHHHHcC-CC----CCCeEEEEcCCCCceEEecCCCCCcEEEEehHHhhhC-CHHHHHHHHHHHHHHHHccc
Confidence            478888888877542 22    2469999999999999994    67899999999998 68999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhccc---hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHH
Q psy768          303 VAEKLSNKTFLEILYIVPLMIIWFLLPD---LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEAD  379 (475)
Q Consensus       303 ~~~~~~~~~l~~~l~~~~~~~i~~~~~~---~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD  379 (475)
                      .........+..++..+....++...++   ...... .+...+..++  +|+         ...++.+.+||.+|++||
T Consensus       189 i~~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~l~l~~~--~~~---------~~~ll~~~iSR~rEy~AD  256 (336)
T PRK02870        189 IRLTLCVGVLSNIMLIVADFLFYSFMGNRRNSGANRA-RMIILILRYV--LPI---------LTVLLMLFLSRTREYMAD  256 (336)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccchh-HHHHHHHHHH--HHH---------HHHHHHHHHHHHHHHHHh
Confidence            8665433332222111111111111111   000000 0000000000  111         112367889999999999


Q ss_pred             HHHHHHHHHcCCCchhHHHHHHHHHhhccccc-------------------cC------CCcccccCCCCChHHHHHHHH
Q psy768          380 EVGLKLMARACYDVRVAPLFWQKMALKETQDQ-------------------VG------PKMEEYLSTHPSHENRANNLE  434 (475)
Q Consensus       380 ~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~-------------------~g------~~~~~~lsTHP~~~~RI~~L~  434 (475)
                      ++|++++.    ||.+++++++|+........                   ..      ....++|||||++++||++|+
T Consensus       257 ~~Aa~ltg----~p~aLasAL~KL~~~~~~~~~~~~~~~~~~~~~~a~~~i~~p~~~~~~~~~~LfsTHPp~e~RI~rL~  332 (336)
T PRK02870        257 AGAVELMR----DNEPMARALQKISNDHAQNDEQYAYKHTDHESTRRAAYLFDPAGISPGSLSDAFSTHPSIENRLAALG  332 (336)
T ss_pred             HHHHHHhC----CHHHHHHHHHHHHhccccCccccccccccCChhhhhhhccCCcccccccHhHHHcCCCCHHHHHHHHh
Confidence            99988864    89999999999986442110                   00      012468999999999999997


Q ss_pred             H
Q psy768          435 S  435 (475)
Q Consensus       435 ~  435 (475)
                      .
T Consensus       333 ~  333 (336)
T PRK02870        333 G  333 (336)
T ss_pred             h
Confidence            5


No 8  
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.93  E-value=4e-26  Score=219.44  Aligned_cols=185  Identities=28%  Similarity=0.319  Sum_probs=100.4

Q ss_pred             HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCCe---EEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDGR---IFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG~---IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      ++++++++++...+    ....+++++|++++.+|||++|+|.   |+|++||++.+ |+|||++|||||+||+.++|..
T Consensus        32 ~L~~~v~~l~~~~~----~~~~~~~v~v~~~~~~NA~~~g~~~~~~I~v~~~ll~~~-~~~el~aVlaHElgH~~~~h~~  106 (226)
T PF01435_consen   32 ELRRIVEELARRAG----LGIPPPRVYVIDSPSPNAFATGGGPRKRIVVTSGLLESL-SEDELAAVLAHELGHIKHRHIL  106 (226)
T ss_dssp             HHHHHHHHHHHHHH----CTSS--EEEEE--SSEEEEEETTTC--EEEEEHHHHHHS-SHHHHHHHHHHHHHHHHTTHCC
T ss_pred             HHHHHHHHHHHHhc----CCCCCCeEEEEcCCCCcEEEEccCCCcEEEEeChhhhcc-cHHHHHHHHHHHHHHHHcCCcc
Confidence            55555666555431    1124579999999999999999998   99999999665 7899999999999999999998


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHH
Q psy768          305 EKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLK  384 (475)
Q Consensus       305 ~~~~~~~l~~~l~~~~~~~i~~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~  384 (475)
                      +.+....+..++..+....+...  .......... ......+          .  ....++...+||.+|++||++|++
T Consensus       107 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~----------~--~~~~~~~~~~sr~~E~~AD~~a~~  171 (226)
T PF01435_consen  107 KSLLISLLLSILFFALLALLIGS--MSLFSAFGFI-DILGILI----------A--FLFQLLTNAFSRRQEYEADRYAAR  171 (226)
T ss_dssp             CCCCHHH-HHHHHHHHHT-------HHHHHHHH-----------------------HHSTT------HHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHHhhh--hhhhhhhhhc-ccchhhH----------H--HHHHHhhcchhHHHHHHHHHHHHH
Confidence            77544443333222211111000  0000000000 0000000          0  111237789999999999999999


Q ss_pred             H------HHHcCCCchhHHHHHHHHHhhcccc-ccCCCcccccCCCCChHHHHHH
Q psy768          385 L------MARACYDVRVAPLFWQKMALKETQD-QVGPKMEEYLSTHPSHENRANN  432 (475)
Q Consensus       385 l------l~~AGydp~a~~~~l~kl~~~~~~~-~~g~~~~~~lsTHP~~~~RI~~  432 (475)
                      +      |.++|+++..+.++|+++....... ..+.....+++|||++.+||++
T Consensus       172 ~~~~~~~l~~a~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~tHP~~~~Ri~~  226 (226)
T PF01435_consen  172 LGGDPALLARALYKPAAAISALEKLAEANSMRPDSDWRYSSLFSTHPSTEERIAA  226 (226)
T ss_dssp             H------HHHTTS-TTHHHHHHHHHT-----------------------HHHHHH
T ss_pred             hcCCcHHHHHhCCCHHHHHHHHHHHHHHhccccCCccccchhcCCCcCHHHHhCc
Confidence            9      9999999999999999998733111 1122337889999999999975


No 9  
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.93  E-value=3.4e-25  Score=222.43  Aligned_cols=193  Identities=22%  Similarity=0.265  Sum_probs=122.7

Q ss_pred             CCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHH
Q psy768          218 VLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSH  293 (475)
Q Consensus       218 ~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAH  293 (475)
                      +.|.++|+   +.+++++|....+ +|     .-+++++|++.+|||+++.    +.|+|++||++.+ |+|||++||||
T Consensus        64 v~~~~~p~---L~~~v~~la~~~g-~p-----~p~vyv~~~~~~NAFa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAH  133 (288)
T PRK03072         64 VSEVQAPA---MYRIVRELSTAAR-QP-----MPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQIL-NERELRGVLGH  133 (288)
T ss_pred             CChhhhHH---HHHHHHHHHHHcC-CC-----CCCEEEecCCCCceEEecCCCCCcEEEecHHHHHhC-CHHHHHHHHHH
Confidence            45555554   4444455544321 22     2379999999999999963    2499999999998 67999999999


Q ss_pred             HHHhhhcccHHHHHHHHHHHHHHHHHHHHHHH-Hhcc---c--hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhc
Q psy768          294 ELSHTLLKHVAEKLSNKTFLEILYIVPLMIIW-FLLP---D--LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFE  367 (475)
Q Consensus       294 ElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~-~~~~---~--~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~  367 (475)
                      |+||+.++|.........+..++..+.....+ ....   +  ....++..+    ..++  .|+         ...++.
T Consensus       134 ElgHi~~~d~~~~~~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~----~~l~--~~~---------~~~~~~  198 (288)
T PRK03072        134 ELSHVYNRDILISSVAGALASVITYLANMAMFAGMFGGRRDNDGPNPLALLL----VSLL--GPI---------AATVIQ  198 (288)
T ss_pred             HHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHH----HHHH--HHH---------HHHHHH
Confidence            99999999999765443332222211111111 1100   0  001111100    0011  010         112366


Q ss_pred             cccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccccc-c------------------CCCcccccCCCCChHH
Q psy768          368 LPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQ-V------------------GPKMEEYLSTHPSHEN  428 (475)
Q Consensus       368 ~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~-~------------------g~~~~~~lsTHP~~~~  428 (475)
                      +.|||.+|++||++|+++.    .||++++++|+||........ .                  +.....|++|||++++
T Consensus       199 ~~~SR~rE~~AD~~A~~l~----~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~~~~~i~~p~~~~~~~~l~sTHP~~~e  274 (288)
T PRK03072        199 LAISRSREYQADESGAELT----GDPLALASALRKISGGVQAAPLPPEPQLASQAHLMIANPFRAGGIGRLFSTHPPMAD  274 (288)
T ss_pred             HHHHhHHHHHHhHHHHHHh----CCHHHHHHHHHHHHhccccCCCCccccchhhhhhhhcCcccchHHHHHHcCCcCHHH
Confidence            7899999999999998874    599999999999986441100 0                  0112578999999999


Q ss_pred             HHHHHHHHHHH
Q psy768          429 RANNLESKMKE  439 (475)
Q Consensus       429 RI~~L~~~i~~  439 (475)
                      ||++|++...+
T Consensus       275 RI~~L~~~~~~  285 (288)
T PRK03072        275 RIARLEQMAGR  285 (288)
T ss_pred             HHHHHHHHhhh
Confidence            99999977543


No 10 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.92  E-value=1.3e-24  Score=218.24  Aligned_cols=188  Identities=26%  Similarity=0.300  Sum_probs=123.1

Q ss_pred             HHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhccc
Q psy768          227 KRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKH  302 (475)
Q Consensus       227 ~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH  302 (475)
                      ......+.++...+.     .+..++++|++++.+|||++|+    |+|+||+||++.+ |+|||++|||||+||+++||
T Consensus        99 ~~~~~~v~~~a~~~~-----~~~~~~v~i~~~~~~NAFa~g~~~~~~~V~vt~gLl~~l-~~dEl~aVlaHElgHi~~rd  172 (302)
T COG0501          99 AVLLLKVAELARQAG-----IPHMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLDLL-NDDELEAVLAHELGHIKNRH  172 (302)
T ss_pred             HHHHHHHHHHHHHCC-----CCCCCeeEEecCCCccceecCCCCCCeeEEecHHHHhhC-CHHHHHHHHHHHHHHHhccc
Confidence            344445555544321     1347899999999999999998    6899999999976 68999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHH-HHHHhccchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHH
Q psy768          303 VAEKLSNKTFLEILYIVPLM-IIWFLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEV  381 (475)
Q Consensus       303 ~~~~~~~~~l~~~l~~~~~~-~i~~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~  381 (475)
                      ...++............+.. +.+.............+...+...               ....+.+.|||.+|++||++
T Consensus       173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~SR~~E~~AD~~  237 (302)
T COG0501         173 TLVRLTLRGLLASAFVLLATLALAAGLLGEAALALLLLLLLLALF---------------LATLLVLAFSRKREYEADRF  237 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHH---------------HHHHHHHHHHHHhHHhhhhc
Confidence            99865544433222222111 111110000000011100000000               01127889999999999999


Q ss_pred             HHHHHHHcCCCchhHHHHHHHHHhhcc----ccccCCC-----cccccCCCCChHHHHHHHHHHHHHH
Q psy768          382 GLKLMARACYDVRVAPLFWQKMALKET----QDQVGPK-----MEEYLSTHPSHENRANNLESKMKEA  440 (475)
Q Consensus       382 gl~ll~~AGydp~a~~~~l~kl~~~~~----~~~~g~~-----~~~~lsTHP~~~~RI~~L~~~i~~a  440 (475)
                      |++++.     |+.+...++|+....+    .....+.     ...+++|||++.+||++|+++.+..
T Consensus       238 a~~l~~-----~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~stHP~~~~Ri~~L~~~~~~~  300 (302)
T COG0501         238 AAKLTG-----PEKLASALQKLARLSGRANSKAFIASGFSGGRLQALFSTHPPLAERIAALRQLALTV  300 (302)
T ss_pred             hhhccC-----hHHHHHHHHHHHhhhcccchhhhhcCcccccchHHHHhcCCCHHHHHHHHHHHHHhh
Confidence            988874     7888999999887431    1111111     1479999999999999999987653


No 11 
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.92  E-value=1.1e-23  Score=212.64  Aligned_cols=186  Identities=19%  Similarity=0.265  Sum_probs=118.1

Q ss_pred             HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCC----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPD----GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpg----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      ++.+++++++.... +|     .-++++++++.+|||++|.    +.|+|++||++.+ |+|||++|||||+||++++|.
T Consensus        81 ~L~~~v~~la~~~g-ip-----~p~v~v~~~~~~NAfa~G~~~~~~~v~vt~gLl~~l-~~~El~aVlAHElgHi~~~d~  153 (298)
T PRK04897         81 ELWHIVEDMAMVAQ-IP-----MPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIM-NREELEGVIGHEISHIRNYDI  153 (298)
T ss_pred             HHHHHHHHHHHHcC-CC-----CCcEEEecCCCCceEEeccCCCCcEEEeehHHHhhC-CHHHHHHHHHHHHHHHhcCCH
Confidence            46666666665432 22     2389999999999999973    3699999999999 679999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHH----HHHhc------cc-hHH-HHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccc
Q psy768          304 AEKLSNKTFLEILYIVPLMI----IWFLL------PD-LGA-IVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFE  371 (475)
Q Consensus       304 ~~~~~~~~l~~~l~~~~~~~----i~~~~------~~-~~~-~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~S  371 (475)
                      ........+..++..+....    .+...      .+ ..+ .+...+...+..++  .|+         ...++.+.+|
T Consensus       154 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~--~~~---------~~~ll~~~~S  222 (298)
T PRK04897        154 RLSTIAVALASAITLLSDIAGRMMWWGGGSRRRDDDRDGGGLQIILLIVSLLLLIL--APL---------AATLIQLAIS  222 (298)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccchhhHHHHHHHHHHHHH--HHH---------HHHHHHHhhh
Confidence            87654433222221111100    11110      00 000 00000000000000  011         1123567899


Q ss_pred             hhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhcccccc----------CCC-----cccccCCCCChHHHHHHHHH
Q psy768          372 REMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQV----------GPK-----MEEYLSTHPSHENRANNLES  435 (475)
Q Consensus       372 R~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~----------g~~-----~~~~lsTHP~~~~RI~~L~~  435 (475)
                      |.+|++||++|+++.    +||++++++++|+...+.....          .++     ...|++|||++++||++|++
T Consensus       223 R~rE~~AD~~A~~lt----~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~i~~p~~~~~~~~lfsTHP~~~eRI~~L~~  297 (298)
T PRK04897        223 RQREYLADASSVELT----RNPQGLISALEKISNSQPMKHPVDDASAALYISDPLKKKGLSKLFDTHPPIEERIERLKN  297 (298)
T ss_pred             HHHHHHHHHHHHHHh----CCHHHHHHHHHHHHhccccccccChHHHHhccCCCcccchHHHHHcCCcCHHHHHHHHHc
Confidence            999999999988875    6899999999999865421100          011     24689999999999999975


No 12 
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.92  E-value=5.6e-24  Score=214.21  Aligned_cols=198  Identities=24%  Similarity=0.222  Sum_probs=121.7

Q ss_pred             CCCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCC----eEEEehhhhhhcCCHHHHHHHHH
Q psy768          217 QVLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDG----RIFMFTGMFQLCQTDDELATVLS  292 (475)
Q Consensus       217 ~~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG----~IvV~~GLL~~l~~edELaaVLA  292 (475)
                      ++.|.++|+   +.+++++++...+ .|     .-++++++++.+|||++|.+    .|+|++||++.+ |+||+++|||
T Consensus        69 ~~~~~~~p~---L~~~v~~la~~~~-~~-----~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVla  138 (296)
T PRK02391         69 IVSEDEYPE---LHAMVERLCALAD-LP-----KPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRL-DPDELEAVLA  138 (296)
T ss_pred             ECChhhCHH---HHHHHHHHHHHcC-CC-----CCcEEEEeCCCCceEEecCCCCCcEEEecHHHHhhC-CHHHHHHHHH
Confidence            345556664   4455555554321 22     23789999999999999743    489999999998 5799999999


Q ss_pred             HHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhc-c-c-hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccc
Q psy768          293 HELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLL-P-D-LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELP  369 (475)
Q Consensus       293 HElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~-~-~-~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~  369 (475)
                      ||+||+.++|.........+..+...+....++... . + .....+.++...+...+         .  .....++.+.
T Consensus       139 HElgHi~~~di~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~--~~~~~~l~~~  207 (296)
T PRK02391        139 HELSHVKNRDVAVMTIASFLSTIAFLIVRWGFYFGGFGGRGGGGGGGGILVVILVSLV---------V--WAISFLLIRA  207 (296)
T ss_pred             HHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCcccchHHHHHHHHHHH---------H--HHHHHHHHHH
Confidence            999999999998754433222222111111111110 0 0 00000000000000000         0  0011125678


Q ss_pred             cchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc----c-----------c----cCCCcccccCCCCChHHHH
Q psy768          370 FEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ----D-----------Q----VGPKMEEYLSTHPSHENRA  430 (475)
Q Consensus       370 ~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~----~-----------~----~g~~~~~~lsTHP~~~~RI  430 (475)
                      +||.+|++||++|+++.    .||.++++++.|+....+.    +           .    .+.....|++|||++++||
T Consensus       208 ~SR~rE~~AD~~Aa~lt----g~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~i~p~~~~~~~~~l~sTHP~~~eRI  283 (296)
T PRK02391        208 LSRYREFAADRGAAIIT----GRPSALASALMKISGRMDRVPTEDLREAEGMNAFFIIPALSGGSLGRLFSTHPPLEKRI  283 (296)
T ss_pred             HHHHHHHHHhHHHHHHh----CCHHHHHHHHHHHHcccccCCchhhcccHHHhhhcccCCCCcchHHHHhcCCcCHHHHH
Confidence            99999999999988765    4899999999999864210    0           0    0112257899999999999


Q ss_pred             HHHHHHHHH
Q psy768          431 NNLESKMKE  439 (475)
Q Consensus       431 ~~L~~~i~~  439 (475)
                      ++|++..++
T Consensus       284 ~~L~~~~~~  292 (296)
T PRK02391        284 AQLEKLERE  292 (296)
T ss_pred             HHHHHHHHh
Confidence            999976543


No 13 
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.91  E-value=2.8e-23  Score=208.05  Aligned_cols=185  Identities=22%  Similarity=0.230  Sum_probs=116.8

Q ss_pred             HHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeCCC----eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          228 RVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDG----RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       228 ~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalpgG----~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      ++.+.+++++...+ .|     .-++++++++.+|||++|.+    .|+|++||++.+ |+||+++|+|||+||+.++|.
T Consensus        78 ~L~~~v~~la~~~g-~p-----~p~v~v~~~~~~NAfa~G~~~~~~~V~vt~gLl~~L-~~~El~aVlAHElgHi~~~d~  150 (284)
T PRK05457         78 WLVETVARQARQAG-IG-----MPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNM-SRDEVEAVLAHEISHIANGDM  150 (284)
T ss_pred             HHHHHHHHHHHhCC-CC-----CCCEEEEeCCCceEEEecCCCCCeEEEeehHHhhhC-CHHHHHHHHHHHHHHHHcCCH
Confidence            57777777766532 22     34789999999999999843    489999999999 679999999999999999998


Q ss_pred             HHHHHHHHHHHHHHHHHHHH----HHHhcc-c-hHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHH
Q psy768          304 AEKLSNKTFLEILYIVPLMI----IWFLLP-D-LGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETE  377 (475)
Q Consensus       304 ~~~~~~~~l~~~l~~~~~~~----i~~~~~-~-~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~E  377 (475)
                      ........+...+...+...    ...... + ....++..+...+..++  +++         ...++.+.+||.+|++
T Consensus       151 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~---------~~~ll~~~~SR~rEy~  219 (284)
T PRK05457        151 VTMTLIQGVVNTFVIFLSRIIAQIVDRFVSGNEEGNGIGYFIVSIVLEIV--FGI---------LASIIVMWFSRHREFR  219 (284)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccHHHHHHHHHHHHH--HHH---------HHHHHHHHHHhHHHHH
Confidence            86533222221111111100    000000 0 00001100000000000  000         1113567899999999


Q ss_pred             HHHHHHHHHHHcCCCchhHHHHHHHHHhhcccccc---------CC-CcccccCCCCChHHHHHHHHH
Q psy768          378 ADEVGLKLMARACYDVRVAPLFWQKMALKETQDQV---------GP-KMEEYLSTHPSHENRANNLES  435 (475)
Q Consensus       378 AD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~---------g~-~~~~~lsTHP~~~~RI~~L~~  435 (475)
                      ||++|.++.   |  |++++++++|+.+.......         +. ....|++|||++++||++|++
T Consensus       220 AD~~Aa~lt---g--p~~L~~AL~KL~~~~~~~~~~~~~~~~i~~~~~~~~lfsTHP~~~eRI~~L~~  282 (284)
T PRK05457        220 ADAGGAKLA---G--REKMIAALQRLKTSYEPQLPGSMAAFGINGKSGLSELFMSHPPLEKRIAALRS  282 (284)
T ss_pred             HHHHHHHhh---C--HHHHHHHHHHHHhhCccCCChHHHHhhccCchhHHHHHcCCcCHHHHHHHHHh
Confidence            999999885   3  89999999999876521000         10 135789999999999999976


No 14 
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.90  E-value=5.8e-23  Score=208.27  Aligned_cols=196  Identities=20%  Similarity=0.223  Sum_probs=121.4

Q ss_pred             CCCCCCCHHHHHHHHHHHHHHHhccccccCCCcceeEEEEecCCcceeEeC----CCeEEEehhhhhhcCCHHHHHHHHH
Q psy768          217 QVLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLS  292 (475)
Q Consensus       217 ~~lp~~dp~~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAFalp----gG~IvV~~GLL~~l~~edELaaVLA  292 (475)
                      ++.|.+ |...++.+++++++.... .|     ..+++++|++.+|||+++    +++|+|++||++.+ |+|||.+|||
T Consensus        74 ~~~p~~-~~~~~L~~~v~~la~~~g-~~-----~p~vyv~~~~~~NAfa~G~~~~~~~Ivvt~gLl~~l-~~~El~aVlA  145 (324)
T PRK01265         74 EVTPTD-PVYGWLYSIVAEVAKYNG-IR-----VPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKIL-NRDEIKAVAG  145 (324)
T ss_pred             eCCCCC-cccHHHHHHHHHHHHHcC-CC-----CCeEEEecCCCCCeEEeccCCCCCEEEEehHHHhhC-CHHHHHHHHH
Confidence            444443 222345666666665431 22     348999999999999975    45799999999999 6799999999


Q ss_pred             HHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhc-------cchHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhh
Q psy768          293 HELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLL-------PDLGAIVTQVLLDDLKSIIFELPFEREMETEADESII  365 (475)
Q Consensus       293 HElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~-------~~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~  365 (475)
                      ||+||++++|....+....+..++..+.....|...       .+.... +..+...+.     .++     ..+  ..+
T Consensus       146 HElgHik~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~l~~ll~-----~~~-----~~i--~~~  212 (324)
T PRK01265        146 HELGHLKHRDVELLMAIGLIPTLIYYLGYSLFWGGMFGGGGGGRGNNGG-LLFLIGIAL-----MAV-----SFV--FNL  212 (324)
T ss_pred             HHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCCCccch-HHHHHHHHH-----HHH-----HHH--HHH
Confidence            999999999998765443332222221111122110       000000 000000000     000     001  112


Q ss_pred             hccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc--------c---------------ccC---------
Q psy768          366 FELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ--------D---------------QVG---------  413 (475)
Q Consensus       366 ~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~--------~---------------~~g---------  413 (475)
                      +.+.+||.+|++||++|.+.+   +-+|.++.++++|+......        .               ..+         
T Consensus       213 l~~aiSR~rEy~AD~~aa~~~---tg~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~~~l~i~~~~~~~~~~~~~~~  289 (324)
T PRK01265        213 LVLSINRMREAYADVNSALTV---PGGAENLQTALAKITLSMDPGALERFKKKSTTNQMASMLFFSNAIEEVPTWDAREL  289 (324)
T ss_pred             HHHHHHHHHHHHHhHHHHHhh---cCChHHHHHHHHHHHhcccccccccccccccCChhhhhheeeCcccccccchhhhh
Confidence            667899999999999987643   23789999999999853210        0               001         


Q ss_pred             ---------CCcccccCCCCChHHHHHHHHHH
Q psy768          414 ---------PKMEEYLSTHPSHENRANNLESK  436 (475)
Q Consensus       414 ---------~~~~~~lsTHP~~~~RI~~L~~~  436 (475)
                               +...++|||||++++||++|++.
T Consensus       290 ~~~~~~~~~~~~~~lFsTHPp~e~Ri~rL~~~  321 (324)
T PRK01265        290 VEYWKTTKVPWYADIFSDHPHPAKRIQLLEKL  321 (324)
T ss_pred             hhhhcccchhHHHHHhCCCcChHHHHHHHHHh
Confidence                     01246799999999999999985


No 15 
>KOG2719|consensus
Probab=99.72  E-value=8.4e-17  Score=165.88  Aligned_cols=198  Identities=17%  Similarity=0.244  Sum_probs=127.4

Q ss_pred             cCCCCCCCCHHH-HHHHHHHHHHHHhccccccCCCcce-eEEEEec----CCcceeEeCCC---eEEEehhhhh--hcCC
Q psy768          215 GNQVLPLGHPAY-KRVGAVVKRLIDANKVYMEHNNFKY-PITIIDD----PLINAFVFPDG---RIFMFTGMFQ--LCQT  283 (475)
Q Consensus       215 ~~~~lp~~dp~~-~~v~~i~~rL~~~~~~~p~~~~~~~-~v~Vv~~----~~~NAFalpgG---~IvV~~GLL~--~l~~  283 (475)
                      -++..|..+... +.++++++++.           .|- +++|++.    ...|||.+|-+   +|+++|.|+.  ...|
T Consensus       208 f~K~~PLe~g~l~~~Ie~la~s~g-----------fp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~~~~~~  276 (428)
T KOG2719|consen  208 FGKFTPLEEGDLKEKIERLADSVG-----------FPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLEEEHLN  276 (428)
T ss_pred             hcCCCCCCCCchHHHHHHHHHhcC-----------CCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhhhhccc
Confidence            456667666544 46777766653           222 4667772    27999999855   6999999993  3336


Q ss_pred             HHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHH------hcc--c-hHHHHHHHHHHHHHHHHhhcchhh
Q psy768          284 DDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWF------LLP--D-LGAIVTQVLLDDLKSIIFELPFER  354 (475)
Q Consensus       284 edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~------~~~--~-~~~~l~~~~~~~l~~~l~~lp~~~  354 (475)
                      +||+++|+|||+||+.++|..++....++..++.+.+.++++.      .++  + ....+|-++   +..+. ..|...
T Consensus       277 ~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~llF~~~~~~~~ly~a~Gf~~~~P~~ig~li---vf~~~-l~py~~  352 (428)
T KOG2719|consen  277 NEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLLFGFLYRNPKLYAAFGFIDEQPSLIGFLI---VFQFV-LAPYRA  352 (428)
T ss_pred             cHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHcCcchheeecCCCCCcchhHHHH---HHHHH-HHhHHH
Confidence            6899999999999999999999988877766665554444432      111  1 001122110   00110 011111


Q ss_pred             hhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccccccCCCc-ccccCCCCChHHHHHHH
Q psy768          355 EMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQVGPKM-EEYLSTHPSHENRANNL  433 (475)
Q Consensus       355 ~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~~~~g~~~-~~~lsTHP~~~~RI~~L  433 (475)
                      .       .....+-+||..|++||++|    .+-||.- .+..++-||...+-.-...++. ..|-+|||+.-+|++++
T Consensus       353 l-------~~~~~n~~sR~fEyqAD~fA----~klGYg~-~L~~AL~KL~~dnlsf~~~D~LYs~~~~~HPtvleRl~~l  420 (428)
T KOG2719|consen  353 L-------LNFLMNLISRRFEYQADAFA----KKLGYGK-DLRQALIKLFVDNLSFPVSDPLYSAWHHLHPTVLERLDAL  420 (428)
T ss_pred             H-------HHHHHHHHHHHHHHHHHHHH----HHcCCch-hHHHHHHHHhhhhcCCCCCcHHHHHHHhcCchHHHHHHHH
Confidence            1       11267889999999999994    6668864 5677777887655111112222 56789999999999999


Q ss_pred             HHHHHH
Q psy768          434 ESKMKE  439 (475)
Q Consensus       434 ~~~i~~  439 (475)
                      +....+
T Consensus       421 ~~~~~k  426 (428)
T KOG2719|consen  421 DYLSEK  426 (428)
T ss_pred             HHHHhh
Confidence            987654


No 16 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=98.20  E-value=3.1e-05  Score=78.35  Aligned_cols=136  Identities=21%  Similarity=0.284  Sum_probs=83.3

Q ss_pred             EEEEecC-CcceeEeCC--CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy768          253 ITIIDDP-LINAFVFPD--GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLP  329 (475)
Q Consensus       253 v~Vv~~~-~~NAFalpg--G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~~  329 (475)
                      +.++.++ ....|++|-  -+|++..++.+.+ |++|+..||.||++|+.++|....+-...        +..+.|. .|
T Consensus       160 ~~i~~s~~i~sP~~~G~~~p~I~lP~~~~~~~-~~~el~~il~HEl~Hikr~D~~~~~l~~l--------~~~l~Wf-nP  229 (299)
T PF05569_consen  160 IRIRVSSGISSPFVFGFLRPVIVLPESLLEDL-SEEELRAILLHELAHIKRRDLLWKLLAEL--------LCALHWF-NP  229 (299)
T ss_pred             eEEEEcCCCCCCeeecCcceEEEecCcccccc-CHHHHHHHHHHHHHHHHCCChHHHHHHHH--------HHHHHHh-hH
Confidence            3344333 456777753  3799999999888 57999999999999999999986542221        1122221 11


Q ss_pred             chHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhccc
Q psy768          330 DLGAIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQ  409 (475)
Q Consensus       330 ~~~~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~~~  409 (475)
                                   +.-+                   +...+.+.+|+.||+..++.+.  +-+.......+-++......
T Consensus       230 -------------~~~~-------------------~~~~~~~~~E~~cD~~vl~~l~--~~~~~~Y~~~Ll~~~~~~~~  275 (299)
T PF05569_consen  230 -------------LVWL-------------------LRRRIRRDRELACDEAVLRNLG--KEERKAYAETLLKVAKRSQQ  275 (299)
T ss_pred             -------------HHHH-------------------HHHHHHHHHHHhhhHHHHHhcC--chhHHHHHHHHHHHHHhhcC
Confidence                         1111                   4567889999999999877642  23455666666566544411


Q ss_pred             cccCCCc-ccccCCCCChHHHHHHH
Q psy768          410 DQVGPKM-EEYLSTHPSHENRANNL  433 (475)
Q Consensus       410 ~~~g~~~-~~~lsTHP~~~~RI~~L  433 (475)
                       ...... ..+..+.-..++||..|
T Consensus       276 -~~~~~~~~~~~~~~~~lk~RI~~I  299 (299)
T PF05569_consen  276 -FKRPPLASSFAFSKSQLKRRIKMI  299 (299)
T ss_pred             -CCcchhhhhccCChHHHHHHHHhC
Confidence             001111 12234666778887653


No 17 
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=97.67  E-value=9.8e-05  Score=70.79  Aligned_cols=60  Identities=18%  Similarity=0.119  Sum_probs=34.0

Q ss_pred             ccchhhhHHHHHHHHHHHHHcCCC-----chhHHHHHHHHH--hhccccccCCCcccccCCCCChHHHHHHH
Q psy768          369 PFEREMETEADEVGLKLMARACYD-----VRVAPLFWQKMA--LKETQDQVGPKMEEYLSTHPSHENRANNL  433 (475)
Q Consensus       369 ~~SR~~E~EAD~~gl~ll~~AGyd-----p~a~~~~l~kl~--~~~~~~~~g~~~~~~lsTHP~~~~RI~~L  433 (475)
                      ..+++.|.+||.+|..++-....+     -+.+.-....+.  ...     -......-.|||+..+||.++
T Consensus       123 ~~~~~eE~~AD~~A~~~il~~~~~~~~~~~r~~gIa~al~~i~~l~-----~~~~~~~~~tHP~~~~Ri~~~  189 (206)
T PF10463_consen  123 SQSIQEEKEADSYATEMILSDVGNYAEERKRNLGIAIALFFISLLF-----PENSLNGTTTHPDIDDRIFNI  189 (206)
T ss_pred             chhHHHHHhhhHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHh-----cccccCCCCCCcCHHHHHHHH
Confidence            456778999999999998766555     111111111111  111     011122235999999999665


No 18 
>PRK09672 phage exclusion protein Lit; Provisional
Probab=97.67  E-value=0.00013  Score=72.91  Aligned_cols=60  Identities=15%  Similarity=-0.004  Sum_probs=38.0

Q ss_pred             ccchhhhHHHHHHHHHHHHHcCCC--chh-----HHHHHHHHHhhccccccCCCcccccCCCCChHHHHHHH
Q psy768          369 PFEREMETEADEVGLKLMARACYD--VRV-----APLFWQKMALKETQDQVGPKMEEYLSTHPSHENRANNL  433 (475)
Q Consensus       369 ~~SR~~E~EAD~~gl~ll~~AGyd--p~a-----~~~~l~kl~~~~~~~~~g~~~~~~lsTHP~~~~RI~~L  433 (475)
                      ..|++.|.+||.+|..++-...++  +..     ..-++--+....     ........+|||+..+||.+.
T Consensus       187 e~s~~eE~eaDs~At~~il~~~~~~a~s~Kr~vgIaia~l~l~~l~-----~~~~~~g~~tHP~~~~RI~a~  253 (305)
T PRK09672        187 EDSIQEEKEADSYATNWLLSKSEKYAPSKKRSVGIAIALLFLQELE-----LENKSCGKGTHPDADQRIFAN  253 (305)
T ss_pred             hHHHHHHHHHHHHHHHHHHhccccccHHHhhhHHHHHHHHHHHHHh-----ccccCCCCCCCCCHHHHHHHh
Confidence            456788999999999999987777  322     111111222222     111122246999999999987


No 19 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=97.20  E-value=0.0015  Score=66.14  Aligned_cols=131  Identities=21%  Similarity=0.171  Sum_probs=71.2

Q ss_pred             CcceeEeC-CCeEEEehhhhhhcC------CHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHhccchH
Q psy768          260 LINAFVFP-DGRIFMFTGMFQLCQ------TDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLG  332 (475)
Q Consensus       260 ~~NAFalp-gG~IvV~~GLL~~l~------~edELaaVLAHElgHv~~rH~~~~~~~~~l~~~l~~~~~~~i~~~~~~~~  332 (475)
                      ..|||=.| +.+||+....++.+.      .+-..+.|||||.||.+.+..-          ++.-              
T Consensus       136 ~~gpFYCp~D~tIYlD~~f~~~L~~~~ga~G~~a~ayVlAHEyGHHVQ~l~G----------il~~--------------  191 (292)
T PF04228_consen  136 ATGPFYCPADQTIYLDLSFFDELQQRFGASGDFAQAYVLAHEYGHHVQNLLG----------ILDA--------------  191 (292)
T ss_pred             CCCCEeCCCCCEEEechHHHHHHHHHhCCccHHHHHHHHHHHHHHHHHHHhh----------hHHH--------------
Confidence            35667666 568999988766552      2345678999999998654221          0000              


Q ss_pred             HHHHHHHHHHHHHHHhhcchhhhhhhhhhhhhhhccccchhhhHHHHHHHHHHHHHcC----CCchhHHHHHHHHHhhcc
Q psy768          333 AIVTQVLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEADEVGLKLMARAC----YDVRVAPLFWQKMALKET  408 (475)
Q Consensus       333 ~~l~~~~~~~l~~~l~~lp~~~~l~~~~~~~~~~~~~~SR~~E~EAD~~gl~ll~~AG----ydp~a~~~~l~kl~~~~~  408 (475)
                        +.... .....                   --.+..||..|.+||.||=-++..+-    +++....+.+.......+
T Consensus       192 --~~~~~-~~~~~-------------------~~~~~~svr~ELQADC~AGvw~~~~~~~~~l~~~di~~al~aa~aiGD  249 (292)
T PF04228_consen  192 --VRQAQ-QGRSP-------------------AEANELSVRLELQADCFAGVWAGHAAEDGSLTPGDIEEALNAASAIGD  249 (292)
T ss_pred             --HHHHh-hccCc-------------------cccchHHHHHHHHHHHHHHHHhhhccccCCcCHHHHHHHHHHHHHhcC
Confidence              00000 00000                   02356799999999999988876553    233333333333222211


Q ss_pred             ---ccc-cCCCcccccCCCCChHHHHHHHHHHH
Q psy768          409 ---QDQ-VGPKMEEYLSTHPSHENRANNLESKM  437 (475)
Q Consensus       409 ---~~~-~g~~~~~~lsTHP~~~~RI~~L~~~i  437 (475)
                         ++. .+...|+ -.||=+.++|+.-.++=.
T Consensus       250 D~iq~~~~g~v~pd-s~tHGSs~qR~~Wf~~G~  281 (292)
T PF04228_consen  250 DRIQKRAGGRVVPD-SFTHGSSEQRVRWFQRGY  281 (292)
T ss_pred             hHhhhhcCCCCCCC-CcCcCCHHHHHHHHHHHh
Confidence               011 1111122 139999999987665543


No 20 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=97.06  E-value=0.0031  Score=53.65  Aligned_cols=50  Identities=28%  Similarity=0.439  Sum_probs=31.2

Q ss_pred             eeEEEEecC--CcceeEeC---CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768          251 YPITIIDDP--LINAFVFP---DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       251 ~~v~Vv~~~--~~NAFalp---gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      .+|...+..  ....+..+   .+.|+|+..+     ++.+-.+++|||+||+..+|...
T Consensus         6 i~v~~~~~~~~~~~~~~~~~~~~~~I~in~~~-----~~~~~~f~laHELgH~~~~~~~~   60 (122)
T PF06114_consen    6 IDVVYTDLGPKDLGGFSIPKENNPIIFINSNL-----SPERQRFTLAHELGHILLHHGDE   60 (122)
T ss_dssp             -EEEEE--TT--EEEETT----TTEEEEESSS------HHHHHHHHHHHHHHHHHHH-HH
T ss_pred             EEEEEEcCCCCCcCEEEEecCcCCEEEECCCC-----CHHHHHHHHHHHHHHHHhhhccc
Confidence            344444443  33334333   5578888873     57888999999999999888764


No 21 
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=96.85  E-value=0.038  Score=56.14  Aligned_cols=53  Identities=25%  Similarity=0.281  Sum_probs=40.3

Q ss_pred             EEEEecC-CcceeEeC--CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHH
Q psy768          253 ITIIDDP-LINAFVFP--DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEK  306 (475)
Q Consensus       253 v~Vv~~~-~~NAFalp--gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~  306 (475)
                      +.+..+. ..|-.+++  --.|++-+.+.+.++ ++|+..|++||++|++++|....
T Consensus       154 i~ir~s~~i~~P~v~gl~kp~IvlP~d~~~r~~-~ee~~yIilHEl~Hlk~gD~i~n  209 (337)
T COG4219         154 ILIRKSKAIDGPMVFGLVKPCIVLPADFVERLT-DEELKYIILHELSHLKRGDAIIN  209 (337)
T ss_pred             eeEeecccCCCceeeccCcceEEccHHHHhhcC-HHhhhhhHhHHHhhhhcccHHHH
Confidence            4555543 33444433  446999999999995 68999999999999999998764


No 22 
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=96.13  E-value=0.009  Score=60.27  Aligned_cols=56  Identities=20%  Similarity=0.281  Sum_probs=47.7

Q ss_pred             eEEEEecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHHHHH
Q psy768          252 PITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLS  308 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~~~~  308 (475)
                      .+.++.++.+..+++.|.+|+++-..+..+ +.+|+.+||+||+-|++.+|....-.
T Consensus        26 ~l~~~~~~~~~t~~tDg~~l~~nP~~~~~l-~~~~~~~~l~HevlH~~~~H~~r~~~   81 (292)
T PF13203_consen   26 SLPPVEDDGIPTAATDGRRLYYNPEFLESL-SPEERVGLLLHEVLHCLLRHPWRRGG   81 (292)
T ss_pred             hCeeeeCCCCceeeEcCcEEEECcHHHhcC-CHHHHHHHHHHHHHHHHccchhhhcc
Confidence            344455777888999999999999999888 57899999999999999999987655


No 23 
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=94.27  E-value=0.24  Score=49.28  Aligned_cols=39  Identities=26%  Similarity=0.479  Sum_probs=29.5

Q ss_pred             eeEeC-CCeEEEehhhhhhcC------CHHHHHHHHHHHHHhhhcc
Q psy768          263 AFVFP-DGRIFMFTGMFQLCQ------TDDELATVLSHELSHTLLK  301 (475)
Q Consensus       263 AFalp-gG~IvV~~GLL~~l~------~edELaaVLAHElgHv~~r  301 (475)
                      -|-.| +.++|+.....+.|+      .+=.-+.|+|||.||++.+
T Consensus       136 PFYCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHVQn  181 (295)
T COG2321         136 PFYCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHVQN  181 (295)
T ss_pred             CeecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHHHH
Confidence            45556 468999999988874      3345689999999998653


No 24 
>PRK04351 hypothetical protein; Provisional
Probab=93.85  E-value=0.22  Score=45.60  Aligned_cols=38  Identities=24%  Similarity=0.314  Sum_probs=31.3

Q ss_pred             CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhh
Q psy768          260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHT  298 (475)
Q Consensus       260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv  298 (475)
                      ..-++-+....|-++.-+++.. +++.|..||+|||+|+
T Consensus        35 tgG~~~l~~~~I~lnp~ll~~~-~~~~l~~vv~HElcH~   72 (149)
T PRK04351         35 TGGRYLLKDHHIEFNPKMLEEY-GLEELIGIIKHELCHY   72 (149)
T ss_pred             hhheeecCCCeEEeCHHHHhhc-cHHHHHhhHHHHHHHH
Confidence            3445556678899999998875 5799999999999997


No 25 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=92.33  E-value=0.34  Score=44.02  Aligned_cols=41  Identities=17%  Similarity=0.266  Sum_probs=32.1

Q ss_pred             ceeEeCCC---eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          262 NAFVFPDG---RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       262 NAFalpgG---~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      -.+...++   .|.++..+++. .+++++..+|.|||+|+...+.
T Consensus        33 G~~~~~~~~~~~I~ls~~~~~~-~~~~~~~~tL~HEm~H~~~~~~   76 (157)
T PF10263_consen   33 GRCRYKRRSPCEIRLSPKLLDR-NPEEELIDTLLHEMAHAAAYVF   76 (157)
T ss_pred             EEEEECCCCceEEEECHHHHHh-hHHHHHHHHHHHHHHHHHhhhc
Confidence            33445555   69999999988 4688999999999999976433


No 26 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=92.12  E-value=5.1  Score=39.10  Aligned_cols=50  Identities=22%  Similarity=0.183  Sum_probs=33.8

Q ss_pred             cceeEEEEecCCcceeEeCCC-eEEEehhhhhhcC----C------------HHHHHHHHHHHHHhh
Q psy768          249 FKYPITIIDDPLINAFVFPDG-RIFMFTGMFQLCQ----T------------DDELATVLSHELSHT  298 (475)
Q Consensus       249 ~~~~v~Vv~~~~~NAFalpgG-~IvV~~GLL~~l~----~------------edELaaVLAHElgHv  298 (475)
                      .+..|.+-...+.|||=-|.. .|.|--.+++...    +            .+-+.+++-||+||.
T Consensus        37 ~~l~i~~~~CGe~nA~ydPe~~~I~iCYEf~~~~~~~f~~~~~~~~~~~~~~~~~~~~~l~HE~GHA  103 (220)
T PF14247_consen   37 RDLTIRFAECGEDNAFYDPENRSITICYEFVDEILDRFAKANDPDEEYGQAAIGNVLFTLYHELGHA  103 (220)
T ss_pred             CCeEEEEeecCCCCCccCCCCCEEEECHHHHHHHHHHHHhCCcCcchHHHHHHHHHHHHHHHHHHHH
Confidence            345555555779999999954 6888777765531    0            234667788888876


No 27 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=92.12  E-value=0.12  Score=42.16  Aligned_cols=39  Identities=26%  Similarity=0.408  Sum_probs=28.0

Q ss_pred             CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768          260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLK  301 (475)
Q Consensus       260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~r  301 (475)
                      ...||+. |+.|++..|-.+.-..  +=..+||||++|+...
T Consensus        37 ~A~A~T~-G~~I~f~~g~~~~~s~--~~~~llaHEl~Hv~Qq   75 (79)
T PF13699_consen   37 GARAFTV-GNDIYFAPGKYNPDSP--EGRALLAHELAHVVQQ   75 (79)
T ss_pred             CCeEEEE-CCEEEEcCCCcCCCCC--CcchhHhHHHHHHHhh
Confidence            5677776 7789997776544322  3467999999999753


No 28 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=89.67  E-value=0.37  Score=43.73  Aligned_cols=33  Identities=27%  Similarity=0.315  Sum_probs=28.4

Q ss_pred             CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768          268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLK  301 (475)
Q Consensus       268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~r  301 (475)
                      .+.|.++..|++.. .++.+..||.|||+|+...
T Consensus        41 ~~~I~ln~~l~~~~-~~~~l~~~l~HEm~H~~~~   73 (146)
T smart00731       41 SAEIRLNPKLLTEN-GRDRLRETLLHELCHAALY   73 (146)
T ss_pred             CCEEEeCHHHHhhc-cHHHHHhhHHHHHHHHHHH
Confidence            67899999998865 4689999999999999873


No 29 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=88.77  E-value=3.6  Score=40.11  Aligned_cols=52  Identities=12%  Similarity=0.213  Sum_probs=29.6

Q ss_pred             ceeEEEEecCCcceeEeCCC-eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          250 KYPITIIDDPLINAFVFPDG-RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       250 ~~~v~Vv~~~~~NAFalpgG-~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      +.+|..++..--+=|- |.. .|-++++-.+.- |- .-.+|-|||.||..+++..
T Consensus        54 ~V~Ve~~~G~LtDHYd-P~~k~vrLS~~vy~~~-Si-aAvaVAAHEvGHAiQ~a~~  106 (222)
T PF04298_consen   54 DVRVERVPGELTDHYD-PRNKVVRLSEDVYNGR-SI-AAVAVAAHEVGHAIQHAEG  106 (222)
T ss_pred             CeeEEEeCCCCCCCcC-CCCCEEEeCCccCCCC-CH-HHHHHHHHHHhHHHhcccc
Confidence            3455555432122222 433 455666655442 32 4467999999999987753


No 30 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=87.36  E-value=3  Score=39.41  Aligned_cols=51  Identities=20%  Similarity=0.117  Sum_probs=35.9

Q ss_pred             eEEEEecCCcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          252 PITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      +|.|.+...-=.-+...|.|.++.-|+..  .++-+..|+.||++|..+.++.
T Consensus       131 ~i~ir~~ksrWGsc~~~~~I~ln~~L~~~--P~~~idYVvvHEL~Hl~~~nHs  181 (205)
T PF01863_consen  131 KIKIRDMKSRWGSCSSKGNITLNWRLVMA--PPEVIDYVVVHELCHLRHPNHS  181 (205)
T ss_pred             eEEEeehhhccccCCCCCcEEeecccccC--CccHHHHHHHHHHHHhccCCCC
Confidence            45555544333334457899999988764  4578999999999999665443


No 31 
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=87.04  E-value=2.2  Score=41.64  Aligned_cols=44  Identities=18%  Similarity=0.123  Sum_probs=31.4

Q ss_pred             CcceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768          260 LINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       260 ~~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      ..=+-+...|.|.++.-|+..  .++.+..|++||++|....++..
T Consensus       150 ~~WGScs~~~~i~~~~~l~~~--p~~~i~YVvvHELaHLke~nHs~  193 (223)
T COG1451         150 RRWGSCSKAGEIRFNWRLVMA--PEEVIDYVVVHELAHLKEKNHSK  193 (223)
T ss_pred             ceeeeecCCCcEEeehhhhcC--CHHHHHHHHHHHHHHHhhhhccH
Confidence            333444566767776655443  57899999999999998877544


No 32 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=86.23  E-value=0.61  Score=45.05  Aligned_cols=34  Identities=21%  Similarity=0.409  Sum_probs=24.3

Q ss_pred             CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhh-cccHH
Q psy768          268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTL-LKHVA  304 (475)
Q Consensus       268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~-~rH~~  304 (475)
                      ++.|.| .++ ... +-+.++.||+|||||.+ .||.-
T Consensus       117 ~~~I~I-~~~-~~~-~~~~~~hvi~HEiGH~IGfRHTD  151 (211)
T PF12388_consen  117 YKFIQI-YGL-SNY-SVNVIEHVITHEIGHCIGFRHTD  151 (211)
T ss_pred             CceEEE-Eec-CCC-chhHHHHHHHHHhhhhccccccC
Confidence            345888 354 332 45789999999999987 57865


No 33 
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=86.02  E-value=1.9  Score=41.06  Aligned_cols=33  Identities=27%  Similarity=0.421  Sum_probs=25.0

Q ss_pred             CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          269 GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       269 G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      |+|+++-  +..-.+.++|.+++|||+-|+..-..
T Consensus        49 ~~i~l~~--~~~~~~~~~l~~~iaHE~hH~~r~~~   81 (195)
T PF10026_consen   49 GYIFLFL--LPNDYSLEELPALIAHEYHHNCRYEQ   81 (195)
T ss_pred             CEEEEEe--cCCcccHHHHHHHHHHHHHHHHHHhc
Confidence            4677775  44445678999999999999976443


No 34 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=85.16  E-value=4.7  Score=38.87  Aligned_cols=49  Identities=24%  Similarity=0.201  Sum_probs=35.1

Q ss_pred             EEecCCcceeEeC---CCeEEEehhhhhhcCC----HHHHHHHHHHHHHhhhcccH
Q psy768          255 IIDDPLINAFVFP---DGRIFMFTGMFQLCQT----DDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       255 Vv~~~~~NAFalp---gG~IvV~~GLL~~l~~----edELaaVLAHElgHv~~rH~  303 (475)
                      ++++..-=|++.+   ...|.++...|....+    .+|+.+||-|||.|+.+...
T Consensus        57 ~~~~~~gVA~t~gd~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~Q~~~  112 (205)
T PF04450_consen   57 ILDDMDGVAYTSGDDDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCWQWDG  112 (205)
T ss_pred             EEECCCeeEEEecCCCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHhhcCC
Confidence            3444333344433   1469999999998853    35999999999999977654


No 35 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=82.03  E-value=1.4  Score=42.75  Aligned_cols=29  Identities=28%  Similarity=0.385  Sum_probs=22.4

Q ss_pred             CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhccc
Q psy768          269 GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKH  302 (475)
Q Consensus       269 G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH  302 (475)
                      +.|+++.-.     +++.=.++||||+||++++-
T Consensus        59 ~~I~iN~n~-----~~~r~rFtlAHELGH~llH~   87 (213)
T COG2856          59 PVIYINANN-----SLERKRFTLAHELGHALLHT   87 (213)
T ss_pred             ceEEEeCCC-----CHHHHHHHHHHHHhHHHhcc
Confidence            367776644     56788999999999998753


No 36 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=82.03  E-value=2.7  Score=37.40  Aligned_cols=47  Identities=26%  Similarity=0.108  Sum_probs=23.8

Q ss_pred             cceeEEEEecC-CcceeEeCCC--eEEEehhhhhhcCCHHHHHHHHHHHHHhhh
Q psy768          249 FKYPITIIDDP-LINAFVFPDG--RIFMFTGMFQLCQTDDELATVLSHELSHTL  299 (475)
Q Consensus       249 ~~~~v~Vv~~~-~~NAFalpgG--~IvV~~GLL~~l~~edELaaVLAHElgHv~  299 (475)
                      -++.++..+++ ..=|...+.|  +|++-..--+..   +.+. |.+||+||+.
T Consensus        40 a~~~~~~~~~~~~sya~~~g~G~G~I~l~~~~~qgy---~~~R-IaaHE~GHiL   89 (132)
T PF02031_consen   40 ADIRYYEGNDPRGSYASTDGLGSGYIFLDYQQNQGY---NSTR-IAAHELGHIL   89 (132)
T ss_dssp             -SEEEEEE--TT--EEEE-SSS-EEEEEEHHHHHHS----HHH-HHHHHHHHHH
T ss_pred             CcEEEEEecCCCCcccccCCCCcEEEEechHHhhCC---ccce-eeeehhcccc
Confidence            34555544333 3445555544  577765432222   2344 9999999984


No 37 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=80.62  E-value=5.1  Score=38.24  Aligned_cols=20  Identities=20%  Similarity=0.307  Sum_probs=15.8

Q ss_pred             HHHHHHHHHHHhhhcccHHH
Q psy768          286 ELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       286 ELaaVLAHElgHv~~rH~~~  305 (475)
                      .=.+|-|||.||....+...
T Consensus        91 a~~aVAAHEVGHAiQd~~~Y  110 (226)
T COG2738          91 AAIAVAAHEVGHAIQDQEDY  110 (226)
T ss_pred             HHHHHHHHHhhHHHhhhccc
Confidence            34679999999999877653


No 38 
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=80.10  E-value=0.43  Score=51.91  Aligned_cols=46  Identities=22%  Similarity=0.372  Sum_probs=32.0

Q ss_pred             CCcceeEeCCCeEEEe-hhhhhhc---------CCHHHHHHHHHHHHHhhhcccHH
Q psy768          259 PLINAFVFPDGRIFMF-TGMFQLC---------QTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       259 ~~~NAFalpgG~IvV~-~GLL~~l---------~~edELaaVLAHElgHv~~rH~~  304 (475)
                      -.+|||--|+...+|+ -++|..-         .|--.+-+||||||||.--....
T Consensus       449 q~VNAYYnp~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFDdqGa  504 (654)
T COG3590         449 QTVNAYYNPQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFDDQGA  504 (654)
T ss_pred             HHhhhhcCCCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhcccccCCcc
Confidence            3799999998764444 5555421         13346889999999998766554


No 39 
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=79.47  E-value=2.4  Score=40.33  Aligned_cols=73  Identities=8%  Similarity=0.084  Sum_probs=42.1

Q ss_pred             HHHHHHHHHHHHHhccccccCCCcceeEEEEecCCccee-EeC-----CCeEEE--e-hhhhhhcCCHHHHHHHHHHHHH
Q psy768          226 YKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAF-VFP-----DGRIFM--F-TGMFQLCQTDDELATVLSHELS  296 (475)
Q Consensus       226 ~~~v~~i~~rL~~~~~~~p~~~~~~~~v~Vv~~~~~NAF-alp-----gG~IvV--~-~GLL~~l~~edELaaVLAHElg  296 (475)
                      ...|++++..+..      -+....|+|-.+..-.+|.- .++     |..|.|  - +.. ...-+-+.|..|+-|||+
T Consensus        19 ~~lL~rlA~~v~p------IM~~~~~~V~~L~E~~P~~~~llG~N~N~G~~I~lrLR~~~~-~~fl~~~~i~~t~lHELa   91 (186)
T PF08325_consen   19 LELLERLAADVKP------IMRKHGWRVGSLEEFYPNGERLLGLNVNKGEKICLRLRTPDD-GGFLPYETILGTMLHELA   91 (186)
T ss_pred             HHHHHHHHHHHHH------HHHHcCcccCeeeccCCCCCCCcceecCCCcEEEEEeCCCCC-CCEeeHHHHHHHHHHHHH
Confidence            3466666666433      23346787777665332221 111     223333  2 222 444456899999999999


Q ss_pred             hhhcccHHH
Q psy768          297 HTLLKHVAE  305 (475)
Q Consensus       297 Hv~~rH~~~  305 (475)
                      |++++.+-.
T Consensus        92 H~~~~~H~~  100 (186)
T PF08325_consen   92 HNVHGPHDD  100 (186)
T ss_pred             hcccCCccH
Confidence            999775543


No 40 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=79.28  E-value=2.5  Score=35.76  Aligned_cols=22  Identities=36%  Similarity=0.551  Sum_probs=18.1

Q ss_pred             CCHHHHHHHHHHHHHhhhcccH
Q psy768          282 QTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       282 ~~edELaaVLAHElgHv~~rH~  303 (475)
                      .+++.+..||+||++|......
T Consensus        20 ~~~~~~~~~l~HE~~H~~~~~~   41 (128)
T PF13485_consen   20 SDEDWLDRVLAHELAHQWFGNY   41 (128)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHH
Confidence            4677889999999999976554


No 41 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=79.05  E-value=16  Score=37.70  Aligned_cols=15  Identities=40%  Similarity=0.603  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHhhh
Q psy768          285 DELATVLSHELSHTL  299 (475)
Q Consensus       285 dELaaVLAHElgHv~  299 (475)
                      -.+-+++||||||..
T Consensus       194 ~~~P~T~~HElAHq~  208 (318)
T PF12725_consen  194 YSLPFTICHELAHQL  208 (318)
T ss_pred             ccccHHHHHHHHHHh
Confidence            467899999999984


No 42 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=78.45  E-value=2.3  Score=38.56  Aligned_cols=51  Identities=20%  Similarity=0.245  Sum_probs=28.9

Q ss_pred             ceeEEEEe----cCCcceeEe----C-CCeEEEehhhhhhc---CCHHHHHHHHHHHHHhhhc
Q psy768          250 KYPITIID----DPLINAFVF----P-DGRIFMFTGMFQLC---QTDDELATVLSHELSHTLL  300 (475)
Q Consensus       250 ~~~v~Vv~----~~~~NAFal----p-gG~IvV~~GLL~~l---~~edELaaVLAHElgHv~~  300 (475)
                      +..+..++    +...+++.-    | .|.|.+.+.-+..-   .+.+.+.+|+.|||||.+.
T Consensus        45 di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLG  107 (165)
T cd04268          45 DIRYSVIRWIPYNDGTWSYGPSQVDPLTGEILLARVYLYSSFVEYSGARLRNTAEHELGHALG  107 (165)
T ss_pred             CceEEEEEeecCCCCccccCCccCCCCCccEEeeEEEEchhHHHHHHHHHHHHHHHHHHHHhc
Confidence            34555554    234455542    2 35566655443221   0235789999999999864


No 43 
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=75.47  E-value=6.7  Score=39.53  Aligned_cols=49  Identities=16%  Similarity=0.080  Sum_probs=28.7

Q ss_pred             EEEecCCcc-eeEe-CCCeEEEehhhhhhcCCHHHH---HHHHHHHHHhhhccc
Q psy768          254 TIIDDPLIN-AFVF-PDGRIFMFTGMFQLCQTDDEL---ATVLSHELSHTLLKH  302 (475)
Q Consensus       254 ~Vv~~~~~N-AFal-pgG~IvV~~GLL~~l~~edEL---aaVLAHElgHv~~rH  302 (475)
                      .++.+...+ +|+. .|+.|....+-.+.+-+.+.+   .--+.||+||....=
T Consensus       181 r~v~~v~~~~g~m~a~g~~i~~~~~~~~~~l~~~~~~~~~WG~~HE~GH~~Q~~  234 (307)
T PF13402_consen  181 RFVFDVQISAGYMHASGYPIGFPPNWMNELLNPNPLRKGGWGPWHELGHNHQQG  234 (307)
T ss_dssp             EEEEETT----SEEEETTEEEEETT--HHHH-HHHHHHH-HHHHHHHHHHH-BG
T ss_pred             eEEEeccccccceeecCCcEEeeCcHHhcccCHhHcCCCCeeehhhhhhhcCcc
Confidence            555555444 5555 566777777766666566666   668999999986643


No 44 
>PRK04860 hypothetical protein; Provisional
Probab=75.47  E-value=2.7  Score=38.95  Aligned_cols=33  Identities=21%  Similarity=0.254  Sum_probs=27.0

Q ss_pred             CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768          268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLK  301 (475)
Q Consensus       268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~r  301 (475)
                      .+.|-++..++..- ++++|..||.|||+|+...
T Consensus        45 ~~~I~~Np~ll~~~-~~~~l~~~v~HEl~H~~~~   77 (160)
T PRK04860         45 SNEIRLNPVLLLEN-QQAFIDEVVPHELAHLLVY   77 (160)
T ss_pred             cCCeeeCHHHHhhC-cHHHHHhHHHHHHHHHHHH
Confidence            45688888887765 6789999999999999553


No 45 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=72.62  E-value=1.5  Score=40.29  Aligned_cols=57  Identities=21%  Similarity=0.265  Sum_probs=28.0

Q ss_pred             cceeEEEEecC------CcceeE-eCCC--------eEEEehhhhhhcCCHHHHHHHHHHHHHhhh-cccHHH
Q psy768          249 FKYPITIIDDP------LINAFV-FPDG--------RIFMFTGMFQLCQTDDELATVLSHELSHTL-LKHVAE  305 (475)
Q Consensus       249 ~~~~v~Vv~~~------~~NAFa-lpgG--------~IvV~~GLL~~l~~edELaaVLAHElgHv~-~rH~~~  305 (475)
                      .-..|+|++..      ...-+| +|..        .|++....+....+.....-+|.||+||+. +.|...
T Consensus        16 ~ylNIwv~~~~~~~g~~~~~G~A~~P~~~~~~~~~~~vv~~~~~l~~~~~~~~~g~TltHEvGH~LGL~HtF~   88 (154)
T PF05572_consen   16 KYLNIWVVNDLYDGGGTSILGYAYFPWSGMSDNGTDGVVINYRYLGGNNSQYNFGKTLTHEVGHWLGLYHTFG   88 (154)
T ss_dssp             T-EEEEEES-EE-TTS-EESEEE--TTS-GGG-SEEEEGGGSSSSTT--TTS-SSHHHHHHHHHHTT---TT-
T ss_pred             CEEEEEEecccccCCCCCCCeEEeCCCCCCCCCCCCEEEEcCcccCCCCCccccccchhhhhhhhhccccccc
Confidence            34578988732      223344 3644        233332333322233456889999999997 456654


No 46 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=72.18  E-value=2.3  Score=38.00  Aligned_cols=35  Identities=29%  Similarity=0.429  Sum_probs=21.2

Q ss_pred             CeEEEehhhhhhc-CCHHHHHHHHHHHHHhhh-cccH
Q psy768          269 GRIFMFTGMFQLC-QTDDELATVLSHELSHTL-LKHV  303 (475)
Q Consensus       269 G~IvV~~GLL~~l-~~edELaaVLAHElgHv~-~rH~  303 (475)
                      |.|.+...---.. .+...+..|+.|||||.. ++|.
T Consensus        86 ~~i~~~~~~~~~~~~~~~~~~~v~~HEiGHaLGL~H~  122 (154)
T PF00413_consen   86 GDIHFNDDESWTIDDSGNDLQSVAIHEIGHALGLDHS  122 (154)
T ss_dssp             TEEEEETTSHEESSSSSEEHHHHHHHHHHHHTTBESS
T ss_pred             ccccccccccchhhhhhhhhhhhhhhccccccCcCcC
Confidence            4555554331111 234578999999999984 4454


No 47 
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=71.51  E-value=8.3  Score=38.58  Aligned_cols=37  Identities=27%  Similarity=0.288  Sum_probs=25.5

Q ss_pred             EEehhhhhhc--CCHHHHHHHHHHHHHhhh---cccHHHHHH
Q psy768          272 FMFTGMFQLC--QTDDELATVLSHELSHTL---LKHVAEKLS  308 (475)
Q Consensus       272 vV~~GLL~~l--~~edELaaVLAHElgHv~---~rH~~~~~~  308 (475)
                      +..|.||..+  .+|.+|+..|-||++|.+   .++.+-+-+
T Consensus       180 wFdDPlLstmlr~dd~~lA~LIFHELAHQk~Y~~~DtAFNEs  221 (376)
T COG4324         180 WFDDPLLSTMLRQDDTYLASLIFHELAHQKIYVNNDTAFNES  221 (376)
T ss_pred             ccccHHHHHHhcCChHHHHHHHHHHHhhheEeecCcchHhHH
Confidence            3455666655  467899999999999964   556654433


No 48 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=69.84  E-value=2.8  Score=38.05  Aligned_cols=16  Identities=38%  Similarity=0.669  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHHHhhh
Q psy768          284 DDELATVLSHELSHTL  299 (475)
Q Consensus       284 edELaaVLAHElgHv~  299 (475)
                      ...+..|+.|||||++
T Consensus       101 ~~~~~~~~~HEiGHaL  116 (156)
T cd04279         101 AENLQAIALHELGHAL  116 (156)
T ss_pred             chHHHHHHHHHhhhhh
Confidence            3579999999999986


No 49 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=68.02  E-value=3.2  Score=40.99  Aligned_cols=31  Identities=26%  Similarity=0.364  Sum_probs=20.3

Q ss_pred             EEEehhhhhhcC-----CHHHHHHHHHHHHHhhhcc
Q psy768          271 IFMFTGMFQLCQ-----TDDELATVLSHELSHTLLK  301 (475)
Q Consensus       271 IvV~~GLL~~l~-----~edELaaVLAHElgHv~~r  301 (475)
                      .+..+||....+     .....+.++||||||...=
T Consensus       146 ~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnlGm  181 (244)
T cd04270         146 KYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNFGS  181 (244)
T ss_pred             eeeecceEeeeccCCccchhHHHHHHHHHHHHhcCC
Confidence            455566654211     1345789999999998754


No 50 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=65.80  E-value=6.9  Score=34.63  Aligned_cols=44  Identities=25%  Similarity=0.262  Sum_probs=25.7

Q ss_pred             ceeEEEEecC--Cc-ceeE-eCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhh
Q psy768          250 KYPITIIDDP--LI-NAFV-FPDGRIFMFTGMFQLCQTDDELATVLSHELSHTL  299 (475)
Q Consensus       250 ~~~v~Vv~~~--~~-NAFa-lpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~  299 (475)
                      +..|.+.+.+  .. -|++ .|+|.+-++.+  ..+.    -.+|+.|||||.+
T Consensus        51 d~~I~f~~~~~~g~~~a~~g~~~g~~~~~~~--~~~~----~~~~~~HEigHaL   98 (140)
T smart00235       51 DIYISFGKGDGSGCTLSHAGRPGGDQHFSLG--NGCI----NTGVAAHELGHAL   98 (140)
T ss_pred             CeEEEEEECCCCCcceeeeecCCCceEEEcc--CCcC----CcccHHHHHHHHh
Confidence            3455555554  33 2665 35777666541  1122    2459999999986


No 51 
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=65.58  E-value=6.9  Score=37.06  Aligned_cols=44  Identities=27%  Similarity=0.369  Sum_probs=30.0

Q ss_pred             ceeEeCC-CeEEEehhhhhhc--C-------CHHHHHHHHHHHHHhhhcccHHH
Q psy768          262 NAFVFPD-GRIFMFTGMFQLC--Q-------TDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       262 NAFalpg-G~IvV~~GLL~~l--~-------~edELaaVLAHElgHv~~rH~~~  305 (475)
                      |||=.|. ..|+|..|+|..-  .       +-..|-+||||||+|..-.+...
T Consensus         1 na~Y~~~~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g~~   54 (206)
T PF01431_consen    1 NAYYSPRFNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEGIN   54 (206)
T ss_dssp             --EEETTTTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTGGG
T ss_pred             CCCCCcccCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhHhh
Confidence            6666654 5688888887642  1       23568899999999998776653


No 52 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=63.58  E-value=13  Score=41.64  Aligned_cols=49  Identities=14%  Similarity=0.164  Sum_probs=33.4

Q ss_pred             eEEEEecCCcceeEeC---CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          252 PITIIDDPLINAFVFP---DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalp---gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      .++-.+....-|||.+   ++..+|.-.....+.|    ..+|+||+||..|....
T Consensus       346 D~~~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~d----V~TLaHElGHs~Hs~~s  397 (598)
T COG1164         346 DVYPRKGKRSGAYSIGFYKGDHPFILMNYDGTLRD----VFTLAHELGHSVHSYFS  397 (598)
T ss_pred             eccCCCCCCCCcccCCCCCCCCCeEEEeCCCchhH----HHHHHHHccHHHHHHHH
Confidence            4555556688899975   3556665544444433    57999999999887554


No 53 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=62.97  E-value=6.9  Score=35.86  Aligned_cols=30  Identities=27%  Similarity=0.380  Sum_probs=24.8

Q ss_pred             eEEEehhhhhhcCCHHHHHHHHHHHHHhhhc
Q psy768          270 RIFMFTGMFQLCQTDDELATVLSHELSHTLL  300 (475)
Q Consensus       270 ~IvV~~GLL~~l~~edELaaVLAHElgHv~~  300 (475)
                      .|=++--|+..- .+|.+..|+.||++|+..
T Consensus        45 ~I~lNP~ll~en-~~~f~~~vV~HELaHl~l   74 (156)
T COG3091          45 EIRLNPKLLEEN-GEDFIEQVVPHELAHLHL   74 (156)
T ss_pred             ccccCHHHHHHc-cHHHHHHHHHHHHHHHHH
Confidence            677778887775 579999999999999843


No 54 
>PF10023 DUF2265:  Predicted aminopeptidase (DUF2265);  InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=62.16  E-value=6.1  Score=40.89  Aligned_cols=48  Identities=23%  Similarity=0.199  Sum_probs=33.0

Q ss_pred             CcceeEeCCCeEEEehhhhhhc--CCHHHHHHHHHHHHHhhh---cccHHHHHHHH
Q psy768          260 LINAFVFPDGRIFMFTGMFQLC--QTDDELATVLSHELSHTL---LKHVAEKLSNK  310 (475)
Q Consensus       260 ~~NAFalpgG~IvV~~GLL~~l--~~edELaaVLAHElgHv~---~rH~~~~~~~~  310 (475)
                      .+-||++-|   +..|.|+..+  -++.+||.+|-|||+|-.   .+|+.-+-+..
T Consensus       139 gV~AYSTLG---wF~DPlLSt~l~~~~~~LA~LIfHELaHq~~Yv~~dt~FNEsfA  191 (337)
T PF10023_consen  139 GVPAYSTLG---WFDDPLLSTMLRYPDGELARLIFHELAHQTLYVKGDTAFNESFA  191 (337)
T ss_pred             ccccccccc---ccCCcccccccCCCchHHHHHHHHHHhhceeecCCCchhhHHHH
Confidence            566777654   4467777665  367899999999999964   56666444333


No 55 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=62.10  E-value=5.7  Score=38.09  Aligned_cols=47  Identities=30%  Similarity=0.426  Sum_probs=31.5

Q ss_pred             EEEEecC-CcceeEe----C------CCeEEEehhhhhhcCCH------HHHHHHHHHHHHhhh
Q psy768          253 ITIIDDP-LINAFVF----P------DGRIFMFTGMFQLCQTD------DELATVLSHELSHTL  299 (475)
Q Consensus       253 v~Vv~~~-~~NAFal----p------gG~IvV~~GLL~~l~~e------dELaaVLAHElgHv~  299 (475)
                      |..++++ ..+|+.-    |      +|.|.++.+.+....+.      +-+..+++||+||..
T Consensus        65 ir~~~~~~~~~~~gps~~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~l  128 (197)
T cd04276          65 IRWIHSPNGGWAYGPSVVDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTL  128 (197)
T ss_pred             EEEEecCCCcceecccccCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHh
Confidence            4455665 5566642    2      23578988887664322      458899999999985


No 56 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=62.07  E-value=4.7  Score=37.84  Aligned_cols=16  Identities=38%  Similarity=0.459  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHHHhhh
Q psy768          284 DDELATVLSHELSHTL  299 (475)
Q Consensus       284 edELaaVLAHElgHv~  299 (475)
                      ....+.++||||||..
T Consensus       128 ~~~~a~~~AHElGH~l  143 (194)
T cd04269         128 LLLFAVTMAHELGHNL  143 (194)
T ss_pred             hHHHHHHHHHHHHhhc
Confidence            4578999999999986


No 57 
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=60.87  E-value=13  Score=34.13  Aligned_cols=49  Identities=22%  Similarity=0.261  Sum_probs=26.3

Q ss_pred             cceeEEEE-ecCCcceeEeCCCeEEEehhhhhhcCCHHHHHH---HHHHHHHhhhcc
Q psy768          249 FKYPITII-DDPLINAFVFPDGRIFMFTGMFQLCQTDDELAT---VLSHELSHTLLK  301 (475)
Q Consensus       249 ~~~~v~Vv-~~~~~NAFalpgG~IvV~~GLL~~l~~edELaa---VLAHElgHv~~r  301 (475)
                      .++...|- ...-.|||=- |..++.-+|-=..+.   .+++   |+||||+|-+-.
T Consensus        97 ~~~~~~Vhyg~~~~NAfW~-g~~m~yGdG~~~~f~---~~~~~lDVvaHEltHGVte  149 (150)
T PF01447_consen   97 MPIISRVHYGKNYNNAFWN-GSQMVYGDGDGQIFK---PFASSLDVVAHELTHGVTE  149 (150)
T ss_dssp             S-EEEEESESSSTT-EEE--SSSEEEE---SSSBS----GGG-HHHHHHHHHHHHHH
T ss_pred             cEEEEEEeCCCCccCcccc-CCEEEEECCCCcccc---cCccccceeeecccccccc
Confidence            44444442 3447899984 457888877643332   2333   999999997643


No 58 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=58.94  E-value=5.4  Score=37.40  Aligned_cols=17  Identities=29%  Similarity=0.344  Sum_probs=14.2

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy768          284 DDELATVLSHELSHTLL  300 (475)
Q Consensus       284 edELaaVLAHElgHv~~  300 (475)
                      ...-.-|+||||||...
T Consensus       139 ~~~~~~~~AHEiGH~lG  155 (196)
T PF13688_consen  139 TYNGAITFAHEIGHNLG  155 (196)
T ss_dssp             HHHHHHHHHHHHHHHTT
T ss_pred             CCceehhhHHhHHHhcC
Confidence            46778899999999864


No 59 
>PRK06926 flagellar motor protein MotP; Reviewed
Probab=58.47  E-value=2.1e+02  Score=28.87  Aligned_cols=35  Identities=11%  Similarity=0.264  Sum_probs=26.7

Q ss_pred             EEehhhhhhc--CCHHHHHHHHHHHHHhhhcccHHHH
Q psy768          272 FMFTGMFQLC--QTDDELATVLSHELSHTLLKHVAEK  306 (475)
Q Consensus       272 vV~~GLL~~l--~~edELaaVLAHElgHv~~rH~~~~  306 (475)
                      |+.+|+--..  .+++++..+|-+|+.....+|....
T Consensus       112 Fl~~gl~lvvDG~~~~~i~~iLe~ei~~~~~r~~~~~  148 (271)
T PRK06926        112 FIKKGLLLAIDGWEPETIRDIMMAEIAAMEERHRKGR  148 (271)
T ss_pred             HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHhHH
Confidence            5666663332  4789999999999999999987643


No 60 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=58.41  E-value=7.7  Score=36.59  Aligned_cols=17  Identities=35%  Similarity=0.661  Sum_probs=14.4

Q ss_pred             CHHHHHHHHHHHHHhhh
Q psy768          283 TDDELATVLSHELSHTL  299 (475)
Q Consensus       283 ~edELaaVLAHElgHv~  299 (475)
                      +...++.++||||||..
T Consensus       127 ~~~~~a~~~AHelGH~l  143 (199)
T PF01421_consen  127 SGLSFAVIIAHELGHNL  143 (199)
T ss_dssp             SHHHHHHHHHHHHHHHT
T ss_pred             hhHHHHHHHHHHHHHhc
Confidence            45688999999999974


No 61 
>PF14891 Peptidase_M91:  Effector protein
Probab=57.94  E-value=5.7  Score=37.07  Aligned_cols=15  Identities=40%  Similarity=0.412  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHhhh
Q psy768          285 DELATVLSHELSHTL  299 (475)
Q Consensus       285 dELaaVLAHElgHv~  299 (475)
                      ..=..||+|||+|..
T Consensus       101 ~~p~v~L~HEL~HA~  115 (174)
T PF14891_consen  101 RPPFVVLYHELIHAY  115 (174)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            455789999999983


No 62 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=57.47  E-value=5.4  Score=34.37  Aligned_cols=12  Identities=33%  Similarity=0.736  Sum_probs=10.4

Q ss_pred             HHHHHHHHHhhh
Q psy768          288 ATVLSHELSHTL  299 (475)
Q Consensus       288 aaVLAHElgHv~  299 (475)
                      ..+++|||||..
T Consensus       108 ~~~~~HEiGH~l  119 (124)
T PF13582_consen  108 VDTFAHEIGHNL  119 (124)
T ss_dssp             TTHHHHHHHHHT
T ss_pred             ceEeeehhhHhc
Confidence            389999999975


No 63 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=56.93  E-value=4.3  Score=38.04  Aligned_cols=15  Identities=40%  Similarity=0.476  Sum_probs=13.2

Q ss_pred             HHHHHHHHHHHHhhh
Q psy768          285 DELATVLSHELSHTL  299 (475)
Q Consensus       285 dELaaVLAHElgHv~  299 (475)
                      -..+.++|||+||..
T Consensus       131 ~~~~~~~aHElGH~l  145 (192)
T cd04267         131 LLTALTMAHELGHNL  145 (192)
T ss_pred             eeehhhhhhhHHhhc
Confidence            467889999999986


No 64 
>COG1291 MotA Flagellar motor component [Cell motility and secretion]
Probab=55.57  E-value=2.1e+02  Score=28.76  Aligned_cols=110  Identities=15%  Similarity=0.071  Sum_probs=55.9

Q ss_pred             CHHHHHHHHHHHHHhhhcccHHHHHHHHHHH------HHHHHHHHHHHH-HhccchHHHHHHHHHHHHHHHHhhcchhhh
Q psy768          283 TDDELATVLSHELSHTLLKHVAEKLSNKTFL------EILYIVPLMIIW-FLLPDLGAIVTQVLLDDLKSIIFELPFERE  355 (475)
Q Consensus       283 ~edELaaVLAHElgHv~~rH~~~~~~~~~l~------~~l~~~~~~~i~-~~~~~~~~~l~~~~~~~l~~~l~~lp~~~~  355 (475)
                      +.+++.+++=-|+.+...+|.........+.      .+++.+++.+.. ....+ -..+|..+...+...+.       
T Consensus       121 ~~~~I~~~me~Ei~~~ee~~~~~a~~~~~~g~~aPa~GivgaV~GlI~~l~~l~~-p~~LG~~iA~Alv~T~~-------  192 (266)
T COG1291         121 DPEEIEALMEEEIETMEERHEKPAHAFTTAGDYAPAFGIVGAVMGLIHALGNLDD-PAELGALIAAALVGTLY-------  192 (266)
T ss_pred             CHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhCchhhHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHHHHHH-------
Confidence            5899999999999999999877544333322      223332222111 11112 23344443333322210       


Q ss_pred             hhhhhhhhhhhccccchhhhHHHHHHHH--------HHHHHcCCCchhHHHHHHHH
Q psy768          356 METEADESIIFELPFEREMETEADEVGL--------KLMARACYDVRVAPLFWQKM  403 (475)
Q Consensus       356 l~~~~~~~~~~~~~~SR~~E~EAD~~gl--------~ll~~AGydp~a~~~~l~kl  403 (475)
                         +...+..+..|+.-+.+...|.--.        -+....|.+|+-+.+.+.++
T Consensus       193 ---Gi~~ay~~~~P~a~kLk~~~~~e~~~~~~i~e~ll~i~~G~nP~i~~e~l~~~  245 (266)
T COG1291         193 ---GIFLAYGLFGPLANKLKQKSDEEVKLKEIIIEGLLAIQNGENPRILEEKLRKF  245 (266)
T ss_pred             ---HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcHHHHHHHHhh
Confidence               1111112445555555555554211        13556789998777766544


No 65 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=55.23  E-value=6  Score=35.93  Aligned_cols=32  Identities=25%  Similarity=0.293  Sum_probs=20.4

Q ss_pred             CCeEEEehhhhhhcC---CHHHHHHHHHHHHHhhh
Q psy768          268 DGRIFMFTGMFQLCQ---TDDELATVLSHELSHTL  299 (475)
Q Consensus       268 gG~IvV~~GLL~~l~---~edELaaVLAHElgHv~  299 (475)
                      .|.|.+.+.-.-...   ....+..|+.|||||..
T Consensus        85 ~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaL  119 (157)
T cd04278          85 GGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHAL  119 (157)
T ss_pred             ceeEEECCCcccccCCCCccchHHHHHHHHhcccc
Confidence            566776654321111   22469999999999984


No 66 
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=54.61  E-value=38  Score=31.87  Aligned_cols=36  Identities=22%  Similarity=0.316  Sum_probs=24.6

Q ss_pred             cceeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhh
Q psy768          261 INAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTL  299 (475)
Q Consensus       261 ~NAFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~  299 (475)
                      .-.|.-+...|+|-.   +.+.+.+++.-+|+|||-|..
T Consensus        48 ~Ggf~p~~~~I~lC~---N~~~~~~~l~~~l~HELIHay   83 (173)
T PF09768_consen   48 SGGFDPSKKGIVLCQ---NRIRSQGHLEDTLTHELIHAY   83 (173)
T ss_pred             cCCccCCCCCEEEee---CCCCCHHHHHHHHHHHHHHHH
Confidence            334432244566654   334788999999999999984


No 67 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=54.29  E-value=5.7  Score=35.94  Aligned_cols=15  Identities=47%  Similarity=0.680  Sum_probs=13.4

Q ss_pred             HHHHHHHHHHHHhhh
Q psy768          285 DELATVLSHELSHTL  299 (475)
Q Consensus       285 dELaaVLAHElgHv~  299 (475)
                      .....+++|||||..
T Consensus        94 ~~~~~~~~HElGH~L  108 (167)
T cd00203          94 KEGAQTIAHELGHAL  108 (167)
T ss_pred             ccchhhHHHHHHHHh
Confidence            478999999999986


No 68 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=53.95  E-value=9.9  Score=35.49  Aligned_cols=36  Identities=19%  Similarity=0.226  Sum_probs=23.0

Q ss_pred             CeEEEehhhhhhc-CCHHHHHHHHHHHHHhhhc-ccHH
Q psy768          269 GRIFMFTGMFQLC-QTDDELATVLSHELSHTLL-KHVA  304 (475)
Q Consensus       269 G~IvV~~GLL~~l-~~edELaaVLAHElgHv~~-rH~~  304 (475)
                      |.|++........ ..-...-.|+.|||||.+- +|.-
T Consensus        94 g~i~~~~~~~~~~~~~g~~~~~t~~HEiGHaLGL~H~~  131 (186)
T cd04277          94 GDIWFNSSYDTNSDSPGSYGYQTIIHEIGHALGLEHPG  131 (186)
T ss_pred             ceeEEecCcccccCCCChhhHHHHHHHHHHHhcCCCCC
Confidence            4677776654432 1124678999999999863 4443


No 69 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=53.77  E-value=6.6  Score=36.57  Aligned_cols=15  Identities=33%  Similarity=0.381  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHhhhcc
Q psy768          287 LATVLSHELSHTLLK  301 (475)
Q Consensus       287 LaaVLAHElgHv~~r  301 (475)
                      -.-++||||||..--
T Consensus       111 ~~~~~aHElGH~lGa  125 (173)
T PF13574_consen  111 GIDTFAHELGHQLGA  125 (173)
T ss_dssp             HHHHHHHHHHHHHT-
T ss_pred             eeeeehhhhHhhcCC
Confidence            355799999998654


No 70 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=53.11  E-value=7.6  Score=37.42  Aligned_cols=14  Identities=36%  Similarity=0.449  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHhhh
Q psy768          286 ELATVLSHELSHTL  299 (475)
Q Consensus       286 ELaaVLAHElgHv~  299 (475)
                      ..+.++||||||..
T Consensus       144 ~~~~~~AHElGH~l  157 (220)
T cd04272         144 YGVYTMTHELAHLL  157 (220)
T ss_pred             ccHHHHHHHHHHHh
Confidence            45899999999987


No 71 
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=51.95  E-value=12  Score=41.82  Aligned_cols=37  Identities=22%  Similarity=0.332  Sum_probs=25.1

Q ss_pred             eeEeCCCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          263 AFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       263 AFalpgG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      |.--|| .+|.+..++..   +.+++.|+||||+|---+..
T Consensus       259 gMEN~~-ltf~~~~ll~~---d~s~~~viaHElAHqWfGNl  295 (601)
T TIGR02411       259 GMENPN-LTFATPTLIAG---DRSNVDVIAHELAHSWSGNL  295 (601)
T ss_pred             cccccc-ceeeccccccC---ChhhhhhHHHHHHhhccCce
Confidence            333343 56777777632   34577899999999877744


No 72 
>KOG3624|consensus
Probab=50.41  E-value=12  Score=42.58  Aligned_cols=46  Identities=26%  Similarity=0.402  Sum_probs=34.5

Q ss_pred             CcceeEeCC-CeEEEehhhhhh----c-----CCHHHHHHHHHHHHHhhhcccHHH
Q psy768          260 LINAFVFPD-GRIFMFTGMFQL----C-----QTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       260 ~~NAFalpg-G~IvV~~GLL~~----l-----~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      .+|||-.|. ..|+|--|++..    .     -+-.-+.+||||||+|.--.++..
T Consensus       481 ~~na~Y~~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~G~~  536 (687)
T KOG3624|consen  481 QVNAFYSPEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQGRQ  536 (687)
T ss_pred             eeeccccCCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhcccccccc
Confidence            789988775 468888888875    1     134568899999999987666543


No 73 
>KOG1047|consensus
Probab=48.79  E-value=11  Score=41.31  Aligned_cols=32  Identities=22%  Similarity=0.416  Sum_probs=24.9

Q ss_pred             eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          270 RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       270 ~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      ..||+..||..   +.-|+-||||||+|---+...
T Consensus       274 ltF~TpTllaG---Drsl~~vIaHEIAHSWtGNlV  305 (613)
T KOG1047|consen  274 LTFVTPTLLAG---DRSLVDVIAHEIAHSWTGNLV  305 (613)
T ss_pred             eeeecchhhcC---CcchhhHHHHHhhhhhccccc
Confidence            38899999755   356899999999997666443


No 74 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=48.47  E-value=6.1  Score=38.63  Aligned_cols=12  Identities=42%  Similarity=0.706  Sum_probs=10.6

Q ss_pred             HHHHHHHHhhhc
Q psy768          289 TVLSHELSHTLL  300 (475)
Q Consensus       289 aVLAHElgHv~~  300 (475)
                      .++||||||...
T Consensus       147 ~t~AHElGHnLG  158 (228)
T cd04271         147 QVFAHEIGHTFG  158 (228)
T ss_pred             eehhhhhhhhcC
Confidence            699999999974


No 75 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=48.42  E-value=7.5  Score=43.20  Aligned_cols=49  Identities=14%  Similarity=0.175  Sum_probs=28.4

Q ss_pred             eEEEEecCCcceeEeCCC--eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          252 PITIIDDPLINAFVFPDG--RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalpgG--~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      .++-.++....||+.+.+  .-+|   |++.-++-++ ..+|+||+||..|....
T Consensus       345 D~~~r~gK~~Ga~~~~~~~~~p~i---l~N~~~~~~d-v~TLaHElGHa~H~~~~  395 (591)
T TIGR00181       345 DYAENKGKRSGAYSIGGYKVKPYI---LMNWDGTLNS-VFTLAHELGHSMHSYFS  395 (591)
T ss_pred             eecCCCCCCCCcccCCCCCCCCeE---EEecCCCcch-HHHHHHHhhhHHHHHHH
Confidence            444445557789997532  2222   1122222222 56899999999877544


No 76 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=48.26  E-value=11  Score=39.63  Aligned_cols=49  Identities=18%  Similarity=0.230  Sum_probs=28.8

Q ss_pred             eEEEEecCCcceeEeC---CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          252 PITIIDDPLINAFVFP---DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalp---gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      .++..++...-||+.+   ++..+|....-.   +-++ ...|+||+||..|....
T Consensus       188 D~~~r~gK~~gaf~~~~~~~~~p~i~~n~~~---~~~~-v~tl~HE~GHa~h~~~~  239 (427)
T cd06459         188 DVEPRKGKRSGAYCTGLPPGKHPFILMNFNG---TLDD-VFTLAHELGHAFHSYLS  239 (427)
T ss_pred             eccCCCCCCCCeecCCCCCCCCCeEEecCCC---Chhh-HHHHHHHhhHHHHHHHH
Confidence            3443344456788865   334555433322   3334 56799999998776544


No 77 
>PF01433 Peptidase_M1:  Peptidase family M1 This is family M1 in the peptidase classification.;  InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.  Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=45.94  E-value=27  Score=36.07  Aligned_cols=52  Identities=19%  Similarity=0.236  Sum_probs=28.7

Q ss_pred             eeEEEEecCCcceeEeCCCeEEEehhh-hhh--cC---CHHHHHHHHHHHHHhhhcccH
Q psy768          251 YPITIIDDPLINAFVFPDGRIFMFTGM-FQL--CQ---TDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       251 ~~v~Vv~~~~~NAFalpgG~IvV~~GL-L~~--l~---~edELaaVLAHElgHv~~rH~  303 (475)
                      +.+.++.+-..++..-+ |.|++.... +-.  ..   +..++..++||||+|.--++.
T Consensus       254 ~~~v~~p~~~~~~me~~-g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWfGn~  311 (390)
T PF01433_consen  254 LDIVAVPDFPFGGMENW-GLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWFGNL  311 (390)
T ss_dssp             EEEEEEST-SSSEE--T-TEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTBTTT
T ss_pred             eeEEEEecccccccccc-ccccccccccccCcccccchhhhhhHHHHHHHHHHHHhccC
Confidence            34444443234566555 445555443 322  11   235789999999999977764


No 78 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=45.78  E-value=13  Score=32.72  Aligned_cols=30  Identities=27%  Similarity=0.610  Sum_probs=21.5

Q ss_pred             eEEEehh-hhhhcCC-H----HHHHHHHHHHHHhhh
Q psy768          270 RIFMFTG-MFQLCQT-D----DELATVLSHELSHTL  299 (475)
Q Consensus       270 ~IvV~~G-LL~~l~~-e----dELaaVLAHElgHv~  299 (475)
                      +|.+|.+ |++.+.+ +    |++.-||-|||||.-
T Consensus        86 rItlYRrailDywae~eetlgd~vthvliHEIgHhF  121 (136)
T COG3824          86 RITLYRRALLDYWAENEETLGDQVTHVLIHEIGHHF  121 (136)
T ss_pred             eeeeeHHHHHHHHhhhhhhHhhHhhhhhhhhhhhhc
Confidence            6888876 5666643 3    467888999999863


No 79 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=44.47  E-value=11  Score=36.22  Aligned_cols=16  Identities=31%  Similarity=0.532  Sum_probs=11.5

Q ss_pred             HHHHHHHHhhhc-ccHH
Q psy768          289 TVLSHELSHTLL-KHVA  304 (475)
Q Consensus       289 aVLAHElgHv~~-rH~~  304 (475)
                      -+++|||||... +|..
T Consensus       139 ~~~aHEiGH~lGl~H~~  155 (206)
T PF13583_consen  139 QTFAHEIGHNLGLRHDF  155 (206)
T ss_pred             hHHHHHHHHHhcCCCCc
Confidence            459999999864 3443


No 80 
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=43.74  E-value=25  Score=38.67  Aligned_cols=54  Identities=22%  Similarity=0.271  Sum_probs=28.4

Q ss_pred             CCCcceeEEEEecCC---cceeEeC-----CC-----eEEEehhhhhhcCCHHHHHHHHHHHHHhhhc
Q psy768          246 HNNFKYPITIIDDPL---INAFVFP-----DG-----RIFMFTGMFQLCQTDDELATVLSHELSHTLL  300 (475)
Q Consensus       246 ~~~~~~~v~Vv~~~~---~NAFalp-----gG-----~IvV~~GLL~~l~~edELaaVLAHElgHv~~  300 (475)
                      +.+.++-|||...+.   .=|+|.+     +|     .|.+....+... ..+...-|+.|||+|+..
T Consensus       157 v~~~D~vlyV~~~~~~~~~~A~A~~C~~~~~~RP~~G~in~~p~~i~~~-~~~~~~~~~~HEi~HaLG  223 (521)
T PF01457_consen  157 VPNADFVLYVTARPSSSSTLAWAAPCQQDSDGRPIAGVININPSYIPSF-YFQEFFRTVIHEIAHALG  223 (521)
T ss_dssp             EES-SEEEEEE----STT---EEEEEEE-TTS-EEEEEEE--GGG---S---HHHHHHHHHHHHHHTT
T ss_pred             cCCccEEEEEEEeeccCCceEEEeehhhccCCCCeeEEEEEchhHccch-hhhcccceeeeeeeeeee
Confidence            345678888876552   2366643     33     477777666654 357788899999999975


No 81 
>PF12652 CotJB:  CotJB protein;  InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=43.58  E-value=57  Score=26.61  Aligned_cols=39  Identities=26%  Similarity=0.385  Sum_probs=26.5

Q ss_pred             ccccCCCCChHHHHHHHHHHHHHHHHHHHh--cCCCCCCCC
Q psy768          417 EEYLSTHPSHENRANNLESKMKEALDIRKE--CNCLPLGPL  455 (475)
Q Consensus       417 ~~~lsTHP~~~~RI~~L~~~i~~a~~~~~~--~~c~~~~~~  455 (475)
                      .-|+.|||...+=++..++...+-.++.+.  ..+++|--.
T Consensus        19 ~LyLDTHP~d~~Al~~y~~~~~~~~~l~~~Ye~~yGPLt~~   59 (78)
T PF12652_consen   19 NLYLDTHPDDQEALEYYNEYSKQRKQLKKEYEKRYGPLTNF   59 (78)
T ss_pred             HHHhcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC
Confidence            457899999999999888776655444442  255555433


No 82 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=42.52  E-value=37  Score=28.84  Aligned_cols=33  Identities=18%  Similarity=0.380  Sum_probs=21.7

Q ss_pred             CeEEEehhhh-hhcCCHH----HHHHHHHHHHHhhhcc
Q psy768          269 GRIFMFTGMF-QLCQTDD----ELATVLSHELSHTLLK  301 (475)
Q Consensus       269 G~IvV~~GLL-~~l~~ed----ELaaVLAHElgHv~~r  301 (475)
                      -+|+||..=| ..+.+.+    ++.-|+-||+||+..-
T Consensus        50 ~rI~lyR~pl~~~~~~~~eL~~~I~~tlvhEiah~fG~   87 (97)
T PF06262_consen   50 DRIVLYRRPLERRARSREELAELIRDTLVHEIAHHFGI   87 (97)
T ss_dssp             EEEEEEHHHHHHT-SSHHHHHHHHHHHHHHHHHHHTT-
T ss_pred             CEEEEehHHHHHHhCCHHHHHHHHHHHHHHHHHHHcCC
Confidence            3688888654 4566655    4566888999988653


No 83 
>PF13150 DUF3989:  Protein of unknown function (DUF3989)
Probab=41.51  E-value=48  Score=27.51  Aligned_cols=18  Identities=28%  Similarity=0.503  Sum_probs=13.8

Q ss_pred             hhHHHHHHhcCchhhHHH
Q psy768          133 RISRKWWSKLSPDQKKII  150 (475)
Q Consensus       133 ~~~~~w~~~l~~~~r~~~  150 (475)
                      ..+|.+|++|||.+|...
T Consensus        13 ~~Lr~~c~~Lsp~~R~~v   30 (85)
T PF13150_consen   13 DRLRRYCGRLSPKQRLRV   30 (85)
T ss_pred             HHHHHHHhcCCHHHHHHH
Confidence            457888899999887554


No 84 
>PF06861 BALF1:  BALF1 protein;  InterPro: IPR010677  Epstein-Barr virus (strain GD1) (HHV-4), a human tumour DNA virus and a prominent member of gamma-herpesviruses, encodes homologues of cellular antiapoptotic viral Bcl-2 proteins BALF1 and BHRF1. They protect the virus from apoptosis in its host cell during virus synthesis [, ]. The virus infects B lymphocytes to establish a latent infection and yield proliferating, growth-transformed B cells in vitro. Bcl-2 genes are essential for the initial evasion of apoptosis which allows it to establish a latent infection or cause cellular transformation, or both []. Bcl-2 family proteins can inhibit or induce programmed cell death in part by counteracting the activity of other BCL-2 family members. BALF1, inhibits the antiapoptotic activity of EBV BHRF1 and of KSBcl-2 in several transfected cell lines. BALF1 fails, however, to inhibit the cellular BCL-2 family member, BCL-x(L). Thus, BALF1 acts as a negative regulator of the survival function of BHRF1, similar to the counterbalance observed between cellular BCL-2 family members []. 
Probab=41.15  E-value=18  Score=33.77  Aligned_cols=28  Identities=29%  Similarity=0.294  Sum_probs=23.2

Q ss_pred             hhcCCHHHHHHHHHHHHHhhhcccHHHH
Q psy768          279 QLCQTDDELATVLSHELSHTLLKHVAEK  306 (475)
Q Consensus       279 ~~l~~edELaaVLAHElgHv~~rH~~~~  306 (475)
                      ..+.|+.|=++|++|++||...+|-.--
T Consensus       124 ~~l~~d~e~~s~v~~~lA~Fy~~~r~~W  151 (182)
T PF06861_consen  124 RNLLNDHENASLVSHALAHFYLRYRRAW  151 (182)
T ss_pred             HHhcCchHHHHHHHHHHHHHHHHHHHHH
Confidence            3456789999999999999999987643


No 85 
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=41.15  E-value=14  Score=35.60  Aligned_cols=26  Identities=31%  Similarity=0.535  Sum_probs=18.7

Q ss_pred             EEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768          271 IFMFTGMFQLCQTDDELATVLSHELSHTLLK  301 (475)
Q Consensus       271 IvV~~GLL~~l~~edELaaVLAHElgHv~~r  301 (475)
                      |.|-.||=.     .-..+|||||+.|..++
T Consensus        82 Ilvl~GLPr-----ll~gsiLAHE~mHa~Lr  107 (212)
T PF12315_consen   82 ILVLYGLPR-----LLTGSILAHELMHAWLR  107 (212)
T ss_pred             EEEECCCCH-----HHHhhHHHHHHHHHHhc
Confidence            566666632     33478999999999875


No 86 
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.65  E-value=30  Score=35.58  Aligned_cols=37  Identities=24%  Similarity=0.492  Sum_probs=32.2

Q ss_pred             CCeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768          268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       268 gG~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      +-+++.+-..+..+ ..+++.+.|-||+=|+.++|+-.
T Consensus        51 ~~~~y~NPei~~~~-p~~~~~aLl~HEV~Hi~l~Hi~r   87 (396)
T COG3864          51 YFTMYFNPEIFLNC-PISEMKALLKHEVYHIMLNHIKR   87 (396)
T ss_pred             ceEEEeCHHHHccC-CHHHHHHHHHHHHHHHHHHHHHh
Confidence            33688888888888 57999999999999999999974


No 87 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=39.50  E-value=22  Score=33.67  Aligned_cols=17  Identities=35%  Similarity=0.468  Sum_probs=13.5

Q ss_pred             HHHHHHHHHHHHhhhcc
Q psy768          285 DELATVLSHELSHTLLK  301 (475)
Q Consensus       285 dELaaVLAHElgHv~~r  301 (475)
                      ....+|+.|||||.+.=
T Consensus        90 ~~~~~~i~HElgHaLG~  106 (198)
T cd04327          90 PEFSRVVLHEFGHALGF  106 (198)
T ss_pred             hhHHHHHHHHHHHHhcC
Confidence            35668999999998754


No 88 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=39.44  E-value=4.4  Score=38.66  Aligned_cols=14  Identities=43%  Similarity=0.624  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHhhh
Q psy768          286 ELATVLSHELSHTL  299 (475)
Q Consensus       286 ELaaVLAHElgHv~  299 (475)
                      ..+.++||||||..
T Consensus       139 ~~a~~~aHElGH~L  152 (207)
T cd04273         139 SSAFTIAHELGHVL  152 (207)
T ss_pred             eeEEeeeeechhhc
Confidence            35889999999986


No 89 
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=36.33  E-value=22  Score=33.15  Aligned_cols=15  Identities=40%  Similarity=0.678  Sum_probs=12.2

Q ss_pred             HHHHHHHHHHhhhcc
Q psy768          287 LATVLSHELSHTLLK  301 (475)
Q Consensus       287 LaaVLAHElgHv~~r  301 (475)
                      +.+++.||+||...-
T Consensus         7 ~i~i~~HE~gH~~~a   21 (192)
T PF02163_consen    7 LISIVLHELGHALAA   21 (192)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             ccccccccccccccc
Confidence            467899999998654


No 90 
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=35.80  E-value=47  Score=30.16  Aligned_cols=39  Identities=31%  Similarity=0.496  Sum_probs=26.2

Q ss_pred             CcceeEeCCC---eEEEehhhhhhc-CCHHHHHHHHHHHHHhh
Q psy768          260 LINAFVFPDG---RIFMFTGMFQLC-QTDDELATVLSHELSHT  298 (475)
Q Consensus       260 ~~NAFalpgG---~IvV~~GLL~~l-~~edELaaVLAHElgHv  298 (475)
                      ..-|++.|+.   .|+|-......- ...+-=+++|.||+.|.
T Consensus        65 ~~~a~~~~~~~~~~IyLc~~F~~~p~~g~~Sk~~TLiHE~SHf  107 (148)
T PF14521_consen   65 NTYAYVYPDSPTYTIYLCPAFFSAPTTGKDSKEGTLIHEWSHF  107 (148)
T ss_dssp             SSSEEE-TTST-TEEEE-HHHHHS-SSSTT-HHHHHHHHHHHS
T ss_pred             ceEEEEECCCCceEEEEChhhcCCCCCCCCchHHHHHHhhhhh
Confidence            5567778876   499988887742 23355689999999993


No 91 
>TIGR03818 MotA1 flagellar motor stator protein MotA. This model represents one family of MotA proteins which are often not identified by the "transporter, MotA/TolQ/ExbB proton channel family" model, pfam01618.
Probab=35.44  E-value=4.9e+02  Score=26.37  Aligned_cols=34  Identities=12%  Similarity=0.153  Sum_probs=23.7

Q ss_pred             EEehhh---hhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768          272 FMFTGM---FQLCQTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       272 vV~~GL---L~~l~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      |+-+|+   ++...+++++..+|--|+-....+|...
T Consensus       125 Fl~~gl~l~vdG~~~~~~i~~iLe~ei~~~~~~~~~~  161 (282)
T TIGR03818       125 FICDYLRLMVMGNMNPHELEALMEEEIETHHHELLKP  161 (282)
T ss_pred             HHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            444554   2333378999999999998887777653


No 92 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=35.32  E-value=19  Score=39.84  Aligned_cols=49  Identities=16%  Similarity=0.213  Sum_probs=28.7

Q ss_pred             eEEEEecCCcceeEeC--C-CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHH
Q psy768          252 PITIIDDPLINAFVFP--D-GRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalp--g-G~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      .++-.++....|||.+  . +.-+|.   ++.-.+-+. ..+|+||+||..|....
T Consensus       303 D~~~r~gK~~Gayc~~~~~~~~P~I~---~Nf~~t~~d-v~TL~HElGHa~H~~~s  354 (549)
T TIGR02289       303 DLESRKGKAAGGYCTYLPKYKAPFIF---SNFNGTSGD-IDVLTHEAGHAFHVYES  354 (549)
T ss_pred             ccCCCCCCCCCcccCCCCCCCCcEEE---EeCCCChhH-HHHHHHHhhHHHHHHHh
Confidence            3444445577899865  2 333442   222233334 45899999999877544


No 93 
>PRK12482 flagellar motor protein MotA; Provisional
Probab=34.29  E-value=5.2e+02  Score=26.33  Aligned_cols=25  Identities=20%  Similarity=0.234  Sum_probs=21.4

Q ss_pred             CCHHHHHHHHHHHHHhhhcccHHHH
Q psy768          282 QTDDELATVLSHELSHTLLKHVAEK  306 (475)
Q Consensus       282 ~~edELaaVLAHElgHv~~rH~~~~  306 (475)
                      .+++++..+|-.|+.....+|....
T Consensus       138 ~d~~~i~~iLe~eie~~~~r~~~~a  162 (287)
T PRK12482        138 INQHELEGILDQELDAVEEELLQPS  162 (287)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHhHH
Confidence            4789999999999999998887643


No 94 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=33.54  E-value=25  Score=33.70  Aligned_cols=15  Identities=47%  Similarity=0.693  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHhhhc
Q psy768          286 ELATVLSHELSHTLL  300 (475)
Q Consensus       286 ELaaVLAHElgHv~~  300 (475)
                      =+.+|+.||+||...
T Consensus        37 l~~~v~iHElgH~~~   51 (208)
T cd06161          37 LFLSVLLHELGHALV   51 (208)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            467899999999864


No 95 
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.40  E-value=61  Score=31.24  Aligned_cols=22  Identities=27%  Similarity=0.270  Sum_probs=14.1

Q ss_pred             cccchHHHHHHHHHHHHHHHHh
Q psy768          157 HQDKIAGITLTSIGLAYIYYFL  178 (475)
Q Consensus       157 ~~~~~~~~~~~~~~~~~~~~~~  178 (475)
                      .+|.++|++++++++|+--|+.
T Consensus        22 Gk~li~gviLg~~~lfGW~ywq   43 (207)
T COG2976          22 GKALIVGVILGLGGLFGWRYWQ   43 (207)
T ss_pred             CchhHHHHHHHHHHHHHHHHHH
Confidence            3677777777777776654443


No 96 
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=30.75  E-value=52  Score=38.39  Aligned_cols=36  Identities=19%  Similarity=0.362  Sum_probs=23.1

Q ss_pred             CeEEEehhhhhhc-CC---HHHHHHHHHHHHHhhhcccHH
Q psy768          269 GRIFMFTGMFQLC-QT---DDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       269 G~IvV~~GLL~~l-~~---edELaaVLAHElgHv~~rH~~  304 (475)
                      |.|.+...++..- .+   ...++.|++|||+|---++..
T Consensus       265 Glit~~e~~l~~~~~~~~~~~~~~~viaHElAHqWFGnlV  304 (831)
T TIGR02412       265 GCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLV  304 (831)
T ss_pred             ceeeechhhccCCcCCHHHHHHHHHHHHHHHHHHHhCCEe
Confidence            4566666554321 11   235789999999998777543


No 97 
>PTZ00337 surface protease GP63; Provisional
Probab=30.46  E-value=78  Score=35.31  Aligned_cols=53  Identities=19%  Similarity=0.135  Sum_probs=33.9

Q ss_pred             cCCCcceeEEEEecCC--cceeEeC-----C-----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhc
Q psy768          245 EHNNFKYPITIIDDPL--INAFVFP-----D-----GRIFMFTGMFQLCQTDDELATVLSHELSHTLL  300 (475)
Q Consensus       245 ~~~~~~~~v~Vv~~~~--~NAFalp-----g-----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~  300 (475)
                      ++.+.++.++|.-.|.  .=|+|.+     .     |.|.++...+   .+.++.-.|+.|||+|+..
T Consensus       178 Gv~naD~vLyV~a~p~~~~~A~A~~C~~~~~~RPi~G~in~np~~i---~~~~~~~~v~~HEi~HALG  242 (567)
T PTZ00337        178 GVAGADMILYANIFPTSGPAAWAIPCFLLDDGRPFAAAVNFDPRQI---AVTNGDVRVAAHELGHALG  242 (567)
T ss_pred             CccccceEEEEEeccCCCcceEEEEcccccCCCceEEEEEECHHHc---cchhHHHHHHHHHHHHHHc
Confidence            4456678888865442  2366654     2     3466665444   3346778899999999974


No 98 
>PRK08990 flagellar motor protein PomA; Reviewed
Probab=30.13  E-value=5.6e+02  Score=25.48  Aligned_cols=24  Identities=17%  Similarity=0.266  Sum_probs=20.2

Q ss_pred             CCHHHHHHHHHHHHHhhhcccHHH
Q psy768          282 QTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       282 ~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      .+++++..+|-.|+.....+|...
T Consensus       116 ~~~~~i~~~l~~~~~~~~~~~~~~  139 (254)
T PRK08990        116 HDGDVVRAALEKDIALTEERHETG  139 (254)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHhH
Confidence            478999999999999988887653


No 99 
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=29.72  E-value=44  Score=39.13  Aligned_cols=56  Identities=20%  Similarity=0.363  Sum_probs=32.9

Q ss_pred             cceeEEEEecCCcceeEeCC-CeEEEehhh-hhh---cCC--HHHHHHHHHHHHHhhhcccHH
Q psy768          249 FKYPITIIDDPLINAFVFPD-GRIFMFTGM-FQL---CQT--DDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       249 ~~~~v~Vv~~~~~NAFalpg-G~IvV~~GL-L~~---l~~--edELaaVLAHElgHv~~rH~~  304 (475)
                      ++|...+|.-+..|+=++=+ |.|.+++.. +..   ..+  -...+.|++||++|---+.-.
T Consensus       262 y~l~~~~V~v~~f~~GaMEN~Gl~tf~~~~ll~~~~~at~~~~~~~~~viaHElaHqWfGnlV  324 (859)
T COG0308         262 YALPIDIVAVPDFSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLV  324 (859)
T ss_pred             CCCcccEEeccCCCCccccccceeEEeeeEEeeCcccchhHHHHHHHHHHHHHHhhhccccee
Confidence            33443556666677655533 455555553 221   111  245777999999998777654


No 100
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=29.49  E-value=13  Score=37.43  Aligned_cols=11  Identities=45%  Similarity=0.797  Sum_probs=10.0

Q ss_pred             HHHHHHHHHhh
Q psy768          288 ATVLSHELSHT  298 (475)
Q Consensus       288 aaVLAHElgHv  298 (475)
                      .+|++||+||.
T Consensus       166 igv~~HE~gH~  176 (286)
T TIGR03296       166 VGVIAHELGHD  176 (286)
T ss_pred             eeeeehhhhcc
Confidence            59999999995


No 101
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=29.16  E-value=33  Score=32.72  Aligned_cols=16  Identities=38%  Similarity=0.567  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHhhhcc
Q psy768          286 ELATVLSHELSHTLLK  301 (475)
Q Consensus       286 ELaaVLAHElgHv~~r  301 (475)
                      .+..++.||+||....
T Consensus        21 ~~l~t~~HE~gHal~a   36 (200)
T PF13398_consen   21 RLLVTFVHELGHALAA   36 (200)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4667999999998653


No 102
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=29.08  E-value=34  Score=31.64  Aligned_cols=13  Identities=54%  Similarity=0.779  Sum_probs=11.1

Q ss_pred             HHHHHHHHHHhhh
Q psy768          287 LATVLSHELSHTL  299 (475)
Q Consensus       287 LaaVLAHElgHv~  299 (475)
                      +.+|+.||+||..
T Consensus         8 ~i~i~iHE~gH~~   20 (180)
T cd05709           8 LISVTVHELGHAL   20 (180)
T ss_pred             HHHHHHHHHHHHH
Confidence            4578999999986


No 103
>PRK08124 flagellar motor protein MotA; Validated
Probab=28.72  E-value=6e+02  Score=25.35  Aligned_cols=24  Identities=25%  Similarity=0.314  Sum_probs=19.4

Q ss_pred             CCHHHHHHHHHHHHHhhhcccHHH
Q psy768          282 QTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       282 ~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      .+++++..+|-.|+.....+|...
T Consensus       120 ~~~~~i~~~le~~i~~~~~~~~~~  143 (263)
T PRK08124        120 QSPEFIRDVLEEEIEAMEERHAAG  143 (263)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHhh
Confidence            378899999999999888776553


No 104
>KOG3658|consensus
Probab=27.67  E-value=21  Score=40.08  Aligned_cols=34  Identities=18%  Similarity=0.205  Sum_probs=25.4

Q ss_pred             CCeEEEehhhhhhcC-----CHHHHHHHHHHHHHhhhcc
Q psy768          268 DGRIFMFTGMFQLCQ-----TDDELATVLSHELSHTLLK  301 (475)
Q Consensus       268 gG~IvV~~GLL~~l~-----~edELaaVLAHElgHv~~r  301 (475)
                      |-+.++++||+...+     -..+-.-|+|||+||--..
T Consensus       368 G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNfGS  406 (764)
T KOG3658|consen  368 GKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNFGS  406 (764)
T ss_pred             cceEEeecceeeeeecCCccCcchhheeehhhhccccCC
Confidence            346899999987763     1357778999999997543


No 105
>KOG1046|consensus
Probab=27.19  E-value=1.1e+02  Score=35.97  Aligned_cols=52  Identities=17%  Similarity=0.275  Sum_probs=30.1

Q ss_pred             EEEEecCCcceeEeC-CCeEEEehhhhhh------cCCHHHHHHHHHHHHHhhhcccHH
Q psy768          253 ITIIDDPLINAFVFP-DGRIFMFTGMFQL------CQTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       253 v~Vv~~~~~NAFalp-gG~IvV~~GLL~~------l~~edELaaVLAHElgHv~~rH~~  304 (475)
                      +-+|--|+..+=|+= .|.|..-...|..      ..+...++.|+|||+||.--+...
T Consensus       283 ~D~iavPdf~~GAMENwGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGNLV  341 (882)
T KOG1046|consen  283 LDLVAVPDFSAGAMENWGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGNLV  341 (882)
T ss_pred             ccEEecCCccccchhcCcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcCcc
Confidence            444444544443332 3555544443322      123567999999999998777554


No 106
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=26.93  E-value=30  Score=37.02  Aligned_cols=18  Identities=28%  Similarity=0.589  Sum_probs=14.1

Q ss_pred             CHHHHHHHHHHHHHhhhcc
Q psy768          283 TDDELATVLSHELSHTLLK  301 (475)
Q Consensus       283 ~edELaaVLAHElgHv~~r  301 (475)
                      +-+++ ..|+||+||..|.
T Consensus       239 ~~~~v-~tLfHE~GHa~H~  256 (458)
T PF01432_consen  239 SHDDV-ETLFHEFGHAMHS  256 (458)
T ss_dssp             SHHHH-HHHHHHHHHHHHH
T ss_pred             ChhhH-HHHHHHHhHHHHH
Confidence            44666 7899999999764


No 107
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=26.90  E-value=33  Score=37.16  Aligned_cols=18  Identities=28%  Similarity=0.530  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHhhhcccH
Q psy768          285 DELATVLSHELSHTLLKHV  303 (475)
Q Consensus       285 dELaaVLAHElgHv~~rH~  303 (475)
                      +++ ..|+||+||..|...
T Consensus       262 ~~V-~TLfHEfGHalH~~l  279 (472)
T cd06455         262 DEV-ETFFHEFGHVIHHLL  279 (472)
T ss_pred             HHH-HHHHHHHHHHHHHHh
Confidence            554 579999999987543


No 108
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=26.72  E-value=39  Score=33.02  Aligned_cols=15  Identities=53%  Similarity=0.793  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHhhhc
Q psy768          286 ELATVLSHELSHTLL  300 (475)
Q Consensus       286 ELaaVLAHElgHv~~  300 (475)
                      =..+|+-||+||...
T Consensus        52 l~~~v~iHElgH~~~   66 (227)
T cd06164          52 LFASVLLHELGHSLV   66 (227)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            457899999999864


No 109
>COG3426 Butyrate kinase [Energy production and conversion]
Probab=26.57  E-value=60  Score=33.11  Aligned_cols=40  Identities=15%  Similarity=0.363  Sum_probs=31.4

Q ss_pred             CCcceeE--------eCCCeEEEehhhhhhcCC------HHHHHHHHHHHHHhh
Q psy768          259 PLINAFV--------FPDGRIFMFTGMFQLCQT------DDELATVLSHELSHT  298 (475)
Q Consensus       259 ~~~NAFa--------lpgG~IvV~~GLL~~l~~------edELaaVLAHElgHv  298 (475)
                      +.++|++        .|||+.+|+.-|++.+.+      ...|.++|||+++--
T Consensus        67 ~~~dAvvgRGGLL~pi~gGTY~Vn~~M~~~lk~~~~G~haSnLGaiiA~~ia~~  120 (358)
T COG3426          67 SKFDAVVGRGGLLRPIPGGTYVVNEKMLKDLKNGVQGEHASNLGAIIANRIAKA  120 (358)
T ss_pred             CCccceeecCccccccCCceeEeCHHHHHHHHcCCCCcchhhhhHHHHHHHhhh
Confidence            3567776        478999999999988742      456999999999854


No 110
>PF12911 OppC_N:  N-terminal TM domain of oligopeptide transport permease C
Probab=26.51  E-value=1.3e+02  Score=22.08  Aligned_cols=17  Identities=6%  Similarity=0.233  Sum_probs=13.0

Q ss_pred             HHHHHHHhcccchHHHH
Q psy768          149 IILSQIKKHQDKIAGIT  165 (475)
Q Consensus       149 ~~~~~~~~~~~~~~~~~  165 (475)
                      .+.+++++||..++|++
T Consensus         7 ~~~~~f~~nk~a~~gl~   23 (56)
T PF12911_consen    7 DAWRRFRRNKLAVIGLI   23 (56)
T ss_pred             HHHHHHHhCchHHHHHH
Confidence            57778889988886663


No 111
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=25.69  E-value=34  Score=38.02  Aligned_cols=48  Identities=15%  Similarity=0.105  Sum_probs=26.8

Q ss_pred             eEEEEecCCcceeEeC--CC-eEEEehhhhhhcCCHHHHHHHHHHHHHhhhcccH
Q psy768          252 PITIIDDPLINAFVFP--DG-RIFMFTGMFQLCQTDDELATVLSHELSHTLLKHV  303 (475)
Q Consensus       252 ~v~Vv~~~~~NAFalp--gG-~IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~  303 (475)
                      .++-.+.....||+.+  ++ .-+|....-   ++-++ ...|+||+||..|...
T Consensus       341 D~~~r~gK~~Ga~~~~~~~~~~p~i~~N~~---~~~~~-v~TL~HE~GHa~H~~l  391 (587)
T TIGR02290       341 DAEPRPGKRGGAFCTGFPPSKEPRVLMNYD---GSRRD-VSTLAHELGHAYHSEL  391 (587)
T ss_pred             ccCCCCCCCCCcccCCCCCCCCCEEEEecC---CCchh-HHHHHHHhhHHHHHHH
Confidence            4454555577888854  22 223322221   12223 4579999999988543


No 112
>TIGR00035 asp_race aspartate racemase.
Probab=24.93  E-value=1.5e+02  Score=28.69  Aligned_cols=60  Identities=8%  Similarity=0.075  Sum_probs=39.9

Q ss_pred             CCCchhHHHHHHHHHhhccccccCCCcccccCCCCChHHHHHHHH--------HHHHHHHHHHHhcCC
Q psy768          390 CYDVRVAPLFWQKMALKETQDQVGPKMEEYLSTHPSHENRANNLE--------SKMKEALDIRKECNC  449 (475)
Q Consensus       390 Gydp~a~~~~l~kl~~~~~~~~~g~~~~~~lsTHP~~~~RI~~L~--------~~i~~a~~~~~~~~c  449 (475)
                      |..|.+..++++++.+.......++..+...-.||...+|.+.+.        ..+.++.+.-++++|
T Consensus         9 Gmgp~at~~~~~~i~~~~~a~~d~~~~~~i~~~~~~~~dr~~~~~~~~~~~~~~~l~~~~~~L~~~g~   76 (229)
T TIGR00035         9 GMGPLATAELFRRINEKTKAKRDQEHPAEVLFNNPNIPDRTAYILGRGEDRPRPILIDIAVKLENAGA   76 (229)
T ss_pred             CcCHHHHHHHHHHHHHHhHHhcCCCCCceeeeeCCCHHHHHHHHhcCCcchHHHHHHHHHHHHHHcCC
Confidence            778899999999988665212222334677889999999988754        244445444455554


No 113
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=24.63  E-value=45  Score=31.48  Aligned_cols=15  Identities=40%  Similarity=0.408  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHhhhc
Q psy768          286 ELATVLSHELSHTLL  300 (475)
Q Consensus       286 ELaaVLAHElgHv~~  300 (475)
                      =+..+..||+||...
T Consensus        40 l~~~l~iHElgH~~~   54 (183)
T cd06160          40 LLAILGIHEMGHYLA   54 (183)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456788999999854


No 114
>PRK08456 flagellar motor protein MotA; Validated
Probab=24.55  E-value=7e+02  Score=24.74  Aligned_cols=23  Identities=4%  Similarity=-0.012  Sum_probs=19.2

Q ss_pred             CCHHHHHHHHHHHHHhhhcccHH
Q psy768          282 QTDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       282 ~~edELaaVLAHElgHv~~rH~~  304 (475)
                      .+++++..+|-.|+.....+|..
T Consensus       119 ~~~~~i~~~le~ei~~~~~~~~~  141 (257)
T PRK08456        119 KDLEEIKESMEIQIEEMEEYYHG  141 (257)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHh
Confidence            47899999999999988887764


No 115
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=23.10  E-value=62  Score=38.00  Aligned_cols=19  Identities=21%  Similarity=0.378  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHhhhcccHH
Q psy768          286 ELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       286 ELaaVLAHElgHv~~rH~~  304 (475)
                      .++.|+|||++|---++..
T Consensus       282 ~i~~VIaHElaHqWfGNlV  300 (863)
T TIGR02414       282 RIESVIAHEYFHNWTGNRV  300 (863)
T ss_pred             HHHHHHHHHHHHHHhccee
Confidence            4889999999998776543


No 116
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=22.98  E-value=1.3e+02  Score=33.84  Aligned_cols=55  Identities=18%  Similarity=0.233  Sum_probs=33.6

Q ss_pred             CCCcceeEEEEecCC-cc--eeEeC-----C-----CeEEEehhhhhhcCCHHHHHHHHHHHHHhhhcc
Q psy768          246 HNNFKYPITIIDDPL-IN--AFVFP-----D-----GRIFMFTGMFQLCQTDDELATVLSHELSHTLLK  301 (475)
Q Consensus       246 ~~~~~~~v~Vv~~~~-~N--AFalp-----g-----G~IvV~~GLL~~l~~edELaaVLAHElgHv~~r  301 (475)
                      +.+.++-+||.-.|. .+  |+|.+     +     |.|.+.-.-+.. ..+++...|+.|||.|...=
T Consensus       203 v~naD~vLYVta~P~~~~~lA~A~~C~~~~d~RPi~G~iNinp~~i~s-~~~~~~~rv~~HEi~HALGF  270 (622)
T PTZ00257        203 VSNADFVLYVASVPSEPGVLAWATTCQVFSDDHPAVGVMNIPAANIVS-RYDQGTTRTVTHEVAHALGF  270 (622)
T ss_pred             ccccCEEEEEEEecCCCCeeEEEEEeccccCCCceEEEEeeCHHHCCC-ccchHHHHHHHHHHHHHhcC
Confidence            445677788775552 23  66653     2     346665544432 12356789999999999753


No 117
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=22.61  E-value=53  Score=31.03  Aligned_cols=13  Identities=38%  Similarity=0.657  Sum_probs=10.7

Q ss_pred             HHHHHHHHHhhhc
Q psy768          288 ATVLSHELSHTLL  300 (475)
Q Consensus       288 aaVLAHElgHv~~  300 (475)
                      ..|+-||+||...
T Consensus        10 ~~v~iHElGH~~~   22 (182)
T cd06163          10 ILIFVHELGHFLV   22 (182)
T ss_pred             HHHHHHHHHHHHH
Confidence            4689999999864


No 118
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=22.59  E-value=1e+02  Score=29.01  Aligned_cols=17  Identities=47%  Similarity=0.800  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHhhh-ccc
Q psy768          286 ELATVLSHELSHTL-LKH  302 (475)
Q Consensus       286 ELaaVLAHElgHv~-~rH  302 (475)
                      -+.-++.||+||.. +.|
T Consensus       124 R~~k~~~HElGH~lGL~H  141 (179)
T PRK13267        124 RVRKEVTHELGHTLGLEH  141 (179)
T ss_pred             HHHHHHHHHHHHHcCCcc
Confidence            45556899999993 344


No 119
>KOG3607|consensus
Probab=22.05  E-value=50  Score=37.83  Aligned_cols=17  Identities=47%  Similarity=0.491  Sum_probs=14.2

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy768          284 DDELATVLSHELSHTLL  300 (475)
Q Consensus       284 edELaaVLAHElgHv~~  300 (475)
                      ....|.|+||||||..-
T Consensus       320 ~~~~a~v~AhelgH~lG  336 (716)
T KOG3607|consen  320 LLAFAVVLAHELGHNLG  336 (716)
T ss_pred             chhHHHHHHHHHHhhcC
Confidence            45789999999999854


No 120
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=21.91  E-value=41  Score=34.72  Aligned_cols=21  Identities=24%  Similarity=0.393  Sum_probs=14.8

Q ss_pred             CHHHHHHHHHHHHHhhhcccHH
Q psy768          283 TDDELATVLSHELSHTLLKHVA  304 (475)
Q Consensus       283 ~edELaaVLAHElgHv~~rH~~  304 (475)
                      +-+++. .|.||+||..+.-..
T Consensus       151 ~~~~v~-tl~HE~GHa~h~~l~  171 (365)
T cd06258         151 GHDDIN-TLFHEFGHAVHFLLI  171 (365)
T ss_pred             CHHHHH-HHHHHHhHHHHHHHh
Confidence            445554 689999999876433


No 121
>KOG2975|consensus
Probab=21.84  E-value=1.2e+02  Score=30.47  Aligned_cols=33  Identities=18%  Similarity=0.283  Sum_probs=27.7

Q ss_pred             hHHHHHHHHHHHHHcCCCchhHHHHHHHHHhhc
Q psy768          375 ETEADEVGLKLMARACYDVRVAPLFWQKMALKE  407 (475)
Q Consensus       375 E~EAD~~gl~ll~~AGydp~a~~~~l~kl~~~~  407 (475)
                      -++|+++|++++.+.+++|.....++..|++.+
T Consensus       172 ~~~~ervgl~li~kt~~sp~r~~~l~~dLqQv~  204 (288)
T KOG2975|consen  172 YYDAERVGLDLIEKTSFSPSRVAGLSSDLQQVE  204 (288)
T ss_pred             ecchhhhHHHHHHHhccChhhhccccchHHHHH
Confidence            468999999999999999998877776666655


No 122
>PF04612 T2SM:  Type II secretion system (T2SS), protein M;  InterPro: IPR007690 General secretion pathway (GSP) protein M is a membrane protein involved in the export of proteins in bacteria. It consists of a short cytosolic N-terminal domain, a transmembrane domain, and a C-terminal periplasmic domain. The precise function of this protein is unknown, though in Vibrio cholerae, the EpsM protein interacts with the EpsL protein, and also forms homodimers [],; GO: 0006858 extracellular transport; PDB: 1UV7_A.
Probab=21.59  E-value=31  Score=31.20  Aligned_cols=14  Identities=36%  Similarity=1.113  Sum_probs=0.0

Q ss_pred             hHHHHHHhcCchhh
Q psy768          134 ISRKWWSKLSPDQK  147 (475)
Q Consensus       134 ~~~~w~~~l~~~~r  147 (475)
                      .++.||..|++..|
T Consensus         3 ~l~~~w~~ls~REr   16 (160)
T PF04612_consen    3 QLKQWWQSLSPRER   16 (160)
T ss_dssp             --------------
T ss_pred             HHHHHHHhCCHHHH
Confidence            46789999888654


No 123
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=21.52  E-value=43  Score=32.16  Aligned_cols=15  Identities=27%  Similarity=0.330  Sum_probs=12.1

Q ss_pred             HHHHHHHHHhhhccc
Q psy768          288 ATVLSHELSHTLLKH  302 (475)
Q Consensus       288 aaVLAHElgHv~~rH  302 (475)
                      .+++.|||+|.+.=+
T Consensus        88 ~Gti~HEl~HaLGf~  102 (200)
T cd04281          88 FGIVVHELGHVIGFW  102 (200)
T ss_pred             CchHHHHHHHHhcCc
Confidence            479999999987543


No 124
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=21.42  E-value=44  Score=31.63  Aligned_cols=14  Identities=36%  Similarity=0.453  Sum_probs=11.6

Q ss_pred             HHHHHHHHHhhhcc
Q psy768          288 ATVLSHELSHTLLK  301 (475)
Q Consensus       288 aaVLAHElgHv~~r  301 (475)
                      .+++.|||+|.+.=
T Consensus        78 ~G~i~HEl~HaLG~   91 (182)
T cd04283          78 KGIIQHELLHALGF   91 (182)
T ss_pred             cchHHHHHHHHhCC
Confidence            48999999998753


No 125
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=21.14  E-value=57  Score=32.70  Aligned_cols=15  Identities=47%  Similarity=0.660  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHhhhc
Q psy768          286 ELATVLSHELSHTLL  300 (475)
Q Consensus       286 ELaaVLAHElgHv~~  300 (475)
                      =+.+|+.||+||...
T Consensus       117 L~isv~iHElgHa~~  131 (263)
T cd06159         117 LVVGVVVHELSHGIL  131 (263)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            457799999999854


No 126
>PF05960 DUF885:  Bacterial protein of unknown function (DUF885);  InterPro: IPR010281 This family consists of hypothetical bacterial proteins.; PDB: 3O0Y_B 3U24_A 3IUK_A.
Probab=20.98  E-value=4.2e+02  Score=28.89  Aligned_cols=74  Identities=18%  Similarity=0.209  Sum_probs=39.3

Q ss_pred             HHHHHHHHHHHHhcc-ccccCCCcceeEEEEecC----CcceeEeCC--------CeEEEehhhhhhcCCHHHHHHHHHH
Q psy768          227 KRVGAVVKRLIDANK-VYMEHNNFKYPITIIDDP----LINAFVFPD--------GRIFMFTGMFQLCQTDDELATVLSH  293 (475)
Q Consensus       227 ~~v~~i~~rL~~~~~-~~p~~~~~~~~v~Vv~~~----~~NAFalpg--------G~IvV~~GLL~~l~~edELaaVLAH  293 (475)
                      ...+++.+++....+ ..+.+...++.|.-+...    ..-||..|+        |.++|+..-.... +.-++..+++|
T Consensus       300 ~~~~~~~~~~~~~~~~~f~~~p~~~~~v~~~p~~~~~~~~~a~y~~~~~~d~~~~~~~~in~~~~~~~-~~~~~~~~~~H  378 (549)
T PF05960_consen  300 DDYRELLERLKAKLPDLFPTIPDQPYEVRPVPPFLEPSAPPAYYQPPPPIDGSRPGRFYINPPDLRSR-NKYSLYTTAAH  378 (549)
T ss_dssp             HHHHHHHHHHHHHHCTCS---TCS-EEEEEEECCCHTTS-SEEEEE--TTTSSS-EEEEE--STHHCC-ECTCHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHhcccCCCCCCceeecCccccCcCCCCcccCCCCCCCCCCcEEEEeCCCcccc-ChhhHHHHHHh
Confidence            455566666543322 223344566777766653    234565553        5688888554444 33589999999


Q ss_pred             HH--Hhhhcc
Q psy768          294 EL--SHTLLK  301 (475)
Q Consensus       294 El--gHv~~r  301 (475)
                      |.  ||+.+-
T Consensus       379 E~~PGHh~Q~  388 (549)
T PF05960_consen  379 EAYPGHHLQI  388 (549)
T ss_dssp             HSTTTHHHHH
T ss_pred             hcCCcHHHHH
Confidence            98  566543


No 127
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=20.54  E-value=55  Score=30.91  Aligned_cols=14  Identities=36%  Similarity=0.396  Sum_probs=11.5

Q ss_pred             HHHHHHHHHhhhcc
Q psy768          288 ATVLSHELSHTLLK  301 (475)
Q Consensus       288 aaVLAHElgHv~~r  301 (475)
                      .+++.|||+|.+.=
T Consensus        80 ~~~i~HEl~HaLG~   93 (191)
T PF01400_consen   80 VGTILHELGHALGF   93 (191)
T ss_dssp             HHHHHHHHHHHHTB
T ss_pred             ccchHHHHHHHHhh
Confidence            56999999998654


No 128
>KOG3314|consensus
Probab=20.53  E-value=2.5e+02  Score=26.38  Aligned_cols=32  Identities=25%  Similarity=0.320  Sum_probs=24.5

Q ss_pred             EEEehhhhhhcCCHHHHHHHHHHHHHhhhcccHHH
Q psy768          271 IFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAE  305 (475)
Q Consensus       271 IvV~~GLL~~l~~edELaaVLAHElgHv~~rH~~~  305 (475)
                      |++-.   +.+.+.+++.-|++|||-|..-.|-++
T Consensus        78 IvlCq---N~l~~q~h~n~vv~HElIH~fDd~r~~  109 (194)
T KOG3314|consen   78 IVLCQ---NRLTIQDHVNQVVIHELIHAFDDCRAK  109 (194)
T ss_pred             eEEec---cccchHHHHHHHHHHHHHHHHHhhhhh
Confidence            76665   345667899999999999997766554


Done!