Query         psy4798
Match_columns 623
No_of_seqs    395 out of 2642
Neff          5.4 
Searched_HMMs 46136
Date          Fri Aug 16 17:54:05 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4798.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4798hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2719|consensus              100.0 7.8E-62 1.7E-66  518.0  33.7  285  264-614    82-399 (428)
  2 KOG2719|consensus              100.0 1.2E-38 2.6E-43  340.4  20.3  188   37-267   215-402 (428)
  3 PRK03001 M48 family peptidase; 100.0 7.1E-35 1.5E-39  302.4  24.3  201  370-606     2-229 (283)
  4 PRK03982 heat shock protein Ht 100.0 2.9E-33 6.4E-38  291.0  24.7  200  370-605     3-233 (288)
  5 PRK01265 heat shock protein Ht 100.0 1.2E-29 2.5E-34  267.5  27.2  170  405-607    48-252 (324)
  6 PRK01265 heat shock protein Ht 100.0 1.2E-30 2.6E-35  275.0  19.3  194   12-259    50-252 (324)
  7 PRK01345 heat shock protein Ht 100.0 2.6E-28 5.6E-33  257.3  25.8  172  402-605    31-229 (317)
  8 PRK04897 heat shock protein Ht 100.0 2.8E-29   6E-34  262.6  18.3  196   11-260    48-256 (298)
  9 PRK02391 heat shock protein Ht 100.0 4.3E-28 9.4E-33  253.4  25.6  169  405-605    43-240 (296)
 10 PRK03982 heat shock protein Ht 100.0 4.5E-29 9.8E-34  259.7  18.1  198   10-263    35-239 (288)
 11 PRK02391 heat shock protein Ht 100.0   1E-28 2.2E-33  258.1  20.4  197   10-262    43-245 (296)
 12 PRK03072 heat shock protein Ht 100.0 5.3E-28 1.1E-32  251.9  24.2  173  401-605    33-233 (288)
 13 PRK04897 heat shock protein Ht 100.0 5.4E-28 1.2E-32  252.9  24.2  167  407-606    49-254 (298)
 14 PRK02870 heat shock protein Ht 100.0 1.1E-28 2.3E-33  261.5  18.4  192   11-262    78-284 (336)
 15 PRK03072 heat shock protein Ht 100.0 1.8E-28 3.8E-33  255.5  19.2  193   11-262    38-238 (288)
 16 PRK03001 M48 family peptidase; 100.0 5.6E-28 1.2E-32  250.9  20.2  191   10-262    34-233 (283)
 17 PRK05457 heat shock protein Ht 100.0 8.4E-28 1.8E-32  250.0  20.1  191   13-260    44-246 (284)
 18 PRK02870 heat shock protein Ht 100.0 8.4E-27 1.8E-31  247.1  26.8  169  406-606    78-280 (336)
 19 PRK05457 heat shock protein Ht  99.9 3.9E-26 8.4E-31  237.5  23.9  167  407-607    43-245 (284)
 20 PRK01345 heat shock protein Ht  99.9 1.6E-26 3.5E-31  243.7  20.7  191   10-260    34-232 (317)
 21 PF01435 Peptidase_M48:  Peptid  99.9 8.4E-25 1.8E-29  216.2   4.7  183  408-622     3-224 (226)
 22 COG0501 HtpX Zn-dependent prot  99.9 4.4E-22 9.6E-27  205.8  18.8  187   11-259    61-259 (302)
 23 COG0501 HtpX Zn-dependent prot  99.9 3.2E-20 6.8E-25  192.0  22.6  170  404-606    59-258 (302)
 24 PF01435 Peptidase_M48:  Peptid  99.8 1.8E-21 3.9E-26  192.4  -0.1  156   38-256    30-188 (226)
 25 COG4783 Putative Zn-dependent   99.3 1.6E-11 3.5E-16  133.7  15.6  158  431-622    65-253 (484)
 26 PF05569 Peptidase_M56:  BlaR1   99.3 1.3E-10 2.8E-15  121.5  19.0  142  426-608   131-275 (299)
 27 PF05569 Peptidase_M56:  BlaR1   99.0 5.7E-09 1.2E-13  109.2  16.1  131   39-260   141-275 (299)
 28 KOG2661|consensus               99.0 2.2E-09 4.8E-14  112.3   9.5  135  455-623   240-390 (424)
 29 COG4783 Putative Zn-dependent   98.8 2.5E-08 5.4E-13  109.2  12.2  138   86-271   101-249 (484)
 30 COG4784 Putative Zn-dependent   98.7 1.7E-07 3.7E-12   98.8  12.8  138  434-605    62-217 (479)
 31 COG4219 MecR1 Antirepressor re  98.5 7.8E-06 1.7E-10   86.2  17.9  128  435-603   135-264 (337)
 32 COG4784 Putative Zn-dependent   98.1 1.5E-05 3.2E-10   84.5  10.4  117   86-256    95-216 (479)
 33 KOG2661|consensus               98.0 1.1E-05 2.5E-10   85.0   7.7  100   85-241   245-345 (424)
 34 COG4219 MecR1 Antirepressor re  97.7 0.00054 1.2E-08   72.6  13.7  106   88-261   162-270 (337)
 35 PF06114 DUF955:  Domain of unk  97.1  0.0046   1E-07   54.4  10.5   65  519-604    37-101 (122)
 36 PF04228 Zn_peptidase:  Putativ  96.6   0.011 2.4E-07   62.6  10.2  128  440-603   101-245 (292)
 37 PF06114 DUF955:  Domain of unk  95.4   0.034 7.4E-07   48.9   5.9   63  140-255    37-100 (122)
 38 PF01863 DUF45:  Protein of unk  95.3    0.04 8.7E-07   54.3   6.6   69  436-541   112-182 (205)
 39 PRK04351 hypothetical protein;  94.8    0.11 2.4E-06   50.1   7.8   71  433-535     1-72  (149)
 40 COG1451 Predicted metal-depend  94.7   0.085 1.8E-06   54.0   7.1   73  436-544   123-195 (223)
 41 PF01863 DUF45:  Protein of unk  94.2   0.061 1.3E-06   53.0   4.6   67   40-161   113-180 (205)
 42 PF13699 DUF4157:  Domain of un  93.3    0.15 3.2E-06   44.0   4.9   69  432-537     1-74  (79)
 43 PF04298 Zn_peptidase_2:  Putat  92.1     7.1 0.00015   40.2  15.9   86  419-540    17-105 (222)
 44 COG1451 Predicted metal-depend  90.9     0.3 6.4E-06   50.0   4.6   25  140-164   170-194 (223)
 45 PF10026 DUF2268:  Predicted Zn  90.7    0.75 1.6E-05   45.8   7.2   74  437-539     4-80  (195)
 46 PF10463 Peptidase_U49:  Peptid  90.2    0.23   5E-06   50.3   3.1   41  523-589   100-140 (206)
 47 smart00731 SprT SprT homologue  89.7    0.65 1.4E-05   44.2   5.5   66  437-537     2-72  (146)
 48 PF10263 SprT-like:  SprT-like   88.0     2.1 4.6E-05   40.5   7.8   33  480-540    44-76  (157)
 49 PF04450 BSP:  Peptidase of pla  85.7     1.8 3.9E-05   43.9   6.2   82  432-540    24-112 (205)
 50 PF04228 Zn_peptidase:  Putativ  85.2    0.69 1.5E-05   49.3   3.1   18  224-241   205-222 (292)
 51 PF04298 Zn_peptidase_2:  Putat  82.2      66  0.0014   33.3  15.7   17  145-161    89-105 (222)
 52 PF13203 DUF2201_N:  Putative m  80.6       2 4.3E-05   45.1   4.4   27  519-545    55-81  (292)
 53 PF08325 WLM:  WLM domain;  Int  80.1     2.4 5.2E-05   42.4   4.5   23  138-160    75-97  (186)
 54 PRK04351 hypothetical protein;  76.5       2 4.4E-05   41.4   2.7   31   98-156    42-72  (149)
 55 PF12388 Peptidase_M57:  Dual-a  74.7     5.8 0.00013   40.5   5.5   22  519-540   128-150 (211)
 56 PF13699 DUF4157:  Domain of un  73.7     2.7 5.8E-05   36.3   2.5   19  141-159    57-75  (79)
 57 PF04450 BSP:  Peptidase of pla  73.0     3.9 8.4E-05   41.5   3.8   53   86-161    60-112 (205)
 58 PF08325 WLM:  WLM domain;  Int  72.8     7.4 0.00016   38.9   5.7   21  519-539    77-97  (186)
 59 PF13203 DUF2201_N:  Putative m  72.3     3.6 7.8E-05   43.2   3.6   26  140-165    55-80  (292)
 60 PF10463 Peptidase_U49:  Peptid  71.9     1.7 3.7E-05   44.2   0.9   19  145-163   101-119 (206)
 61 PRK09672 phage exclusion prote  71.5     3.6 7.9E-05   43.9   3.3   41  524-590   165-205 (305)
 62 cd04269 ZnMc_adamalysin_II_lik  71.5     9.2  0.0002   37.5   6.1   16  521-536   128-143 (194)
 63 COG2321 Predicted metalloprote  71.4     5.7 0.00012   42.0   4.6  103  451-586   111-217 (295)
 64 smart00731 SprT SprT homologue  70.9     3.4 7.4E-05   39.2   2.7   32   99-158    41-72  (146)
 65 PF10026 DUF2268:  Predicted Zn  70.8     8.9 0.00019   38.2   5.8   21  140-160    60-80  (195)
 66 PF10263 SprT-like:  SprT-like   70.8     2.9 6.2E-05   39.6   2.2   32  100-159    43-74  (157)
 67 COG4900 Predicted metallopepti  70.6      13 0.00028   34.5   6.2   18  519-536    75-92  (133)
 68 COG2321 Predicted metalloprote  69.1     2.9 6.2E-05   44.2   1.9   18  224-241   202-219 (295)
 69 cd04270 ZnMc_TACE_like Zinc-de  68.3     3.4 7.4E-05   42.7   2.3   22  137-158   159-180 (244)
 70 COG3091 SprT Zn-dependent meta  68.1     9.4  0.0002   37.1   5.0   84  435-552     4-98  (156)
 71 cd04279 ZnMc_MMP_like_1 Zinc-d  64.0       4 8.6E-05   38.8   1.7   15  142-156   101-115 (156)
 72 PRK04860 hypothetical protein;  64.0      18 0.00039   35.4   6.2   64  441-538    12-77  (160)
 73 PRK09672 phage exclusion prote  64.0     1.8 3.8E-05   46.3  -0.8   18  145-162   165-182 (305)
 74 TIGR00181 pepF oligoendopeptid  64.0       6 0.00013   45.7   3.5   22   83-104   348-370 (591)
 75 cd04269 ZnMc_adamalysin_II_lik  63.9     6.6 0.00014   38.5   3.3   16  142-157   128-143 (194)
 76 cd04268 ZnMc_MMP_like Zinc-dep  63.9       4 8.7E-05   38.6   1.7   16  520-535    90-105 (165)
 77 PF01432 Peptidase_M3:  Peptida  63.8     6.9 0.00015   43.7   3.8   55   83-159   194-256 (458)
 78 PF12388 Peptidase_M57:  Dual-a  63.5       6 0.00013   40.4   2.9   17  140-156   128-144 (211)
 79 PF05572 Peptidase_M43:  Pregna  63.4     3.6 7.9E-05   39.7   1.3   23  522-544    67-90  (154)
 80 cd04270 ZnMc_TACE_like Zinc-de  63.1     5.2 0.00011   41.3   2.5   18  519-536   162-179 (244)
 81 cd04279 ZnMc_MMP_like_1 Zinc-d  63.1     4.5 9.7E-05   38.4   1.8   16  521-536   101-116 (156)
 82 PF00413 Peptidase_M10:  Matrix  62.9     3.3 7.1E-05   38.6   0.9   17  140-156   100-116 (154)
 83 cd04268 ZnMc_MMP_like Zinc-dep  61.5     4.3 9.4E-05   38.3   1.5   16  141-156    90-105 (165)
 84 PF13485 Peptidase_MA_2:  Pepti  60.3     6.6 0.00014   34.5   2.3   22  519-540    20-41  (128)
 85 PF00413 Peptidase_M10:  Matrix  58.7     5.7 0.00012   37.0   1.7   22  519-540   100-122 (154)
 86 TIGR02289 M3_not_pepF oligoend  58.4     8.6 0.00019   44.2   3.4   19   82-100   305-323 (549)
 87 KOG2765|consensus               57.7      47   0.001   36.9   8.6  103  291-415   275-387 (416)
 88 PF13688 Reprolysin_5:  Metallo  57.6     5.1 0.00011   39.2   1.2   16  520-535   138-153 (196)
 89 cd06455 M3A_TOP Peptidase M3 T  57.4      22 0.00047   40.2   6.3   20  519-539   259-278 (472)
 90 PF13485 Peptidase_MA_2:  Pepti  57.0     7.8 0.00017   34.0   2.2   22  140-161    20-41  (128)
 91 COG2856 Predicted Zn peptidase  55.8     7.8 0.00017   39.6   2.2   20  519-538    67-86  (213)
 92 PF05572 Peptidase_M43:  Pregna  55.7     5.5 0.00012   38.5   1.1   14  144-157    68-81  (154)
 93 cd04267 ZnMc_ADAM_like Zinc-de  54.5     5.5 0.00012   38.9   0.9   15  522-536   131-145 (192)
 94 PF15061 DUF4538:  Domain of un  54.4      17 0.00036   29.9   3.4   43  404-454     8-50  (58)
 95 COG2856 Predicted Zn peptidase  54.4     8.4 0.00018   39.4   2.2   21  140-160    67-87  (213)
 96 PF13582 Reprolysin_3:  Metallo  54.2     6.4 0.00014   35.5   1.2   10  526-535   109-118 (124)
 97 PF13688 Reprolysin_5:  Metallo  53.9     5.9 0.00013   38.7   1.0   17  140-156   137-153 (196)
 98 cd06455 M3A_TOP Peptidase M3 T  53.0      18  0.0004   40.8   4.8   21  138-159   257-277 (472)
 99 PF01421 Reprolysin:  Reprolysi  52.1     7.1 0.00015   38.5   1.3   16  520-535   127-142 (199)
100 cd04272 ZnMc_salivary_gland_MP  51.5      12 0.00025   37.8   2.7   14  523-536   144-157 (220)
101 PF13582 Reprolysin_3:  Metallo  51.3     7.1 0.00015   35.2   1.0   11  147-157   109-119 (124)
102 COG3824 Predicted Zn-dependent  51.1      12 0.00026   35.1   2.5   14  522-535   107-120 (136)
103 COG1164 Oligoendopeptidase F [  50.2      14 0.00031   43.2   3.5   44   83-160   349-395 (598)
104 PF13574 Reprolysin_2:  Metallo  50.2     7.9 0.00017   37.8   1.2   13  524-536   111-123 (173)
105 TIGR02289 M3_not_pepF oligoend  49.7      11 0.00024   43.3   2.5   21  520-541   334-354 (549)
106 PF01421 Reprolysin:  Reprolysi  49.5     9.7 0.00021   37.6   1.7   16  141-156   127-142 (199)
107 cd06459 M3B_Oligoendopeptidase  49.3      16 0.00034   40.0   3.5   18   83-100   191-208 (427)
108 cd04267 ZnMc_ADAM_like Zinc-de  49.3     7.1 0.00015   38.2   0.7   15  143-157   131-145 (192)
109 cd04278 ZnMc_MMP Zinc-dependen  49.2     8.7 0.00019   36.5   1.3   15  521-535   104-118 (157)
110 PF01432 Peptidase_M3:  Peptida  48.4      23 0.00049   39.6   4.6   62  454-537   186-255 (458)
111 cd04278 ZnMc_MMP Zinc-dependen  48.0     9.5 0.00021   36.3   1.4   16  141-156   103-118 (157)
112 cd04272 ZnMc_salivary_gland_MP  47.9      15 0.00032   37.1   2.7   20    6-25     21-40  (220)
113 COG2738 Predicted Zn-dependent  47.3 1.6E+02  0.0034   30.2   9.6   83  423-542    24-110 (226)
114 PF02031 Peptidase_M7:  Strepto  47.3     9.1  0.0002   36.3   1.1   10  147-156    79-88  (132)
115 PF13574 Reprolysin_2:  Metallo  46.7     9.1  0.0002   37.4   1.0   11  146-156   112-122 (173)
116 cd06459 M3B_Oligoendopeptidase  46.7      14  0.0003   40.4   2.5   20  521-541   220-239 (427)
117 TIGR00181 pepF oligoendopeptid  46.3      16 0.00035   42.2   3.2   17  525-541   379-395 (591)
118 TIGR02290 M3_fam_3 oligoendope  45.5      15 0.00033   42.5   2.8   16   83-98    344-359 (587)
119 PF09768 Peptidase_M76:  Peptid  44.5      22 0.00047   35.3   3.2   40   92-161    47-87  (173)
120 PF01447 Peptidase_M4:  Thermol  44.1      11 0.00023   36.5   1.0   11  146-156   136-146 (150)
121 COG1164 Oligoendopeptidase F [  43.8      15 0.00032   43.1   2.3   45  466-541   353-397 (598)
122 COG2738 Predicted Zn-dependent  43.8      38 0.00083   34.4   4.8   20  145-164    92-111 (226)
123 PRK04860 hypothetical protein;  43.8      16 0.00034   35.8   2.1   19  140-158    58-76  (160)
124 PF01447 Peptidase_M4:  Thermol  43.3      12 0.00026   36.2   1.2   17  466-486   108-124 (150)
125 PF02031 Peptidase_M7:  Strepto  42.4      13 0.00028   35.2   1.3    9  527-535    80-88  (132)
126 cd00203 ZnMc Zinc-dependent me  41.1      12 0.00025   35.4   0.8   16  521-536    93-108 (167)
127 COG3091 SprT Zn-dependent meta  40.8      17 0.00038   35.3   1.9   17  140-156    56-72  (156)
128 PRK13267 archaemetzincin-like   40.7      84  0.0018   31.3   6.8   19  521-539   122-141 (179)
129 PF14247 DUF4344:  Domain of un  40.6      46   0.001   34.3   5.0   57  466-535    47-103 (220)
130 cd04271 ZnMc_ADAM_fungal Zinc-  39.8      10 0.00023   38.8   0.3   11  526-536   147-157 (228)
131 TIGR00782 ccoP cytochrome c ox  39.7      44 0.00095   35.3   4.9   33  396-430    25-57  (285)
132 cd00203 ZnMc Zinc-dependent me  39.4      12 0.00026   35.2   0.6   16  142-157    93-108 (167)
133 cd06460 M32_Taq Peptidase fami  38.4      42 0.00092   37.4   4.7   73  431-540   102-175 (396)
134 cd06258 Peptidase_M3_like The   38.3      44 0.00095   36.0   4.8   60   79-160   104-169 (365)
135 KOG1047|consensus               38.1      41  0.0009   39.0   4.6   60  442-538   239-302 (613)
136 cd04271 ZnMc_ADAM_fungal Zinc-  37.5      12 0.00025   38.4   0.2   19    6-24     21-39  (228)
137 smart00235 ZnMc Zinc-dependent  37.0      15 0.00032   34.0   0.8   10  526-535    88-97  (140)
138 PF09768 Peptidase_M76:  Peptid  36.9      92   0.002   31.0   6.3   16  520-535    67-82  (173)
139 PF13583 Reprolysin_4:  Metallo  36.8      16 0.00035   36.8   1.0   16  526-541   139-155 (206)
140 TIGR02290 M3_fam_3 oligoendope  36.8      20 0.00044   41.5   2.0   15  525-539   376-390 (587)
141 cd06456 M3A_DCP_Oligopeptidase  36.6      46   0.001   37.3   4.7   22  138-160   202-223 (422)
142 COG4900 Predicted metallopepti  35.6      26 0.00055   32.6   2.0   21  137-157    72-92  (133)
143 COG1054 Predicted sulfurtransf  34.5      74  0.0016   34.3   5.5   45  425-475     8-53  (308)
144 PF02163 Peptidase_M50:  Peptid  34.3      23  0.0005   34.5   1.7   12  524-535     7-18  (192)
145 PF13583 Reprolysin_4:  Metallo  34.2      18 0.00039   36.4   0.9   11  147-157   139-149 (206)
146 KOG3314|consensus               33.2      42 0.00092   33.2   3.2   39   95-163    71-109 (194)
147 COG3824 Predicted Zn-dependent  32.8      23 0.00051   33.3   1.3   15  143-157   107-121 (136)
148 cd06258 Peptidase_M3_like The   32.4      82  0.0018   33.9   5.7   21  519-540   150-170 (365)
149 KOG3607|consensus               32.3      36 0.00078   40.9   3.1   16  520-535   319-334 (716)
150 PF14891 Peptidase_M91:  Effect  31.7      24 0.00052   34.5   1.3   18  141-158    99-116 (174)
151 COG2928 Uncharacterized conser  31.6 5.2E+02   0.011   26.9  10.8   35  437-475   103-137 (222)
152 smart00235 ZnMc Zinc-dependent  31.4      20 0.00044   33.1   0.7   10  147-156    88-97  (140)
153 PF14891 Peptidase_M91:  Effect  31.2      33 0.00072   33.5   2.2   18  520-537    99-116 (174)
154 cd06456 M3A_DCP_Oligopeptidase  30.7      61  0.0013   36.3   4.4   21  519-540   204-224 (422)
155 PF12725 DUF3810:  Protein of u  30.6      26 0.00056   37.9   1.4   26  133-158   184-209 (318)
156 COG3590 PepO Predicted metallo  30.4      14  0.0003   42.9  -0.7   15  522-536   485-499 (654)
157 COG3590 PepO Predicted metallo  29.8      15 0.00032   42.6  -0.6   17  141-157   483-499 (654)
158 PF14715 FixP_N:  N-terminal do  29.1      61  0.0013   25.9   2.9   26  396-421    19-44  (51)
159 PF02163 Peptidase_M50:  Peptid  29.1      30 0.00066   33.7   1.5   12  145-156     7-18  (192)
160 KOG1047|consensus               29.0      34 0.00073   39.7   2.0   27  320-347   391-417 (613)
161 PF01433 Peptidase_M1:  Peptida  28.8   1E+02  0.0022   33.1   5.7   20  521-540   292-311 (390)
162 PRK05320 rhodanese superfamily  28.5      69  0.0015   33.5   4.1   57  426-490     7-64  (257)
163 COG0475 KefB Kef-type K+ trans  28.5 1.7E+02  0.0037   32.5   7.4   97  342-438   120-222 (397)
164 cd05709 S2P-M50 Site-2 proteas  28.4      33 0.00072   33.2   1.7   13  524-536     8-20  (180)
165 COG3864 Uncharacterized protei  27.9      63  0.0014   35.2   3.7   24  519-542    64-87  (396)
166 PRK01415 hypothetical protein;  27.6      74  0.0016   33.3   4.1   45  425-475     8-53  (247)
167 PF12315 DUF3633:  Protein of u  27.4      47   0.001   34.1   2.5   20  518-537    87-106 (212)
168 TIGR02412 pepN_strep_liv amino  27.4      74  0.0016   38.7   4.7   18  522-539   285-302 (831)
169 cd04327 ZnMc_MMP_like_3 Zinc-d  27.4      38 0.00082   33.7   1.9   14  522-535    90-103 (198)
170 TIGR03296 M6dom_TIGR03296 M6 f  27.3      14 0.00031   39.0  -1.2   11  525-535   166-176 (286)
171 PF11694 DUF3290:  Protein of u  27.3 4.3E+02  0.0094   25.6   9.0   24  436-459    79-102 (149)
172 cd04275 ZnMc_pappalysin_like Z  27.2      11 0.00024   38.7  -1.9   20  523-542   136-156 (225)
173 COG4227 Antirestriction protei  27.1      34 0.00074   36.2   1.5   21  520-541   199-219 (316)
174 KOG3733|consensus               27.1 1.5E+02  0.0033   33.6   6.4   61  410-480   487-555 (566)
175 PF12315 DUF3633:  Protein of u  26.7      44 0.00094   34.3   2.2   20  138-157    86-105 (212)
176 cd06460 M32_Taq Peptidase fami  26.6      84  0.0018   35.2   4.6   14  147-160   161-174 (396)
177 cd04273 ZnMc_ADAMTS_like Zinc-  26.6      14 0.00031   36.8  -1.3   12  146-157   141-152 (207)
178 cd04277 ZnMc_serralysin_like Z  26.4      37  0.0008   33.1   1.6   19  522-540   111-130 (186)
179 cd04277 ZnMc_serralysin_like Z  26.3      35 0.00076   33.3   1.4   15  142-156   110-124 (186)
180 COG3864 Uncharacterized protei  26.3      48   0.001   36.0   2.5   25  139-163    63-87  (396)
181 TIGR03513 GldL_gliding gliding  26.2 2.3E+02   0.005   29.0   7.1   46  375-420     6-51  (202)
182 cd06161 S2P-M50_SpoIVFB SpoIVF  25.8      39 0.00084   34.0   1.7   13  523-535    37-49  (208)
183 PF13398 Peptidase_M50B:  Pepti  25.2      41 0.00088   33.7   1.7   15  521-535    19-33  (200)
184 cd04327 ZnMc_MMP_like_3 Zinc-d  25.2      41 0.00089   33.4   1.7   14  143-156    90-103 (198)
185 KOG3607|consensus               24.8      49  0.0011   39.8   2.4   16  142-157   320-335 (716)
186 COG3402 Uncharacterized conser  24.5 1.9E+02  0.0042   28.5   6.0   44  377-420    25-69  (161)
187 TIGR02140 permease_CysW sulfat  24.1 7.7E+02   0.017   25.0  11.7   36  417-457   139-174 (261)
188 COG4227 Antirestriction protei  24.0      44 0.00095   35.5   1.6   24  140-163   198-221 (316)
189 cd04276 ZnMc_MMP_like_2 Zinc-d  23.8      46   0.001   33.6   1.7   14  143-156   114-127 (197)
190 cd04276 ZnMc_MMP_like_2 Zinc-d  23.7      50  0.0011   33.4   2.0   19  522-540   114-133 (197)
191 COG4324 Predicted aminopeptida  23.7      57  0.0012   34.7   2.4   26  139-164   191-219 (376)
192 TIGR02414 pepN_proteo aminopep  23.6      91   0.002   38.3   4.5   18  522-539   281-298 (863)
193 cd06163 S2P-M50_PDZ_RseP-like   23.6      48   0.001   33.0   1.8   11  525-535    10-20  (182)
194 cd05709 S2P-M50 Site-2 proteas  23.6      43 0.00094   32.4   1.5   13  145-157     8-20  (180)
195 KOG3714|consensus               23.3      34 0.00074   38.2   0.8   18  515-535   153-170 (411)
196 COG2274 SunT ABC-type bacterio  23.2 3.3E+02  0.0071   32.9   8.9   20  300-319   229-248 (709)
197 cd06164 S2P-M50_SpoIVFB_CBS Sp  22.8      48   0.001   34.0   1.7   13  523-535    52-64  (227)
198 cd06161 S2P-M50_SpoIVFB SpoIVF  22.4      47   0.001   33.4   1.5   12  145-156    38-49  (208)
199 cd06159 S2P-M50_PDZ_Arch Uncha  22.2      50  0.0011   34.8   1.7   13  523-535   117-129 (263)
200 cd06162 S2P-M50_PDZ_SREBP Ster  21.7      52  0.0011   35.1   1.7   14  523-536   134-147 (277)
201 PF06262 DUF1025:  Possibl zinc  21.6      61  0.0013   29.2   1.9   19  142-160    70-88  (97)
202 PRK14015 pepN aminopeptidase N  21.3      64  0.0014   39.7   2.6   19  522-540   294-312 (875)
203 smart00793 AgrB Accessory gene  21.1 4.3E+02  0.0093   26.1   8.0   24  423-446   118-141 (184)
204 COG4662 TupA ABC-type tungstat  20.9 5.4E+02   0.012   26.5   8.4   63  389-456    80-143 (227)
205 PF11677 DUF3273:  Protein of u  20.8 2.9E+02  0.0063   29.4   6.9   48  393-445   207-255 (265)
206 cd04281 ZnMc_BMP1_TLD Zinc-dep  20.8      44 0.00095   33.8   0.9   17  516-535    82-98  (200)
207 KOG3314|consensus               20.6   1E+02  0.0022   30.7   3.2   44  469-542    65-109 (194)
208 COG4324 Predicted aminopeptida  20.5      76  0.0016   33.8   2.6   28  519-546   192-222 (376)
209 cd06163 S2P-M50_PDZ_RseP-like   20.3      51  0.0011   32.8   1.2   11  146-156    10-20  (182)

No 1  
>KOG2719|consensus
Probab=100.00  E-value=7.8e-62  Score=517.95  Aligned_cols=285  Identities=45%  Similarity=0.762  Sum_probs=249.6

Q ss_pred             hhccchhhhhHHhhhhhhhhccCCccchHHHHHHHHHHHHHHHHhhhccHHHhhhcccccccCCCcccHHHHHHHHHHHH
Q psy4798         264 LLVCNGLPYFWSKSEELGETYFGFHKNEIVTSCLFIVLFNIFSTVIGLPISFYHHFVLEEKHGFNKQTYGFFVKDQIKSF  343 (623)
Q Consensus       264 iL~~g~~~~lw~~s~~~~~~~~~~~~~~~~~~~~f~~~~~l~~~li~lP~~~Y~~f~le~~~Gfn~~T~~~f~~d~vk~~  343 (623)
                      .++++.+|++|.+++++...     ..|++++++|+.+.+.+++++++|+++|++|++|+|||||+||.+.|++|.+|+.
T Consensus        82 ~L~~g~~~~lw~lt~~~~~~-----~~~~~~s~vF~~~~s~~~~l~~LP~~iys~fvie~~~gfnk~t~~~~~~d~lk~~  156 (428)
T KOG2719|consen   82 ELYLGALPFLWKLTGKFLGK-----AGEIWLSLVFLLLLSIYSTLLDLPSSIYSTFVLEEKHGFNKQTLSLFIIDGLKSL  156 (428)
T ss_pred             HHHHhHHHHHHHHHHHHhhh-----hhhhhhhhHHHHHHHHHHHHHhccHHHHHHHHHHhhcCCCcCcchhHHHHHHHHH
Confidence            46678999999999987541     2389999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHhhccccccchhhhhhccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhchHhh
Q psy4798         344 IVSLILSIPLTGAVVYIIQTYGFFVKDQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYPEFI  423 (623)
Q Consensus       344 ~l~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I  423 (623)
                      ++++                                ++++|+++++.+++..+|++|.+|+|++..++++++++++|.+|
T Consensus       157 ~v~~--------------------------------~l~~Pi~~~iv~i~~~~g~yF~i~~w~f~~v~~l~l~ti~p~~i  204 (428)
T KOG2719|consen  157 LVGV--------------------------------VLTIPIVAAIVMILKKFGPYFFIYLWLFYFVLSLLLLTIYPGFI  204 (428)
T ss_pred             HHHH--------------------------------HHHHHHHHHHHHHHHHcCccHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            9999                                89999999999999999999999999999999999999999999


Q ss_pred             hhhhccCCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCC
Q psy4798         424 APLFDKYTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAG  503 (623)
Q Consensus       424 ~plf~k~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~  503 (623)
                      +|+|+|++|+++++||++||++|++.|||.++++|++||+||+|+|||++|+|++||||+||||+.+             
T Consensus       205 ~P~f~K~~PLe~g~l~~~Ie~la~s~gfp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~-------------  271 (428)
T KOG2719|consen  205 APLFGKFTPLEEGDLKEKIERLADSVGFPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLE-------------  271 (428)
T ss_pred             hhhhcCCCCCCCCchHHHHHHHHHhcCCCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhh-------------
Confidence            9999999999999999999999999999999999999999999999999999999999999999941             


Q ss_pred             CCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHH--------HHHhhhhhh---------------H-
Q psy4798         504 DSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKE--------FGVANKERE---------------A-  559 (623)
Q Consensus       504 ~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~--------~g~~~~~~~---------------~-  559 (623)
                               +++   |++||++||+|||+|||++||+.|++++.++        ||++++...               + 
T Consensus       272 ---------~~~---~~~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~llF~~~~~~~~ly~a~Gf~~~~P~~ig~  339 (428)
T KOG2719|consen  272 ---------EEH---LNNEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLLFGFLYRNPKLYAAFGFIDEQPSLIGF  339 (428)
T ss_pred             ---------hhc---cccHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHcCcchheeecCCCCCcchhHH
Confidence                     011   7899999999999999999999999998875        455444211               0 


Q ss_pred             ---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcccccccccc
Q psy4798         560 ---------SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDKARLTQLYD  614 (623)
Q Consensus       560 ---------~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~~~~~~l~~  614 (623)
                               +....+....+.. ||+.|++||++|.+.. -.+++.+||.||..  +|++.+.+
T Consensus       340 livf~~~l~py~~l~~~~~n~~-sR~fEyqAD~fA~klG-Yg~~L~~AL~KL~~--dnlsf~~~  399 (428)
T KOG2719|consen  340 LIVFQFVLAPYRALLNFLMNLI-SRRFEYQADAFAKKLG-YGKDLRQALIKLFV--DNLSFPVS  399 (428)
T ss_pred             HHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHHcC-CchhHHHHHHHHhh--hhcCCCCC
Confidence                     1111122233444 9999999999999986 67789999999998  44444433


No 2  
>KOG2719|consensus
Probab=100.00  E-value=1.2e-38  Score=340.41  Aligned_cols=188  Identities=52%  Similarity=0.850  Sum_probs=176.8

Q ss_pred             cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCCCeEEEEecchhhhccCCc
Q psy4798          37 ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLN  116 (623)
Q Consensus        37 ~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~  116 (623)
                      +.+++++.+|+++++.|||.+++++.                  ++|+|++|+|||++|+|++||||++|||+..     
T Consensus       215 e~g~l~~~Ie~la~s~gfp~~k~~vi------------------~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~-----  271 (428)
T KOG2719|consen  215 EEGDLKEKIERLADSVGFPLSKYRVI------------------DGSKRSSHSNAYFYGLCKNKRIVIYDTLLLE-----  271 (428)
T ss_pred             CCCchHHHHHHHHHhcCCCceEEEEE------------------ecCCCCCCCCeeeeeccccceEEEehhhhhh-----
Confidence            45689999999999999999998874                  5899999999999999999999999999951     


Q ss_pred             cccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCC
Q psy4798         117 ADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYD  196 (623)
Q Consensus       117 ~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~  196 (623)
                                |+          .||+||+.||+|||+|||+++|+.|+++++++..++.+++|+.+++++.+|.++||.+
T Consensus       272 ----------~~----------~~~~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~llF~~~~~~~~ly~a~Gf~~  331 (428)
T KOG2719|consen  272 ----------EE----------HLNNEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLLFGFLYRNPKLYAAFGFID  331 (428)
T ss_pred             ----------hh----------ccccHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHcCcchheeecCCC
Confidence                      00          1799999999999999999999999999999999999999999999999999999999


Q ss_pred             CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcCCHHHHHHHHHHHhhCcCCCchhhcc
Q psy4798         197 SQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLGKAIFLRKALLKINKDNLGFPALLVC  267 (623)
Q Consensus       197 ~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~~~~~L~sAL~KLa~~~ls~P~iL~~  267 (623)
                      .+|..+|+++++.++++|+..+++++++.+||++|||||+||.++|+++.|++||.|+..+|.++|....+
T Consensus       332 ~~P~~ig~livf~~~l~py~~l~~~~~n~~sR~fEyqAD~fA~klGYg~~L~~AL~KL~~dnlsf~~~D~L  402 (428)
T KOG2719|consen  332 EQPSLIGFLIVFQFVLAPYRALLNFLMNLISRRFEYQADAFAKKLGYGKDLRQALIKLFVDNLSFPVSDPL  402 (428)
T ss_pred             CCcchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHcCCchhHHHHHHHHhhhhcCCCCCcHH
Confidence            89999999999999999999999999999999999999999999999999999999999999999998644


No 3  
>PRK03001 M48 family peptidase; Provisional
Probab=100.00  E-value=7.1e-35  Score=302.38  Aligned_cols=201  Identities=15%  Similarity=0.231  Sum_probs=163.3

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccCCcCCc---hhHHHHHHHHH
Q psy4798         370 DQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKYTPLPD---GELKSRIEQLS  446 (623)
Q Consensus       370 ~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~~pl~~---~~L~~~I~~la  446 (623)
                      |.+|++++.++++++++.+.+++.   |..+|+|+|++.++++++.++++|.++.|++++ +|+++   ++|++.++++|
T Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~l~~~~~p~L~~~v~~la   77 (283)
T PRK03001          2 NWVKTAMLMAAITALFIVIGGMIG---GSQGMLIALLFALGMNFFSYWFSDKMVLKMYNA-QEVDENTAPQFYRMVRELA   77 (283)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHHh---chhhHHHHHHHHHHHHHHHHHHhHHHHHHHcCC-EECCccccHHHHHHHHHHH
Confidence            456667767788888877777665   566799999999999999999999999999998 88876   58999999999


Q ss_pred             HHcCCCCCeEEEEeCCCCCCcccEEEeccCCCc-eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHH
Q psy4798         447 ASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVL  525 (623)
Q Consensus       447 ~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~k-rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~  525 (623)
                      ++.|+|.+++|++|+    +++|||++|.+++| +||++|+|++.                            +++||+.
T Consensus        78 ~~~g~~~p~v~v~~~----~~~NAfa~G~~~~~~~Ivvt~gLl~~----------------------------l~~~El~  125 (283)
T PRK03001         78 QRAGLPMPKVYLINE----DQPNAFATGRNPEHAAVAATTGILRV----------------------------LSEREIR  125 (283)
T ss_pred             HHcCCCCCeEEEecC----CCcceEEecCCCCCeEEEecHHHHhh----------------------------CCHHHHH
Confidence            999999999999996    36999999998765 69999999998                            8999999


Q ss_pred             HHHHHHhhhhhccCHHHHHHHHHHHHHhhhh--------------h--hH-------HHHHHHHHHHHHHHHHHHHHHHH
Q psy4798         526 AVLAHELGHWKYNHVLKSMILKKEFGVANKE--------------R--EA-------SVMRYVTKESELITARQDREAAE  582 (623)
Q Consensus       526 AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~--------------~--~~-------~v~~~~~~~~~~~~sR~~E~~AD  582 (623)
                      ||+|||+|||+++|+.++++.+.+.++....              .  ..       .+......+....+||++|++||
T Consensus       126 aVlAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD  205 (283)
T PRK03001        126 GVMAHELAHVKHRDILISTISATMAGAISALANFAMFFGGRDENGRPVNPIAGIAVAILAPLAASLIQMAISRAREFEAD  205 (283)
T ss_pred             HHHHHHHHHHhCCChHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccchHHHHHHHHHHHHHHHHHHHHHhHHHHHHHh
Confidence            9999999999999999998765332211100              0  00       01111122333446999999999


Q ss_pred             HHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798         583 KKYKEAMKEKDNVMAKLKVLSTDK  606 (623)
Q Consensus       583 ~~a~~~~~~~~~l~~aL~kL~~~~  606 (623)
                      ++|++.+++|++++++|+||+.+.
T Consensus       206 ~~a~~l~~~p~~l~~AL~Kl~~~~  229 (283)
T PRK03001        206 RGGARISGDPQALASALDKIHRYA  229 (283)
T ss_pred             HHHHHHhCCHHHHHHHHHHHHhhh
Confidence            999999999999999999999853


No 4  
>PRK03982 heat shock protein HtpX; Provisional
Probab=100.00  E-value=2.9e-33  Score=290.98  Aligned_cols=200  Identities=19%  Similarity=0.180  Sum_probs=157.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccCCcCC---chhHHHHHHHHH
Q psy4798         370 DQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKYTPLP---DGELKSRIEQLS  446 (623)
Q Consensus       370 ~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~~pl~---~~~L~~~I~~la  446 (623)
                      |.+|+++..++++.+++++.+   ..+|..||+++|++.+++.++ .+++|..|.|++++++|++   +++|++.++++|
T Consensus         3 ~~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~l~~~~~p~L~~~v~~la   78 (288)
T PRK03982          3 NQLKTGLLMALLTGLLYAIGY---LLGGSIGPIIAILLALIPNLI-SYYYSDKIVLASYNARIVSEEEAPELYRIVERLA   78 (288)
T ss_pred             hHHHHHHHHHHHHHHHHHHHH---HHhchhHHHHHHHHHHHHHHH-HHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHH
Confidence            455666666666666655544   235788999999998888776 7899999999999999994   567999999999


Q ss_pred             HHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-EEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHH
Q psy4798         447 ASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-IVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVL  525 (623)
Q Consensus       447 ~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-IVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~  525 (623)
                      ++.|+|.+++|++|++    ++|||++|++++++ |+++|+|++.                            +++||+.
T Consensus        79 ~~~g~~~p~v~v~~~~----~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------l~~~El~  126 (288)
T PRK03982         79 ERANIPKPKVAIVPTQ----TPNAFATGRDPKHAVVAVTEGILNL----------------------------LNEDELE  126 (288)
T ss_pred             HHcCCCCCeEEEEeCC----CcceEEeccCCCCeEEEeehHHHhh----------------------------CCHHHHH
Confidence            9999999999999854    69999999987554 7799999998                            8999999


Q ss_pred             HHHHHHhhhhhccCHHHHHHHHHHHHHhh--------hhh--------------hHH-----HHHHHHHHHHHHHHHHHH
Q psy4798         526 AVLAHELGHWKYNHVLKSMILKKEFGVAN--------KER--------------EAS-----VMRYVTKESELITARQDR  578 (623)
Q Consensus       526 AVLAHELGH~k~~H~~k~~~~~~~~g~~~--------~~~--------------~~~-----v~~~~~~~~~~~~sR~~E  578 (623)
                      ||+|||+||+|++|+.++++.+.+.+...        ...              .+.     +......+....+||++|
T Consensus       127 AVlAHElgHi~~~h~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~SR~~E  206 (288)
T PRK03982        127 GVIAHELTHIKNRDTLIQTIAATLAGAIMYLAQWLSWGLWFGGGGRDDRNGGNPIGSLLLIILAPIAATLIQFAISRQRE  206 (288)
T ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence            99999999999999999987654322111        000              000     111112233344699999


Q ss_pred             HHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798         579 EAAEKKYKEAMKEKDNVMAKLKVLSTD  605 (623)
Q Consensus       579 ~~AD~~a~~~~~~~~~l~~aL~kL~~~  605 (623)
                      ++||++|++.+++|++++++|.||+.+
T Consensus       207 ~~AD~~A~~~~~~p~~l~~aL~kL~~~  233 (288)
T PRK03982        207 FSADEGGARLTGNPLALANALQKLEKG  233 (288)
T ss_pred             HHHhHHHHHHhCCHHHHHHHHHHHHhh
Confidence            999999999999999999999999875


No 5  
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.97  E-value=1.2e-29  Score=267.47  Aligned_cols=170  Identities=21%  Similarity=0.233  Sum_probs=137.6

Q ss_pred             HHHHHHHHHHHHhhchHhhhhhhc--cCCcCC--chhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccC-CCc
Q psy4798         405 WVFIILMSLFLMTIYPEFIAPLFD--KYTPLP--DGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFF-KNK  479 (623)
Q Consensus       405 w~~~~~~~l~~~~i~P~~I~plf~--k~~pl~--~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~-~~k  479 (623)
                      .++.+++.++..+++|.++.|+|+  +++|.+  +++|++.++++|++.|+|.+++|++|++    .+|||++|.+ ++|
T Consensus        48 ~~~~~~~~~~~~~~sp~li~~~~~a~~~~p~~~~~~~L~~~v~~la~~~g~~~p~vyv~~~~----~~NAfa~G~~~~~~  123 (324)
T PRK01265         48 LIFVFFLNIIQWLFGPYMINAAYRTVEVTPTDPVYGWLYSIVAEVAKYNGIRVPKVYIADVP----FPNAFAYGSPIAGK  123 (324)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHcCCeeCCCCCcccHHHHHHHHHHHHHcCCCCCeEEEecCC----CCCeEEeccCCCCC
Confidence            345567788999999999999997  899987  7899999999999999999999999975    4999999996 579


Q ss_pred             eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhh----
Q psy4798         480 RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANK----  555 (623)
Q Consensus       480 rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~----  555 (623)
                      +|+++|+|++.                            +++||+.||+|||+|||||+|+.++++++.+.+++..    
T Consensus       124 ~Ivvt~gLl~~----------------------------l~~~El~aVlAHElgHik~~d~~~~~~~~~i~~~~~~~~~~  175 (324)
T PRK01265        124 RIAITLPLLKI----------------------------LNRDEIKAVAGHELGHLKHRDVELLMAIGLIPTLIYYLGYS  175 (324)
T ss_pred             EEEEehHHHhh----------------------------CCHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHH
Confidence            99999999998                            8999999999999999999999998876543111110    


Q ss_pred             ----hh-----------------hHH---HHHH-HHHHHHHHHHHHHHHHHHHHHHH-HcCCHHHHHHHHHHhhhccc
Q psy4798         556 ----ER-----------------EAS---VMRY-VTKESELITARQDREAAEKKYKE-AMKEKDNVMAKLKVLSTDKA  607 (623)
Q Consensus       556 ----~~-----------------~~~---v~~~-~~~~~~~~~sR~~E~~AD~~a~~-~~~~~~~l~~aL~kL~~~~~  607 (623)
                          ..                 ...   +... ...+..+ .||++||+||++|++ .+++|+++++||+||+...+
T Consensus       176 ~~~~~~~~~~~~~~~~~~~~l~l~~ll~~~~~~i~~~l~~a-iSR~rEy~AD~~aa~~~tg~p~~LasAL~KL~~~~~  252 (324)
T PRK01265        176 LFWGGMFGGGGGGRGNNGGLLFLIGIALMAVSFVFNLLVLS-INRMREAYADVNSALTVPGGAENLQTALAKITLSMD  252 (324)
T ss_pred             HHHHHHhcccccCCCccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhHHHHHhhcCChHHHHHHHHHHHhccc
Confidence                00                 000   1111 2223344 499999999999998 55999999999999998544


No 6  
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.97  E-value=1.2e-30  Score=274.99  Aligned_cols=194  Identities=23%  Similarity=0.235  Sum_probs=153.4

Q ss_pred             HHHHHHHHHHHHHHHHHHhhhhccc----c--CCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798          12 GIIGFSWIVFLFEFYLSIRQRRVYH----E--TTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR   85 (623)
Q Consensus        12 ~~~~~~~~~~~~~~~l~~~~~~~~~----~--~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~   85 (623)
                      ..+++++++|+.++++..+..++++    +  .|++++.++++|++.|+|+|++|++|                      
T Consensus        50 ~~~~~~~~~~~~sp~li~~~~~a~~~~p~~~~~~~L~~~v~~la~~~g~~~p~vyv~~----------------------  107 (324)
T PRK01265         50 FVFFLNIIQWLFGPYMINAAYRTVEVTPTDPVYGWLYSIVAEVAKYNGIRVPKVYIAD----------------------  107 (324)
T ss_pred             HHHHHHHHHHHHhHHHHHHHcCCeeCCCCCcccHHHHHHHHHHHHHcCCCCCeEEEec----------------------
Confidence            3467789999999999999887553    1  34799999999999999999999987                      


Q ss_pred             CCCCceeeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          86 SEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        86 ~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      ++.+|||++|.+. +++|+++|+|++.                            ||+||++||+|||+|||||+|+.++
T Consensus       108 ~~~~NAfa~G~~~~~~~Ivvt~gLl~~----------------------------l~~~El~aVlAHElgHik~~d~~~~  159 (324)
T PRK01265        108 VPFPNAFAYGSPIAGKRIAITLPLLKI----------------------------LNRDEIKAVAGHELGHLKHRDVELL  159 (324)
T ss_pred             CCCCCeEEeccCCCCCEEEEehHHHhh----------------------------CCHHHHHHHHHHHHHHHHcccHHHH
Confidence            4569999999974 7999999999997                            8999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc--C
Q psy4798         165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL--G  242 (623)
Q Consensus       165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l--~  242 (623)
                      ++++.+..++.++..  ...+..+++.+|..+...  .+.+.++..++.++..++.++++++||+|||+||++|++.  +
T Consensus       160 ~~~~~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~l~l~~ll~~~~~~i~~~l~~aiSR~rEy~AD~~aa~~~tg  235 (324)
T PRK01265        160 MAIGLIPTLIYYLGY--SLFWGGMFGGGGGGRGNN--GGLLFLIGIALMAVSFVFNLLVLSINRMREAYADVNSALTVPG  235 (324)
T ss_pred             HHHHHHHHHHHHHHH--HHHHHHHhcccccCCCcc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhcC
Confidence            988888776555432  222333444444321111  1122334555566677788899999999999999999995  8


Q ss_pred             CHHHHHHHHHHHhhCcC
Q psy4798         243 KAIFLRKALLKINKDNL  259 (623)
Q Consensus       243 ~~~~L~sAL~KLa~~~l  259 (623)
                      +++.+++||.|++....
T Consensus       236 ~p~~LasAL~KL~~~~~  252 (324)
T PRK01265        236 GAENLQTALAKITLSMD  252 (324)
T ss_pred             ChHHHHHHHHHHHhccc
Confidence            89999999999998543


No 7  
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.96  E-value=2.6e-28  Score=257.28  Aligned_cols=172  Identities=20%  Similarity=0.239  Sum_probs=137.9

Q ss_pred             HHHHHHHHHHHHHHHhhchHhhhhhhcc--CCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCC-C
Q psy4798         402 LYLWVFIILMSLFLMTIYPEFIAPLFDK--YTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFK-N  478 (623)
Q Consensus       402 l~~w~~~~~~~l~~~~i~P~~I~plf~k--~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~-~  478 (623)
                      +...++.+++.++..+..|..+.+.++.  .+|.++++|++.++++|++.|+|.++||++|+    .++|||++|.++ +
T Consensus        31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~La~~agi~~p~v~vid~----~~~NAFa~G~~~~~  106 (317)
T PRK01345         31 MIALVIAAGMNLFSYWNSDKMVLRMYGAQEVDERSAPELYRMVRDLARRAGLPMPKVYIIDN----PQPNAFATGRNPEN  106 (317)
T ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHcCCeECCcccCHHHHHHHHHHHHHcCCCCCcEEEEcC----CCcceEEecCCCCC
Confidence            3344666777888889999999999985  78888899999999999999999999999995    469999999975 5


Q ss_pred             ceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhh--
Q psy4798         479 KRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKE--  556 (623)
Q Consensus       479 krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~--  556 (623)
                      ++|+++|+|++.                            +++||+.||+|||+||++++|+.++++.+.+.++....  
T Consensus       107 ~~V~vt~gLL~~----------------------------L~~dEL~aVlAHElgHi~~~d~~~~~l~~~~~~~~~~l~~  158 (317)
T PRK01345        107 AAVAATTGLLQR----------------------------LSPEEVAGVMAHELAHVKNRDTLTMTITATLAGAISMLAN  158 (317)
T ss_pred             eEEEechHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHH
Confidence            579999999998                            89999999999999999999999988765432211100  


Q ss_pred             -----------------hhHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798         557 -----------------REAS-----VMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTD  605 (623)
Q Consensus       557 -----------------~~~~-----v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~  605 (623)
                                       ..+.     +......+....+||++|++||++|++++++|++++++|+||+.+
T Consensus       159 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~~~SR~rE~~AD~~A~~ltg~p~~L~~AL~KL~~~  229 (317)
T PRK01345        159 FAFFFGGNRENNNGPLGLVGTLAAMIVAPLAAMLVQMAISRTREYAADRRGAEICGNPLWLASALGKIERG  229 (317)
T ss_pred             HHHHhcCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhh
Confidence                             0000     011111233444699999999999999999999999999999974


No 8  
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.96  E-value=2.8e-29  Score=262.63  Aligned_cols=196  Identities=19%  Similarity=0.190  Sum_probs=147.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCC
Q psy4798          11 YGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRS   86 (623)
Q Consensus        11 ~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~   86 (623)
                      ...+++++++|+.++++.+++.++.+    +.|++++.++++|++.|+|+|++|++|                      +
T Consensus        48 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~la~~~gip~p~v~v~~----------------------~  105 (298)
T PRK04897         48 IIGVIYALIMIFQSTNVVMSMNHAREVTEEEAPELWHIVEDMAMVAQIPMPRVFIID----------------------D  105 (298)
T ss_pred             HHHHHHHHHHHHhhHHHHHHhCCCEECChhhhHHHHHHHHHHHHHcCCCCCcEEEec----------------------C
Confidence            44567788999999999999988765    347899999999999999999999987                      5


Q ss_pred             CCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798          87 EHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM  165 (623)
Q Consensus        87 ~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~  165 (623)
                      ..+|||++|+++++ .|+++++|++.                            ||+||++||+|||+|||+|+|+.+++
T Consensus       106 ~~~NAfa~G~~~~~~~v~vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~  157 (298)
T PRK04897        106 PSPNAFATGSSPKNAAVAVTTGLLAI----------------------------MNREELEGVIGHEISHIRNYDIRLST  157 (298)
T ss_pred             CCCceEEeccCCCCcEEEeehHHHhh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHH
Confidence            67999999998865 57888889997                            89999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCcccCCCC---CC--C--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhh
Q psy4798         166 IVMQLNLLFMLYSFQYLFQYPPLYSAFGF---YD--S--QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFG  238 (623)
Q Consensus       166 ~~~~~~~~~~~~lf~~~~~~~~l~~~~G~---~~--~--~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA  238 (623)
                      +...+..+++++ ...+.. ..++..++.   .+  .  .+++..++.++..++.|+.  ..++++++||++||+||++|
T Consensus       158 ~~~~~~~~~~~l-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~--~~ll~~~~SR~rE~~AD~~A  233 (298)
T PRK04897        158 IAVALASAITLL-SDIAGR-MMWWGGGSRRRDDDRDGGGLQIILLIVSLLLLILAPLA--ATLIQLAISRQREYLADASS  233 (298)
T ss_pred             HHHHHHHHHHHH-HHHHHH-HHHhcccccccccccccchhhHHHHHHHHHHHHHHHHH--HHHHHHhhhHHHHHHHHHHH
Confidence            887665554432 111111 111111111   01  1  1122222333444555543  46778999999999999999


Q ss_pred             hhcC-CHHHHHHHHHHHhhCcCC
Q psy4798         239 KSLG-KAIFLRKALLKINKDNLG  260 (623)
Q Consensus       239 ~~l~-~~~~L~sAL~KLa~~~ls  260 (623)
                      ++++ +|+.+++||.|+++++..
T Consensus       234 ~~lt~~p~~La~AL~KL~~~~~~  256 (298)
T PRK04897        234 VELTRNPQGLISALEKISNSQPM  256 (298)
T ss_pred             HHHhCCHHHHHHHHHHHHhcccc
Confidence            9995 699999999999988764


No 9  
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.96  E-value=4.3e-28  Score=253.43  Aligned_cols=169  Identities=20%  Similarity=0.109  Sum_probs=129.7

Q ss_pred             HHHHHHHHHHHHhhchHhhhhhhc--cCCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-E
Q psy4798         405 WVFIILMSLFLMTIYPEFIAPLFD--KYTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-I  481 (623)
Q Consensus       405 w~~~~~~~l~~~~i~P~~I~plf~--k~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-I  481 (623)
                      .++.+++.++..++.|..+....+  +.+|.++++|++.++++|++.|+|.+++|++|.    +++|||++|++++++ |
T Consensus        43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~L~~~v~~la~~~~~~~p~v~v~~~----~~~NAfa~G~~~~~~~V  118 (296)
T PRK02391         43 VVIAGGFLLAQYFFSDKLALWSMGARIVSEDEYPELHAMVERLCALADLPKPRVAVADS----DVPNAFATGRSPKNAVV  118 (296)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHcCCEECChhhCHHHHHHHHHHHHHcCCCCCcEEEEeC----CCCceEEecCCCCCcEE
Confidence            344555666666677777665543  578888999999999999999999999999993    579999999987665 6


Q ss_pred             EEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhh-----
Q psy4798         482 VLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKE-----  556 (623)
Q Consensus       482 VL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~-----  556 (623)
                      +++|+|++.                            +++||+.||+|||+||||++|+.++++.+.+..++...     
T Consensus       119 ~vt~gLl~~----------------------------L~~~El~aVlaHElgHi~~~di~~~~i~~~~~~~~~~l~~~~~  170 (296)
T PRK02391        119 CVTTGLMRR----------------------------LDPDELEAVLAHELSHVKNRDVAVMTIASFLSTIAFLIVRWGF  170 (296)
T ss_pred             EecHHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            788999998                            89999999999999999999999988665431111100     


Q ss_pred             ----------hh--H-----H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798         557 ----------RE--A-----S----VMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTD  605 (623)
Q Consensus       557 ----------~~--~-----~----v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~  605 (623)
                                ..  .     .    +...+........||++|++||++|++++++|+++++||.||+.+
T Consensus       171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~SR~rE~~AD~~Aa~ltg~p~~LasAL~KL~~~  240 (296)
T PRK02391        171 YFGGFGGRGGGGGGGGILVVILVSLVVWAISFLLIRALSRYREFAADRGAAIITGRPSALASALMKISGR  240 (296)
T ss_pred             HhccccCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHcc
Confidence                      00  0     0    011111222233499999999999999999999999999999975


No 10 
>PRK03982 heat shock protein HtpX; Provisional
Probab=99.96  E-value=4.5e-29  Score=259.66  Aligned_cols=198  Identities=19%  Similarity=0.229  Sum_probs=148.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798          10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR   85 (623)
Q Consensus        10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~   85 (623)
                      +.+.+++.+++|++++++..+.++.++    +.|++++.++++|++.|+|+|++|++|                      
T Consensus        35 ~~~~~~~~~~~~~~~~~i~~~~~~~~~l~~~~~p~L~~~v~~la~~~g~~~p~v~v~~----------------------   92 (288)
T PRK03982         35 ILLALIPNLISYYYSDKIVLASYNARIVSEEEAPELYRIVERLAERANIPKPKVAIVP----------------------   92 (288)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHHHHcCCCCCeEEEEe----------------------
Confidence            345677788899999999988887764    346799999999999999999999976                      


Q ss_pred             CCCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          86 SEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        86 ~~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      +..+|||++|+++++ .|+++|+|++.                            ||+||++||+|||+||++|+|+.++
T Consensus        93 ~~~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------l~~~El~AVlAHElgHi~~~h~~~~  144 (288)
T PRK03982         93 TQTPNAFATGRDPKHAVVAVTEGILNL----------------------------LNEDELEGVIAHELTHIKNRDTLIQ  144 (288)
T ss_pred             CCCcceEEeccCCCCeEEEeehHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence            568999999998854 57799999997                            8999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcC-
Q psy4798         165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQ-PILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLG-  242 (623)
Q Consensus       165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~-p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~-  242 (623)
                      ++.+.+...+.++.  .+..+..++..+|+.... +..++.+ ++. ++.|+  ...++++++||++||+||++|++++ 
T Consensus       145 ~~~~~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~--~~~l~~~~~SR~~E~~AD~~A~~~~~  218 (288)
T PRK03982        145 TIAATLAGAIMYLA--QWLSWGLWFGGGGRDDRNGGNPIGSL-LLI-ILAPI--AATLIQFAISRQREFSADEGGARLTG  218 (288)
T ss_pred             HHHHHHHHHHHHHH--HHHHHHHHhcccCccccccchHHHHH-HHH-HHHHH--HHHHHHHHHhHHHHHHHhHHHHHHhC
Confidence            98876543332221  111122344444443222 2223322 122 22332  2466778999999999999999995 


Q ss_pred             CHHHHHHHHHHHhhCcCCCch
Q psy4798         243 KAIFLRKALLKINKDNLGFPA  263 (623)
Q Consensus       243 ~~~~L~sAL~KLa~~~ls~P~  263 (623)
                      +++.+++||.|+++.+.+.|.
T Consensus       219 ~p~~l~~aL~kL~~~~~~~~~  239 (288)
T PRK03982        219 NPLALANALQKLEKGVRYIPL  239 (288)
T ss_pred             CHHHHHHHHHHHHhhhccCCC
Confidence            899999999999988665543


No 11 
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.96  E-value=1e-28  Score=258.14  Aligned_cols=197  Identities=24%  Similarity=0.242  Sum_probs=145.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798          10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR   85 (623)
Q Consensus        10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~   85 (623)
                      +...+++++++|+.++.+.++.++..+    +.|++++.++++|++.|+|+|++|++|                      
T Consensus        43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~L~~~v~~la~~~~~~~p~v~v~~----------------------  100 (296)
T PRK02391         43 VVIAGGFLLAQYFFSDKLALWSMGARIVSEDEYPELHAMVERLCALADLPKPRVAVAD----------------------  100 (296)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHcCCEECChhhCHHHHHHHHHHHHHcCCCCCcEEEEe----------------------
Confidence            445677889999999999999888765    347899999999999999999999987                      


Q ss_pred             CCCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          86 SEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        86 ~~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      ++++|||++|+++++ .|+++|+|++.                            ||+||++||+|||+|||+|+|+.++
T Consensus       101 ~~~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------L~~~El~aVlaHElgHi~~~di~~~  152 (296)
T PRK02391        101 SDVPNAFATGRSPKNAVVCVTTGLMRR----------------------------LDPDELEAVLAHELSHVKNRDVAVM  152 (296)
T ss_pred             CCCCceEEecCCCCCcEEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence            678999999998755 57778999997                            8999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-CC
Q psy4798         165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL-GK  243 (623)
Q Consensus       165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l-~~  243 (623)
                      ++++.+..++.+.. ....    ++..++..+..+.. +.+++...+..+...+..++++++||++||+||++|+++ ++
T Consensus       153 ~i~~~~~~~~~~l~-~~~~----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~~SR~rE~~AD~~Aa~ltg~  226 (296)
T PRK02391        153 TIASFLSTIAFLIV-RWGF----YFGGFGGRGGGGGG-GGILVVILVSLVVWAISFLLIRALSRYREFAADRGAAIITGR  226 (296)
T ss_pred             HHHHHHHHHHHHHH-HHHH----HhccccCCCCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCC
Confidence            98877765544321 1110    11122211111100 111111111112233456778999999999999999998 57


Q ss_pred             HHHHHHHHHHHhhCcCCCc
Q psy4798         244 AIFLRKALLKINKDNLGFP  262 (623)
Q Consensus       244 ~~~L~sAL~KLa~~~ls~P  262 (623)
                      |+.+++||.|++......|
T Consensus       227 p~~LasAL~KL~~~~~~~~  245 (296)
T PRK02391        227 PSALASALMKISGRMDRVP  245 (296)
T ss_pred             HHHHHHHHHHHHcccccCC
Confidence            9999999999998754433


No 12 
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.96  E-value=5.3e-28  Score=251.89  Aligned_cols=173  Identities=16%  Similarity=0.167  Sum_probs=131.9

Q ss_pred             HHHHHHHHHHHHHHHHhhchHhhhhhhc--cCCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCC
Q psy4798         401 FLYLWVFIILMSLFLMTIYPEFIAPLFD--KYTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKN  478 (623)
Q Consensus       401 ~l~~w~~~~~~~l~~~~i~P~~I~plf~--k~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~  478 (623)
                      .+++.++.+++.++..+..|..+....+  ..+|.++++|++.++++|++.|+|.+++|++|.+    .+|||++|..++
T Consensus        33 ~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~~~~~p~L~~~v~~la~~~g~p~p~vyv~~~~----~~NAFa~G~~~~  108 (288)
T PRK03072         33 LGIAVLIAVGMNAYVYWNSDKLALRAMHAQPVSEVQAPAMYRIVRELSTAARQPMPRLYISPTA----APNAFATGRNPR  108 (288)
T ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHHHHcCCCCCCEEEecCC----CCceEEecCCCC
Confidence            3444455666666777777776666554  3577888999999999999999999999999954    599999997544


Q ss_pred             c-eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhh--
Q psy4798         479 K-RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANK--  555 (623)
Q Consensus       479 k-rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~--  555 (623)
                      + .|+++|+|++.                            +++||++||+|||+||++++|+.++.+.+.+.++...  
T Consensus       109 ~~~v~vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~~~~~~~~~i~~l~  160 (288)
T PRK03072        109 NAAVCCTEGILQI----------------------------LNERELRGVLGHELSHVYNRDILISSVAGALASVITYLA  160 (288)
T ss_pred             CcEEEecHHHHHh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHH
Confidence            3 48899999988                            8999999999999999999999998876543221110  


Q ss_pred             ----h-------h----hH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798         556 ----E-------R----EA--------SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTD  605 (623)
Q Consensus       556 ----~-------~----~~--------~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~  605 (623)
                          .       .    ..        .+......+....+||++|++||++|++++++|++++++|+||+.+
T Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~SR~rE~~AD~~A~~l~~~p~~La~AL~KL~~~  233 (288)
T PRK03072        161 NMAMFAGMFGGRRDNDGPNPLALLLVSLLGPIAATVIQLAISRSREYQADESGAELTGDPLALASALRKISGG  233 (288)
T ss_pred             HHHHHHHHhccccccccchHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHhCCHHHHHHHHHHHHhc
Confidence                0       0    00        0111112233344699999999999999999999999999999874


No 13 
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.96  E-value=5.4e-28  Score=252.90  Aligned_cols=167  Identities=16%  Similarity=0.234  Sum_probs=125.5

Q ss_pred             HHHHHHHHHHhhchHhhhhhhccCCcC---CchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-EE
Q psy4798         407 FIILMSLFLMTIYPEFIAPLFDKYTPL---PDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-IV  482 (623)
Q Consensus       407 ~~~~~~l~~~~i~P~~I~plf~k~~pl---~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-IV  482 (623)
                      +.+++.+ +.+.++..+...+++.+|+   ++++|++.++++|++.|+|.++||+++++    ++|||++|++++|+ |+
T Consensus        49 ~~~~~~~-~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~la~~~gip~p~v~v~~~~----~~NAfa~G~~~~~~~v~  123 (298)
T PRK04897         49 IGVIYAL-IMIFQSTNVVMSMNHAREVTEEEAPELWHIVEDMAMVAQIPMPRVFIIDDP----SPNAFATGSSPKNAAVA  123 (298)
T ss_pred             HHHHHHH-HHHHhhHHHHHHhCCCEECChhhhHHHHHHHHHHHHHcCCCCCcEEEecCC----CCceEEeccCCCCcEEE
Confidence            3333333 4444555555666678887   46789999999999999999999999954    59999999987665 88


Q ss_pred             EehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhh-------
Q psy4798         483 LFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANK-------  555 (623)
Q Consensus       483 L~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~-------  555 (623)
                      ++++|++.                            +++||+.||+|||+|||||+|+.++++...+.++...       
T Consensus       124 vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~~~~~~~~~~~~l~~~~~~  175 (298)
T PRK04897        124 VTTGLLAI----------------------------MNREELEGVIGHEISHIRNYDIRLSTIAVALASAITLLSDIAGR  175 (298)
T ss_pred             eehHHHhh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence            88899998                            8999999999999999999999998876443211100       


Q ss_pred             --hh------------hH------H----HHHHH----HHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798         556 --ER------------EA------S----VMRYV----TKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDK  606 (623)
Q Consensus       556 --~~------------~~------~----v~~~~----~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~  606 (623)
                        ..            .+      .    +...+    ..+....+||++|++||++|++++++|+++++||+||+.+.
T Consensus       176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~ll~~~~SR~rE~~AD~~A~~lt~~p~~La~AL~KL~~~~  254 (298)
T PRK04897        176 MMWWGGGSRRRDDDRDGGGLQIILLIVSLLLLILAPLAATLIQLAISRQREYLADASSVELTRNPQGLISALEKISNSQ  254 (298)
T ss_pred             HHHhcccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhcc
Confidence              00            00      0    01111    11223345999999999999999999999999999999753


No 14 
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.96  E-value=1.1e-28  Score=261.49  Aligned_cols=192  Identities=22%  Similarity=0.285  Sum_probs=147.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhhccccC---------CcchHHHHhhhhhcCCC-CccccccccchhhhHHHHHHhHhhc
Q psy4798          11 YGIIGFSWIVFLFEFYLSIRQRRVYHET---------TIVPHQIAHGMDAESFE-KSRRYSLDKNVFSMFKETVSNVMNT   80 (623)
Q Consensus        11 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~---------~~l~~~l~~l~~~~g~~-~~~ly~~d~~~~~~~~~~v~~v~~~   80 (623)
                      ...+++++++|++++.+.++++++++.+         |++++.++++|++.|+| +|++|++|                 
T Consensus        78 ~~~~~~~~~~y~~~~~~~l~~~~a~~I~~~~~~p~~~~~L~~~ve~La~~ag~p~~p~V~vi~-----------------  140 (336)
T PRK02870         78 LVAVISILVTFQNFDKIMLSGTEYKEITPENALSLQERQLYNVVEELLVAAGLRFMPKVYIID-----------------  140 (336)
T ss_pred             HHHHHHHHHHHHhhHHHHHHHcCCEEcCCCCCCchhhHHHHHHHHHHHHHcCCCCCCeEEEEc-----------------
Confidence            4567789999999999999999986532         36999999999999999 89999987                 


Q ss_pred             ccCCCCCCCceeeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcc
Q psy4798          81 VSSKRSEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYN  159 (623)
Q Consensus        81 ~~s~~~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~  159 (623)
                           ++++|||++|+++ +++|+++|+|++.                            ||+||++||+|||+||++|+
T Consensus       141 -----~~~~NAFA~G~~~~~~~Ivvt~GLL~~----------------------------L~~dEL~aVlAHELgHik~~  187 (336)
T PRK02870        141 -----APYMNAFASGYSEKSAMVAITTGLLEK----------------------------LDRDELQAVMAHELSHIRHG  187 (336)
T ss_pred             -----CCCCceEEecCCCCCcEEEEehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcc
Confidence                 5689999999986 7899999999997                            89999999999999999999


Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCC---CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccch
Q psy4798         160 HVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYD---SQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADA  236 (623)
Q Consensus       160 h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~---~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~  236 (623)
                      |+.+++++..+..++.+.. ...     +|..+|..+   ..+..++ ++++.+++.+   +..++++++||++||+||+
T Consensus       188 di~~~~~~~~l~~~~~~~~-~~~-----~~~~~g~~~~~~~~~~~~~-~l~l~~~~~~---~~~ll~~~iSR~rEy~AD~  257 (336)
T PRK02870        188 DIRLTLCVGVLSNIMLIVA-DFL-----FYSFMGNRRNSGANRARMI-ILILRYVLPI---LTVLLMLFLSRTREYMADA  257 (336)
T ss_pred             cHHHHHHHHHHHHHHHHHH-HHH-----HHHHhcCCcccccchhHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHHHhH
Confidence            9999887766654333221 111     111122211   1111112 2223333322   3467889999999999999


Q ss_pred             hhhhc-CCHHHHHHHHHHHhhCcCCCc
Q psy4798         237 FGKSL-GKAIFLRKALLKINKDNLGFP  262 (623)
Q Consensus       237 fA~~l-~~~~~L~sAL~KLa~~~ls~P  262 (623)
                      +|+++ ++|+++++||.|++.++...|
T Consensus       258 ~Aa~ltg~p~aLasAL~KL~~~~~~~~  284 (336)
T PRK02870        258 GAVELMRDNEPMARALQKISNDHAQND  284 (336)
T ss_pred             HHHHHhCCHHHHHHHHHHHHhccccCc
Confidence            99999 579999999999999887765


No 15 
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.96  E-value=1.8e-28  Score=255.47  Aligned_cols=193  Identities=21%  Similarity=0.257  Sum_probs=147.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCC
Q psy4798          11 YGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRS   86 (623)
Q Consensus        11 ~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~   86 (623)
                      ...+++++++|+.++++.+++++..+    +.|++++.++++|++.|+|+|++|++|                      +
T Consensus        38 ~~~~~~~~~~~~~s~~~~~~~~~~~~v~~~~~p~L~~~v~~la~~~g~p~p~vyv~~----------------------~   95 (288)
T PRK03072         38 LIAVGMNAYVYWNSDKLALRAMHAQPVSEVQAPAMYRIVRELSTAARQPMPRLYISP----------------------T   95 (288)
T ss_pred             HHHHHHHHHHHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHHHHcCCCCCCEEEec----------------------C
Confidence            44567788999999999999988764    347899999999999999999999987                      5


Q ss_pred             CCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798          87 EHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM  165 (623)
Q Consensus        87 ~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~  165 (623)
                      ..+|||++|..+.+ .|+++++|++.                            +|+||++||+|||+|||+|+|+++++
T Consensus        96 ~~~NAFa~G~~~~~~~v~vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~  147 (288)
T PRK03072         96 AAPNAFATGRNPRNAAVCCTEGILQI----------------------------LNERELRGVLGHELSHVYNRDILISS  147 (288)
T ss_pred             CCCceEEecCCCCCcEEEecHHHHHh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHH
Confidence            67999999986644 58889999997                            89999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCcccCCCCCC-C-chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-C
Q psy4798         166 IVMQLNLLFMLYSFQYLFQYPPLYSAFGFYD-S-QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL-G  242 (623)
Q Consensus       166 ~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~-~-~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l-~  242 (623)
                      +++.+..++.+..  .+.   .++..+|... . .+..++++ ++. ++.|+  ...++++++||++||+||++|+++ +
T Consensus       148 ~~~~~~~~i~~l~--~~~---~~~~~~~~~~~~~~~~~~~~~-~~~-l~~~~--~~~~~~~~~SR~rE~~AD~~A~~l~~  218 (288)
T PRK03072        148 VAGALASVITYLA--NMA---MFAGMFGGRRDNDGPNPLALL-LVS-LLGPI--AATVIQLAISRSREYQADESGAELTG  218 (288)
T ss_pred             HHHHHHHHHHHHH--HHH---HHHHHhccccccccchHHHHH-HHH-HHHHH--HHHHHHHHHHhHHHHHHhHHHHHHhC
Confidence            8877665544321  110   0112233221 1 12222222 222 23332  356778999999999999999999 5


Q ss_pred             CHHHHHHHHHHHhhCcCCCc
Q psy4798         243 KAIFLRKALLKINKDNLGFP  262 (623)
Q Consensus       243 ~~~~L~sAL~KLa~~~ls~P  262 (623)
                      +++.+++||.|+++.+.+.|
T Consensus       219 ~p~~La~AL~KL~~~~~~~~  238 (288)
T PRK03072        219 DPLALASALRKISGGVQAAP  238 (288)
T ss_pred             CHHHHHHHHHHHHhccccCC
Confidence            79999999999999876554


No 16 
>PRK03001 M48 family peptidase; Provisional
Probab=99.96  E-value=5.6e-28  Score=250.90  Aligned_cols=191  Identities=19%  Similarity=0.284  Sum_probs=144.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798          10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR   85 (623)
Q Consensus        10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~   85 (623)
                      +...+++++++++.++.+..+..+..+    +.|++++.++++|++.|+|+|++|++|                      
T Consensus        34 ~~~~~~~~~~~~~~~~~~i~~~~~~~~l~~~~~p~L~~~v~~la~~~g~~~p~v~v~~----------------------   91 (283)
T PRK03001         34 LLFALGMNFFSYWFSDKMVLKMYNAQEVDENTAPQFYRMVRELAQRAGLPMPKVYLIN----------------------   91 (283)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHcCCEECCccccHHHHHHHHHHHHHcCCCCCeEEEec----------------------
Confidence            455677788899999988888776644    236899999999999999999999987                      


Q ss_pred             CCCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          86 SEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        86 ~~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      ++++|||++|++++| +|+++|+|++.                            +|+||++||+|||+|||+|+|+.++
T Consensus        92 ~~~~NAfa~G~~~~~~~Ivvt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~h~~~~  143 (283)
T PRK03001         92 EDQPNAFATGRNPEHAAVAATTGILRV----------------------------LSEREIRGVMAHELAHVKHRDILIS  143 (283)
T ss_pred             CCCcceEEecCCCCCeEEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHhCCChHHH
Confidence            568999999998865 79999999997                            8999999999999999999999999


Q ss_pred             HHHHHHHHHHHH-HHHHHHhcCCCcccCCCCCCC--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc
Q psy4798         165 MIVMQLNLLFML-YSFQYLFQYPPLYSAFGFYDS--QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL  241 (623)
Q Consensus       165 ~~~~~~~~~~~~-~lf~~~~~~~~l~~~~G~~~~--~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l  241 (623)
                      ++...+..++.. ..++.+      +++++..+.  .+..   .+++.++ .|+  ...++++++||++||+||++|.++
T Consensus       144 ~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~---~~~~~~~-~~~--~~~~~~~~~SR~~E~~AD~~a~~l  211 (283)
T PRK03001        144 TISATMAGAISALANFAMF------FGGRDENGRPVNPIA---GIAVAIL-APL--AASLIQMAISRAREFEADRGGARI  211 (283)
T ss_pred             HHHHHHHHHHHHHHHHHHH------hcCCCccccccchHH---HHHHHHH-HHH--HHHHHHHHHhHHHHHHHhHHHHHH
Confidence            876543333222 112222      222332111  1211   1122222 222  245677899999999999999998


Q ss_pred             C-CHHHHHHHHHHHhhCcCCCc
Q psy4798         242 G-KAIFLRKALLKINKDNLGFP  262 (623)
Q Consensus       242 ~-~~~~L~sAL~KLa~~~ls~P  262 (623)
                      + +++.+++||.|+++++.+.|
T Consensus       212 ~~~p~~l~~AL~Kl~~~~~~~p  233 (283)
T PRK03001        212 SGDPQALASALDKIHRYASGIP  233 (283)
T ss_pred             hCCHHHHHHHHHHHHhhhccCC
Confidence            5 79999999999999887755


No 17 
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.96  E-value=8.4e-28  Score=249.97  Aligned_cols=191  Identities=22%  Similarity=0.272  Sum_probs=139.1

Q ss_pred             HHHHHHHHHHHHHHHHHhhhhccccC-------CcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798          13 IIGFSWIVFLFEFYLSIRQRRVYHET-------TIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR   85 (623)
Q Consensus        13 ~~~~~~~~~~~~~~l~~~~~~~~~~~-------~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~   85 (623)
                      .++.++++|+.++++..+++++++.+       |.+++.++++|++.|+|+|++|++|                      
T Consensus        44 ~~~~~~~~~~~~~~i~~~~~~a~~i~~~~~~~~~~L~~~v~~la~~~g~p~p~v~v~~----------------------  101 (284)
T PRK05457         44 GFGGSFISLLMSKWMAKRSTGAEVIEQPRNETERWLVETVARQARQAGIGMPEVAIYH----------------------  101 (284)
T ss_pred             HHHHHHHHHHHHHHHHHHhcCCeECCCCCCchHHHHHHHHHHHHHhCCCCCCCEEEEe----------------------
Confidence            44567899999999999998876532       1388999999999999999999987                      


Q ss_pred             CCCCceeeecCCCCeE-EEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          86 SEHSNAYFYGFFKNKR-IVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        86 ~~~~NA~~~G~~~~kr-Ivl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      ++++|||++|.+++++ |+++++|++.                            ||+||++||+|||+||++|+|+.++
T Consensus       102 ~~~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------L~~~El~aVlAHElgHi~~~d~~~~  153 (284)
T PRK05457        102 SPEINAFATGASKNNSLVAVSTGLLQN----------------------------MSRDEVEAVLAHEISHIANGDMVTM  153 (284)
T ss_pred             CCCceEEEecCCCCCeEEEeehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence            5789999999988776 5566668886                            8999999999999999999999998


Q ss_pred             HHHHHH-HHHHHHH--HHHHHhcCCCcccCCCCCCCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhh
Q psy4798         165 MIVMQL-NLLFMLY--SFQYLFQYPPLYSAFGFYDSQPI-LLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKS  240 (623)
Q Consensus       165 ~~~~~~-~~~~~~~--lf~~~~~~~~l~~~~G~~~~~p~-~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~  240 (623)
                      +++..+ ..+++++  +++.+...  ...  +..+.... .....+++..+   +.++.+++.+++||++||+||++|++
T Consensus       154 ~l~~~~~~~~~~~~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~l~~~~---~~~~~~ll~~~~SR~rEy~AD~~Aa~  226 (284)
T PRK05457        154 TLIQGVVNTFVIFLSRIIAQIVDR--FVS--GNEEGNGIGYFIVSIVLEIV---FGILASIIVMWFSRHREFRADAGGAK  226 (284)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHh--hcc--cCcccccHHHHHHHHHHHHH---HHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence            766443 3333332  12222111  000  11111111 11111122222   23456788899999999999999999


Q ss_pred             cCCHHHHHHHHHHHhhCcCC
Q psy4798         241 LGKAIFLRKALLKINKDNLG  260 (623)
Q Consensus       241 l~~~~~L~sAL~KLa~~~ls  260 (623)
                      +++|+.+++||.|+++.+..
T Consensus       227 ltgp~~L~~AL~KL~~~~~~  246 (284)
T PRK05457        227 LAGREKMIAALQRLKTSYEP  246 (284)
T ss_pred             hhCHHHHHHHHHHHHhhCcc
Confidence            98899999999999998773


No 18 
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.95  E-value=8.4e-27  Score=247.11  Aligned_cols=169  Identities=18%  Similarity=0.291  Sum_probs=128.7

Q ss_pred             HHHHHHHHHHHhhchHhhhhhh-------ccCCcCCchhHHHHHHHHHHHcCCC-CCeEEEEeCCCCCCcccEEEeccCC
Q psy4798         406 VFIILMSLFLMTIYPEFIAPLF-------DKYTPLPDGELKSRIEQLSASVKFP-LKKLYVVEGSKRSEHSNAYFYGFFK  477 (623)
Q Consensus       406 ~~~~~~~l~~~~i~P~~I~plf-------~k~~pl~~~~L~~~I~~la~~~gfp-~~~v~vv~~s~rs~~~NAy~~G~~~  477 (623)
                      ++.+++.++..+..+..+....       +++.|+++++|++.++++|++.|+| .++||++|++    ++|||++|+++
T Consensus        78 ~~~~~~~~~~y~~~~~~~l~~~~a~~I~~~~~~p~~~~~L~~~ve~La~~ag~p~~p~V~vi~~~----~~NAFA~G~~~  153 (336)
T PRK02870         78 LVAVISILVTFQNFDKIMLSGTEYKEITPENALSLQERQLYNVVEELLVAAGLRFMPKVYIIDAP----YMNAFASGYSE  153 (336)
T ss_pred             HHHHHHHHHHHHhhHHHHHHHcCCEEcCCCCCCchhhHHHHHHHHHHHHHcCCCCCCeEEEEcCC----CCceEEecCCC
Confidence            4445555555556665554443       3567777899999999999999999 7999999964    69999999985


Q ss_pred             -CceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHh---
Q psy4798         478 -NKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVA---  553 (623)
Q Consensus       478 -~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~---  553 (623)
                       +++|+++|+|++.                            +++||+.||+|||+||++|+|+.++++.+.+.++.   
T Consensus       154 ~~~~Ivvt~GLL~~----------------------------L~~dEL~aVlAHELgHik~~di~~~~~~~~l~~~~~~~  205 (336)
T PRK02870        154 KSAMVAITTGLLEK----------------------------LDRDELQAVMAHELSHIRHGDIRLTLCVGVLSNIMLIV  205 (336)
T ss_pred             CCcEEEEehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHH
Confidence             6899999999997                            89999999999999999999998887654331110   


Q ss_pred             ---------hhh--h------hH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798         554 ---------NKE--R------EA-----SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDK  606 (623)
Q Consensus       554 ---------~~~--~------~~-----~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~  606 (623)
                               ...  .      ..     .+...+..+....+||++||+||++|++++++|+++++||+||+.+.
T Consensus       206 ~~~~~~~~~g~~~~~~~~~~~~~~l~l~~~~~~~~~ll~~~iSR~rEy~AD~~Aa~ltg~p~aLasAL~KL~~~~  280 (336)
T PRK02870        206 ADFLFYSFMGNRRNSGANRARMIILILRYVLPILTVLLMLFLSRTREYMADAGAVELMRDNEPMARALQKISNDH  280 (336)
T ss_pred             HHHHHHHHhcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhcc
Confidence                     000  0      00     11122223333445999999999999999999999999999999754


No 19 
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.95  E-value=3.9e-26  Score=237.51  Aligned_cols=167  Identities=17%  Similarity=0.179  Sum_probs=120.1

Q ss_pred             HHHHHHHHHHhhchHhhhhhhccCC----cCCchh--HHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce
Q psy4798         407 FIILMSLFLMTIYPEFIAPLFDKYT----PLPDGE--LKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR  480 (623)
Q Consensus       407 ~~~~~~l~~~~i~P~~I~plf~k~~----pl~~~~--L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr  480 (623)
                      +..+..++..++.|..+....+ .+    |.++++  |++.++++|++.|+|.+++|++|.    .++|||++|.+++++
T Consensus        43 ~~~~~~~~~~~~~~~i~~~~~~-a~~i~~~~~~~~~~L~~~v~~la~~~g~p~p~v~v~~~----~~~NAfa~G~~~~~~  117 (284)
T PRK05457         43 FGFGGSFISLLMSKWMAKRSTG-AEVIEQPRNETERWLVETVARQARQAGIGMPEVAIYHS----PEINAFATGASKNNS  117 (284)
T ss_pred             HHHHHHHHHHHHHHHHHHHhcC-CeECCCCCCchHHHHHHHHHHHHHhCCCCCCCEEEEeC----CCceEEEecCCCCCe
Confidence            3334455566666666654443 33    433433  999999999999999999999994    469999999988777


Q ss_pred             -EEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHH-----HH--
Q psy4798         481 -IVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEF-----GV--  552 (623)
Q Consensus       481 -IVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~-----g~--  552 (623)
                       |+++++|++.                            +++||+.||+|||+||++++|+.++.++..+.     .+  
T Consensus       118 ~V~vt~gLl~~----------------------------L~~~El~aVlAHElgHi~~~d~~~~~l~~~~~~~~~~~~~~  169 (284)
T PRK05457        118 LVAVSTGLLQN----------------------------MSRDEVEAVLAHEISHIANGDMVTMTLIQGVVNTFVIFLSR  169 (284)
T ss_pred             EEEeehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHH
Confidence             5566679987                            89999999999999999999999875543221     00  


Q ss_pred             -----hhhhh-----h-H-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhccc
Q psy4798         553 -----ANKER-----E-A-----------SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDKA  607 (623)
Q Consensus       553 -----~~~~~-----~-~-----------~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~~  607 (623)
                           .....     . +           .+...+..+....+||++||+||++|++.++ |+++++||+||+.+..
T Consensus       170 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ll~~~~SR~rEy~AD~~Aa~ltg-p~~L~~AL~KL~~~~~  245 (284)
T PRK05457        170 IIAQIVDRFVSGNEEGNGIGYFIVSIVLEIVFGILASIIVMWFSRHREFRADAGGAKLAG-REKMIAALQRLKTSYE  245 (284)
T ss_pred             HHHHHHHhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhC-HHHHHHHHHHHHhhCc
Confidence                 00000     0 0           0111122223334599999999999999995 9999999999997544


No 20 
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.95  E-value=1.6e-26  Score=243.75  Aligned_cols=191  Identities=22%  Similarity=0.317  Sum_probs=143.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798          10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR   85 (623)
Q Consensus        10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~   85 (623)
                      +...+++++++++.++++..++++..+    +.|++++.++++|++.|+|+|++|++|                      
T Consensus        34 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~La~~agi~~p~v~vid----------------------   91 (317)
T PRK01345         34 LVIAAGMNLFSYWNSDKMVLRMYGAQEVDERSAPELYRMVRDLARRAGLPMPKVYIID----------------------   91 (317)
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHcCCeECCcccCHHHHHHHHHHHHHcCCCCCcEEEEc----------------------
Confidence            345566788999999999998877643    357899999999999999999999987                      


Q ss_pred             CCCCceeeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          86 SEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        86 ~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      ++++|||++|.++ +++|+++++|++.                            ||+||++||+|||+|||+|+|+.++
T Consensus        92 ~~~~NAFa~G~~~~~~~V~vt~gLL~~----------------------------L~~dEL~aVlAHElgHi~~~d~~~~  143 (317)
T PRK01345         92 NPQPNAFATGRNPENAAVAATTGLLQR----------------------------LSPEEVAGVMAHELAHVKNRDTLTM  143 (317)
T ss_pred             CCCcceEEecCCCCCeEEEechHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence            6789999999976 5689999999997                            8999999999999999999999999


Q ss_pred             HHHHHHHHHHHHH-HHHHHhcCCCcccCCCCCCCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-
Q psy4798         165 MIVMQLNLLFMLY-SFQYLFQYPPLYSAFGFYDSQPI-LLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL-  241 (623)
Q Consensus       165 ~~~~~~~~~~~~~-lf~~~~~~~~l~~~~G~~~~~p~-~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l-  241 (623)
                      ++.+.+..++..+ .+..++..      .......|. .++.  ++..++.|+.  ..+++..+||++||+||++|+++ 
T Consensus       144 ~l~~~~~~~~~~l~~~~~~~~~------~~~~~~~~~~~i~~--~~~~~~~~~~--~~l~~~~~SR~rE~~AD~~A~~lt  213 (317)
T PRK01345        144 TITATLAGAISMLANFAFFFGG------NRENNNGPLGLVGT--LAAMIVAPLA--AMLVQMAISRTREYAADRRGAEIC  213 (317)
T ss_pred             HHHHHHHHHHHHHHHHHHHhcC------CcccccchHHHHHH--HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHh
Confidence            8876654333221 12222211      011111221 2221  2223444542  34677899999999999999998 


Q ss_pred             CCHHHHHHHHHHHhhCcCC
Q psy4798         242 GKAIFLRKALLKINKDNLG  260 (623)
Q Consensus       242 ~~~~~L~sAL~KLa~~~ls  260 (623)
                      ++++.+.+||.|+++.+..
T Consensus       214 g~p~~L~~AL~KL~~~~~~  232 (317)
T PRK01345        214 GNPLWLASALGKIERGAHG  232 (317)
T ss_pred             CCHHHHHHHHHHHHhhhcc
Confidence            6899999999999986544


No 21 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.90  E-value=8.4e-25  Score=216.17  Aligned_cols=183  Identities=28%  Similarity=0.429  Sum_probs=112.6

Q ss_pred             HHHHHHHHHhhchHhhhhhhccCCcCCchhHHHHHHHHHHHc--CCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEeh
Q psy4798         408 IILMSLFLMTIYPEFIAPLFDKYTPLPDGELKSRIEQLSASV--KFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFD  485 (623)
Q Consensus       408 ~~~~~l~~~~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~--gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~d  485 (623)
                      .++++.++..++|.+++|++++++|.++++|++.++++|+++  +.|.+++|+++++.    .|||++|.+++++|++++
T Consensus         3 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~L~~~v~~l~~~~~~~~~~~~v~v~~~~~----~NA~~~g~~~~~~I~v~~   78 (226)
T PF01435_consen    3 MLVVSLLLAFIGPPLIAPLFNKFTPLEDPELRRIVEELARRAGLGIPPPRVYVIDSPS----PNAFATGGGPRKRIVVTS   78 (226)
T ss_dssp             --TTTSTTGHHCCCCHCCHCTC--B-HHHHHHHHHHHHHHHHHCTSS--EEEEE--SS----EEEEEETTTC--EEEEEH
T ss_pred             eeeHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHhcCCCCCCeEEEEcCCC----CcEEEEccCCCcEEEEeC
Confidence            345566777889999999999999999999999999999999  88899999999653    999999999999999999


Q ss_pred             hHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhh----hhH--
Q psy4798         486 TLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKE----REA--  559 (623)
Q Consensus       486 tLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~----~~~--  559 (623)
                      +|++.                            +++||+.||||||+||++++|..+++....+..+....    ...  
T Consensus        79 ~ll~~----------------------------~~~~el~aVlaHElgH~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~  130 (226)
T PF01435_consen   79 GLLES----------------------------LSEDELAAVLAHELGHIKHRHILKSLLISLLLSILFFALLALLIGSM  130 (226)
T ss_dssp             HHHHH----------------------------SSHHHHHHHHHHHHHHHHTTHCCCCCCHHH-HHHHHHHHHT-----H
T ss_pred             hhhhc----------------------------ccHHHHHHHHHHHHHHHHcCCcchHHHHHHHHHHHHHHHHHHHhhhh
Confidence            99977                            89999999999999999999998875443322211100    000  


Q ss_pred             ------H-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHH----------HHHHHHhhh------ccc-cc
Q psy4798         560 ------S-------VMRYVTKESELITARQDREAAEKKYKEAMKEKDNV----------MAKLKVLST------DKA-RL  609 (623)
Q Consensus       560 ------~-------v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l----------~~aL~kL~~------~~~-~~  609 (623)
                            .       +...........+||++|++||++|++.+++|..+          .+.+.++..      ... ..
T Consensus       131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~sr~~E~~AD~~a~~~~~~~~~l~~a~~~~~~~~~~l~~l~~~~~~~~~~~~~~  210 (226)
T PF01435_consen  131 SLFSAFGFIDILGILIAFLFQLLTNAFSRRQEYEADRYAARLGGDPALLARALYKPAAAISALEKLAEANSMRPDSDWRY  210 (226)
T ss_dssp             HHHHHHH----------HHSTT------HHHHHHHHHHHHHH------HHHTTS-TTHHHHHHHHHT-------------
T ss_pred             hhhhhhhhcccchhhHHHHHHHhhcchhHHHHHHHHHHHHHhcCCcHHHHHhCCCHHHHHHHHHHHHHHhccccCCcccc
Confidence                  0       00011122333459999999999999998776653          667777765      111 12


Q ss_pred             -cccccCccccccc
Q psy4798         610 -TQLYDDKVRMYNV  622 (623)
Q Consensus       610 -~~l~~~~~~e~~~  622 (623)
                       .-...||..+-|+
T Consensus       211 ~~~~~tHP~~~~Ri  224 (226)
T PF01435_consen  211 SSLFSTHPSTEERI  224 (226)
T ss_dssp             ----------HHHH
T ss_pred             chhcCCCcCHHHHh
Confidence             2346677766554


No 22 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.89  E-value=4.4e-22  Score=205.79  Aligned_cols=187  Identities=24%  Similarity=0.238  Sum_probs=137.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhhhcccc------CCcchH----HHHhhhhhcCCC-CccccccccchhhhHHHHHHhHhh
Q psy4798          11 YGIIGFSWIVFLFEFYLSIRQRRVYHE------TTIVPH----QIAHGMDAESFE-KSRRYSLDKNVFSMFKETVSNVMN   79 (623)
Q Consensus        11 ~~~~~~~~~~~~~~~~l~~~~~~~~~~------~~~l~~----~l~~l~~~~g~~-~~~ly~~d~~~~~~~~~~v~~v~~   79 (623)
                      ...+....+.+++++.+..+..+....      .+..+.    .+++++.+.+.+ +|++|+.+                
T Consensus        61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~v~i~~----------------  124 (302)
T COG0501          61 LLAFAALLISLLFSKALVLKSLGALTLSEPILLAPRLYAVLLLKVAELARQAGIPHMPEVYILE----------------  124 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhceeecccccccchHHHHHHHHHHHHHHHHCCCCCCCeeEEec----------------
Confidence            345667778888888888877775321      123444    889999999999 89999976                


Q ss_pred             cccCCCCCCCceeeecCC-CCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhc
Q psy4798          80 TVSSKRSEHSNAYFYGFF-KNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKY  158 (623)
Q Consensus        80 ~~~s~~~~~~NA~~~G~~-~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~  158 (623)
                            ++.||||++|.+ ++++|++|++|++.                            +|+||++||+|||+||++|
T Consensus       125 ------~~~~NAFa~g~~~~~~~V~vt~gLl~~----------------------------l~~dEl~aVlaHElgHi~~  170 (302)
T COG0501         125 ------TPQPNAFALGGGPKNGRVVVTTGLLDL----------------------------LNDDELEAVLAHELGHIKN  170 (302)
T ss_pred             ------CCCccceecCCCCCCeeEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHhc
Confidence                  789999999997 57999999999996                            8999999999999999999


Q ss_pred             chHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhh
Q psy4798         159 NHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFG  238 (623)
Q Consensus       159 ~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA  238 (623)
                      +|+.+++.+..+.......++.........+      +      +...+...+.....++++.+.+.+||++||+||++|
T Consensus       171 rd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a  238 (302)
T COG0501         171 RHTLVRLTLRGLLASAFVLLATLALAAGLLG------E------AALALLLLLLLLALFLATLLVLAFSRKREYEADRFA  238 (302)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------c------hHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhch
Confidence            9999966554443333333322222111110      0      011111222233334567788999999999999999


Q ss_pred             hhcCCHHHHHHHHHHHhhCcC
Q psy4798         239 KSLGKAIFLRKALLKINKDNL  259 (623)
Q Consensus       239 ~~l~~~~~L~sAL~KLa~~~l  259 (623)
                      +++.+++.+.++|.|+...+.
T Consensus       239 ~~l~~~~~l~~aL~kl~~~~~  259 (302)
T COG0501         239 AKLTGPEKLASALQKLARLSG  259 (302)
T ss_pred             hhccChHHHHHHHHHHHhhhc
Confidence            999779999999999998654


No 23 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.86  E-value=3.2e-20  Score=191.99  Aligned_cols=170  Identities=21%  Similarity=0.258  Sum_probs=120.7

Q ss_pred             HHHHHHHHHHHHHhhchHhhhhhhcc--CCcCC------chhHHHHHHHHHHHcCCC-CCeEEEEeCCCCCCcccEEEec
Q psy4798         404 LWVFIILMSLFLMTIYPEFIAPLFDK--YTPLP------DGELKSRIEQLSASVKFP-LKKLYVVEGSKRSEHSNAYFYG  474 (623)
Q Consensus       404 ~w~~~~~~~l~~~~i~P~~I~plf~k--~~pl~------~~~L~~~I~~la~~~gfp-~~~v~vv~~s~rs~~~NAy~~G  474 (623)
                      .+.+.....++..+..+.++.-....  ..+..      ...+..++++++++.+.| .+++++++    +..+|||++|
T Consensus        59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~v~i~~----~~~~NAFa~g  134 (302)
T COG0501          59 ALLLAFAALLISLLFSKALVLKSLGALTLSEPILLAPRLYAVLLLKVAELARQAGIPHMPEVYILE----TPQPNAFALG  134 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhceeecccccccchHHHHHHHHHHHHHHHHCCCCCCCeeEEec----CCCccceecC
Confidence            34444555555555666555544443  22211      233455899999999999 79999999    4569999999


Q ss_pred             cC-CCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHH---
Q psy4798         475 FF-KNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEF---  550 (623)
Q Consensus       475 ~~-~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~---  550 (623)
                      .+ ++++|+++++|++.                            +|+||+.||+|||+||++++|+.++..+....   
T Consensus       135 ~~~~~~~V~vt~gLl~~----------------------------l~~dEl~aVlaHElgHi~~rd~~~~~~~~~~~~~~  186 (302)
T COG0501         135 GGPKNGRVVVTTGLLDL----------------------------LNDDELEAVLAHELGHIKNRHTLVRLTLRGLLASA  186 (302)
T ss_pred             CCCCCeeEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHH
Confidence            87 67999999999997                            89999999999999999999999855433221   


Q ss_pred             -HH-hhhhh---------------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798         551 -GV-ANKER---------------EASVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDK  606 (623)
Q Consensus       551 -g~-~~~~~---------------~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~  606 (623)
                       .. .....               ...+...+.......+||.+|++||+.|++.++ |+.+.++|.|+....
T Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~l~~-~~~l~~aL~kl~~~~  258 (302)
T COG0501         187 FVLLATLALAAGLLGEAALALLLLLLLLALFLATLLVLAFSRKREYEADRFAAKLTG-PEKLASALQKLARLS  258 (302)
T ss_pred             HHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhchhhccC-hHHHHHHHHHHHhhh
Confidence             10 00000               001122222233444699999999999999997 999999999999843


No 24 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.81  E-value=1.8e-21  Score=192.43  Aligned_cols=156  Identities=33%  Similarity=0.433  Sum_probs=95.5

Q ss_pred             CCcchHHHHhhhhhc--CCCCccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCCCeEEEEecchhhhccCC
Q psy4798          38 TTIVPHQIAHGMDAE--SFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPL  115 (623)
Q Consensus        38 ~~~l~~~l~~l~~~~--g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~  115 (623)
                      .+.+++.++++|++.  +.|++++|+.+                      ++.+|||++|.+++++|+++++|++.    
T Consensus        30 ~~~L~~~v~~l~~~~~~~~~~~~v~v~~----------------------~~~~NA~~~g~~~~~~I~v~~~ll~~----   83 (226)
T PF01435_consen   30 DPELRRIVEELARRAGLGIPPPRVYVID----------------------SPSPNAFATGGGPRKRIVVTSGLLES----   83 (226)
T ss_dssp             HHHHHHHHHHHHHHHHCTSS--EEEEE------------------------SSEEEEEETTTC--EEEEEHHHHHH----
T ss_pred             HHHHHHHHHHHHHHhcCCCCCCeEEEEc----------------------CCCCcEEEEccCCCcEEEEeChhhhc----
Confidence            467999999999999  99999999976                      55699999999999999999999976    


Q ss_pred             ccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCC
Q psy4798         116 NADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFY  195 (623)
Q Consensus       116 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~  195 (623)
                                              +++||+.||||||+||++++|..+.+....+..++..+....+.........+++.
T Consensus        84 ------------------------~~~~el~aVlaHElgH~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  139 (226)
T PF01435_consen   84 ------------------------LSEDELAAVLAHELGHIKHRHILKSLLISLLLSILFFALLALLIGSMSLFSAFGFI  139 (226)
T ss_dssp             ------------------------SSHHHHHHHHHHHHHHHHTTHCCCCCCHHH-HHHHHHHHHT-----HHHHHHHH--
T ss_pred             ------------------------ccHHHHHHHHHHHHHHHHcCCcchHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhc
Confidence                                    89999999999999999999999986666655554444322211111100000110


Q ss_pred             CCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcCCHHH-HHHHHHHHhh
Q psy4798         196 DSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLGKAIF-LRKALLKINK  256 (623)
Q Consensus       196 ~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~~~~~-L~sAL~KLa~  256 (623)
                      ...+             ..+..+...+.+.+||++||+||++|+++++... +.++++++..
T Consensus       140 ~~~~-------------~~~~~~~~~~~~~~sr~~E~~AD~~a~~~~~~~~~l~~a~~~~~~  188 (226)
T PF01435_consen  140 DILG-------------ILIAFLFQLLTNAFSRRQEYEADRYAARLGGDPALLARALYKPAA  188 (226)
T ss_dssp             ---------------------HHSTT------HHHHHHHHHHHHHH------HHHTTS-TTH
T ss_pred             ccch-------------hhHHHHHHHhhcchhHHHHHHHHHHHHHhcCCcHHHHHhCCCHHH
Confidence            0000             0000334557799999999999999999965544 5555555443


No 25 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.34  E-value=1.6e-11  Score=133.72  Aligned_cols=158  Identities=18%  Similarity=0.206  Sum_probs=107.9

Q ss_pred             CcCCchhHHHHHHHHHHH----cCCCC--CeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCC
Q psy4798         431 TPLPDGELKSRIEQLSAS----VKFPL--KKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGD  504 (623)
Q Consensus       431 ~pl~~~~L~~~I~~la~~----~gfp~--~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~  504 (623)
                      ...+|++|.+-|+++..+    .|-+.  -.+++|+.    ...|||++   .+.+|++..+|+...             
T Consensus        65 ~~i~D~el~~yv~~~g~rL~~~a~~~~~~f~f~lV~d----~~iNAFA~---~Gg~v~vntGLll~a-------------  124 (484)
T COG4783          65 PLIRDPELEEYVNSLGQRLAAAADLVKTPFTFFLVND----DSINAFAT---PGGYVVVNTGLLLTA-------------  124 (484)
T ss_pred             CCcCCHHHHHHHHHHHHHHHHhcCCCCCCeEEEEecC----Cccchhhc---CCceEEEehHHHHhc-------------
Confidence            556899988888776654    45553  35677774    35899998   236899999999873             


Q ss_pred             CCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHH----------HHHHHHhhhhhh--H----HHHHHHHHH
Q psy4798         505 SEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIL----------KKEFGVANKERE--A----SVMRYVTKE  568 (623)
Q Consensus       505 ~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~----------~~~~g~~~~~~~--~----~v~~~~~~~  568 (623)
                                    -+++|+.+|||||+||+..+|..+.+=-          +.+.|+....-.  +    .........
T Consensus       125 --------------e~esElagViAHEigHv~qrH~aR~~e~~~r~~~~~i~~ml~gi~aa~a~~~ag~a~iag~~a~~~  190 (484)
T COG4783         125 --------------ENESELAGVIAHEIGHVAQRHLARSMEQQQRAAPMAIAGMLLGILAALAGADAGMAGIAGALAGAA  190 (484)
T ss_pred             --------------CCHHHHHHHHHHHHHHHhhhhHHHHHHHHhhhchhHHHHHHHHHHHHHhCccccHHHHHHHHHHhh
Confidence                          4899999999999999999999987632          222222221110  0    111111111


Q ss_pred             H-HHHHHHHHHHHHHHHHHHHc----CCHHHHHHHHHHhh---hccccc-cccccCccccccc
Q psy4798         569 S-ELITARQDREAAEKKYKEAM----KEKDNVMAKLKVLS---TDKARL-TQLYDDKVRMYNV  622 (623)
Q Consensus       569 ~-~~~~sR~~E~~AD~~a~~~~----~~~~~l~~aL~kL~---~~~~~~-~~l~~~~~~e~~~  622 (623)
                      . .+-|||+.|.+||+++...+    .||.+|.+.++|+.   .++++. .-+..||.++=|+
T Consensus       191 ~g~L~~sR~~E~eADr~Gi~~L~raGydp~gM~~ff~rl~~~~~~~~~~p~yl~THPlp~~RI  253 (484)
T COG4783         191 QGQLNFSRQNEQEADRIGITTLVRAGYDPQGMPEFFERLADQLRYGGQPPEYLLTHPLPEERI  253 (484)
T ss_pred             hhhhhcchhhHHHHHHHHHHHHHHcCCCchhHHHHHHHHHHHHhcCCCCChHHhcCCCchhHH
Confidence            1 22269999999999988765    38999999999999   455554 3557787766443


No 26 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=99.29  E-value=1.3e-10  Score=121.48  Aligned_cols=142  Identities=23%  Similarity=0.347  Sum_probs=108.8

Q ss_pred             hhccCCcCCchhHHHHHHHHHHHcCCCC-CeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCC
Q psy4798         426 LFDKYTPLPDGELKSRIEQLSASVKFPL-KKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGD  504 (623)
Q Consensus       426 lf~k~~pl~~~~L~~~I~~la~~~gfp~-~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~  504 (623)
                      +..+.++.+++++.+.+++.+++.|++. .++++.++     -...+.+|+.+ .+|++-+.+.+.              
T Consensus       131 l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~i~~s~~-----i~sP~~~G~~~-p~I~lP~~~~~~--------------  190 (299)
T PF05569_consen  131 LLRKARPVEDEELQALLEECKEELGIKRPIRIRVSSG-----ISSPFVFGFLR-PVIVLPESLLED--------------  190 (299)
T ss_pred             HHHhccccCcHHHHHHHHHHHHHhCCCCceEEEEcCC-----CCCCeeecCcc-eEEEecCccccc--------------
Confidence            5677899999999999999999999763 44544432     35778999875 589999999977              


Q ss_pred             CCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4798         505 SEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKK  584 (623)
Q Consensus       505 ~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~  584 (623)
                                    ++++|+.+|++||+.|+|++|.+.+++...+..+..   ..++..+   ....+ .+.+|+.||+.
T Consensus       191 --------------~~~~el~~il~HEl~Hikr~D~~~~~l~~l~~~l~W---fnP~~~~---~~~~~-~~~~E~~cD~~  249 (299)
T PF05569_consen  191 --------------LSEEELRAILLHELAHIKRRDLLWKLLAELLCALHW---FNPLVWL---LRRRI-RRDRELACDEA  249 (299)
T ss_pred             --------------cCHHHHHHHHHHHHHHHHCCChHHHHHHHHHHHHHH---hhHHHHH---HHHHH-HHHHHHhhhHH
Confidence                          899999999999999999999999887654332211   1122222   22333 88899999999


Q ss_pred             HHHHc--CCHHHHHHHHHHhhhcccc
Q psy4798         585 YKEAM--KEKDNVMAKLKVLSTDKAR  608 (623)
Q Consensus       585 a~~~~--~~~~~l~~aL~kL~~~~~~  608 (623)
                      +.+.+  +++.....+|.+++.....
T Consensus       250 vl~~l~~~~~~~Y~~~Ll~~~~~~~~  275 (299)
T PF05569_consen  250 VLRNLGKEERKAYAETLLKVAKRSQQ  275 (299)
T ss_pred             HHHhcCchhHHHHHHHHHHHHHhhcC
Confidence            99994  5778899999999875443


No 27 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=99.03  E-value=5.7e-09  Score=109.17  Aligned_cols=131  Identities=27%  Similarity=0.322  Sum_probs=100.9

Q ss_pred             CcchHHHHhhhhhcCCCC-ccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCCCeEEEEecchhhhccCCcc
Q psy4798          39 TIVPHQIAHGMDAESFEK-SRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNA  117 (623)
Q Consensus        39 ~~l~~~l~~l~~~~g~~~-~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~  117 (623)
                      ++..+.+++.+.+.|+++ .++++-                       +.-...+++|+. ..+|++.+++.+.      
T Consensus       141 ~~~~~~l~~~~~~~~~~~~~~i~~s-----------------------~~i~sP~~~G~~-~p~I~lP~~~~~~------  190 (299)
T PF05569_consen  141 EELQALLEECKEELGIKRPIRIRVS-----------------------SGISSPFVFGFL-RPVIVLPESLLED------  190 (299)
T ss_pred             HHHHHHHHHHHHHhCCCCceEEEEc-----------------------CCCCCCeeecCc-ceEEEecCccccc------
Confidence            357788899999999764 555553                       234567889986 4799999999876      


Q ss_pred             ccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCC
Q psy4798         118 DKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDS  197 (623)
Q Consensus       118 ~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~  197 (623)
                                            ++++|+.+|++||+.|+|++|.+..++...+..+.                       
T Consensus       191 ----------------------~~~~el~~il~HEl~Hikr~D~~~~~l~~l~~~l~-----------------------  225 (299)
T PF05569_consen  191 ----------------------LSEEELRAILLHELAHIKRRDLLWKLLAELLCALH-----------------------  225 (299)
T ss_pred             ----------------------cCHHHHHHHHHHHHHHHHCCChHHHHHHHHHHHHH-----------------------
Confidence                                  79999999999999999999999987755443221                       


Q ss_pred             chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc---CCHHHHHHHHHHHhhCcCC
Q psy4798         198 QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL---GKAIFLRKALLKINKDNLG  260 (623)
Q Consensus       198 ~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l---~~~~~L~sAL~KLa~~~ls  260 (623)
                                   -+.|   +..++...+.+.+|+.||+.+.+.   +......++|.+++.....
T Consensus       226 -------------WfnP---~~~~~~~~~~~~~E~~cD~~vl~~l~~~~~~~Y~~~Ll~~~~~~~~  275 (299)
T PF05569_consen  226 -------------WFNP---LVWLLRRRIRRDRELACDEAVLRNLGKEERKAYAETLLKVAKRSQQ  275 (299)
T ss_pred             -------------HhhH---HHHHHHHHHHHHHHHhhhHHHHHhcCchhHHHHHHHHHHHHHhhcC
Confidence                         1122   345667899999999999999888   3466788888888877766


No 28 
>KOG2661|consensus
Probab=98.95  E-value=2.2e-09  Score=112.28  Aligned_cols=135  Identities=13%  Similarity=0.093  Sum_probs=91.2

Q ss_pred             eEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhh
Q psy4798         455 KLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGH  534 (623)
Q Consensus       455 ~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH  534 (623)
                      .|.||+..    .+|||+.   ++++|++|.++|...                           -++||+.+|||||+||
T Consensus       240 eihVVndP----ipNAFvL---PgGKvfVFtgiLn~c---------------------------k~ddglAtvLgHE~aH  285 (424)
T KOG2661|consen  240 EIHVVNDP----IPNAFVL---PGGKVFVFTGILNSC---------------------------KDDDGLATVLGHEIAH  285 (424)
T ss_pred             EEEEecCC----CCceeec---cCCeEEEEechhhcc---------------------------cChHHHHHHHHHHHHH
Confidence            56788854    4999988   567899999999872                           3789999999999999


Q ss_pred             hhccCHHHHHHHHHHHHHhhhhhhHHHH----H-H-HHHHHHHHHHHHHHHHHHHHHHHHc----CCHHHHHHHHHHhhh
Q psy4798         535 WKYNHVLKSMILKKEFGVANKEREASVM----R-Y-VTKESELITARQDREAAEKKYKEAM----KEKDNVMAKLKVLST  604 (623)
Q Consensus       535 ~k~~H~~k~~~~~~~~g~~~~~~~~~v~----~-~-~~~~~~~~~sR~~E~~AD~~a~~~~----~~~~~l~~aL~kL~~  604 (623)
                      -..+|+.-.+.....+.+........++    + . ....+++-+||++|.+||.++.-+|    -||.+...-.++++.
T Consensus       286 aVarH~AEki~k~~~~siLgLvlyt~~~a~~~n~~Ll~~flrlPfSRKMEtEADyIGLlLma~Acfdpras~tvwErM~~  365 (424)
T KOG2661|consen  286 AVARHAAEKIGKVHLLSILGLVLYTMIWAICPNDKLLEYFLRLPFSRKMETEADYIGLLLMAKACFDPRASSTVWERMEF  365 (424)
T ss_pred             HHHHHHHHhhhhhHHHHHHHHHHHHHHhhccchHHHHHHHhcCcchhhhhhhhhHHHHHHHHHhhcCcccchHHHHHHHH
Confidence            9999998766544333222211111111    1 1 1122334479999999999876554    478777666666654


Q ss_pred             cc------ccccccccCcccccccC
Q psy4798         605 DK------ARLTQLYDDKVRMYNVR  623 (623)
Q Consensus       605 ~~------~~~~~l~~~~~~e~~~~  623 (623)
                      -+      +++.=|+-||.+..|++
T Consensus       366 ~egqmg~~~~~eflSTHPSskkRie  390 (424)
T KOG2661|consen  366 VEGQMGQPKMPEFLSTHPSSKKRIE  390 (424)
T ss_pred             hhhhcCCCCCchhhhcCCCccchhH
Confidence            22      23445678888877764


No 29 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=98.82  E-value=2.5e-08  Score=109.25  Aligned_cols=138  Identities=21%  Similarity=0.194  Sum_probs=88.4

Q ss_pred             CCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798          86 SEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM  165 (623)
Q Consensus        86 ~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~  165 (623)
                      ++.-|||++   ...+|+|+++|+...                           -|+.|+.+|||||+||+..+|..+.+
T Consensus       101 d~~iNAFA~---~Gg~v~vntGLll~a---------------------------e~esElagViAHEigHv~qrH~aR~~  150 (484)
T COG4783         101 DDSINAFAT---PGGYVVVNTGLLLTA---------------------------ENESELAGVIAHEIGHVAQRHLARSM  150 (484)
T ss_pred             CCccchhhc---CCceEEEehHHHHhc---------------------------CCHHHHHHHHHHHHHHHhhhhHHHHH
Confidence            678999999   137999999999963                           58999999999999999999999887


Q ss_pred             HHHHHH---HHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-
Q psy4798         166 IVMQLN---LLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL-  241 (623)
Q Consensus       166 ~~~~~~---~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l-  241 (623)
                      =-.+=.   .++.+. .+.+...      .|-..+...+.+    .+...       ..-+..+||..|.+||+.+... 
T Consensus       151 e~~~r~~~~~i~~ml-~gi~aa~------a~~~ag~a~iag----~~a~~-------~~g~L~~sR~~E~eADr~Gi~~L  212 (484)
T COG4783         151 EQQQRAAPMAIAGML-LGILAAL------AGADAGMAGIAG----ALAGA-------AQGQLNFSRQNEQEADRIGITTL  212 (484)
T ss_pred             HHHhhhchhHHHHHH-HHHHHHH------hCccccHHHHHH----HHHHh-------hhhhhhcchhhHHHHHHHHHHHH
Confidence            542111   111111 1111000      011111111111    00000       0114679999999999999766 


Q ss_pred             ---C-CHHHHHHHHHHHh---hCcCCCchhhccchhh
Q psy4798         242 ---G-KAIFLRKALLKIN---KDNLGFPALLVCNGLP  271 (623)
Q Consensus       242 ---~-~~~~L~sAL~KLa---~~~ls~P~iL~~g~~~  271 (623)
                         | ++..|.+.+.|+.   .+.-.+|.++..--+|
T Consensus       213 ~raGydp~gM~~ff~rl~~~~~~~~~~p~yl~THPlp  249 (484)
T COG4783         213 VRAGYDPQGMPEFFERLADQLRYGGQPPEYLLTHPLP  249 (484)
T ss_pred             HHcCCCchhHHHHHHHHHHHHhcCCCCChHHhcCCCc
Confidence               4 4899999999999   5666677776665554


No 30 
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=98.69  E-value=1.7e-07  Score=98.76  Aligned_cols=138  Identities=19%  Similarity=0.175  Sum_probs=94.7

Q ss_pred             CchhHHHHHHHHHHHcC----CCC--CeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCc
Q psy4798         434 PDGELKSRIEQLSASVK----FPL--KKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEP  507 (623)
Q Consensus       434 ~~~~L~~~I~~la~~~g----fp~--~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~  507 (623)
                      .|+.|...+.++..++-    =|-  -++-|+|    |...|||+.   +..-+.++.+|+...                
T Consensus        62 ~D~Kler~Vari~g~lt~~S~~p~q~YriTiln----SP~INAFAL---PGGYlYitRGLlAla----------------  118 (479)
T COG4784          62 RDPKLERMVARIVGALTAVSENPQQTYRITILN----SPNINAFAL---PGGYLYITRGLLALA----------------  118 (479)
T ss_pred             CCHHHHHHHHHHHhHhhhhccCCCceEEEEEec----CCCcccccc---CCceEEEehhHHHHc----------------
Confidence            35555555555444432    121  3566677    456899987   466899999999862                


Q ss_pred             cccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhH--------HHHHHHHHHHHHHHHHHHHH
Q psy4798         508 LISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREA--------SVMRYVTKESELITARQDRE  579 (623)
Q Consensus       508 ~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~--------~v~~~~~~~~~~~~sR~~E~  579 (623)
                                 -+..|+.|||+||+||.--+|.+.+--.-....++......        -......++.-..|||++|.
T Consensus       119 -----------nd~sEvAAVl~HEmgHVtAnHgi~rQ~~e~a~~ia~rvva~vl~~~~agk~A~~rGklrla~fsRnqEL  187 (479)
T COG4784         119 -----------NDSSEVAAVLAHEMGHVTANHGIQRQQREAAEVIASRVVAEVLGSDAAGKQALIRGKLRLAQFSRNQEL  187 (479)
T ss_pred             -----------CCHHHHHHHHHhhhhheecchhHHHHHHHHHHHHHHHHHHHHhCCcchhhHHHhhhhHHHhhhccchhh
Confidence                       37799999999999999999998765443334444333211        11222233333446999999


Q ss_pred             HHHHHHHHHcC----CHHHHHHHHHHhhhc
Q psy4798         580 AAEKKYKEAMK----EKDNVMAKLKVLSTD  605 (623)
Q Consensus       580 ~AD~~a~~~~~----~~~~l~~aL~kL~~~  605 (623)
                      +||.++.+..+    ||.+..+.|..++..
T Consensus       188 qAD~iG~~~lgeAGYDP~A~~rfl~sm~ay  217 (479)
T COG4784         188 QADAIGIKMLGEAGYDPYAAARFLQSMAAY  217 (479)
T ss_pred             hhhhhhHHHHHhcCCChHHHHHHHHHHHhh
Confidence            99999998876    799999999999864


No 31 
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=98.47  E-value=7.8e-06  Score=86.19  Aligned_cols=128  Identities=17%  Similarity=0.177  Sum_probs=85.4

Q ss_pred             chhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCcccccccc
Q psy4798         435 DGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGA  514 (623)
Q Consensus       435 ~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~  514 (623)
                      +.|.+..++++.++..  .+++ ++.+|+  ...+=-++|..+. +||+-+.+.++                        
T Consensus       135 e~d~~~~~~~~~~~~~--~k~i-~ir~s~--~i~~P~v~gl~kp-~IvlP~d~~~r------------------------  184 (337)
T COG4219         135 EVDKRKIVTILKNHQY--KKHI-LIRKSK--AIDGPMVFGLVKP-CIVLPADFVER------------------------  184 (337)
T ss_pred             cccHHHHHHHHHHhhh--ccCe-eEeecc--cCCCceeeccCcc-eEEccHHHHhh------------------------
Confidence            4566777777766653  3444 345554  3456677788754 79999999988                        


Q ss_pred             CCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC--CH
Q psy4798         515 NKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAMK--EK  592 (623)
Q Consensus       515 ~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~--~~  592 (623)
                          ++++|+.-|++||++|.|++|.+.+.+...+-+++.      .-..+......+ .-++|.+||+.+.....  +.
T Consensus       185 ----~~~ee~~yIilHEl~Hlk~gD~i~n~i~~~~~~l~W------fNP~v~l~~~~~-~~D~E~aCDa~vL~~~~~~er  253 (337)
T COG4219         185 ----LTDEELKYIILHELSHLKRGDAIINLIVVVLGVLFW------FNPLVHLGKRKI-RIDQEIACDAAVLARINPEER  253 (337)
T ss_pred             ----cCHHhhhhhHhHHHhhhhcccHHHHHHHHHHhHHhh------cChHHHHHHHHH-HhhHHHHhhHHHHhccChHHH
Confidence                899999999999999999999998876533222221      111122222223 66799999999998772  23


Q ss_pred             HHHHHHHHHhh
Q psy4798         593 DNVMAKLKVLS  603 (623)
Q Consensus       593 ~~l~~aL~kL~  603 (623)
                      .....++.|+-
T Consensus       254 r~YaEsil~~l  264 (337)
T COG4219         254 RTYAESILKLL  264 (337)
T ss_pred             HHHHHHHHHHH
Confidence            44556665553


No 32 
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=98.11  E-value=1.5e-05  Score=84.47  Aligned_cols=117  Identities=26%  Similarity=0.272  Sum_probs=74.7

Q ss_pred             CCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798          86 SEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM  165 (623)
Q Consensus        86 ~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~  165 (623)
                      |+..|||+.   +..++.++.+||...                           -+..|+.|||+||+||+.-+|-...-
T Consensus        95 SP~INAFAL---PGGYlYitRGLlAla---------------------------nd~sEvAAVl~HEmgHVtAnHgi~rQ  144 (479)
T COG4784          95 SPNINAFAL---PGGYLYITRGLLALA---------------------------NDSSEVAAVLAHEMGHVTANHGIQRQ  144 (479)
T ss_pred             CCCcccccc---CCceEEEehhHHHHc---------------------------CCHHHHHHHHHhhhhheecchhHHHH
Confidence            578999997   478999999999952                           47789999999999999999987664


Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc----
Q psy4798         166 IVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL----  241 (623)
Q Consensus       166 ~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l----  241 (623)
                      ---....+..-..- ..+...    ..|.   ...+.                ...-...+||..|.+||..+.+.    
T Consensus       145 ~~e~a~~ia~rvva-~vl~~~----~agk---~A~~r----------------Gklrla~fsRnqELqAD~iG~~~lgeA  200 (479)
T COG4784         145 QREAAEVIASRVVA-EVLGSD----AAGK---QALIR----------------GKLRLAQFSRNQELQADAIGIKMLGEA  200 (479)
T ss_pred             HHHHHHHHHHHHHH-HHhCCc----chhh---HHHhh----------------hhHHHhhhccchhhhhhhhhHHHHHhc
Confidence            32222222111110 111100    0111   11111                12234789999999999999877    


Q ss_pred             C-CHHHHHHHHHHHhh
Q psy4798         242 G-KAIFLRKALLKINK  256 (623)
Q Consensus       242 ~-~~~~L~sAL~KLa~  256 (623)
                      | +|.++.+-|..++.
T Consensus       201 GYDP~A~~rfl~sm~a  216 (479)
T COG4784         201 GYDPYAAARFLQSMAA  216 (479)
T ss_pred             CCChHHHHHHHHHHHh
Confidence            3 36666666666544


No 33 
>KOG2661|consensus
Probab=98.03  E-value=1.1e-05  Score=84.98  Aligned_cols=100  Identities=23%  Similarity=0.367  Sum_probs=70.5

Q ss_pred             CCCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798          85 RSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus        85 ~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      +++.+|||+.   ++..|++++++|...                           -+++|+.+|||||+||-.-+|..-.
T Consensus       245 ndPipNAFvL---PgGKvfVFtgiLn~c---------------------------k~ddglAtvLgHE~aHaVarH~AEk  294 (424)
T KOG2661|consen  245 NDPIPNAFVL---PGGKVFVFTGILNSC---------------------------KDDDGLATVLGHEIAHAVARHAAEK  294 (424)
T ss_pred             cCCCCceeec---cCCeEEEEechhhcc---------------------------cChHHHHHHHHHHHHHHHHHHHHHh
Confidence            3567999997   477899999999963                           4789999999999999999999988


Q ss_pred             HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccccchhhhhc
Q psy4798         165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYN-QLVQFLMTCMTRRFEFQADAFGKSL  241 (623)
Q Consensus       165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~-~l~~~l~~~~sR~~Ey~AD~fA~~l  241 (623)
                      +....+..++.+.++....  +.                         .|=. .+-.++.+-+||+.|-+||-.+.-+
T Consensus       295 i~k~~~~siLgLvlyt~~~--a~-------------------------~~n~~Ll~~flrlPfSRKMEtEADyIGLlL  345 (424)
T KOG2661|consen  295 IGKVHLLSILGLVLYTMIW--AI-------------------------CPNDKLLEYFLRLPFSRKMETEADYIGLLL  345 (424)
T ss_pred             hhhhHHHHHHHHHHHHHHh--hc-------------------------cchHHHHHHHhcCcchhhhhhhhhHHHHHH
Confidence            7765554444333322110  00                         0000 1123455778999999999998765


No 34 
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=97.71  E-value=0.00054  Score=72.60  Aligned_cols=106  Identities=25%  Similarity=0.289  Sum_probs=70.6

Q ss_pred             CCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHH
Q psy4798          88 HSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIV  167 (623)
Q Consensus        88 ~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~  167 (623)
                      -.|=-++|..+ .+||+-+++.+.                            +++||++-|++||++|.|++|.....+.
T Consensus       162 i~~P~v~gl~k-p~IvlP~d~~~r----------------------------~~~ee~~yIilHEl~Hlk~gD~i~n~i~  212 (337)
T COG4219         162 IDGPMVFGLVK-PCIVLPADFVER----------------------------LTDEELKYIILHELSHLKRGDAIINLIV  212 (337)
T ss_pred             CCCceeeccCc-ceEEccHHHHhh----------------------------cCHHhhhhhHhHHHhhhhcccHHHHHHH
Confidence            34455666653 578999998887                            8999999999999999999999987665


Q ss_pred             HHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcCCH---
Q psy4798         168 MQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLGKA---  244 (623)
Q Consensus       168 ~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~~~---  244 (623)
                      ..++.++.                                    |.|   ++.+....+--.+|-.||+.+...-.+   
T Consensus       213 ~~~~~l~W------------------------------------fNP---~v~l~~~~~~~D~E~aCDa~vL~~~~~~er  253 (337)
T COG4219         213 VVLGVLFW------------------------------------FNP---LVHLGKRKIRIDQEIACDAAVLARINPEER  253 (337)
T ss_pred             HHHhHHhh------------------------------------cCh---HHHHHHHHHHhhHHHHhhHHHHhccChHHH
Confidence            44333211                                    011   123445677778999999999877422   


Q ss_pred             HHHHHHHHHHhhCcCCC
Q psy4798         245 IFLRKALLKINKDNLGF  261 (623)
Q Consensus       245 ~~L~sAL~KLa~~~ls~  261 (623)
                      ..-.+++.++-.....+
T Consensus       254 r~YaEsil~~l~a~s~~  270 (337)
T COG4219         254 RTYAESILKLLLAFSNP  270 (337)
T ss_pred             HHHHHHHHHHHhcccCC
Confidence            22344444444433333


No 35 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=97.13  E-value=0.0046  Score=54.44  Aligned_cols=65  Identities=17%  Similarity=0.160  Sum_probs=43.4

Q ss_pred             CCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHH
Q psy4798         519 CDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAK  598 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~a  598 (623)
                      +++.+-..++||||||+.++|.....                     ..........+.|.+||.+|...+=..+.+...
T Consensus        37 ~~~~~~~f~laHELgH~~~~~~~~~~---------------------~~~~~~~~~~~~E~~An~fA~~lL~p~~~~~~~   95 (122)
T PF06114_consen   37 LSPERQRFTLAHELGHILLHHGDETF---------------------NYYLNYFFNERQEREANAFAAALLMPEEDFRDA   95 (122)
T ss_dssp             S-HHHHHHHHHHHHHHHHHHH-HHHH---------------------HHHHHH--THHHHHHHHHHHHHHHS-HHHHHHH
T ss_pred             CCHHHHHHHHHHHHHHHHhhhccccc---------------------hhhccccchhhHHHHHHHHHHHHhCCHHHHHHH
Confidence            47788999999999999888875543                     000111126778999999999988666666666


Q ss_pred             HHHhhh
Q psy4798         599 LKVLST  604 (623)
Q Consensus       599 L~kL~~  604 (623)
                      +.+...
T Consensus        96 ~~~~~~  101 (122)
T PF06114_consen   96 LEKYRN  101 (122)
T ss_dssp             HHHH-S
T ss_pred             HHHcCC
Confidence            666554


No 36 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=96.63  E-value=0.011  Score=62.60  Aligned_cols=128  Identities=17%  Similarity=0.144  Sum_probs=66.1

Q ss_pred             HHHHHHHHHcCCC--CCeEEEEeCCCCCCccc------EEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccc
Q psy4798         440 SRIEQLSASVKFP--LKKLYVVEGSKRSEHSN------AYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLIST  511 (623)
Q Consensus       440 ~~I~~la~~~gfp--~~~v~vv~~s~rs~~~N------Ay~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~  511 (623)
                      +.-.+..++.|.+  .+++.+.+++.+|..-+      ||-  |+.++.|.+--+.++...                   
T Consensus       101 ~~W~~~~~~~g~~y~~P~lv~~~~~~~t~CG~a~s~~gpFY--Cp~D~tIYlD~~f~~~L~-------------------  159 (292)
T PF04228_consen  101 DVWTPQFPQAGLPYRPPKLVLFSGSVQTGCGTASSATGPFY--CPADQTIYLDLSFFDELQ-------------------  159 (292)
T ss_pred             HHHHHHHHHcCCCCCCCEEEEeCCCCCCCCCCCcCCCCCEe--CCCCCEEEechHHHHHHH-------------------
Confidence            3334444455655  57888888776654433      332  234566777655554310                   


Q ss_pred             cccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHcC
Q psy4798         512 EGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYV-TKESELITARQDREAAEKKYKEAMK  590 (623)
Q Consensus       512 ~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~-~~~~~~~~sR~~E~~AD~~a~~~~~  590 (623)
                       .+-|. -.+--...|||||.||...+.          .|+.......  ..-. ....+.. ||+.|.+||.+|=--++
T Consensus       160 -~~~ga-~G~~a~ayVlAHEyGHHVQ~l----------~Gil~~~~~~--~~~~~~~~~~~~-svr~ELQADC~AGvw~~  224 (292)
T PF04228_consen  160 -QRFGA-SGDFAQAYVLAHEYGHHVQNL----------LGILDAVRQA--QQGRSPAEANEL-SVRLELQADCFAGVWAG  224 (292)
T ss_pred             -HHhCC-ccHHHHHHHHHHHHHHHHHHH----------hhhHHHHHHH--hhccCccccchH-HHHHHHHHHHHHHHHhh
Confidence             00000 124456779999999984332          2332211100  0000 0112233 99999999998765442


Q ss_pred             --------CHHHHHHHHHHhh
Q psy4798         591 --------EKDNVMAKLKVLS  603 (623)
Q Consensus       591 --------~~~~l~~aL~kL~  603 (623)
                              ++..+-++|...+
T Consensus       225 ~~~~~~~l~~~di~~al~aa~  245 (292)
T PF04228_consen  225 HAAEDGSLTPGDIEEALNAAS  245 (292)
T ss_pred             hccccCCcCHHHHHHHHHHHH
Confidence                    3445555555443


No 37 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=95.42  E-value=0.034  Score=48.88  Aligned_cols=63  Identities=32%  Similarity=0.308  Sum_probs=41.4

Q ss_pred             CChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHH
Q psy4798         140 CDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLV  219 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~  219 (623)
                      +++++-..++||||||+..+|-.....                                                     
T Consensus        37 ~~~~~~~f~laHELgH~~~~~~~~~~~-----------------------------------------------------   63 (122)
T PF06114_consen   37 LSPERQRFTLAHELGHILLHHGDETFN-----------------------------------------------------   63 (122)
T ss_dssp             S-HHHHHHHHHHHHHHHHHHH-HHHHH-----------------------------------------------------
T ss_pred             CCHHHHHHHHHHHHHHHHhhhccccch-----------------------------------------------------
Confidence            577889999999999999888775430                                                     


Q ss_pred             HHHHHHHhhhhccccchhhhhc-CCHHHHHHHHHHHh
Q psy4798         220 QFLMTCMTRRFEFQADAFGKSL-GKAIFLRKALLKIN  255 (623)
Q Consensus       220 ~~l~~~~sR~~Ey~AD~fA~~l-~~~~~L~sAL~KLa  255 (623)
                      .-.......+.|++||.||..+ -+.+.+...+.+..
T Consensus        64 ~~~~~~~~~~~E~~An~fA~~lL~p~~~~~~~~~~~~  100 (122)
T PF06114_consen   64 YYLNYFFNERQEREANAFAAALLMPEEDFRDALEKYR  100 (122)
T ss_dssp             HHHHH--THHHHHHHHHHHHHHHS-HHHHHHHHHHH-
T ss_pred             hhccccchhhHHHHHHHHHHHHhCCHHHHHHHHHHcC
Confidence            0011445667899999999887 56666666655543


No 38 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=95.28  E-value=0.04  Score=54.31  Aligned_cols=69  Identities=23%  Similarity=0.285  Sum_probs=53.0

Q ss_pred             hhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCC-CceEEEehhHHhhccCCCccccCCCCCCCcccccccc
Q psy4798         436 GELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGA  514 (623)
Q Consensus       436 ~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~-~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~  514 (623)
                      ..+.+.++..++.+|.+++++.+-+...        .+|.|. .++|.+.-.|+.                         
T Consensus       112 ~~l~~~~~~~~~~~~~~~~~i~ir~~ks--------rWGsc~~~~~I~ln~~L~~-------------------------  158 (205)
T PF01863_consen  112 EYLPERLKKYAKKLGLPPPKIKIRDMKS--------RWGSCSSKGNITLNWRLVM-------------------------  158 (205)
T ss_pred             HHHHHHHHHHHHHcCCCcceEEEeehhh--------ccccCCCCCcEEeeccccc-------------------------
Confidence            4567778888999999999998866332        267663 456888888885                         


Q ss_pred             CCCCCCHHHHHHHHHHHhhhhhc-cCHH
Q psy4798         515 NKKGCDTEEVLAVLAHELGHWKY-NHVL  541 (623)
Q Consensus       515 ~g~gl~~~Ei~AVLAHELGH~k~-~H~~  541 (623)
                          ++++-+..|+.|||.|.++ +|..
T Consensus       159 ----~P~~~idYVvvHEL~Hl~~~nHs~  182 (205)
T PF01863_consen  159 ----APPEVIDYVVVHELCHLRHPNHSK  182 (205)
T ss_pred             ----CCccHHHHHHHHHHHHhccCCCCH
Confidence                5888999999999999964 5553


No 39 
>PRK04351 hypothetical protein; Provisional
Probab=94.79  E-value=0.11  Score=50.06  Aligned_cols=71  Identities=23%  Similarity=0.462  Sum_probs=50.2

Q ss_pred             CCchhHHHHHHHHHHH-cCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccc
Q psy4798         433 LPDGELKSRIEQLSAS-VKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLIST  511 (623)
Q Consensus       433 l~~~~L~~~I~~la~~-~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~  511 (623)
                      +.+.||.+.+++++.+ ++-|.+.-...+..-|++ .-+|.   .+...|.+...+++.                     
T Consensus         1 ~~~~~l~~l~~~~s~~~F~~~f~~~v~~n~Rlrtt-gG~~~---l~~~~I~lnp~ll~~---------------------   55 (149)
T PRK04351          1 MTNQELQRLVEEISLEYFGKPFRHQAYFNKRLRTT-GGRYL---LKDHHIEFNPKMLEE---------------------   55 (149)
T ss_pred             CCHHHHHHHHHHHHHHHhCCCCCcEEEEeccchhh-hheee---cCCCeEEeCHHHHhh---------------------
Confidence            3578999999999975 455544333344433322 23442   245689999999986                     


Q ss_pred             cccCCCCCCHHHHHHHHHHHhhhh
Q psy4798         512 EGANKKGCDTEEVLAVLAHELGHW  535 (623)
Q Consensus       512 ~~~~g~gl~~~Ei~AVLAHELGH~  535 (623)
                             -+++++..||+|||.|+
T Consensus        56 -------~~~~~l~~vv~HElcH~   72 (149)
T PRK04351         56 -------YGLEELIGIIKHELCHY   72 (149)
T ss_pred             -------ccHHHHHhhHHHHHHHH
Confidence                   57899999999999998


No 40 
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=94.68  E-value=0.085  Score=53.97  Aligned_cols=73  Identities=19%  Similarity=0.234  Sum_probs=51.5

Q ss_pred             hhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccC
Q psy4798         436 GELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGAN  515 (623)
Q Consensus       436 ~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~  515 (623)
                      ..+...++..++.+|.+...+.+-+.+.        ..|.|..+..+.++..+..                         
T Consensus       123 ~~l~~~~~~~~~~l~~~~~~~~ik~~k~--------~WGScs~~~~i~~~~~l~~-------------------------  169 (223)
T COG1451         123 EILEIRLKEYAKKLGVPPRAIKLKNMKR--------RWGSCSKAGEIRFNWRLVM-------------------------  169 (223)
T ss_pred             HHHHHHHHHHHHHhCCCccceeeeeccc--------eeeeecCCCcEEeehhhhc-------------------------
Confidence            4566777788889999988888765443        3566533334555555544                         


Q ss_pred             CCCCCHHHHHHHHHHHhhhhhccCHHHHH
Q psy4798         516 KKGCDTEEVLAVLAHELGHWKYNHVLKSM  544 (623)
Q Consensus       516 g~gl~~~Ei~AVLAHELGH~k~~H~~k~~  544 (623)
                         .+++.+..|++|||+|.+..+=-+.|
T Consensus       170 ---~p~~~i~YVvvHELaHLke~nHs~~F  195 (223)
T COG1451         170 ---APEEVIDYVVVHELAHLKEKNHSKRF  195 (223)
T ss_pred             ---CCHHHHHHHHHHHHHHHhhhhccHHH
Confidence               69999999999999999976644444


No 41 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=94.15  E-value=0.061  Score=53.02  Aligned_cols=67  Identities=18%  Similarity=0.175  Sum_probs=51.5

Q ss_pred             cchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCC-CeEEEEecchhhhccCCccc
Q psy4798          40 IVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNAD  118 (623)
Q Consensus        40 ~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~  118 (623)
                      .+.+.++..++..|.+++++.+-++                         .. .+|.|. .+.|.+.-.|+.        
T Consensus       113 ~l~~~~~~~~~~~~~~~~~i~ir~~-------------------------ks-rWGsc~~~~~I~ln~~L~~--------  158 (205)
T PF01863_consen  113 YLPERLKKYAKKLGLPPPKIKIRDM-------------------------KS-RWGSCSSKGNITLNWRLVM--------  158 (205)
T ss_pred             HHHHHHHHHHHHcCCCcceEEEeeh-------------------------hh-ccccCCCCCcEEeeccccc--------
Confidence            4777788889999999888877431                         11 378765 567888888887        


Q ss_pred             cCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchH
Q psy4798         119 KKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV  161 (623)
Q Consensus       119 ~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~  161 (623)
                                           ++++-++.|++|||+|.++.+-
T Consensus       159 ---------------------~P~~~idYVvvHEL~Hl~~~nH  180 (205)
T PF01863_consen  159 ---------------------APPEVIDYVVVHELCHLRHPNH  180 (205)
T ss_pred             ---------------------CCccHHHHHHHHHHHHhccCCC
Confidence                                 5888899999999999977553


No 42 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=93.34  E-value=0.15  Score=44.03  Aligned_cols=69  Identities=30%  Similarity=0.399  Sum_probs=44.6

Q ss_pred             cCCchhHHHHHHHHHHHcCCCCCeEEEEeC-----CCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCC
Q psy4798         432 PLPDGELKSRIEQLSASVKFPLKKLYVVEG-----SKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSE  506 (623)
Q Consensus       432 pl~~~~L~~~I~~la~~~gfp~~~v~vv~~-----s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~  506 (623)
                      |||+ .+++.   +-..+|.++.+|-|-.+     ..++....||..|    ..|++-.+=.+                 
T Consensus         1 PLp~-~~r~~---~e~~~G~dl~~Vrvh~~~~a~~~~~~~~A~A~T~G----~~I~f~~g~~~-----------------   55 (79)
T PF13699_consen    1 PLPE-SIRSR---LERAFGADLSDVRVHTGPAASRAAAALGARAFTVG----NDIYFAPGKYN-----------------   55 (79)
T ss_pred             CCCH-HHHHH---HHHHhCCCccceEEEeCCchhhhhhccCCeEEEEC----CEEEEcCCCcC-----------------
Confidence            5543 35555   45678888888887666     2345677888876    46777333221                 


Q ss_pred             ccccccccCCCCCCHHHHHHHHHHHhhhhhc
Q psy4798         507 PLISTEGANKKGCDTEEVLAVLAHELGHWKY  537 (623)
Q Consensus       507 ~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~  537 (623)
                                  .+..+=.++|||||.|...
T Consensus        56 ------------~~s~~~~~llaHEl~Hv~Q   74 (79)
T PF13699_consen   56 ------------PDSPEGRALLAHELAHVVQ   74 (79)
T ss_pred             ------------CCCCCcchhHhHHHHHHHh
Confidence                        2334456889999999864


No 43 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=92.14  E-value=7.1  Score=40.16  Aligned_cols=86  Identities=19%  Similarity=0.234  Sum_probs=55.3

Q ss_pred             chHhhhhhhccCCcCCch---hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCC
Q psy4798         419 YPEFIAPLFDKYTPLPDG---ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLN  495 (623)
Q Consensus       419 ~P~~I~plf~k~~pl~~~---~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~  495 (623)
                      .-..+..-|+||...+..   .=.+.=+.+.++.|+.--+|..++|.- +.|-|      +.+|.|-|++.-...     
T Consensus        17 aq~~vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~L-tDHYd------P~~k~vrLS~~vy~~-----   84 (222)
T PF04298_consen   17 AQWWVKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGEL-TDHYD------PRNKVVRLSEDVYNG-----   84 (222)
T ss_pred             HHHHHHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCC-CCCcC------CCCCEEEeCCccCCC-----
Confidence            334566778888665541   122333446888998777777777632 33422      356778888876653     


Q ss_pred             ccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798         496 ADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       496 ~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~  540 (623)
                                              +.=--.||-|||.||--+.+.
T Consensus        85 ------------------------~SiaAvaVAAHEvGHAiQ~a~  105 (222)
T PF04298_consen   85 ------------------------RSIAAVAVAAHEVGHAIQHAE  105 (222)
T ss_pred             ------------------------CCHHHHHHHHHHHhHHHhccc
Confidence                                    334567999999999877664


No 44 
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=90.91  E-value=0.3  Score=50.04  Aligned_cols=25  Identities=32%  Similarity=0.327  Sum_probs=21.5

Q ss_pred             CChHHHHHHHHHhhhhhhcchHHHH
Q psy4798         140 CDTEEVLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h~~k~  164 (623)
                      .+++.+..|++|||+|.++.+--+.
T Consensus       170 ~p~~~i~YVvvHELaHLke~nHs~~  194 (223)
T COG1451         170 APEEVIDYVVVHELAHLKEKNHSKR  194 (223)
T ss_pred             CCHHHHHHHHHHHHHHHhhhhccHH
Confidence            6899999999999999998775444


No 45 
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=90.65  E-value=0.75  Score=45.84  Aligned_cols=74  Identities=20%  Similarity=0.082  Sum_probs=44.8

Q ss_pred             hHHHHHHHHHHHcCCCCCeEEEEe--CCCCCCc-ccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccc
Q psy4798         437 ELKSRIEQLSASVKFPLKKLYVVE--GSKRSEH-SNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEG  513 (623)
Q Consensus       437 ~L~~~I~~la~~~gfp~~~v~vv~--~s~rs~~-~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~  513 (623)
                      .+++.++++.+.+.-|--+|++..  ...+... ...+..|.+-..+|.++= +-..                       
T Consensus         4 ~i~~~~~~~~~~~~~~~i~v~i~p~~~~~~~~~~~~g~~g~~~~~~~i~l~~-~~~~-----------------------   59 (195)
T PF10026_consen    4 IIEEALEKSIELLPGPDIPVFIFPADPENPFLIPELGGKGGGAIPGYIFLFL-LPND-----------------------   59 (195)
T ss_pred             HHHHHHHHHHHHcCCCCCCEEEEeccCCCcccccccCcccccCCCCEEEEEe-cCCc-----------------------
Confidence            456667777777766655565544  3332122 223444444455677764 2222                       


Q ss_pred             cCCCCCCHHHHHHHHHHHhhhhhccC
Q psy4798         514 ANKKGCDTEEVLAVLAHELGHWKYNH  539 (623)
Q Consensus       514 ~~g~gl~~~Ei~AVLAHELGH~k~~H  539 (623)
                           .+.+++.+++|||+.|..+..
T Consensus        60 -----~~~~~l~~~iaHE~hH~~r~~   80 (195)
T PF10026_consen   60 -----YSLEELPALIAHEYHHNCRYE   80 (195)
T ss_pred             -----ccHHHHHHHHHHHHHHHHHHh
Confidence                 688999999999999996543


No 46 
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=90.21  E-value=0.23  Score=50.33  Aligned_cols=41  Identities=24%  Similarity=0.264  Sum_probs=32.8

Q ss_pred             HHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Q psy4798         523 EVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAM  589 (623)
Q Consensus       523 Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~  589 (623)
                      -+.-+++||++|..++|....                          ..-.+++.|.+||.+|.+.+
T Consensus       100 A~~fil~HE~~Hv~~~h~~~~--------------------------~~~~~~~eE~~AD~~A~~~i  140 (206)
T PF10463_consen  100 AIAFILLHELAHVVLGHEGDS--------------------------SPSQSIQEEKEADSYATEMI  140 (206)
T ss_pred             HHHHHHHHHHHHHHHcCcccc--------------------------ccchhHHHHHhhhHHHHHHH
Confidence            355689999999999999664                          11128899999999999876


No 47 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=89.71  E-value=0.65  Score=44.15  Aligned_cols=66  Identities=29%  Similarity=0.311  Sum_probs=42.7

Q ss_pred             hHHHHHHHHHH-HcC--CCCCeEEEEeCCCCCCcccEEEeccC--CCceEEEehhHHhhccCCCccccCCCCCCCccccc
Q psy4798         437 ELKSRIEQLSA-SVK--FPLKKLYVVEGSKRSEHSNAYFYGFF--KNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLIST  511 (623)
Q Consensus       437 ~L~~~I~~la~-~~g--fp~~~v~vv~~s~rs~~~NAy~~G~~--~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~  511 (623)
                      +|.+.++++-. -++  +|.+.|. .+...++.      .|.+  +...|.|...|++.                     
T Consensus         2 ~L~~~~~~~n~~~F~~~l~~~~i~-w~~r~~~~------~G~~~~~~~~I~ln~~l~~~---------------------   53 (146)
T smart00731        2 ELTQRLEDASLRVFGRKLPHPKVV-WNKRLRKT------GGRCLLKSAEIRLNPKLLTE---------------------   53 (146)
T ss_pred             hHHHHHHHHHHHHHCCCCCCCEEE-Eehhhhhh------hHHhhcCCCEEEeCHHHHhh---------------------
Confidence            56667777653 233  4444343 33222211      2332  36679999999986                     


Q ss_pred             cccCCCCCCHHHHHHHHHHHhhhhhc
Q psy4798         512 EGANKKGCDTEEVLAVLAHELGHWKY  537 (623)
Q Consensus       512 ~~~~g~gl~~~Ei~AVLAHELGH~k~  537 (623)
                             .+++++..||.|||.|...
T Consensus        54 -------~~~~~l~~~l~HEm~H~~~   72 (146)
T smart00731       54 -------NGRDRLRETLLHELCHAAL   72 (146)
T ss_pred             -------ccHHHHHhhHHHHHHHHHH
Confidence                   5789999999999999976


No 48 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=88.00  E-value=2.1  Score=40.47  Aligned_cols=33  Identities=30%  Similarity=0.328  Sum_probs=28.6

Q ss_pred             eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798         480 RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       480 rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~  540 (623)
                      .|.|...+++.                            .+.+|+..+|.|||.|....+.
T Consensus        44 ~I~ls~~~~~~----------------------------~~~~~~~~tL~HEm~H~~~~~~   76 (157)
T PF10263_consen   44 EIRLSPKLLDR----------------------------NPEEELIDTLLHEMAHAAAYVF   76 (157)
T ss_pred             EEEECHHHHHh----------------------------hHHHHHHHHHHHHHHHHHhhhc
Confidence            79999999986                            5889999999999999966444


No 49 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=85.66  E-value=1.8  Score=43.87  Aligned_cols=82  Identities=20%  Similarity=0.230  Sum_probs=52.9

Q ss_pred             cCCchhHHHHHHHHHHHcCC------CCCeEE-EEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCC
Q psy4798         432 PLPDGELKSRIEQLSASVKF------PLKKLY-VVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGD  504 (623)
Q Consensus       432 pl~~~~L~~~I~~la~~~gf------p~~~v~-vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~  504 (623)
                      |-++.-|++....+.+-+-.      |.++|- +++.    ...=||..|-...+.|-++...+++.       .++   
T Consensus        24 ~~a~~~L~~a~~~V~~~ly~~~~~~~~v~~Vt~~~~~----~~gVA~t~gd~~~~~I~~S~~~i~~~-------~~~---   89 (205)
T PF04450_consen   24 PDAEQVLRDASRFVWRLLYQSPADRKPVRSVTLILDD----MDGVAYTSGDDDHKEIHFSARYIAKY-------PAD---   89 (205)
T ss_pred             cCHHHHHHHHHHHHHHHhCCCCCCCCcccEEEEEEEC----CCeeEEEecCCCccEEEEeHHHHhhc-------ccc---
Confidence            33334455544444444322      234554 4442    35789999977788999999999873       000   


Q ss_pred             CCccccccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798         505 SEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       505 ~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~  540 (623)
                                   |-..+|+.+||-||+.|.-++..
T Consensus        90 -------------~~~~~Ei~Gvl~HE~~H~~Q~~~  112 (205)
T PF04450_consen   90 -------------GDVRDEIIGVLYHEMVHCWQWDG  112 (205)
T ss_pred             -------------cchHHHHHHHHHHHHHHHhhcCC
Confidence                         13458999999999999977654


No 50 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=85.24  E-value=0.69  Score=49.26  Aligned_cols=18  Identities=33%  Similarity=0.503  Sum_probs=15.4

Q ss_pred             HHHhhhhccccchhhhhc
Q psy4798         224 TCMTRRFEFQADAFGKSL  241 (623)
Q Consensus       224 ~~~sR~~Ey~AD~fA~~l  241 (623)
                      +.++|+.|.|||.||.--
T Consensus       205 ~~~svr~ELQADC~AGvw  222 (292)
T PF04228_consen  205 NELSVRLELQADCFAGVW  222 (292)
T ss_pred             chHHHHHHHHHHHHHHHH
Confidence            578999999999998653


No 51 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=82.18  E-value=66  Score=33.27  Aligned_cols=17  Identities=41%  Similarity=0.471  Sum_probs=14.0

Q ss_pred             HHHHHHHhhhhhhcchH
Q psy4798         145 VLAVLAHELGHWKYNHV  161 (623)
Q Consensus       145 l~AVlAHELGH~k~~h~  161 (623)
                      -.||-|||.||--.+..
T Consensus        89 AvaVAAHEvGHAiQ~a~  105 (222)
T PF04298_consen   89 AVAVAAHEVGHAIQHAE  105 (222)
T ss_pred             HHHHHHHHHhHHHhccc
Confidence            46899999999877763


No 52 
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=80.61  E-value=2  Score=45.11  Aligned_cols=27  Identities=26%  Similarity=0.241  Sum_probs=24.9

Q ss_pred             CCHHHHHHHHHHHhhhhhccCHHHHHH
Q psy4798         519 CDTEEVLAVLAHELGHWKYNHVLKSMI  545 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H~~k~~~  545 (623)
                      ++.+|+.+|++||+-|...+|..+.--
T Consensus        55 l~~~~~~~~l~HevlH~~~~H~~r~~~   81 (292)
T PF13203_consen   55 LSPEERVGLLLHEVLHCLLRHPWRRGG   81 (292)
T ss_pred             CCHHHHHHHHHHHHHHHHccchhhhcc
Confidence            899999999999999999999987654


No 53 
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=80.12  E-value=2.4  Score=42.36  Aligned_cols=23  Identities=26%  Similarity=0.208  Sum_probs=19.8

Q ss_pred             CCCChHHHHHHHHHhhhhhhcch
Q psy4798         138 KGCDTEEVLAVLAHELGHWKYNH  160 (623)
Q Consensus       138 ~~l~~~El~AVlAHELGH~k~~h  160 (623)
                      .-++-++|..|+.|||+|..++.
T Consensus        75 ~fl~~~~i~~t~lHELaH~~~~~   97 (186)
T PF08325_consen   75 GFLPYETILGTMLHELAHNVHGP   97 (186)
T ss_pred             CEeeHHHHHHHHHHHHHhcccCC
Confidence            34899999999999999998654


No 54 
>PRK04351 hypothetical protein; Provisional
Probab=76.50  E-value=2  Score=41.42  Aligned_cols=31  Identities=29%  Similarity=0.597  Sum_probs=26.6

Q ss_pred             CCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhh
Q psy4798          98 KNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHW  156 (623)
Q Consensus        98 ~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~  156 (623)
                      ....|.+...+++.                            -+++++..||+|||.|+
T Consensus        42 ~~~~I~lnp~ll~~----------------------------~~~~~l~~vv~HElcH~   72 (149)
T PRK04351         42 KDHHIEFNPKMLEE----------------------------YGLEELIGIIKHELCHY   72 (149)
T ss_pred             CCCeEEeCHHHHhh----------------------------ccHHHHHhhHHHHHHHH
Confidence            35689999888885                            47899999999999997


No 55 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=74.73  E-value=5.8  Score=40.49  Aligned_cols=22  Identities=27%  Similarity=0.585  Sum_probs=17.4

Q ss_pred             CCHHHHHHHHHHHhhhhh-ccCH
Q psy4798         519 CDTEEVLAVLAHELGHWK-YNHV  540 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k-~~H~  540 (623)
                      -+.+.+..|++|||||-- .+|+
T Consensus       128 ~~~~~~~hvi~HEiGH~IGfRHT  150 (211)
T PF12388_consen  128 YSVNVIEHVITHEIGHCIGFRHT  150 (211)
T ss_pred             CchhHHHHHHHHHhhhhcccccc
Confidence            477889999999999963 4554


No 56 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=73.68  E-value=2.7  Score=36.25  Aligned_cols=19  Identities=47%  Similarity=0.513  Sum_probs=13.8

Q ss_pred             ChHHHHHHHHHhhhhhhcc
Q psy4798         141 DTEEVLAVLAHELGHWKYN  159 (623)
Q Consensus       141 ~~~El~AVlAHELGH~k~~  159 (623)
                      +..+=.++|||||+|+..+
T Consensus        57 ~s~~~~~llaHEl~Hv~Qq   75 (79)
T PF13699_consen   57 DSPEGRALLAHELAHVVQQ   75 (79)
T ss_pred             CCCCcchhHhHHHHHHHhh
Confidence            3344567899999998653


No 57 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=72.96  E-value=3.9  Score=41.51  Aligned_cols=53  Identities=25%  Similarity=0.249  Sum_probs=40.9

Q ss_pred             CCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchH
Q psy4798          86 SEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV  161 (623)
Q Consensus        86 ~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~  161 (623)
                      +...-||.+|-+..+.|-++...++...+                  +     +-..+|+..||-||+.|+-..+-
T Consensus        60 ~~~gVA~t~gd~~~~~I~~S~~~i~~~~~------------------~-----~~~~~Ei~Gvl~HE~~H~~Q~~~  112 (205)
T PF04450_consen   60 DMDGVAYTSGDDDHKEIHFSARYIAKYPA------------------D-----GDVRDEIIGVLYHEMVHCWQWDG  112 (205)
T ss_pred             CCCeeEEEecCCCccEEEEeHHHHhhccc------------------c-----cchHHHHHHHHHHHHHHHhhcCC
Confidence            45677999997778999999999997410                  0     12348999999999999977654


No 58 
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=72.78  E-value=7.4  Score=38.89  Aligned_cols=21  Identities=29%  Similarity=0.327  Sum_probs=18.8

Q ss_pred             CCHHHHHHHHHHHhhhhhccC
Q psy4798         519 CDTEEVLAVLAHELGHWKYNH  539 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H  539 (623)
                      ++-++|..|+.|||+|..++.
T Consensus        77 l~~~~i~~t~lHELaH~~~~~   97 (186)
T PF08325_consen   77 LPYETILGTMLHELAHNVHGP   97 (186)
T ss_pred             eeHHHHHHHHHHHHHhcccCC
Confidence            688999999999999998654


No 59 
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=72.28  E-value=3.6  Score=43.20  Aligned_cols=26  Identities=27%  Similarity=0.308  Sum_probs=24.1

Q ss_pred             CChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798         140 CDTEEVLAVLAHELGHWKYNHVLKSM  165 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h~~k~~  165 (623)
                      |+.+|+..|++||+-|+..+|..+.-
T Consensus        55 l~~~~~~~~l~HevlH~~~~H~~r~~   80 (292)
T PF13203_consen   55 LSPEERVGLLLHEVLHCLLRHPWRRG   80 (292)
T ss_pred             CCHHHHHHHHHHHHHHHHccchhhhc
Confidence            89999999999999999999998764


No 60 
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=71.92  E-value=1.7  Score=44.18  Aligned_cols=19  Identities=32%  Similarity=0.385  Sum_probs=16.0

Q ss_pred             HHHHHHHhhhhhhcchHHH
Q psy4798         145 VLAVLAHELGHWKYNHVLK  163 (623)
Q Consensus       145 l~AVlAHELGH~k~~h~~k  163 (623)
                      +.-+++||++|+..+|...
T Consensus       101 ~~fil~HE~~Hv~~~h~~~  119 (206)
T PF10463_consen  101 IAFILLHELAHVVLGHEGD  119 (206)
T ss_pred             HHHHHHHHHHHHHHcCccc
Confidence            4568999999999999764


No 61 
>PRK09672 phage exclusion protein Lit; Provisional
Probab=71.55  E-value=3.6  Score=43.94  Aligned_cols=41  Identities=20%  Similarity=0.264  Sum_probs=31.8

Q ss_pred             HHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Q psy4798         524 VLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAMK  590 (623)
Q Consensus       524 i~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~  590 (623)
                      +.-|+.||++|..++|....                         ..- .+++.|.+||.+|.+.+-
T Consensus       165 ~a~i~~HEiaHv~~~h~~~~-------------------------~~e-~s~~eE~eaDs~At~~il  205 (305)
T PRK09672        165 LAWILLHEIAHVEFQHSSLE-------------------------SNE-DSIQEEKEADSYATNWLL  205 (305)
T ss_pred             HHHHHHHHHHHHHhcccccc-------------------------Cch-HHHHHHHHHHHHHHHHHH
Confidence            45689999999999998540                         011 278999999999998773


No 62 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=71.53  E-value=9.2  Score=37.49  Aligned_cols=16  Identities=38%  Similarity=0.372  Sum_probs=13.5

Q ss_pred             HHHHHHHHHHHhhhhh
Q psy4798         521 TEEVLAVLAHELGHWK  536 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~k  536 (623)
                      ......++|||+||--
T Consensus       128 ~~~~a~~~AHElGH~l  143 (194)
T cd04269         128 LLLFAVTMAHELGHNL  143 (194)
T ss_pred             hHHHHHHHHHHHHhhc
Confidence            4677899999999974


No 63 
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=71.38  E-value=5.7  Score=42.02  Aligned_cols=103  Identities=19%  Similarity=0.173  Sum_probs=52.0

Q ss_pred             CCCCeEEEEeCCCCCCcccEE-Eecc--CCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHH
Q psy4798         451 FPLKKLYVVEGSKRSEHSNAY-FYGF--FKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAV  527 (623)
Q Consensus       451 fp~~~v~vv~~s~rs~~~NAy-~~G~--~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AV  527 (623)
                      ...+++..+.++.++..-.|= .+|-  |+..+=|..|.-+-+.+                  +.+--..|  +=.-.-|
T Consensus       111 Y~~Ptlvlf~~~v~t~CG~assasGPFYCP~D~kvYlDlsFf~~m------------------~~~fga~G--dfAqaYV  170 (295)
T COG2321         111 YQKPTLVLFSGQVRTGCGFASSASGPFYCPADTKVYLDLSFFDEM------------------KTKFGASG--DFAQAYV  170 (295)
T ss_pred             ccCCeEEEecCccccCcCCCCcCCCCeecCCCceEEEehhHHHHH------------------HHHhcCCc--cHHHHHH
Confidence            345788888888877663332 2232  34444455554443311                  11111122  4456789


Q ss_pred             HHHHhhhhhccCHHHHHHHHHHHHHhhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4798         528 LAHELGHWKYNHVLKSMILKKEFGVANKEREA-SVMRYVTKESELITARQDREAAEKKYK  586 (623)
Q Consensus       528 LAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~-~v~~~~~~~~~~~~sR~~E~~AD~~a~  586 (623)
                      +|||.||...          .++|++...... .-+.  ....+-. |=+.|.+||.+|=
T Consensus       171 iAHEVGHHVQ----------nllGi~~kv~~~~q~~s--~~~an~~-sVr~ELQADCfAG  217 (295)
T COG2321         171 IAHEVGHHVQ----------NLLGILPKVNQNQQGMS--QAEANQL-SVRRELQADCFAG  217 (295)
T ss_pred             HHhhhhHHHH----------HHhhhhHHHHHHHhhhh--hhhccCc-ceeeeehhhHHHH
Confidence            9999999743          233433322110 0000  0112233 6678999999864


No 64 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=70.88  E-value=3.4  Score=39.24  Aligned_cols=32  Identities=28%  Similarity=0.326  Sum_probs=27.5

Q ss_pred             CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhc
Q psy4798          99 NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKY  158 (623)
Q Consensus        99 ~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~  158 (623)
                      ...|.+...|++.                            -+++++..||.|||.|...
T Consensus        41 ~~~I~ln~~l~~~----------------------------~~~~~l~~~l~HEm~H~~~   72 (146)
T smart00731       41 SAEIRLNPKLLTE----------------------------NGRDRLRETLLHELCHAAL   72 (146)
T ss_pred             CCEEEeCHHHHhh----------------------------ccHHHHHhhHHHHHHHHHH
Confidence            6789999998885                            4778999999999999875


No 65 
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=70.84  E-value=8.9  Score=38.23  Aligned_cols=21  Identities=33%  Similarity=0.316  Sum_probs=17.9

Q ss_pred             CChHHHHHHHHHhhhhhhcch
Q psy4798         140 CDTEEVLAVLAHELGHWKYNH  160 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h  160 (623)
                      .+.+++.|++|||+-|....+
T Consensus        60 ~~~~~l~~~iaHE~hH~~r~~   80 (195)
T PF10026_consen   60 YSLEELPALIAHEYHHNCRYE   80 (195)
T ss_pred             ccHHHHHHHHHHHHHHHHHHh
Confidence            578899999999999996544


No 66 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=70.78  E-value=2.9  Score=39.59  Aligned_cols=32  Identities=31%  Similarity=0.329  Sum_probs=27.2

Q ss_pred             eEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcc
Q psy4798         100 KRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYN  159 (623)
Q Consensus       100 krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~  159 (623)
                      ..|.+...+++.                            -+++|+..+|.|||.|....
T Consensus        43 ~~I~ls~~~~~~----------------------------~~~~~~~~tL~HEm~H~~~~   74 (157)
T PF10263_consen   43 CEIRLSPKLLDR----------------------------NPEEELIDTLLHEMAHAAAY   74 (157)
T ss_pred             eEEEECHHHHHh----------------------------hHHHHHHHHHHHHHHHHHhh
Confidence            479999999885                            47899999999999997653


No 67 
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=70.57  E-value=13  Score=34.50  Aligned_cols=18  Identities=33%  Similarity=0.372  Sum_probs=16.6

Q ss_pred             CCHHHHHHHHHHHhhhhh
Q psy4798         519 CDTEEVLAVLAHELGHWK  536 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k  536 (623)
                      ++-+|=.-|++||+.|+-
T Consensus        75 Ls~~ekvKviiHEllHIP   92 (133)
T COG4900          75 LSCAEKVKVIIHELLHIP   92 (133)
T ss_pred             CChHHHHHHHHHHHhcCc
Confidence            899999999999999983


No 68 
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=69.07  E-value=2.9  Score=44.16  Aligned_cols=18  Identities=33%  Similarity=0.418  Sum_probs=14.8

Q ss_pred             HHHhhhhccccchhhhhc
Q psy4798         224 TCMTRRFEFQADAFGKSL  241 (623)
Q Consensus       224 ~~~sR~~Ey~AD~fA~~l  241 (623)
                      +.+|=+.|.|||.||.--
T Consensus       202 n~~sVr~ELQADCfAGVW  219 (295)
T COG2321         202 NQLSVRRELQADCFAGVW  219 (295)
T ss_pred             cCcceeeeehhhHHHHHH
Confidence            667788999999999643


No 69 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=68.31  E-value=3.4  Score=42.66  Aligned_cols=22  Identities=45%  Similarity=0.324  Sum_probs=16.4

Q ss_pred             CCCCChHHHHHHHHHhhhhhhc
Q psy4798         137 KKGCDTEEVLAVLAHELGHWKY  158 (623)
Q Consensus       137 ~~~l~~~El~AVlAHELGH~k~  158 (623)
                      ++.+...+...++||||||--.
T Consensus       159 ~~~~~~~~~a~t~AHElGHnlG  180 (244)
T cd04270         159 GKRVPTKESDLVTAHELGHNFG  180 (244)
T ss_pred             CCccchhHHHHHHHHHHHHhcC
Confidence            4445566788999999999643


No 70 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=68.06  E-value=9.4  Score=37.11  Aligned_cols=84  Identities=24%  Similarity=0.307  Sum_probs=53.0

Q ss_pred             chhHHHHHHHHHHHcC--CCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCcccccc
Q psy4798         435 DGELKSRIEQLSASVK--FPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTE  512 (623)
Q Consensus       435 ~~~L~~~I~~la~~~g--fp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~  512 (623)
                      +.+|++.+++......  ++.+++.-=... + +..-||+-   ++ .|-+.-.|++.                      
T Consensus         4 ~~~L~~~~~~as~~~~r~~~~p~~~~n~Rg-~-taG~ayL~---~~-~I~lNP~ll~e----------------------   55 (156)
T COG3091           4 NRKLQQCVEQASLKFFRKFFRPKASYNQRG-R-TAGGAYLL---KS-EIRLNPKLLEE----------------------   55 (156)
T ss_pred             HHHHHHHHHHHHHHhcCCCCCcceehhhhh-h-hcchhhcc---cc-ccccCHHHHHH----------------------
Confidence            4567777777666553  344666432211 1 23455543   44 78899998887                      


Q ss_pred             ccCCCCCCHHHHHHHHHHHhhhhh---------ccCHHHHHHHHHHHHH
Q psy4798         513 GANKKGCDTEEVLAVLAHELGHWK---------YNHVLKSMILKKEFGV  552 (623)
Q Consensus       513 ~~~g~gl~~~Ei~AVLAHELGH~k---------~~H~~k~~~~~~~~g~  552 (623)
                            -.+|.+..|+.|||+|+.         +++--+..++.+++|+
T Consensus        56 ------n~~~f~~~vV~HELaHl~ly~~~gr~~phg~ewk~lm~qV~~l   98 (156)
T COG3091          56 ------NGEDFIEQVVPHELAHLHLYQEFGRYKPHGKEWKLLMQQVLGL   98 (156)
T ss_pred             ------ccHHHHHHHHHHHHHHHHHHHHcCCCCCCchhHHHHHHHhCCC
Confidence                  588999999999999973         2333344555666554


No 71 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=64.01  E-value=4  Score=38.77  Aligned_cols=15  Identities=47%  Similarity=0.689  Sum_probs=13.5

Q ss_pred             hHHHHHHHHHhhhhh
Q psy4798         142 TEEVLAVLAHELGHW  156 (623)
Q Consensus       142 ~~El~AVlAHELGH~  156 (623)
                      ..++.+|+.||+||.
T Consensus       101 ~~~~~~~~~HEiGHa  115 (156)
T cd04279         101 AENLQAIALHELGHA  115 (156)
T ss_pred             chHHHHHHHHHhhhh
Confidence            568999999999996


No 72 
>PRK04860 hypothetical protein; Provisional
Probab=63.99  E-value=18  Score=35.38  Aligned_cols=64  Identities=22%  Similarity=0.128  Sum_probs=39.3

Q ss_pred             HHHHHHHHcCCCCC--eEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCC
Q psy4798         441 RIEQLSASVKFPLK--KLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKG  518 (623)
Q Consensus       441 ~I~~la~~~gfp~~--~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~g  518 (623)
                      .++.-.+.+|-|.+  .+. .+...|+    |-.+.+ ....|-+.-.++..                            
T Consensus        12 ~~~~a~~~f~~~f~~p~~~-f~~R~rt----aG~~~l-~~~~I~~Np~ll~~----------------------------   57 (160)
T PRK04860         12 CLAQANLYFKRTFPEPKVS-YTQRGTS----AGTAWL-QSNEIRLNPVLLLE----------------------------   57 (160)
T ss_pred             HHHHHHHHhCCCCCCCEEE-Eeecchh----hcchhH-hcCCeeeCHHHHhh----------------------------
Confidence            33333344565543  443 3433332    333332 34568888888876                            


Q ss_pred             CCHHHHHHHHHHHhhhhhcc
Q psy4798         519 CDTEEVLAVLAHELGHWKYN  538 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~  538 (623)
                      -+.+++..|++||+.|+...
T Consensus        58 ~~~~~l~~~v~HEl~H~~~~   77 (160)
T PRK04860         58 NQQAFIDEVVPHELAHLLVY   77 (160)
T ss_pred             CcHHHHHhHHHHHHHHHHHH
Confidence            57899999999999998543


No 73 
>PRK09672 phage exclusion protein Lit; Provisional
Probab=63.98  E-value=1.8  Score=46.26  Aligned_cols=18  Identities=28%  Similarity=0.540  Sum_probs=15.0

Q ss_pred             HHHHHHHhhhhhhcchHH
Q psy4798         145 VLAVLAHELGHWKYNHVL  162 (623)
Q Consensus       145 l~AVlAHELGH~k~~h~~  162 (623)
                      +.-|+.||++|+.++|..
T Consensus       165 ~a~i~~HEiaHv~~~h~~  182 (305)
T PRK09672        165 LAWILLHEIAHVEFQHSS  182 (305)
T ss_pred             HHHHHHHHHHHHHhcccc
Confidence            445789999999999976


No 74 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=63.96  E-value=6  Score=45.73  Aligned_cols=22  Identities=27%  Similarity=0.484  Sum_probs=15.4

Q ss_pred             CCCCCCCceeeecCCC-CeEEEE
Q psy4798          83 SKRSEHSNAYFYGFFK-NKRIVL  104 (623)
Q Consensus        83 s~~~~~~NA~~~G~~~-~krIvl  104 (623)
                      +++.+.+-||.+|+.+ ..+|++
T Consensus       348 ~r~gK~~Ga~~~~~~~~~p~il~  370 (591)
T TIGR00181       348 ENKGKRSGAYSIGGYKVKPYILM  370 (591)
T ss_pred             CCCCCCCCcccCCCCCCCCeEEE
Confidence            4567788899999966 344544


No 75 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=63.90  E-value=6.6  Score=38.51  Aligned_cols=16  Identities=38%  Similarity=0.372  Sum_probs=13.0

Q ss_pred             hHHHHHHHHHhhhhhh
Q psy4798         142 TEEVLAVLAHELGHWK  157 (623)
Q Consensus       142 ~~El~AVlAHELGH~k  157 (623)
                      .....-++|||+||--
T Consensus       128 ~~~~a~~~AHElGH~l  143 (194)
T cd04269         128 LLLFAVTMAHELGHNL  143 (194)
T ss_pred             hHHHHHHHHHHHHhhc
Confidence            3567889999999964


No 76 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=63.87  E-value=4  Score=38.57  Aligned_cols=16  Identities=31%  Similarity=0.428  Sum_probs=13.8

Q ss_pred             CHHHHHHHHHHHhhhh
Q psy4798         520 DTEEVLAVLAHELGHW  535 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~  535 (623)
                      +.++..+|++|||||-
T Consensus        90 ~~~~~~~~~~HEiGHa  105 (165)
T cd04268          90 SGARLRNTAEHELGHA  105 (165)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4468999999999996


No 77 
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=63.80  E-value=6.9  Score=43.65  Aligned_cols=55  Identities=16%  Similarity=0.248  Sum_probs=29.8

Q ss_pred             CCCCCCCceeeecCCCC--------eEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhh
Q psy4798          83 SKRSEHSNAYFYGFFKN--------KRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELG  154 (623)
Q Consensus        83 s~~~~~~NA~~~G~~~~--------krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELG  154 (623)
                      +++.+.+.||.+++.++        ...++.-+.-..                     ...+.+.|+-+++ -.|+||+|
T Consensus       194 ~R~gK~~ga~~~~~~~~~~~~~~~~~~~~i~~n~~~~---------------------~~~~~~ll~~~~v-~tLfHE~G  251 (458)
T PF01432_consen  194 PRPGKRSGAFCFTLRPSRSDGERQLPVPYIFCNFTGP---------------------SAGKPSLLSHDDV-ETLFHEFG  251 (458)
T ss_dssp             --TTS-SS-EEEEEEC-BTTSTCECEEEEEEEEE-S----------------------BTTC--B-SHHHH-HHHHHHHH
T ss_pred             hcCCCCCCceeCCccCccccccCCCCceEEEecCCCC---------------------CCCCCCccChhhH-HHHHHHHh
Confidence            45567788999998765        444444333221                     1122344788898 67999999


Q ss_pred             hhhcc
Q psy4798         155 HWKYN  159 (623)
Q Consensus       155 H~k~~  159 (623)
                      |--|.
T Consensus       252 Ha~H~  256 (458)
T PF01432_consen  252 HAMHS  256 (458)
T ss_dssp             HHHHH
T ss_pred             HHHHH
Confidence            97653


No 78 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=63.49  E-value=6  Score=40.42  Aligned_cols=17  Identities=29%  Similarity=0.575  Sum_probs=14.5

Q ss_pred             CChHHHHHHHHHhhhhh
Q psy4798         140 CDTEEVLAVLAHELGHW  156 (623)
Q Consensus       140 l~~~El~AVlAHELGH~  156 (623)
                      .+.+.++.|++|||||.
T Consensus       128 ~~~~~~~hvi~HEiGH~  144 (211)
T PF12388_consen  128 YSVNVIEHVITHEIGHC  144 (211)
T ss_pred             CchhHHHHHHHHHhhhh
Confidence            46677899999999995


No 79 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=63.38  E-value=3.6  Score=39.71  Aligned_cols=23  Identities=30%  Similarity=0.531  Sum_probs=13.1

Q ss_pred             HHHHHHHHHHhhhhh-ccCHHHHH
Q psy4798         522 EEVLAVLAHELGHWK-YNHVLKSM  544 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k-~~H~~k~~  544 (623)
                      ...-.+|+||+|||- +.|++..-
T Consensus        67 ~~~g~TltHEvGH~LGL~HtF~~~   90 (154)
T PF05572_consen   67 YNFGKTLTHEVGHWLGLYHTFGGG   90 (154)
T ss_dssp             S-SSHHHHHHHHHHTT---TT---
T ss_pred             cccccchhhhhhhhhcccccccCC
Confidence            344589999999996 67776544


No 80 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=63.05  E-value=5.2  Score=41.32  Aligned_cols=18  Identities=50%  Similarity=0.449  Sum_probs=14.5

Q ss_pred             CCHHHHHHHHHHHhhhhh
Q psy4798         519 CDTEEVLAVLAHELGHWK  536 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k  536 (623)
                      +...+...++||||||--
T Consensus       162 ~~~~~~a~t~AHElGHnl  179 (244)
T cd04270         162 VPTKESDLVTAHELGHNF  179 (244)
T ss_pred             cchhHHHHHHHHHHHHhc
Confidence            455668899999999973


No 81 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=63.05  E-value=4.5  Score=38.41  Aligned_cols=16  Identities=44%  Similarity=0.603  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHhhhhh
Q psy4798         521 TEEVLAVLAHELGHWK  536 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~k  536 (623)
                      ..++.+|+.||+||.-
T Consensus       101 ~~~~~~~~~HEiGHaL  116 (156)
T cd04279         101 AENLQAIALHELGHAL  116 (156)
T ss_pred             chHHHHHHHHHhhhhh
Confidence            4689999999999973


No 82 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=62.90  E-value=3.3  Score=38.63  Aligned_cols=17  Identities=29%  Similarity=0.470  Sum_probs=14.6

Q ss_pred             CChHHHHHHHHHhhhhh
Q psy4798         140 CDTEEVLAVLAHELGHW  156 (623)
Q Consensus       140 l~~~El~AVlAHELGH~  156 (623)
                      .+..++..|+.||+||.
T Consensus       100 ~~~~~~~~v~~HEiGHa  116 (154)
T PF00413_consen  100 DSGNDLQSVAIHEIGHA  116 (154)
T ss_dssp             SSSEEHHHHHHHHHHHH
T ss_pred             hhhhhhhhhhhhccccc
Confidence            45568999999999997


No 83 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=61.51  E-value=4.3  Score=38.35  Aligned_cols=16  Identities=31%  Similarity=0.428  Sum_probs=13.6

Q ss_pred             ChHHHHHHHHHhhhhh
Q psy4798         141 DTEEVLAVLAHELGHW  156 (623)
Q Consensus       141 ~~~El~AVlAHELGH~  156 (623)
                      +.++..+|++|||||-
T Consensus        90 ~~~~~~~~~~HEiGHa  105 (165)
T cd04268          90 SGARLRNTAEHELGHA  105 (165)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4467999999999995


No 84 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=60.30  E-value=6.6  Score=34.53  Aligned_cols=22  Identities=36%  Similarity=0.450  Sum_probs=18.0

Q ss_pred             CCHHHHHHHHHHHhhhhhccCH
Q psy4798         519 CDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H~  540 (623)
                      .+++.+..|++||++|.-+.+.
T Consensus        20 ~~~~~~~~~l~HE~~H~~~~~~   41 (128)
T PF13485_consen   20 SDEDWLDRVLAHELAHQWFGNY   41 (128)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHH
Confidence            4777888999999999965555


No 85 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=58.74  E-value=5.7  Score=37.01  Aligned_cols=22  Identities=27%  Similarity=0.456  Sum_probs=16.5

Q ss_pred             CCHHHHHHHHHHHhhhh-hccCH
Q psy4798         519 CDTEEVLAVLAHELGHW-KYNHV  540 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~-k~~H~  540 (623)
                      .+..++..|+.||+||. =+.|.
T Consensus       100 ~~~~~~~~v~~HEiGHaLGL~H~  122 (154)
T PF00413_consen  100 DSGNDLQSVAIHEIGHALGLDHS  122 (154)
T ss_dssp             SSSEEHHHHHHHHHHHHTTBESS
T ss_pred             hhhhhhhhhhhhccccccCcCcC
Confidence            34567999999999998 34443


No 86 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=58.41  E-value=8.6  Score=44.23  Aligned_cols=19  Identities=21%  Similarity=0.263  Sum_probs=14.7

Q ss_pred             cCCCCCCCceeeecCCCCe
Q psy4798          82 SSKRSEHSNAYFYGFFKNK  100 (623)
Q Consensus        82 ~s~~~~~~NA~~~G~~~~k  100 (623)
                      .+++.+.+.||.+|+...+
T Consensus       305 ~~r~gK~~Gayc~~~~~~~  323 (549)
T TIGR02289       305 ESRKGKAAGGYCTYLPKYK  323 (549)
T ss_pred             CCCCCCCCCcccCCCCCCC
Confidence            3667788999999987643


No 87 
>KOG2765|consensus
Probab=57.74  E-value=47  Score=36.93  Aligned_cols=103  Identities=22%  Similarity=0.249  Sum_probs=73.4

Q ss_pred             hHHHHHHHHHHHHHHHHhhhccHHHhhhcccccccCCCcccH----------HHHHHHHHHHHHHHHhhccccccchhhh
Q psy4798         291 EIVTSCLFIVLFNIFSTVIGLPISFYHHFVLEEKHGFNKQTY----------GFFVKDQIKSFIVSLILSIPLTGAVVYI  360 (623)
Q Consensus       291 ~~~~~~~f~~~~~l~~~li~lP~~~Y~~f~le~~~Gfn~~T~----------~~f~~d~vk~~~l~~~~s~~~~~~~~~~  360 (623)
                      +-.-.-+|+..+.+++-++-.|.-+--+|..+|+|.+.+++.          +.++.|.+......+  -.|+.+.+-+ 
T Consensus       275 ~rvdi~lffGfvGLfnllllwP~l~iL~~~~~e~F~lP~~~q~~~vv~~~ligtvvSDylW~~a~~l--Ts~Lv~Tlgm-  351 (416)
T KOG2765|consen  275 ERVDIQLFFGFVGLFNLLLLWPPLIILDFFGEERFELPSSTQFSLVVFNNLIGTVVSDYLWAKAVLL--TSPLVVTLGM-  351 (416)
T ss_pred             ccccHHHHHHHHHHHHHHHHhHHHHHHHHhccCcccCCCCceeEeeeHhhHHHHHHHHHHHHHHHHh--ccchhheeee-
Confidence            345556888999999999999988888888999999999983          778888765444333  3354443322 


Q ss_pred             hhccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHH
Q psy4798         361 IQTYGFFVKDQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFL  415 (623)
Q Consensus       361 ~~~~~~~~~~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~  415 (623)
                                         -+.+|+....=.+++.-...++.++....+.+++++
T Consensus       352 -------------------SltIPLA~~aD~l~k~~~~S~~~iiGsi~Ifv~Fv~  387 (416)
T KOG2765|consen  352 -------------------SLTIPLAMFADVLIKGKHPSALYIIGSIPIFVGFVI  387 (416)
T ss_pred             -------------------eEeeeHHHHHHHHHcCCCCCHHHHHHHHHHHHHHhh
Confidence                               456788877777787666777777666666665554


No 88 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=57.59  E-value=5.1  Score=39.21  Aligned_cols=16  Identities=38%  Similarity=0.428  Sum_probs=13.3

Q ss_pred             CHHHHHHHHHHHhhhh
Q psy4798         520 DTEEVLAVLAHELGHW  535 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~  535 (623)
                      ....-.-|+|||+||.
T Consensus       138 ~~~~~~~~~AHEiGH~  153 (196)
T PF13688_consen  138 PTYNGAITFAHEIGHN  153 (196)
T ss_dssp             -HHHHHHHHHHHHHHH
T ss_pred             CCCceehhhHHhHHHh
Confidence            4577888999999997


No 89 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=57.37  E-value=22  Score=40.24  Aligned_cols=20  Identities=30%  Similarity=0.399  Sum_probs=15.8

Q ss_pred             CCHHHHHHHHHHHhhhhhccC
Q psy4798         519 CDTEEVLAVLAHELGHWKYNH  539 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H  539 (623)
                      ++-+|+ -.|+||+||--|+.
T Consensus       259 l~~~~V-~TLfHEfGHalH~~  278 (472)
T cd06455         259 LRHDEV-ETFFHEFGHVIHHL  278 (472)
T ss_pred             CCHHHH-HHHHHHHHHHHHHH
Confidence            677886 46899999997743


No 90 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=57.00  E-value=7.8  Score=34.03  Aligned_cols=22  Identities=36%  Similarity=0.450  Sum_probs=17.8

Q ss_pred             CChHHHHHHHHHhhhhhhcchH
Q psy4798         140 CDTEEVLAVLAHELGHWKYNHV  161 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h~  161 (623)
                      .+.+.+..|++||++|.-+.+.
T Consensus        20 ~~~~~~~~~l~HE~~H~~~~~~   41 (128)
T PF13485_consen   20 SDEDWLDRVLAHELAHQWFGNY   41 (128)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHH
Confidence            4667788999999999866554


No 91 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=55.83  E-value=7.8  Score=39.60  Aligned_cols=20  Identities=40%  Similarity=0.390  Sum_probs=16.7

Q ss_pred             CCHHHHHHHHHHHhhhhhcc
Q psy4798         519 CDTEEVLAVLAHELGHWKYN  538 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~  538 (623)
                      .+++.-.-++||||||.-.+
T Consensus        67 ~~~~r~rFtlAHELGH~llH   86 (213)
T COG2856          67 NSLERKRFTLAHELGHALLH   86 (213)
T ss_pred             CCHHHHHHHHHHHHhHHHhc
Confidence            47788999999999998543


No 92 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=55.69  E-value=5.5  Score=38.49  Aligned_cols=14  Identities=43%  Similarity=0.878  Sum_probs=11.1

Q ss_pred             HHHHHHHHhhhhhh
Q psy4798         144 EVLAVLAHELGHWK  157 (623)
Q Consensus       144 El~AVlAHELGH~k  157 (623)
                      ...-+|+||+|||-
T Consensus        68 ~~g~TltHEvGH~L   81 (154)
T PF05572_consen   68 NFGKTLTHEVGHWL   81 (154)
T ss_dssp             -SSHHHHHHHHHHT
T ss_pred             ccccchhhhhhhhh
Confidence            34579999999985


No 93 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=54.50  E-value=5.5  Score=38.95  Aligned_cols=15  Identities=40%  Similarity=0.395  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHhhhhh
Q psy4798         522 EEVLAVLAHELGHWK  536 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k  536 (623)
                      -+...++|||+||+-
T Consensus       131 ~~~~~~~aHElGH~l  145 (192)
T cd04267         131 LLTALTMAHELGHNL  145 (192)
T ss_pred             eeehhhhhhhHHhhc
Confidence            456779999999995


No 94 
>PF15061 DUF4538:  Domain of unknown function (DUF4538)
Probab=54.39  E-value=17  Score=29.94  Aligned_cols=43  Identities=12%  Similarity=0.240  Sum_probs=27.4

Q ss_pred             HHHHHHHHHHHHHhhchHhhhhhhccCCcCCchhHHHHHHHHHHHcCCCCC
Q psy4798         404 LWVFIILMSLFLMTIYPEFIAPLFDKYTPLPDGELKSRIEQLSASVKFPLK  454 (623)
Q Consensus       404 ~w~~~~~~~l~~~~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~gfp~~  454 (623)
                      ..++...++++...+||+++.|+.+       +|=+++.++. .+.|++..
T Consensus         8 ~~~~ggfVg~iG~a~Ypi~~~Pmm~-------~eeYk~~Q~~-nR~gI~qe   50 (58)
T PF15061_consen    8 ALFVGGFVGLIGAALYPIYFRPMMN-------PEEYKKEQKI-NRAGIKQE   50 (58)
T ss_pred             hhhHHHHHHHHHHHHhhhhcccccC-------hHHHHHHHHH-HHhcccHh
Confidence            3455666777888899999999887       2334444443 44555443


No 95 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=54.36  E-value=8.4  Score=39.37  Aligned_cols=21  Identities=38%  Similarity=0.366  Sum_probs=17.1

Q ss_pred             CChHHHHHHHHHhhhhhhcch
Q psy4798         140 CDTEEVLAVLAHELGHWKYNH  160 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h  160 (623)
                      .+++.-.-++||||||.-.+.
T Consensus        67 ~~~~r~rFtlAHELGH~llH~   87 (213)
T COG2856          67 NSLERKRFTLAHELGHALLHT   87 (213)
T ss_pred             CCHHHHHHHHHHHHhHHHhcc
Confidence            577788999999999976544


No 96 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=54.25  E-value=6.4  Score=35.49  Aligned_cols=10  Identities=50%  Similarity=0.850  Sum_probs=9.5

Q ss_pred             HHHHHHhhhh
Q psy4798         526 AVLAHELGHW  535 (623)
Q Consensus       526 AVLAHELGH~  535 (623)
                      .+++||+||.
T Consensus       109 ~~~~HEiGH~  118 (124)
T PF13582_consen  109 DTFAHEIGHN  118 (124)
T ss_dssp             THHHHHHHHH
T ss_pred             eEeeehhhHh
Confidence            8999999997


No 97 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=53.95  E-value=5.9  Score=38.73  Aligned_cols=17  Identities=35%  Similarity=0.343  Sum_probs=13.0

Q ss_pred             CChHHHHHHHHHhhhhh
Q psy4798         140 CDTEEVLAVLAHELGHW  156 (623)
Q Consensus       140 l~~~El~AVlAHELGH~  156 (623)
                      .+...-.-|+|||+||.
T Consensus       137 ~~~~~~~~~~AHEiGH~  153 (196)
T PF13688_consen  137 PPTYNGAITFAHEIGHN  153 (196)
T ss_dssp             --HHHHHHHHHHHHHHH
T ss_pred             CCCCceehhhHHhHHHh
Confidence            34567788999999995


No 98 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=53.04  E-value=18  Score=40.82  Aligned_cols=21  Identities=29%  Similarity=0.298  Sum_probs=16.2

Q ss_pred             CCCChHHHHHHHHHhhhhhhcc
Q psy4798         138 KGCDTEEVLAVLAHELGHWKYN  159 (623)
Q Consensus       138 ~~l~~~El~AVlAHELGH~k~~  159 (623)
                      ..|+-+|+. .|+||+||--|+
T Consensus       257 ~ll~~~~V~-TLfHEfGHalH~  277 (472)
T cd06455         257 SLLRHDEVE-TFFHEFGHVIHH  277 (472)
T ss_pred             CCCCHHHHH-HHHHHHHHHHHH
Confidence            347778864 589999998873


No 99 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=52.14  E-value=7.1  Score=38.50  Aligned_cols=16  Identities=38%  Similarity=0.557  Sum_probs=13.5

Q ss_pred             CHHHHHHHHHHHhhhh
Q psy4798         520 DTEEVLAVLAHELGHW  535 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~  535 (623)
                      +......++|||+||.
T Consensus       127 ~~~~~a~~~AHelGH~  142 (199)
T PF01421_consen  127 SGLSFAVIIAHELGHN  142 (199)
T ss_dssp             SHHHHHHHHHHHHHHH
T ss_pred             hhHHHHHHHHHHHHHh
Confidence            4567888999999996


No 100
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=51.50  E-value=12  Score=37.84  Aligned_cols=14  Identities=29%  Similarity=0.342  Sum_probs=11.6

Q ss_pred             HHHHHHHHHhhhhh
Q psy4798         523 EVLAVLAHELGHWK  536 (623)
Q Consensus       523 Ei~AVLAHELGH~k  536 (623)
                      .-..++|||+||--
T Consensus       144 ~~~~~~AHElGH~l  157 (220)
T cd04272         144 YGVYTMTHELAHLL  157 (220)
T ss_pred             ccHHHHHHHHHHHh
Confidence            44799999999974


No 101
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=51.31  E-value=7.1  Score=35.19  Aligned_cols=11  Identities=45%  Similarity=0.709  Sum_probs=9.7

Q ss_pred             HHHHHhhhhhh
Q psy4798         147 AVLAHELGHWK  157 (623)
Q Consensus       147 AVlAHELGH~k  157 (623)
                      .+++||+||.-
T Consensus       109 ~~~~HEiGH~l  119 (124)
T PF13582_consen  109 DTFAHEIGHNL  119 (124)
T ss_dssp             THHHHHHHHHT
T ss_pred             eEeeehhhHhc
Confidence            89999999963


No 102
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=51.07  E-value=12  Score=35.11  Aligned_cols=14  Identities=50%  Similarity=0.869  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHhhhh
Q psy4798         522 EEVLAVLAHELGHW  535 (623)
Q Consensus       522 ~Ei~AVLAHELGH~  535 (623)
                      |+|.-|+-||+||.
T Consensus       107 d~vthvliHEIgHh  120 (136)
T COG3824         107 DQVTHVLIHEIGHH  120 (136)
T ss_pred             hHhhhhhhhhhhhh
Confidence            56778999999996


No 103
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=50.24  E-value=14  Score=43.17  Aligned_cols=44  Identities=39%  Similarity=0.602  Sum_probs=30.0

Q ss_pred             CCCCCCCceeeecCCC--CeEEEEe-cchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcc
Q psy4798          83 SKRSEHSNAYFYGFFK--NKRIVLF-DTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYN  159 (623)
Q Consensus        83 s~~~~~~NA~~~G~~~--~krIvl~-d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~  159 (623)
                      .++.+.+=||+.|...  ..+|.+. ++-+.+                                  ..+||||+||-.|.
T Consensus       349 ~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~d----------------------------------V~TLaHElGHs~Hs  394 (598)
T COG1164         349 PRKGKRSGAYSIGFYKGDHPFILMNYDGTLRD----------------------------------VFTLAHELGHSVHS  394 (598)
T ss_pred             CCCCCCCCcccCCCCCCCCCeEEEeCCCchhH----------------------------------HHHHHHHccHHHHH
Confidence            4456788999999976  3355554 332222                                  46899999998876


Q ss_pred             h
Q psy4798         160 H  160 (623)
Q Consensus       160 h  160 (623)
                      -
T Consensus       395 ~  395 (598)
T COG1164         395 Y  395 (598)
T ss_pred             H
Confidence            3


No 104
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=50.21  E-value=7.9  Score=37.82  Aligned_cols=13  Identities=46%  Similarity=0.641  Sum_probs=9.9

Q ss_pred             HHHHHHHHhhhhh
Q psy4798         524 VLAVLAHELGHWK  536 (623)
Q Consensus       524 i~AVLAHELGH~k  536 (623)
                      -.-++|||+||--
T Consensus       111 ~~~~~aHElGH~l  123 (173)
T PF13574_consen  111 GIDTFAHELGHQL  123 (173)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             eeeeehhhhHhhc
Confidence            3445999999973


No 105
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=49.67  E-value=11  Score=43.29  Aligned_cols=21  Identities=29%  Similarity=0.282  Sum_probs=14.7

Q ss_pred             CHHHHHHHHHHHhhhhhccCHH
Q psy4798         520 DTEEVLAVLAHELGHWKYNHVL  541 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~k~~H~~  541 (623)
                      +.+++ .+|+||+||--|....
T Consensus       334 t~~dv-~TL~HElGHa~H~~~s  354 (549)
T TIGR02289       334 TSGDI-DVLTHEAGHAFHVYES  354 (549)
T ss_pred             ChhHH-HHHHHHhhHHHHHHHh
Confidence            33444 5799999999876543


No 106
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=49.47  E-value=9.7  Score=37.56  Aligned_cols=16  Identities=38%  Similarity=0.557  Sum_probs=12.8

Q ss_pred             ChHHHHHHHHHhhhhh
Q psy4798         141 DTEEVLAVLAHELGHW  156 (623)
Q Consensus       141 ~~~El~AVlAHELGH~  156 (623)
                      +......++|||+||.
T Consensus       127 ~~~~~a~~~AHelGH~  142 (199)
T PF01421_consen  127 SGLSFAVIIAHELGHN  142 (199)
T ss_dssp             SHHHHHHHHHHHHHHH
T ss_pred             hhHHHHHHHHHHHHHh
Confidence            3456778999999995


No 107
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=49.31  E-value=16  Score=39.97  Aligned_cols=18  Identities=28%  Similarity=0.481  Sum_probs=12.9

Q ss_pred             CCCCCCCceeeecCCCCe
Q psy4798          83 SKRSEHSNAYFYGFFKNK  100 (623)
Q Consensus        83 s~~~~~~NA~~~G~~~~k  100 (623)
                      +++.+.+-||.+|+.+.+
T Consensus       191 ~r~gK~~gaf~~~~~~~~  208 (427)
T cd06459         191 PRKGKRSGAYCTGLPPGK  208 (427)
T ss_pred             CCCCCCCCeecCCCCCCC
Confidence            556677789999987433


No 108
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=49.29  E-value=7.1  Score=38.16  Aligned_cols=15  Identities=40%  Similarity=0.395  Sum_probs=12.2

Q ss_pred             HHHHHHHHHhhhhhh
Q psy4798         143 EEVLAVLAHELGHWK  157 (623)
Q Consensus       143 ~El~AVlAHELGH~k  157 (623)
                      -+...++|||+||+-
T Consensus       131 ~~~~~~~aHElGH~l  145 (192)
T cd04267         131 LLTALTMAHELGHNL  145 (192)
T ss_pred             eeehhhhhhhHHhhc
Confidence            355679999999985


No 109
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=49.21  E-value=8.7  Score=36.53  Aligned_cols=15  Identities=40%  Similarity=0.720  Sum_probs=13.1

Q ss_pred             HHHHHHHHHHHhhhh
Q psy4798         521 TEEVLAVLAHELGHW  535 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~  535 (623)
                      ...+.+|+.|||||.
T Consensus       104 ~~~~~~~~~HEiGHa  118 (157)
T cd04278         104 GTDLFSVAAHEIGHA  118 (157)
T ss_pred             cchHHHHHHHHhccc
Confidence            356999999999997


No 110
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=48.43  E-value=23  Score=39.57  Aligned_cols=62  Identities=18%  Similarity=0.223  Sum_probs=32.2

Q ss_pred             CeEEEEeCCCCCCcccEEEeccCCC--------ceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHH
Q psy4798         454 KKLYVVEGSKRSEHSNAYFYGFFKN--------KRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVL  525 (623)
Q Consensus       454 ~~v~vv~~s~rs~~~NAy~~G~~~~--------krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~  525 (623)
                      ..+|+=--++..+.+.||.+++.+.        ...++.-+.-..          ..           ..-..++.+++ 
T Consensus       186 g~~ylDl~~R~gK~~ga~~~~~~~~~~~~~~~~~~~~i~~n~~~~----------~~-----------~~~~ll~~~~v-  243 (458)
T PF01432_consen  186 GYIYLDLYPRPGKRSGAFCFTLRPSRSDGERQLPVPYIFCNFTGP----------SA-----------GKPSLLSHDDV-  243 (458)
T ss_dssp             EEEEEEES--TTS-SS-EEEEEEC-BTTSTCECEEEEEEEEE-S-----------BT-----------TC--B-SHHHH-
T ss_pred             hcccccchhcCCCCCCceeCCccCccccccCCCCceEEEecCCCC----------CC-----------CCCCccChhhH-
Confidence            5555422333345678999988765        444454443321          00           11122688998 


Q ss_pred             HHHHHHhhhhhc
Q psy4798         526 AVLAHELGHWKY  537 (623)
Q Consensus       526 AVLAHELGH~k~  537 (623)
                      -.|+||+||--|
T Consensus       244 ~tLfHE~GHa~H  255 (458)
T PF01432_consen  244 ETLFHEFGHAMH  255 (458)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHhHHHH
Confidence            779999999855


No 111
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=48.04  E-value=9.5  Score=36.25  Aligned_cols=16  Identities=38%  Similarity=0.677  Sum_probs=13.4

Q ss_pred             ChHHHHHHHHHhhhhh
Q psy4798         141 DTEEVLAVLAHELGHW  156 (623)
Q Consensus       141 ~~~El~AVlAHELGH~  156 (623)
                      ....+.+|+.|||||.
T Consensus       103 ~~~~~~~~~~HEiGHa  118 (157)
T cd04278         103 GGTDLFSVAAHEIGHA  118 (157)
T ss_pred             ccchHHHHHHHHhccc
Confidence            3456999999999996


No 112
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=47.93  E-value=15  Score=37.10  Aligned_cols=20  Identities=15%  Similarity=-0.038  Sum_probs=14.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHH
Q psy4798           6 EFQIFYGIIGFSWIVFLFEF   25 (623)
Q Consensus         6 ~~~i~~~~~~~~~~~~~~~~   25 (623)
                      ++.+-|.+..++.+...+..
T Consensus        21 ~~~~~yv~~~~n~vn~~Y~~   40 (220)
T cd04272          21 EQLIRYLAVMVNAANLRYRD   40 (220)
T ss_pred             HHHHHHHHHHHHHHHHHhhh
Confidence            45677778888877776664


No 113
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=47.30  E-value=1.6e+02  Score=30.23  Aligned_cols=83  Identities=16%  Similarity=0.153  Sum_probs=44.2

Q ss_pred             hhhhhccCCcCCchh--H-HHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-EEEehhHHhhccCCCccc
Q psy4798         423 IAPLFDKYTPLPDGE--L-KSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-IVLFDTLLKDYVPLNADK  498 (623)
Q Consensus       423 I~plf~k~~pl~~~~--L-~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-IVL~dtLl~~~~~~~~~~  498 (623)
                      +..-|+||...+...  - .+--..+.+..|+---+|-.++|.- +.|-       -+++| +-|++.-...        
T Consensus        24 vks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~L-TDHY-------DP~~kvvrLSe~~y~g--------   87 (226)
T COG2738          24 VKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTL-TDHY-------DPRRKVVRLSEANYYG--------   87 (226)
T ss_pred             HHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCc-cccc-------ChhhheeeccccccCC--------
Confidence            344567777665321  1 1222234667777655666666532 3331       23334 4455544321        


Q ss_pred             cCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHH
Q psy4798         499 KDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLK  542 (623)
Q Consensus       499 ~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k  542 (623)
                                          -+ =.=.||-|||.||--+.+.-.
T Consensus        88 --------------------~S-ia~~aVAAHEVGHAiQd~~~Y  110 (226)
T COG2738          88 --------------------PS-IAAIAVAAHEVGHAIQDQEDY  110 (226)
T ss_pred             --------------------cc-HHHHHHHHHHhhHHHhhhccc
Confidence                                23 345799999999976655433


No 114
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=47.29  E-value=9.1  Score=36.26  Aligned_cols=10  Identities=60%  Similarity=0.936  Sum_probs=8.9

Q ss_pred             HHHHHhhhhh
Q psy4798         147 AVLAHELGHW  156 (623)
Q Consensus       147 AVlAHELGH~  156 (623)
                      -|.|||+||+
T Consensus        79 RIaaHE~GHi   88 (132)
T PF02031_consen   79 RIAAHELGHI   88 (132)
T ss_dssp             HHHHHHHHHH
T ss_pred             eeeeehhccc
Confidence            3999999996


No 115
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=46.66  E-value=9.1  Score=37.40  Aligned_cols=11  Identities=55%  Similarity=0.921  Sum_probs=8.9

Q ss_pred             HHHHHHhhhhh
Q psy4798         146 LAVLAHELGHW  156 (623)
Q Consensus       146 ~AVlAHELGH~  156 (623)
                      .-++|||+||-
T Consensus       112 ~~~~aHElGH~  122 (173)
T PF13574_consen  112 IDTFAHELGHQ  122 (173)
T ss_dssp             HHHHHHHHHHH
T ss_pred             eeeehhhhHhh
Confidence            34599999995


No 116
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=46.65  E-value=14  Score=40.44  Aligned_cols=20  Identities=35%  Similarity=0.426  Sum_probs=13.9

Q ss_pred             HHHHHHHHHHHhhhhhccCHH
Q psy4798         521 TEEVLAVLAHELGHWKYNHVL  541 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~k~~H~~  541 (623)
                      .++ +-.|+||+||--|....
T Consensus       220 ~~~-v~tl~HE~GHa~h~~~~  239 (427)
T cd06459         220 LDD-VFTLAHELGHAFHSYLS  239 (427)
T ss_pred             hhh-HHHHHHHhhHHHHHHHH
Confidence            344 56799999997665443


No 117
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=46.26  E-value=16  Score=42.22  Aligned_cols=17  Identities=41%  Similarity=0.430  Sum_probs=13.3

Q ss_pred             HHHHHHHhhhhhccCHH
Q psy4798         525 LAVLAHELGHWKYNHVL  541 (623)
Q Consensus       525 ~AVLAHELGH~k~~H~~  541 (623)
                      +.+||||+||--|+...
T Consensus       379 v~TLaHElGHa~H~~~~  395 (591)
T TIGR00181       379 VFTLAHELGHSMHSYFS  395 (591)
T ss_pred             HHHHHHHhhhHHHHHHH
Confidence            56899999999766543


No 118
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=45.46  E-value=15  Score=42.52  Aligned_cols=16  Identities=19%  Similarity=0.401  Sum_probs=12.4

Q ss_pred             CCCCCCCceeeecCCC
Q psy4798          83 SKRSEHSNAYFYGFFK   98 (623)
Q Consensus        83 s~~~~~~NA~~~G~~~   98 (623)
                      +++.+.+-||.+|+.+
T Consensus       344 ~r~gK~~Ga~~~~~~~  359 (587)
T TIGR02290       344 PRPGKRGGAFCTGFPP  359 (587)
T ss_pred             CCCCCCCCcccCCCCC
Confidence            5566788899999765


No 119
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=44.50  E-value=22  Score=35.34  Aligned_cols=40  Identities=28%  Similarity=0.225  Sum_probs=31.3

Q ss_pred             eeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchH
Q psy4798          92 YFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV  161 (623)
Q Consensus        92 ~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~  161 (623)
                      ..-|+.+ .+.|+|..+-+.                              +..+++-+|+|||-|...+=.
T Consensus        47 ~~Ggf~p~~~~I~lC~N~~~------------------------------~~~~l~~~l~HELIHayD~cr   87 (173)
T PF09768_consen   47 VSGGFDPSKKGIVLCQNRIR------------------------------SQGHLEDTLTHELIHAYDHCR   87 (173)
T ss_pred             CcCCccCCCCCEEEeeCCCC------------------------------CHHHHHHHHHHHHHHHHHHHh
Confidence            5567766 788999977665                              678999999999999865433


No 120
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=44.09  E-value=11  Score=36.51  Aligned_cols=11  Identities=64%  Similarity=0.782  Sum_probs=9.0

Q ss_pred             HHHHHHhhhhh
Q psy4798         146 LAVLAHELGHW  156 (623)
Q Consensus       146 ~AVlAHELGH~  156 (623)
                      .=|+||||+|=
T Consensus       136 lDVvaHEltHG  146 (150)
T PF01447_consen  136 LDVVAHELTHG  146 (150)
T ss_dssp             HHHHHHHHHHH
T ss_pred             cceeeeccccc
Confidence            34999999994


No 121
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=43.78  E-value=15  Score=43.05  Aligned_cols=45  Identities=29%  Similarity=0.338  Sum_probs=29.3

Q ss_pred             CcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHH
Q psy4798         466 EHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVL  541 (623)
Q Consensus       466 ~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~  541 (623)
                      +.+=||..|+.+.....++-+--..                               -.=+.+||||+||-.|....
T Consensus       353 KrsGaYs~~~~~~~~p~IlmN~~gt-------------------------------~~dV~TLaHElGHs~Hs~~s  397 (598)
T COG1164         353 KRSGAYSIGFYKGDHPFILMNYDGT-------------------------------LRDVFTLAHELGHSVHSYFS  397 (598)
T ss_pred             CCCCcccCCCCCCCCCeEEEeCCCc-------------------------------hhHHHHHHHHccHHHHHHHH
Confidence            4588999998764444443332221                               22367899999999887544


No 122
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=43.77  E-value=38  Score=34.45  Aligned_cols=20  Identities=35%  Similarity=0.385  Sum_probs=14.8

Q ss_pred             HHHHHHHhhhhhhcchHHHH
Q psy4798         145 VLAVLAHELGHWKYNHVLKS  164 (623)
Q Consensus       145 l~AVlAHELGH~k~~h~~k~  164 (623)
                      =.||-|||.||--+.+--..
T Consensus        92 ~~aVAAHEVGHAiQd~~~Y~  111 (226)
T COG2738          92 AIAVAAHEVGHAIQDQEDYA  111 (226)
T ss_pred             HHHHHHHHhhHHHhhhcccH
Confidence            46899999999766554433


No 123
>PRK04860 hypothetical protein; Provisional
Probab=43.77  E-value=16  Score=35.83  Aligned_cols=19  Identities=26%  Similarity=0.282  Sum_probs=16.1

Q ss_pred             CChHHHHHHHHHhhhhhhc
Q psy4798         140 CDTEEVLAVLAHELGHWKY  158 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~  158 (623)
                      -+.+++..||+||++|+.-
T Consensus        58 ~~~~~l~~~v~HEl~H~~~   76 (160)
T PRK04860         58 NQQAFIDEVVPHELAHLLV   76 (160)
T ss_pred             CcHHHHHhHHHHHHHHHHH
Confidence            3678999999999999743


No 124
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=43.32  E-value=12  Score=36.19  Aligned_cols=17  Identities=29%  Similarity=0.503  Sum_probs=9.6

Q ss_pred             CcccEEEeccCCCceEEEehh
Q psy4798         466 EHSNAYFYGFFKNKRIVLFDT  486 (623)
Q Consensus       466 ~~~NAy~~G~~~~krIVL~dt  486 (623)
                      ...|||--|    .+++.-|+
T Consensus       108 ~~~NAfW~g----~~m~yGdG  124 (150)
T PF01447_consen  108 NYNNAFWNG----SQMVYGDG  124 (150)
T ss_dssp             STT-EEE-S----SSEEEE--
T ss_pred             CccCccccC----CEEEEECC
Confidence            468999654    56888775


No 125
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=42.44  E-value=13  Score=35.23  Aligned_cols=9  Identities=67%  Similarity=1.062  Sum_probs=8.6

Q ss_pred             HHHHHhhhh
Q psy4798         527 VLAHELGHW  535 (623)
Q Consensus       527 VLAHELGH~  535 (623)
                      |.+||+||.
T Consensus        80 IaaHE~GHi   88 (132)
T PF02031_consen   80 IAAHELGHI   88 (132)
T ss_dssp             HHHHHHHHH
T ss_pred             eeeehhccc
Confidence            899999998


No 126
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=41.08  E-value=12  Score=35.38  Aligned_cols=16  Identities=50%  Similarity=0.590  Sum_probs=13.5

Q ss_pred             HHHHHHHHHHHhhhhh
Q psy4798         521 TEEVLAVLAHELGHWK  536 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~k  536 (623)
                      ..+...+++||+||.-
T Consensus        93 ~~~~~~~~~HElGH~L  108 (167)
T cd00203          93 TKEGAQTIAHELGHAL  108 (167)
T ss_pred             cccchhhHHHHHHHHh
Confidence            3578999999999984


No 127
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=40.82  E-value=17  Score=35.32  Aligned_cols=17  Identities=29%  Similarity=0.433  Sum_probs=15.4

Q ss_pred             CChHHHHHHHHHhhhhh
Q psy4798         140 CDTEEVLAVLAHELGHW  156 (623)
Q Consensus       140 l~~~El~AVlAHELGH~  156 (623)
                      ..+|.+.-|+.|||+|.
T Consensus        56 n~~~f~~~vV~HELaHl   72 (156)
T COG3091          56 NGEDFIEQVVPHELAHL   72 (156)
T ss_pred             ccHHHHHHHHHHHHHHH
Confidence            57889999999999996


No 128
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=40.65  E-value=84  Score=31.27  Aligned_cols=19  Identities=42%  Similarity=0.592  Sum_probs=13.6

Q ss_pred             HHHHHHHHHHHhhhh-hccC
Q psy4798         521 TEEVLAVLAHELGHW-KYNH  539 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~-k~~H  539 (623)
                      .+-+.-...||+||. -..|
T Consensus       122 ~~R~~k~~~HElGH~lGL~H  141 (179)
T PRK13267        122 EERVRKEVTHELGHTLGLEH  141 (179)
T ss_pred             HHHHHHHHHHHHHHHcCCcc
Confidence            345666689999998 4455


No 129
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=40.62  E-value=46  Score=34.26  Aligned_cols=57  Identities=25%  Similarity=0.366  Sum_probs=32.7

Q ss_pred             CcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhh
Q psy4798         466 EHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHW  535 (623)
Q Consensus       466 ~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~  535 (623)
                      +..|||.-.  .++.|.+.-.++....  +.-++.+.-++        +.++ -..+-++.++-||+||-
T Consensus        47 Ge~nA~ydP--e~~~I~iCYEf~~~~~--~~f~~~~~~~~--------~~~~-~~~~~~~~~l~HE~GHA  103 (220)
T PF14247_consen   47 GEDNAFYDP--ENRSITICYEFVDEIL--DRFAKANDPDE--------EYGQ-AAIGNVLFTLYHELGHA  103 (220)
T ss_pred             CCCCCccCC--CCCEEEECHHHHHHHH--HHHHhCCcCcc--------hHHH-HHHHHHHHHHHHHHHHH
Confidence            458999652  4567998887776531  11111111000        0000 15567899999999997


No 130
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=39.84  E-value=10  Score=38.81  Aligned_cols=11  Identities=55%  Similarity=0.782  Sum_probs=9.7

Q ss_pred             HHHHHHhhhhh
Q psy4798         526 AVLAHELGHWK  536 (623)
Q Consensus       526 AVLAHELGH~k  536 (623)
                      .++||||||--
T Consensus       147 ~t~AHElGHnL  157 (228)
T cd04271         147 QVFAHEIGHTF  157 (228)
T ss_pred             eehhhhhhhhc
Confidence            59999999974


No 131
>TIGR00782 ccoP cytochrome c oxidase, cbb3-type, subunit III. This model describes a di-heme subunit of approximately 26 kDa of the cbb3 type copper and heme-containing cytochrome oxidase.
Probab=39.70  E-value=44  Score=35.28  Aligned_cols=33  Identities=18%  Similarity=0.311  Sum_probs=25.9

Q ss_pred             cchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccC
Q psy4798         396 GGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKY  430 (623)
Q Consensus       396 ~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~  430 (623)
                      .-+.||+|++...++++++.++.||.+  |.++.+
T Consensus        25 ~~P~ww~~~f~~~i~~~~~y~~~yp~~--~~~~~~   57 (285)
T TIGR00782        25 PLPRWWLWTFYATIVWGFGYLVAYPAW--PLVSGA   57 (285)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHhccc--CCcccc
Confidence            346799999999999999998888854  555444


No 132
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=39.35  E-value=12  Score=35.23  Aligned_cols=16  Identities=50%  Similarity=0.590  Sum_probs=13.2

Q ss_pred             hHHHHHHHHHhhhhhh
Q psy4798         142 TEEVLAVLAHELGHWK  157 (623)
Q Consensus       142 ~~El~AVlAHELGH~k  157 (623)
                      ..+...+++|||||.-
T Consensus        93 ~~~~~~~~~HElGH~L  108 (167)
T cd00203          93 TKEGAQTIAHELGHAL  108 (167)
T ss_pred             cccchhhHHHHHHHHh
Confidence            4568899999999963


No 133
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=38.36  E-value=42  Score=37.45  Aligned_cols=73  Identities=21%  Similarity=0.263  Sum_probs=44.1

Q ss_pred             CcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccC-CCceEEEehhHHhhccCCCccccCCCCCCCccc
Q psy4798         431 TPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFF-KNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLI  509 (623)
Q Consensus       431 ~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~-~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~  509 (623)
                      .+.+...-++.-+++++.+|++..+-. +|.|     ++.|..|++ ...||  +..-=+                    
T Consensus       102 ~~~~~~~Q~~~~~~~~~~~g~df~~gr-iD~s-----~hpF~~~~~~~dvRI--tt~y~~--------------------  153 (396)
T cd06460         102 GPFPEEKQEALGRELLEALGFDFDRGR-LDVS-----AHPFTGGLGPGDVRI--TTRYDE--------------------  153 (396)
T ss_pred             CCCCHHHHHHHHHHHHHHhCCcccCCe-eecC-----CCCCCCCCCCCCceE--EeeeCC--------------------
Confidence            455556666667788999999875543 4544     345887874 33444  221111                    


Q ss_pred             cccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798         510 STEGANKKGCDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       510 ~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~  540 (623)
                               -+.-.=+--++||+||-.|.+.
T Consensus       154 ---------~d~~~~l~t~iHE~GHalye~~  175 (396)
T cd06460         154 ---------NDFRSALFSTIHETGHALYEQG  175 (396)
T ss_pred             ---------cchHHHHHHHHHHhhHHHHHhc
Confidence                     1223334567999999998884


No 134
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=38.29  E-value=44  Score=36.00  Aligned_cols=60  Identities=13%  Similarity=0.165  Sum_probs=34.1

Q ss_pred             hccc-CCCCCCCceeeecCCC-----CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHh
Q psy4798          79 NTVS-SKRSEHSNAYFYGFFK-----NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHE  152 (623)
Q Consensus        79 ~~~~-s~~~~~~NA~~~G~~~-----~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHE  152 (623)
                      -+|- +++.+.+.||.+++.+     .+.+.++-+.-..    ..|.                 -.-++.+++. .+.||
T Consensus       104 ~lD~~~R~gK~~~a~~~~~~~~~~~~~~~~~i~~n~~~~----~~~~-----------------~~ll~~~~v~-tl~HE  161 (365)
T cd06258         104 YLDLYDRKGKYPHGFCTGLDPGFNRQDKDVRILANFTSP----AAPD-----------------PVLLGHDDIN-TLFHE  161 (365)
T ss_pred             hccCCCCCCCCCCCeeccccCCCCCCCCeEEEEccCCCC----CCCC-----------------CCcCCHHHHH-HHHHH
Confidence            4444 4445778999999863     3444444443321    1111                 1225666664 58999


Q ss_pred             hhhhhcch
Q psy4798         153 LGHWKYNH  160 (623)
Q Consensus       153 LGH~k~~h  160 (623)
                      +||.-|.-
T Consensus       162 ~GHa~h~~  169 (365)
T cd06258         162 FGHAVHFL  169 (365)
T ss_pred             HhHHHHHH
Confidence            99987753


No 135
>KOG1047|consensus
Probab=38.08  E-value=41  Score=39.01  Aligned_cols=60  Identities=23%  Similarity=0.201  Sum_probs=38.6

Q ss_pred             HHHHHHHcCCC----CCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCC
Q psy4798         442 IEQLSASVKFP----LKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKK  517 (623)
Q Consensus       442 I~~la~~~gfp----~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~  517 (623)
                      +.+.|+++.-|    .-++.|+..|..       +-|+-+..--.++-|||.                            
T Consensus       239 ~L~~Ae~l~GpY~WgryDllvlPpSFP-------~gGMENPcltF~TpTlla----------------------------  283 (613)
T KOG1047|consen  239 FLKAAEKLFGPYVWGRYDLLVLPPSFP-------FGGMENPCLTFVTPTLLA----------------------------  283 (613)
T ss_pred             HHHHHHHHcCCcccccceEEEecCCCC-------cccccCcceeeecchhhc----------------------------
Confidence            33445566555    357888887653       234433333568889984                            


Q ss_pred             CCCHHHHHHHHHHHhhhhhcc
Q psy4798         518 GCDTEEVLAVLAHELGHWKYN  538 (623)
Q Consensus       518 gl~~~Ei~AVLAHELGH~k~~  538 (623)
                        .+.-+.-|+|||++|---|
T Consensus       284 --GDrsl~~vIaHEIAHSWtG  302 (613)
T KOG1047|consen  284 --GDRSLVDVIAHEIAHSWTG  302 (613)
T ss_pred             --CCcchhhHHHHHhhhhhcc
Confidence              3456689999999996544


No 136
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=37.52  E-value=12  Score=38.44  Aligned_cols=19  Identities=16%  Similarity=-0.096  Sum_probs=12.4

Q ss_pred             hHHHHHHHHHHHHHHHHHH
Q psy4798           6 EFQIFYGIIGFSWIVFLFE   24 (623)
Q Consensus         6 ~~~i~~~~~~~~~~~~~~~   24 (623)
                      +...-+.+..++.+.-++.
T Consensus        21 ~~~~~~v~~ivN~vn~~Y~   39 (228)
T cd04271          21 EEARRNILNNVNSASQLYE   39 (228)
T ss_pred             HHHHHHHHHHHHHHHHHhh
Confidence            4555566677777776666


No 137
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=37.03  E-value=15  Score=34.04  Aligned_cols=10  Identities=70%  Similarity=0.923  Sum_probs=9.1

Q ss_pred             HHHHHHhhhh
Q psy4798         526 AVLAHELGHW  535 (623)
Q Consensus       526 AVLAHELGH~  535 (623)
                      +|++||+||.
T Consensus        88 ~~~~HEigHa   97 (140)
T smart00235       88 GVAAHELGHA   97 (140)
T ss_pred             ccHHHHHHHH
Confidence            5999999997


No 138
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=36.86  E-value=92  Score=30.96  Aligned_cols=16  Identities=31%  Similarity=0.312  Sum_probs=15.1

Q ss_pred             CHHHHHHHHHHHhhhh
Q psy4798         520 DTEEVLAVLAHELGHW  535 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~  535 (623)
                      +.+++.-+|+|||=|.
T Consensus        67 ~~~~l~~~l~HELIHa   82 (173)
T PF09768_consen   67 SQGHLEDTLTHELIHA   82 (173)
T ss_pred             CHHHHHHHHHHHHHHH
Confidence            6799999999999998


No 139
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=36.79  E-value=16  Score=36.77  Aligned_cols=16  Identities=38%  Similarity=0.603  Sum_probs=11.2

Q ss_pred             HHHHHHhhhhh-ccCHH
Q psy4798         526 AVLAHELGHWK-YNHVL  541 (623)
Q Consensus       526 AVLAHELGH~k-~~H~~  541 (623)
                      -++|||+||-- .+|..
T Consensus       139 ~~~aHEiGH~lGl~H~~  155 (206)
T PF13583_consen  139 QTFAHEIGHNLGLRHDF  155 (206)
T ss_pred             hHHHHHHHHHhcCCCCc
Confidence            45999999974 34443


No 140
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=36.76  E-value=20  Score=41.50  Aligned_cols=15  Identities=47%  Similarity=0.643  Sum_probs=12.0

Q ss_pred             HHHHHHHhhhhhccC
Q psy4798         525 LAVLAHELGHWKYNH  539 (623)
Q Consensus       525 ~AVLAHELGH~k~~H  539 (623)
                      +..|+||+||--|..
T Consensus       376 v~TL~HE~GHa~H~~  390 (587)
T TIGR02290       376 VSTLAHELGHAYHSE  390 (587)
T ss_pred             HHHHHHHhhHHHHHH
Confidence            356999999998754


No 141
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=36.59  E-value=46  Score=37.26  Aligned_cols=22  Identities=32%  Similarity=0.300  Sum_probs=16.7

Q ss_pred             CCCChHHHHHHHHHhhhhhhcch
Q psy4798         138 KGCDTEEVLAVLAHELGHWKYNH  160 (623)
Q Consensus       138 ~~l~~~El~AVlAHELGH~k~~h  160 (623)
                      +.|+-+|+.. |.||+||--|+-
T Consensus       202 ~lL~~~~v~t-LfHEfGHalH~~  223 (422)
T cd06456         202 ALLTHDEVTT-LFHEFGHALHHL  223 (422)
T ss_pred             CccCHHHHHH-HHHHHHHHHHHH
Confidence            3478888764 889999987753


No 142
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=35.56  E-value=26  Score=32.63  Aligned_cols=21  Identities=33%  Similarity=0.301  Sum_probs=18.3

Q ss_pred             CCCCChHHHHHHHHHhhhhhh
Q psy4798         137 KKGCDTEEVLAVLAHELGHWK  157 (623)
Q Consensus       137 ~~~l~~~El~AVlAHELGH~k  157 (623)
                      -++|+-+|-.-|++||+.|+-
T Consensus        72 F~rLs~~ekvKviiHEllHIP   92 (133)
T COG4900          72 FKRLSCAEKVKVIIHELLHIP   92 (133)
T ss_pred             cCCCChHHHHHHHHHHHhcCc
Confidence            346899999999999999985


No 143
>COG1054 Predicted sulfurtransferase [General function prediction only]
Probab=34.52  E-value=74  Score=34.33  Aligned_cols=45  Identities=16%  Similarity=0.285  Sum_probs=37.4

Q ss_pred             hhhccCCcCCch-hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEecc
Q psy4798         425 PLFDKYTPLPDG-ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGF  475 (623)
Q Consensus       425 plf~k~~pl~~~-~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~  475 (623)
                      -.+.+|+|++++ ++++.+.++++++|+. .+|++.+.     ..|+.+.|-
T Consensus         8 la~Y~f~~i~dp~~~~~~l~~~~~~~~vk-GrillA~E-----GINgtvsG~   53 (308)
T COG1054           8 LAYYKFVPIEDPEALRDPLLALCKALGVK-GRILLAHE-----GINGTVSGS   53 (308)
T ss_pred             EEEEEEEecCCHHHHHHHHHHHHHHcCce-eEEEEccC-----CcceeEecC
Confidence            356789999985 5999999999999986 78998774     479999884


No 144
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=34.35  E-value=23  Score=34.48  Aligned_cols=12  Identities=42%  Similarity=0.888  Sum_probs=10.6

Q ss_pred             HHHHHHHHhhhh
Q psy4798         524 VLAVLAHELGHW  535 (623)
Q Consensus       524 i~AVLAHELGH~  535 (623)
                      +.+++.||+||.
T Consensus         7 ~i~i~~HE~gH~   18 (192)
T PF02163_consen    7 LISIVLHELGHA   18 (192)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cccccccccccc
Confidence            568999999997


No 145
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=34.16  E-value=18  Score=36.44  Aligned_cols=11  Identities=45%  Similarity=0.712  Sum_probs=8.8

Q ss_pred             HHHHHhhhhhh
Q psy4798         147 AVLAHELGHWK  157 (623)
Q Consensus       147 AVlAHELGH~k  157 (623)
                      -++|||+||.-
T Consensus       139 ~~~aHEiGH~l  149 (206)
T PF13583_consen  139 QTFAHEIGHNL  149 (206)
T ss_pred             hHHHHHHHHHh
Confidence            45999999963


No 146
>KOG3314|consensus
Probab=33.17  E-value=42  Score=33.23  Aligned_cols=39  Identities=31%  Similarity=0.301  Sum_probs=31.8

Q ss_pred             cCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHH
Q psy4798          95 GFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLK  163 (623)
Q Consensus        95 G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k  163 (623)
                      |+.+.+.|||..+-|.                              +.+++.-|++|||-|.-..+-.+
T Consensus        71 Gy~~~~gIvlCqN~l~------------------------------~q~h~n~vv~HElIH~fDd~r~~  109 (194)
T KOG3314|consen   71 GYTPGRGIVLCQNRLT------------------------------IQDHVNQVVIHELIHAFDDCRAK  109 (194)
T ss_pred             CccCCCceEEeccccc------------------------------hHHHHHHHHHHHHHHHHHhhhhh
Confidence            5677788999988665                              56899999999999987776655


No 147
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=32.79  E-value=23  Score=33.30  Aligned_cols=15  Identities=47%  Similarity=0.738  Sum_probs=12.3

Q ss_pred             HHHHHHHHHhhhhhh
Q psy4798         143 EEVLAVLAHELGHWK  157 (623)
Q Consensus       143 ~El~AVlAHELGH~k  157 (623)
                      |++.-|+-||+||.-
T Consensus       107 d~vthvliHEIgHhF  121 (136)
T COG3824         107 DQVTHVLIHEIGHHF  121 (136)
T ss_pred             hHhhhhhhhhhhhhc
Confidence            567889999999963


No 148
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=32.43  E-value=82  Score=33.91  Aligned_cols=21  Identities=24%  Similarity=0.303  Sum_probs=15.3

Q ss_pred             CCHHHHHHHHHHHhhhhhccCH
Q psy4798         519 CDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H~  540 (623)
                      ++.+++. .+.||+||..|...
T Consensus       150 l~~~~v~-tl~HE~GHa~h~~l  170 (365)
T cd06258         150 LGHDDIN-TLFHEFGHAVHFLL  170 (365)
T ss_pred             CCHHHHH-HHHHHHhHHHHHHH
Confidence            4666664 58899999977543


No 149
>KOG3607|consensus
Probab=32.32  E-value=36  Score=40.85  Aligned_cols=16  Identities=50%  Similarity=0.491  Sum_probs=13.9

Q ss_pred             CHHHHHHHHHHHhhhh
Q psy4798         520 DTEEVLAVLAHELGHW  535 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~  535 (623)
                      +......|+||||||.
T Consensus       319 ~~~~~a~v~AhelgH~  334 (716)
T KOG3607|consen  319 ILLAFAVVLAHELGHN  334 (716)
T ss_pred             cchhHHHHHHHHHHhh
Confidence            4577899999999997


No 150
>PF14891 Peptidase_M91:  Effector protein
Probab=31.66  E-value=24  Score=34.54  Aligned_cols=18  Identities=33%  Similarity=0.235  Sum_probs=14.3

Q ss_pred             ChHHHHHHHHHhhhhhhc
Q psy4798         141 DTEEVLAVLAHELGHWKY  158 (623)
Q Consensus       141 ~~~El~AVlAHELGH~k~  158 (623)
                      .+.+=..||+|||+|..+
T Consensus        99 ~~~~p~v~L~HEL~HA~~  116 (174)
T PF14891_consen   99 SPRPPFVVLYHELIHAYD  116 (174)
T ss_pred             hhHHHHHHHHHHHHHHHH
Confidence            345667899999999865


No 151
>COG2928 Uncharacterized conserved protein [Function unknown]
Probab=31.64  E-value=5.2e+02  Score=26.88  Aligned_cols=35  Identities=20%  Similarity=0.254  Sum_probs=19.1

Q ss_pred             hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEecc
Q psy4798         437 ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGF  475 (623)
Q Consensus       437 ~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~  475 (623)
                      ..++.++.+.++-+=.-+++..++=..    .+.|..||
T Consensus       103 s~kqi~etll~~~~~sfk~vvlVefP~----~G~~~i~f  137 (222)
T COG2928         103 SAKQVVETLLSDQSGSFKQVVLVEFPR----RGIWAIAF  137 (222)
T ss_pred             HHHHHHHHHHhcCCccceeeEEEECCC----CCcEEEEE
Confidence            345555555555443557787787443    34444444


No 152
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=31.41  E-value=20  Score=33.08  Aligned_cols=10  Identities=70%  Similarity=0.923  Sum_probs=9.0

Q ss_pred             HHHHHhhhhh
Q psy4798         147 AVLAHELGHW  156 (623)
Q Consensus       147 AVlAHELGH~  156 (623)
                      +|++||+||.
T Consensus        88 ~~~~HEigHa   97 (140)
T smart00235       88 GVAAHELGHA   97 (140)
T ss_pred             ccHHHHHHHH
Confidence            4999999996


No 153
>PF14891 Peptidase_M91:  Effector protein
Probab=31.18  E-value=33  Score=33.54  Aligned_cols=18  Identities=33%  Similarity=0.235  Sum_probs=14.5

Q ss_pred             CHHHHHHHHHHHhhhhhc
Q psy4798         520 DTEEVLAVLAHELGHWKY  537 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~k~  537 (623)
                      .+.+=..||+|||+|..+
T Consensus        99 ~~~~p~v~L~HEL~HA~~  116 (174)
T PF14891_consen   99 SPRPPFVVLYHELIHAYD  116 (174)
T ss_pred             hhHHHHHHHHHHHHHHHH
Confidence            456778899999999853


No 154
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=30.69  E-value=61  Score=36.33  Aligned_cols=21  Identities=33%  Similarity=0.409  Sum_probs=16.1

Q ss_pred             CCHHHHHHHHHHHhhhhhccCH
Q psy4798         519 CDTEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H~  540 (623)
                      ++.+|+. .|.||+||--|...
T Consensus       204 L~~~~v~-tLfHEfGHalH~~l  224 (422)
T cd06456         204 LTHDEVT-TLFHEFGHALHHLL  224 (422)
T ss_pred             cCHHHHH-HHHHHHHHHHHHHH
Confidence            6778876 58899999976543


No 155
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=30.58  E-value=26  Score=37.87  Aligned_cols=26  Identities=19%  Similarity=0.128  Sum_probs=0.0

Q ss_pred             ccccCCCCChHHHHHHHHHhhhhhhc
Q psy4798         133 EGANKKGCDTEEVLAVLAHELGHWKY  158 (623)
Q Consensus       133 ~~~~~~~l~~~El~AVlAHELGH~k~  158 (623)
                      |-++-..+.+-++-+++|||+||.+.
T Consensus       184 EA~vN~~~p~~~~P~T~~HElAHq~G  209 (318)
T PF12725_consen  184 EANVNTDLPPYSLPFTICHELAHQLG  209 (318)
T ss_pred             eeecCCCCCcccccHHHHHHHHHHhC


No 156
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=30.42  E-value=14  Score=42.86  Aligned_cols=15  Identities=40%  Similarity=0.532  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHhhhhh
Q psy4798         522 EEVLAVLAHELGHWK  536 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k  536 (623)
                      .-|-||+|||+||--
T Consensus       485 GgIGaVIgHEI~HgF  499 (654)
T COG3590         485 GGIGAVIGHEIGHGF  499 (654)
T ss_pred             cCccceehhhhcccc
Confidence            458899999999963


No 157
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=29.78  E-value=15  Score=42.65  Aligned_cols=17  Identities=35%  Similarity=0.475  Sum_probs=13.1

Q ss_pred             ChHHHHHHHHHhhhhhh
Q psy4798         141 DTEEVLAVLAHELGHWK  157 (623)
Q Consensus       141 ~~~El~AVlAHELGH~k  157 (623)
                      |=--|-||||||+||--
T Consensus       483 NYGgIGaVIgHEI~HgF  499 (654)
T COG3590         483 NYGGIGAVIGHEIGHGF  499 (654)
T ss_pred             cccCccceehhhhcccc
Confidence            33448899999999953


No 158
>PF14715 FixP_N:  N-terminal domain of cytochrome oxidase-cbb3, FixP 
Probab=29.08  E-value=61  Score=25.94  Aligned_cols=26  Identities=19%  Similarity=0.341  Sum_probs=21.9

Q ss_pred             cchhHHHHHHHHHHHHHHHHHhhchH
Q psy4798         396 GGNMVFLYLWVFIILMSLFLMTIYPE  421 (623)
Q Consensus       396 ~g~~~~l~~w~~~~~~~l~~~~i~P~  421 (623)
                      .-+.||++++...++++++-.+.||.
T Consensus        19 plP~ww~~~f~~tivfa~~Y~~~yp~   44 (51)
T PF14715_consen   19 PLPRWWLWLFYGTIVFAVGYLVLYPG   44 (51)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHhc
Confidence            34678999999999999998888874


No 159
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=29.08  E-value=30  Score=33.68  Aligned_cols=12  Identities=42%  Similarity=0.888  Sum_probs=10.2

Q ss_pred             HHHHHHHhhhhh
Q psy4798         145 VLAVLAHELGHW  156 (623)
Q Consensus       145 l~AVlAHELGH~  156 (623)
                      +.+++.||+||.
T Consensus         7 ~i~i~~HE~gH~   18 (192)
T PF02163_consen    7 LISIVLHELGHA   18 (192)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             cccccccccccc
Confidence            468999999996


No 160
>KOG1047|consensus
Probab=29.01  E-value=34  Score=39.71  Aligned_cols=27  Identities=19%  Similarity=0.304  Sum_probs=17.1

Q ss_pred             ccccccCCCcccHHHHHHHHHHHHHHHH
Q psy4798         320 VLEEKHGFNKQTYGFFVKDQIKSFIVSL  347 (623)
Q Consensus       320 ~le~~~Gfn~~T~~~f~~d~vk~~~l~~  347 (623)
                      .||++.|= ++-+..|+++.++.+.-..
T Consensus       391 ~Le~~lG~-~~~Fd~FLr~Yv~kfa~ks  417 (613)
T KOG1047|consen  391 YLEQLLGD-PTRFDPFLRAYVHKFAFKS  417 (613)
T ss_pred             HHHHHhCC-hhhHHHHHHHHHHHhccce
Confidence            46666662 2345568888888776555


No 161
>PF01433 Peptidase_M1:  Peptidase family M1 This is family M1 in the peptidase classification.;  InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.  Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=28.83  E-value=1e+02  Score=33.06  Aligned_cols=20  Identities=30%  Similarity=0.631  Sum_probs=16.5

Q ss_pred             HHHHHHHHHHHhhhhhccCH
Q psy4798         521 TEEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~k~~H~  540 (623)
                      ..++..++|||++|---++.
T Consensus       292 ~~~~~~~iahElahqWfGn~  311 (390)
T PF01433_consen  292 KQEIASLIAHELAHQWFGNL  311 (390)
T ss_dssp             HHHHHHHHHHHHHTTTBTTT
T ss_pred             hhhhHHHHHHHHHHHHhccC
Confidence            46899999999999876653


No 162
>PRK05320 rhodanese superfamily protein; Provisional
Probab=28.52  E-value=69  Score=33.54  Aligned_cols=57  Identities=16%  Similarity=0.219  Sum_probs=43.6

Q ss_pred             hhccCCcCCch-hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhh
Q psy4798         426 LFDKYTPLPDG-ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKD  490 (623)
Q Consensus       426 lf~k~~pl~~~-~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~  490 (623)
                      .|.||+|++|+ ++++.+.++|++.|+. .+|++-+.     ..|+-+.|-  ..+|-.+-+.+..
T Consensus         7 ~~Y~f~~i~~~~~~~~~~~~~~~~~~~~-G~i~ia~e-----GiN~t~~g~--~~~id~~~~~l~~   64 (257)
T PRK05320          7 AAYKFVSLDDPETLRPLVLARCEALGLK-GTILLAPE-----GINLFLAGT--REAIDAFYAWLRA   64 (257)
T ss_pred             EEEceeecCCHHHHHHHHHHHHHHCCCe-EEEEEcCC-----CceEEEEee--HHHHHHHHHHHhh
Confidence            57899999985 5899999999999985 88998663     479999874  4556555555543


No 163
>COG0475 KefB Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]
Probab=28.51  E-value=1.7e+02  Score=32.46  Aligned_cols=97  Identities=21%  Similarity=0.294  Sum_probs=51.8

Q ss_pred             HHHHHHhhccccccchhhhhhccccchhhhhhh----HHHHHHHHHHHHHHHHHHHHHcchhH--HHHHHHHHHHHHHHH
Q psy4798         342 SFIVSLILSIPLTGAVVYIIQTYGFFVKDQIKS----FIVSLILSIPLTGAVVYIIQVGGNMV--FLYLWVFIILMSLFL  415 (623)
Q Consensus       342 ~~~l~~~~s~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~pl~~~~~~li~~~g~~~--~l~~w~~~~~~~l~~  415 (623)
                      +.+++.+++.+.++.++-++.=.+....+.-+.    .+...+.+.|+++.+..+.+..+...  |++......++.++.
T Consensus       120 al~lg~~l~~sS~~i~~~iL~e~~~~~~~~g~~~l~~~i~~Di~~i~lLai~~~l~~~g~~~~~~~~~~~~~~~~f~~~~  199 (397)
T COG0475         120 ALFLGAALALSSTAIVLKILMELGLLKTREGQLILGALVFDDIAAILLLAIVPALAGGGSGSVGFILGLLLAILAFLALL  199 (397)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCccHhHHHHHHHHHHHHHHHH
Confidence            444444445555555555554444333222221    24444677777776666654333222  444445555555555


Q ss_pred             HhhchHhhhhhhccCCcCCchhH
Q psy4798         416 MTIYPEFIAPLFDKYTPLPDGEL  438 (623)
Q Consensus       416 ~~i~P~~I~plf~k~~pl~~~~L  438 (623)
                      ..+.+.+..|++++....+++|+
T Consensus       200 l~~g~~l~~~~~r~~~~~~~~e~  222 (397)
T COG0475         200 LLLGRYLLPPLFRRVAKTESSEL  222 (397)
T ss_pred             HHHHHHHHHHHHHHHHhccchHH
Confidence            55567778888887766555553


No 164
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=28.39  E-value=33  Score=33.19  Aligned_cols=13  Identities=46%  Similarity=0.728  Sum_probs=11.0

Q ss_pred             HHHHHHHHhhhhh
Q psy4798         524 VLAVLAHELGHWK  536 (623)
Q Consensus       524 i~AVLAHELGH~k  536 (623)
                      +.+|..||+||..
T Consensus         8 ~i~i~iHE~gH~~   20 (180)
T cd05709           8 LISVTVHELGHAL   20 (180)
T ss_pred             HHHHHHHHHHHHH
Confidence            4689999999974


No 165
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.91  E-value=63  Score=35.15  Aligned_cols=24  Identities=38%  Similarity=0.625  Sum_probs=22.8

Q ss_pred             CCHHHHHHHHHHHhhhhhccCHHH
Q psy4798         519 CDTEEVLAVLAHELGHWKYNHVLK  542 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k~~H~~k  542 (623)
                      |..+|+.|.|-||.=|+.++|+.+
T Consensus        64 ~p~~~~~aLl~HEV~Hi~l~Hi~r   87 (396)
T COG3864          64 CPISEMKALLKHEVYHIMLNHIKR   87 (396)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHHh
Confidence            999999999999999999999964


No 166
>PRK01415 hypothetical protein; Validated
Probab=27.60  E-value=74  Score=33.29  Aligned_cols=45  Identities=9%  Similarity=0.087  Sum_probs=37.1

Q ss_pred             hhhccCCcCCch-hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEecc
Q psy4798         425 PLFDKYTPLPDG-ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGF  475 (623)
Q Consensus       425 plf~k~~pl~~~-~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~  475 (623)
                      -.|.+|+|++|+ ++++.+.++|++.|+. .+|++-+.     ..|+-+.|-
T Consensus         8 ~~fY~f~~i~~~~~~~~~l~~~~~~~~~~-G~i~la~E-----GIN~tisg~   53 (247)
T PRK01415          8 LSAYSFVNIEEPANLIPKLLLIGKRKYVR-GTILLANE-----GFNGSFSGS   53 (247)
T ss_pred             EEEEccccCCCHHHHHHHHHHHHHHcCCe-eEEEEccC-----ccceEeeCC
Confidence            368899999985 4899999999999975 88998663     479998885


No 167
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=27.40  E-value=47  Score=34.05  Aligned_cols=20  Identities=35%  Similarity=0.232  Sum_probs=16.5

Q ss_pred             CCCHHHHHHHHHHHhhhhhc
Q psy4798         518 GCDTEEVLAVLAHELGHWKY  537 (623)
Q Consensus       518 gl~~~Ei~AVLAHELGH~k~  537 (623)
                      |+...=.-+|||||++|.-.
T Consensus        87 GLPrll~gsiLAHE~mHa~L  106 (212)
T PF12315_consen   87 GLPRLLTGSILAHELMHAWL  106 (212)
T ss_pred             CCCHHHHhhHHHHHHHHHHh
Confidence            47777779999999999754


No 168
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=27.39  E-value=74  Score=38.74  Aligned_cols=18  Identities=28%  Similarity=0.503  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHhhhhhccC
Q psy4798         522 EEVLAVLAHELGHWKYNH  539 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k~~H  539 (623)
                      +.+..|+|||++|---++
T Consensus       285 ~~~~~viaHElAHqWFGn  302 (831)
T TIGR02412       285 ENRAGVILHEMAHMWFGD  302 (831)
T ss_pred             HHHHHHHHHHHHHHHhCC
Confidence            457889999999966653


No 169
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=27.36  E-value=38  Score=33.66  Aligned_cols=14  Identities=43%  Similarity=0.598  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHhhhh
Q psy4798         522 EEVLAVLAHELGHW  535 (623)
Q Consensus       522 ~Ei~AVLAHELGH~  535 (623)
                      .+..+++.|||||.
T Consensus        90 ~~~~~~i~HElgHa  103 (198)
T cd04327          90 PEFSRVVLHEFGHA  103 (198)
T ss_pred             hhHHHHHHHHHHHH
Confidence            35679999999997


No 170
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=27.31  E-value=14  Score=39.00  Aligned_cols=11  Identities=64%  Similarity=0.966  Sum_probs=10.1

Q ss_pred             HHHHHHHhhhh
Q psy4798         525 LAVLAHELGHW  535 (623)
Q Consensus       525 ~AVLAHELGH~  535 (623)
                      .+|+|||+||.
T Consensus       166 igv~~HE~gH~  176 (286)
T TIGR03296       166 VGVIAHELGHD  176 (286)
T ss_pred             eeeeehhhhcc
Confidence            79999999995


No 171
>PF11694 DUF3290:  Protein of unknown function (DUF3290);  InterPro: IPR021707  This family of proteins with unknown function appears to be restricted to Firmicutes. 
Probab=27.28  E-value=4.3e+02  Score=25.63  Aligned_cols=24  Identities=17%  Similarity=0.319  Sum_probs=21.0

Q ss_pred             hhHHHHHHHHHHHcCCCCCeEEEE
Q psy4798         436 GELKSRIEQLSASVKFPLKKLYVV  459 (623)
Q Consensus       436 ~~L~~~I~~la~~~gfp~~~v~vv  459 (623)
                      +.+-.-|+.+|++.|.|..+|+|=
T Consensus        79 ~~~~~fi~~vA~~~~V~~~~v~VN  102 (149)
T PF11694_consen   79 SQMVHFIESVAKDLGVSKEEVYVN  102 (149)
T ss_pred             HHHHHHHHHHHHHhCCChheEEEe
Confidence            557788999999999999999973


No 172
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=27.23  E-value=11  Score=38.68  Aligned_cols=20  Identities=30%  Similarity=0.694  Sum_probs=14.4

Q ss_pred             HHHHHHHHHhhhhh-ccCHHH
Q psy4798         523 EVLAVLAHELGHWK-YNHVLK  542 (623)
Q Consensus       523 Ei~AVLAHELGH~k-~~H~~k  542 (623)
                      ..--++.||+|||- +.|+..
T Consensus       136 n~g~t~~HEvGH~lGL~HtF~  156 (225)
T cd04275         136 NLGDTATHEVGHWLGLYHTFQ  156 (225)
T ss_pred             cccceeEEeccceeeeeeeec
Confidence            34468999999995 566654


No 173
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=27.12  E-value=34  Score=36.24  Aligned_cols=21  Identities=48%  Similarity=0.816  Sum_probs=16.8

Q ss_pred             CHHHHHHHHHHHhhhhhccCHH
Q psy4798         520 DTEEVLAVLAHELGHWKYNHVL  541 (623)
Q Consensus       520 ~~~Ei~AVLAHELGH~k~~H~~  541 (623)
                      +..--.|.+.||+|||. +|..
T Consensus       199 da~~yyaTl~HElghwt-gh~~  219 (316)
T COG4227         199 DAINYYATLLHELGHWT-GHEA  219 (316)
T ss_pred             chHhHHHHHHHHhcccc-Cchh
Confidence            66788999999999994 4443


No 174
>KOG3733|consensus
Probab=27.10  E-value=1.5e+02  Score=33.64  Aligned_cols=61  Identities=25%  Similarity=0.420  Sum_probs=36.4

Q ss_pred             HHHHHHHhhchHhhhhhhccCC--------cCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce
Q psy4798         410 LMSLFLMTIYPEFIAPLFDKYT--------PLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR  480 (623)
Q Consensus       410 ~~~l~~~~i~P~~I~plf~k~~--------pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr  480 (623)
                      .+++++..+...||+-+-+.|.        -.||.||++-|.+-+++   |.+..|.-+       +.+++.-++|++|
T Consensus       487 FISLfIYmvLSLFIAlItdaYDTIK~~qq~GfpeTdLqafi~ec~d~---~~sg~f~~e-------s~~f~s~iCCc~R  555 (566)
T KOG3733|consen  487 FISLFIYMVLSLFIALITDAYDTIKAYQQDGFPETDLQAFITECADD---PSSGKFRTE-------SDDFVSLICCCGR  555 (566)
T ss_pred             HHHHHHHHHHHHHHHHHhchHHHHHHhhhCCCChhHHHHHHHHhccC---CCccceecc-------CCCceeeEEecCc
Confidence            4455666667777777766542        24678899888554443   445555544       4455665555543


No 175
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=26.69  E-value=44  Score=34.27  Aligned_cols=20  Identities=35%  Similarity=0.197  Sum_probs=16.6

Q ss_pred             CCCChHHHHHHHHHhhhhhh
Q psy4798         138 KGCDTEEVLAVLAHELGHWK  157 (623)
Q Consensus       138 ~~l~~~El~AVlAHELGH~k  157 (623)
                      .||+..-.-+|||||++|.-
T Consensus        86 ~GLPrll~gsiLAHE~mHa~  105 (212)
T PF12315_consen   86 YGLPRLLTGSILAHELMHAW  105 (212)
T ss_pred             CCCCHHHHhhHHHHHHHHHH
Confidence            35778888999999999963


No 176
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=26.58  E-value=84  Score=35.16  Aligned_cols=14  Identities=36%  Similarity=0.470  Sum_probs=11.3

Q ss_pred             HHHHHhhhhhhcch
Q psy4798         147 AVLAHELGHWKYNH  160 (623)
Q Consensus       147 AVlAHELGH~k~~h  160 (623)
                      .-++||+||-.+..
T Consensus       161 ~t~iHE~GHalye~  174 (396)
T cd06460         161 FSTIHETGHALYEQ  174 (396)
T ss_pred             HHHHHHhhHHHHHh
Confidence            45799999987766


No 177
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=26.57  E-value=14  Score=36.78  Aligned_cols=12  Identities=50%  Similarity=0.645  Sum_probs=10.1

Q ss_pred             HHHHHHhhhhhh
Q psy4798         146 LAVLAHELGHWK  157 (623)
Q Consensus       146 ~AVlAHELGH~k  157 (623)
                      ..++||||||--
T Consensus       141 a~~~aHElGH~L  152 (207)
T cd04273         141 AFTIAHELGHVL  152 (207)
T ss_pred             EEeeeeechhhc
Confidence            578999999964


No 178
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=26.41  E-value=37  Score=33.10  Aligned_cols=19  Identities=26%  Similarity=0.359  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHhhhhh-ccCH
Q psy4798         522 EEVLAVLAHELGHWK-YNHV  540 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k-~~H~  540 (623)
                      .....++.||+||.- +.|.
T Consensus       111 ~~~~~t~~HEiGHaLGL~H~  130 (186)
T cd04277         111 SYGYQTIIHEIGHALGLEHP  130 (186)
T ss_pred             hhhHHHHHHHHHHHhcCCCC
Confidence            457899999999973 3444


No 179
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=26.35  E-value=35  Score=33.26  Aligned_cols=15  Identities=27%  Similarity=0.364  Sum_probs=12.5

Q ss_pred             hHHHHHHHHHhhhhh
Q psy4798         142 TEEVLAVLAHELGHW  156 (623)
Q Consensus       142 ~~El~AVlAHELGH~  156 (623)
                      ......++.||+||.
T Consensus       110 g~~~~~t~~HEiGHa  124 (186)
T cd04277         110 GSYGYQTIIHEIGHA  124 (186)
T ss_pred             ChhhHHHHHHHHHHH
Confidence            355789999999996


No 180
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.27  E-value=48  Score=36.04  Aligned_cols=25  Identities=36%  Similarity=0.606  Sum_probs=23.0

Q ss_pred             CCChHHHHHHHHHhhhhhhcchHHH
Q psy4798         139 GCDTEEVLAVLAHELGHWKYNHVLK  163 (623)
Q Consensus       139 ~l~~~El~AVlAHELGH~k~~h~~k  163 (623)
                      .|+.+|+.|.|-||+-|+..+|+.+
T Consensus        63 ~~p~~~~~aLl~HEV~Hi~l~Hi~r   87 (396)
T COG3864          63 NCPISEMKALLKHEVYHIMLNHIKR   87 (396)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHHHHh
Confidence            3999999999999999999999864


No 181
>TIGR03513 GldL_gliding gliding motility-associated protein GldL. This protein family, GldL, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile
Probab=26.22  E-value=2.3e+02  Score=29.01  Aligned_cols=46  Identities=9%  Similarity=0.099  Sum_probs=29.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhch
Q psy4798         375 FIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYP  420 (623)
Q Consensus       375 ~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P  420 (623)
                      +++++...+.++.++..|..|.+..+++.+..+.=++.+++.-+-|
T Consensus         6 ~~~~~ga~~vi~GalfKi~h~~~~~~~l~~g~~te~lvFfiSAFe~   51 (202)
T TIGR03513         6 IVYGWGAAVVILGALFKIMHWPMGNPMLFVGLITEALIFAISAFEK   51 (202)
T ss_pred             HHHHHhHHHHHHHHHHHHhccCCchHHHHHHHHHHHHHHHHhccCC
Confidence            3444455555666777888888888777777666666666554433


No 182
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=25.85  E-value=39  Score=34.03  Aligned_cols=13  Identities=62%  Similarity=0.800  Sum_probs=10.9

Q ss_pred             HHHHHHHHHhhhh
Q psy4798         523 EVLAVLAHELGHW  535 (623)
Q Consensus       523 Ei~AVLAHELGH~  535 (623)
                      =..+|+.||+||.
T Consensus        37 l~~~v~iHElgH~   49 (208)
T cd06161          37 LFLSVLLHELGHA   49 (208)
T ss_pred             HHHHHHHHHHHHH
Confidence            3568999999997


No 183
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=25.20  E-value=41  Score=33.75  Aligned_cols=15  Identities=40%  Similarity=0.605  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHhhhh
Q psy4798         521 TEEVLAVLAHELGHW  535 (623)
Q Consensus       521 ~~Ei~AVLAHELGH~  535 (623)
                      +-.+..++.||+||.
T Consensus        19 ~~~~l~t~~HE~gHa   33 (200)
T PF13398_consen   19 PFRLLVTFVHELGHA   33 (200)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            346788999999996


No 184
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=25.15  E-value=41  Score=33.41  Aligned_cols=14  Identities=43%  Similarity=0.598  Sum_probs=11.6

Q ss_pred             HHHHHHHHHhhhhh
Q psy4798         143 EEVLAVLAHELGHW  156 (623)
Q Consensus       143 ~El~AVlAHELGH~  156 (623)
                      .+..+++.||+||.
T Consensus        90 ~~~~~~i~HElgHa  103 (198)
T cd04327          90 PEFSRVVLHEFGHA  103 (198)
T ss_pred             hhHHHHHHHHHHHH
Confidence            35678999999995


No 185
>KOG3607|consensus
Probab=24.78  E-value=49  Score=39.77  Aligned_cols=16  Identities=50%  Similarity=0.526  Sum_probs=13.4

Q ss_pred             hHHHHHHHHHhhhhhh
Q psy4798         142 TEEVLAVLAHELGHWK  157 (623)
Q Consensus       142 ~~El~AVlAHELGH~k  157 (623)
                      ......|+||||||.-
T Consensus       320 ~~~~a~v~AhelgH~l  335 (716)
T KOG3607|consen  320 LLAFAVVLAHELGHNL  335 (716)
T ss_pred             chhHHHHHHHHHHhhc
Confidence            5668899999999964


No 186
>COG3402 Uncharacterized conserved protein [Function unknown]
Probab=24.50  E-value=1.9e+02  Score=28.47  Aligned_cols=44  Identities=20%  Similarity=0.221  Sum_probs=25.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHH-HHHHhhch
Q psy4798         377 VSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMS-LFLMTIYP  420 (623)
Q Consensus       377 ~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~-l~~~~i~P  420 (623)
                      .++++.+|+..++++..-..-++|+.+..++.+++. ++..+++|
T Consensus        25 ~~l~Ll~av~~~~~~~~~~~~~~w~~~a~~av~l~~~vv~l~iiP   69 (161)
T COG3402          25 IALVLLIAVAAGVLLYFVGLDPNWSSVAAVAVILLAAVVTLFIIP   69 (161)
T ss_pred             HHHHHHHHHHHHHHHheeccCCccHHHHHHHHHHHHHHHHhhhhh
Confidence            344667777777666665555667776665554433 33344444


No 187
>TIGR02140 permease_CysW sulfate ABC transporter, permease protein CysW. This model represents CysW, one of two homologous, tandem permeases in the sulfate ABC transporter system; the other is CysT (TIGR02139). The sulfate transporter has been described in E. coli as transporting sulfate, thiosulfate, selenate, and selenite. Sulfate transporters may also transport molybdate ion if a specific molybdate transporter is not present.
Probab=24.05  E-value=7.7e+02  Score=25.05  Aligned_cols=36  Identities=8%  Similarity=0.047  Sum_probs=22.3

Q ss_pred             hhchHhhhhhhccCCcCCchhHHHHHHHHHHHcCCCCCeEE
Q psy4798         417 TIYPEFIAPLFDKYTPLPDGELKSRIEQLSASVKFPLKKLY  457 (623)
Q Consensus       417 ~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~gfp~~~v~  457 (623)
                      ...|.....+...++..+     +.+++.|+..|...-+.+
T Consensus       139 ~~~p~~~~~~~~~l~~i~-----~~~~eAA~~~Gas~~~~~  174 (261)
T TIGR02140       139 VTCPFVARELIPVMEEQG-----TEQEEAALTLGASWWQTF  174 (261)
T ss_pred             HHHHHHHHHHHHHHHhCC-----ccHHHHHHhcCCCcceee
Confidence            456666655555455543     456777999998765443


No 188
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=24.01  E-value=44  Score=35.46  Aligned_cols=24  Identities=38%  Similarity=0.547  Sum_probs=19.5

Q ss_pred             CChHHHHHHHHHhhhhhhcchHHH
Q psy4798         140 CDTEEVLAVLAHELGHWKYNHVLK  163 (623)
Q Consensus       140 l~~~El~AVlAHELGH~k~~h~~k  163 (623)
                      -+...-.|.+.||+|||..+-...
T Consensus       198 ~da~~yyaTl~HElghwtgh~~rl  221 (316)
T COG4227         198 RDAINYYATLLHELGHWTGHEARL  221 (316)
T ss_pred             cchHhHHHHHHHHhccccCchhhh
Confidence            566778999999999998776553


No 189
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=23.81  E-value=46  Score=33.59  Aligned_cols=14  Identities=43%  Similarity=0.627  Sum_probs=12.5

Q ss_pred             HHHHHHHHHhhhhh
Q psy4798         143 EEVLAVLAHELGHW  156 (623)
Q Consensus       143 ~El~AVlAHELGH~  156 (623)
                      +-+.++++||+||.
T Consensus       114 ~~~~~~~~he~gh~  127 (197)
T cd04276         114 ASLRYLLAHEVGHT  127 (197)
T ss_pred             HHHHHHHHHHHHHH
Confidence            56889999999996


No 190
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=23.75  E-value=50  Score=33.37  Aligned_cols=19  Identities=37%  Similarity=0.497  Sum_probs=14.8

Q ss_pred             HHHHHHHHHHhhhhh-ccCH
Q psy4798         522 EEVLAVLAHELGHWK-YNHV  540 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k-~~H~  540 (623)
                      +-+..+++||+||.- +.|.
T Consensus       114 ~~~~~~~~he~gh~lGl~hn  133 (197)
T cd04276         114 ASLRYLLAHEVGHTLGLRHN  133 (197)
T ss_pred             HHHHHHHHHHHHHHhcCccc
Confidence            568899999999973 4454


No 191
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=23.73  E-value=57  Score=34.73  Aligned_cols=26  Identities=27%  Similarity=0.288  Sum_probs=20.8

Q ss_pred             CCChHHHHHHHHHhhhhhh---cchHHHH
Q psy4798         139 GCDTEEVLAVLAHELGHWK---YNHVLKS  164 (623)
Q Consensus       139 ~l~~~El~AVlAHELGH~k---~~h~~k~  164 (623)
                      +.++++|...|-|||+|-|   ++|+...
T Consensus       191 r~dd~~lA~LIFHELAHQk~Y~~~DtAFN  219 (376)
T COG4324         191 RQDDTYLASLIFHELAHQKIYVNNDTAFN  219 (376)
T ss_pred             cCChHHHHHHHHHHHhhheEeecCcchHh
Confidence            4789999999999999976   5565543


No 192
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=23.58  E-value=91  Score=38.33  Aligned_cols=18  Identities=33%  Similarity=0.316  Sum_probs=14.1

Q ss_pred             HHHHHHHHHHhhhhhccC
Q psy4798         522 EEVLAVLAHELGHWKYNH  539 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k~~H  539 (623)
                      ..+..|+|||++|--.|+
T Consensus       281 ~~i~~VIaHElaHqWfGN  298 (863)
T TIGR02414       281 ERIESVIAHEYFHNWTGN  298 (863)
T ss_pred             HHHHHHHHHHHHHHHhcc
Confidence            458899999999955543


No 193
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=23.57  E-value=48  Score=32.99  Aligned_cols=11  Identities=45%  Similarity=1.108  Sum_probs=9.6

Q ss_pred             HHHHHHHhhhh
Q psy4798         525 LAVLAHELGHW  535 (623)
Q Consensus       525 ~AVLAHELGH~  535 (623)
                      ..|+-||+||.
T Consensus        10 ~~v~iHElGH~   20 (182)
T cd06163          10 ILIFVHELGHF   20 (182)
T ss_pred             HHHHHHHHHHH
Confidence            46899999997


No 194
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=23.57  E-value=43  Score=32.38  Aligned_cols=13  Identities=46%  Similarity=0.728  Sum_probs=10.6

Q ss_pred             HHHHHHHhhhhhh
Q psy4798         145 VLAVLAHELGHWK  157 (623)
Q Consensus       145 l~AVlAHELGH~k  157 (623)
                      +.+|..||+||..
T Consensus         8 ~i~i~iHE~gH~~   20 (180)
T cd05709           8 LISVTVHELGHAL   20 (180)
T ss_pred             HHHHHHHHHHHHH
Confidence            4578999999964


No 195
>KOG3714|consensus
Probab=23.31  E-value=34  Score=38.23  Aligned_cols=18  Identities=39%  Similarity=0.684  Sum_probs=15.0

Q ss_pred             CCCCCCHHHHHHHHHHHhhhh
Q psy4798         515 NKKGCDTEEVLAVLAHELGHW  535 (623)
Q Consensus       515 ~g~gl~~~Ei~AVLAHELGH~  535 (623)
                      .|.||..   .++++|||+|.
T Consensus       153 l~~~C~~---~G~i~HEl~Ha  170 (411)
T KOG3714|consen  153 LGDGCDR---FGTIVHELMHA  170 (411)
T ss_pred             cCCCcCc---CchhHHHHHHH
Confidence            3556877   99999999997


No 196
>COG2274 SunT ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]
Probab=23.20  E-value=3.3e+02  Score=32.91  Aligned_cols=20  Identities=20%  Similarity=0.509  Sum_probs=15.8

Q ss_pred             HHHHHHHHhhhccHHHhhhc
Q psy4798         300 VLFNIFSTVIGLPISFYHHF  319 (623)
Q Consensus       300 ~~~~l~~~li~lP~~~Y~~f  319 (623)
                      +...++..++++|.++|++.
T Consensus       229 l~~~~~~hll~Lpl~~f~~r  248 (709)
T COG2274         229 LSGRFFRHLLRLPLSYFEKR  248 (709)
T ss_pred             HHHHHHHHHHcCcHHHccCC
Confidence            44567889999999999654


No 197
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=22.78  E-value=48  Score=34.01  Aligned_cols=13  Identities=54%  Similarity=0.674  Sum_probs=11.1

Q ss_pred             HHHHHHHHHhhhh
Q psy4798         523 EVLAVLAHELGHW  535 (623)
Q Consensus       523 Ei~AVLAHELGH~  535 (623)
                      =..+|+.||+||.
T Consensus        52 l~~~v~iHElgH~   64 (227)
T cd06164          52 LFASVLLHELGHS   64 (227)
T ss_pred             HHHHHHHHHHHHH
Confidence            3578999999996


No 198
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=22.37  E-value=47  Score=33.45  Aligned_cols=12  Identities=67%  Similarity=0.944  Sum_probs=10.3

Q ss_pred             HHHHHHHhhhhh
Q psy4798         145 VLAVLAHELGHW  156 (623)
Q Consensus       145 l~AVlAHELGH~  156 (623)
                      ..+|+.||+||.
T Consensus        38 ~~~v~iHElgH~   49 (208)
T cd06161          38 FLSVLLHELGHA   49 (208)
T ss_pred             HHHHHHHHHHHH
Confidence            458999999995


No 199
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=22.21  E-value=50  Score=34.85  Aligned_cols=13  Identities=46%  Similarity=0.621  Sum_probs=11.1

Q ss_pred             HHHHHHHHHhhhh
Q psy4798         523 EVLAVLAHELGHW  535 (623)
Q Consensus       523 Ei~AVLAHELGH~  535 (623)
                      =..+|+.||+||.
T Consensus       117 L~isv~iHElgHa  129 (263)
T cd06159         117 LVVGVVVHELSHG  129 (263)
T ss_pred             HHHHHHHHHHHHH
Confidence            3678999999996


No 200
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=21.68  E-value=52  Score=35.09  Aligned_cols=14  Identities=29%  Similarity=0.496  Sum_probs=11.8

Q ss_pred             HHHHHHHHHhhhhh
Q psy4798         523 EVLAVLAHELGHWK  536 (623)
Q Consensus       523 Ei~AVLAHELGH~k  536 (623)
                      =+.+|+.||+||--
T Consensus       134 l~isvvvHElgHal  147 (277)
T cd06162         134 LLISGVVHEMGHGV  147 (277)
T ss_pred             HHHHHHHHHHHHHH
Confidence            37799999999973


No 201
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=21.58  E-value=61  Score=29.23  Aligned_cols=19  Identities=26%  Similarity=0.246  Sum_probs=13.6

Q ss_pred             hHHHHHHHHHhhhhhhcch
Q psy4798         142 TEEVLAVLAHELGHWKYNH  160 (623)
Q Consensus       142 ~~El~AVlAHELGH~k~~h  160 (623)
                      .+++.-|+-||+||.-..+
T Consensus        70 ~~~I~~tlvhEiah~fG~~   88 (97)
T PF06262_consen   70 AELIRDTLVHEIAHHFGIS   88 (97)
T ss_dssp             HHHHHHHHHHHHHHHTT--
T ss_pred             HHHHHHHHHHHHHHHcCCC
Confidence            3567889999999986543


No 202
>PRK14015 pepN aminopeptidase N; Provisional
Probab=21.34  E-value=64  Score=39.67  Aligned_cols=19  Identities=32%  Similarity=0.265  Sum_probs=14.6

Q ss_pred             HHHHHHHHHHhhhhhccCH
Q psy4798         522 EEVLAVLAHELGHWKYNHV  540 (623)
Q Consensus       522 ~Ei~AVLAHELGH~k~~H~  540 (623)
                      ..+..|+|||++|--.|..
T Consensus       294 ~~i~~vIaHElaHqWFGNl  312 (875)
T PRK14015        294 ERIESVIAHEYFHNWTGNR  312 (875)
T ss_pred             HHHHHHHHHHHHHHHHhCc
Confidence            4588999999999655543


No 203
>smart00793 AgrB Accessory gene regulator B. The accessory gene regulator (agr) of Staphylococcus aureus is the central regulatory system that controls the gene expression for a large set of virulence factors. The arg locus consists of two transcripts: RNAII and RNAIII. RNAII encodes four genes (agrA, B, C, and D) whose gene products assemble a quorum sensing system. At low cell density, the agr genes are continuously expressed at basal levels. A signal molecule, autoinducing peptide (AIP), produced and secreted by the bacteria, accumulates outside of the cells. When the cell density increases and the AIP concentration reaches a threshold, it activates the agr response, i.e. activation of secreted protein gene expression and subsequent repression of cell wall-associated protein genes. AgrB and AgrD are essential for the production of the autoinducing peptide which functions as a signal for quorum sensing. AgrB is a transmembrane protein PUBMED:11195102. AgrB is involved in the proteolyt
Probab=21.09  E-value=4.3e+02  Score=26.07  Aligned_cols=24  Identities=25%  Similarity=0.346  Sum_probs=19.1

Q ss_pred             hhhhhccCCcCCchhHHHHHHHHH
Q psy4798         423 IAPLFDKYTPLPDGELKSRIEQLS  446 (623)
Q Consensus       423 I~plf~k~~pl~~~~L~~~I~~la  446 (623)
                      .+|.-++-+|+.++|.+.+....+
T Consensus       118 ~APv~~~~kpi~~~e~~k~lk~~s  141 (184)
T smart00793      118 YAPADTEKQPVIPEKLKKKLKKKS  141 (184)
T ss_pred             cCCcccccCCCCCHHHHHHHHHHH
Confidence            478989999999998877765544


No 204
>COG4662 TupA ABC-type tungstate transport system, periplasmic component [Coenzyme metabolism]
Probab=20.93  E-value=5.4e+02  Score=26.48  Aligned_cols=63  Identities=21%  Similarity=0.248  Sum_probs=37.1

Q ss_pred             HHHHHHHcchhHHHHHHHHHHHHHHH-HHhhchHhhhhhhccCCcCCchhHHHHHHHHHHHcCCCCCeE
Q psy4798         389 VVYIIQVGGNMVFLYLWVFIILMSLF-LMTIYPEFIAPLFDKYTPLPDGELKSRIEQLSASVKFPLKKL  456 (623)
Q Consensus       389 ~~~li~~~g~~~~l~~w~~~~~~~l~-~~~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~gfp~~~v  456 (623)
                      +|.++++.|+.-|+-+....-++.+- ...+.|.+|+-..+..+..     +.++.++++.+|..+.|+
T Consensus        80 LylLlSr~GPlG~f~LLfT~~amILGq~iL~lPlvia~~l~ale~~-----dpr~~ela~~lgas~~kl  143 (227)
T COG4662          80 LYLLLSRSGPLGWFNLLFTQDAMILGQAILILPLVIAFVLTALESV-----DPRLKELARSLGASPLKL  143 (227)
T ss_pred             HHHHHhccCCCccchhHhhhhhHHHHHHHHHHHHHHHHHHHHHHhc-----CHHHHHHHHHcCCcHHHH
Confidence            45677888888776544333332222 1235677776655543332     346677889988876444


No 205
>PF11677 DUF3273:  Protein of unknown function (DUF3273);  InterPro: IPR021691  Some members in this family of proteins are annotated as multi-transmembrane proteins however this cannot be confirmed. Currently this family has no known function. 
Probab=20.83  E-value=2.9e+02  Score=29.39  Aligned_cols=48  Identities=25%  Similarity=0.519  Sum_probs=30.9

Q ss_pred             HHHcchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccCCc-CCchhHHHHHHHH
Q psy4798         393 IQVGGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKYTP-LPDGELKSRIEQL  445 (623)
Q Consensus       393 i~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~~p-l~~~~L~~~I~~l  445 (623)
                      +.-.|..+.+..|+     ++....+.-.+..|+.+|+.| |.+.++++.+++-
T Consensus       207 f~~~g~~~tL~~l~-----al~~~t~WA~~FEP~~~~~sp~L~~s~l~ndv~~~  255 (265)
T PF11677_consen  207 FTGFGAFFTLFLLL-----ALMAATLWAFLFEPLLNKWSPKLHQSALRNDVEPQ  255 (265)
T ss_pred             HcCCchHHHHHHHH-----HHHHHHHHHHHhhHHhhhcCchhhHHHhhhhhhHH
Confidence            34445555555443     333344455577899999999 6788898886654


No 206
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=20.79  E-value=44  Score=33.81  Aligned_cols=17  Identities=47%  Similarity=0.978  Sum_probs=12.8

Q ss_pred             CCCCCHHHHHHHHHHHhhhh
Q psy4798         516 KKGCDTEEVLAVLAHELGHW  535 (623)
Q Consensus       516 g~gl~~~Ei~AVLAHELGH~  535 (623)
                      |.||..   .+++.|||||.
T Consensus        82 ~~~C~~---~Gti~HEl~Ha   98 (200)
T cd04281          82 GKNCDK---FGIVVHELGHV   98 (200)
T ss_pred             CCCcCc---CchHHHHHHHH
Confidence            345652   58999999996


No 207
>KOG3314|consensus
Probab=20.61  E-value=1e+02  Score=30.70  Aligned_cols=44  Identities=27%  Similarity=0.231  Sum_probs=31.7

Q ss_pred             cEEEe-ccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHH
Q psy4798         469 NAYFY-GFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLK  542 (623)
Q Consensus       469 NAy~~-G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k  542 (623)
                      |+... |+-+.+-|||..+=|                              -+.+++.-|++|||=|.--.|-++
T Consensus        65 ~~~~~GGy~~~~gIvlCqN~l------------------------------~~q~h~n~vv~HElIH~fDd~r~~  109 (194)
T KOG3314|consen   65 TGPVAGGYTPGRGIVLCQNRL------------------------------TIQDHVNQVVIHELIHAFDDCRAK  109 (194)
T ss_pred             CCCccCCccCCCceEEecccc------------------------------chHHHHHHHHHHHHHHHHHhhhhh
Confidence            34444 444666799998765                              356899999999999986665544


No 208
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=20.50  E-value=76  Score=33.85  Aligned_cols=28  Identities=25%  Similarity=0.258  Sum_probs=21.9

Q ss_pred             CCHHHHHHHHHHHhhhhh---ccCHHHHHHH
Q psy4798         519 CDTEEVLAVLAHELGHWK---YNHVLKSMIL  546 (623)
Q Consensus       519 l~~~Ei~AVLAHELGH~k---~~H~~k~~~~  546 (623)
                      .+++++...+-|||+|-|   ++++..+-.+
T Consensus       192 ~dd~~lA~LIFHELAHQk~Y~~~DtAFNEsF  222 (376)
T COG4324         192 QDDTYLASLIFHELAHQKIYVNNDTAFNESF  222 (376)
T ss_pred             CChHHHHHHHHHHHhhheEeecCcchHhHHH
Confidence            588999999999999976   5666555443


No 209
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=20.29  E-value=51  Score=32.81  Aligned_cols=11  Identities=45%  Similarity=1.108  Sum_probs=9.3

Q ss_pred             HHHHHHhhhhh
Q psy4798         146 LAVLAHELGHW  156 (623)
Q Consensus       146 ~AVlAHELGH~  156 (623)
                      ..|+.||+||.
T Consensus        10 ~~v~iHElGH~   20 (182)
T cd06163          10 ILIFVHELGHF   20 (182)
T ss_pred             HHHHHHHHHHH
Confidence            46899999996


Done!