Query psy4798
Match_columns 623
No_of_seqs 395 out of 2642
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 17:54:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4798.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4798hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2719|consensus 100.0 7.8E-62 1.7E-66 518.0 33.7 285 264-614 82-399 (428)
2 KOG2719|consensus 100.0 1.2E-38 2.6E-43 340.4 20.3 188 37-267 215-402 (428)
3 PRK03001 M48 family peptidase; 100.0 7.1E-35 1.5E-39 302.4 24.3 201 370-606 2-229 (283)
4 PRK03982 heat shock protein Ht 100.0 2.9E-33 6.4E-38 291.0 24.7 200 370-605 3-233 (288)
5 PRK01265 heat shock protein Ht 100.0 1.2E-29 2.5E-34 267.5 27.2 170 405-607 48-252 (324)
6 PRK01265 heat shock protein Ht 100.0 1.2E-30 2.6E-35 275.0 19.3 194 12-259 50-252 (324)
7 PRK01345 heat shock protein Ht 100.0 2.6E-28 5.6E-33 257.3 25.8 172 402-605 31-229 (317)
8 PRK04897 heat shock protein Ht 100.0 2.8E-29 6E-34 262.6 18.3 196 11-260 48-256 (298)
9 PRK02391 heat shock protein Ht 100.0 4.3E-28 9.4E-33 253.4 25.6 169 405-605 43-240 (296)
10 PRK03982 heat shock protein Ht 100.0 4.5E-29 9.8E-34 259.7 18.1 198 10-263 35-239 (288)
11 PRK02391 heat shock protein Ht 100.0 1E-28 2.2E-33 258.1 20.4 197 10-262 43-245 (296)
12 PRK03072 heat shock protein Ht 100.0 5.3E-28 1.1E-32 251.9 24.2 173 401-605 33-233 (288)
13 PRK04897 heat shock protein Ht 100.0 5.4E-28 1.2E-32 252.9 24.2 167 407-606 49-254 (298)
14 PRK02870 heat shock protein Ht 100.0 1.1E-28 2.3E-33 261.5 18.4 192 11-262 78-284 (336)
15 PRK03072 heat shock protein Ht 100.0 1.8E-28 3.8E-33 255.5 19.2 193 11-262 38-238 (288)
16 PRK03001 M48 family peptidase; 100.0 5.6E-28 1.2E-32 250.9 20.2 191 10-262 34-233 (283)
17 PRK05457 heat shock protein Ht 100.0 8.4E-28 1.8E-32 250.0 20.1 191 13-260 44-246 (284)
18 PRK02870 heat shock protein Ht 100.0 8.4E-27 1.8E-31 247.1 26.8 169 406-606 78-280 (336)
19 PRK05457 heat shock protein Ht 99.9 3.9E-26 8.4E-31 237.5 23.9 167 407-607 43-245 (284)
20 PRK01345 heat shock protein Ht 99.9 1.6E-26 3.5E-31 243.7 20.7 191 10-260 34-232 (317)
21 PF01435 Peptidase_M48: Peptid 99.9 8.4E-25 1.8E-29 216.2 4.7 183 408-622 3-224 (226)
22 COG0501 HtpX Zn-dependent prot 99.9 4.4E-22 9.6E-27 205.8 18.8 187 11-259 61-259 (302)
23 COG0501 HtpX Zn-dependent prot 99.9 3.2E-20 6.8E-25 192.0 22.6 170 404-606 59-258 (302)
24 PF01435 Peptidase_M48: Peptid 99.8 1.8E-21 3.9E-26 192.4 -0.1 156 38-256 30-188 (226)
25 COG4783 Putative Zn-dependent 99.3 1.6E-11 3.5E-16 133.7 15.6 158 431-622 65-253 (484)
26 PF05569 Peptidase_M56: BlaR1 99.3 1.3E-10 2.8E-15 121.5 19.0 142 426-608 131-275 (299)
27 PF05569 Peptidase_M56: BlaR1 99.0 5.7E-09 1.2E-13 109.2 16.1 131 39-260 141-275 (299)
28 KOG2661|consensus 99.0 2.2E-09 4.8E-14 112.3 9.5 135 455-623 240-390 (424)
29 COG4783 Putative Zn-dependent 98.8 2.5E-08 5.4E-13 109.2 12.2 138 86-271 101-249 (484)
30 COG4784 Putative Zn-dependent 98.7 1.7E-07 3.7E-12 98.8 12.8 138 434-605 62-217 (479)
31 COG4219 MecR1 Antirepressor re 98.5 7.8E-06 1.7E-10 86.2 17.9 128 435-603 135-264 (337)
32 COG4784 Putative Zn-dependent 98.1 1.5E-05 3.2E-10 84.5 10.4 117 86-256 95-216 (479)
33 KOG2661|consensus 98.0 1.1E-05 2.5E-10 85.0 7.7 100 85-241 245-345 (424)
34 COG4219 MecR1 Antirepressor re 97.7 0.00054 1.2E-08 72.6 13.7 106 88-261 162-270 (337)
35 PF06114 DUF955: Domain of unk 97.1 0.0046 1E-07 54.4 10.5 65 519-604 37-101 (122)
36 PF04228 Zn_peptidase: Putativ 96.6 0.011 2.4E-07 62.6 10.2 128 440-603 101-245 (292)
37 PF06114 DUF955: Domain of unk 95.4 0.034 7.4E-07 48.9 5.9 63 140-255 37-100 (122)
38 PF01863 DUF45: Protein of unk 95.3 0.04 8.7E-07 54.3 6.6 69 436-541 112-182 (205)
39 PRK04351 hypothetical protein; 94.8 0.11 2.4E-06 50.1 7.8 71 433-535 1-72 (149)
40 COG1451 Predicted metal-depend 94.7 0.085 1.8E-06 54.0 7.1 73 436-544 123-195 (223)
41 PF01863 DUF45: Protein of unk 94.2 0.061 1.3E-06 53.0 4.6 67 40-161 113-180 (205)
42 PF13699 DUF4157: Domain of un 93.3 0.15 3.2E-06 44.0 4.9 69 432-537 1-74 (79)
43 PF04298 Zn_peptidase_2: Putat 92.1 7.1 0.00015 40.2 15.9 86 419-540 17-105 (222)
44 COG1451 Predicted metal-depend 90.9 0.3 6.4E-06 50.0 4.6 25 140-164 170-194 (223)
45 PF10026 DUF2268: Predicted Zn 90.7 0.75 1.6E-05 45.8 7.2 74 437-539 4-80 (195)
46 PF10463 Peptidase_U49: Peptid 90.2 0.23 5E-06 50.3 3.1 41 523-589 100-140 (206)
47 smart00731 SprT SprT homologue 89.7 0.65 1.4E-05 44.2 5.5 66 437-537 2-72 (146)
48 PF10263 SprT-like: SprT-like 88.0 2.1 4.6E-05 40.5 7.8 33 480-540 44-76 (157)
49 PF04450 BSP: Peptidase of pla 85.7 1.8 3.9E-05 43.9 6.2 82 432-540 24-112 (205)
50 PF04228 Zn_peptidase: Putativ 85.2 0.69 1.5E-05 49.3 3.1 18 224-241 205-222 (292)
51 PF04298 Zn_peptidase_2: Putat 82.2 66 0.0014 33.3 15.7 17 145-161 89-105 (222)
52 PF13203 DUF2201_N: Putative m 80.6 2 4.3E-05 45.1 4.4 27 519-545 55-81 (292)
53 PF08325 WLM: WLM domain; Int 80.1 2.4 5.2E-05 42.4 4.5 23 138-160 75-97 (186)
54 PRK04351 hypothetical protein; 76.5 2 4.4E-05 41.4 2.7 31 98-156 42-72 (149)
55 PF12388 Peptidase_M57: Dual-a 74.7 5.8 0.00013 40.5 5.5 22 519-540 128-150 (211)
56 PF13699 DUF4157: Domain of un 73.7 2.7 5.8E-05 36.3 2.5 19 141-159 57-75 (79)
57 PF04450 BSP: Peptidase of pla 73.0 3.9 8.4E-05 41.5 3.8 53 86-161 60-112 (205)
58 PF08325 WLM: WLM domain; Int 72.8 7.4 0.00016 38.9 5.7 21 519-539 77-97 (186)
59 PF13203 DUF2201_N: Putative m 72.3 3.6 7.8E-05 43.2 3.6 26 140-165 55-80 (292)
60 PF10463 Peptidase_U49: Peptid 71.9 1.7 3.7E-05 44.2 0.9 19 145-163 101-119 (206)
61 PRK09672 phage exclusion prote 71.5 3.6 7.9E-05 43.9 3.3 41 524-590 165-205 (305)
62 cd04269 ZnMc_adamalysin_II_lik 71.5 9.2 0.0002 37.5 6.1 16 521-536 128-143 (194)
63 COG2321 Predicted metalloprote 71.4 5.7 0.00012 42.0 4.6 103 451-586 111-217 (295)
64 smart00731 SprT SprT homologue 70.9 3.4 7.4E-05 39.2 2.7 32 99-158 41-72 (146)
65 PF10026 DUF2268: Predicted Zn 70.8 8.9 0.00019 38.2 5.8 21 140-160 60-80 (195)
66 PF10263 SprT-like: SprT-like 70.8 2.9 6.2E-05 39.6 2.2 32 100-159 43-74 (157)
67 COG4900 Predicted metallopepti 70.6 13 0.00028 34.5 6.2 18 519-536 75-92 (133)
68 COG2321 Predicted metalloprote 69.1 2.9 6.2E-05 44.2 1.9 18 224-241 202-219 (295)
69 cd04270 ZnMc_TACE_like Zinc-de 68.3 3.4 7.4E-05 42.7 2.3 22 137-158 159-180 (244)
70 COG3091 SprT Zn-dependent meta 68.1 9.4 0.0002 37.1 5.0 84 435-552 4-98 (156)
71 cd04279 ZnMc_MMP_like_1 Zinc-d 64.0 4 8.6E-05 38.8 1.7 15 142-156 101-115 (156)
72 PRK04860 hypothetical protein; 64.0 18 0.00039 35.4 6.2 64 441-538 12-77 (160)
73 PRK09672 phage exclusion prote 64.0 1.8 3.8E-05 46.3 -0.8 18 145-162 165-182 (305)
74 TIGR00181 pepF oligoendopeptid 64.0 6 0.00013 45.7 3.5 22 83-104 348-370 (591)
75 cd04269 ZnMc_adamalysin_II_lik 63.9 6.6 0.00014 38.5 3.3 16 142-157 128-143 (194)
76 cd04268 ZnMc_MMP_like Zinc-dep 63.9 4 8.7E-05 38.6 1.7 16 520-535 90-105 (165)
77 PF01432 Peptidase_M3: Peptida 63.8 6.9 0.00015 43.7 3.8 55 83-159 194-256 (458)
78 PF12388 Peptidase_M57: Dual-a 63.5 6 0.00013 40.4 2.9 17 140-156 128-144 (211)
79 PF05572 Peptidase_M43: Pregna 63.4 3.6 7.9E-05 39.7 1.3 23 522-544 67-90 (154)
80 cd04270 ZnMc_TACE_like Zinc-de 63.1 5.2 0.00011 41.3 2.5 18 519-536 162-179 (244)
81 cd04279 ZnMc_MMP_like_1 Zinc-d 63.1 4.5 9.7E-05 38.4 1.8 16 521-536 101-116 (156)
82 PF00413 Peptidase_M10: Matrix 62.9 3.3 7.1E-05 38.6 0.9 17 140-156 100-116 (154)
83 cd04268 ZnMc_MMP_like Zinc-dep 61.5 4.3 9.4E-05 38.3 1.5 16 141-156 90-105 (165)
84 PF13485 Peptidase_MA_2: Pepti 60.3 6.6 0.00014 34.5 2.3 22 519-540 20-41 (128)
85 PF00413 Peptidase_M10: Matrix 58.7 5.7 0.00012 37.0 1.7 22 519-540 100-122 (154)
86 TIGR02289 M3_not_pepF oligoend 58.4 8.6 0.00019 44.2 3.4 19 82-100 305-323 (549)
87 KOG2765|consensus 57.7 47 0.001 36.9 8.6 103 291-415 275-387 (416)
88 PF13688 Reprolysin_5: Metallo 57.6 5.1 0.00011 39.2 1.2 16 520-535 138-153 (196)
89 cd06455 M3A_TOP Peptidase M3 T 57.4 22 0.00047 40.2 6.3 20 519-539 259-278 (472)
90 PF13485 Peptidase_MA_2: Pepti 57.0 7.8 0.00017 34.0 2.2 22 140-161 20-41 (128)
91 COG2856 Predicted Zn peptidase 55.8 7.8 0.00017 39.6 2.2 20 519-538 67-86 (213)
92 PF05572 Peptidase_M43: Pregna 55.7 5.5 0.00012 38.5 1.1 14 144-157 68-81 (154)
93 cd04267 ZnMc_ADAM_like Zinc-de 54.5 5.5 0.00012 38.9 0.9 15 522-536 131-145 (192)
94 PF15061 DUF4538: Domain of un 54.4 17 0.00036 29.9 3.4 43 404-454 8-50 (58)
95 COG2856 Predicted Zn peptidase 54.4 8.4 0.00018 39.4 2.2 21 140-160 67-87 (213)
96 PF13582 Reprolysin_3: Metallo 54.2 6.4 0.00014 35.5 1.2 10 526-535 109-118 (124)
97 PF13688 Reprolysin_5: Metallo 53.9 5.9 0.00013 38.7 1.0 17 140-156 137-153 (196)
98 cd06455 M3A_TOP Peptidase M3 T 53.0 18 0.0004 40.8 4.8 21 138-159 257-277 (472)
99 PF01421 Reprolysin: Reprolysi 52.1 7.1 0.00015 38.5 1.3 16 520-535 127-142 (199)
100 cd04272 ZnMc_salivary_gland_MP 51.5 12 0.00025 37.8 2.7 14 523-536 144-157 (220)
101 PF13582 Reprolysin_3: Metallo 51.3 7.1 0.00015 35.2 1.0 11 147-157 109-119 (124)
102 COG3824 Predicted Zn-dependent 51.1 12 0.00026 35.1 2.5 14 522-535 107-120 (136)
103 COG1164 Oligoendopeptidase F [ 50.2 14 0.00031 43.2 3.5 44 83-160 349-395 (598)
104 PF13574 Reprolysin_2: Metallo 50.2 7.9 0.00017 37.8 1.2 13 524-536 111-123 (173)
105 TIGR02289 M3_not_pepF oligoend 49.7 11 0.00024 43.3 2.5 21 520-541 334-354 (549)
106 PF01421 Reprolysin: Reprolysi 49.5 9.7 0.00021 37.6 1.7 16 141-156 127-142 (199)
107 cd06459 M3B_Oligoendopeptidase 49.3 16 0.00034 40.0 3.5 18 83-100 191-208 (427)
108 cd04267 ZnMc_ADAM_like Zinc-de 49.3 7.1 0.00015 38.2 0.7 15 143-157 131-145 (192)
109 cd04278 ZnMc_MMP Zinc-dependen 49.2 8.7 0.00019 36.5 1.3 15 521-535 104-118 (157)
110 PF01432 Peptidase_M3: Peptida 48.4 23 0.00049 39.6 4.6 62 454-537 186-255 (458)
111 cd04278 ZnMc_MMP Zinc-dependen 48.0 9.5 0.00021 36.3 1.4 16 141-156 103-118 (157)
112 cd04272 ZnMc_salivary_gland_MP 47.9 15 0.00032 37.1 2.7 20 6-25 21-40 (220)
113 COG2738 Predicted Zn-dependent 47.3 1.6E+02 0.0034 30.2 9.6 83 423-542 24-110 (226)
114 PF02031 Peptidase_M7: Strepto 47.3 9.1 0.0002 36.3 1.1 10 147-156 79-88 (132)
115 PF13574 Reprolysin_2: Metallo 46.7 9.1 0.0002 37.4 1.0 11 146-156 112-122 (173)
116 cd06459 M3B_Oligoendopeptidase 46.7 14 0.0003 40.4 2.5 20 521-541 220-239 (427)
117 TIGR00181 pepF oligoendopeptid 46.3 16 0.00035 42.2 3.2 17 525-541 379-395 (591)
118 TIGR02290 M3_fam_3 oligoendope 45.5 15 0.00033 42.5 2.8 16 83-98 344-359 (587)
119 PF09768 Peptidase_M76: Peptid 44.5 22 0.00047 35.3 3.2 40 92-161 47-87 (173)
120 PF01447 Peptidase_M4: Thermol 44.1 11 0.00023 36.5 1.0 11 146-156 136-146 (150)
121 COG1164 Oligoendopeptidase F [ 43.8 15 0.00032 43.1 2.3 45 466-541 353-397 (598)
122 COG2738 Predicted Zn-dependent 43.8 38 0.00083 34.4 4.8 20 145-164 92-111 (226)
123 PRK04860 hypothetical protein; 43.8 16 0.00034 35.8 2.1 19 140-158 58-76 (160)
124 PF01447 Peptidase_M4: Thermol 43.3 12 0.00026 36.2 1.2 17 466-486 108-124 (150)
125 PF02031 Peptidase_M7: Strepto 42.4 13 0.00028 35.2 1.3 9 527-535 80-88 (132)
126 cd00203 ZnMc Zinc-dependent me 41.1 12 0.00025 35.4 0.8 16 521-536 93-108 (167)
127 COG3091 SprT Zn-dependent meta 40.8 17 0.00038 35.3 1.9 17 140-156 56-72 (156)
128 PRK13267 archaemetzincin-like 40.7 84 0.0018 31.3 6.8 19 521-539 122-141 (179)
129 PF14247 DUF4344: Domain of un 40.6 46 0.001 34.3 5.0 57 466-535 47-103 (220)
130 cd04271 ZnMc_ADAM_fungal Zinc- 39.8 10 0.00023 38.8 0.3 11 526-536 147-157 (228)
131 TIGR00782 ccoP cytochrome c ox 39.7 44 0.00095 35.3 4.9 33 396-430 25-57 (285)
132 cd00203 ZnMc Zinc-dependent me 39.4 12 0.00026 35.2 0.6 16 142-157 93-108 (167)
133 cd06460 M32_Taq Peptidase fami 38.4 42 0.00092 37.4 4.7 73 431-540 102-175 (396)
134 cd06258 Peptidase_M3_like The 38.3 44 0.00095 36.0 4.8 60 79-160 104-169 (365)
135 KOG1047|consensus 38.1 41 0.0009 39.0 4.6 60 442-538 239-302 (613)
136 cd04271 ZnMc_ADAM_fungal Zinc- 37.5 12 0.00025 38.4 0.2 19 6-24 21-39 (228)
137 smart00235 ZnMc Zinc-dependent 37.0 15 0.00032 34.0 0.8 10 526-535 88-97 (140)
138 PF09768 Peptidase_M76: Peptid 36.9 92 0.002 31.0 6.3 16 520-535 67-82 (173)
139 PF13583 Reprolysin_4: Metallo 36.8 16 0.00035 36.8 1.0 16 526-541 139-155 (206)
140 TIGR02290 M3_fam_3 oligoendope 36.8 20 0.00044 41.5 2.0 15 525-539 376-390 (587)
141 cd06456 M3A_DCP_Oligopeptidase 36.6 46 0.001 37.3 4.7 22 138-160 202-223 (422)
142 COG4900 Predicted metallopepti 35.6 26 0.00055 32.6 2.0 21 137-157 72-92 (133)
143 COG1054 Predicted sulfurtransf 34.5 74 0.0016 34.3 5.5 45 425-475 8-53 (308)
144 PF02163 Peptidase_M50: Peptid 34.3 23 0.0005 34.5 1.7 12 524-535 7-18 (192)
145 PF13583 Reprolysin_4: Metallo 34.2 18 0.00039 36.4 0.9 11 147-157 139-149 (206)
146 KOG3314|consensus 33.2 42 0.00092 33.2 3.2 39 95-163 71-109 (194)
147 COG3824 Predicted Zn-dependent 32.8 23 0.00051 33.3 1.3 15 143-157 107-121 (136)
148 cd06258 Peptidase_M3_like The 32.4 82 0.0018 33.9 5.7 21 519-540 150-170 (365)
149 KOG3607|consensus 32.3 36 0.00078 40.9 3.1 16 520-535 319-334 (716)
150 PF14891 Peptidase_M91: Effect 31.7 24 0.00052 34.5 1.3 18 141-158 99-116 (174)
151 COG2928 Uncharacterized conser 31.6 5.2E+02 0.011 26.9 10.8 35 437-475 103-137 (222)
152 smart00235 ZnMc Zinc-dependent 31.4 20 0.00044 33.1 0.7 10 147-156 88-97 (140)
153 PF14891 Peptidase_M91: Effect 31.2 33 0.00072 33.5 2.2 18 520-537 99-116 (174)
154 cd06456 M3A_DCP_Oligopeptidase 30.7 61 0.0013 36.3 4.4 21 519-540 204-224 (422)
155 PF12725 DUF3810: Protein of u 30.6 26 0.00056 37.9 1.4 26 133-158 184-209 (318)
156 COG3590 PepO Predicted metallo 30.4 14 0.0003 42.9 -0.7 15 522-536 485-499 (654)
157 COG3590 PepO Predicted metallo 29.8 15 0.00032 42.6 -0.6 17 141-157 483-499 (654)
158 PF14715 FixP_N: N-terminal do 29.1 61 0.0013 25.9 2.9 26 396-421 19-44 (51)
159 PF02163 Peptidase_M50: Peptid 29.1 30 0.00066 33.7 1.5 12 145-156 7-18 (192)
160 KOG1047|consensus 29.0 34 0.00073 39.7 2.0 27 320-347 391-417 (613)
161 PF01433 Peptidase_M1: Peptida 28.8 1E+02 0.0022 33.1 5.7 20 521-540 292-311 (390)
162 PRK05320 rhodanese superfamily 28.5 69 0.0015 33.5 4.1 57 426-490 7-64 (257)
163 COG0475 KefB Kef-type K+ trans 28.5 1.7E+02 0.0037 32.5 7.4 97 342-438 120-222 (397)
164 cd05709 S2P-M50 Site-2 proteas 28.4 33 0.00072 33.2 1.7 13 524-536 8-20 (180)
165 COG3864 Uncharacterized protei 27.9 63 0.0014 35.2 3.7 24 519-542 64-87 (396)
166 PRK01415 hypothetical protein; 27.6 74 0.0016 33.3 4.1 45 425-475 8-53 (247)
167 PF12315 DUF3633: Protein of u 27.4 47 0.001 34.1 2.5 20 518-537 87-106 (212)
168 TIGR02412 pepN_strep_liv amino 27.4 74 0.0016 38.7 4.7 18 522-539 285-302 (831)
169 cd04327 ZnMc_MMP_like_3 Zinc-d 27.4 38 0.00082 33.7 1.9 14 522-535 90-103 (198)
170 TIGR03296 M6dom_TIGR03296 M6 f 27.3 14 0.00031 39.0 -1.2 11 525-535 166-176 (286)
171 PF11694 DUF3290: Protein of u 27.3 4.3E+02 0.0094 25.6 9.0 24 436-459 79-102 (149)
172 cd04275 ZnMc_pappalysin_like Z 27.2 11 0.00024 38.7 -1.9 20 523-542 136-156 (225)
173 COG4227 Antirestriction protei 27.1 34 0.00074 36.2 1.5 21 520-541 199-219 (316)
174 KOG3733|consensus 27.1 1.5E+02 0.0033 33.6 6.4 61 410-480 487-555 (566)
175 PF12315 DUF3633: Protein of u 26.7 44 0.00094 34.3 2.2 20 138-157 86-105 (212)
176 cd06460 M32_Taq Peptidase fami 26.6 84 0.0018 35.2 4.6 14 147-160 161-174 (396)
177 cd04273 ZnMc_ADAMTS_like Zinc- 26.6 14 0.00031 36.8 -1.3 12 146-157 141-152 (207)
178 cd04277 ZnMc_serralysin_like Z 26.4 37 0.0008 33.1 1.6 19 522-540 111-130 (186)
179 cd04277 ZnMc_serralysin_like Z 26.3 35 0.00076 33.3 1.4 15 142-156 110-124 (186)
180 COG3864 Uncharacterized protei 26.3 48 0.001 36.0 2.5 25 139-163 63-87 (396)
181 TIGR03513 GldL_gliding gliding 26.2 2.3E+02 0.005 29.0 7.1 46 375-420 6-51 (202)
182 cd06161 S2P-M50_SpoIVFB SpoIVF 25.8 39 0.00084 34.0 1.7 13 523-535 37-49 (208)
183 PF13398 Peptidase_M50B: Pepti 25.2 41 0.00088 33.7 1.7 15 521-535 19-33 (200)
184 cd04327 ZnMc_MMP_like_3 Zinc-d 25.2 41 0.00089 33.4 1.7 14 143-156 90-103 (198)
185 KOG3607|consensus 24.8 49 0.0011 39.8 2.4 16 142-157 320-335 (716)
186 COG3402 Uncharacterized conser 24.5 1.9E+02 0.0042 28.5 6.0 44 377-420 25-69 (161)
187 TIGR02140 permease_CysW sulfat 24.1 7.7E+02 0.017 25.0 11.7 36 417-457 139-174 (261)
188 COG4227 Antirestriction protei 24.0 44 0.00095 35.5 1.6 24 140-163 198-221 (316)
189 cd04276 ZnMc_MMP_like_2 Zinc-d 23.8 46 0.001 33.6 1.7 14 143-156 114-127 (197)
190 cd04276 ZnMc_MMP_like_2 Zinc-d 23.7 50 0.0011 33.4 2.0 19 522-540 114-133 (197)
191 COG4324 Predicted aminopeptida 23.7 57 0.0012 34.7 2.4 26 139-164 191-219 (376)
192 TIGR02414 pepN_proteo aminopep 23.6 91 0.002 38.3 4.5 18 522-539 281-298 (863)
193 cd06163 S2P-M50_PDZ_RseP-like 23.6 48 0.001 33.0 1.8 11 525-535 10-20 (182)
194 cd05709 S2P-M50 Site-2 proteas 23.6 43 0.00094 32.4 1.5 13 145-157 8-20 (180)
195 KOG3714|consensus 23.3 34 0.00074 38.2 0.8 18 515-535 153-170 (411)
196 COG2274 SunT ABC-type bacterio 23.2 3.3E+02 0.0071 32.9 8.9 20 300-319 229-248 (709)
197 cd06164 S2P-M50_SpoIVFB_CBS Sp 22.8 48 0.001 34.0 1.7 13 523-535 52-64 (227)
198 cd06161 S2P-M50_SpoIVFB SpoIVF 22.4 47 0.001 33.4 1.5 12 145-156 38-49 (208)
199 cd06159 S2P-M50_PDZ_Arch Uncha 22.2 50 0.0011 34.8 1.7 13 523-535 117-129 (263)
200 cd06162 S2P-M50_PDZ_SREBP Ster 21.7 52 0.0011 35.1 1.7 14 523-536 134-147 (277)
201 PF06262 DUF1025: Possibl zinc 21.6 61 0.0013 29.2 1.9 19 142-160 70-88 (97)
202 PRK14015 pepN aminopeptidase N 21.3 64 0.0014 39.7 2.6 19 522-540 294-312 (875)
203 smart00793 AgrB Accessory gene 21.1 4.3E+02 0.0093 26.1 8.0 24 423-446 118-141 (184)
204 COG4662 TupA ABC-type tungstat 20.9 5.4E+02 0.012 26.5 8.4 63 389-456 80-143 (227)
205 PF11677 DUF3273: Protein of u 20.8 2.9E+02 0.0063 29.4 6.9 48 393-445 207-255 (265)
206 cd04281 ZnMc_BMP1_TLD Zinc-dep 20.8 44 0.00095 33.8 0.9 17 516-535 82-98 (200)
207 KOG3314|consensus 20.6 1E+02 0.0022 30.7 3.2 44 469-542 65-109 (194)
208 COG4324 Predicted aminopeptida 20.5 76 0.0016 33.8 2.6 28 519-546 192-222 (376)
209 cd06163 S2P-M50_PDZ_RseP-like 20.3 51 0.0011 32.8 1.2 11 146-156 10-20 (182)
No 1
>KOG2719|consensus
Probab=100.00 E-value=7.8e-62 Score=517.95 Aligned_cols=285 Identities=45% Similarity=0.762 Sum_probs=249.6
Q ss_pred hhccchhhhhHHhhhhhhhhccCCccchHHHHHHHHHHHHHHHHhhhccHHHhhhcccccccCCCcccHHHHHHHHHHHH
Q psy4798 264 LLVCNGLPYFWSKSEELGETYFGFHKNEIVTSCLFIVLFNIFSTVIGLPISFYHHFVLEEKHGFNKQTYGFFVKDQIKSF 343 (623)
Q Consensus 264 iL~~g~~~~lw~~s~~~~~~~~~~~~~~~~~~~~f~~~~~l~~~li~lP~~~Y~~f~le~~~Gfn~~T~~~f~~d~vk~~ 343 (623)
.++++.+|++|.+++++... ..|++++++|+.+.+.+++++++|+++|++|++|+|||||+||.+.|++|.+|+.
T Consensus 82 ~L~~g~~~~lw~lt~~~~~~-----~~~~~~s~vF~~~~s~~~~l~~LP~~iys~fvie~~~gfnk~t~~~~~~d~lk~~ 156 (428)
T KOG2719|consen 82 ELYLGALPFLWKLTGKFLGK-----AGEIWLSLVFLLLLSIYSTLLDLPSSIYSTFVLEEKHGFNKQTLSLFIIDGLKSL 156 (428)
T ss_pred HHHHhHHHHHHHHHHHHhhh-----hhhhhhhhHHHHHHHHHHHHHhccHHHHHHHHHHhhcCCCcCcchhHHHHHHHHH
Confidence 46678999999999987541 2389999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhccccccchhhhhhccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhchHhh
Q psy4798 344 IVSLILSIPLTGAVVYIIQTYGFFVKDQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYPEFI 423 (623)
Q Consensus 344 ~l~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I 423 (623)
++++ ++++|+++++.+++..+|++|.+|+|++..++++++++++|.+|
T Consensus 157 ~v~~--------------------------------~l~~Pi~~~iv~i~~~~g~yF~i~~w~f~~v~~l~l~ti~p~~i 204 (428)
T KOG2719|consen 157 LVGV--------------------------------VLTIPIVAAIVMILKKFGPYFFIYLWLFYFVLSLLLLTIYPGFI 204 (428)
T ss_pred HHHH--------------------------------HHHHHHHHHHHHHHHHcCccHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 9999 89999999999999999999999999999999999999999999
Q ss_pred hhhhccCCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCC
Q psy4798 424 APLFDKYTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAG 503 (623)
Q Consensus 424 ~plf~k~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~ 503 (623)
+|+|+|++|+++++||++||++|++.|||.++++|++||+||+|+|||++|+|++||||+||||+.+
T Consensus 205 ~P~f~K~~PLe~g~l~~~Ie~la~s~gfp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~------------- 271 (428)
T KOG2719|consen 205 APLFGKFTPLEEGDLKEKIERLADSVGFPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYDTLLLE------------- 271 (428)
T ss_pred hhhhcCCCCCCCCchHHHHHHHHHhcCCCceEEEEEecCCCCCCCCeeeeeccccceEEEehhhhhh-------------
Confidence 9999999999999999999999999999999999999999999999999999999999999999941
Q ss_pred CCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHH--------HHHhhhhhh---------------H-
Q psy4798 504 DSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKE--------FGVANKERE---------------A- 559 (623)
Q Consensus 504 ~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~--------~g~~~~~~~---------------~- 559 (623)
+++ |++||++||+|||+|||++||+.|++++.++ ||++++... +
T Consensus 272 ---------~~~---~~~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~llF~~~~~~~~ly~a~Gf~~~~P~~ig~ 339 (428)
T KOG2719|consen 272 ---------EEH---LNNEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLLFGFLYRNPKLYAAFGFIDEQPSLIGF 339 (428)
T ss_pred ---------hhc---cccHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHcCcchheeecCCCCCcchhHH
Confidence 011 7899999999999999999999999998875 455444211 0
Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcccccccccc
Q psy4798 560 ---------SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDKARLTQLYD 614 (623)
Q Consensus 560 ---------~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~~~~~~l~~ 614 (623)
+....+....+.. ||+.|++||++|.+.. -.+++.+||.||.. +|++.+.+
T Consensus 340 livf~~~l~py~~l~~~~~n~~-sR~fEyqAD~fA~klG-Yg~~L~~AL~KL~~--dnlsf~~~ 399 (428)
T KOG2719|consen 340 LIVFQFVLAPYRALLNFLMNLI-SRRFEYQADAFAKKLG-YGKDLRQALIKLFV--DNLSFPVS 399 (428)
T ss_pred HHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHHcC-CchhHHHHHHHHhh--hhcCCCCC
Confidence 1111122233444 9999999999999986 67789999999998 44444433
No 2
>KOG2719|consensus
Probab=100.00 E-value=1.2e-38 Score=340.41 Aligned_cols=188 Identities=52% Similarity=0.850 Sum_probs=176.8
Q ss_pred cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCCCeEEEEecchhhhccCCc
Q psy4798 37 ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLN 116 (623)
Q Consensus 37 ~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~ 116 (623)
+.+++++.+|+++++.|||.+++++. ++|+|++|+|||++|+|++||||++|||+..
T Consensus 215 e~g~l~~~Ie~la~s~gfp~~k~~vi------------------~~s~rs~hsNAyfyG~~~~KRIvIyDtLl~~----- 271 (428)
T KOG2719|consen 215 EEGDLKEKIERLADSVGFPLSKYRVI------------------DGSKRSSHSNAYFYGLCKNKRIVIYDTLLLE----- 271 (428)
T ss_pred CCCchHHHHHHHHHhcCCCceEEEEE------------------ecCCCCCCCCeeeeeccccceEEEehhhhhh-----
Confidence 45689999999999999999998874 5899999999999999999999999999951
Q ss_pred cccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCC
Q psy4798 117 ADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYD 196 (623)
Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~ 196 (623)
|+ .||+||+.||+|||+|||+++|+.|+++++++..++.+++|+.+++++.+|.++||.+
T Consensus 272 ----------~~----------~~~~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~llF~~~~~~~~ly~a~Gf~~ 331 (428)
T KOG2719|consen 272 ----------EE----------HLNNEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLLFGFLYRNPKLYAAFGFID 331 (428)
T ss_pred ----------hh----------ccccHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHcCcchheeecCCC
Confidence 00 1799999999999999999999999999999999999999999999999999999999
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcCCHHHHHHHHHHHhhCcCCCchhhcc
Q psy4798 197 SQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLGKAIFLRKALLKINKDNLGFPALLVC 267 (623)
Q Consensus 197 ~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~~~~~L~sAL~KLa~~~ls~P~iL~~ 267 (623)
.+|..+|+++++.++++|+..+++++++.+||++|||||+||.++|+++.|++||.|+..+|.++|....+
T Consensus 332 ~~P~~ig~livf~~~l~py~~l~~~~~n~~sR~fEyqAD~fA~klGYg~~L~~AL~KL~~dnlsf~~~D~L 402 (428)
T KOG2719|consen 332 EQPSLIGFLIVFQFVLAPYRALLNFLMNLISRRFEYQADAFAKKLGYGKDLRQALIKLFVDNLSFPVSDPL 402 (428)
T ss_pred CCcchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHcCCchhHHHHHHHHhhhhcCCCCCcHH
Confidence 89999999999999999999999999999999999999999999999999999999999999999998644
No 3
>PRK03001 M48 family peptidase; Provisional
Probab=100.00 E-value=7.1e-35 Score=302.38 Aligned_cols=201 Identities=15% Similarity=0.231 Sum_probs=163.3
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccCCcCCc---hhHHHHHHHHH
Q psy4798 370 DQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKYTPLPD---GELKSRIEQLS 446 (623)
Q Consensus 370 ~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~~pl~~---~~L~~~I~~la 446 (623)
|.+|++++.++++++++.+.+++. |..+|+|+|++.++++++.++++|.++.|++++ +|+++ ++|++.++++|
T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~l~~~~~p~L~~~v~~la 77 (283)
T PRK03001 2 NWVKTAMLMAAITALFIVIGGMIG---GSQGMLIALLFALGMNFFSYWFSDKMVLKMYNA-QEVDENTAPQFYRMVRELA 77 (283)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHh---chhhHHHHHHHHHHHHHHHHHHhHHHHHHHcCC-EECCccccHHHHHHHHHHH
Confidence 456667767788888877777665 566799999999999999999999999999998 88876 58999999999
Q ss_pred HHcCCCCCeEEEEeCCCCCCcccEEEeccCCCc-eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHH
Q psy4798 447 ASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVL 525 (623)
Q Consensus 447 ~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~k-rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~ 525 (623)
++.|+|.+++|++|+ +++|||++|.+++| +||++|+|++. +++||+.
T Consensus 78 ~~~g~~~p~v~v~~~----~~~NAfa~G~~~~~~~Ivvt~gLl~~----------------------------l~~~El~ 125 (283)
T PRK03001 78 QRAGLPMPKVYLINE----DQPNAFATGRNPEHAAVAATTGILRV----------------------------LSEREIR 125 (283)
T ss_pred HHcCCCCCeEEEecC----CCcceEEecCCCCCeEEEecHHHHhh----------------------------CCHHHHH
Confidence 999999999999996 36999999998765 69999999998 8999999
Q ss_pred HHHHHHhhhhhccCHHHHHHHHHHHHHhhhh--------------h--hH-------HHHHHHHHHHHHHHHHHHHHHHH
Q psy4798 526 AVLAHELGHWKYNHVLKSMILKKEFGVANKE--------------R--EA-------SVMRYVTKESELITARQDREAAE 582 (623)
Q Consensus 526 AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~--------------~--~~-------~v~~~~~~~~~~~~sR~~E~~AD 582 (623)
||+|||+|||+++|+.++++.+.+.++.... . .. .+......+....+||++|++||
T Consensus 126 aVlAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD 205 (283)
T PRK03001 126 GVMAHELAHVKHRDILISTISATMAGAISALANFAMFFGGRDENGRPVNPIAGIAVAILAPLAASLIQMAISRAREFEAD 205 (283)
T ss_pred HHHHHHHHHHhCCChHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccchHHHHHHHHHHHHHHHHHHHHHhHHHHHHHh
Confidence 9999999999999999998765332211100 0 00 01111122333446999999999
Q ss_pred HHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798 583 KKYKEAMKEKDNVMAKLKVLSTDK 606 (623)
Q Consensus 583 ~~a~~~~~~~~~l~~aL~kL~~~~ 606 (623)
++|++.+++|++++++|+||+.+.
T Consensus 206 ~~a~~l~~~p~~l~~AL~Kl~~~~ 229 (283)
T PRK03001 206 RGGARISGDPQALASALDKIHRYA 229 (283)
T ss_pred HHHHHHhCCHHHHHHHHHHHHhhh
Confidence 999999999999999999999853
No 4
>PRK03982 heat shock protein HtpX; Provisional
Probab=100.00 E-value=2.9e-33 Score=290.98 Aligned_cols=200 Identities=19% Similarity=0.180 Sum_probs=157.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccCCcCC---chhHHHHHHHHH
Q psy4798 370 DQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKYTPLP---DGELKSRIEQLS 446 (623)
Q Consensus 370 ~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~~pl~---~~~L~~~I~~la 446 (623)
|.+|+++..++++.+++++.+ ..+|..||+++|++.+++.++ .+++|..|.|++++++|++ +++|++.++++|
T Consensus 3 ~~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~l~~~~~p~L~~~v~~la 78 (288)
T PRK03982 3 NQLKTGLLMALLTGLLYAIGY---LLGGSIGPIIAILLALIPNLI-SYYYSDKIVLASYNARIVSEEEAPELYRIVERLA 78 (288)
T ss_pred hHHHHHHHHHHHHHHHHHHHH---HHhchhHHHHHHHHHHHHHHH-HHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHH
Confidence 455666666666666655544 235788999999998888776 7899999999999999994 567999999999
Q ss_pred HHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-EEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHH
Q psy4798 447 ASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-IVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVL 525 (623)
Q Consensus 447 ~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-IVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~ 525 (623)
++.|+|.+++|++|++ ++|||++|++++++ |+++|+|++. +++||+.
T Consensus 79 ~~~g~~~p~v~v~~~~----~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------l~~~El~ 126 (288)
T PRK03982 79 ERANIPKPKVAIVPTQ----TPNAFATGRDPKHAVVAVTEGILNL----------------------------LNEDELE 126 (288)
T ss_pred HHcCCCCCeEEEEeCC----CcceEEeccCCCCeEEEeehHHHhh----------------------------CCHHHHH
Confidence 9999999999999854 69999999987554 7799999998 8999999
Q ss_pred HHHHHHhhhhhccCHHHHHHHHHHHHHhh--------hhh--------------hHH-----HHHHHHHHHHHHHHHHHH
Q psy4798 526 AVLAHELGHWKYNHVLKSMILKKEFGVAN--------KER--------------EAS-----VMRYVTKESELITARQDR 578 (623)
Q Consensus 526 AVLAHELGH~k~~H~~k~~~~~~~~g~~~--------~~~--------------~~~-----v~~~~~~~~~~~~sR~~E 578 (623)
||+|||+||+|++|+.++++.+.+.+... ... .+. +......+....+||++|
T Consensus 127 AVlAHElgHi~~~h~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~SR~~E 206 (288)
T PRK03982 127 GVIAHELTHIKNRDTLIQTIAATLAGAIMYLAQWLSWGLWFGGGGRDDRNGGNPIGSLLLIILAPIAATLIQFAISRQRE 206 (288)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 99999999999999999987654322111 000 000 111112233344699999
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798 579 EAAEKKYKEAMKEKDNVMAKLKVLSTD 605 (623)
Q Consensus 579 ~~AD~~a~~~~~~~~~l~~aL~kL~~~ 605 (623)
++||++|++.+++|++++++|.||+.+
T Consensus 207 ~~AD~~A~~~~~~p~~l~~aL~kL~~~ 233 (288)
T PRK03982 207 FSADEGGARLTGNPLALANALQKLEKG 233 (288)
T ss_pred HHHhHHHHHHhCCHHHHHHHHHHHHhh
Confidence 999999999999999999999999875
No 5
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.97 E-value=1.2e-29 Score=267.47 Aligned_cols=170 Identities=21% Similarity=0.233 Sum_probs=137.6
Q ss_pred HHHHHHHHHHHHhhchHhhhhhhc--cCCcCC--chhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccC-CCc
Q psy4798 405 WVFIILMSLFLMTIYPEFIAPLFD--KYTPLP--DGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFF-KNK 479 (623)
Q Consensus 405 w~~~~~~~l~~~~i~P~~I~plf~--k~~pl~--~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~-~~k 479 (623)
.++.+++.++..+++|.++.|+|+ +++|.+ +++|++.++++|++.|+|.+++|++|++ .+|||++|.+ ++|
T Consensus 48 ~~~~~~~~~~~~~~sp~li~~~~~a~~~~p~~~~~~~L~~~v~~la~~~g~~~p~vyv~~~~----~~NAfa~G~~~~~~ 123 (324)
T PRK01265 48 LIFVFFLNIIQWLFGPYMINAAYRTVEVTPTDPVYGWLYSIVAEVAKYNGIRVPKVYIADVP----FPNAFAYGSPIAGK 123 (324)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHcCCeeCCCCCcccHHHHHHHHHHHHHcCCCCCeEEEecCC----CCCeEEeccCCCCC
Confidence 345567788999999999999997 899987 7899999999999999999999999975 4999999996 579
Q ss_pred eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhh----
Q psy4798 480 RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANK---- 555 (623)
Q Consensus 480 rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~---- 555 (623)
+|+++|+|++. +++||+.||+|||+|||||+|+.++++++.+.+++..
T Consensus 124 ~Ivvt~gLl~~----------------------------l~~~El~aVlAHElgHik~~d~~~~~~~~~i~~~~~~~~~~ 175 (324)
T PRK01265 124 RIAITLPLLKI----------------------------LNRDEIKAVAGHELGHLKHRDVELLMAIGLIPTLIYYLGYS 175 (324)
T ss_pred EEEEehHHHhh----------------------------CCHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHH
Confidence 99999999998 8999999999999999999999998876543111110
Q ss_pred ----hh-----------------hHH---HHHH-HHHHHHHHHHHHHHHHHHHHHHH-HcCCHHHHHHHHHHhhhccc
Q psy4798 556 ----ER-----------------EAS---VMRY-VTKESELITARQDREAAEKKYKE-AMKEKDNVMAKLKVLSTDKA 607 (623)
Q Consensus 556 ----~~-----------------~~~---v~~~-~~~~~~~~~sR~~E~~AD~~a~~-~~~~~~~l~~aL~kL~~~~~ 607 (623)
.. ... +... ...+..+ .||++||+||++|++ .+++|+++++||+||+...+
T Consensus 176 ~~~~~~~~~~~~~~~~~~~~l~l~~ll~~~~~~i~~~l~~a-iSR~rEy~AD~~aa~~~tg~p~~LasAL~KL~~~~~ 252 (324)
T PRK01265 176 LFWGGMFGGGGGGRGNNGGLLFLIGIALMAVSFVFNLLVLS-INRMREAYADVNSALTVPGGAENLQTALAKITLSMD 252 (324)
T ss_pred HHHHHHhcccccCCCccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhHHHHHhhcCChHHHHHHHHHHHhccc
Confidence 00 000 1111 2223344 499999999999998 55999999999999998544
No 6
>PRK01265 heat shock protein HtpX; Provisional
Probab=99.97 E-value=1.2e-30 Score=274.99 Aligned_cols=194 Identities=23% Similarity=0.235 Sum_probs=153.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccc----c--CCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798 12 GIIGFSWIVFLFEFYLSIRQRRVYH----E--TTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR 85 (623)
Q Consensus 12 ~~~~~~~~~~~~~~~l~~~~~~~~~----~--~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~ 85 (623)
..+++++++|+.++++..+..++++ + .|++++.++++|++.|+|+|++|++|
T Consensus 50 ~~~~~~~~~~~~sp~li~~~~~a~~~~p~~~~~~~L~~~v~~la~~~g~~~p~vyv~~---------------------- 107 (324)
T PRK01265 50 FVFFLNIIQWLFGPYMINAAYRTVEVTPTDPVYGWLYSIVAEVAKYNGIRVPKVYIAD---------------------- 107 (324)
T ss_pred HHHHHHHHHHHHhHHHHHHHcCCeeCCCCCcccHHHHHHHHHHHHHcCCCCCeEEEec----------------------
Confidence 3467789999999999999887553 1 34799999999999999999999987
Q ss_pred CCCCceeeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 86 SEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 86 ~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
++.+|||++|.+. +++|+++|+|++. ||+||++||+|||+|||||+|+.++
T Consensus 108 ~~~~NAfa~G~~~~~~~Ivvt~gLl~~----------------------------l~~~El~aVlAHElgHik~~d~~~~ 159 (324)
T PRK01265 108 VPFPNAFAYGSPIAGKRIAITLPLLKI----------------------------LNRDEIKAVAGHELGHLKHRDVELL 159 (324)
T ss_pred CCCCCeEEeccCCCCCEEEEehHHHhh----------------------------CCHHHHHHHHHHHHHHHHcccHHHH
Confidence 4569999999974 7999999999997 8999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc--C
Q psy4798 165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL--G 242 (623)
Q Consensus 165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l--~ 242 (623)
++++.+..++.++.. ...+..+++.+|..+... .+.+.++..++.++..++.++++++||+|||+||++|++. +
T Consensus 160 ~~~~~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~l~l~~ll~~~~~~i~~~l~~aiSR~rEy~AD~~aa~~~tg 235 (324)
T PRK01265 160 MAIGLIPTLIYYLGY--SLFWGGMFGGGGGGRGNN--GGLLFLIGIALMAVSFVFNLLVLSINRMREAYADVNSALTVPG 235 (324)
T ss_pred HHHHHHHHHHHHHHH--HHHHHHHhcccccCCCcc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhcC
Confidence 988888776555432 222333444444321111 1122334555566677788899999999999999999995 8
Q ss_pred CHHHHHHHHHHHhhCcC
Q psy4798 243 KAIFLRKALLKINKDNL 259 (623)
Q Consensus 243 ~~~~L~sAL~KLa~~~l 259 (623)
+++.+++||.|++....
T Consensus 236 ~p~~LasAL~KL~~~~~ 252 (324)
T PRK01265 236 GAENLQTALAKITLSMD 252 (324)
T ss_pred ChHHHHHHHHHHHhccc
Confidence 89999999999998543
No 7
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.96 E-value=2.6e-28 Score=257.28 Aligned_cols=172 Identities=20% Similarity=0.239 Sum_probs=137.9
Q ss_pred HHHHHHHHHHHHHHHhhchHhhhhhhcc--CCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCC-C
Q psy4798 402 LYLWVFIILMSLFLMTIYPEFIAPLFDK--YTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFK-N 478 (623)
Q Consensus 402 l~~w~~~~~~~l~~~~i~P~~I~plf~k--~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~-~ 478 (623)
+...++.+++.++..+..|..+.+.++. .+|.++++|++.++++|++.|+|.++||++|+ .++|||++|.++ +
T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~La~~agi~~p~v~vid~----~~~NAFa~G~~~~~ 106 (317)
T PRK01345 31 MIALVIAAGMNLFSYWNSDKMVLRMYGAQEVDERSAPELYRMVRDLARRAGLPMPKVYIIDN----PQPNAFATGRNPEN 106 (317)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHcCCeECCcccCHHHHHHHHHHHHHcCCCCCcEEEEcC----CCcceEEecCCCCC
Confidence 3344666777888889999999999985 78888899999999999999999999999995 469999999975 5
Q ss_pred ceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhh--
Q psy4798 479 KRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKE-- 556 (623)
Q Consensus 479 krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~-- 556 (623)
++|+++|+|++. +++||+.||+|||+||++++|+.++++.+.+.++....
T Consensus 107 ~~V~vt~gLL~~----------------------------L~~dEL~aVlAHElgHi~~~d~~~~~l~~~~~~~~~~l~~ 158 (317)
T PRK01345 107 AAVAATTGLLQR----------------------------LSPEEVAGVMAHELAHVKNRDTLTMTITATLAGAISMLAN 158 (317)
T ss_pred eEEEechHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHH
Confidence 579999999998 89999999999999999999999988765432211100
Q ss_pred -----------------hhHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798 557 -----------------REAS-----VMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTD 605 (623)
Q Consensus 557 -----------------~~~~-----v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~ 605 (623)
..+. +......+....+||++|++||++|++++++|++++++|+||+.+
T Consensus 159 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~~~SR~rE~~AD~~A~~ltg~p~~L~~AL~KL~~~ 229 (317)
T PRK01345 159 FAFFFGGNRENNNGPLGLVGTLAAMIVAPLAAMLVQMAISRTREYAADRRGAEICGNPLWLASALGKIERG 229 (317)
T ss_pred HHHHhcCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhh
Confidence 0000 011111233444699999999999999999999999999999974
No 8
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.96 E-value=2.8e-29 Score=262.63 Aligned_cols=196 Identities=19% Similarity=0.190 Sum_probs=147.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCC
Q psy4798 11 YGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRS 86 (623)
Q Consensus 11 ~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~ 86 (623)
...+++++++|+.++++.+++.++.+ +.|++++.++++|++.|+|+|++|++| +
T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~la~~~gip~p~v~v~~----------------------~ 105 (298)
T PRK04897 48 IIGVIYALIMIFQSTNVVMSMNHAREVTEEEAPELWHIVEDMAMVAQIPMPRVFIID----------------------D 105 (298)
T ss_pred HHHHHHHHHHHHhhHHHHHHhCCCEECChhhhHHHHHHHHHHHHHcCCCCCcEEEec----------------------C
Confidence 44567788999999999999988765 347899999999999999999999987 5
Q ss_pred CCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798 87 EHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM 165 (623)
Q Consensus 87 ~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~ 165 (623)
..+|||++|+++++ .|+++++|++. ||+||++||+|||+|||+|+|+.+++
T Consensus 106 ~~~NAfa~G~~~~~~~v~vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~ 157 (298)
T PRK04897 106 PSPNAFATGSSPKNAAVAVTTGLLAI----------------------------MNREELEGVIGHEISHIRNYDIRLST 157 (298)
T ss_pred CCCceEEeccCCCCcEEEeehHHHhh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHH
Confidence 67999999998865 57888889997 89999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccCCCC---CC--C--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhh
Q psy4798 166 IVMQLNLLFMLYSFQYLFQYPPLYSAFGF---YD--S--QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFG 238 (623)
Q Consensus 166 ~~~~~~~~~~~~lf~~~~~~~~l~~~~G~---~~--~--~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA 238 (623)
+...+..+++++ ...+.. ..++..++. .+ . .+++..++.++..++.|+. ..++++++||++||+||++|
T Consensus 158 ~~~~~~~~~~~l-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~--~~ll~~~~SR~rE~~AD~~A 233 (298)
T PRK04897 158 IAVALASAITLL-SDIAGR-MMWWGGGSRRRDDDRDGGGLQIILLIVSLLLLILAPLA--ATLIQLAISRQREYLADASS 233 (298)
T ss_pred HHHHHHHHHHHH-HHHHHH-HHHhcccccccccccccchhhHHHHHHHHHHHHHHHHH--HHHHHHhhhHHHHHHHHHHH
Confidence 887665554432 111111 111111111 01 1 1122222333444555543 46778999999999999999
Q ss_pred hhcC-CHHHHHHHHHHHhhCcCC
Q psy4798 239 KSLG-KAIFLRKALLKINKDNLG 260 (623)
Q Consensus 239 ~~l~-~~~~L~sAL~KLa~~~ls 260 (623)
++++ +|+.+++||.|+++++..
T Consensus 234 ~~lt~~p~~La~AL~KL~~~~~~ 256 (298)
T PRK04897 234 VELTRNPQGLISALEKISNSQPM 256 (298)
T ss_pred HHHhCCHHHHHHHHHHHHhcccc
Confidence 9995 699999999999988764
No 9
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.96 E-value=4.3e-28 Score=253.43 Aligned_cols=169 Identities=20% Similarity=0.109 Sum_probs=129.7
Q ss_pred HHHHHHHHHHHHhhchHhhhhhhc--cCCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-E
Q psy4798 405 WVFIILMSLFLMTIYPEFIAPLFD--KYTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-I 481 (623)
Q Consensus 405 w~~~~~~~l~~~~i~P~~I~plf~--k~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-I 481 (623)
.++.+++.++..++.|..+....+ +.+|.++++|++.++++|++.|+|.+++|++|. +++|||++|++++++ |
T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~L~~~v~~la~~~~~~~p~v~v~~~----~~~NAfa~G~~~~~~~V 118 (296)
T PRK02391 43 VVIAGGFLLAQYFFSDKLALWSMGARIVSEDEYPELHAMVERLCALADLPKPRVAVADS----DVPNAFATGRSPKNAVV 118 (296)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHcCCEECChhhCHHHHHHHHHHHHHcCCCCCcEEEEeC----CCCceEEecCCCCCcEE
Confidence 344555666666677777665543 578888999999999999999999999999993 579999999987665 6
Q ss_pred EEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhh-----
Q psy4798 482 VLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKE----- 556 (623)
Q Consensus 482 VL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~----- 556 (623)
+++|+|++. +++||+.||+|||+||||++|+.++++.+.+..++...
T Consensus 119 ~vt~gLl~~----------------------------L~~~El~aVlaHElgHi~~~di~~~~i~~~~~~~~~~l~~~~~ 170 (296)
T PRK02391 119 CVTTGLMRR----------------------------LDPDELEAVLAHELSHVKNRDVAVMTIASFLSTIAFLIVRWGF 170 (296)
T ss_pred EecHHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 788999998 89999999999999999999999988665431111100
Q ss_pred ----------hh--H-----H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798 557 ----------RE--A-----S----VMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTD 605 (623)
Q Consensus 557 ----------~~--~-----~----v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~ 605 (623)
.. . . +...+........||++|++||++|++++++|+++++||.||+.+
T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~SR~rE~~AD~~Aa~ltg~p~~LasAL~KL~~~ 240 (296)
T PRK02391 171 YFGGFGGRGGGGGGGGILVVILVSLVVWAISFLLIRALSRYREFAADRGAAIITGRPSALASALMKISGR 240 (296)
T ss_pred HhccccCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHcc
Confidence 00 0 0 011111222233499999999999999999999999999999975
No 10
>PRK03982 heat shock protein HtpX; Provisional
Probab=99.96 E-value=4.5e-29 Score=259.66 Aligned_cols=198 Identities=19% Similarity=0.229 Sum_probs=148.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798 10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR 85 (623)
Q Consensus 10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~ 85 (623)
+.+.+++.+++|++++++..+.++.++ +.|++++.++++|++.|+|+|++|++|
T Consensus 35 ~~~~~~~~~~~~~~~~~i~~~~~~~~~l~~~~~p~L~~~v~~la~~~g~~~p~v~v~~---------------------- 92 (288)
T PRK03982 35 ILLALIPNLISYYYSDKIVLASYNARIVSEEEAPELYRIVERLAERANIPKPKVAIVP---------------------- 92 (288)
T ss_pred HHHHHHHHHHHHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHHHHcCCCCCeEEEEe----------------------
Confidence 345677788899999999988887764 346799999999999999999999976
Q ss_pred CCCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 86 SEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 86 ~~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
+..+|||++|+++++ .|+++|+|++. ||+||++||+|||+||++|+|+.++
T Consensus 93 ~~~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------l~~~El~AVlAHElgHi~~~h~~~~ 144 (288)
T PRK03982 93 TQTPNAFATGRDPKHAVVAVTEGILNL----------------------------LNEDELEGVIAHELTHIKNRDTLIQ 144 (288)
T ss_pred CCCcceEEeccCCCCeEEEeehHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence 568999999998854 57799999997 8999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcC-
Q psy4798 165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQ-PILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLG- 242 (623)
Q Consensus 165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~-p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~- 242 (623)
++.+.+...+.++. .+..+..++..+|+.... +..++.+ ++. ++.|+ ...++++++||++||+||++|++++
T Consensus 145 ~~~~~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~--~~~l~~~~~SR~~E~~AD~~A~~~~~ 218 (288)
T PRK03982 145 TIAATLAGAIMYLA--QWLSWGLWFGGGGRDDRNGGNPIGSL-LLI-ILAPI--AATLIQFAISRQREFSADEGGARLTG 218 (288)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHhcccCccccccchHHHHH-HHH-HHHHH--HHHHHHHHHhHHHHHHHhHHHHHHhC
Confidence 98876543332221 111122344444443222 2223322 122 22332 2466778999999999999999995
Q ss_pred CHHHHHHHHHHHhhCcCCCch
Q psy4798 243 KAIFLRKALLKINKDNLGFPA 263 (623)
Q Consensus 243 ~~~~L~sAL~KLa~~~ls~P~ 263 (623)
+++.+++||.|+++.+.+.|.
T Consensus 219 ~p~~l~~aL~kL~~~~~~~~~ 239 (288)
T PRK03982 219 NPLALANALQKLEKGVRYIPL 239 (288)
T ss_pred CHHHHHHHHHHHHhhhccCCC
Confidence 899999999999988665543
No 11
>PRK02391 heat shock protein HtpX; Provisional
Probab=99.96 E-value=1e-28 Score=258.14 Aligned_cols=197 Identities=24% Similarity=0.242 Sum_probs=145.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798 10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR 85 (623)
Q Consensus 10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~ 85 (623)
+...+++++++|+.++.+.++.++..+ +.|++++.++++|++.|+|+|++|++|
T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~L~~~v~~la~~~~~~~p~v~v~~---------------------- 100 (296)
T PRK02391 43 VVIAGGFLLAQYFFSDKLALWSMGARIVSEDEYPELHAMVERLCALADLPKPRVAVAD---------------------- 100 (296)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHcCCEECChhhCHHHHHHHHHHHHHcCCCCCcEEEEe----------------------
Confidence 445677889999999999999888765 347899999999999999999999987
Q ss_pred CCCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 86 SEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 86 ~~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
++++|||++|+++++ .|+++|+|++. ||+||++||+|||+|||+|+|+.++
T Consensus 101 ~~~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------L~~~El~aVlaHElgHi~~~di~~~ 152 (296)
T PRK02391 101 SDVPNAFATGRSPKNAVVCVTTGLMRR----------------------------LDPDELEAVLAHELSHVKNRDVAVM 152 (296)
T ss_pred CCCCceEEecCCCCCcEEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence 678999999998755 57778999997 8999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-CC
Q psy4798 165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL-GK 243 (623)
Q Consensus 165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l-~~ 243 (623)
++++.+..++.+.. .... ++..++..+..+.. +.+++...+..+...+..++++++||++||+||++|+++ ++
T Consensus 153 ~i~~~~~~~~~~l~-~~~~----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~~SR~rE~~AD~~Aa~ltg~ 226 (296)
T PRK02391 153 TIASFLSTIAFLIV-RWGF----YFGGFGGRGGGGGG-GGILVVILVSLVVWAISFLLIRALSRYREFAADRGAAIITGR 226 (296)
T ss_pred HHHHHHHHHHHHHH-HHHH----HhccccCCCCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCC
Confidence 98877765544321 1110 11122211111100 111111111112233456778999999999999999998 57
Q ss_pred HHHHHHHHHHHhhCcCCCc
Q psy4798 244 AIFLRKALLKINKDNLGFP 262 (623)
Q Consensus 244 ~~~L~sAL~KLa~~~ls~P 262 (623)
|+.+++||.|++......|
T Consensus 227 p~~LasAL~KL~~~~~~~~ 245 (296)
T PRK02391 227 PSALASALMKISGRMDRVP 245 (296)
T ss_pred HHHHHHHHHHHHcccccCC
Confidence 9999999999998754433
No 12
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.96 E-value=5.3e-28 Score=251.89 Aligned_cols=173 Identities=16% Similarity=0.167 Sum_probs=131.9
Q ss_pred HHHHHHHHHHHHHHHHhhchHhhhhhhc--cCCcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCC
Q psy4798 401 FLYLWVFIILMSLFLMTIYPEFIAPLFD--KYTPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKN 478 (623)
Q Consensus 401 ~l~~w~~~~~~~l~~~~i~P~~I~plf~--k~~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~ 478 (623)
.+++.++.+++.++..+..|..+....+ ..+|.++++|++.++++|++.|+|.+++|++|.+ .+|||++|..++
T Consensus 33 ~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~~~~~p~L~~~v~~la~~~g~p~p~vyv~~~~----~~NAFa~G~~~~ 108 (288)
T PRK03072 33 LGIAVLIAVGMNAYVYWNSDKLALRAMHAQPVSEVQAPAMYRIVRELSTAARQPMPRLYISPTA----APNAFATGRNPR 108 (288)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHHHHcCCCCCCEEEecCC----CCceEEecCCCC
Confidence 3444455666666777777776666554 3577888999999999999999999999999954 599999997544
Q ss_pred c-eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhh--
Q psy4798 479 K-RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANK-- 555 (623)
Q Consensus 479 k-rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~-- 555 (623)
+ .|+++|+|++. +++||++||+|||+||++++|+.++.+.+.+.++...
T Consensus 109 ~~~v~vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~~~~~~~~~i~~l~ 160 (288)
T PRK03072 109 NAAVCCTEGILQI----------------------------LNERELRGVLGHELSHVYNRDILISSVAGALASVITYLA 160 (288)
T ss_pred CcEEEecHHHHHh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHH
Confidence 3 48899999988 8999999999999999999999998876543221110
Q ss_pred ----h-------h----hH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhc
Q psy4798 556 ----E-------R----EA--------SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTD 605 (623)
Q Consensus 556 ----~-------~----~~--------~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~ 605 (623)
. . .. .+......+....+||++|++||++|++++++|++++++|+||+.+
T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~SR~rE~~AD~~A~~l~~~p~~La~AL~KL~~~ 233 (288)
T PRK03072 161 NMAMFAGMFGGRRDNDGPNPLALLLVSLLGPIAATVIQLAISRSREYQADESGAELTGDPLALASALRKISGG 233 (288)
T ss_pred HHHHHHHHhccccccccchHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHhCCHHHHHHHHHHHHhc
Confidence 0 0 00 0111112233344699999999999999999999999999999874
No 13
>PRK04897 heat shock protein HtpX; Provisional
Probab=99.96 E-value=5.4e-28 Score=252.90 Aligned_cols=167 Identities=16% Similarity=0.234 Sum_probs=125.5
Q ss_pred HHHHHHHHHHhhchHhhhhhhccCCcC---CchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-EE
Q psy4798 407 FIILMSLFLMTIYPEFIAPLFDKYTPL---PDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-IV 482 (623)
Q Consensus 407 ~~~~~~l~~~~i~P~~I~plf~k~~pl---~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-IV 482 (623)
+.+++.+ +.+.++..+...+++.+|+ ++++|++.++++|++.|+|.++||+++++ ++|||++|++++|+ |+
T Consensus 49 ~~~~~~~-~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~la~~~gip~p~v~v~~~~----~~NAfa~G~~~~~~~v~ 123 (298)
T PRK04897 49 IGVIYAL-IMIFQSTNVVMSMNHAREVTEEEAPELWHIVEDMAMVAQIPMPRVFIIDDP----SPNAFATGSSPKNAAVA 123 (298)
T ss_pred HHHHHHH-HHHHhhHHHHHHhCCCEECChhhhHHHHHHHHHHHHHcCCCCCcEEEecCC----CCceEEeccCCCCcEEE
Confidence 3333333 4444555555666678887 46789999999999999999999999954 59999999987665 88
Q ss_pred EehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhh-------
Q psy4798 483 LFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANK------- 555 (623)
Q Consensus 483 L~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~------- 555 (623)
++++|++. +++||+.||+|||+|||||+|+.++++...+.++...
T Consensus 124 vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~~~~~~~~~~~~l~~~~~~ 175 (298)
T PRK04897 124 VTTGLLAI----------------------------MNREELEGVIGHEISHIRNYDIRLSTIAVALASAITLLSDIAGR 175 (298)
T ss_pred eehHHHhh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 88899998 8999999999999999999999998876443211100
Q ss_pred --hh------------hH------H----HHHHH----HHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798 556 --ER------------EA------S----VMRYV----TKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDK 606 (623)
Q Consensus 556 --~~------------~~------~----v~~~~----~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~ 606 (623)
.. .+ . +...+ ..+....+||++|++||++|++++++|+++++||+||+.+.
T Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~ll~~~~SR~rE~~AD~~A~~lt~~p~~La~AL~KL~~~~ 254 (298)
T PRK04897 176 MMWWGGGSRRRDDDRDGGGLQIILLIVSLLLLILAPLAATLIQLAISRQREYLADASSVELTRNPQGLISALEKISNSQ 254 (298)
T ss_pred HHHhcccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhcc
Confidence 00 00 0 01111 11223345999999999999999999999999999999753
No 14
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.96 E-value=1.1e-28 Score=261.49 Aligned_cols=192 Identities=22% Similarity=0.285 Sum_probs=147.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccccC---------CcchHHHHhhhhhcCCC-CccccccccchhhhHHHHHHhHhhc
Q psy4798 11 YGIIGFSWIVFLFEFYLSIRQRRVYHET---------TIVPHQIAHGMDAESFE-KSRRYSLDKNVFSMFKETVSNVMNT 80 (623)
Q Consensus 11 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~---------~~l~~~l~~l~~~~g~~-~~~ly~~d~~~~~~~~~~v~~v~~~ 80 (623)
...+++++++|++++.+.++++++++.+ |++++.++++|++.|+| +|++|++|
T Consensus 78 ~~~~~~~~~~y~~~~~~~l~~~~a~~I~~~~~~p~~~~~L~~~ve~La~~ag~p~~p~V~vi~----------------- 140 (336)
T PRK02870 78 LVAVISILVTFQNFDKIMLSGTEYKEITPENALSLQERQLYNVVEELLVAAGLRFMPKVYIID----------------- 140 (336)
T ss_pred HHHHHHHHHHHHhhHHHHHHHcCCEEcCCCCCCchhhHHHHHHHHHHHHHcCCCCCCeEEEEc-----------------
Confidence 4567789999999999999999986532 36999999999999999 89999987
Q ss_pred ccCCCCCCCceeeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcc
Q psy4798 81 VSSKRSEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYN 159 (623)
Q Consensus 81 ~~s~~~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~ 159 (623)
++++|||++|+++ +++|+++|+|++. ||+||++||+|||+||++|+
T Consensus 141 -----~~~~NAFA~G~~~~~~~Ivvt~GLL~~----------------------------L~~dEL~aVlAHELgHik~~ 187 (336)
T PRK02870 141 -----APYMNAFASGYSEKSAMVAITTGLLEK----------------------------LDRDELQAVMAHELSHIRHG 187 (336)
T ss_pred -----CCCCceEEecCCCCCcEEEEehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcc
Confidence 5689999999986 7899999999997 89999999999999999999
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCC---CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccch
Q psy4798 160 HVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYD---SQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADA 236 (623)
Q Consensus 160 h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~---~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~ 236 (623)
|+.+++++..+..++.+.. ... +|..+|..+ ..+..++ ++++.+++.+ +..++++++||++||+||+
T Consensus 188 di~~~~~~~~l~~~~~~~~-~~~-----~~~~~g~~~~~~~~~~~~~-~l~l~~~~~~---~~~ll~~~iSR~rEy~AD~ 257 (336)
T PRK02870 188 DIRLTLCVGVLSNIMLIVA-DFL-----FYSFMGNRRNSGANRARMI-ILILRYVLPI---LTVLLMLFLSRTREYMADA 257 (336)
T ss_pred cHHHHHHHHHHHHHHHHHH-HHH-----HHHHhcCCcccccchhHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHHHhH
Confidence 9999887766654333221 111 111122211 1111112 2223333322 3467889999999999999
Q ss_pred hhhhc-CCHHHHHHHHHHHhhCcCCCc
Q psy4798 237 FGKSL-GKAIFLRKALLKINKDNLGFP 262 (623)
Q Consensus 237 fA~~l-~~~~~L~sAL~KLa~~~ls~P 262 (623)
+|+++ ++|+++++||.|++.++...|
T Consensus 258 ~Aa~ltg~p~aLasAL~KL~~~~~~~~ 284 (336)
T PRK02870 258 GAVELMRDNEPMARALQKISNDHAQND 284 (336)
T ss_pred HHHHHhCCHHHHHHHHHHHHhccccCc
Confidence 99999 579999999999999887765
No 15
>PRK03072 heat shock protein HtpX; Provisional
Probab=99.96 E-value=1.8e-28 Score=255.47 Aligned_cols=193 Identities=21% Similarity=0.257 Sum_probs=147.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCC
Q psy4798 11 YGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRS 86 (623)
Q Consensus 11 ~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~ 86 (623)
...+++++++|+.++++.+++++..+ +.|++++.++++|++.|+|+|++|++| +
T Consensus 38 ~~~~~~~~~~~~~s~~~~~~~~~~~~v~~~~~p~L~~~v~~la~~~g~p~p~vyv~~----------------------~ 95 (288)
T PRK03072 38 LIAVGMNAYVYWNSDKLALRAMHAQPVSEVQAPAMYRIVRELSTAARQPMPRLYISP----------------------T 95 (288)
T ss_pred HHHHHHHHHHHHHhHHHHHHhcCCEECChhhhHHHHHHHHHHHHHcCCCCCCEEEec----------------------C
Confidence 44567788999999999999988764 347899999999999999999999987 5
Q ss_pred CCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798 87 EHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM 165 (623)
Q Consensus 87 ~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~ 165 (623)
..+|||++|..+.+ .|+++++|++. +|+||++||+|||+|||+|+|+++++
T Consensus 96 ~~~NAFa~G~~~~~~~v~vt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~d~~~~~ 147 (288)
T PRK03072 96 AAPNAFATGRNPRNAAVCCTEGILQI----------------------------LNERELRGVLGHELSHVYNRDILISS 147 (288)
T ss_pred CCCceEEecCCCCCcEEEecHHHHHh----------------------------CCHHHHHHHHHHHHHHHhcCCHHHHH
Confidence 67999999986644 58889999997 89999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccCCCCCC-C-chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-C
Q psy4798 166 IVMQLNLLFMLYSFQYLFQYPPLYSAFGFYD-S-QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL-G 242 (623)
Q Consensus 166 ~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~-~-~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l-~ 242 (623)
+++.+..++.+.. .+. .++..+|... . .+..++++ ++. ++.|+ ...++++++||++||+||++|+++ +
T Consensus 148 ~~~~~~~~i~~l~--~~~---~~~~~~~~~~~~~~~~~~~~~-~~~-l~~~~--~~~~~~~~~SR~rE~~AD~~A~~l~~ 218 (288)
T PRK03072 148 VAGALASVITYLA--NMA---MFAGMFGGRRDNDGPNPLALL-LVS-LLGPI--AATVIQLAISRSREYQADESGAELTG 218 (288)
T ss_pred HHHHHHHHHHHHH--HHH---HHHHHhccccccccchHHHHH-HHH-HHHHH--HHHHHHHHHHhHHHHHHhHHHHHHhC
Confidence 8877665544321 110 0112233221 1 12222222 222 23332 356778999999999999999999 5
Q ss_pred CHHHHHHHHHHHhhCcCCCc
Q psy4798 243 KAIFLRKALLKINKDNLGFP 262 (623)
Q Consensus 243 ~~~~L~sAL~KLa~~~ls~P 262 (623)
+++.+++||.|+++.+.+.|
T Consensus 219 ~p~~La~AL~KL~~~~~~~~ 238 (288)
T PRK03072 219 DPLALASALRKISGGVQAAP 238 (288)
T ss_pred CHHHHHHHHHHHHhccccCC
Confidence 79999999999999876554
No 16
>PRK03001 M48 family peptidase; Provisional
Probab=99.96 E-value=5.6e-28 Score=250.90 Aligned_cols=191 Identities=19% Similarity=0.284 Sum_probs=144.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798 10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR 85 (623)
Q Consensus 10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~ 85 (623)
+...+++++++++.++.+..+..+..+ +.|++++.++++|++.|+|+|++|++|
T Consensus 34 ~~~~~~~~~~~~~~~~~~i~~~~~~~~l~~~~~p~L~~~v~~la~~~g~~~p~v~v~~---------------------- 91 (283)
T PRK03001 34 LLFALGMNFFSYWFSDKMVLKMYNAQEVDENTAPQFYRMVRELAQRAGLPMPKVYLIN---------------------- 91 (283)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHcCCEECCccccHHHHHHHHHHHHHcCCCCCeEEEec----------------------
Confidence 455677788899999988888776644 236899999999999999999999987
Q ss_pred CCCCceeeecCCCCe-EEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 86 SEHSNAYFYGFFKNK-RIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 86 ~~~~NA~~~G~~~~k-rIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
++++|||++|++++| +|+++|+|++. +|+||++||+|||+|||+|+|+.++
T Consensus 92 ~~~~NAfa~G~~~~~~~Ivvt~gLl~~----------------------------l~~~El~aVlAHElgHi~~~h~~~~ 143 (283)
T PRK03001 92 EDQPNAFATGRNPEHAAVAATTGILRV----------------------------LSEREIRGVMAHELAHVKHRDILIS 143 (283)
T ss_pred CCCcceEEecCCCCCeEEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHhCCChHHH
Confidence 568999999998865 79999999997 8999999999999999999999999
Q ss_pred HHHHHHHHHHHH-HHHHHHhcCCCcccCCCCCCC--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc
Q psy4798 165 MIVMQLNLLFML-YSFQYLFQYPPLYSAFGFYDS--QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL 241 (623)
Q Consensus 165 ~~~~~~~~~~~~-~lf~~~~~~~~l~~~~G~~~~--~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l 241 (623)
++...+..++.. ..++.+ +++++..+. .+.. .+++.++ .|+ ...++++++||++||+||++|.++
T Consensus 144 ~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~---~~~~~~~-~~~--~~~~~~~~~SR~~E~~AD~~a~~l 211 (283)
T PRK03001 144 TISATMAGAISALANFAMF------FGGRDENGRPVNPIA---GIAVAIL-APL--AASLIQMAISRAREFEADRGGARI 211 (283)
T ss_pred HHHHHHHHHHHHHHHHHHH------hcCCCccccccchHH---HHHHHHH-HHH--HHHHHHHHHhHHHHHHHhHHHHHH
Confidence 876543333222 112222 222332111 1211 1122222 222 245677899999999999999998
Q ss_pred C-CHHHHHHHHHHHhhCcCCCc
Q psy4798 242 G-KAIFLRKALLKINKDNLGFP 262 (623)
Q Consensus 242 ~-~~~~L~sAL~KLa~~~ls~P 262 (623)
+ +++.+++||.|+++++.+.|
T Consensus 212 ~~~p~~l~~AL~Kl~~~~~~~p 233 (283)
T PRK03001 212 SGDPQALASALDKIHRYASGIP 233 (283)
T ss_pred hCCHHHHHHHHHHHHhhhccCC
Confidence 5 79999999999999887755
No 17
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.96 E-value=8.4e-28 Score=249.97 Aligned_cols=191 Identities=22% Similarity=0.272 Sum_probs=139.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhhccccC-------CcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798 13 IIGFSWIVFLFEFYLSIRQRRVYHET-------TIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR 85 (623)
Q Consensus 13 ~~~~~~~~~~~~~~l~~~~~~~~~~~-------~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~ 85 (623)
.++.++++|+.++++..+++++++.+ |.+++.++++|++.|+|+|++|++|
T Consensus 44 ~~~~~~~~~~~~~~i~~~~~~a~~i~~~~~~~~~~L~~~v~~la~~~g~p~p~v~v~~---------------------- 101 (284)
T PRK05457 44 GFGGSFISLLMSKWMAKRSTGAEVIEQPRNETERWLVETVARQARQAGIGMPEVAIYH---------------------- 101 (284)
T ss_pred HHHHHHHHHHHHHHHHHHhcCCeECCCCCCchHHHHHHHHHHHHHhCCCCCCCEEEEe----------------------
Confidence 44567899999999999998876532 1388999999999999999999987
Q ss_pred CCCCceeeecCCCCeE-EEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 86 SEHSNAYFYGFFKNKR-IVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 86 ~~~~NA~~~G~~~~kr-Ivl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
++++|||++|.+++++ |+++++|++. ||+||++||+|||+||++|+|+.++
T Consensus 102 ~~~~NAfa~G~~~~~~~V~vt~gLl~~----------------------------L~~~El~aVlAHElgHi~~~d~~~~ 153 (284)
T PRK05457 102 SPEINAFATGASKNNSLVAVSTGLLQN----------------------------MSRDEVEAVLAHEISHIANGDMVTM 153 (284)
T ss_pred CCCceEEEecCCCCCeEEEeehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence 5789999999988776 5566668886 8999999999999999999999998
Q ss_pred HHHHHH-HHHHHHH--HHHHHhcCCCcccCCCCCCCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhh
Q psy4798 165 MIVMQL-NLLFMLY--SFQYLFQYPPLYSAFGFYDSQPI-LLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKS 240 (623)
Q Consensus 165 ~~~~~~-~~~~~~~--lf~~~~~~~~l~~~~G~~~~~p~-~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~ 240 (623)
+++..+ ..+++++ +++.+... ... +..+.... .....+++..+ +.++.+++.+++||++||+||++|++
T Consensus 154 ~l~~~~~~~~~~~~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~l~~~~---~~~~~~ll~~~~SR~rEy~AD~~Aa~ 226 (284)
T PRK05457 154 TLIQGVVNTFVIFLSRIIAQIVDR--FVS--GNEEGNGIGYFIVSIVLEIV---FGILASIIVMWFSRHREFRADAGGAK 226 (284)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh--hcc--cCcccccHHHHHHHHHHHHH---HHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 766443 3333332 12222111 000 11111111 11111122222 23456788899999999999999999
Q ss_pred cCCHHHHHHHHHHHhhCcCC
Q psy4798 241 LGKAIFLRKALLKINKDNLG 260 (623)
Q Consensus 241 l~~~~~L~sAL~KLa~~~ls 260 (623)
+++|+.+++||.|+++.+..
T Consensus 227 ltgp~~L~~AL~KL~~~~~~ 246 (284)
T PRK05457 227 LAGREKMIAALQRLKTSYEP 246 (284)
T ss_pred hhCHHHHHHHHHHHHhhCcc
Confidence 98899999999999998773
No 18
>PRK02870 heat shock protein HtpX; Provisional
Probab=99.95 E-value=8.4e-27 Score=247.11 Aligned_cols=169 Identities=18% Similarity=0.291 Sum_probs=128.7
Q ss_pred HHHHHHHHHHHhhchHhhhhhh-------ccCCcCCchhHHHHHHHHHHHcCCC-CCeEEEEeCCCCCCcccEEEeccCC
Q psy4798 406 VFIILMSLFLMTIYPEFIAPLF-------DKYTPLPDGELKSRIEQLSASVKFP-LKKLYVVEGSKRSEHSNAYFYGFFK 477 (623)
Q Consensus 406 ~~~~~~~l~~~~i~P~~I~plf-------~k~~pl~~~~L~~~I~~la~~~gfp-~~~v~vv~~s~rs~~~NAy~~G~~~ 477 (623)
++.+++.++..+..+..+.... +++.|+++++|++.++++|++.|+| .++||++|++ ++|||++|+++
T Consensus 78 ~~~~~~~~~~y~~~~~~~l~~~~a~~I~~~~~~p~~~~~L~~~ve~La~~ag~p~~p~V~vi~~~----~~NAFA~G~~~ 153 (336)
T PRK02870 78 LVAVISILVTFQNFDKIMLSGTEYKEITPENALSLQERQLYNVVEELLVAAGLRFMPKVYIIDAP----YMNAFASGYSE 153 (336)
T ss_pred HHHHHHHHHHHHhhHHHHHHHcCCEEcCCCCCCchhhHHHHHHHHHHHHHcCCCCCCeEEEEcCC----CCceEEecCCC
Confidence 4445555555556665554443 3567777899999999999999999 7999999964 69999999985
Q ss_pred -CceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHh---
Q psy4798 478 -NKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVA--- 553 (623)
Q Consensus 478 -~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~--- 553 (623)
+++|+++|+|++. +++||+.||+|||+||++|+|+.++++.+.+.++.
T Consensus 154 ~~~~Ivvt~GLL~~----------------------------L~~dEL~aVlAHELgHik~~di~~~~~~~~l~~~~~~~ 205 (336)
T PRK02870 154 KSAMVAITTGLLEK----------------------------LDRDELQAVMAHELSHIRHGDIRLTLCVGVLSNIMLIV 205 (336)
T ss_pred CCcEEEEehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHH
Confidence 6899999999997 89999999999999999999998887654331110
Q ss_pred ---------hhh--h------hH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798 554 ---------NKE--R------EA-----SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDK 606 (623)
Q Consensus 554 ---------~~~--~------~~-----~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~ 606 (623)
... . .. .+...+..+....+||++||+||++|++++++|+++++||+||+.+.
T Consensus 206 ~~~~~~~~~g~~~~~~~~~~~~~~l~l~~~~~~~~~ll~~~iSR~rEy~AD~~Aa~ltg~p~aLasAL~KL~~~~ 280 (336)
T PRK02870 206 ADFLFYSFMGNRRNSGANRARMIILILRYVLPILTVLLMLFLSRTREYMADAGAVELMRDNEPMARALQKISNDH 280 (336)
T ss_pred HHHHHHHHhcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhcc
Confidence 000 0 00 11122223333445999999999999999999999999999999754
No 19
>PRK05457 heat shock protein HtpX; Provisional
Probab=99.95 E-value=3.9e-26 Score=237.51 Aligned_cols=167 Identities=17% Similarity=0.179 Sum_probs=120.1
Q ss_pred HHHHHHHHHHhhchHhhhhhhccCC----cCCchh--HHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce
Q psy4798 407 FIILMSLFLMTIYPEFIAPLFDKYT----PLPDGE--LKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR 480 (623)
Q Consensus 407 ~~~~~~l~~~~i~P~~I~plf~k~~----pl~~~~--L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr 480 (623)
+..+..++..++.|..+....+ .+ |.++++ |++.++++|++.|+|.+++|++|. .++|||++|.+++++
T Consensus 43 ~~~~~~~~~~~~~~~i~~~~~~-a~~i~~~~~~~~~~L~~~v~~la~~~g~p~p~v~v~~~----~~~NAfa~G~~~~~~ 117 (284)
T PRK05457 43 FGFGGSFISLLMSKWMAKRSTG-AEVIEQPRNETERWLVETVARQARQAGIGMPEVAIYHS----PEINAFATGASKNNS 117 (284)
T ss_pred HHHHHHHHHHHHHHHHHHHhcC-CeECCCCCCchHHHHHHHHHHHHHhCCCCCCCEEEEeC----CCceEEEecCCCCCe
Confidence 3334455566666666654443 33 433433 999999999999999999999994 469999999988777
Q ss_pred -EEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHH-----HH--
Q psy4798 481 -IVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEF-----GV-- 552 (623)
Q Consensus 481 -IVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~-----g~-- 552 (623)
|+++++|++. +++||+.||+|||+||++++|+.++.++..+. .+
T Consensus 118 ~V~vt~gLl~~----------------------------L~~~El~aVlAHElgHi~~~d~~~~~l~~~~~~~~~~~~~~ 169 (284)
T PRK05457 118 LVAVSTGLLQN----------------------------MSRDEVEAVLAHEISHIANGDMVTMTLIQGVVNTFVIFLSR 169 (284)
T ss_pred EEEeehHHhhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHH
Confidence 5566679987 89999999999999999999999875543221 00
Q ss_pred -----hhhhh-----h-H-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhccc
Q psy4798 553 -----ANKER-----E-A-----------SVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDKA 607 (623)
Q Consensus 553 -----~~~~~-----~-~-----------~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~~ 607 (623)
..... . + .+...+..+....+||++||+||++|++.++ |+++++||+||+.+..
T Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ll~~~~SR~rEy~AD~~Aa~ltg-p~~L~~AL~KL~~~~~ 245 (284)
T PRK05457 170 IIAQIVDRFVSGNEEGNGIGYFIVSIVLEIVFGILASIIVMWFSRHREFRADAGGAKLAG-REKMIAALQRLKTSYE 245 (284)
T ss_pred HHHHHHHhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhC-HHHHHHHHHHHHhhCc
Confidence 00000 0 0 0111122223334599999999999999995 9999999999997544
No 20
>PRK01345 heat shock protein HtpX; Provisional
Probab=99.95 E-value=1.6e-26 Score=243.75 Aligned_cols=191 Identities=22% Similarity=0.317 Sum_probs=143.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccc----cCCcchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCC
Q psy4798 10 FYGIIGFSWIVFLFEFYLSIRQRRVYH----ETTIVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKR 85 (623)
Q Consensus 10 ~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~ 85 (623)
+...+++++++++.++++..++++..+ +.|++++.++++|++.|+|+|++|++|
T Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~p~L~~~v~~La~~agi~~p~v~vid---------------------- 91 (317)
T PRK01345 34 LVIAAGMNLFSYWNSDKMVLRMYGAQEVDERSAPELYRMVRDLARRAGLPMPKVYIID---------------------- 91 (317)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHcCCeECCcccCHHHHHHHHHHHHHcCCCCCcEEEEc----------------------
Confidence 345566788999999999998877643 357899999999999999999999987
Q ss_pred CCCCceeeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 86 SEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 86 ~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
++++|||++|.++ +++|+++++|++. ||+||++||+|||+|||+|+|+.++
T Consensus 92 ~~~~NAFa~G~~~~~~~V~vt~gLL~~----------------------------L~~dEL~aVlAHElgHi~~~d~~~~ 143 (317)
T PRK01345 92 NPQPNAFATGRNPENAAVAATTGLLQR----------------------------LSPEEVAGVMAHELAHVKNRDTLTM 143 (317)
T ss_pred CCCcceEEecCCCCCeEEEechHHHhh----------------------------CCHHHHHHHHHHHHHHHHcCCHHHH
Confidence 6789999999976 5689999999997 8999999999999999999999999
Q ss_pred HHHHHHHHHHHHH-HHHHHhcCCCcccCCCCCCCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-
Q psy4798 165 MIVMQLNLLFMLY-SFQYLFQYPPLYSAFGFYDSQPI-LLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL- 241 (623)
Q Consensus 165 ~~~~~~~~~~~~~-lf~~~~~~~~l~~~~G~~~~~p~-~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l- 241 (623)
++.+.+..++..+ .+..++.. .......|. .++. ++..++.|+. ..+++..+||++||+||++|+++
T Consensus 144 ~l~~~~~~~~~~l~~~~~~~~~------~~~~~~~~~~~i~~--~~~~~~~~~~--~~l~~~~~SR~rE~~AD~~A~~lt 213 (317)
T PRK01345 144 TITATLAGAISMLANFAFFFGG------NRENNNGPLGLVGT--LAAMIVAPLA--AMLVQMAISRTREYAADRRGAEIC 213 (317)
T ss_pred HHHHHHHHHHHHHHHHHHHhcC------CcccccchHHHHHH--HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 8876654333221 12222211 011111221 2221 2223444542 34677899999999999999998
Q ss_pred CCHHHHHHHHHHHhhCcCC
Q psy4798 242 GKAIFLRKALLKINKDNLG 260 (623)
Q Consensus 242 ~~~~~L~sAL~KLa~~~ls 260 (623)
++++.+.+||.|+++.+..
T Consensus 214 g~p~~L~~AL~KL~~~~~~ 232 (317)
T PRK01345 214 GNPLWLASALGKIERGAHG 232 (317)
T ss_pred CCHHHHHHHHHHHHhhhcc
Confidence 6899999999999986544
No 21
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.90 E-value=8.4e-25 Score=216.17 Aligned_cols=183 Identities=28% Similarity=0.429 Sum_probs=112.6
Q ss_pred HHHHHHHHHhhchHhhhhhhccCCcCCchhHHHHHHHHHHHc--CCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEeh
Q psy4798 408 IILMSLFLMTIYPEFIAPLFDKYTPLPDGELKSRIEQLSASV--KFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFD 485 (623)
Q Consensus 408 ~~~~~l~~~~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~--gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~d 485 (623)
.++++.++..++|.+++|++++++|.++++|++.++++|+++ +.|.+++|+++++. .|||++|.+++++|++++
T Consensus 3 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~L~~~v~~l~~~~~~~~~~~~v~v~~~~~----~NA~~~g~~~~~~I~v~~ 78 (226)
T PF01435_consen 3 MLVVSLLLAFIGPPLIAPLFNKFTPLEDPELRRIVEELARRAGLGIPPPRVYVIDSPS----PNAFATGGGPRKRIVVTS 78 (226)
T ss_dssp --TTTSTTGHHCCCCHCCHCTC--B-HHHHHHHHHHHHHHHHHCTSS--EEEEE--SS----EEEEEETTTC--EEEEEH
T ss_pred eeeHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHhcCCCCCCeEEEEcCCC----CcEEEEccCCCcEEEEeC
Confidence 345566777889999999999999999999999999999999 88899999999653 999999999999999999
Q ss_pred hHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhh----hhH--
Q psy4798 486 TLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKE----REA-- 559 (623)
Q Consensus 486 tLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~----~~~-- 559 (623)
+|++. +++||+.||||||+||++++|..+++....+..+.... ...
T Consensus 79 ~ll~~----------------------------~~~~el~aVlaHElgH~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (226)
T PF01435_consen 79 GLLES----------------------------LSEDELAAVLAHELGHIKHRHILKSLLISLLLSILFFALLALLIGSM 130 (226)
T ss_dssp HHHHH----------------------------SSHHHHHHHHHHHHHHHHTTHCCCCCCHHH-HHHHHHHHHT-----H
T ss_pred hhhhc----------------------------ccHHHHHHHHHHHHHHHHcCCcchHHHHHHHHHHHHHHHHHHHhhhh
Confidence 99977 89999999999999999999998875443322211100 000
Q ss_pred ------H-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHH----------HHHHHHhhh------ccc-cc
Q psy4798 560 ------S-------VMRYVTKESELITARQDREAAEKKYKEAMKEKDNV----------MAKLKVLST------DKA-RL 609 (623)
Q Consensus 560 ------~-------v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l----------~~aL~kL~~------~~~-~~ 609 (623)
. +...........+||++|++||++|++.+++|..+ .+.+.++.. ... ..
T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~sr~~E~~AD~~a~~~~~~~~~l~~a~~~~~~~~~~l~~l~~~~~~~~~~~~~~ 210 (226)
T PF01435_consen 131 SLFSAFGFIDILGILIAFLFQLLTNAFSRRQEYEADRYAARLGGDPALLARALYKPAAAISALEKLAEANSMRPDSDWRY 210 (226)
T ss_dssp HHHHHHH----------HHSTT------HHHHHHHHHHHHHH------HHHTTS-TTHHHHHHHHHT-------------
T ss_pred hhhhhhhhcccchhhHHHHHHHhhcchhHHHHHHHHHHHHHhcCCcHHHHHhCCCHHHHHHHHHHHHHHhccccCCcccc
Confidence 0 00011122333459999999999999998776653 667777765 111 12
Q ss_pred -cccccCccccccc
Q psy4798 610 -TQLYDDKVRMYNV 622 (623)
Q Consensus 610 -~~l~~~~~~e~~~ 622 (623)
.-...||..+-|+
T Consensus 211 ~~~~~tHP~~~~Ri 224 (226)
T PF01435_consen 211 SSLFSTHPSTEERI 224 (226)
T ss_dssp ----------HHHH
T ss_pred chhcCCCcCHHHHh
Confidence 2346677766554
No 22
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.89 E-value=4.4e-22 Score=205.79 Aligned_cols=187 Identities=24% Similarity=0.238 Sum_probs=137.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccc------CCcchH----HHHhhhhhcCCC-CccccccccchhhhHHHHHHhHhh
Q psy4798 11 YGIIGFSWIVFLFEFYLSIRQRRVYHE------TTIVPH----QIAHGMDAESFE-KSRRYSLDKNVFSMFKETVSNVMN 79 (623)
Q Consensus 11 ~~~~~~~~~~~~~~~~l~~~~~~~~~~------~~~l~~----~l~~l~~~~g~~-~~~ly~~d~~~~~~~~~~v~~v~~ 79 (623)
...+....+.+++++.+..+..+.... .+..+. .+++++.+.+.+ +|++|+.+
T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~v~i~~---------------- 124 (302)
T COG0501 61 LLAFAALLISLLFSKALVLKSLGALTLSEPILLAPRLYAVLLLKVAELARQAGIPHMPEVYILE---------------- 124 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhceeecccccccchHHHHHHHHHHHHHHHHCCCCCCCeeEEec----------------
Confidence 345667778888888888877775321 123444 889999999999 89999976
Q ss_pred cccCCCCCCCceeeecCC-CCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhc
Q psy4798 80 TVSSKRSEHSNAYFYGFF-KNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKY 158 (623)
Q Consensus 80 ~~~s~~~~~~NA~~~G~~-~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~ 158 (623)
++.||||++|.+ ++++|++|++|++. +|+||++||+|||+||++|
T Consensus 125 ------~~~~NAFa~g~~~~~~~V~vt~gLl~~----------------------------l~~dEl~aVlaHElgHi~~ 170 (302)
T COG0501 125 ------TPQPNAFALGGGPKNGRVVVTTGLLDL----------------------------LNDDELEAVLAHELGHIKN 170 (302)
T ss_pred ------CCCccceecCCCCCCeeEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHhc
Confidence 789999999997 57999999999996 8999999999999999999
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhh
Q psy4798 159 NHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFG 238 (623)
Q Consensus 159 ~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA 238 (623)
+|+.+++.+..+.......++.........+ + +...+...+.....++++.+.+.+||++||+||++|
T Consensus 171 rd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a 238 (302)
T COG0501 171 RHTLVRLTLRGLLASAFVLLATLALAAGLLG------E------AALALLLLLLLLALFLATLLVLAFSRKREYEADRFA 238 (302)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------c------hHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhch
Confidence 9999966554443333333322222111110 0 011111222233334567788999999999999999
Q ss_pred hhcCCHHHHHHHHHHHhhCcC
Q psy4798 239 KSLGKAIFLRKALLKINKDNL 259 (623)
Q Consensus 239 ~~l~~~~~L~sAL~KLa~~~l 259 (623)
+++.+++.+.++|.|+...+.
T Consensus 239 ~~l~~~~~l~~aL~kl~~~~~ 259 (302)
T COG0501 239 AKLTGPEKLASALQKLARLSG 259 (302)
T ss_pred hhccChHHHHHHHHHHHhhhc
Confidence 999779999999999998654
No 23
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.86 E-value=3.2e-20 Score=191.99 Aligned_cols=170 Identities=21% Similarity=0.258 Sum_probs=120.7
Q ss_pred HHHHHHHHHHHHHhhchHhhhhhhcc--CCcCC------chhHHHHHHHHHHHcCCC-CCeEEEEeCCCCCCcccEEEec
Q psy4798 404 LWVFIILMSLFLMTIYPEFIAPLFDK--YTPLP------DGELKSRIEQLSASVKFP-LKKLYVVEGSKRSEHSNAYFYG 474 (623)
Q Consensus 404 ~w~~~~~~~l~~~~i~P~~I~plf~k--~~pl~------~~~L~~~I~~la~~~gfp-~~~v~vv~~s~rs~~~NAy~~G 474 (623)
.+.+.....++..+..+.++.-.... ..+.. ...+..++++++++.+.| .+++++++ +..+|||++|
T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~v~i~~----~~~~NAFa~g 134 (302)
T COG0501 59 ALLLAFAALLISLLFSKALVLKSLGALTLSEPILLAPRLYAVLLLKVAELARQAGIPHMPEVYILE----TPQPNAFALG 134 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhceeecccccccchHHHHHHHHHHHHHHHHCCCCCCCeeEEec----CCCccceecC
Confidence 34444555555555666555544443 22211 233455899999999999 79999999 4569999999
Q ss_pred cC-CCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHH---
Q psy4798 475 FF-KNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEF--- 550 (623)
Q Consensus 475 ~~-~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~--- 550 (623)
.+ ++++|+++++|++. +|+||+.||+|||+||++++|+.++..+....
T Consensus 135 ~~~~~~~V~vt~gLl~~----------------------------l~~dEl~aVlaHElgHi~~rd~~~~~~~~~~~~~~ 186 (302)
T COG0501 135 GGPKNGRVVVTTGLLDL----------------------------LNDDELEAVLAHELGHIKNRHTLVRLTLRGLLASA 186 (302)
T ss_pred CCCCCeeEEecHHHHhh----------------------------CCHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHH
Confidence 87 67999999999997 89999999999999999999999855433221
Q ss_pred -HH-hhhhh---------------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHhhhcc
Q psy4798 551 -GV-ANKER---------------EASVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAKLKVLSTDK 606 (623)
Q Consensus 551 -g~-~~~~~---------------~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~aL~kL~~~~ 606 (623)
.. ..... ...+...+.......+||.+|++||+.|++.++ |+.+.++|.|+....
T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~l~~-~~~l~~aL~kl~~~~ 258 (302)
T COG0501 187 FVLLATLALAAGLLGEAALALLLLLLLLALFLATLLVLAFSRKREYEADRFAAKLTG-PEKLASALQKLARLS 258 (302)
T ss_pred HHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhchhhccC-hHHHHHHHHHHHhhh
Confidence 10 00000 001122222233444699999999999999997 999999999999843
No 24
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.81 E-value=1.8e-21 Score=192.43 Aligned_cols=156 Identities=33% Similarity=0.433 Sum_probs=95.5
Q ss_pred CCcchHHHHhhhhhc--CCCCccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCCCeEEEEecchhhhccCC
Q psy4798 38 TTIVPHQIAHGMDAE--SFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPL 115 (623)
Q Consensus 38 ~~~l~~~l~~l~~~~--g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~ 115 (623)
.+.+++.++++|++. +.|++++|+.+ ++.+|||++|.+++++|+++++|++.
T Consensus 30 ~~~L~~~v~~l~~~~~~~~~~~~v~v~~----------------------~~~~NA~~~g~~~~~~I~v~~~ll~~---- 83 (226)
T PF01435_consen 30 DPELRRIVEELARRAGLGIPPPRVYVID----------------------SPSPNAFATGGGPRKRIVVTSGLLES---- 83 (226)
T ss_dssp HHHHHHHHHHHHHHHHCTSS--EEEEE------------------------SSEEEEEETTTC--EEEEEHHHHHH----
T ss_pred HHHHHHHHHHHHHHhcCCCCCCeEEEEc----------------------CCCCcEEEEccCCCcEEEEeChhhhc----
Confidence 467999999999999 99999999976 55699999999999999999999976
Q ss_pred ccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCC
Q psy4798 116 NADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFY 195 (623)
Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~ 195 (623)
+++||+.||||||+||++++|..+.+....+..++..+....+.........+++.
T Consensus 84 ------------------------~~~~el~aVlaHElgH~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (226)
T PF01435_consen 84 ------------------------LSEDELAAVLAHELGHIKHRHILKSLLISLLLSILFFALLALLIGSMSLFSAFGFI 139 (226)
T ss_dssp ------------------------SSHHHHHHHHHHHHHHHHTTHCCCCCCHHH-HHHHHHHHHT-----HHHHHHHH--
T ss_pred ------------------------ccHHHHHHHHHHHHHHHHcCCcchHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhc
Confidence 89999999999999999999999986666655554444322211111100000110
Q ss_pred CCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcCCHHH-HHHHHHHHhh
Q psy4798 196 DSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLGKAIF-LRKALLKINK 256 (623)
Q Consensus 196 ~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~~~~~-L~sAL~KLa~ 256 (623)
...+ ..+..+...+.+.+||++||+||++|+++++... +.++++++..
T Consensus 140 ~~~~-------------~~~~~~~~~~~~~~sr~~E~~AD~~a~~~~~~~~~l~~a~~~~~~ 188 (226)
T PF01435_consen 140 DILG-------------ILIAFLFQLLTNAFSRRQEYEADRYAARLGGDPALLARALYKPAA 188 (226)
T ss_dssp ---------------------HHSTT------HHHHHHHHHHHHHH------HHHTTS-TTH
T ss_pred ccch-------------hhHHHHHHHhhcchhHHHHHHHHHHHHHhcCCcHHHHHhCCCHHH
Confidence 0000 0000334557799999999999999999965544 5555555443
No 25
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.34 E-value=1.6e-11 Score=133.72 Aligned_cols=158 Identities=18% Similarity=0.206 Sum_probs=107.9
Q ss_pred CcCCchhHHHHHHHHHHH----cCCCC--CeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCC
Q psy4798 431 TPLPDGELKSRIEQLSAS----VKFPL--KKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGD 504 (623)
Q Consensus 431 ~pl~~~~L~~~I~~la~~----~gfp~--~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~ 504 (623)
...+|++|.+-|+++..+ .|-+. -.+++|+. ...|||++ .+.+|++..+|+...
T Consensus 65 ~~i~D~el~~yv~~~g~rL~~~a~~~~~~f~f~lV~d----~~iNAFA~---~Gg~v~vntGLll~a------------- 124 (484)
T COG4783 65 PLIRDPELEEYVNSLGQRLAAAADLVKTPFTFFLVND----DSINAFAT---PGGYVVVNTGLLLTA------------- 124 (484)
T ss_pred CCcCCHHHHHHHHHHHHHHHHhcCCCCCCeEEEEecC----Cccchhhc---CCceEEEehHHHHhc-------------
Confidence 556899988888776654 45553 35677774 35899998 236899999999873
Q ss_pred CCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHH----------HHHHHHhhhhhh--H----HHHHHHHHH
Q psy4798 505 SEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIL----------KKEFGVANKERE--A----SVMRYVTKE 568 (623)
Q Consensus 505 ~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~----------~~~~g~~~~~~~--~----~v~~~~~~~ 568 (623)
-+++|+.+|||||+||+..+|..+.+=- +.+.|+....-. + .........
T Consensus 125 --------------e~esElagViAHEigHv~qrH~aR~~e~~~r~~~~~i~~ml~gi~aa~a~~~ag~a~iag~~a~~~ 190 (484)
T COG4783 125 --------------ENESELAGVIAHEIGHVAQRHLARSMEQQQRAAPMAIAGMLLGILAALAGADAGMAGIAGALAGAA 190 (484)
T ss_pred --------------CCHHHHHHHHHHHHHHHhhhhHHHHHHHHhhhchhHHHHHHHHHHHHHhCccccHHHHHHHHHHhh
Confidence 4899999999999999999999987632 222222221110 0 111111111
Q ss_pred H-HHHHHHHHHHHHHHHHHHHc----CCHHHHHHHHHHhh---hccccc-cccccCccccccc
Q psy4798 569 S-ELITARQDREAAEKKYKEAM----KEKDNVMAKLKVLS---TDKARL-TQLYDDKVRMYNV 622 (623)
Q Consensus 569 ~-~~~~sR~~E~~AD~~a~~~~----~~~~~l~~aL~kL~---~~~~~~-~~l~~~~~~e~~~ 622 (623)
. .+-|||+.|.+||+++...+ .||.+|.+.++|+. .++++. .-+..||.++=|+
T Consensus 191 ~g~L~~sR~~E~eADr~Gi~~L~raGydp~gM~~ff~rl~~~~~~~~~~p~yl~THPlp~~RI 253 (484)
T COG4783 191 QGQLNFSRQNEQEADRIGITTLVRAGYDPQGMPEFFERLADQLRYGGQPPEYLLTHPLPEERI 253 (484)
T ss_pred hhhhhcchhhHHHHHHHHHHHHHHcCCCchhHHHHHHHHHHHHhcCCCCChHHhcCCCchhHH
Confidence 1 22269999999999988765 38999999999999 455554 3557787766443
No 26
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=99.29 E-value=1.3e-10 Score=121.48 Aligned_cols=142 Identities=23% Similarity=0.347 Sum_probs=108.8
Q ss_pred hhccCCcCCchhHHHHHHHHHHHcCCCC-CeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCC
Q psy4798 426 LFDKYTPLPDGELKSRIEQLSASVKFPL-KKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGD 504 (623)
Q Consensus 426 lf~k~~pl~~~~L~~~I~~la~~~gfp~-~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~ 504 (623)
+..+.++.+++++.+.+++.+++.|++. .++++.++ -...+.+|+.+ .+|++-+.+.+.
T Consensus 131 l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~i~~s~~-----i~sP~~~G~~~-p~I~lP~~~~~~-------------- 190 (299)
T PF05569_consen 131 LLRKARPVEDEELQALLEECKEELGIKRPIRIRVSSG-----ISSPFVFGFLR-PVIVLPESLLED-------------- 190 (299)
T ss_pred HHHhccccCcHHHHHHHHHHHHHhCCCCceEEEEcCC-----CCCCeeecCcc-eEEEecCccccc--------------
Confidence 5677899999999999999999999763 44544432 35778999875 589999999977
Q ss_pred CCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4798 505 SEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKK 584 (623)
Q Consensus 505 ~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~ 584 (623)
++++|+.+|++||+.|+|++|.+.+++...+..+.. ..++..+ ....+ .+.+|+.||+.
T Consensus 191 --------------~~~~el~~il~HEl~Hikr~D~~~~~l~~l~~~l~W---fnP~~~~---~~~~~-~~~~E~~cD~~ 249 (299)
T PF05569_consen 191 --------------LSEEELRAILLHELAHIKRRDLLWKLLAELLCALHW---FNPLVWL---LRRRI-RRDRELACDEA 249 (299)
T ss_pred --------------cCHHHHHHHHHHHHHHHHCCChHHHHHHHHHHHHHH---hhHHHHH---HHHHH-HHHHHHhhhHH
Confidence 899999999999999999999999887654332211 1122222 22333 88899999999
Q ss_pred HHHHc--CCHHHHHHHHHHhhhcccc
Q psy4798 585 YKEAM--KEKDNVMAKLKVLSTDKAR 608 (623)
Q Consensus 585 a~~~~--~~~~~l~~aL~kL~~~~~~ 608 (623)
+.+.+ +++.....+|.+++.....
T Consensus 250 vl~~l~~~~~~~Y~~~Ll~~~~~~~~ 275 (299)
T PF05569_consen 250 VLRNLGKEERKAYAETLLKVAKRSQQ 275 (299)
T ss_pred HHHhcCchhHHHHHHHHHHHHHhhcC
Confidence 99994 5778899999999875443
No 27
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=99.03 E-value=5.7e-09 Score=109.17 Aligned_cols=131 Identities=27% Similarity=0.322 Sum_probs=100.9
Q ss_pred CcchHHHHhhhhhcCCCC-ccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCCCeEEEEecchhhhccCCcc
Q psy4798 39 TIVPHQIAHGMDAESFEK-SRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNA 117 (623)
Q Consensus 39 ~~l~~~l~~l~~~~g~~~-~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~ 117 (623)
++..+.+++.+.+.|+++ .++++- +.-...+++|+. ..+|++.+++.+.
T Consensus 141 ~~~~~~l~~~~~~~~~~~~~~i~~s-----------------------~~i~sP~~~G~~-~p~I~lP~~~~~~------ 190 (299)
T PF05569_consen 141 EELQALLEECKEELGIKRPIRIRVS-----------------------SGISSPFVFGFL-RPVIVLPESLLED------ 190 (299)
T ss_pred HHHHHHHHHHHHHhCCCCceEEEEc-----------------------CCCCCCeeecCc-ceEEEecCccccc------
Confidence 357788899999999764 555553 234567889986 4799999999876
Q ss_pred ccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCC
Q psy4798 118 DKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDS 197 (623)
Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~ 197 (623)
++++|+.+|++||+.|+|++|.+..++...+..+.
T Consensus 191 ----------------------~~~~el~~il~HEl~Hikr~D~~~~~l~~l~~~l~----------------------- 225 (299)
T PF05569_consen 191 ----------------------LSEEELRAILLHELAHIKRRDLLWKLLAELLCALH----------------------- 225 (299)
T ss_pred ----------------------cCHHHHHHHHHHHHHHHHCCChHHHHHHHHHHHHH-----------------------
Confidence 79999999999999999999999987755443221
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc---CCHHHHHHHHHHHhhCcCC
Q psy4798 198 QPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL---GKAIFLRKALLKINKDNLG 260 (623)
Q Consensus 198 ~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l---~~~~~L~sAL~KLa~~~ls 260 (623)
-+.| +..++...+.+.+|+.||+.+.+. +......++|.+++.....
T Consensus 226 -------------WfnP---~~~~~~~~~~~~~E~~cD~~vl~~l~~~~~~~Y~~~Ll~~~~~~~~ 275 (299)
T PF05569_consen 226 -------------WFNP---LVWLLRRRIRRDRELACDEAVLRNLGKEERKAYAETLLKVAKRSQQ 275 (299)
T ss_pred -------------HhhH---HHHHHHHHHHHHHHHhhhHHHHHhcCchhHHHHHHHHHHHHHhhcC
Confidence 1122 345667899999999999999888 3466788888888877766
No 28
>KOG2661|consensus
Probab=98.95 E-value=2.2e-09 Score=112.28 Aligned_cols=135 Identities=13% Similarity=0.093 Sum_probs=91.2
Q ss_pred eEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhh
Q psy4798 455 KLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGH 534 (623)
Q Consensus 455 ~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH 534 (623)
.|.||+.. .+|||+. ++++|++|.++|... -++||+.+|||||+||
T Consensus 240 eihVVndP----ipNAFvL---PgGKvfVFtgiLn~c---------------------------k~ddglAtvLgHE~aH 285 (424)
T KOG2661|consen 240 EIHVVNDP----IPNAFVL---PGGKVFVFTGILNSC---------------------------KDDDGLATVLGHEIAH 285 (424)
T ss_pred EEEEecCC----CCceeec---cCCeEEEEechhhcc---------------------------cChHHHHHHHHHHHHH
Confidence 56788854 4999988 567899999999872 3789999999999999
Q ss_pred hhccCHHHHHHHHHHHHHhhhhhhHHHH----H-H-HHHHHHHHHHHHHHHHHHHHHHHHc----CCHHHHHHHHHHhhh
Q psy4798 535 WKYNHVLKSMILKKEFGVANKEREASVM----R-Y-VTKESELITARQDREAAEKKYKEAM----KEKDNVMAKLKVLST 604 (623)
Q Consensus 535 ~k~~H~~k~~~~~~~~g~~~~~~~~~v~----~-~-~~~~~~~~~sR~~E~~AD~~a~~~~----~~~~~l~~aL~kL~~ 604 (623)
-..+|+.-.+.....+.+........++ + . ....+++-+||++|.+||.++.-+| -||.+...-.++++.
T Consensus 286 aVarH~AEki~k~~~~siLgLvlyt~~~a~~~n~~Ll~~flrlPfSRKMEtEADyIGLlLma~Acfdpras~tvwErM~~ 365 (424)
T KOG2661|consen 286 AVARHAAEKIGKVHLLSILGLVLYTMIWAICPNDKLLEYFLRLPFSRKMETEADYIGLLLMAKACFDPRASSTVWERMEF 365 (424)
T ss_pred HHHHHHHHhhhhhHHHHHHHHHHHHHHhhccchHHHHHHHhcCcchhhhhhhhhHHHHHHHHHhhcCcccchHHHHHHHH
Confidence 9999998766544333222211111111 1 1 1122334479999999999876554 478777666666654
Q ss_pred cc------ccccccccCcccccccC
Q psy4798 605 DK------ARLTQLYDDKVRMYNVR 623 (623)
Q Consensus 605 ~~------~~~~~l~~~~~~e~~~~ 623 (623)
-+ +++.=|+-||.+..|++
T Consensus 366 ~egqmg~~~~~eflSTHPSskkRie 390 (424)
T KOG2661|consen 366 VEGQMGQPKMPEFLSTHPSSKKRIE 390 (424)
T ss_pred hhhhcCCCCCchhhhcCCCccchhH
Confidence 22 23445678888877764
No 29
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=98.82 E-value=2.5e-08 Score=109.25 Aligned_cols=138 Identities=21% Similarity=0.194 Sum_probs=88.4
Q ss_pred CCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798 86 SEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM 165 (623)
Q Consensus 86 ~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~ 165 (623)
++.-|||++ ...+|+|+++|+... -|+.|+.+|||||+||+..+|..+.+
T Consensus 101 d~~iNAFA~---~Gg~v~vntGLll~a---------------------------e~esElagViAHEigHv~qrH~aR~~ 150 (484)
T COG4783 101 DDSINAFAT---PGGYVVVNTGLLLTA---------------------------ENESELAGVIAHEIGHVAQRHLARSM 150 (484)
T ss_pred CCccchhhc---CCceEEEehHHHHhc---------------------------CCHHHHHHHHHHHHHHHhhhhHHHHH
Confidence 678999999 137999999999963 58999999999999999999999887
Q ss_pred HHHHHH---HHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc-
Q psy4798 166 IVMQLN---LLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL- 241 (623)
Q Consensus 166 ~~~~~~---~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l- 241 (623)
=-.+=. .++.+. .+.+... .|-..+...+.+ .+... ..-+..+||..|.+||+.+...
T Consensus 151 e~~~r~~~~~i~~ml-~gi~aa~------a~~~ag~a~iag----~~a~~-------~~g~L~~sR~~E~eADr~Gi~~L 212 (484)
T COG4783 151 EQQQRAAPMAIAGML-LGILAAL------AGADAGMAGIAG----ALAGA-------AQGQLNFSRQNEQEADRIGITTL 212 (484)
T ss_pred HHHhhhchhHHHHHH-HHHHHHH------hCccccHHHHHH----HHHHh-------hhhhhhcchhhHHHHHHHHHHHH
Confidence 542111 111111 1111000 011111111111 00000 0114679999999999999766
Q ss_pred ---C-CHHHHHHHHHHHh---hCcCCCchhhccchhh
Q psy4798 242 ---G-KAIFLRKALLKIN---KDNLGFPALLVCNGLP 271 (623)
Q Consensus 242 ---~-~~~~L~sAL~KLa---~~~ls~P~iL~~g~~~ 271 (623)
| ++..|.+.+.|+. .+.-.+|.++..--+|
T Consensus 213 ~raGydp~gM~~ff~rl~~~~~~~~~~p~yl~THPlp 249 (484)
T COG4783 213 VRAGYDPQGMPEFFERLADQLRYGGQPPEYLLTHPLP 249 (484)
T ss_pred HHcCCCchhHHHHHHHHHHHHhcCCCCChHHhcCCCc
Confidence 4 4899999999999 5666677776665554
No 30
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=98.69 E-value=1.7e-07 Score=98.76 Aligned_cols=138 Identities=19% Similarity=0.175 Sum_probs=94.7
Q ss_pred CchhHHHHHHHHHHHcC----CCC--CeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCc
Q psy4798 434 PDGELKSRIEQLSASVK----FPL--KKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEP 507 (623)
Q Consensus 434 ~~~~L~~~I~~la~~~g----fp~--~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~ 507 (623)
.|+.|...+.++..++- =|- -++-|+| |...|||+. +..-+.++.+|+...
T Consensus 62 ~D~Kler~Vari~g~lt~~S~~p~q~YriTiln----SP~INAFAL---PGGYlYitRGLlAla---------------- 118 (479)
T COG4784 62 RDPKLERMVARIVGALTAVSENPQQTYRITILN----SPNINAFAL---PGGYLYITRGLLALA---------------- 118 (479)
T ss_pred CCHHHHHHHHHHHhHhhhhccCCCceEEEEEec----CCCcccccc---CCceEEEehhHHHHc----------------
Confidence 35555555555444432 121 3566677 456899987 466899999999862
Q ss_pred cccccccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhH--------HHHHHHHHHHHHHHHHHHHH
Q psy4798 508 LISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREA--------SVMRYVTKESELITARQDRE 579 (623)
Q Consensus 508 ~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~--------~v~~~~~~~~~~~~sR~~E~ 579 (623)
-+..|+.|||+||+||.--+|.+.+--.-....++...... -......++.-..|||++|.
T Consensus 119 -----------nd~sEvAAVl~HEmgHVtAnHgi~rQ~~e~a~~ia~rvva~vl~~~~agk~A~~rGklrla~fsRnqEL 187 (479)
T COG4784 119 -----------NDSSEVAAVLAHEMGHVTANHGIQRQQREAAEVIASRVVAEVLGSDAAGKQALIRGKLRLAQFSRNQEL 187 (479)
T ss_pred -----------CCHHHHHHHHHhhhhheecchhHHHHHHHHHHHHHHHHHHHHhCCcchhhHHHhhhhHHHhhhccchhh
Confidence 37799999999999999999998765443334444333211 11222233333446999999
Q ss_pred HHHHHHHHHcC----CHHHHHHHHHHhhhc
Q psy4798 580 AAEKKYKEAMK----EKDNVMAKLKVLSTD 605 (623)
Q Consensus 580 ~AD~~a~~~~~----~~~~l~~aL~kL~~~ 605 (623)
+||.++.+..+ ||.+..+.|..++..
T Consensus 188 qAD~iG~~~lgeAGYDP~A~~rfl~sm~ay 217 (479)
T COG4784 188 QADAIGIKMLGEAGYDPYAAARFLQSMAAY 217 (479)
T ss_pred hhhhhhHHHHHhcCCChHHHHHHHHHHHhh
Confidence 99999998876 799999999999864
No 31
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=98.47 E-value=7.8e-06 Score=86.19 Aligned_cols=128 Identities=17% Similarity=0.177 Sum_probs=85.4
Q ss_pred chhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCcccccccc
Q psy4798 435 DGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGA 514 (623)
Q Consensus 435 ~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (623)
+.|.+..++++.++.. .+++ ++.+|+ ...+=-++|..+. +||+-+.+.++
T Consensus 135 e~d~~~~~~~~~~~~~--~k~i-~ir~s~--~i~~P~v~gl~kp-~IvlP~d~~~r------------------------ 184 (337)
T COG4219 135 EVDKRKIVTILKNHQY--KKHI-LIRKSK--AIDGPMVFGLVKP-CIVLPADFVER------------------------ 184 (337)
T ss_pred cccHHHHHHHHHHhhh--ccCe-eEeecc--cCCCceeeccCcc-eEEccHHHHhh------------------------
Confidence 4566777777766653 3444 345554 3456677788754 79999999988
Q ss_pred CCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC--CH
Q psy4798 515 NKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAMK--EK 592 (623)
Q Consensus 515 ~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~--~~ 592 (623)
++++|+.-|++||++|.|++|.+.+.+...+-+++. .-..+......+ .-++|.+||+.+..... +.
T Consensus 185 ----~~~ee~~yIilHEl~Hlk~gD~i~n~i~~~~~~l~W------fNP~v~l~~~~~-~~D~E~aCDa~vL~~~~~~er 253 (337)
T COG4219 185 ----LTDEELKYIILHELSHLKRGDAIINLIVVVLGVLFW------FNPLVHLGKRKI-RIDQEIACDAAVLARINPEER 253 (337)
T ss_pred ----cCHHhhhhhHhHHHhhhhcccHHHHHHHHHHhHHhh------cChHHHHHHHHH-HhhHHHHhhHHHHhccChHHH
Confidence 899999999999999999999998876533222221 111122222223 66799999999998772 23
Q ss_pred HHHHHHHHHhh
Q psy4798 593 DNVMAKLKVLS 603 (623)
Q Consensus 593 ~~l~~aL~kL~ 603 (623)
.....++.|+-
T Consensus 254 r~YaEsil~~l 264 (337)
T COG4219 254 RTYAESILKLL 264 (337)
T ss_pred HHHHHHHHHHH
Confidence 44556665553
No 32
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=98.11 E-value=1.5e-05 Score=84.47 Aligned_cols=117 Identities=26% Similarity=0.272 Sum_probs=74.7
Q ss_pred CCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798 86 SEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSM 165 (623)
Q Consensus 86 ~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~ 165 (623)
|+..|||+. +..++.++.+||... -+..|+.|||+||+||+.-+|-...-
T Consensus 95 SP~INAFAL---PGGYlYitRGLlAla---------------------------nd~sEvAAVl~HEmgHVtAnHgi~rQ 144 (479)
T COG4784 95 SPNINAFAL---PGGYLYITRGLLALA---------------------------NDSSEVAAVLAHEMGHVTANHGIQRQ 144 (479)
T ss_pred CCCcccccc---CCceEEEehhHHHHc---------------------------CCHHHHHHHHHhhhhheecchhHHHH
Confidence 578999997 478999999999952 47789999999999999999987664
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhc----
Q psy4798 166 IVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSL---- 241 (623)
Q Consensus 166 ~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l---- 241 (623)
---....+..-..- ..+... ..|. ...+. ...-...+||..|.+||..+.+.
T Consensus 145 ~~e~a~~ia~rvva-~vl~~~----~agk---~A~~r----------------Gklrla~fsRnqELqAD~iG~~~lgeA 200 (479)
T COG4784 145 QREAAEVIASRVVA-EVLGSD----AAGK---QALIR----------------GKLRLAQFSRNQELQADAIGIKMLGEA 200 (479)
T ss_pred HHHHHHHHHHHHHH-HHhCCc----chhh---HHHhh----------------hhHHHhhhccchhhhhhhhhHHHHHhc
Confidence 32222222111110 111100 0111 11111 12234789999999999999877
Q ss_pred C-CHHHHHHHHHHHhh
Q psy4798 242 G-KAIFLRKALLKINK 256 (623)
Q Consensus 242 ~-~~~~L~sAL~KLa~ 256 (623)
| +|.++.+-|..++.
T Consensus 201 GYDP~A~~rfl~sm~a 216 (479)
T COG4784 201 GYDPYAAARFLQSMAA 216 (479)
T ss_pred CCChHHHHHHHHHHHh
Confidence 3 36666666666544
No 33
>KOG2661|consensus
Probab=98.03 E-value=1.1e-05 Score=84.98 Aligned_cols=100 Identities=23% Similarity=0.367 Sum_probs=70.5
Q ss_pred CCCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 85 RSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 85 ~~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
+++.+|||+. ++..|++++++|... -+++|+.+|||||+||-.-+|..-.
T Consensus 245 ndPipNAFvL---PgGKvfVFtgiLn~c---------------------------k~ddglAtvLgHE~aHaVarH~AEk 294 (424)
T KOG2661|consen 245 NDPIPNAFVL---PGGKVFVFTGILNSC---------------------------KDDDGLATVLGHEIAHAVARHAAEK 294 (424)
T ss_pred cCCCCceeec---cCCeEEEEechhhcc---------------------------cChHHHHHHHHHHHHHHHHHHHHHh
Confidence 3567999997 477899999999963 4789999999999999999999988
Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccccchhhhhc
Q psy4798 165 MIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYN-QLVQFLMTCMTRRFEFQADAFGKSL 241 (623)
Q Consensus 165 ~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~-~l~~~l~~~~sR~~Ey~AD~fA~~l 241 (623)
+....+..++.+.++.... +. .|=. .+-.++.+-+||+.|-+||-.+.-+
T Consensus 295 i~k~~~~siLgLvlyt~~~--a~-------------------------~~n~~Ll~~flrlPfSRKMEtEADyIGLlL 345 (424)
T KOG2661|consen 295 IGKVHLLSILGLVLYTMIW--AI-------------------------CPNDKLLEYFLRLPFSRKMETEADYIGLLL 345 (424)
T ss_pred hhhhHHHHHHHHHHHHHHh--hc-------------------------cchHHHHHHHhcCcchhhhhhhhhHHHHHH
Confidence 7765554444333322110 00 0000 1123455778999999999998765
No 34
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=97.71 E-value=0.00054 Score=72.60 Aligned_cols=106 Identities=25% Similarity=0.289 Sum_probs=70.6
Q ss_pred CCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHHHHHH
Q psy4798 88 HSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMIV 167 (623)
Q Consensus 88 ~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k~~~~ 167 (623)
-.|=-++|..+ .+||+-+++.+. +++||++-|++||++|.|++|.....+.
T Consensus 162 i~~P~v~gl~k-p~IvlP~d~~~r----------------------------~~~ee~~yIilHEl~Hlk~gD~i~n~i~ 212 (337)
T COG4219 162 IDGPMVFGLVK-PCIVLPADFVER----------------------------LTDEELKYIILHELSHLKRGDAIINLIV 212 (337)
T ss_pred CCCceeeccCc-ceEEccHHHHhh----------------------------cCHHhhhhhHhHHHhhhhcccHHHHHHH
Confidence 34455666653 578999998887 8999999999999999999999987665
Q ss_pred HHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccchhhhhcCCH---
Q psy4798 168 MQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLVQFLMTCMTRRFEFQADAFGKSLGKA--- 244 (623)
Q Consensus 168 ~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~~~l~~~~sR~~Ey~AD~fA~~l~~~--- 244 (623)
..++.++. |.| ++.+....+--.+|-.||+.+...-.+
T Consensus 213 ~~~~~l~W------------------------------------fNP---~v~l~~~~~~~D~E~aCDa~vL~~~~~~er 253 (337)
T COG4219 213 VVLGVLFW------------------------------------FNP---LVHLGKRKIRIDQEIACDAAVLARINPEER 253 (337)
T ss_pred HHHhHHhh------------------------------------cCh---HHHHHHHHHHhhHHHHhhHHHHhccChHHH
Confidence 44333211 011 123445677778999999999877422
Q ss_pred HHHHHHHHHHhhCcCCC
Q psy4798 245 IFLRKALLKINKDNLGF 261 (623)
Q Consensus 245 ~~L~sAL~KLa~~~ls~ 261 (623)
..-.+++.++-.....+
T Consensus 254 r~YaEsil~~l~a~s~~ 270 (337)
T COG4219 254 RTYAESILKLLLAFSNP 270 (337)
T ss_pred HHHHHHHHHHHhcccCC
Confidence 22344444444433333
No 35
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=97.13 E-value=0.0046 Score=54.44 Aligned_cols=65 Identities=17% Similarity=0.160 Sum_probs=43.4
Q ss_pred CCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHH
Q psy4798 519 CDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAMKEKDNVMAK 598 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~~~~~l~~a 598 (623)
+++.+-..++||||||+.++|..... ..........+.|.+||.+|...+=..+.+...
T Consensus 37 ~~~~~~~f~laHELgH~~~~~~~~~~---------------------~~~~~~~~~~~~E~~An~fA~~lL~p~~~~~~~ 95 (122)
T PF06114_consen 37 LSPERQRFTLAHELGHILLHHGDETF---------------------NYYLNYFFNERQEREANAFAAALLMPEEDFRDA 95 (122)
T ss_dssp S-HHHHHHHHHHHHHHHHHHH-HHHH---------------------HHHHHH--THHHHHHHHHHHHHHHS-HHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHHhhhccccc---------------------hhhccccchhhHHHHHHHHHHHHhCCHHHHHHH
Confidence 47788999999999999888875543 000111126778999999999988666666666
Q ss_pred HHHhhh
Q psy4798 599 LKVLST 604 (623)
Q Consensus 599 L~kL~~ 604 (623)
+.+...
T Consensus 96 ~~~~~~ 101 (122)
T PF06114_consen 96 LEKYRN 101 (122)
T ss_dssp HHHH-S
T ss_pred HHHcCC
Confidence 666554
No 36
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=96.63 E-value=0.011 Score=62.60 Aligned_cols=128 Identities=17% Similarity=0.144 Sum_probs=66.1
Q ss_pred HHHHHHHHHcCCC--CCeEEEEeCCCCCCccc------EEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccc
Q psy4798 440 SRIEQLSASVKFP--LKKLYVVEGSKRSEHSN------AYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLIST 511 (623)
Q Consensus 440 ~~I~~la~~~gfp--~~~v~vv~~s~rs~~~N------Ay~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~ 511 (623)
+.-.+..++.|.+ .+++.+.+++.+|..-+ ||- |+.++.|.+--+.++...
T Consensus 101 ~~W~~~~~~~g~~y~~P~lv~~~~~~~t~CG~a~s~~gpFY--Cp~D~tIYlD~~f~~~L~------------------- 159 (292)
T PF04228_consen 101 DVWTPQFPQAGLPYRPPKLVLFSGSVQTGCGTASSATGPFY--CPADQTIYLDLSFFDELQ------------------- 159 (292)
T ss_pred HHHHHHHHHcCCCCCCCEEEEeCCCCCCCCCCCcCCCCCEe--CCCCCEEEechHHHHHHH-------------------
Confidence 3334444455655 57888888776654433 332 234566777655554310
Q ss_pred cccCCCCCCHHHHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHcC
Q psy4798 512 EGANKKGCDTEEVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYV-TKESELITARQDREAAEKKYKEAMK 590 (623)
Q Consensus 512 ~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~-~~~~~~~~sR~~E~~AD~~a~~~~~ 590 (623)
.+-|. -.+--...|||||.||...+. .|+....... ..-. ....+.. ||+.|.+||.+|=--++
T Consensus 160 -~~~ga-~G~~a~ayVlAHEyGHHVQ~l----------~Gil~~~~~~--~~~~~~~~~~~~-svr~ELQADC~AGvw~~ 224 (292)
T PF04228_consen 160 -QRFGA-SGDFAQAYVLAHEYGHHVQNL----------LGILDAVRQA--QQGRSPAEANEL-SVRLELQADCFAGVWAG 224 (292)
T ss_pred -HHhCC-ccHHHHHHHHHHHHHHHHHHH----------hhhHHHHHHH--hhccCccccchH-HHHHHHHHHHHHHHHhh
Confidence 00000 124456779999999984332 2332211100 0000 0112233 99999999998765442
Q ss_pred --------CHHHHHHHHHHhh
Q psy4798 591 --------EKDNVMAKLKVLS 603 (623)
Q Consensus 591 --------~~~~l~~aL~kL~ 603 (623)
++..+-++|...+
T Consensus 225 ~~~~~~~l~~~di~~al~aa~ 245 (292)
T PF04228_consen 225 HAAEDGSLTPGDIEEALNAAS 245 (292)
T ss_pred hccccCCcCHHHHHHHHHHHH
Confidence 3445555555443
No 37
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=95.42 E-value=0.034 Score=48.88 Aligned_cols=63 Identities=32% Similarity=0.308 Sum_probs=41.4
Q ss_pred CChHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHhcCCCcccCCCCCCCchhHHHHHHHHHHHHHHHHHHH
Q psy4798 140 CDTEEVLAVLAHELGHWKYNHVLKSMIVMQLNLLFMLYSFQYLFQYPPLYSAFGFYDSQPILLGLIIVLQYVFAPYNQLV 219 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h~~k~~~~~~~~~~~~~~lf~~~~~~~~l~~~~G~~~~~p~~i~l~~~~~~i~~p~~~l~ 219 (623)
+++++-..++||||||+..+|-.....
T Consensus 37 ~~~~~~~f~laHELgH~~~~~~~~~~~----------------------------------------------------- 63 (122)
T PF06114_consen 37 LSPERQRFTLAHELGHILLHHGDETFN----------------------------------------------------- 63 (122)
T ss_dssp S-HHHHHHHHHHHHHHHHHHH-HHHHH-----------------------------------------------------
T ss_pred CCHHHHHHHHHHHHHHHHhhhccccch-----------------------------------------------------
Confidence 577889999999999999888775430
Q ss_pred HHHHHHHhhhhccccchhhhhc-CCHHHHHHHHHHHh
Q psy4798 220 QFLMTCMTRRFEFQADAFGKSL-GKAIFLRKALLKIN 255 (623)
Q Consensus 220 ~~l~~~~sR~~Ey~AD~fA~~l-~~~~~L~sAL~KLa 255 (623)
.-.......+.|++||.||..+ -+.+.+...+.+..
T Consensus 64 ~~~~~~~~~~~E~~An~fA~~lL~p~~~~~~~~~~~~ 100 (122)
T PF06114_consen 64 YYLNYFFNERQEREANAFAAALLMPEEDFRDALEKYR 100 (122)
T ss_dssp HHHHH--THHHHHHHHHHHHHHHS-HHHHHHHHHHH-
T ss_pred hhccccchhhHHHHHHHHHHHHhCCHHHHHHHHHHcC
Confidence 0011445667899999999887 56666666655543
No 38
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=95.28 E-value=0.04 Score=54.31 Aligned_cols=69 Identities=23% Similarity=0.285 Sum_probs=53.0
Q ss_pred hhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCC-CceEEEehhHHhhccCCCccccCCCCCCCcccccccc
Q psy4798 436 GELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGA 514 (623)
Q Consensus 436 ~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~-~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (623)
..+.+.++..++.+|.+++++.+-+... .+|.|. .++|.+.-.|+.
T Consensus 112 ~~l~~~~~~~~~~~~~~~~~i~ir~~ks--------rWGsc~~~~~I~ln~~L~~------------------------- 158 (205)
T PF01863_consen 112 EYLPERLKKYAKKLGLPPPKIKIRDMKS--------RWGSCSSKGNITLNWRLVM------------------------- 158 (205)
T ss_pred HHHHHHHHHHHHHcCCCcceEEEeehhh--------ccccCCCCCcEEeeccccc-------------------------
Confidence 4567778888999999999998866332 267663 456888888885
Q ss_pred CCCCCCHHHHHHHHHHHhhhhhc-cCHH
Q psy4798 515 NKKGCDTEEVLAVLAHELGHWKY-NHVL 541 (623)
Q Consensus 515 ~g~gl~~~Ei~AVLAHELGH~k~-~H~~ 541 (623)
++++-+..|+.|||.|.++ +|..
T Consensus 159 ----~P~~~idYVvvHEL~Hl~~~nHs~ 182 (205)
T PF01863_consen 159 ----APPEVIDYVVVHELCHLRHPNHSK 182 (205)
T ss_pred ----CCccHHHHHHHHHHHHhccCCCCH
Confidence 5888999999999999964 5553
No 39
>PRK04351 hypothetical protein; Provisional
Probab=94.79 E-value=0.11 Score=50.06 Aligned_cols=71 Identities=23% Similarity=0.462 Sum_probs=50.2
Q ss_pred CCchhHHHHHHHHHHH-cCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccc
Q psy4798 433 LPDGELKSRIEQLSAS-VKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLIST 511 (623)
Q Consensus 433 l~~~~L~~~I~~la~~-~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~ 511 (623)
+.+.||.+.+++++.+ ++-|.+.-...+..-|++ .-+|. .+...|.+...+++.
T Consensus 1 ~~~~~l~~l~~~~s~~~F~~~f~~~v~~n~Rlrtt-gG~~~---l~~~~I~lnp~ll~~--------------------- 55 (149)
T PRK04351 1 MTNQELQRLVEEISLEYFGKPFRHQAYFNKRLRTT-GGRYL---LKDHHIEFNPKMLEE--------------------- 55 (149)
T ss_pred CCHHHHHHHHHHHHHHHhCCCCCcEEEEeccchhh-hheee---cCCCeEEeCHHHHhh---------------------
Confidence 3578999999999975 455544333344433322 23442 245689999999986
Q ss_pred cccCCCCCCHHHHHHHHHHHhhhh
Q psy4798 512 EGANKKGCDTEEVLAVLAHELGHW 535 (623)
Q Consensus 512 ~~~~g~gl~~~Ei~AVLAHELGH~ 535 (623)
-+++++..||+|||.|+
T Consensus 56 -------~~~~~l~~vv~HElcH~ 72 (149)
T PRK04351 56 -------YGLEELIGIIKHELCHY 72 (149)
T ss_pred -------ccHHHHHhhHHHHHHHH
Confidence 57899999999999998
No 40
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=94.68 E-value=0.085 Score=53.97 Aligned_cols=73 Identities=19% Similarity=0.234 Sum_probs=51.5
Q ss_pred hhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccC
Q psy4798 436 GELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGAN 515 (623)
Q Consensus 436 ~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (623)
..+...++..++.+|.+...+.+-+.+. ..|.|..+..+.++..+..
T Consensus 123 ~~l~~~~~~~~~~l~~~~~~~~ik~~k~--------~WGScs~~~~i~~~~~l~~------------------------- 169 (223)
T COG1451 123 EILEIRLKEYAKKLGVPPRAIKLKNMKR--------RWGSCSKAGEIRFNWRLVM------------------------- 169 (223)
T ss_pred HHHHHHHHHHHHHhCCCccceeeeeccc--------eeeeecCCCcEEeehhhhc-------------------------
Confidence 4566777788889999988888765443 3566533334555555544
Q ss_pred CCCCCHHHHHHHHHHHhhhhhccCHHHHH
Q psy4798 516 KKGCDTEEVLAVLAHELGHWKYNHVLKSM 544 (623)
Q Consensus 516 g~gl~~~Ei~AVLAHELGH~k~~H~~k~~ 544 (623)
.+++.+..|++|||+|.+..+=-+.|
T Consensus 170 ---~p~~~i~YVvvHELaHLke~nHs~~F 195 (223)
T COG1451 170 ---APEEVIDYVVVHELAHLKEKNHSKRF 195 (223)
T ss_pred ---CCHHHHHHHHHHHHHHHhhhhccHHH
Confidence 69999999999999999976644444
No 41
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=94.15 E-value=0.061 Score=53.02 Aligned_cols=67 Identities=18% Similarity=0.175 Sum_probs=51.5
Q ss_pred cchHHHHhhhhhcCCCCccccccccchhhhHHHHHHhHhhcccCCCCCCCceeeecCCC-CeEEEEecchhhhccCCccc
Q psy4798 40 IVPHQIAHGMDAESFEKSRRYSLDKNVFSMFKETVSNVMNTVSSKRSEHSNAYFYGFFK-NKRIVLFDTLLKDYVPLNAD 118 (623)
Q Consensus 40 ~l~~~l~~l~~~~g~~~~~ly~~d~~~~~~~~~~v~~v~~~~~s~~~~~~NA~~~G~~~-~krIvl~d~Ll~~~~~~~~~ 118 (623)
.+.+.++..++..|.+++++.+-++ .. .+|.|. .+.|.+.-.|+.
T Consensus 113 ~l~~~~~~~~~~~~~~~~~i~ir~~-------------------------ks-rWGsc~~~~~I~ln~~L~~-------- 158 (205)
T PF01863_consen 113 YLPERLKKYAKKLGLPPPKIKIRDM-------------------------KS-RWGSCSSKGNITLNWRLVM-------- 158 (205)
T ss_pred HHHHHHHHHHHHcCCCcceEEEeeh-------------------------hh-ccccCCCCCcEEeeccccc--------
Confidence 4777788889999999888877431 11 378765 567888888887
Q ss_pred cCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchH
Q psy4798 119 KKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV 161 (623)
Q Consensus 119 ~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~ 161 (623)
++++-++.|++|||+|.++.+-
T Consensus 159 ---------------------~P~~~idYVvvHEL~Hl~~~nH 180 (205)
T PF01863_consen 159 ---------------------APPEVIDYVVVHELCHLRHPNH 180 (205)
T ss_pred ---------------------CCccHHHHHHHHHHHHhccCCC
Confidence 5888899999999999977553
No 42
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=93.34 E-value=0.15 Score=44.03 Aligned_cols=69 Identities=30% Similarity=0.399 Sum_probs=44.6
Q ss_pred cCCchhHHHHHHHHHHHcCCCCCeEEEEeC-----CCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCC
Q psy4798 432 PLPDGELKSRIEQLSASVKFPLKKLYVVEG-----SKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSE 506 (623)
Q Consensus 432 pl~~~~L~~~I~~la~~~gfp~~~v~vv~~-----s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~ 506 (623)
|||+ .+++. +-..+|.++.+|-|-.+ ..++....||..| ..|++-.+=.+
T Consensus 1 PLp~-~~r~~---~e~~~G~dl~~Vrvh~~~~a~~~~~~~~A~A~T~G----~~I~f~~g~~~----------------- 55 (79)
T PF13699_consen 1 PLPE-SIRSR---LERAFGADLSDVRVHTGPAASRAAAALGARAFTVG----NDIYFAPGKYN----------------- 55 (79)
T ss_pred CCCH-HHHHH---HHHHhCCCccceEEEeCCchhhhhhccCCeEEEEC----CEEEEcCCCcC-----------------
Confidence 5543 35555 45678888888887666 2345677888876 46777333221
Q ss_pred ccccccccCCCCCCHHHHHHHHHHHhhhhhc
Q psy4798 507 PLISTEGANKKGCDTEEVLAVLAHELGHWKY 537 (623)
Q Consensus 507 ~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~ 537 (623)
.+..+=.++|||||.|...
T Consensus 56 ------------~~s~~~~~llaHEl~Hv~Q 74 (79)
T PF13699_consen 56 ------------PDSPEGRALLAHELAHVVQ 74 (79)
T ss_pred ------------CCCCCcchhHhHHHHHHHh
Confidence 2334456889999999864
No 43
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=92.14 E-value=7.1 Score=40.16 Aligned_cols=86 Identities=19% Similarity=0.234 Sum_probs=55.3
Q ss_pred chHhhhhhhccCCcCCch---hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCC
Q psy4798 419 YPEFIAPLFDKYTPLPDG---ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLN 495 (623)
Q Consensus 419 ~P~~I~plf~k~~pl~~~---~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~ 495 (623)
.-..+..-|+||...+.. .=.+.=+.+.++.|+.--+|..++|.- +.|-| +.+|.|-|++.-...
T Consensus 17 aq~~vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~L-tDHYd------P~~k~vrLS~~vy~~----- 84 (222)
T PF04298_consen 17 AQWWVKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGEL-TDHYD------PRNKVVRLSEDVYNG----- 84 (222)
T ss_pred HHHHHHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCC-CCCcC------CCCCEEEeCCccCCC-----
Confidence 334566778888665541 122333446888998777777777632 33422 356778888876653
Q ss_pred ccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798 496 ADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 496 ~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~ 540 (623)
+.=--.||-|||.||--+.+.
T Consensus 85 ------------------------~SiaAvaVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 85 ------------------------RSIAAVAVAAHEVGHAIQHAE 105 (222)
T ss_pred ------------------------CCHHHHHHHHHHHhHHHhccc
Confidence 334567999999999877664
No 44
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=90.91 E-value=0.3 Score=50.04 Aligned_cols=25 Identities=32% Similarity=0.327 Sum_probs=21.5
Q ss_pred CChHHHHHHHHHhhhhhhcchHHHH
Q psy4798 140 CDTEEVLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h~~k~ 164 (623)
.+++.+..|++|||+|.++.+--+.
T Consensus 170 ~p~~~i~YVvvHELaHLke~nHs~~ 194 (223)
T COG1451 170 APEEVIDYVVVHELAHLKEKNHSKR 194 (223)
T ss_pred CCHHHHHHHHHHHHHHHhhhhccHH
Confidence 6899999999999999998775444
No 45
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=90.65 E-value=0.75 Score=45.84 Aligned_cols=74 Identities=20% Similarity=0.082 Sum_probs=44.8
Q ss_pred hHHHHHHHHHHHcCCCCCeEEEEe--CCCCCCc-ccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccc
Q psy4798 437 ELKSRIEQLSASVKFPLKKLYVVE--GSKRSEH-SNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEG 513 (623)
Q Consensus 437 ~L~~~I~~la~~~gfp~~~v~vv~--~s~rs~~-~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (623)
.+++.++++.+.+.-|--+|++.. ...+... ...+..|.+-..+|.++= +-..
T Consensus 4 ~i~~~~~~~~~~~~~~~i~v~i~p~~~~~~~~~~~~g~~g~~~~~~~i~l~~-~~~~----------------------- 59 (195)
T PF10026_consen 4 IIEEALEKSIELLPGPDIPVFIFPADPENPFLIPELGGKGGGAIPGYIFLFL-LPND----------------------- 59 (195)
T ss_pred HHHHHHHHHHHHcCCCCCCEEEEeccCCCcccccccCcccccCCCCEEEEEe-cCCc-----------------------
Confidence 456667777777766655565544 3332122 223444444455677764 2222
Q ss_pred cCCCCCCHHHHHHHHHHHhhhhhccC
Q psy4798 514 ANKKGCDTEEVLAVLAHELGHWKYNH 539 (623)
Q Consensus 514 ~~g~gl~~~Ei~AVLAHELGH~k~~H 539 (623)
.+.+++.+++|||+.|..+..
T Consensus 60 -----~~~~~l~~~iaHE~hH~~r~~ 80 (195)
T PF10026_consen 60 -----YSLEELPALIAHEYHHNCRYE 80 (195)
T ss_pred -----ccHHHHHHHHHHHHHHHHHHh
Confidence 688999999999999996543
No 46
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=90.21 E-value=0.23 Score=50.33 Aligned_cols=41 Identities=24% Similarity=0.264 Sum_probs=32.8
Q ss_pred HHHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Q psy4798 523 EVLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAM 589 (623)
Q Consensus 523 Ei~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~ 589 (623)
-+.-+++||++|..++|.... ..-.+++.|.+||.+|.+.+
T Consensus 100 A~~fil~HE~~Hv~~~h~~~~--------------------------~~~~~~~eE~~AD~~A~~~i 140 (206)
T PF10463_consen 100 AIAFILLHELAHVVLGHEGDS--------------------------SPSQSIQEEKEADSYATEMI 140 (206)
T ss_pred HHHHHHHHHHHHHHHcCcccc--------------------------ccchhHHHHHhhhHHHHHHH
Confidence 355689999999999999664 11128899999999999876
No 47
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=89.71 E-value=0.65 Score=44.15 Aligned_cols=66 Identities=29% Similarity=0.311 Sum_probs=42.7
Q ss_pred hHHHHHHHHHH-HcC--CCCCeEEEEeCCCCCCcccEEEeccC--CCceEEEehhHHhhccCCCccccCCCCCCCccccc
Q psy4798 437 ELKSRIEQLSA-SVK--FPLKKLYVVEGSKRSEHSNAYFYGFF--KNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLIST 511 (623)
Q Consensus 437 ~L~~~I~~la~-~~g--fp~~~v~vv~~s~rs~~~NAy~~G~~--~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~ 511 (623)
+|.+.++++-. -++ +|.+.|. .+...++. .|.+ +...|.|...|++.
T Consensus 2 ~L~~~~~~~n~~~F~~~l~~~~i~-w~~r~~~~------~G~~~~~~~~I~ln~~l~~~--------------------- 53 (146)
T smart00731 2 ELTQRLEDASLRVFGRKLPHPKVV-WNKRLRKT------GGRCLLKSAEIRLNPKLLTE--------------------- 53 (146)
T ss_pred hHHHHHHHHHHHHHCCCCCCCEEE-Eehhhhhh------hHHhhcCCCEEEeCHHHHhh---------------------
Confidence 56667777653 233 4444343 33222211 2332 36679999999986
Q ss_pred cccCCCCCCHHHHHHHHHHHhhhhhc
Q psy4798 512 EGANKKGCDTEEVLAVLAHELGHWKY 537 (623)
Q Consensus 512 ~~~~g~gl~~~Ei~AVLAHELGH~k~ 537 (623)
.+++++..||.|||.|...
T Consensus 54 -------~~~~~l~~~l~HEm~H~~~ 72 (146)
T smart00731 54 -------NGRDRLRETLLHELCHAAL 72 (146)
T ss_pred -------ccHHHHHhhHHHHHHHHHH
Confidence 5789999999999999976
No 48
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=88.00 E-value=2.1 Score=40.47 Aligned_cols=33 Identities=30% Similarity=0.328 Sum_probs=28.6
Q ss_pred eEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798 480 RIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 480 rIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~ 540 (623)
.|.|...+++. .+.+|+..+|.|||.|....+.
T Consensus 44 ~I~ls~~~~~~----------------------------~~~~~~~~tL~HEm~H~~~~~~ 76 (157)
T PF10263_consen 44 EIRLSPKLLDR----------------------------NPEEELIDTLLHEMAHAAAYVF 76 (157)
T ss_pred EEEECHHHHHh----------------------------hHHHHHHHHHHHHHHHHHhhhc
Confidence 79999999986 5889999999999999966444
No 49
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=85.66 E-value=1.8 Score=43.87 Aligned_cols=82 Identities=20% Similarity=0.230 Sum_probs=52.9
Q ss_pred cCCchhHHHHHHHHHHHcCC------CCCeEE-EEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCC
Q psy4798 432 PLPDGELKSRIEQLSASVKF------PLKKLY-VVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGD 504 (623)
Q Consensus 432 pl~~~~L~~~I~~la~~~gf------p~~~v~-vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~ 504 (623)
|-++.-|++....+.+-+-. |.++|- +++. ...=||..|-...+.|-++...+++. .++
T Consensus 24 ~~a~~~L~~a~~~V~~~ly~~~~~~~~v~~Vt~~~~~----~~gVA~t~gd~~~~~I~~S~~~i~~~-------~~~--- 89 (205)
T PF04450_consen 24 PDAEQVLRDASRFVWRLLYQSPADRKPVRSVTLILDD----MDGVAYTSGDDDHKEIHFSARYIAKY-------PAD--- 89 (205)
T ss_pred cCHHHHHHHHHHHHHHHhCCCCCCCCcccEEEEEEEC----CCeeEEEecCCCccEEEEeHHHHhhc-------ccc---
Confidence 33334455544444444322 234554 4442 35789999977788999999999873 000
Q ss_pred CCccccccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798 505 SEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 505 ~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~ 540 (623)
|-..+|+.+||-||+.|.-++..
T Consensus 90 -------------~~~~~Ei~Gvl~HE~~H~~Q~~~ 112 (205)
T PF04450_consen 90 -------------GDVRDEIIGVLYHEMVHCWQWDG 112 (205)
T ss_pred -------------cchHHHHHHHHHHHHHHHhhcCC
Confidence 13458999999999999977654
No 50
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=85.24 E-value=0.69 Score=49.26 Aligned_cols=18 Identities=33% Similarity=0.503 Sum_probs=15.4
Q ss_pred HHHhhhhccccchhhhhc
Q psy4798 224 TCMTRRFEFQADAFGKSL 241 (623)
Q Consensus 224 ~~~sR~~Ey~AD~fA~~l 241 (623)
+.++|+.|.|||.||.--
T Consensus 205 ~~~svr~ELQADC~AGvw 222 (292)
T PF04228_consen 205 NELSVRLELQADCFAGVW 222 (292)
T ss_pred chHHHHHHHHHHHHHHHH
Confidence 578999999999998653
No 51
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=82.18 E-value=66 Score=33.27 Aligned_cols=17 Identities=41% Similarity=0.471 Sum_probs=14.0
Q ss_pred HHHHHHHhhhhhhcchH
Q psy4798 145 VLAVLAHELGHWKYNHV 161 (623)
Q Consensus 145 l~AVlAHELGH~k~~h~ 161 (623)
-.||-|||.||--.+..
T Consensus 89 AvaVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 89 AVAVAAHEVGHAIQHAE 105 (222)
T ss_pred HHHHHHHHHhHHHhccc
Confidence 46899999999877763
No 52
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=80.61 E-value=2 Score=45.11 Aligned_cols=27 Identities=26% Similarity=0.241 Sum_probs=24.9
Q ss_pred CCHHHHHHHHHHHhhhhhccCHHHHHH
Q psy4798 519 CDTEEVLAVLAHELGHWKYNHVLKSMI 545 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H~~k~~~ 545 (623)
++.+|+.+|++||+-|...+|..+.--
T Consensus 55 l~~~~~~~~l~HevlH~~~~H~~r~~~ 81 (292)
T PF13203_consen 55 LSPEERVGLLLHEVLHCLLRHPWRRGG 81 (292)
T ss_pred CCHHHHHHHHHHHHHHHHccchhhhcc
Confidence 899999999999999999999987654
No 53
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=80.12 E-value=2.4 Score=42.36 Aligned_cols=23 Identities=26% Similarity=0.208 Sum_probs=19.8
Q ss_pred CCCChHHHHHHHHHhhhhhhcch
Q psy4798 138 KGCDTEEVLAVLAHELGHWKYNH 160 (623)
Q Consensus 138 ~~l~~~El~AVlAHELGH~k~~h 160 (623)
.-++-++|..|+.|||+|..++.
T Consensus 75 ~fl~~~~i~~t~lHELaH~~~~~ 97 (186)
T PF08325_consen 75 GFLPYETILGTMLHELAHNVHGP 97 (186)
T ss_pred CEeeHHHHHHHHHHHHHhcccCC
Confidence 34899999999999999998654
No 54
>PRK04351 hypothetical protein; Provisional
Probab=76.50 E-value=2 Score=41.42 Aligned_cols=31 Identities=29% Similarity=0.597 Sum_probs=26.6
Q ss_pred CCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhh
Q psy4798 98 KNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHW 156 (623)
Q Consensus 98 ~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~ 156 (623)
....|.+...+++. -+++++..||+|||.|+
T Consensus 42 ~~~~I~lnp~ll~~----------------------------~~~~~l~~vv~HElcH~ 72 (149)
T PRK04351 42 KDHHIEFNPKMLEE----------------------------YGLEELIGIIKHELCHY 72 (149)
T ss_pred CCCeEEeCHHHHhh----------------------------ccHHHHHhhHHHHHHHH
Confidence 35689999888885 47899999999999997
No 55
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=74.73 E-value=5.8 Score=40.49 Aligned_cols=22 Identities=27% Similarity=0.585 Sum_probs=17.4
Q ss_pred CCHHHHHHHHHHHhhhhh-ccCH
Q psy4798 519 CDTEEVLAVLAHELGHWK-YNHV 540 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k-~~H~ 540 (623)
-+.+.+..|++|||||-- .+|+
T Consensus 128 ~~~~~~~hvi~HEiGH~IGfRHT 150 (211)
T PF12388_consen 128 YSVNVIEHVITHEIGHCIGFRHT 150 (211)
T ss_pred CchhHHHHHHHHHhhhhcccccc
Confidence 477889999999999963 4554
No 56
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=73.68 E-value=2.7 Score=36.25 Aligned_cols=19 Identities=47% Similarity=0.513 Sum_probs=13.8
Q ss_pred ChHHHHHHHHHhhhhhhcc
Q psy4798 141 DTEEVLAVLAHELGHWKYN 159 (623)
Q Consensus 141 ~~~El~AVlAHELGH~k~~ 159 (623)
+..+=.++|||||+|+..+
T Consensus 57 ~s~~~~~llaHEl~Hv~Qq 75 (79)
T PF13699_consen 57 DSPEGRALLAHELAHVVQQ 75 (79)
T ss_pred CCCCcchhHhHHHHHHHhh
Confidence 3344567899999998653
No 57
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=72.96 E-value=3.9 Score=41.51 Aligned_cols=53 Identities=25% Similarity=0.249 Sum_probs=40.9
Q ss_pred CCCCceeeecCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchH
Q psy4798 86 SEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV 161 (623)
Q Consensus 86 ~~~~NA~~~G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~ 161 (623)
+...-||.+|-+..+.|-++...++...+ + +-..+|+..||-||+.|+-..+-
T Consensus 60 ~~~gVA~t~gd~~~~~I~~S~~~i~~~~~------------------~-----~~~~~Ei~Gvl~HE~~H~~Q~~~ 112 (205)
T PF04450_consen 60 DMDGVAYTSGDDDHKEIHFSARYIAKYPA------------------D-----GDVRDEIIGVLYHEMVHCWQWDG 112 (205)
T ss_pred CCCeeEEEecCCCccEEEEeHHHHhhccc------------------c-----cchHHHHHHHHHHHHHHHhhcCC
Confidence 45677999997778999999999997410 0 12348999999999999977654
No 58
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=72.78 E-value=7.4 Score=38.89 Aligned_cols=21 Identities=29% Similarity=0.327 Sum_probs=18.8
Q ss_pred CCHHHHHHHHHHHhhhhhccC
Q psy4798 519 CDTEEVLAVLAHELGHWKYNH 539 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H 539 (623)
++-++|..|+.|||+|..++.
T Consensus 77 l~~~~i~~t~lHELaH~~~~~ 97 (186)
T PF08325_consen 77 LPYETILGTMLHELAHNVHGP 97 (186)
T ss_pred eeHHHHHHHHHHHHHhcccCC
Confidence 688999999999999998654
No 59
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=72.28 E-value=3.6 Score=43.20 Aligned_cols=26 Identities=27% Similarity=0.308 Sum_probs=24.1
Q ss_pred CChHHHHHHHHHhhhhhhcchHHHHH
Q psy4798 140 CDTEEVLAVLAHELGHWKYNHVLKSM 165 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h~~k~~ 165 (623)
|+.+|+..|++||+-|+..+|..+.-
T Consensus 55 l~~~~~~~~l~HevlH~~~~H~~r~~ 80 (292)
T PF13203_consen 55 LSPEERVGLLLHEVLHCLLRHPWRRG 80 (292)
T ss_pred CCHHHHHHHHHHHHHHHHccchhhhc
Confidence 89999999999999999999998764
No 60
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=71.92 E-value=1.7 Score=44.18 Aligned_cols=19 Identities=32% Similarity=0.385 Sum_probs=16.0
Q ss_pred HHHHHHHhhhhhhcchHHH
Q psy4798 145 VLAVLAHELGHWKYNHVLK 163 (623)
Q Consensus 145 l~AVlAHELGH~k~~h~~k 163 (623)
+.-+++||++|+..+|...
T Consensus 101 ~~fil~HE~~Hv~~~h~~~ 119 (206)
T PF10463_consen 101 IAFILLHELAHVVLGHEGD 119 (206)
T ss_pred HHHHHHHHHHHHHHcCccc
Confidence 4568999999999999764
No 61
>PRK09672 phage exclusion protein Lit; Provisional
Probab=71.55 E-value=3.6 Score=43.94 Aligned_cols=41 Identities=20% Similarity=0.264 Sum_probs=31.8
Q ss_pred HHHHHHHHhhhhhccCHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Q psy4798 524 VLAVLAHELGHWKYNHVLKSMILKKEFGVANKEREASVMRYVTKESELITARQDREAAEKKYKEAMK 590 (623)
Q Consensus 524 i~AVLAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~~v~~~~~~~~~~~~sR~~E~~AD~~a~~~~~ 590 (623)
+.-|+.||++|..++|.... ..- .+++.|.+||.+|.+.+-
T Consensus 165 ~a~i~~HEiaHv~~~h~~~~-------------------------~~e-~s~~eE~eaDs~At~~il 205 (305)
T PRK09672 165 LAWILLHEIAHVEFQHSSLE-------------------------SNE-DSIQEEKEADSYATNWLL 205 (305)
T ss_pred HHHHHHHHHHHHHhcccccc-------------------------Cch-HHHHHHHHHHHHHHHHHH
Confidence 45689999999999998540 011 278999999999998773
No 62
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=71.53 E-value=9.2 Score=37.49 Aligned_cols=16 Identities=38% Similarity=0.372 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHhhhhh
Q psy4798 521 TEEVLAVLAHELGHWK 536 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~k 536 (623)
......++|||+||--
T Consensus 128 ~~~~a~~~AHElGH~l 143 (194)
T cd04269 128 LLLFAVTMAHELGHNL 143 (194)
T ss_pred hHHHHHHHHHHHHhhc
Confidence 4677899999999974
No 63
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=71.38 E-value=5.7 Score=42.02 Aligned_cols=103 Identities=19% Similarity=0.173 Sum_probs=52.0
Q ss_pred CCCCeEEEEeCCCCCCcccEE-Eecc--CCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHH
Q psy4798 451 FPLKKLYVVEGSKRSEHSNAY-FYGF--FKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAV 527 (623)
Q Consensus 451 fp~~~v~vv~~s~rs~~~NAy-~~G~--~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AV 527 (623)
...+++..+.++.++..-.|= .+|- |+..+=|..|.-+-+.+ +.+--..| +=.-.-|
T Consensus 111 Y~~Ptlvlf~~~v~t~CG~assasGPFYCP~D~kvYlDlsFf~~m------------------~~~fga~G--dfAqaYV 170 (295)
T COG2321 111 YQKPTLVLFSGQVRTGCGFASSASGPFYCPADTKVYLDLSFFDEM------------------KTKFGASG--DFAQAYV 170 (295)
T ss_pred ccCCeEEEecCccccCcCCCCcCCCCeecCCCceEEEehhHHHHH------------------HHHhcCCc--cHHHHHH
Confidence 345788888888877663332 2232 34444455554443311 11111122 4456789
Q ss_pred HHHHhhhhhccCHHHHHHHHHHHHHhhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4798 528 LAHELGHWKYNHVLKSMILKKEFGVANKEREA-SVMRYVTKESELITARQDREAAEKKYK 586 (623)
Q Consensus 528 LAHELGH~k~~H~~k~~~~~~~~g~~~~~~~~-~v~~~~~~~~~~~~sR~~E~~AD~~a~ 586 (623)
+|||.||... .++|++...... .-+. ....+-. |=+.|.+||.+|=
T Consensus 171 iAHEVGHHVQ----------nllGi~~kv~~~~q~~s--~~~an~~-sVr~ELQADCfAG 217 (295)
T COG2321 171 IAHEVGHHVQ----------NLLGILPKVNQNQQGMS--QAEANQL-SVRRELQADCFAG 217 (295)
T ss_pred HHhhhhHHHH----------HHhhhhHHHHHHHhhhh--hhhccCc-ceeeeehhhHHHH
Confidence 9999999743 233433322110 0000 0112233 6678999999864
No 64
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=70.88 E-value=3.4 Score=39.24 Aligned_cols=32 Identities=28% Similarity=0.326 Sum_probs=27.5
Q ss_pred CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhc
Q psy4798 99 NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKY 158 (623)
Q Consensus 99 ~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~ 158 (623)
...|.+...|++. -+++++..||.|||.|...
T Consensus 41 ~~~I~ln~~l~~~----------------------------~~~~~l~~~l~HEm~H~~~ 72 (146)
T smart00731 41 SAEIRLNPKLLTE----------------------------NGRDRLRETLLHELCHAAL 72 (146)
T ss_pred CCEEEeCHHHHhh----------------------------ccHHHHHhhHHHHHHHHHH
Confidence 6789999998885 4778999999999999875
No 65
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=70.84 E-value=8.9 Score=38.23 Aligned_cols=21 Identities=33% Similarity=0.316 Sum_probs=17.9
Q ss_pred CChHHHHHHHHHhhhhhhcch
Q psy4798 140 CDTEEVLAVLAHELGHWKYNH 160 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h 160 (623)
.+.+++.|++|||+-|....+
T Consensus 60 ~~~~~l~~~iaHE~hH~~r~~ 80 (195)
T PF10026_consen 60 YSLEELPALIAHEYHHNCRYE 80 (195)
T ss_pred ccHHHHHHHHHHHHHHHHHHh
Confidence 578899999999999996544
No 66
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=70.78 E-value=2.9 Score=39.59 Aligned_cols=32 Identities=31% Similarity=0.329 Sum_probs=27.2
Q ss_pred eEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcc
Q psy4798 100 KRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYN 159 (623)
Q Consensus 100 krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~ 159 (623)
..|.+...+++. -+++|+..+|.|||.|....
T Consensus 43 ~~I~ls~~~~~~----------------------------~~~~~~~~tL~HEm~H~~~~ 74 (157)
T PF10263_consen 43 CEIRLSPKLLDR----------------------------NPEEELIDTLLHEMAHAAAY 74 (157)
T ss_pred eEEEECHHHHHh----------------------------hHHHHHHHHHHHHHHHHHhh
Confidence 479999999885 47899999999999997653
No 67
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=70.57 E-value=13 Score=34.50 Aligned_cols=18 Identities=33% Similarity=0.372 Sum_probs=16.6
Q ss_pred CCHHHHHHHHHHHhhhhh
Q psy4798 519 CDTEEVLAVLAHELGHWK 536 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k 536 (623)
++-+|=.-|++||+.|+-
T Consensus 75 Ls~~ekvKviiHEllHIP 92 (133)
T COG4900 75 LSCAEKVKVIIHELLHIP 92 (133)
T ss_pred CChHHHHHHHHHHHhcCc
Confidence 899999999999999983
No 68
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=69.07 E-value=2.9 Score=44.16 Aligned_cols=18 Identities=33% Similarity=0.418 Sum_probs=14.8
Q ss_pred HHHhhhhccccchhhhhc
Q psy4798 224 TCMTRRFEFQADAFGKSL 241 (623)
Q Consensus 224 ~~~sR~~Ey~AD~fA~~l 241 (623)
+.+|=+.|.|||.||.--
T Consensus 202 n~~sVr~ELQADCfAGVW 219 (295)
T COG2321 202 NQLSVRRELQADCFAGVW 219 (295)
T ss_pred cCcceeeeehhhHHHHHH
Confidence 667788999999999643
No 69
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=68.31 E-value=3.4 Score=42.66 Aligned_cols=22 Identities=45% Similarity=0.324 Sum_probs=16.4
Q ss_pred CCCCChHHHHHHHHHhhhhhhc
Q psy4798 137 KKGCDTEEVLAVLAHELGHWKY 158 (623)
Q Consensus 137 ~~~l~~~El~AVlAHELGH~k~ 158 (623)
++.+...+...++||||||--.
T Consensus 159 ~~~~~~~~~a~t~AHElGHnlG 180 (244)
T cd04270 159 GKRVPTKESDLVTAHELGHNFG 180 (244)
T ss_pred CCccchhHHHHHHHHHHHHhcC
Confidence 4445566788999999999643
No 70
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=68.06 E-value=9.4 Score=37.11 Aligned_cols=84 Identities=24% Similarity=0.307 Sum_probs=53.0
Q ss_pred chhHHHHHHHHHHHcC--CCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCcccccc
Q psy4798 435 DGELKSRIEQLSASVK--FPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTE 512 (623)
Q Consensus 435 ~~~L~~~I~~la~~~g--fp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~ 512 (623)
+.+|++.+++...... ++.+++.-=... + +..-||+- ++ .|-+.-.|++.
T Consensus 4 ~~~L~~~~~~as~~~~r~~~~p~~~~n~Rg-~-taG~ayL~---~~-~I~lNP~ll~e---------------------- 55 (156)
T COG3091 4 NRKLQQCVEQASLKFFRKFFRPKASYNQRG-R-TAGGAYLL---KS-EIRLNPKLLEE---------------------- 55 (156)
T ss_pred HHHHHHHHHHHHHHhcCCCCCcceehhhhh-h-hcchhhcc---cc-ccccCHHHHHH----------------------
Confidence 4567777777666553 344666432211 1 23455543 44 78899998887
Q ss_pred ccCCCCCCHHHHHHHHHHHhhhhh---------ccCHHHHHHHHHHHHH
Q psy4798 513 GANKKGCDTEEVLAVLAHELGHWK---------YNHVLKSMILKKEFGV 552 (623)
Q Consensus 513 ~~~g~gl~~~Ei~AVLAHELGH~k---------~~H~~k~~~~~~~~g~ 552 (623)
-.+|.+..|+.|||+|+. +++--+..++.+++|+
T Consensus 56 ------n~~~f~~~vV~HELaHl~ly~~~gr~~phg~ewk~lm~qV~~l 98 (156)
T COG3091 56 ------NGEDFIEQVVPHELAHLHLYQEFGRYKPHGKEWKLLMQQVLGL 98 (156)
T ss_pred ------ccHHHHHHHHHHHHHHHHHHHHcCCCCCCchhHHHHHHHhCCC
Confidence 588999999999999973 2333344555666554
No 71
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=64.01 E-value=4 Score=38.77 Aligned_cols=15 Identities=47% Similarity=0.689 Sum_probs=13.5
Q ss_pred hHHHHHHHHHhhhhh
Q psy4798 142 TEEVLAVLAHELGHW 156 (623)
Q Consensus 142 ~~El~AVlAHELGH~ 156 (623)
..++.+|+.||+||.
T Consensus 101 ~~~~~~~~~HEiGHa 115 (156)
T cd04279 101 AENLQAIALHELGHA 115 (156)
T ss_pred chHHHHHHHHHhhhh
Confidence 568999999999996
No 72
>PRK04860 hypothetical protein; Provisional
Probab=63.99 E-value=18 Score=35.38 Aligned_cols=64 Identities=22% Similarity=0.128 Sum_probs=39.3
Q ss_pred HHHHHHHHcCCCCC--eEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCC
Q psy4798 441 RIEQLSASVKFPLK--KLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKG 518 (623)
Q Consensus 441 ~I~~la~~~gfp~~--~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~g 518 (623)
.++.-.+.+|-|.+ .+. .+...|+ |-.+.+ ....|-+.-.++..
T Consensus 12 ~~~~a~~~f~~~f~~p~~~-f~~R~rt----aG~~~l-~~~~I~~Np~ll~~---------------------------- 57 (160)
T PRK04860 12 CLAQANLYFKRTFPEPKVS-YTQRGTS----AGTAWL-QSNEIRLNPVLLLE---------------------------- 57 (160)
T ss_pred HHHHHHHHhCCCCCCCEEE-Eeecchh----hcchhH-hcCCeeeCHHHHhh----------------------------
Confidence 33333344565543 443 3433332 333332 34568888888876
Q ss_pred CCHHHHHHHHHHHhhhhhcc
Q psy4798 519 CDTEEVLAVLAHELGHWKYN 538 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~ 538 (623)
-+.+++..|++||+.|+...
T Consensus 58 ~~~~~l~~~v~HEl~H~~~~ 77 (160)
T PRK04860 58 NQQAFIDEVVPHELAHLLVY 77 (160)
T ss_pred CcHHHHHhHHHHHHHHHHHH
Confidence 57899999999999998543
No 73
>PRK09672 phage exclusion protein Lit; Provisional
Probab=63.98 E-value=1.8 Score=46.26 Aligned_cols=18 Identities=28% Similarity=0.540 Sum_probs=15.0
Q ss_pred HHHHHHHhhhhhhcchHH
Q psy4798 145 VLAVLAHELGHWKYNHVL 162 (623)
Q Consensus 145 l~AVlAHELGH~k~~h~~ 162 (623)
+.-|+.||++|+.++|..
T Consensus 165 ~a~i~~HEiaHv~~~h~~ 182 (305)
T PRK09672 165 LAWILLHEIAHVEFQHSS 182 (305)
T ss_pred HHHHHHHHHHHHHhcccc
Confidence 445789999999999976
No 74
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=63.96 E-value=6 Score=45.73 Aligned_cols=22 Identities=27% Similarity=0.484 Sum_probs=15.4
Q ss_pred CCCCCCCceeeecCCC-CeEEEE
Q psy4798 83 SKRSEHSNAYFYGFFK-NKRIVL 104 (623)
Q Consensus 83 s~~~~~~NA~~~G~~~-~krIvl 104 (623)
+++.+.+-||.+|+.+ ..+|++
T Consensus 348 ~r~gK~~Ga~~~~~~~~~p~il~ 370 (591)
T TIGR00181 348 ENKGKRSGAYSIGGYKVKPYILM 370 (591)
T ss_pred CCCCCCCCcccCCCCCCCCeEEE
Confidence 4567788899999966 344544
No 75
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=63.90 E-value=6.6 Score=38.51 Aligned_cols=16 Identities=38% Similarity=0.372 Sum_probs=13.0
Q ss_pred hHHHHHHHHHhhhhhh
Q psy4798 142 TEEVLAVLAHELGHWK 157 (623)
Q Consensus 142 ~~El~AVlAHELGH~k 157 (623)
.....-++|||+||--
T Consensus 128 ~~~~a~~~AHElGH~l 143 (194)
T cd04269 128 LLLFAVTMAHELGHNL 143 (194)
T ss_pred hHHHHHHHHHHHHhhc
Confidence 3567889999999964
No 76
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=63.87 E-value=4 Score=38.57 Aligned_cols=16 Identities=31% Similarity=0.428 Sum_probs=13.8
Q ss_pred CHHHHHHHHHHHhhhh
Q psy4798 520 DTEEVLAVLAHELGHW 535 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~ 535 (623)
+.++..+|++|||||-
T Consensus 90 ~~~~~~~~~~HEiGHa 105 (165)
T cd04268 90 SGARLRNTAEHELGHA 105 (165)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4468999999999996
No 77
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=63.80 E-value=6.9 Score=43.65 Aligned_cols=55 Identities=16% Similarity=0.248 Sum_probs=29.8
Q ss_pred CCCCCCCceeeecCCCC--------eEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhh
Q psy4798 83 SKRSEHSNAYFYGFFKN--------KRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELG 154 (623)
Q Consensus 83 s~~~~~~NA~~~G~~~~--------krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELG 154 (623)
+++.+.+.||.+++.++ ...++.-+.-.. ...+.+.|+-+++ -.|+||+|
T Consensus 194 ~R~gK~~ga~~~~~~~~~~~~~~~~~~~~i~~n~~~~---------------------~~~~~~ll~~~~v-~tLfHE~G 251 (458)
T PF01432_consen 194 PRPGKRSGAFCFTLRPSRSDGERQLPVPYIFCNFTGP---------------------SAGKPSLLSHDDV-ETLFHEFG 251 (458)
T ss_dssp --TTS-SS-EEEEEEC-BTTSTCECEEEEEEEEE-S----------------------BTTC--B-SHHHH-HHHHHHHH
T ss_pred hcCCCCCCceeCCccCccccccCCCCceEEEecCCCC---------------------CCCCCCccChhhH-HHHHHHHh
Confidence 45567788999998765 444444333221 1122344788898 67999999
Q ss_pred hhhcc
Q psy4798 155 HWKYN 159 (623)
Q Consensus 155 H~k~~ 159 (623)
|--|.
T Consensus 252 Ha~H~ 256 (458)
T PF01432_consen 252 HAMHS 256 (458)
T ss_dssp HHHHH
T ss_pred HHHHH
Confidence 97653
No 78
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=63.49 E-value=6 Score=40.42 Aligned_cols=17 Identities=29% Similarity=0.575 Sum_probs=14.5
Q ss_pred CChHHHHHHHHHhhhhh
Q psy4798 140 CDTEEVLAVLAHELGHW 156 (623)
Q Consensus 140 l~~~El~AVlAHELGH~ 156 (623)
.+.+.++.|++|||||.
T Consensus 128 ~~~~~~~hvi~HEiGH~ 144 (211)
T PF12388_consen 128 YSVNVIEHVITHEIGHC 144 (211)
T ss_pred CchhHHHHHHHHHhhhh
Confidence 46677899999999995
No 79
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=63.38 E-value=3.6 Score=39.71 Aligned_cols=23 Identities=30% Similarity=0.531 Sum_probs=13.1
Q ss_pred HHHHHHHHHHhhhhh-ccCHHHHH
Q psy4798 522 EEVLAVLAHELGHWK-YNHVLKSM 544 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k-~~H~~k~~ 544 (623)
...-.+|+||+|||- +.|++..-
T Consensus 67 ~~~g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 67 YNFGKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp S-SSHHHHHHHHHHTT---TT---
T ss_pred cccccchhhhhhhhhcccccccCC
Confidence 344589999999996 67776544
No 80
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=63.05 E-value=5.2 Score=41.32 Aligned_cols=18 Identities=50% Similarity=0.449 Sum_probs=14.5
Q ss_pred CCHHHHHHHHHHHhhhhh
Q psy4798 519 CDTEEVLAVLAHELGHWK 536 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k 536 (623)
+...+...++||||||--
T Consensus 162 ~~~~~~a~t~AHElGHnl 179 (244)
T cd04270 162 VPTKESDLVTAHELGHNF 179 (244)
T ss_pred cchhHHHHHHHHHHHHhc
Confidence 455668899999999973
No 81
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=63.05 E-value=4.5 Score=38.41 Aligned_cols=16 Identities=44% Similarity=0.603 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHhhhhh
Q psy4798 521 TEEVLAVLAHELGHWK 536 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~k 536 (623)
..++.+|+.||+||.-
T Consensus 101 ~~~~~~~~~HEiGHaL 116 (156)
T cd04279 101 AENLQAIALHELGHAL 116 (156)
T ss_pred chHHHHHHHHHhhhhh
Confidence 4689999999999973
No 82
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=62.90 E-value=3.3 Score=38.63 Aligned_cols=17 Identities=29% Similarity=0.470 Sum_probs=14.6
Q ss_pred CChHHHHHHHHHhhhhh
Q psy4798 140 CDTEEVLAVLAHELGHW 156 (623)
Q Consensus 140 l~~~El~AVlAHELGH~ 156 (623)
.+..++..|+.||+||.
T Consensus 100 ~~~~~~~~v~~HEiGHa 116 (154)
T PF00413_consen 100 DSGNDLQSVAIHEIGHA 116 (154)
T ss_dssp SSSEEHHHHHHHHHHHH
T ss_pred hhhhhhhhhhhhccccc
Confidence 45568999999999997
No 83
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=61.51 E-value=4.3 Score=38.35 Aligned_cols=16 Identities=31% Similarity=0.428 Sum_probs=13.6
Q ss_pred ChHHHHHHHHHhhhhh
Q psy4798 141 DTEEVLAVLAHELGHW 156 (623)
Q Consensus 141 ~~~El~AVlAHELGH~ 156 (623)
+.++..+|++|||||-
T Consensus 90 ~~~~~~~~~~HEiGHa 105 (165)
T cd04268 90 SGARLRNTAEHELGHA 105 (165)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4467999999999995
No 84
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=60.30 E-value=6.6 Score=34.53 Aligned_cols=22 Identities=36% Similarity=0.450 Sum_probs=18.0
Q ss_pred CCHHHHHHHHHHHhhhhhccCH
Q psy4798 519 CDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H~ 540 (623)
.+++.+..|++||++|.-+.+.
T Consensus 20 ~~~~~~~~~l~HE~~H~~~~~~ 41 (128)
T PF13485_consen 20 SDEDWLDRVLAHELAHQWFGNY 41 (128)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH
Confidence 4777888999999999965555
No 85
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=58.74 E-value=5.7 Score=37.01 Aligned_cols=22 Identities=27% Similarity=0.456 Sum_probs=16.5
Q ss_pred CCHHHHHHHHHHHhhhh-hccCH
Q psy4798 519 CDTEEVLAVLAHELGHW-KYNHV 540 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~-k~~H~ 540 (623)
.+..++..|+.||+||. =+.|.
T Consensus 100 ~~~~~~~~v~~HEiGHaLGL~H~ 122 (154)
T PF00413_consen 100 DSGNDLQSVAIHEIGHALGLDHS 122 (154)
T ss_dssp SSSEEHHHHHHHHHHHHTTBESS
T ss_pred hhhhhhhhhhhhccccccCcCcC
Confidence 34567999999999998 34443
No 86
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=58.41 E-value=8.6 Score=44.23 Aligned_cols=19 Identities=21% Similarity=0.263 Sum_probs=14.7
Q ss_pred cCCCCCCCceeeecCCCCe
Q psy4798 82 SSKRSEHSNAYFYGFFKNK 100 (623)
Q Consensus 82 ~s~~~~~~NA~~~G~~~~k 100 (623)
.+++.+.+.||.+|+...+
T Consensus 305 ~~r~gK~~Gayc~~~~~~~ 323 (549)
T TIGR02289 305 ESRKGKAAGGYCTYLPKYK 323 (549)
T ss_pred CCCCCCCCCcccCCCCCCC
Confidence 3667788999999987643
No 87
>KOG2765|consensus
Probab=57.74 E-value=47 Score=36.93 Aligned_cols=103 Identities=22% Similarity=0.249 Sum_probs=73.4
Q ss_pred hHHHHHHHHHHHHHHHHhhhccHHHhhhcccccccCCCcccH----------HHHHHHHHHHHHHHHhhccccccchhhh
Q psy4798 291 EIVTSCLFIVLFNIFSTVIGLPISFYHHFVLEEKHGFNKQTY----------GFFVKDQIKSFIVSLILSIPLTGAVVYI 360 (623)
Q Consensus 291 ~~~~~~~f~~~~~l~~~li~lP~~~Y~~f~le~~~Gfn~~T~----------~~f~~d~vk~~~l~~~~s~~~~~~~~~~ 360 (623)
+-.-.-+|+..+.+++-++-.|.-+--+|..+|+|.+.+++. +.++.|.+......+ -.|+.+.+-+
T Consensus 275 ~rvdi~lffGfvGLfnllllwP~l~iL~~~~~e~F~lP~~~q~~~vv~~~ligtvvSDylW~~a~~l--Ts~Lv~Tlgm- 351 (416)
T KOG2765|consen 275 ERVDIQLFFGFVGLFNLLLLWPPLIILDFFGEERFELPSSTQFSLVVFNNLIGTVVSDYLWAKAVLL--TSPLVVTLGM- 351 (416)
T ss_pred ccccHHHHHHHHHHHHHHHHhHHHHHHHHhccCcccCCCCceeEeeeHhhHHHHHHHHHHHHHHHHh--ccchhheeee-
Confidence 345556888999999999999988888888999999999983 778888765444333 3354443322
Q ss_pred hhccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHH
Q psy4798 361 IQTYGFFVKDQIKSFIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFL 415 (623)
Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~ 415 (623)
-+.+|+....=.+++.-...++.++....+.+++++
T Consensus 352 -------------------SltIPLA~~aD~l~k~~~~S~~~iiGsi~Ifv~Fv~ 387 (416)
T KOG2765|consen 352 -------------------SLTIPLAMFADVLIKGKHPSALYIIGSIPIFVGFVI 387 (416)
T ss_pred -------------------eEeeeHHHHHHHHHcCCCCCHHHHHHHHHHHHHHhh
Confidence 456788877777787666777777666666665554
No 88
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=57.59 E-value=5.1 Score=39.21 Aligned_cols=16 Identities=38% Similarity=0.428 Sum_probs=13.3
Q ss_pred CHHHHHHHHHHHhhhh
Q psy4798 520 DTEEVLAVLAHELGHW 535 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~ 535 (623)
....-.-|+|||+||.
T Consensus 138 ~~~~~~~~~AHEiGH~ 153 (196)
T PF13688_consen 138 PTYNGAITFAHEIGHN 153 (196)
T ss_dssp -HHHHHHHHHHHHHHH
T ss_pred CCCceehhhHHhHHHh
Confidence 4577888999999997
No 89
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=57.37 E-value=22 Score=40.24 Aligned_cols=20 Identities=30% Similarity=0.399 Sum_probs=15.8
Q ss_pred CCHHHHHHHHHHHhhhhhccC
Q psy4798 519 CDTEEVLAVLAHELGHWKYNH 539 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H 539 (623)
++-+|+ -.|+||+||--|+.
T Consensus 259 l~~~~V-~TLfHEfGHalH~~ 278 (472)
T cd06455 259 LRHDEV-ETFFHEFGHVIHHL 278 (472)
T ss_pred CCHHHH-HHHHHHHHHHHHHH
Confidence 677886 46899999997743
No 90
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=57.00 E-value=7.8 Score=34.03 Aligned_cols=22 Identities=36% Similarity=0.450 Sum_probs=17.8
Q ss_pred CChHHHHHHHHHhhhhhhcchH
Q psy4798 140 CDTEEVLAVLAHELGHWKYNHV 161 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h~ 161 (623)
.+.+.+..|++||++|.-+.+.
T Consensus 20 ~~~~~~~~~l~HE~~H~~~~~~ 41 (128)
T PF13485_consen 20 SDEDWLDRVLAHELAHQWFGNY 41 (128)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH
Confidence 4667788999999999866554
No 91
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=55.83 E-value=7.8 Score=39.60 Aligned_cols=20 Identities=40% Similarity=0.390 Sum_probs=16.7
Q ss_pred CCHHHHHHHHHHHhhhhhcc
Q psy4798 519 CDTEEVLAVLAHELGHWKYN 538 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~ 538 (623)
.+++.-.-++||||||.-.+
T Consensus 67 ~~~~r~rFtlAHELGH~llH 86 (213)
T COG2856 67 NSLERKRFTLAHELGHALLH 86 (213)
T ss_pred CCHHHHHHHHHHHHhHHHhc
Confidence 47788999999999998543
No 92
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=55.69 E-value=5.5 Score=38.49 Aligned_cols=14 Identities=43% Similarity=0.878 Sum_probs=11.1
Q ss_pred HHHHHHHHhhhhhh
Q psy4798 144 EVLAVLAHELGHWK 157 (623)
Q Consensus 144 El~AVlAHELGH~k 157 (623)
...-+|+||+|||-
T Consensus 68 ~~g~TltHEvGH~L 81 (154)
T PF05572_consen 68 NFGKTLTHEVGHWL 81 (154)
T ss_dssp -SSHHHHHHHHHHT
T ss_pred ccccchhhhhhhhh
Confidence 34579999999985
No 93
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=54.50 E-value=5.5 Score=38.95 Aligned_cols=15 Identities=40% Similarity=0.395 Sum_probs=12.5
Q ss_pred HHHHHHHHHHhhhhh
Q psy4798 522 EEVLAVLAHELGHWK 536 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k 536 (623)
-+...++|||+||+-
T Consensus 131 ~~~~~~~aHElGH~l 145 (192)
T cd04267 131 LLTALTMAHELGHNL 145 (192)
T ss_pred eeehhhhhhhHHhhc
Confidence 456779999999995
No 94
>PF15061 DUF4538: Domain of unknown function (DUF4538)
Probab=54.39 E-value=17 Score=29.94 Aligned_cols=43 Identities=12% Similarity=0.240 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHhhchHhhhhhhccCCcCCchhHHHHHHHHHHHcCCCCC
Q psy4798 404 LWVFIILMSLFLMTIYPEFIAPLFDKYTPLPDGELKSRIEQLSASVKFPLK 454 (623)
Q Consensus 404 ~w~~~~~~~l~~~~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~gfp~~ 454 (623)
..++...++++...+||+++.|+.+ +|=+++.++. .+.|++..
T Consensus 8 ~~~~ggfVg~iG~a~Ypi~~~Pmm~-------~eeYk~~Q~~-nR~gI~qe 50 (58)
T PF15061_consen 8 ALFVGGFVGLIGAALYPIYFRPMMN-------PEEYKKEQKI-NRAGIKQE 50 (58)
T ss_pred hhhHHHHHHHHHHHHhhhhcccccC-------hHHHHHHHHH-HHhcccHh
Confidence 3455666777888899999999887 2334444443 44555443
No 95
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=54.36 E-value=8.4 Score=39.37 Aligned_cols=21 Identities=38% Similarity=0.366 Sum_probs=17.1
Q ss_pred CChHHHHHHHHHhhhhhhcch
Q psy4798 140 CDTEEVLAVLAHELGHWKYNH 160 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h 160 (623)
.+++.-.-++||||||.-.+.
T Consensus 67 ~~~~r~rFtlAHELGH~llH~ 87 (213)
T COG2856 67 NSLERKRFTLAHELGHALLHT 87 (213)
T ss_pred CCHHHHHHHHHHHHhHHHhcc
Confidence 577788999999999976544
No 96
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=54.25 E-value=6.4 Score=35.49 Aligned_cols=10 Identities=50% Similarity=0.850 Sum_probs=9.5
Q ss_pred HHHHHHhhhh
Q psy4798 526 AVLAHELGHW 535 (623)
Q Consensus 526 AVLAHELGH~ 535 (623)
.+++||+||.
T Consensus 109 ~~~~HEiGH~ 118 (124)
T PF13582_consen 109 DTFAHEIGHN 118 (124)
T ss_dssp THHHHHHHHH
T ss_pred eEeeehhhHh
Confidence 8999999997
No 97
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=53.95 E-value=5.9 Score=38.73 Aligned_cols=17 Identities=35% Similarity=0.343 Sum_probs=13.0
Q ss_pred CChHHHHHHHHHhhhhh
Q psy4798 140 CDTEEVLAVLAHELGHW 156 (623)
Q Consensus 140 l~~~El~AVlAHELGH~ 156 (623)
.+...-.-|+|||+||.
T Consensus 137 ~~~~~~~~~~AHEiGH~ 153 (196)
T PF13688_consen 137 PPTYNGAITFAHEIGHN 153 (196)
T ss_dssp --HHHHHHHHHHHHHHH
T ss_pred CCCCceehhhHHhHHHh
Confidence 34567788999999995
No 98
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=53.04 E-value=18 Score=40.82 Aligned_cols=21 Identities=29% Similarity=0.298 Sum_probs=16.2
Q ss_pred CCCChHHHHHHHHHhhhhhhcc
Q psy4798 138 KGCDTEEVLAVLAHELGHWKYN 159 (623)
Q Consensus 138 ~~l~~~El~AVlAHELGH~k~~ 159 (623)
..|+-+|+. .|+||+||--|+
T Consensus 257 ~ll~~~~V~-TLfHEfGHalH~ 277 (472)
T cd06455 257 SLLRHDEVE-TFFHEFGHVIHH 277 (472)
T ss_pred CCCCHHHHH-HHHHHHHHHHHH
Confidence 347778864 589999998873
No 99
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=52.14 E-value=7.1 Score=38.50 Aligned_cols=16 Identities=38% Similarity=0.557 Sum_probs=13.5
Q ss_pred CHHHHHHHHHHHhhhh
Q psy4798 520 DTEEVLAVLAHELGHW 535 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~ 535 (623)
+......++|||+||.
T Consensus 127 ~~~~~a~~~AHelGH~ 142 (199)
T PF01421_consen 127 SGLSFAVIIAHELGHN 142 (199)
T ss_dssp SHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHh
Confidence 4567888999999996
No 100
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=51.50 E-value=12 Score=37.84 Aligned_cols=14 Identities=29% Similarity=0.342 Sum_probs=11.6
Q ss_pred HHHHHHHHHhhhhh
Q psy4798 523 EVLAVLAHELGHWK 536 (623)
Q Consensus 523 Ei~AVLAHELGH~k 536 (623)
.-..++|||+||--
T Consensus 144 ~~~~~~AHElGH~l 157 (220)
T cd04272 144 YGVYTMTHELAHLL 157 (220)
T ss_pred ccHHHHHHHHHHHh
Confidence 44799999999974
No 101
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=51.31 E-value=7.1 Score=35.19 Aligned_cols=11 Identities=45% Similarity=0.709 Sum_probs=9.7
Q ss_pred HHHHHhhhhhh
Q psy4798 147 AVLAHELGHWK 157 (623)
Q Consensus 147 AVlAHELGH~k 157 (623)
.+++||+||.-
T Consensus 109 ~~~~HEiGH~l 119 (124)
T PF13582_consen 109 DTFAHEIGHNL 119 (124)
T ss_dssp THHHHHHHHHT
T ss_pred eEeeehhhHhc
Confidence 89999999963
No 102
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=51.07 E-value=12 Score=35.11 Aligned_cols=14 Identities=50% Similarity=0.869 Sum_probs=12.0
Q ss_pred HHHHHHHHHHhhhh
Q psy4798 522 EEVLAVLAHELGHW 535 (623)
Q Consensus 522 ~Ei~AVLAHELGH~ 535 (623)
|+|.-|+-||+||.
T Consensus 107 d~vthvliHEIgHh 120 (136)
T COG3824 107 DQVTHVLIHEIGHH 120 (136)
T ss_pred hHhhhhhhhhhhhh
Confidence 56778999999996
No 103
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=50.24 E-value=14 Score=43.17 Aligned_cols=44 Identities=39% Similarity=0.602 Sum_probs=30.0
Q ss_pred CCCCCCCceeeecCCC--CeEEEEe-cchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcc
Q psy4798 83 SKRSEHSNAYFYGFFK--NKRIVLF-DTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYN 159 (623)
Q Consensus 83 s~~~~~~NA~~~G~~~--~krIvl~-d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~ 159 (623)
.++.+.+=||+.|... ..+|.+. ++-+.+ ..+||||+||-.|.
T Consensus 349 ~~~gKrsGaYs~~~~~~~~p~IlmN~~gt~~d----------------------------------V~TLaHElGHs~Hs 394 (598)
T COG1164 349 PRKGKRSGAYSIGFYKGDHPFILMNYDGTLRD----------------------------------VFTLAHELGHSVHS 394 (598)
T ss_pred CCCCCCCCcccCCCCCCCCCeEEEeCCCchhH----------------------------------HHHHHHHccHHHHH
Confidence 4456788999999976 3355554 332222 46899999998876
Q ss_pred h
Q psy4798 160 H 160 (623)
Q Consensus 160 h 160 (623)
-
T Consensus 395 ~ 395 (598)
T COG1164 395 Y 395 (598)
T ss_pred H
Confidence 3
No 104
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=50.21 E-value=7.9 Score=37.82 Aligned_cols=13 Identities=46% Similarity=0.641 Sum_probs=9.9
Q ss_pred HHHHHHHHhhhhh
Q psy4798 524 VLAVLAHELGHWK 536 (623)
Q Consensus 524 i~AVLAHELGH~k 536 (623)
-.-++|||+||--
T Consensus 111 ~~~~~aHElGH~l 123 (173)
T PF13574_consen 111 GIDTFAHELGHQL 123 (173)
T ss_dssp HHHHHHHHHHHHH
T ss_pred eeeeehhhhHhhc
Confidence 3445999999973
No 105
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=49.67 E-value=11 Score=43.29 Aligned_cols=21 Identities=29% Similarity=0.282 Sum_probs=14.7
Q ss_pred CHHHHHHHHHHHhhhhhccCHH
Q psy4798 520 DTEEVLAVLAHELGHWKYNHVL 541 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~k~~H~~ 541 (623)
+.+++ .+|+||+||--|....
T Consensus 334 t~~dv-~TL~HElGHa~H~~~s 354 (549)
T TIGR02289 334 TSGDI-DVLTHEAGHAFHVYES 354 (549)
T ss_pred ChhHH-HHHHHHhhHHHHHHHh
Confidence 33444 5799999999876543
No 106
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=49.47 E-value=9.7 Score=37.56 Aligned_cols=16 Identities=38% Similarity=0.557 Sum_probs=12.8
Q ss_pred ChHHHHHHHHHhhhhh
Q psy4798 141 DTEEVLAVLAHELGHW 156 (623)
Q Consensus 141 ~~~El~AVlAHELGH~ 156 (623)
+......++|||+||.
T Consensus 127 ~~~~~a~~~AHelGH~ 142 (199)
T PF01421_consen 127 SGLSFAVIIAHELGHN 142 (199)
T ss_dssp SHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHh
Confidence 3456778999999995
No 107
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=49.31 E-value=16 Score=39.97 Aligned_cols=18 Identities=28% Similarity=0.481 Sum_probs=12.9
Q ss_pred CCCCCCCceeeecCCCCe
Q psy4798 83 SKRSEHSNAYFYGFFKNK 100 (623)
Q Consensus 83 s~~~~~~NA~~~G~~~~k 100 (623)
+++.+.+-||.+|+.+.+
T Consensus 191 ~r~gK~~gaf~~~~~~~~ 208 (427)
T cd06459 191 PRKGKRSGAYCTGLPPGK 208 (427)
T ss_pred CCCCCCCCeecCCCCCCC
Confidence 556677789999987433
No 108
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=49.29 E-value=7.1 Score=38.16 Aligned_cols=15 Identities=40% Similarity=0.395 Sum_probs=12.2
Q ss_pred HHHHHHHHHhhhhhh
Q psy4798 143 EEVLAVLAHELGHWK 157 (623)
Q Consensus 143 ~El~AVlAHELGH~k 157 (623)
-+...++|||+||+-
T Consensus 131 ~~~~~~~aHElGH~l 145 (192)
T cd04267 131 LLTALTMAHELGHNL 145 (192)
T ss_pred eeehhhhhhhHHhhc
Confidence 355679999999985
No 109
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=49.21 E-value=8.7 Score=36.53 Aligned_cols=15 Identities=40% Similarity=0.720 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHhhhh
Q psy4798 521 TEEVLAVLAHELGHW 535 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~ 535 (623)
...+.+|+.|||||.
T Consensus 104 ~~~~~~~~~HEiGHa 118 (157)
T cd04278 104 GTDLFSVAAHEIGHA 118 (157)
T ss_pred cchHHHHHHHHhccc
Confidence 356999999999997
No 110
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=48.43 E-value=23 Score=39.57 Aligned_cols=62 Identities=18% Similarity=0.223 Sum_probs=32.2
Q ss_pred CeEEEEeCCCCCCcccEEEeccCCC--------ceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHH
Q psy4798 454 KKLYVVEGSKRSEHSNAYFYGFFKN--------KRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVL 525 (623)
Q Consensus 454 ~~v~vv~~s~rs~~~NAy~~G~~~~--------krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~ 525 (623)
..+|+=--++..+.+.||.+++.+. ...++.-+.-.. .. ..-..++.+++
T Consensus 186 g~~ylDl~~R~gK~~ga~~~~~~~~~~~~~~~~~~~~i~~n~~~~----------~~-----------~~~~ll~~~~v- 243 (458)
T PF01432_consen 186 GYIYLDLYPRPGKRSGAFCFTLRPSRSDGERQLPVPYIFCNFTGP----------SA-----------GKPSLLSHDDV- 243 (458)
T ss_dssp EEEEEEES--TTS-SS-EEEEEEC-BTTSTCECEEEEEEEEE-S-----------BT-----------TC--B-SHHHH-
T ss_pred hcccccchhcCCCCCCceeCCccCccccccCCCCceEEEecCCCC----------CC-----------CCCCccChhhH-
Confidence 5555422333345678999988765 444454443321 00 11122688998
Q ss_pred HHHHHHhhhhhc
Q psy4798 526 AVLAHELGHWKY 537 (623)
Q Consensus 526 AVLAHELGH~k~ 537 (623)
-.|+||+||--|
T Consensus 244 ~tLfHE~GHa~H 255 (458)
T PF01432_consen 244 ETLFHEFGHAMH 255 (458)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHhHHHH
Confidence 779999999855
No 111
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=48.04 E-value=9.5 Score=36.25 Aligned_cols=16 Identities=38% Similarity=0.677 Sum_probs=13.4
Q ss_pred ChHHHHHHHHHhhhhh
Q psy4798 141 DTEEVLAVLAHELGHW 156 (623)
Q Consensus 141 ~~~El~AVlAHELGH~ 156 (623)
....+.+|+.|||||.
T Consensus 103 ~~~~~~~~~~HEiGHa 118 (157)
T cd04278 103 GGTDLFSVAAHEIGHA 118 (157)
T ss_pred ccchHHHHHHHHhccc
Confidence 3456999999999996
No 112
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=47.93 E-value=15 Score=37.10 Aligned_cols=20 Identities=15% Similarity=-0.038 Sum_probs=14.3
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy4798 6 EFQIFYGIIGFSWIVFLFEF 25 (623)
Q Consensus 6 ~~~i~~~~~~~~~~~~~~~~ 25 (623)
++.+-|.+..++.+...+..
T Consensus 21 ~~~~~yv~~~~n~vn~~Y~~ 40 (220)
T cd04272 21 EQLIRYLAVMVNAANLRYRD 40 (220)
T ss_pred HHHHHHHHHHHHHHHHHhhh
Confidence 45677778888877776664
No 113
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=47.30 E-value=1.6e+02 Score=30.23 Aligned_cols=83 Identities=16% Similarity=0.153 Sum_probs=44.2
Q ss_pred hhhhhccCCcCCchh--H-HHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce-EEEehhHHhhccCCCccc
Q psy4798 423 IAPLFDKYTPLPDGE--L-KSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR-IVLFDTLLKDYVPLNADK 498 (623)
Q Consensus 423 I~plf~k~~pl~~~~--L-~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr-IVL~dtLl~~~~~~~~~~ 498 (623)
+..-|+||...+... - .+--..+.+..|+---+|-.++|.- +.|- -+++| +-|++.-...
T Consensus 24 vks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~L-TDHY-------DP~~kvvrLSe~~y~g-------- 87 (226)
T COG2738 24 VKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTL-TDHY-------DPRRKVVRLSEANYYG-------- 87 (226)
T ss_pred HHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCc-cccc-------ChhhheeeccccccCC--------
Confidence 344567777665321 1 1222234667777655666666532 3331 23334 4455544321
Q ss_pred cCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHH
Q psy4798 499 KDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLK 542 (623)
Q Consensus 499 ~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k 542 (623)
-+ =.=.||-|||.||--+.+.-.
T Consensus 88 --------------------~S-ia~~aVAAHEVGHAiQd~~~Y 110 (226)
T COG2738 88 --------------------PS-IAAIAVAAHEVGHAIQDQEDY 110 (226)
T ss_pred --------------------cc-HHHHHHHHHHhhHHHhhhccc
Confidence 23 345799999999976655433
No 114
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=47.29 E-value=9.1 Score=36.26 Aligned_cols=10 Identities=60% Similarity=0.936 Sum_probs=8.9
Q ss_pred HHHHHhhhhh
Q psy4798 147 AVLAHELGHW 156 (623)
Q Consensus 147 AVlAHELGH~ 156 (623)
-|.|||+||+
T Consensus 79 RIaaHE~GHi 88 (132)
T PF02031_consen 79 RIAAHELGHI 88 (132)
T ss_dssp HHHHHHHHHH
T ss_pred eeeeehhccc
Confidence 3999999996
No 115
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=46.66 E-value=9.1 Score=37.40 Aligned_cols=11 Identities=55% Similarity=0.921 Sum_probs=8.9
Q ss_pred HHHHHHhhhhh
Q psy4798 146 LAVLAHELGHW 156 (623)
Q Consensus 146 ~AVlAHELGH~ 156 (623)
.-++|||+||-
T Consensus 112 ~~~~aHElGH~ 122 (173)
T PF13574_consen 112 IDTFAHELGHQ 122 (173)
T ss_dssp HHHHHHHHHHH
T ss_pred eeeehhhhHhh
Confidence 34599999995
No 116
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=46.65 E-value=14 Score=40.44 Aligned_cols=20 Identities=35% Similarity=0.426 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHhhhhhccCHH
Q psy4798 521 TEEVLAVLAHELGHWKYNHVL 541 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~k~~H~~ 541 (623)
.++ +-.|+||+||--|....
T Consensus 220 ~~~-v~tl~HE~GHa~h~~~~ 239 (427)
T cd06459 220 LDD-VFTLAHELGHAFHSYLS 239 (427)
T ss_pred hhh-HHHHHHHhhHHHHHHHH
Confidence 344 56799999997665443
No 117
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=46.26 E-value=16 Score=42.22 Aligned_cols=17 Identities=41% Similarity=0.430 Sum_probs=13.3
Q ss_pred HHHHHHHhhhhhccCHH
Q psy4798 525 LAVLAHELGHWKYNHVL 541 (623)
Q Consensus 525 ~AVLAHELGH~k~~H~~ 541 (623)
+.+||||+||--|+...
T Consensus 379 v~TLaHElGHa~H~~~~ 395 (591)
T TIGR00181 379 VFTLAHELGHSMHSYFS 395 (591)
T ss_pred HHHHHHHhhhHHHHHHH
Confidence 56899999999766543
No 118
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=45.46 E-value=15 Score=42.52 Aligned_cols=16 Identities=19% Similarity=0.401 Sum_probs=12.4
Q ss_pred CCCCCCCceeeecCCC
Q psy4798 83 SKRSEHSNAYFYGFFK 98 (623)
Q Consensus 83 s~~~~~~NA~~~G~~~ 98 (623)
+++.+.+-||.+|+.+
T Consensus 344 ~r~gK~~Ga~~~~~~~ 359 (587)
T TIGR02290 344 PRPGKRGGAFCTGFPP 359 (587)
T ss_pred CCCCCCCCcccCCCCC
Confidence 5566788899999765
No 119
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=44.50 E-value=22 Score=35.34 Aligned_cols=40 Identities=28% Similarity=0.225 Sum_probs=31.3
Q ss_pred eeecCCC-CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchH
Q psy4798 92 YFYGFFK-NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHV 161 (623)
Q Consensus 92 ~~~G~~~-~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~ 161 (623)
..-|+.+ .+.|+|..+-+. +..+++-+|+|||-|...+=.
T Consensus 47 ~~Ggf~p~~~~I~lC~N~~~------------------------------~~~~l~~~l~HELIHayD~cr 87 (173)
T PF09768_consen 47 VSGGFDPSKKGIVLCQNRIR------------------------------SQGHLEDTLTHELIHAYDHCR 87 (173)
T ss_pred CcCCccCCCCCEEEeeCCCC------------------------------CHHHHHHHHHHHHHHHHHHHh
Confidence 5567766 788999977665 678999999999999865433
No 120
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=44.09 E-value=11 Score=36.51 Aligned_cols=11 Identities=64% Similarity=0.782 Sum_probs=9.0
Q ss_pred HHHHHHhhhhh
Q psy4798 146 LAVLAHELGHW 156 (623)
Q Consensus 146 ~AVlAHELGH~ 156 (623)
.=|+||||+|=
T Consensus 136 lDVvaHEltHG 146 (150)
T PF01447_consen 136 LDVVAHELTHG 146 (150)
T ss_dssp HHHHHHHHHHH
T ss_pred cceeeeccccc
Confidence 34999999994
No 121
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=43.78 E-value=15 Score=43.05 Aligned_cols=45 Identities=29% Similarity=0.338 Sum_probs=29.3
Q ss_pred CcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHH
Q psy4798 466 EHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVL 541 (623)
Q Consensus 466 ~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~ 541 (623)
+.+=||..|+.+.....++-+--.. -.=+.+||||+||-.|....
T Consensus 353 KrsGaYs~~~~~~~~p~IlmN~~gt-------------------------------~~dV~TLaHElGHs~Hs~~s 397 (598)
T COG1164 353 KRSGAYSIGFYKGDHPFILMNYDGT-------------------------------LRDVFTLAHELGHSVHSYFS 397 (598)
T ss_pred CCCCcccCCCCCCCCCeEEEeCCCc-------------------------------hhHHHHHHHHccHHHHHHHH
Confidence 4588999998764444443332221 22367899999999887544
No 122
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=43.77 E-value=38 Score=34.45 Aligned_cols=20 Identities=35% Similarity=0.385 Sum_probs=14.8
Q ss_pred HHHHHHHhhhhhhcchHHHH
Q psy4798 145 VLAVLAHELGHWKYNHVLKS 164 (623)
Q Consensus 145 l~AVlAHELGH~k~~h~~k~ 164 (623)
=.||-|||.||--+.+--..
T Consensus 92 ~~aVAAHEVGHAiQd~~~Y~ 111 (226)
T COG2738 92 AIAVAAHEVGHAIQDQEDYA 111 (226)
T ss_pred HHHHHHHHhhHHHhhhcccH
Confidence 46899999999766554433
No 123
>PRK04860 hypothetical protein; Provisional
Probab=43.77 E-value=16 Score=35.83 Aligned_cols=19 Identities=26% Similarity=0.282 Sum_probs=16.1
Q ss_pred CChHHHHHHHHHhhhhhhc
Q psy4798 140 CDTEEVLAVLAHELGHWKY 158 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~ 158 (623)
-+.+++..||+||++|+.-
T Consensus 58 ~~~~~l~~~v~HEl~H~~~ 76 (160)
T PRK04860 58 NQQAFIDEVVPHELAHLLV 76 (160)
T ss_pred CcHHHHHhHHHHHHHHHHH
Confidence 3678999999999999743
No 124
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=43.32 E-value=12 Score=36.19 Aligned_cols=17 Identities=29% Similarity=0.503 Sum_probs=9.6
Q ss_pred CcccEEEeccCCCceEEEehh
Q psy4798 466 EHSNAYFYGFFKNKRIVLFDT 486 (623)
Q Consensus 466 ~~~NAy~~G~~~~krIVL~dt 486 (623)
...|||--| .+++.-|+
T Consensus 108 ~~~NAfW~g----~~m~yGdG 124 (150)
T PF01447_consen 108 NYNNAFWNG----SQMVYGDG 124 (150)
T ss_dssp STT-EEE-S----SSEEEE--
T ss_pred CccCccccC----CEEEEECC
Confidence 468999654 56888775
No 125
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=42.44 E-value=13 Score=35.23 Aligned_cols=9 Identities=67% Similarity=1.062 Sum_probs=8.6
Q ss_pred HHHHHhhhh
Q psy4798 527 VLAHELGHW 535 (623)
Q Consensus 527 VLAHELGH~ 535 (623)
|.+||+||.
T Consensus 80 IaaHE~GHi 88 (132)
T PF02031_consen 80 IAAHELGHI 88 (132)
T ss_dssp HHHHHHHHH
T ss_pred eeeehhccc
Confidence 899999998
No 126
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=41.08 E-value=12 Score=35.38 Aligned_cols=16 Identities=50% Similarity=0.590 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHhhhhh
Q psy4798 521 TEEVLAVLAHELGHWK 536 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~k 536 (623)
..+...+++||+||.-
T Consensus 93 ~~~~~~~~~HElGH~L 108 (167)
T cd00203 93 TKEGAQTIAHELGHAL 108 (167)
T ss_pred cccchhhHHHHHHHHh
Confidence 3578999999999984
No 127
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=40.82 E-value=17 Score=35.32 Aligned_cols=17 Identities=29% Similarity=0.433 Sum_probs=15.4
Q ss_pred CChHHHHHHHHHhhhhh
Q psy4798 140 CDTEEVLAVLAHELGHW 156 (623)
Q Consensus 140 l~~~El~AVlAHELGH~ 156 (623)
..+|.+.-|+.|||+|.
T Consensus 56 n~~~f~~~vV~HELaHl 72 (156)
T COG3091 56 NGEDFIEQVVPHELAHL 72 (156)
T ss_pred ccHHHHHHHHHHHHHHH
Confidence 57889999999999996
No 128
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=40.65 E-value=84 Score=31.27 Aligned_cols=19 Identities=42% Similarity=0.592 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHhhhh-hccC
Q psy4798 521 TEEVLAVLAHELGHW-KYNH 539 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~-k~~H 539 (623)
.+-+.-...||+||. -..|
T Consensus 122 ~~R~~k~~~HElGH~lGL~H 141 (179)
T PRK13267 122 EERVRKEVTHELGHTLGLEH 141 (179)
T ss_pred HHHHHHHHHHHHHHHcCCcc
Confidence 345666689999998 4455
No 129
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=40.62 E-value=46 Score=34.26 Aligned_cols=57 Identities=25% Similarity=0.366 Sum_probs=32.7
Q ss_pred CcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhh
Q psy4798 466 EHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHW 535 (623)
Q Consensus 466 ~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~ 535 (623)
+..|||.-. .++.|.+.-.++.... +.-++.+.-++ +.++ -..+-++.++-||+||-
T Consensus 47 Ge~nA~ydP--e~~~I~iCYEf~~~~~--~~f~~~~~~~~--------~~~~-~~~~~~~~~l~HE~GHA 103 (220)
T PF14247_consen 47 GEDNAFYDP--ENRSITICYEFVDEIL--DRFAKANDPDE--------EYGQ-AAIGNVLFTLYHELGHA 103 (220)
T ss_pred CCCCCccCC--CCCEEEECHHHHHHHH--HHHHhCCcCcc--------hHHH-HHHHHHHHHHHHHHHHH
Confidence 458999652 4567998887776531 11111111000 0000 15567899999999997
No 130
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=39.84 E-value=10 Score=38.81 Aligned_cols=11 Identities=55% Similarity=0.782 Sum_probs=9.7
Q ss_pred HHHHHHhhhhh
Q psy4798 526 AVLAHELGHWK 536 (623)
Q Consensus 526 AVLAHELGH~k 536 (623)
.++||||||--
T Consensus 147 ~t~AHElGHnL 157 (228)
T cd04271 147 QVFAHEIGHTF 157 (228)
T ss_pred eehhhhhhhhc
Confidence 59999999974
No 131
>TIGR00782 ccoP cytochrome c oxidase, cbb3-type, subunit III. This model describes a di-heme subunit of approximately 26 kDa of the cbb3 type copper and heme-containing cytochrome oxidase.
Probab=39.70 E-value=44 Score=35.28 Aligned_cols=33 Identities=18% Similarity=0.311 Sum_probs=25.9
Q ss_pred cchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccC
Q psy4798 396 GGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKY 430 (623)
Q Consensus 396 ~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~ 430 (623)
.-+.||+|++...++++++.++.||.+ |.++.+
T Consensus 25 ~~P~ww~~~f~~~i~~~~~y~~~yp~~--~~~~~~ 57 (285)
T TIGR00782 25 PLPRWWLWTFYATIVWGFGYLVAYPAW--PLVSGA 57 (285)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHhccc--CCcccc
Confidence 346799999999999999998888854 555444
No 132
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=39.35 E-value=12 Score=35.23 Aligned_cols=16 Identities=50% Similarity=0.590 Sum_probs=13.2
Q ss_pred hHHHHHHHHHhhhhhh
Q psy4798 142 TEEVLAVLAHELGHWK 157 (623)
Q Consensus 142 ~~El~AVlAHELGH~k 157 (623)
..+...+++|||||.-
T Consensus 93 ~~~~~~~~~HElGH~L 108 (167)
T cd00203 93 TKEGAQTIAHELGHAL 108 (167)
T ss_pred cccchhhHHHHHHHHh
Confidence 4568899999999963
No 133
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=38.36 E-value=42 Score=37.45 Aligned_cols=73 Identities=21% Similarity=0.263 Sum_probs=44.1
Q ss_pred CcCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccC-CCceEEEehhHHhhccCCCccccCCCCCCCccc
Q psy4798 431 TPLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFF-KNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLI 509 (623)
Q Consensus 431 ~pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~-~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~ 509 (623)
.+.+...-++.-+++++.+|++..+-. +|.| ++.|..|++ ...|| +..-=+
T Consensus 102 ~~~~~~~Q~~~~~~~~~~~g~df~~gr-iD~s-----~hpF~~~~~~~dvRI--tt~y~~-------------------- 153 (396)
T cd06460 102 GPFPEEKQEALGRELLEALGFDFDRGR-LDVS-----AHPFTGGLGPGDVRI--TTRYDE-------------------- 153 (396)
T ss_pred CCCCHHHHHHHHHHHHHHhCCcccCCe-eecC-----CCCCCCCCCCCCceE--EeeeCC--------------------
Confidence 455556666667788999999875543 4544 345887874 33444 221111
Q ss_pred cccccCCCCCCHHHHHHHHHHHhhhhhccCH
Q psy4798 510 STEGANKKGCDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 510 ~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~ 540 (623)
-+.-.=+--++||+||-.|.+.
T Consensus 154 ---------~d~~~~l~t~iHE~GHalye~~ 175 (396)
T cd06460 154 ---------NDFRSALFSTIHETGHALYEQG 175 (396)
T ss_pred ---------cchHHHHHHHHHHhhHHHHHhc
Confidence 1223334567999999998884
No 134
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=38.29 E-value=44 Score=36.00 Aligned_cols=60 Identities=13% Similarity=0.165 Sum_probs=34.1
Q ss_pred hccc-CCCCCCCceeeecCCC-----CeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHh
Q psy4798 79 NTVS-SKRSEHSNAYFYGFFK-----NKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHE 152 (623)
Q Consensus 79 ~~~~-s~~~~~~NA~~~G~~~-----~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHE 152 (623)
-+|- +++.+.+.||.+++.+ .+.+.++-+.-.. ..|. -.-++.+++. .+.||
T Consensus 104 ~lD~~~R~gK~~~a~~~~~~~~~~~~~~~~~i~~n~~~~----~~~~-----------------~~ll~~~~v~-tl~HE 161 (365)
T cd06258 104 YLDLYDRKGKYPHGFCTGLDPGFNRQDKDVRILANFTSP----AAPD-----------------PVLLGHDDIN-TLFHE 161 (365)
T ss_pred hccCCCCCCCCCCCeeccccCCCCCCCCeEEEEccCCCC----CCCC-----------------CCcCCHHHHH-HHHHH
Confidence 4444 4445778999999863 3444444443321 1111 1225666664 58999
Q ss_pred hhhhhcch
Q psy4798 153 LGHWKYNH 160 (623)
Q Consensus 153 LGH~k~~h 160 (623)
+||.-|.-
T Consensus 162 ~GHa~h~~ 169 (365)
T cd06258 162 FGHAVHFL 169 (365)
T ss_pred HhHHHHHH
Confidence 99987753
No 135
>KOG1047|consensus
Probab=38.08 E-value=41 Score=39.01 Aligned_cols=60 Identities=23% Similarity=0.201 Sum_probs=38.6
Q ss_pred HHHHHHHcCCC----CCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCC
Q psy4798 442 IEQLSASVKFP----LKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKK 517 (623)
Q Consensus 442 I~~la~~~gfp----~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 517 (623)
+.+.|+++.-| .-++.|+..|.. +-|+-+..--.++-|||.
T Consensus 239 ~L~~Ae~l~GpY~WgryDllvlPpSFP-------~gGMENPcltF~TpTlla---------------------------- 283 (613)
T KOG1047|consen 239 FLKAAEKLFGPYVWGRYDLLVLPPSFP-------FGGMENPCLTFVTPTLLA---------------------------- 283 (613)
T ss_pred HHHHHHHHcCCcccccceEEEecCCCC-------cccccCcceeeecchhhc----------------------------
Confidence 33445566555 357888887653 234433333568889984
Q ss_pred CCCHHHHHHHHHHHhhhhhcc
Q psy4798 518 GCDTEEVLAVLAHELGHWKYN 538 (623)
Q Consensus 518 gl~~~Ei~AVLAHELGH~k~~ 538 (623)
.+.-+.-|+|||++|---|
T Consensus 284 --GDrsl~~vIaHEIAHSWtG 302 (613)
T KOG1047|consen 284 --GDRSLVDVIAHEIAHSWTG 302 (613)
T ss_pred --CCcchhhHHHHHhhhhhcc
Confidence 3456689999999996544
No 136
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=37.52 E-value=12 Score=38.44 Aligned_cols=19 Identities=16% Similarity=-0.096 Sum_probs=12.4
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy4798 6 EFQIFYGIIGFSWIVFLFE 24 (623)
Q Consensus 6 ~~~i~~~~~~~~~~~~~~~ 24 (623)
+...-+.+..++.+.-++.
T Consensus 21 ~~~~~~v~~ivN~vn~~Y~ 39 (228)
T cd04271 21 EEARRNILNNVNSASQLYE 39 (228)
T ss_pred HHHHHHHHHHHHHHHHHhh
Confidence 4555566677777776666
No 137
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=37.03 E-value=15 Score=34.04 Aligned_cols=10 Identities=70% Similarity=0.923 Sum_probs=9.1
Q ss_pred HHHHHHhhhh
Q psy4798 526 AVLAHELGHW 535 (623)
Q Consensus 526 AVLAHELGH~ 535 (623)
+|++||+||.
T Consensus 88 ~~~~HEigHa 97 (140)
T smart00235 88 GVAAHELGHA 97 (140)
T ss_pred ccHHHHHHHH
Confidence 5999999997
No 138
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=36.86 E-value=92 Score=30.96 Aligned_cols=16 Identities=31% Similarity=0.312 Sum_probs=15.1
Q ss_pred CHHHHHHHHHHHhhhh
Q psy4798 520 DTEEVLAVLAHELGHW 535 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~ 535 (623)
+.+++.-+|+|||=|.
T Consensus 67 ~~~~l~~~l~HELIHa 82 (173)
T PF09768_consen 67 SQGHLEDTLTHELIHA 82 (173)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 6799999999999998
No 139
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=36.79 E-value=16 Score=36.77 Aligned_cols=16 Identities=38% Similarity=0.603 Sum_probs=11.2
Q ss_pred HHHHHHhhhhh-ccCHH
Q psy4798 526 AVLAHELGHWK-YNHVL 541 (623)
Q Consensus 526 AVLAHELGH~k-~~H~~ 541 (623)
-++|||+||-- .+|..
T Consensus 139 ~~~aHEiGH~lGl~H~~ 155 (206)
T PF13583_consen 139 QTFAHEIGHNLGLRHDF 155 (206)
T ss_pred hHHHHHHHHHhcCCCCc
Confidence 45999999974 34443
No 140
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=36.76 E-value=20 Score=41.50 Aligned_cols=15 Identities=47% Similarity=0.643 Sum_probs=12.0
Q ss_pred HHHHHHHhhhhhccC
Q psy4798 525 LAVLAHELGHWKYNH 539 (623)
Q Consensus 525 ~AVLAHELGH~k~~H 539 (623)
+..|+||+||--|..
T Consensus 376 v~TL~HE~GHa~H~~ 390 (587)
T TIGR02290 376 VSTLAHELGHAYHSE 390 (587)
T ss_pred HHHHHHHhhHHHHHH
Confidence 356999999998754
No 141
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=36.59 E-value=46 Score=37.26 Aligned_cols=22 Identities=32% Similarity=0.300 Sum_probs=16.7
Q ss_pred CCCChHHHHHHHHHhhhhhhcch
Q psy4798 138 KGCDTEEVLAVLAHELGHWKYNH 160 (623)
Q Consensus 138 ~~l~~~El~AVlAHELGH~k~~h 160 (623)
+.|+-+|+.. |.||+||--|+-
T Consensus 202 ~lL~~~~v~t-LfHEfGHalH~~ 223 (422)
T cd06456 202 ALLTHDEVTT-LFHEFGHALHHL 223 (422)
T ss_pred CccCHHHHHH-HHHHHHHHHHHH
Confidence 3478888764 889999987753
No 142
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=35.56 E-value=26 Score=32.63 Aligned_cols=21 Identities=33% Similarity=0.301 Sum_probs=18.3
Q ss_pred CCCCChHHHHHHHHHhhhhhh
Q psy4798 137 KKGCDTEEVLAVLAHELGHWK 157 (623)
Q Consensus 137 ~~~l~~~El~AVlAHELGH~k 157 (623)
-++|+-+|-.-|++||+.|+-
T Consensus 72 F~rLs~~ekvKviiHEllHIP 92 (133)
T COG4900 72 FKRLSCAEKVKVIIHELLHIP 92 (133)
T ss_pred cCCCChHHHHHHHHHHHhcCc
Confidence 346899999999999999985
No 143
>COG1054 Predicted sulfurtransferase [General function prediction only]
Probab=34.52 E-value=74 Score=34.33 Aligned_cols=45 Identities=16% Similarity=0.285 Sum_probs=37.4
Q ss_pred hhhccCCcCCch-hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEecc
Q psy4798 425 PLFDKYTPLPDG-ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGF 475 (623)
Q Consensus 425 plf~k~~pl~~~-~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~ 475 (623)
-.+.+|+|++++ ++++.+.++++++|+. .+|++.+. ..|+.+.|-
T Consensus 8 la~Y~f~~i~dp~~~~~~l~~~~~~~~vk-GrillA~E-----GINgtvsG~ 53 (308)
T COG1054 8 LAYYKFVPIEDPEALRDPLLALCKALGVK-GRILLAHE-----GINGTVSGS 53 (308)
T ss_pred EEEEEEEecCCHHHHHHHHHHHHHHcCce-eEEEEccC-----CcceeEecC
Confidence 356789999985 5999999999999986 78998774 479999884
No 144
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=34.35 E-value=23 Score=34.48 Aligned_cols=12 Identities=42% Similarity=0.888 Sum_probs=10.6
Q ss_pred HHHHHHHHhhhh
Q psy4798 524 VLAVLAHELGHW 535 (623)
Q Consensus 524 i~AVLAHELGH~ 535 (623)
+.+++.||+||.
T Consensus 7 ~i~i~~HE~gH~ 18 (192)
T PF02163_consen 7 LISIVLHELGHA 18 (192)
T ss_dssp HHHHHHHHHHHH
T ss_pred cccccccccccc
Confidence 568999999997
No 145
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=34.16 E-value=18 Score=36.44 Aligned_cols=11 Identities=45% Similarity=0.712 Sum_probs=8.8
Q ss_pred HHHHHhhhhhh
Q psy4798 147 AVLAHELGHWK 157 (623)
Q Consensus 147 AVlAHELGH~k 157 (623)
-++|||+||.-
T Consensus 139 ~~~aHEiGH~l 149 (206)
T PF13583_consen 139 QTFAHEIGHNL 149 (206)
T ss_pred hHHHHHHHHHh
Confidence 45999999963
No 146
>KOG3314|consensus
Probab=33.17 E-value=42 Score=33.23 Aligned_cols=39 Identities=31% Similarity=0.301 Sum_probs=31.8
Q ss_pred cCCCCeEEEEecchhhhccCCccccCCCCCCCCcccccccccCCCCChHHHHHHHHHhhhhhhcchHHH
Q psy4798 95 GFFKNKRIVLFDTLLKDYVPLNADKKDKSGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLK 163 (623)
Q Consensus 95 G~~~~krIvl~d~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~El~AVlAHELGH~k~~h~~k 163 (623)
|+.+.+.|||..+-|. +.+++.-|++|||-|.-..+-.+
T Consensus 71 Gy~~~~gIvlCqN~l~------------------------------~q~h~n~vv~HElIH~fDd~r~~ 109 (194)
T KOG3314|consen 71 GYTPGRGIVLCQNRLT------------------------------IQDHVNQVVIHELIHAFDDCRAK 109 (194)
T ss_pred CccCCCceEEeccccc------------------------------hHHHHHHHHHHHHHHHHHhhhhh
Confidence 5677788999988665 56899999999999987776655
No 147
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=32.79 E-value=23 Score=33.30 Aligned_cols=15 Identities=47% Similarity=0.738 Sum_probs=12.3
Q ss_pred HHHHHHHHHhhhhhh
Q psy4798 143 EEVLAVLAHELGHWK 157 (623)
Q Consensus 143 ~El~AVlAHELGH~k 157 (623)
|++.-|+-||+||.-
T Consensus 107 d~vthvliHEIgHhF 121 (136)
T COG3824 107 DQVTHVLIHEIGHHF 121 (136)
T ss_pred hHhhhhhhhhhhhhc
Confidence 567889999999963
No 148
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=32.43 E-value=82 Score=33.91 Aligned_cols=21 Identities=24% Similarity=0.303 Sum_probs=15.3
Q ss_pred CCHHHHHHHHHHHhhhhhccCH
Q psy4798 519 CDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H~ 540 (623)
++.+++. .+.||+||..|...
T Consensus 150 l~~~~v~-tl~HE~GHa~h~~l 170 (365)
T cd06258 150 LGHDDIN-TLFHEFGHAVHFLL 170 (365)
T ss_pred CCHHHHH-HHHHHHhHHHHHHH
Confidence 4666664 58899999977543
No 149
>KOG3607|consensus
Probab=32.32 E-value=36 Score=40.85 Aligned_cols=16 Identities=50% Similarity=0.491 Sum_probs=13.9
Q ss_pred CHHHHHHHHHHHhhhh
Q psy4798 520 DTEEVLAVLAHELGHW 535 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~ 535 (623)
+......|+||||||.
T Consensus 319 ~~~~~a~v~AhelgH~ 334 (716)
T KOG3607|consen 319 ILLAFAVVLAHELGHN 334 (716)
T ss_pred cchhHHHHHHHHHHhh
Confidence 4577899999999997
No 150
>PF14891 Peptidase_M91: Effector protein
Probab=31.66 E-value=24 Score=34.54 Aligned_cols=18 Identities=33% Similarity=0.235 Sum_probs=14.3
Q ss_pred ChHHHHHHHHHhhhhhhc
Q psy4798 141 DTEEVLAVLAHELGHWKY 158 (623)
Q Consensus 141 ~~~El~AVlAHELGH~k~ 158 (623)
.+.+=..||+|||+|..+
T Consensus 99 ~~~~p~v~L~HEL~HA~~ 116 (174)
T PF14891_consen 99 SPRPPFVVLYHELIHAYD 116 (174)
T ss_pred hhHHHHHHHHHHHHHHHH
Confidence 345667899999999865
No 151
>COG2928 Uncharacterized conserved protein [Function unknown]
Probab=31.64 E-value=5.2e+02 Score=26.88 Aligned_cols=35 Identities=20% Similarity=0.254 Sum_probs=19.1
Q ss_pred hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEecc
Q psy4798 437 ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGF 475 (623)
Q Consensus 437 ~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~ 475 (623)
..++.++.+.++-+=.-+++..++=.. .+.|..||
T Consensus 103 s~kqi~etll~~~~~sfk~vvlVefP~----~G~~~i~f 137 (222)
T COG2928 103 SAKQVVETLLSDQSGSFKQVVLVEFPR----RGIWAIAF 137 (222)
T ss_pred HHHHHHHHHHhcCCccceeeEEEECCC----CCcEEEEE
Confidence 345555555555443557787787443 34444444
No 152
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=31.41 E-value=20 Score=33.08 Aligned_cols=10 Identities=70% Similarity=0.923 Sum_probs=9.0
Q ss_pred HHHHHhhhhh
Q psy4798 147 AVLAHELGHW 156 (623)
Q Consensus 147 AVlAHELGH~ 156 (623)
+|++||+||.
T Consensus 88 ~~~~HEigHa 97 (140)
T smart00235 88 GVAAHELGHA 97 (140)
T ss_pred ccHHHHHHHH
Confidence 4999999996
No 153
>PF14891 Peptidase_M91: Effector protein
Probab=31.18 E-value=33 Score=33.54 Aligned_cols=18 Identities=33% Similarity=0.235 Sum_probs=14.5
Q ss_pred CHHHHHHHHHHHhhhhhc
Q psy4798 520 DTEEVLAVLAHELGHWKY 537 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~k~ 537 (623)
.+.+=..||+|||+|..+
T Consensus 99 ~~~~p~v~L~HEL~HA~~ 116 (174)
T PF14891_consen 99 SPRPPFVVLYHELIHAYD 116 (174)
T ss_pred hhHHHHHHHHHHHHHHHH
Confidence 456778899999999853
No 154
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=30.69 E-value=61 Score=36.33 Aligned_cols=21 Identities=33% Similarity=0.409 Sum_probs=16.1
Q ss_pred CCHHHHHHHHHHHhhhhhccCH
Q psy4798 519 CDTEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H~ 540 (623)
++.+|+. .|.||+||--|...
T Consensus 204 L~~~~v~-tLfHEfGHalH~~l 224 (422)
T cd06456 204 LTHDEVT-TLFHEFGHALHHLL 224 (422)
T ss_pred cCHHHHH-HHHHHHHHHHHHHH
Confidence 6778876 58899999976543
No 155
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=30.58 E-value=26 Score=37.87 Aligned_cols=26 Identities=19% Similarity=0.128 Sum_probs=0.0
Q ss_pred ccccCCCCChHHHHHHHHHhhhhhhc
Q psy4798 133 EGANKKGCDTEEVLAVLAHELGHWKY 158 (623)
Q Consensus 133 ~~~~~~~l~~~El~AVlAHELGH~k~ 158 (623)
|-++-..+.+-++-+++|||+||.+.
T Consensus 184 EA~vN~~~p~~~~P~T~~HElAHq~G 209 (318)
T PF12725_consen 184 EANVNTDLPPYSLPFTICHELAHQLG 209 (318)
T ss_pred eeecCCCCCcccccHHHHHHHHHHhC
No 156
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=30.42 E-value=14 Score=42.86 Aligned_cols=15 Identities=40% Similarity=0.532 Sum_probs=12.6
Q ss_pred HHHHHHHHHHhhhhh
Q psy4798 522 EEVLAVLAHELGHWK 536 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k 536 (623)
.-|-||+|||+||--
T Consensus 485 GgIGaVIgHEI~HgF 499 (654)
T COG3590 485 GGIGAVIGHEIGHGF 499 (654)
T ss_pred cCccceehhhhcccc
Confidence 458899999999963
No 157
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=29.78 E-value=15 Score=42.65 Aligned_cols=17 Identities=35% Similarity=0.475 Sum_probs=13.1
Q ss_pred ChHHHHHHHHHhhhhhh
Q psy4798 141 DTEEVLAVLAHELGHWK 157 (623)
Q Consensus 141 ~~~El~AVlAHELGH~k 157 (623)
|=--|-||||||+||--
T Consensus 483 NYGgIGaVIgHEI~HgF 499 (654)
T COG3590 483 NYGGIGAVIGHEIGHGF 499 (654)
T ss_pred cccCccceehhhhcccc
Confidence 33448899999999953
No 158
>PF14715 FixP_N: N-terminal domain of cytochrome oxidase-cbb3, FixP
Probab=29.08 E-value=61 Score=25.94 Aligned_cols=26 Identities=19% Similarity=0.341 Sum_probs=21.9
Q ss_pred cchhHHHHHHHHHHHHHHHHHhhchH
Q psy4798 396 GGNMVFLYLWVFIILMSLFLMTIYPE 421 (623)
Q Consensus 396 ~g~~~~l~~w~~~~~~~l~~~~i~P~ 421 (623)
.-+.||++++...++++++-.+.||.
T Consensus 19 plP~ww~~~f~~tivfa~~Y~~~yp~ 44 (51)
T PF14715_consen 19 PLPRWWLWLFYGTIVFAVGYLVLYPG 44 (51)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34678999999999999998888874
No 159
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=29.08 E-value=30 Score=33.68 Aligned_cols=12 Identities=42% Similarity=0.888 Sum_probs=10.2
Q ss_pred HHHHHHHhhhhh
Q psy4798 145 VLAVLAHELGHW 156 (623)
Q Consensus 145 l~AVlAHELGH~ 156 (623)
+.+++.||+||.
T Consensus 7 ~i~i~~HE~gH~ 18 (192)
T PF02163_consen 7 LISIVLHELGHA 18 (192)
T ss_dssp HHHHHHHHHHHH
T ss_pred cccccccccccc
Confidence 468999999996
No 160
>KOG1047|consensus
Probab=29.01 E-value=34 Score=39.71 Aligned_cols=27 Identities=19% Similarity=0.304 Sum_probs=17.1
Q ss_pred ccccccCCCcccHHHHHHHHHHHHHHHH
Q psy4798 320 VLEEKHGFNKQTYGFFVKDQIKSFIVSL 347 (623)
Q Consensus 320 ~le~~~Gfn~~T~~~f~~d~vk~~~l~~ 347 (623)
.||++.|= ++-+..|+++.++.+.-..
T Consensus 391 ~Le~~lG~-~~~Fd~FLr~Yv~kfa~ks 417 (613)
T KOG1047|consen 391 YLEQLLGD-PTRFDPFLRAYVHKFAFKS 417 (613)
T ss_pred HHHHHhCC-hhhHHHHHHHHHHHhccce
Confidence 46666662 2345568888888776555
No 161
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=28.83 E-value=1e+02 Score=33.06 Aligned_cols=20 Identities=30% Similarity=0.631 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHhhhhhccCH
Q psy4798 521 TEEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~k~~H~ 540 (623)
..++..++|||++|---++.
T Consensus 292 ~~~~~~~iahElahqWfGn~ 311 (390)
T PF01433_consen 292 KQEIASLIAHELAHQWFGNL 311 (390)
T ss_dssp HHHHHHHHHHHHHTTTBTTT
T ss_pred hhhhHHHHHHHHHHHHhccC
Confidence 46899999999999876653
No 162
>PRK05320 rhodanese superfamily protein; Provisional
Probab=28.52 E-value=69 Score=33.54 Aligned_cols=57 Identities=16% Similarity=0.219 Sum_probs=43.6
Q ss_pred hhccCCcCCch-hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCceEEEehhHHhh
Q psy4798 426 LFDKYTPLPDG-ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKRIVLFDTLLKD 490 (623)
Q Consensus 426 lf~k~~pl~~~-~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~krIVL~dtLl~~ 490 (623)
.|.||+|++|+ ++++.+.++|++.|+. .+|++-+. ..|+-+.|- ..+|-.+-+.+..
T Consensus 7 ~~Y~f~~i~~~~~~~~~~~~~~~~~~~~-G~i~ia~e-----GiN~t~~g~--~~~id~~~~~l~~ 64 (257)
T PRK05320 7 AAYKFVSLDDPETLRPLVLARCEALGLK-GTILLAPE-----GINLFLAGT--REAIDAFYAWLRA 64 (257)
T ss_pred EEEceeecCCHHHHHHHHHHHHHHCCCe-EEEEEcCC-----CceEEEEee--HHHHHHHHHHHhh
Confidence 57899999985 5899999999999985 88998663 479999874 4556555555543
No 163
>COG0475 KefB Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]
Probab=28.51 E-value=1.7e+02 Score=32.46 Aligned_cols=97 Identities=21% Similarity=0.294 Sum_probs=51.8
Q ss_pred HHHHHHhhccccccchhhhhhccccchhhhhhh----HHHHHHHHHHHHHHHHHHHHHcchhH--HHHHHHHHHHHHHHH
Q psy4798 342 SFIVSLILSIPLTGAVVYIIQTYGFFVKDQIKS----FIVSLILSIPLTGAVVYIIQVGGNMV--FLYLWVFIILMSLFL 415 (623)
Q Consensus 342 ~~~l~~~~s~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~pl~~~~~~li~~~g~~~--~l~~w~~~~~~~l~~ 415 (623)
+.+++.+++.+.++.++-++.=.+....+.-+. .+...+.+.|+++.+..+.+..+... |++......++.++.
T Consensus 120 al~lg~~l~~sS~~i~~~iL~e~~~~~~~~g~~~l~~~i~~Di~~i~lLai~~~l~~~g~~~~~~~~~~~~~~~~f~~~~ 199 (397)
T COG0475 120 ALFLGAALALSSTAIVLKILMELGLLKTREGQLILGALVFDDIAAILLLAIVPALAGGGSGSVGFILGLLLAILAFLALL 199 (397)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCccHhHHHHHHHHHHHHHHHH
Confidence 444444445555555555554444333222221 24444677777776666654333222 444445555555555
Q ss_pred HhhchHhhhhhhccCCcCCchhH
Q psy4798 416 MTIYPEFIAPLFDKYTPLPDGEL 438 (623)
Q Consensus 416 ~~i~P~~I~plf~k~~pl~~~~L 438 (623)
..+.+.+..|++++....+++|+
T Consensus 200 l~~g~~l~~~~~r~~~~~~~~e~ 222 (397)
T COG0475 200 LLLGRYLLPPLFRRVAKTESSEL 222 (397)
T ss_pred HHHHHHHHHHHHHHHHhccchHH
Confidence 55567778888887766555553
No 164
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=28.39 E-value=33 Score=33.19 Aligned_cols=13 Identities=46% Similarity=0.728 Sum_probs=11.0
Q ss_pred HHHHHHHHhhhhh
Q psy4798 524 VLAVLAHELGHWK 536 (623)
Q Consensus 524 i~AVLAHELGH~k 536 (623)
+.+|..||+||..
T Consensus 8 ~i~i~iHE~gH~~ 20 (180)
T cd05709 8 LISVTVHELGHAL 20 (180)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999974
No 165
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.91 E-value=63 Score=35.15 Aligned_cols=24 Identities=38% Similarity=0.625 Sum_probs=22.8
Q ss_pred CCHHHHHHHHHHHhhhhhccCHHH
Q psy4798 519 CDTEEVLAVLAHELGHWKYNHVLK 542 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k~~H~~k 542 (623)
|..+|+.|.|-||.=|+.++|+.+
T Consensus 64 ~p~~~~~aLl~HEV~Hi~l~Hi~r 87 (396)
T COG3864 64 CPISEMKALLKHEVYHIMLNHIKR 87 (396)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHh
Confidence 999999999999999999999964
No 166
>PRK01415 hypothetical protein; Validated
Probab=27.60 E-value=74 Score=33.29 Aligned_cols=45 Identities=9% Similarity=0.087 Sum_probs=37.1
Q ss_pred hhhccCCcCCch-hHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEecc
Q psy4798 425 PLFDKYTPLPDG-ELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGF 475 (623)
Q Consensus 425 plf~k~~pl~~~-~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~ 475 (623)
-.|.+|+|++|+ ++++.+.++|++.|+. .+|++-+. ..|+-+.|-
T Consensus 8 ~~fY~f~~i~~~~~~~~~l~~~~~~~~~~-G~i~la~E-----GIN~tisg~ 53 (247)
T PRK01415 8 LSAYSFVNIEEPANLIPKLLLIGKRKYVR-GTILLANE-----GFNGSFSGS 53 (247)
T ss_pred EEEEccccCCCHHHHHHHHHHHHHHcCCe-eEEEEccC-----ccceEeeCC
Confidence 368899999985 4899999999999975 88998663 479998885
No 167
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=27.40 E-value=47 Score=34.05 Aligned_cols=20 Identities=35% Similarity=0.232 Sum_probs=16.5
Q ss_pred CCCHHHHHHHHHHHhhhhhc
Q psy4798 518 GCDTEEVLAVLAHELGHWKY 537 (623)
Q Consensus 518 gl~~~Ei~AVLAHELGH~k~ 537 (623)
|+...=.-+|||||++|.-.
T Consensus 87 GLPrll~gsiLAHE~mHa~L 106 (212)
T PF12315_consen 87 GLPRLLTGSILAHELMHAWL 106 (212)
T ss_pred CCCHHHHhhHHHHHHHHHHh
Confidence 47777779999999999754
No 168
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=27.39 E-value=74 Score=38.74 Aligned_cols=18 Identities=28% Similarity=0.503 Sum_probs=14.3
Q ss_pred HHHHHHHHHHhhhhhccC
Q psy4798 522 EEVLAVLAHELGHWKYNH 539 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k~~H 539 (623)
+.+..|+|||++|---++
T Consensus 285 ~~~~~viaHElAHqWFGn 302 (831)
T TIGR02412 285 ENRAGVILHEMAHMWFGD 302 (831)
T ss_pred HHHHHHHHHHHHHHHhCC
Confidence 457889999999966653
No 169
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=27.36 E-value=38 Score=33.66 Aligned_cols=14 Identities=43% Similarity=0.598 Sum_probs=11.9
Q ss_pred HHHHHHHHHHhhhh
Q psy4798 522 EEVLAVLAHELGHW 535 (623)
Q Consensus 522 ~Ei~AVLAHELGH~ 535 (623)
.+..+++.|||||.
T Consensus 90 ~~~~~~i~HElgHa 103 (198)
T cd04327 90 PEFSRVVLHEFGHA 103 (198)
T ss_pred hhHHHHHHHHHHHH
Confidence 35679999999997
No 170
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=27.31 E-value=14 Score=39.00 Aligned_cols=11 Identities=64% Similarity=0.966 Sum_probs=10.1
Q ss_pred HHHHHHHhhhh
Q psy4798 525 LAVLAHELGHW 535 (623)
Q Consensus 525 ~AVLAHELGH~ 535 (623)
.+|+|||+||.
T Consensus 166 igv~~HE~gH~ 176 (286)
T TIGR03296 166 VGVIAHELGHD 176 (286)
T ss_pred eeeeehhhhcc
Confidence 79999999995
No 171
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=27.28 E-value=4.3e+02 Score=25.63 Aligned_cols=24 Identities=17% Similarity=0.319 Sum_probs=21.0
Q ss_pred hhHHHHHHHHHHHcCCCCCeEEEE
Q psy4798 436 GELKSRIEQLSASVKFPLKKLYVV 459 (623)
Q Consensus 436 ~~L~~~I~~la~~~gfp~~~v~vv 459 (623)
+.+-.-|+.+|++.|.|..+|+|=
T Consensus 79 ~~~~~fi~~vA~~~~V~~~~v~VN 102 (149)
T PF11694_consen 79 SQMVHFIESVAKDLGVSKEEVYVN 102 (149)
T ss_pred HHHHHHHHHHHHHhCCChheEEEe
Confidence 557788999999999999999973
No 172
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=27.23 E-value=11 Score=38.68 Aligned_cols=20 Identities=30% Similarity=0.694 Sum_probs=14.4
Q ss_pred HHHHHHHHHhhhhh-ccCHHH
Q psy4798 523 EVLAVLAHELGHWK-YNHVLK 542 (623)
Q Consensus 523 Ei~AVLAHELGH~k-~~H~~k 542 (623)
..--++.||+|||- +.|+..
T Consensus 136 n~g~t~~HEvGH~lGL~HtF~ 156 (225)
T cd04275 136 NLGDTATHEVGHWLGLYHTFQ 156 (225)
T ss_pred cccceeEEeccceeeeeeeec
Confidence 34468999999995 566654
No 173
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=27.12 E-value=34 Score=36.24 Aligned_cols=21 Identities=48% Similarity=0.816 Sum_probs=16.8
Q ss_pred CHHHHHHHHHHHhhhhhccCHH
Q psy4798 520 DTEEVLAVLAHELGHWKYNHVL 541 (623)
Q Consensus 520 ~~~Ei~AVLAHELGH~k~~H~~ 541 (623)
+..--.|.+.||+|||. +|..
T Consensus 199 da~~yyaTl~HElghwt-gh~~ 219 (316)
T COG4227 199 DAINYYATLLHELGHWT-GHEA 219 (316)
T ss_pred chHhHHHHHHHHhcccc-Cchh
Confidence 66788999999999994 4443
No 174
>KOG3733|consensus
Probab=27.10 E-value=1.5e+02 Score=33.64 Aligned_cols=61 Identities=25% Similarity=0.420 Sum_probs=36.4
Q ss_pred HHHHHHHhhchHhhhhhhccCC--------cCCchhHHHHHHHHHHHcCCCCCeEEEEeCCCCCCcccEEEeccCCCce
Q psy4798 410 LMSLFLMTIYPEFIAPLFDKYT--------PLPDGELKSRIEQLSASVKFPLKKLYVVEGSKRSEHSNAYFYGFFKNKR 480 (623)
Q Consensus 410 ~~~l~~~~i~P~~I~plf~k~~--------pl~~~~L~~~I~~la~~~gfp~~~v~vv~~s~rs~~~NAy~~G~~~~kr 480 (623)
.+++++..+...||+-+-+.|. -.||.||++-|.+-+++ |.+..|.-+ +.+++.-++|++|
T Consensus 487 FISLfIYmvLSLFIAlItdaYDTIK~~qq~GfpeTdLqafi~ec~d~---~~sg~f~~e-------s~~f~s~iCCc~R 555 (566)
T KOG3733|consen 487 FISLFIYMVLSLFIALITDAYDTIKAYQQDGFPETDLQAFITECADD---PSSGKFRTE-------SDDFVSLICCCGR 555 (566)
T ss_pred HHHHHHHHHHHHHHHHHhchHHHHHHhhhCCCChhHHHHHHHHhccC---CCccceecc-------CCCceeeEEecCc
Confidence 4455666667777777766542 24678899888554443 445555544 4455665555543
No 175
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=26.69 E-value=44 Score=34.27 Aligned_cols=20 Identities=35% Similarity=0.197 Sum_probs=16.6
Q ss_pred CCCChHHHHHHHHHhhhhhh
Q psy4798 138 KGCDTEEVLAVLAHELGHWK 157 (623)
Q Consensus 138 ~~l~~~El~AVlAHELGH~k 157 (623)
.||+..-.-+|||||++|.-
T Consensus 86 ~GLPrll~gsiLAHE~mHa~ 105 (212)
T PF12315_consen 86 YGLPRLLTGSILAHELMHAW 105 (212)
T ss_pred CCCCHHHHhhHHHHHHHHHH
Confidence 35778888999999999963
No 176
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=26.58 E-value=84 Score=35.16 Aligned_cols=14 Identities=36% Similarity=0.470 Sum_probs=11.3
Q ss_pred HHHHHhhhhhhcch
Q psy4798 147 AVLAHELGHWKYNH 160 (623)
Q Consensus 147 AVlAHELGH~k~~h 160 (623)
.-++||+||-.+..
T Consensus 161 ~t~iHE~GHalye~ 174 (396)
T cd06460 161 FSTIHETGHALYEQ 174 (396)
T ss_pred HHHHHHhhHHHHHh
Confidence 45799999987766
No 177
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=26.57 E-value=14 Score=36.78 Aligned_cols=12 Identities=50% Similarity=0.645 Sum_probs=10.1
Q ss_pred HHHHHHhhhhhh
Q psy4798 146 LAVLAHELGHWK 157 (623)
Q Consensus 146 ~AVlAHELGH~k 157 (623)
..++||||||--
T Consensus 141 a~~~aHElGH~L 152 (207)
T cd04273 141 AFTIAHELGHVL 152 (207)
T ss_pred EEeeeeechhhc
Confidence 578999999964
No 178
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=26.41 E-value=37 Score=33.10 Aligned_cols=19 Identities=26% Similarity=0.359 Sum_probs=14.3
Q ss_pred HHHHHHHHHHhhhhh-ccCH
Q psy4798 522 EEVLAVLAHELGHWK-YNHV 540 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k-~~H~ 540 (623)
.....++.||+||.- +.|.
T Consensus 111 ~~~~~t~~HEiGHaLGL~H~ 130 (186)
T cd04277 111 SYGYQTIIHEIGHALGLEHP 130 (186)
T ss_pred hhhHHHHHHHHHHHhcCCCC
Confidence 457899999999973 3444
No 179
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=26.35 E-value=35 Score=33.26 Aligned_cols=15 Identities=27% Similarity=0.364 Sum_probs=12.5
Q ss_pred hHHHHHHHHHhhhhh
Q psy4798 142 TEEVLAVLAHELGHW 156 (623)
Q Consensus 142 ~~El~AVlAHELGH~ 156 (623)
......++.||+||.
T Consensus 110 g~~~~~t~~HEiGHa 124 (186)
T cd04277 110 GSYGYQTIIHEIGHA 124 (186)
T ss_pred ChhhHHHHHHHHHHH
Confidence 355789999999996
No 180
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.27 E-value=48 Score=36.04 Aligned_cols=25 Identities=36% Similarity=0.606 Sum_probs=23.0
Q ss_pred CCChHHHHHHHHHhhhhhhcchHHH
Q psy4798 139 GCDTEEVLAVLAHELGHWKYNHVLK 163 (623)
Q Consensus 139 ~l~~~El~AVlAHELGH~k~~h~~k 163 (623)
.|+.+|+.|.|-||+-|+..+|+.+
T Consensus 63 ~~p~~~~~aLl~HEV~Hi~l~Hi~r 87 (396)
T COG3864 63 NCPISEMKALLKHEVYHIMLNHIKR 87 (396)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHh
Confidence 3999999999999999999999864
No 181
>TIGR03513 GldL_gliding gliding motility-associated protein GldL. This protein family, GldL, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile
Probab=26.22 E-value=2.3e+02 Score=29.01 Aligned_cols=46 Identities=9% Similarity=0.099 Sum_probs=29.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHHHHhhch
Q psy4798 375 FIVSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMSLFLMTIYP 420 (623)
Q Consensus 375 ~~~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~l~~~~i~P 420 (623)
+++++...+.++.++..|..|.+..+++.+..+.=++.+++.-+-|
T Consensus 6 ~~~~~ga~~vi~GalfKi~h~~~~~~~l~~g~~te~lvFfiSAFe~ 51 (202)
T TIGR03513 6 IVYGWGAAVVILGALFKIMHWPMGNPMLFVGLITEALIFAISAFEK 51 (202)
T ss_pred HHHHHhHHHHHHHHHHHHhccCCchHHHHHHHHHHHHHHHHhccCC
Confidence 3444455555666777888888888777777666666666554433
No 182
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=25.85 E-value=39 Score=34.03 Aligned_cols=13 Identities=62% Similarity=0.800 Sum_probs=10.9
Q ss_pred HHHHHHHHHhhhh
Q psy4798 523 EVLAVLAHELGHW 535 (623)
Q Consensus 523 Ei~AVLAHELGH~ 535 (623)
=..+|+.||+||.
T Consensus 37 l~~~v~iHElgH~ 49 (208)
T cd06161 37 LFLSVLLHELGHA 49 (208)
T ss_pred HHHHHHHHHHHHH
Confidence 3568999999997
No 183
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=25.20 E-value=41 Score=33.75 Aligned_cols=15 Identities=40% Similarity=0.605 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhhhh
Q psy4798 521 TEEVLAVLAHELGHW 535 (623)
Q Consensus 521 ~~Ei~AVLAHELGH~ 535 (623)
+-.+..++.||+||.
T Consensus 19 ~~~~l~t~~HE~gHa 33 (200)
T PF13398_consen 19 PFRLLVTFVHELGHA 33 (200)
T ss_pred HHHHHHHHHHHHHHH
Confidence 346788999999996
No 184
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=25.15 E-value=41 Score=33.41 Aligned_cols=14 Identities=43% Similarity=0.598 Sum_probs=11.6
Q ss_pred HHHHHHHHHhhhhh
Q psy4798 143 EEVLAVLAHELGHW 156 (623)
Q Consensus 143 ~El~AVlAHELGH~ 156 (623)
.+..+++.||+||.
T Consensus 90 ~~~~~~i~HElgHa 103 (198)
T cd04327 90 PEFSRVVLHEFGHA 103 (198)
T ss_pred hhHHHHHHHHHHHH
Confidence 35678999999995
No 185
>KOG3607|consensus
Probab=24.78 E-value=49 Score=39.77 Aligned_cols=16 Identities=50% Similarity=0.526 Sum_probs=13.4
Q ss_pred hHHHHHHHHHhhhhhh
Q psy4798 142 TEEVLAVLAHELGHWK 157 (623)
Q Consensus 142 ~~El~AVlAHELGH~k 157 (623)
......|+||||||.-
T Consensus 320 ~~~~a~v~AhelgH~l 335 (716)
T KOG3607|consen 320 LLAFAVVLAHELGHNL 335 (716)
T ss_pred chhHHHHHHHHHHhhc
Confidence 5668899999999964
No 186
>COG3402 Uncharacterized conserved protein [Function unknown]
Probab=24.50 E-value=1.9e+02 Score=28.47 Aligned_cols=44 Identities=20% Similarity=0.221 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHHHHHHcchhHHHHHHHHHHHHH-HHHHhhch
Q psy4798 377 VSLILSIPLTGAVVYIIQVGGNMVFLYLWVFIILMS-LFLMTIYP 420 (623)
Q Consensus 377 ~~~~l~~pl~~~~~~li~~~g~~~~l~~w~~~~~~~-l~~~~i~P 420 (623)
.++++.+|+..++++..-..-++|+.+..++.+++. ++..+++|
T Consensus 25 ~~l~Ll~av~~~~~~~~~~~~~~w~~~a~~av~l~~~vv~l~iiP 69 (161)
T COG3402 25 IALVLLIAVAAGVLLYFVGLDPNWSSVAAVAVILLAAVVTLFIIP 69 (161)
T ss_pred HHHHHHHHHHHHHHHheeccCCccHHHHHHHHHHHHHHHHhhhhh
Confidence 344667777777666665555667776665554433 33344444
No 187
>TIGR02140 permease_CysW sulfate ABC transporter, permease protein CysW. This model represents CysW, one of two homologous, tandem permeases in the sulfate ABC transporter system; the other is CysT (TIGR02139). The sulfate transporter has been described in E. coli as transporting sulfate, thiosulfate, selenate, and selenite. Sulfate transporters may also transport molybdate ion if a specific molybdate transporter is not present.
Probab=24.05 E-value=7.7e+02 Score=25.05 Aligned_cols=36 Identities=8% Similarity=0.047 Sum_probs=22.3
Q ss_pred hhchHhhhhhhccCCcCCchhHHHHHHHHHHHcCCCCCeEE
Q psy4798 417 TIYPEFIAPLFDKYTPLPDGELKSRIEQLSASVKFPLKKLY 457 (623)
Q Consensus 417 ~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~gfp~~~v~ 457 (623)
...|.....+...++..+ +.+++.|+..|...-+.+
T Consensus 139 ~~~p~~~~~~~~~l~~i~-----~~~~eAA~~~Gas~~~~~ 174 (261)
T TIGR02140 139 VTCPFVARELIPVMEEQG-----TEQEEAALTLGASWWQTF 174 (261)
T ss_pred HHHHHHHHHHHHHHHhCC-----ccHHHHHHhcCCCcceee
Confidence 456666655555455543 456777999998765443
No 188
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=24.01 E-value=44 Score=35.46 Aligned_cols=24 Identities=38% Similarity=0.547 Sum_probs=19.5
Q ss_pred CChHHHHHHHHHhhhhhhcchHHH
Q psy4798 140 CDTEEVLAVLAHELGHWKYNHVLK 163 (623)
Q Consensus 140 l~~~El~AVlAHELGH~k~~h~~k 163 (623)
-+...-.|.+.||+|||..+-...
T Consensus 198 ~da~~yyaTl~HElghwtgh~~rl 221 (316)
T COG4227 198 RDAINYYATLLHELGHWTGHEARL 221 (316)
T ss_pred cchHhHHHHHHHHhccccCchhhh
Confidence 566778999999999998776553
No 189
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=23.81 E-value=46 Score=33.59 Aligned_cols=14 Identities=43% Similarity=0.627 Sum_probs=12.5
Q ss_pred HHHHHHHHHhhhhh
Q psy4798 143 EEVLAVLAHELGHW 156 (623)
Q Consensus 143 ~El~AVlAHELGH~ 156 (623)
+-+.++++||+||.
T Consensus 114 ~~~~~~~~he~gh~ 127 (197)
T cd04276 114 ASLRYLLAHEVGHT 127 (197)
T ss_pred HHHHHHHHHHHHHH
Confidence 56889999999996
No 190
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=23.75 E-value=50 Score=33.37 Aligned_cols=19 Identities=37% Similarity=0.497 Sum_probs=14.8
Q ss_pred HHHHHHHHHHhhhhh-ccCH
Q psy4798 522 EEVLAVLAHELGHWK-YNHV 540 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k-~~H~ 540 (623)
+-+..+++||+||.- +.|.
T Consensus 114 ~~~~~~~~he~gh~lGl~hn 133 (197)
T cd04276 114 ASLRYLLAHEVGHTLGLRHN 133 (197)
T ss_pred HHHHHHHHHHHHHHhcCccc
Confidence 568899999999973 4454
No 191
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=23.73 E-value=57 Score=34.73 Aligned_cols=26 Identities=27% Similarity=0.288 Sum_probs=20.8
Q ss_pred CCChHHHHHHHHHhhhhhh---cchHHHH
Q psy4798 139 GCDTEEVLAVLAHELGHWK---YNHVLKS 164 (623)
Q Consensus 139 ~l~~~El~AVlAHELGH~k---~~h~~k~ 164 (623)
+.++++|...|-|||+|-| ++|+...
T Consensus 191 r~dd~~lA~LIFHELAHQk~Y~~~DtAFN 219 (376)
T COG4324 191 RQDDTYLASLIFHELAHQKIYVNNDTAFN 219 (376)
T ss_pred cCChHHHHHHHHHHHhhheEeecCcchHh
Confidence 4789999999999999976 5565543
No 192
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=23.58 E-value=91 Score=38.33 Aligned_cols=18 Identities=33% Similarity=0.316 Sum_probs=14.1
Q ss_pred HHHHHHHHHHhhhhhccC
Q psy4798 522 EEVLAVLAHELGHWKYNH 539 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k~~H 539 (623)
..+..|+|||++|--.|+
T Consensus 281 ~~i~~VIaHElaHqWfGN 298 (863)
T TIGR02414 281 ERIESVIAHEYFHNWTGN 298 (863)
T ss_pred HHHHHHHHHHHHHHHhcc
Confidence 458899999999955543
No 193
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=23.57 E-value=48 Score=32.99 Aligned_cols=11 Identities=45% Similarity=1.108 Sum_probs=9.6
Q ss_pred HHHHHHHhhhh
Q psy4798 525 LAVLAHELGHW 535 (623)
Q Consensus 525 ~AVLAHELGH~ 535 (623)
..|+-||+||.
T Consensus 10 ~~v~iHElGH~ 20 (182)
T cd06163 10 ILIFVHELGHF 20 (182)
T ss_pred HHHHHHHHHHH
Confidence 46899999997
No 194
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=23.57 E-value=43 Score=32.38 Aligned_cols=13 Identities=46% Similarity=0.728 Sum_probs=10.6
Q ss_pred HHHHHHHhhhhhh
Q psy4798 145 VLAVLAHELGHWK 157 (623)
Q Consensus 145 l~AVlAHELGH~k 157 (623)
+.+|..||+||..
T Consensus 8 ~i~i~iHE~gH~~ 20 (180)
T cd05709 8 LISVTVHELGHAL 20 (180)
T ss_pred HHHHHHHHHHHHH
Confidence 4578999999964
No 195
>KOG3714|consensus
Probab=23.31 E-value=34 Score=38.23 Aligned_cols=18 Identities=39% Similarity=0.684 Sum_probs=15.0
Q ss_pred CCCCCCHHHHHHHHHHHhhhh
Q psy4798 515 NKKGCDTEEVLAVLAHELGHW 535 (623)
Q Consensus 515 ~g~gl~~~Ei~AVLAHELGH~ 535 (623)
.|.||.. .++++|||+|.
T Consensus 153 l~~~C~~---~G~i~HEl~Ha 170 (411)
T KOG3714|consen 153 LGDGCDR---FGTIVHELMHA 170 (411)
T ss_pred cCCCcCc---CchhHHHHHHH
Confidence 3556877 99999999997
No 196
>COG2274 SunT ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]
Probab=23.20 E-value=3.3e+02 Score=32.91 Aligned_cols=20 Identities=20% Similarity=0.509 Sum_probs=15.8
Q ss_pred HHHHHHHHhhhccHHHhhhc
Q psy4798 300 VLFNIFSTVIGLPISFYHHF 319 (623)
Q Consensus 300 ~~~~l~~~li~lP~~~Y~~f 319 (623)
+...++..++++|.++|++.
T Consensus 229 l~~~~~~hll~Lpl~~f~~r 248 (709)
T COG2274 229 LSGRFFRHLLRLPLSYFEKR 248 (709)
T ss_pred HHHHHHHHHHcCcHHHccCC
Confidence 44567889999999999654
No 197
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=22.78 E-value=48 Score=34.01 Aligned_cols=13 Identities=54% Similarity=0.674 Sum_probs=11.1
Q ss_pred HHHHHHHHHhhhh
Q psy4798 523 EVLAVLAHELGHW 535 (623)
Q Consensus 523 Ei~AVLAHELGH~ 535 (623)
=..+|+.||+||.
T Consensus 52 l~~~v~iHElgH~ 64 (227)
T cd06164 52 LFASVLLHELGHS 64 (227)
T ss_pred HHHHHHHHHHHHH
Confidence 3578999999996
No 198
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=22.37 E-value=47 Score=33.45 Aligned_cols=12 Identities=67% Similarity=0.944 Sum_probs=10.3
Q ss_pred HHHHHHHhhhhh
Q psy4798 145 VLAVLAHELGHW 156 (623)
Q Consensus 145 l~AVlAHELGH~ 156 (623)
..+|+.||+||.
T Consensus 38 ~~~v~iHElgH~ 49 (208)
T cd06161 38 FLSVLLHELGHA 49 (208)
T ss_pred HHHHHHHHHHHH
Confidence 458999999995
No 199
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=22.21 E-value=50 Score=34.85 Aligned_cols=13 Identities=46% Similarity=0.621 Sum_probs=11.1
Q ss_pred HHHHHHHHHhhhh
Q psy4798 523 EVLAVLAHELGHW 535 (623)
Q Consensus 523 Ei~AVLAHELGH~ 535 (623)
=..+|+.||+||.
T Consensus 117 L~isv~iHElgHa 129 (263)
T cd06159 117 LVVGVVVHELSHG 129 (263)
T ss_pred HHHHHHHHHHHHH
Confidence 3678999999996
No 200
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=21.68 E-value=52 Score=35.09 Aligned_cols=14 Identities=29% Similarity=0.496 Sum_probs=11.8
Q ss_pred HHHHHHHHHhhhhh
Q psy4798 523 EVLAVLAHELGHWK 536 (623)
Q Consensus 523 Ei~AVLAHELGH~k 536 (623)
=+.+|+.||+||--
T Consensus 134 l~isvvvHElgHal 147 (277)
T cd06162 134 LLISGVVHEMGHGV 147 (277)
T ss_pred HHHHHHHHHHHHHH
Confidence 37799999999973
No 201
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=21.58 E-value=61 Score=29.23 Aligned_cols=19 Identities=26% Similarity=0.246 Sum_probs=13.6
Q ss_pred hHHHHHHHHHhhhhhhcch
Q psy4798 142 TEEVLAVLAHELGHWKYNH 160 (623)
Q Consensus 142 ~~El~AVlAHELGH~k~~h 160 (623)
.+++.-|+-||+||.-..+
T Consensus 70 ~~~I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 70 AELIRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHHHHcCCC
Confidence 3567889999999986543
No 202
>PRK14015 pepN aminopeptidase N; Provisional
Probab=21.34 E-value=64 Score=39.67 Aligned_cols=19 Identities=32% Similarity=0.265 Sum_probs=14.6
Q ss_pred HHHHHHHHHHhhhhhccCH
Q psy4798 522 EEVLAVLAHELGHWKYNHV 540 (623)
Q Consensus 522 ~Ei~AVLAHELGH~k~~H~ 540 (623)
..+..|+|||++|--.|..
T Consensus 294 ~~i~~vIaHElaHqWFGNl 312 (875)
T PRK14015 294 ERIESVIAHEYFHNWTGNR 312 (875)
T ss_pred HHHHHHHHHHHHHHHHhCc
Confidence 4588999999999655543
No 203
>smart00793 AgrB Accessory gene regulator B. The accessory gene regulator (agr) of Staphylococcus aureus is the central regulatory system that controls the gene expression for a large set of virulence factors. The arg locus consists of two transcripts: RNAII and RNAIII. RNAII encodes four genes (agrA, B, C, and D) whose gene products assemble a quorum sensing system. At low cell density, the agr genes are continuously expressed at basal levels. A signal molecule, autoinducing peptide (AIP), produced and secreted by the bacteria, accumulates outside of the cells. When the cell density increases and the AIP concentration reaches a threshold, it activates the agr response, i.e. activation of secreted protein gene expression and subsequent repression of cell wall-associated protein genes. AgrB and AgrD are essential for the production of the autoinducing peptide which functions as a signal for quorum sensing. AgrB is a transmembrane protein PUBMED:11195102. AgrB is involved in the proteolyt
Probab=21.09 E-value=4.3e+02 Score=26.07 Aligned_cols=24 Identities=25% Similarity=0.346 Sum_probs=19.1
Q ss_pred hhhhhccCCcCCchhHHHHHHHHH
Q psy4798 423 IAPLFDKYTPLPDGELKSRIEQLS 446 (623)
Q Consensus 423 I~plf~k~~pl~~~~L~~~I~~la 446 (623)
.+|.-++-+|+.++|.+.+....+
T Consensus 118 ~APv~~~~kpi~~~e~~k~lk~~s 141 (184)
T smart00793 118 YAPADTEKQPVIPEKLKKKLKKKS 141 (184)
T ss_pred cCCcccccCCCCCHHHHHHHHHHH
Confidence 478989999999998877765544
No 204
>COG4662 TupA ABC-type tungstate transport system, periplasmic component [Coenzyme metabolism]
Probab=20.93 E-value=5.4e+02 Score=26.48 Aligned_cols=63 Identities=21% Similarity=0.248 Sum_probs=37.1
Q ss_pred HHHHHHHcchhHHHHHHHHHHHHHHH-HHhhchHhhhhhhccCCcCCchhHHHHHHHHHHHcCCCCCeE
Q psy4798 389 VVYIIQVGGNMVFLYLWVFIILMSLF-LMTIYPEFIAPLFDKYTPLPDGELKSRIEQLSASVKFPLKKL 456 (623)
Q Consensus 389 ~~~li~~~g~~~~l~~w~~~~~~~l~-~~~i~P~~I~plf~k~~pl~~~~L~~~I~~la~~~gfp~~~v 456 (623)
+|.++++.|+.-|+-+....-++.+- ...+.|.+|+-..+..+.. +.++.++++.+|..+.|+
T Consensus 80 LylLlSr~GPlG~f~LLfT~~amILGq~iL~lPlvia~~l~ale~~-----dpr~~ela~~lgas~~kl 143 (227)
T COG4662 80 LYLLLSRSGPLGWFNLLFTQDAMILGQAILILPLVIAFVLTALESV-----DPRLKELARSLGASPLKL 143 (227)
T ss_pred HHHHHhccCCCccchhHhhhhhHHHHHHHHHHHHHHHHHHHHHHhc-----CHHHHHHHHHcCCcHHHH
Confidence 45677888888776544333332222 1235677776655543332 346677889988876444
No 205
>PF11677 DUF3273: Protein of unknown function (DUF3273); InterPro: IPR021691 Some members in this family of proteins are annotated as multi-transmembrane proteins however this cannot be confirmed. Currently this family has no known function.
Probab=20.83 E-value=2.9e+02 Score=29.39 Aligned_cols=48 Identities=25% Similarity=0.519 Sum_probs=30.9
Q ss_pred HHHcchhHHHHHHHHHHHHHHHHHhhchHhhhhhhccCCc-CCchhHHHHHHHH
Q psy4798 393 IQVGGNMVFLYLWVFIILMSLFLMTIYPEFIAPLFDKYTP-LPDGELKSRIEQL 445 (623)
Q Consensus 393 i~~~g~~~~l~~w~~~~~~~l~~~~i~P~~I~plf~k~~p-l~~~~L~~~I~~l 445 (623)
+.-.|..+.+..|+ ++....+.-.+..|+.+|+.| |.+.++++.+++-
T Consensus 207 f~~~g~~~tL~~l~-----al~~~t~WA~~FEP~~~~~sp~L~~s~l~ndv~~~ 255 (265)
T PF11677_consen 207 FTGFGAFFTLFLLL-----ALMAATLWAFLFEPLLNKWSPKLHQSALRNDVEPQ 255 (265)
T ss_pred HcCCchHHHHHHHH-----HHHHHHHHHHHhhHHhhhcCchhhHHHhhhhhhHH
Confidence 34445555555443 333344455577899999999 6788898886654
No 206
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=20.79 E-value=44 Score=33.81 Aligned_cols=17 Identities=47% Similarity=0.978 Sum_probs=12.8
Q ss_pred CCCCCHHHHHHHHHHHhhhh
Q psy4798 516 KKGCDTEEVLAVLAHELGHW 535 (623)
Q Consensus 516 g~gl~~~Ei~AVLAHELGH~ 535 (623)
|.||.. .+++.|||||.
T Consensus 82 ~~~C~~---~Gti~HEl~Ha 98 (200)
T cd04281 82 GKNCDK---FGIVVHELGHV 98 (200)
T ss_pred CCCcCc---CchHHHHHHHH
Confidence 345652 58999999996
No 207
>KOG3314|consensus
Probab=20.61 E-value=1e+02 Score=30.70 Aligned_cols=44 Identities=27% Similarity=0.231 Sum_probs=31.7
Q ss_pred cEEEe-ccCCCceEEEehhHHhhccCCCccccCCCCCCCccccccccCCCCCCHHHHHHHHHHHhhhhhccCHHH
Q psy4798 469 NAYFY-GFFKNKRIVLFDTLLKDYVPLNADKKDKAGDSEPLISTEGANKKGCDTEEVLAVLAHELGHWKYNHVLK 542 (623)
Q Consensus 469 NAy~~-G~~~~krIVL~dtLl~~~~~~~~~~~~~~~~~~~~~~~~~~~g~gl~~~Ei~AVLAHELGH~k~~H~~k 542 (623)
|+... |+-+.+-|||..+=| -+.+++.-|++|||=|.--.|-++
T Consensus 65 ~~~~~GGy~~~~gIvlCqN~l------------------------------~~q~h~n~vv~HElIH~fDd~r~~ 109 (194)
T KOG3314|consen 65 TGPVAGGYTPGRGIVLCQNRL------------------------------TIQDHVNQVVIHELIHAFDDCRAK 109 (194)
T ss_pred CCCccCCccCCCceEEecccc------------------------------chHHHHHHHHHHHHHHHHHhhhhh
Confidence 34444 444666799998765 356899999999999986665544
No 208
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=20.50 E-value=76 Score=33.85 Aligned_cols=28 Identities=25% Similarity=0.258 Sum_probs=21.9
Q ss_pred CCHHHHHHHHHHHhhhhh---ccCHHHHHHH
Q psy4798 519 CDTEEVLAVLAHELGHWK---YNHVLKSMIL 546 (623)
Q Consensus 519 l~~~Ei~AVLAHELGH~k---~~H~~k~~~~ 546 (623)
.+++++...+-|||+|-| ++++..+-.+
T Consensus 192 ~dd~~lA~LIFHELAHQk~Y~~~DtAFNEsF 222 (376)
T COG4324 192 QDDTYLASLIFHELAHQKIYVNNDTAFNESF 222 (376)
T ss_pred CChHHHHHHHHHHHhhheEeecCcchHhHHH
Confidence 588999999999999976 5666555443
No 209
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=20.29 E-value=51 Score=32.81 Aligned_cols=11 Identities=45% Similarity=1.108 Sum_probs=9.3
Q ss_pred HHHHHHhhhhh
Q psy4798 146 LAVLAHELGHW 156 (623)
Q Consensus 146 ~AVlAHELGH~ 156 (623)
..|+.||+||.
T Consensus 10 ~~v~iHElGH~ 20 (182)
T cd06163 10 ILIFVHELGHF 20 (182)
T ss_pred HHHHHHHHHHH
Confidence 46899999996
Done!