Query 039411
Match_columns 222
No_of_seqs 226 out of 1513
Neff 7.9
Searched_HMMs 46136
Date Fri Mar 29 09:25:39 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/039411.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/039411hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 1.5E-54 3.2E-59 391.9 13.5 193 1-221 62-262 (469)
2 cd04278 ZnMc_MMP Zinc-dependen 100.0 2.2E-39 4.7E-44 256.2 15.9 150 69-220 1-157 (157)
3 PF00413 Peptidase_M10: Matrix 100.0 7.3E-37 1.6E-41 240.0 14.0 147 69-220 1-154 (154)
4 cd04279 ZnMc_MMP_like_1 Zinc-d 100.0 9.9E-30 2.1E-34 200.7 14.5 141 75-220 2-156 (156)
5 cd04268 ZnMc_MMP_like Zinc-dep 99.9 8.6E-27 1.9E-31 184.9 8.8 142 74-219 1-165 (165)
6 smart00235 ZnMc Zinc-dependent 99.9 4.1E-25 9E-30 171.1 11.2 132 68-221 3-140 (140)
7 cd04277 ZnMc_serralysin_like Z 99.9 8.9E-24 1.9E-28 171.3 14.2 123 87-220 33-186 (186)
8 cd04327 ZnMc_MMP_like_3 Zinc-d 99.8 7.8E-20 1.7E-24 149.7 13.0 109 69-189 2-110 (198)
9 cd00203 ZnMc Zinc-dependent me 99.8 6.1E-19 1.3E-23 139.8 13.0 129 76-219 2-167 (167)
10 cd04280 ZnMc_astacin_like Zinc 99.7 5.5E-17 1.2E-21 131.0 11.5 91 75-189 2-92 (180)
11 cd04276 ZnMc_MMP_like_2 Zinc-d 99.7 1.6E-16 3.5E-21 129.5 11.6 144 74-221 7-197 (197)
12 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.7 3.8E-16 8.2E-21 127.7 11.9 99 67-189 7-105 (200)
13 PF12388 Peptidase_M57: Dual-a 99.7 6.3E-16 1.4E-20 126.0 13.0 137 69-219 33-211 (211)
14 PF01400 Astacin: Astacin (Pep 99.7 2E-16 4.3E-21 128.9 8.6 96 69-189 2-97 (191)
15 cd04282 ZnMc_meprin Zinc-depen 99.6 2.2E-15 4.9E-20 125.3 12.3 97 66-189 42-138 (230)
16 cd04283 ZnMc_hatching_enzyme Z 99.6 1.2E-14 2.5E-19 117.2 11.9 93 74-189 3-95 (182)
17 KOG3714 Meprin A metalloprotea 99.5 4.3E-14 9.3E-19 127.5 10.6 103 63-189 75-177 (411)
18 PF02031 Peptidase_M7: Strepto 99.1 2.5E-10 5.4E-15 85.5 7.3 122 76-220 3-130 (132)
19 COG5549 Predicted Zn-dependent 99.1 3.1E-10 6.7E-15 91.5 7.5 121 88-221 104-234 (236)
20 cd04267 ZnMc_ADAM_like Zinc-de 98.1 2E-05 4.4E-10 63.8 9.8 48 170-217 132-189 (192)
21 PF13583 Reprolysin_4: Metallo 98.1 5.5E-06 1.2E-10 68.3 5.1 48 171-219 137-198 (206)
22 cd04273 ZnMc_ADAMTS_like Zinc- 98.0 3.1E-05 6.8E-10 63.7 7.5 116 93-218 72-197 (207)
23 PF01471 PG_binding_1: Putativ 97.9 4.8E-06 1E-10 54.2 1.7 24 1-24 34-57 (57)
24 PF05572 Peptidase_M43: Pregna 97.9 8.8E-06 1.9E-10 64.1 3.3 23 169-191 67-89 (154)
25 PF11350 DUF3152: Protein of u 97.9 2.3E-05 5E-10 63.6 5.7 115 74-192 28-160 (203)
26 PF13574 Reprolysin_2: Metallo 97.7 1.1E-05 2.3E-10 64.8 0.5 48 171-219 111-171 (173)
27 COG1913 Predicted Zn-dependent 97.6 2.9E-05 6.3E-10 61.5 1.9 30 171-200 124-153 (181)
28 cd04271 ZnMc_ADAM_fungal Zinc- 97.5 9.7E-05 2.1E-09 61.8 3.8 47 173-220 147-218 (228)
29 PF13582 Reprolysin_3: Metallo 97.5 3.7E-05 8.1E-10 57.7 1.2 17 172-188 108-124 (124)
30 PRK13267 archaemetzincin-like 97.4 7.6E-05 1.7E-09 60.1 2.4 27 171-197 125-151 (179)
31 PF13688 Reprolysin_5: Metallo 97.4 4.7E-05 1E-09 61.8 1.0 23 169-191 140-162 (196)
32 cd04269 ZnMc_adamalysin_II_lik 97.4 0.00017 3.6E-09 58.6 4.2 48 170-218 130-183 (194)
33 cd04275 ZnMc_pappalysin_like Z 97.4 3.4E-05 7.4E-10 64.4 -0.4 24 169-192 135-158 (225)
34 PF07998 Peptidase_M54: Peptid 97.3 0.00012 2.7E-09 59.5 1.8 29 171-200 145-173 (194)
35 PF01421 Reprolysin: Reprolysi 97.0 0.00095 2.1E-08 54.4 4.2 50 169-218 129-186 (199)
36 cd04270 ZnMc_TACE_like Zinc-de 97.0 0.0012 2.6E-08 55.8 4.8 48 171-218 167-228 (244)
37 cd04272 ZnMc_salivary_gland_MP 96.9 0.0013 2.8E-08 54.6 4.6 49 170-218 144-210 (220)
38 TIGR03296 M6dom_TIGR03296 M6 f 96.2 0.0093 2E-07 51.6 5.6 19 172-190 166-184 (286)
39 PF11150 DUF2927: Protein of u 95.7 0.37 7.9E-06 40.0 12.7 145 68-220 28-194 (213)
40 PF10462 Peptidase_M66: Peptid 95.6 0.0053 1.2E-07 53.4 1.4 21 169-189 191-211 (305)
41 PF01457 Peptidase_M8: Leishma 95.6 0.0082 1.8E-07 56.2 2.6 60 116-187 159-226 (521)
42 PF05548 Peptidase_M11: Gameto 95.4 0.0088 1.9E-07 52.4 2.2 20 170-189 149-168 (314)
43 PTZ00337 surface protease GP63 95.4 0.026 5.7E-07 53.2 5.2 37 146-188 210-246 (567)
44 PF12044 Metallopep: Putative 95.0 0.015 3.2E-07 52.9 2.3 25 171-197 316-340 (423)
45 PTZ00257 Glycoprotein GP63 (le 94.2 0.059 1.3E-06 51.1 4.4 61 116-188 205-273 (622)
46 COG2738 Predicted Zn-dependent 93.8 0.082 1.8E-06 42.9 3.8 81 89-187 20-108 (226)
47 PF04298 Zn_peptidase_2: Putat 93.3 0.079 1.7E-06 44.0 3.0 76 90-183 18-101 (222)
48 KOG3658 Tumor necrosis factor- 93.1 0.082 1.8E-06 50.2 3.1 46 172-217 393-451 (764)
49 COG3824 Predicted Zn-dependent 92.5 0.058 1.3E-06 40.3 1.1 18 170-187 108-125 (136)
50 TIGR02869 spore_SleB spore cor 91.9 0.11 2.3E-06 42.7 2.1 32 1-33 45-76 (201)
51 COG4783 Putative Zn-dependent 91.5 0.088 1.9E-06 48.2 1.3 18 167-184 126-143 (484)
52 KOG4525 Jacalin-like lectin do 91.1 0.15 3.2E-06 46.2 2.3 24 171-196 301-324 (614)
53 PF05547 Peptidase_M6: Immune 91.0 0.04 8.7E-07 52.7 -1.4 20 171-190 221-240 (645)
54 PF09471 Peptidase_M64: IgA Pe 90.5 0.14 3.1E-06 43.8 1.6 24 170-193 215-239 (264)
55 KOG3607 Meltrins, fertilins an 90.3 0.14 3.1E-06 49.6 1.6 45 170-215 322-374 (716)
56 PF06262 DUF1025: Possibl zinc 88.5 0.27 5.8E-06 35.6 1.6 16 171-186 73-88 (97)
57 PRK10594 murein L,D-transpepti 88.0 0.32 6.9E-06 46.2 2.1 27 1-27 317-343 (608)
58 PF04228 Zn_peptidase: Putativ 87.8 1.1 2.4E-05 38.9 5.2 40 145-186 141-189 (292)
59 PF12725 DUF3810: Protein of u 87.7 0.27 5.9E-06 43.2 1.4 38 145-190 178-215 (318)
60 PF13398 Peptidase_M50B: Pepti 86.6 0.33 7.2E-06 39.7 1.2 15 170-184 21-35 (200)
61 PF01863 DUF45: Protein of unk 86.1 3.7 8E-05 33.1 7.2 77 91-195 112-188 (205)
62 PF14247 DUF4344: Domain of un 84.9 0.44 9.6E-06 39.7 1.2 14 170-183 91-104 (220)
63 COG2856 Predicted Zn peptidase 84.8 0.4 8.6E-06 39.8 0.8 16 172-187 73-88 (213)
64 PF06114 DUF955: Domain of unk 84.8 0.41 8.9E-06 34.6 0.8 15 169-183 40-54 (122)
65 COG5549 Predicted Zn-dependent 82.5 0.73 1.6E-05 37.9 1.5 110 91-200 49-182 (236)
66 PF01435 Peptidase_M48: Peptid 81.4 0.77 1.7E-05 37.4 1.3 20 169-189 87-106 (226)
67 cd06456 M3A_DCP_Oligopeptidase 81.2 14 0.0003 33.8 9.5 24 90-113 107-130 (422)
68 COG2989 Uncharacterized protei 81.0 0.91 2E-05 42.3 1.7 26 1-26 273-298 (561)
69 cd06457 M3A_MIP Peptidase M3 m 80.8 13 0.00029 34.2 9.3 25 89-113 132-156 (458)
70 PRK02870 heat shock protein Ht 80.2 1.3 2.8E-05 39.3 2.4 33 146-182 152-184 (336)
71 cd06163 S2P-M50_PDZ_RseP-like 79.7 0.94 2E-05 36.6 1.2 12 172-183 10-21 (182)
72 PRK03982 heat shock protein Ht 79.7 0.93 2E-05 39.1 1.3 15 169-183 123-137 (288)
73 COG0501 HtpX Zn-dependent prot 79.0 1 2.2E-05 38.6 1.3 16 168-183 154-169 (302)
74 cd06161 S2P-M50_SpoIVFB SpoIVF 78.8 1 2.2E-05 37.0 1.2 14 170-183 37-50 (208)
75 PRK03072 heat shock protein Ht 78.7 1 2.3E-05 38.9 1.3 17 168-184 124-140 (288)
76 PRK03001 M48 family peptidase; 78.6 1.1 2.3E-05 38.7 1.3 16 168-183 121-136 (283)
77 PRK01345 heat shock protein Ht 78.4 1.1 2.3E-05 39.4 1.3 19 169-188 122-140 (317)
78 cd06455 M3A_TOP Peptidase M3 T 76.0 22 0.00047 32.9 9.2 14 170-183 262-275 (472)
79 PF13699 DUF4157: Domain of un 75.9 3 6.5E-05 28.9 2.7 30 149-183 44-73 (79)
80 PRK04897 heat shock protein Ht 75.4 1.5 3.2E-05 38.2 1.3 15 169-183 135-149 (298)
81 cd06164 S2P-M50_SpoIVFB_CBS Sp 75.4 1.5 3.2E-05 36.7 1.2 15 171-185 53-67 (227)
82 cd06258 Peptidase_M3_like The 75.3 1.4 2.9E-05 39.1 1.1 15 169-183 152-166 (365)
83 KOG3538 Disintegrin metallopro 74.9 0.87 1.9E-05 45.3 -0.3 31 170-200 316-352 (845)
84 PRK05457 heat shock protein Ht 74.7 1.6 3.4E-05 37.8 1.3 15 168-182 131-145 (284)
85 cd06459 M3B_Oligoendopeptidase 74.5 1.4 3.1E-05 39.6 1.0 15 169-183 220-234 (427)
86 PF01432 Peptidase_M3: Peptida 74.5 1.5 3.2E-05 40.3 1.1 25 89-113 136-160 (458)
87 PRK02391 heat shock protein Ht 73.9 1.7 3.7E-05 37.8 1.3 15 168-182 130-144 (296)
88 PRK01265 heat shock protein Ht 73.7 1.7 3.7E-05 38.3 1.3 15 168-182 137-151 (324)
89 PF13485 Peptidase_MA_2: Pepti 73.2 2.2 4.8E-05 30.9 1.7 14 170-183 24-37 (128)
90 cd06162 S2P-M50_PDZ_SREBP Ster 72.2 1.9 4.2E-05 37.1 1.2 13 171-183 135-147 (277)
91 TIGR00181 pepF oligoendopeptid 72.2 2.4 5.2E-05 40.3 2.0 15 169-183 376-390 (591)
92 cd06159 S2P-M50_PDZ_Arch Uncha 71.8 2 4.3E-05 36.8 1.2 15 171-185 118-132 (263)
93 TIGR02289 M3_not_pepF oligoend 71.2 2.1 4.6E-05 40.3 1.4 15 169-183 335-349 (549)
94 TIGR02290 M3_fam_3 oligoendope 70.3 1.9 4E-05 41.0 0.8 14 170-183 374-387 (587)
95 PF01447 Peptidase_M4: Thermol 69.4 2.2 4.7E-05 33.4 0.8 12 172-183 136-147 (150)
96 cd06160 S2P-M50_like_2 Unchara 68.6 2.6 5.7E-05 34.0 1.2 20 171-190 41-60 (183)
97 PF09374 PG_binding_3: Predict 67.7 2.6 5.6E-05 28.6 0.9 24 1-24 3-30 (72)
98 PLN02791 Nudix hydrolase homol 65.4 3.3 7.1E-05 40.7 1.4 21 170-190 584-604 (770)
99 COG4227 Antirestriction protei 64.3 4.9 0.00011 34.3 2.1 23 168-190 200-222 (316)
100 KOG2921 Intramembrane metallop 63.4 3.5 7.5E-05 37.2 1.0 19 171-189 131-149 (484)
101 cd06158 S2P-M50_like_1 Unchara 61.8 4.2 9.2E-05 32.7 1.2 13 171-183 9-21 (181)
102 PF14891 Peptidase_M91: Effect 61.2 8.7 0.00019 30.5 2.9 17 170-186 102-118 (174)
103 COG3409 Putative peptidoglycan 61.1 5.2 0.00011 31.4 1.6 27 1-27 78-105 (185)
104 cd06460 M32_Taq Peptidase fami 60.5 3.8 8.2E-05 37.2 0.8 15 172-186 160-177 (396)
105 COG2317 Zn-dependent carboxype 60.5 4.7 0.0001 37.2 1.4 12 171-183 260-271 (497)
106 KOG2089 Metalloendopeptidase f 60.5 24 0.00052 34.1 6.0 33 91-123 392-426 (718)
107 COG1164 Oligoendopeptidase F [ 60.1 4 8.7E-05 39.0 0.9 15 169-183 378-392 (598)
108 PRK10911 oligopeptidase A; Pro 58.6 4.2 9.1E-05 39.5 0.8 22 91-112 361-382 (680)
109 PF01434 Peptidase_M41: Peptid 58.0 5.3 0.00011 32.9 1.2 15 169-183 26-40 (213)
110 COG4784 Putative Zn-dependent 57.7 5.9 0.00013 35.3 1.5 16 168-183 121-136 (479)
111 KOG2719 Metalloprotease [Gener 57.1 5.5 0.00012 36.4 1.2 14 169-182 278-291 (428)
112 PF02102 Peptidase_M35: Deuter 56.9 13 0.00028 33.3 3.5 47 138-185 262-311 (359)
113 PF04450 BSP: Peptidase of pla 56.5 81 0.0018 25.9 7.9 38 147-184 71-109 (205)
114 TIGR00054 RIP metalloprotease 56.2 5.6 0.00012 36.3 1.2 12 172-183 15-26 (420)
115 PRK10779 zinc metallopeptidase 54.8 5.4 0.00012 36.7 0.8 12 172-183 16-27 (449)
116 PF03571 Peptidase_M49: Peptid 52.7 6.9 0.00015 37.0 1.2 15 171-185 293-308 (549)
117 cd02641 R3H_Smubp-2_like R3H d 51.1 7.9 0.00017 25.3 0.9 21 170-190 28-48 (60)
118 COG2321 Predicted metalloprote 50.2 7.5 0.00016 33.4 0.9 39 145-183 138-179 (295)
119 PRK10280 dipeptidyl carboxypep 49.6 7.3 0.00016 37.9 0.8 32 90-121 363-397 (681)
120 KOG2661 Peptidase family M48 [ 46.9 10 0.00022 33.7 1.2 19 170-189 274-292 (424)
121 PF06167 Peptidase_M90: Glucos 45.0 9.5 0.00021 32.4 0.7 21 168-188 151-171 (253)
122 PF12315 DUF3633: Protein of u 42.8 13 0.00029 30.6 1.3 16 170-185 92-108 (212)
123 PF01431 Peptidase_M13: Peptid 40.3 16 0.00034 29.5 1.4 17 171-187 36-52 (206)
124 cd02639 R3H_RRM R3H domain of 39.9 16 0.00034 24.0 1.0 23 170-192 28-50 (60)
125 COG0339 Dcp Zn-dependent oligo 39.0 13 0.00029 35.9 0.8 33 90-122 364-398 (683)
126 cd06461 M2_ACE Peptidase famil 38.5 16 0.00035 34.0 1.2 17 169-185 245-261 (477)
127 KOG2090 Metalloendopeptidase f 38.0 14 0.00031 35.2 0.8 22 13-34 318-341 (704)
128 COG3590 PepO Predicted metallo 37.6 9.7 0.00021 36.1 -0.3 14 171-184 487-500 (654)
129 PF08434 CLCA_N: Calcium-activ 36.3 14 0.00031 31.6 0.5 26 172-200 150-176 (262)
130 PRK15410 DgsA anti-repressor M 35.2 18 0.00039 30.9 0.9 20 170-189 141-160 (260)
131 COG3091 SprT Zn-dependent meta 34.9 9 0.00019 30.0 -0.9 39 140-183 35-73 (156)
132 smart00731 SprT SprT homologue 34.5 20 0.00042 27.6 1.0 32 148-183 40-71 (146)
133 PF02074 Peptidase_M32: Carbox 33.8 15 0.00033 34.3 0.3 15 174-188 262-279 (494)
134 PF10263 SprT-like: SprT-like 33.5 23 0.0005 27.2 1.2 16 169-184 58-73 (157)
135 PF10026 DUF2268: Predicted Zn 33.4 1.2E+02 0.0025 24.5 5.4 19 169-187 63-81 (195)
136 PRK10733 hflB ATP-dependent me 32.9 21 0.00047 34.4 1.2 14 170-183 407-420 (644)
137 PHA00657 crystallin beta/gamma 32.3 34 0.00074 35.7 2.4 13 170-182 790-802 (2052)
138 PF13402 M60-like: Peptidase M 31.0 21 0.00045 30.7 0.7 10 174-183 222-231 (307)
139 CHL00176 ftsH cell division pr 30.7 21 0.00046 34.5 0.7 14 170-183 437-450 (638)
140 COG5504 Predicted Zn-dependent 30.3 1E+02 0.0022 26.4 4.6 78 89-191 81-160 (280)
141 cd02640 R3H_NRF R3H domain of 30.2 30 0.00065 22.6 1.2 20 170-189 28-47 (60)
142 PRK04860 hypothetical protein; 29.7 15 0.00032 29.1 -0.5 32 147-182 43-74 (160)
143 COG4412 Uncharacterized protei 29.6 15 0.00033 34.6 -0.4 18 173-190 348-365 (760)
144 PF10460 Peptidase_M30: Peptid 28.6 38 0.00082 30.5 1.9 16 169-184 137-152 (366)
145 PF09768 Peptidase_M76: Peptid 28.5 49 0.0011 26.5 2.3 35 145-185 51-85 (173)
146 cd06007 R3H_DEXH_helicase R3H 28.2 33 0.00072 22.3 1.1 20 170-189 27-46 (59)
147 KOG1047 Bifunctional leukotrie 28.0 31 0.00067 32.7 1.3 20 167-186 284-304 (613)
148 PRK04351 hypothetical protein; 26.6 27 0.00057 27.3 0.5 32 147-182 41-72 (149)
149 PF14521 Aspzincin_M35: Lysine 25.6 48 0.001 25.6 1.8 39 150-190 77-115 (148)
150 PF13058 DUF3920: Protein of u 23.8 46 0.001 24.7 1.3 15 170-184 75-89 (126)
151 TIGR00043 metalloprotein, YbeY 22.7 69 0.0015 23.5 2.1 22 170-191 72-93 (110)
152 PF02130 UPF0054: Uncharacteri 22.5 69 0.0015 24.7 2.1 22 169-190 106-127 (145)
153 PF03272 Enhancin: Viral enhan 22.3 34 0.00073 33.9 0.4 18 94-111 173-190 (775)
154 TIGR01241 FtsH_fam ATP-depende 22.1 43 0.00094 31.1 1.1 14 170-183 310-323 (495)
155 COG5570 Uncharacterized small 22.0 44 0.00096 21.3 0.8 18 13-30 14-31 (57)
156 PRK00016 metal-binding heat sh 22.0 63 0.0014 25.3 1.9 43 148-190 87-132 (159)
157 KOG3314 Ku70-binding protein [ 21.4 73 0.0016 25.4 2.0 15 169-183 89-103 (194)
158 KOG3624 M13 family peptidase [ 21.3 43 0.00094 32.5 0.9 14 171-184 518-531 (687)
159 PF10463 Peptidase_U49: Peptid 20.9 42 0.00091 27.7 0.7 19 172-191 102-120 (206)
160 PF13203 DUF2201_N: Putative m 20.4 52 0.0011 28.2 1.1 22 169-191 58-79 (292)
161 PF15639 Tox-MPTase3: Metallop 20.2 1.8E+02 0.0038 22.2 3.8 36 91-126 23-59 (135)
162 PF15127 DUF4565: Protein of u 20.1 89 0.0019 22.2 2.1 14 90-103 63-76 (91)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=1.5e-54 Score=391.87 Aligned_cols=193 Identities=45% Similarity=0.789 Sum_probs=178.1
Q ss_pred ChhhhhcCCCCCccCCCHHHHHHhcCCCCCCCCCCCCcccccCCCcccCCCCCcccccceeeeCCCCCCCCCCCceeEEE
Q 039411 1 MKTYQLNYHLNSTGVLDGNTVAMMTRPRCGVADIVNGTTRMRSGMKSHQHGANHFHTVSHYEFFPGRPRWPPSNSHLTYA 80 (222)
Q Consensus 1 i~~~Q~~~~L~~TG~ld~~T~~~m~~pRCg~pD~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~W~~~~~~lty~ 80 (222)
|+.||++++|++||+||.+|+++|.+||||+||. +|.+++++.||+ +.+|||+
T Consensus 62 l~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~-------------------------~~~~~~~~~kW~--k~~lT~r 114 (469)
T KOG1565|consen 62 LKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG-------------------------RYRYFPGKPKWN--KEHLTYR 114 (469)
T ss_pred HHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------------------------ccccCcccCccc--cccccee
Confidence 5789999999999999999999999999999991 456677999999 9999999
Q ss_pred ecCCC----hHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCC---CCceee
Q 039411 81 FLPGT----PAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPP---RGGIFD 153 (222)
Q Consensus 81 ~~~~~----~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P---~~g~i~ 153 (222)
|.+.+ ..+++.++++||+.|+++++|+|.|+.....|||+|.|..+.|+|++||||++|+|||||+| .+|++|
T Consensus 115 i~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~~G~~h 194 (469)
T KOG1565|consen 115 IKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFPGPGIGGDLH 194 (469)
T ss_pred ccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCcccCCCCceecccCCCCCCCCccc
Confidence 99885 33899999999999999999999999986699999999999999999999999999999999 689999
Q ss_pred ecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCCceeeeeCC-CCCCCCCHhhHHHHHHHhcC
Q 039411 154 YDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGAIMYAYIDP-GATKGLHTDDVQGIRALYNN 221 (222)
Q Consensus 154 fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~~-~~~~~l~~~Di~~iq~LYg~ 221 (222)
||++|.|+++ ...+.+++.||+|||||+|||.||.++++||||+|.+ .....|+.|||.+||.|||.
T Consensus 195 fD~dE~Wt~~-~~~g~~l~~Va~HEiGH~LGL~HS~~~~aiM~P~y~~~~~~~~L~~DDv~giq~lYG~ 262 (469)
T KOG1565|consen 195 FDKDETWTYG-DSNGVDLFLVAAHEIGHALGLGHSSDPDAIMYPFYQPDSGNFDLSQDDVRGIQHLYGG 262 (469)
T ss_pred cCcccceecc-CCccchhHHHhhhhcccccccCCCCCcccccccccccCCCCcccChhhhhhhHHHhCC
Confidence 9999999997 3457899999999999999999999999999999994 34568999999999999996
No 2
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00 E-value=2.2e-39 Score=256.16 Aligned_cols=150 Identities=44% Similarity=0.798 Sum_probs=135.7
Q ss_pred CCCCCCceeEEEecCCC----hHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeee
Q 039411 69 RWPPSNSHLTYAFLPGT----PAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHA 144 (222)
Q Consensus 69 ~W~~~~~~lty~~~~~~----~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a 144 (222)
||+ +++|+|+|.+.. ...++++|++||+.|+++++|+|+|+.+.+.|||+|.|....++|+++|+|.++++|||
T Consensus 1 kW~--~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a 78 (157)
T cd04278 1 KWS--KTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHA 78 (157)
T ss_pred CCC--CCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccc
Confidence 799 999999998754 35889999999999999999999999876689999999999999889999999999999
Q ss_pred cCC--CCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCCceeeeeCCC-CCCCCCHhhHHHHHHHhc
Q 039411 145 APP--RGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGAIMYAYIDPG-ATKGLHTDDVQGIRALYN 220 (222)
Q Consensus 145 ~~P--~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~~~-~~~~l~~~Di~~iq~LYg 220 (222)
++| ..|+|+||.++.|+......+.+++.|++|||||+|||.|+.++.+||||++... ....|+.+||.+||+|||
T Consensus 79 ~~p~~~~g~i~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~~~~lYg 157 (157)
T cd04278 79 FFPGGIGGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRGIQALYG 157 (157)
T ss_pred cCCCCcceeEEECCCcccccCCCCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHHHHHhcC
Confidence 998 7899999999999887534567899999999999999999999999999999843 256899999999999998
No 3
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00 E-value=7.3e-37 Score=240.03 Aligned_cols=147 Identities=42% Similarity=0.798 Sum_probs=130.9
Q ss_pred CCCCCCceeEEEecCCCh----HHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeee
Q 039411 69 RWPPSNSHLTYAFLPGTP----AEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHA 144 (222)
Q Consensus 69 ~W~~~~~~lty~~~~~~~----~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a 144 (222)
||+ +++|||+|.+..+ .++++++++||+.|+++++|+|+++.+. ++||+|.|....+++...|++.+++++++
T Consensus 1 ~W~--~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~~-~adi~i~~~~~~~~~~~~~~~~~~~~~~~ 77 (154)
T PF00413_consen 1 KWP--KKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSDG-NADIRISFGSNNHGDGYSFDGSGGTLAHA 77 (154)
T ss_dssp SSS--SSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESSS-SCSEEEEEESSSSSSSS-CSSSSSESEEE
T ss_pred CCC--CCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccCC-CcceeeeeeccccCcccccccceeeeecc
Confidence 799 8999999987542 3788999999999999999999999853 69999999988888888899999999999
Q ss_pred cCCC---CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCCceeeeeCCCCCCCCCHhhHHHHHHHhc
Q 039411 145 APPR---GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGAIMYAYIDPGATKGLHTDDVQGIRALYN 220 (222)
Q Consensus 145 ~~P~---~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~~~~~~~l~~~Di~~iq~LYg 220 (222)
+.|. .|+++|+.++.|... ..+.++..|++|||||+|||.|+.+++|||||++.......|+.+||.+||+|||
T Consensus 78 ~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~~~~l~~~Di~~i~~lYg 154 (154)
T PF00413_consen 78 YFPNNIVSGDIHFNDDESWTID--DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPDNKTLSEDDIDGIQYLYG 154 (154)
T ss_dssp EESSSTTTTEEEEETTSHEESS--SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSSSTSTTHHHHHHHHHHHS
T ss_pred ccccccccccccccccccchhh--hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCCCCCCCHHHHHHHHHHhC
Confidence 9984 799999999999876 3457899999999999999999999999999999854435899999999999998
No 4
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.97 E-value=9.9e-30 Score=200.68 Aligned_cols=141 Identities=29% Similarity=0.427 Sum_probs=109.6
Q ss_pred ceeEEEecCCC------hHHHHHHHHHHHHHHHhcccceEEEecCC-CCceEEEEEeecCCCCCCCCCCCCCeeeeecCC
Q 039411 75 SHLTYAFLPGT------PAEAINAVGRAFQTWAAISHFKFSQTQDY-TNANLKISFQRGWHGDSSPFDGRGGTIAHAAPP 147 (222)
Q Consensus 75 ~~lty~~~~~~------~~~~~~~i~~A~~~W~~~~~l~F~ev~~~-~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P 147 (222)
..|+|++.+.. +.+++++|++||+.|+++++|+|+++... .++||+|.|.... +++|.|+++||+++|
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g~~~a~a~~p 76 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAGGGLARAGFP 76 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCCCceEEeccc
Confidence 46888888653 34789999999999999999999998753 3799999998532 788999999999999
Q ss_pred CCce---eeecCCCC-cccCCCCCccchhHHHHHhhhhhcCCCCCCCCC-CceeeeeCCCC--CCCCCHhhHHHHHHHhc
Q 039411 148 RGGI---FDYDADEQ-WAVGARPGAFDLETVALHEIGHLLGLDHSLVEG-AIMYAYIDPGA--TKGLHTDDVQGIRALYN 220 (222)
Q Consensus 148 ~~g~---i~fd~~~~-w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~-siM~p~~~~~~--~~~l~~~Di~~iq~LYg 220 (222)
..|. ..++.... +.......+.+++.|++|||||+|||.|+.++. ++|||++..+. ...|+.+|+++|++|||
T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~lY~ 156 (156)
T cd04279 77 LISDGNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRLYG 156 (156)
T ss_pred ccCCCcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHHhC
Confidence 6432 22221111 111111124588999999999999999999998 99999998543 35899999999999997
No 5
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.94 E-value=8.6e-27 Score=184.87 Aligned_cols=142 Identities=22% Similarity=0.273 Sum_probs=106.5
Q ss_pred CceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceee
Q 039411 74 NSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFD 153 (222)
Q Consensus 74 ~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~ 153 (222)
++.|+|+|.+..+...+++|++||+.|+++++|+|+|+...+.+||+|.+....+.. +|..+.+++...|..|+|+
T Consensus 1 ~~~I~y~i~~~~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~~~~~~~~~~~~g~i~ 76 (165)
T cd04268 1 KKPITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTWSYGPSQVDPLTGEIL 76 (165)
T ss_pred CCCEEEEEcCCCCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCccccCCccCCCCCccEE
Confidence 468999999987788999999999999999999999998766799999987632111 1111122222245678999
Q ss_pred ecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC----------------CCCceeeeeCC-------CCCCCCCHh
Q 039411 154 YDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV----------------EGAIMYAYIDP-------GATKGLHTD 210 (222)
Q Consensus 154 fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~----------------~~siM~p~~~~-------~~~~~l~~~ 210 (222)
++....|.......+..+..|++|||||+|||.|+.+ ..|||++.... +....|+..
T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~~~~~~~~~ 156 (165)
T cd04268 77 LARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDGQKYTIGPY 156 (165)
T ss_pred eeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccccCCCCCHH
Confidence 8776544311001234688999999999999999998 89999876542 134689999
Q ss_pred hHHHHHHHh
Q 039411 211 DVQGIRALY 219 (222)
Q Consensus 211 Di~~iq~LY 219 (222)
||.+||+||
T Consensus 157 Di~ai~~lY 165 (165)
T cd04268 157 DIAAIKKLY 165 (165)
T ss_pred HHHHHHhcC
Confidence 999999999
No 6
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.92 E-value=4.1e-25 Score=171.12 Aligned_cols=132 Identities=30% Similarity=0.533 Sum_probs=107.1
Q ss_pred CCCCCCCceeEEEec-CCChH-HHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeec
Q 039411 68 PRWPPSNSHLTYAFL-PGTPA-EAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAA 145 (222)
Q Consensus 68 ~~W~~~~~~lty~~~-~~~~~-~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~ 145 (222)
.+|+ +.+|+|+|. +..+. +++++|++||+.|+++++|+|+|+.+ .+||.|.|....++ |..+++++
T Consensus 3 ~~W~--~~~v~Y~i~~~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~--------g~~~a~~g 70 (140)
T smart00235 3 KKWP--KGTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGS--------GCTLSHAG 70 (140)
T ss_pred CcCC--CCcEEEEEcCCCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCC--------Ccceeeee
Confidence 5899 899999997 54444 45999999999999999999999875 79999999986651 22268888
Q ss_pred CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCCCCC---ceeeeeCCC-CCCCCCHhhHHHHHHHhcC
Q 039411 146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLVEGA---IMYAYIDPG-ATKGLHTDDVQGIRALYNN 221 (222)
Q Consensus 146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~~s---iM~p~~~~~-~~~~l~~~Di~~iq~LYg~ 221 (222)
.| .|+++|+. +.|... ..|++|||||+|||.|++++.+ +|++.+..- ..+.|+++|..+++++||.
T Consensus 71 ~~-~g~~~~~~-~~~~~~--------~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~~~~~yg~ 140 (140)
T smart00235 71 RP-GGDQHFSL-GNGCIN--------TGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLGIPYDYGS 140 (140)
T ss_pred cC-CCceEEEc-cCCcCC--------cccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCCchhccCc
Confidence 88 78899875 455432 3599999999999999999888 999999732 1457888899999999984
No 7
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.91 E-value=8.9e-24 Score=171.26 Aligned_cols=123 Identities=22% Similarity=0.280 Sum_probs=96.3
Q ss_pred HHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCC-------CceeeecCCCC
Q 039411 87 AEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPR-------GGIFDYDADEQ 159 (222)
Q Consensus 87 ~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~-------~g~i~fd~~~~ 159 (222)
...+++|++||+.|+++++|+|+|+.+...+||+|.+..... ++..+.|++|. .|+|+++....
T Consensus 33 ~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~~~~g~a~~p~~~~~~~~~g~i~~~~~~~ 103 (186)
T cd04277 33 AAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------GNTAGYAYYPGSGSGTAYGGDIWFNSSYD 103 (186)
T ss_pred HHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------CCccEEEECCCCCccccccceeEEecCcc
Confidence 378899999999999999999999987668999998765321 34667788773 46899998776
Q ss_pred cccCCCCCccchhHHHHHhhhhhcCCCCCCCC----------------CCceeeeeCCC--------CCCCCCHhhHHHH
Q 039411 160 WAVGARPGAFDLETVALHEIGHLLGLDHSLVE----------------GAIMYAYIDPG--------ATKGLHTDDVQGI 215 (222)
Q Consensus 160 w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~~----------------~siM~p~~~~~--------~~~~l~~~Di~~i 215 (222)
+... ..+.....|++|||||+|||.|+++. -+||......+ ....++..||++|
T Consensus 104 ~~~~--~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~DI~Al 181 (186)
T cd04277 104 TNSD--SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDIAAL 181 (186)
T ss_pred cccC--CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHHHHHH
Confidence 6543 23456789999999999999999853 36886544321 2347889999999
Q ss_pred HHHhc
Q 039411 216 RALYN 220 (222)
Q Consensus 216 q~LYg 220 (222)
|+|||
T Consensus 182 Q~lYG 186 (186)
T cd04277 182 QYLYG 186 (186)
T ss_pred HHhhC
Confidence 99998
No 8
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.83 E-value=7.8e-20 Score=149.68 Aligned_cols=109 Identities=28% Similarity=0.460 Sum_probs=78.0
Q ss_pred CCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCC
Q 039411 69 RWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPR 148 (222)
Q Consensus 69 ~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~ 148 (222)
+|+. .+.|+|.|..+.+.+.++.|++||+.|++++||+|++++.. .++|+|.|..+. | ++++.|..+.. ..+.
T Consensus 2 ~W~~-~~~~~~~f~~~~~~~~r~~I~~A~~~W~~~t~i~F~~~~~~-~adi~I~f~~~~-G-c~S~vG~~~~~---~~~~ 74 (198)
T cd04327 2 LWRN-GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDA-DADIRISFTPGD-G-YWSYVGTDALL---IGAD 74 (198)
T ss_pred CCCC-CCeEEEEeCCCCcHHHHHHHHHHHHHHhhhcCeEEEECCCC-CCCEEEEEecCC-C-CCCCcCCcccc---cCCC
Confidence 6996 66788999887666889999999999999999999998764 789999998643 4 45666654321 1112
Q ss_pred CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 149 GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 149 ~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
...+.+. |.... ..+++...|++|||||||||.|++
T Consensus 75 ~~t~~l~----~~~~~-~~~~~~~~~i~HElgHaLG~~HEh 110 (198)
T cd04327 75 APTMNLG----WFTDD-TPDPEFSRVVLHEFGHALGFIHEH 110 (198)
T ss_pred Cceeeee----eecCC-CchhhHHHHHHHHHHHHhcCcccc
Confidence 2344443 21110 123567789999999999999973
No 9
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.80 E-value=6.1e-19 Score=139.82 Aligned_cols=129 Identities=22% Similarity=0.267 Sum_probs=95.1
Q ss_pred eeEEEecCC--------ChHHHHHHHHHHHHHHHhcccceEEEecCC-CCceEEEEEeecCCCCCCCCCCCCCeeeeecC
Q 039411 76 HLTYAFLPG--------TPAEAINAVGRAFQTWAAISHFKFSQTQDY-TNANLKISFQRGWHGDSSPFDGRGGTIAHAAP 146 (222)
Q Consensus 76 ~lty~~~~~--------~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~-~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~ 146 (222)
.|+|.|... .+..++++|+.|++.|+++++++|+++... ..+||.+.+..... .++.++.|+.
T Consensus 2 ~i~~~i~~d~~~~~~~~~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~~~~g~a~~ 73 (167)
T cd00203 2 VIPYVVVADDRDVEEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DGGTGGWAYL 73 (167)
T ss_pred EEEEEEEecccccchhhhHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CCCceEEEec
Confidence 455555543 345788999999999999999999999864 47999988865332 2234566666
Q ss_pred CC-----CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC--------------------CCCceeeeeCC
Q 039411 147 PR-----GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV--------------------EGAIMYAYIDP 201 (222)
Q Consensus 147 P~-----~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~--------------------~~siM~p~~~~ 201 (222)
|. .+.+.+.....+ ......|++|||||+|||.|... ..+||+|....
T Consensus 74 ~~~c~~~~~~~~~~~~~~~-------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~ 146 (167)
T cd00203 74 GRVCDSLRGVGVLQDNQSG-------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGS 146 (167)
T ss_pred CCccCCCCCcEEEecCCcc-------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccc
Confidence 52 355655443222 23567899999999999999876 36899987764
Q ss_pred ---CCCCCCCHhhHHHHHHHh
Q 039411 202 ---GATKGLHTDDVQGIRALY 219 (222)
Q Consensus 202 ---~~~~~l~~~Di~~iq~LY 219 (222)
.....++..||+.||+||
T Consensus 147 ~~~~~~~~fS~~d~~~i~~~Y 167 (167)
T cd00203 147 FSDGQRKDFSQCDIDQINKLY 167 (167)
T ss_pred cCcccCCCcCHHHHHHHHhhC
Confidence 245689999999999998
No 10
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.72 E-value=5.5e-17 Score=130.95 Aligned_cols=91 Identities=18% Similarity=0.219 Sum_probs=65.5
Q ss_pred ceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeee
Q 039411 75 SHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDY 154 (222)
Q Consensus 75 ~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~f 154 (222)
..|.|.|.+..+...++.|++||+.|++++||+|+|.+. ++..|.|..+ .|| ++..|.-+ ....+.+
T Consensus 2 ~~VpY~i~~~~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~Gc-~S~vG~~~--------~~q~i~l 68 (180)
T cd04280 2 GTVPYVIDGSFDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SGC-WSYVGRVG--------GRQVVSL 68 (180)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CCc-ceecCccC--------CceeEEe
Confidence 579999998766688999999999999999999999875 4445666654 443 33333211 1233444
Q ss_pred cCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 155 DADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 155 d~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
.. ++....+++|||||||||.|++
T Consensus 69 ~~-----------~c~~~g~v~HE~~HalG~~HEh 92 (180)
T cd04280 69 GS-----------GCFSLGTIVHELMHALGFYHEQ 92 (180)
T ss_pred CC-----------CcCcCchhHHHHHHHhcCcchh
Confidence 21 2344689999999999999977
No 11
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.70 E-value=1.6e-16 Score=129.51 Aligned_cols=144 Identities=16% Similarity=0.133 Sum_probs=89.8
Q ss_pred CceeEEEecCCChHHHHHHHHHHHHHHHhc------ccceEEEecCC--CCceEEEEEeecCCCCCCCCCCCCCeeeeec
Q 039411 74 NSHLTYAFLPGTPAEAINAVGRAFQTWAAI------SHFKFSQTQDY--TNANLKISFQRGWHGDSSPFDGRGGTIAHAA 145 (222)
Q Consensus 74 ~~~lty~~~~~~~~~~~~~i~~A~~~W~~~------~~l~F~ev~~~--~~adI~I~f~~~~~gd~~~f~g~g~~la~a~ 145 (222)
|+.|||+|.+.+|.+++++|++||+.|+++ .++.+.++... ...||++...+--+.+ ++..+...+..
T Consensus 7 k~pItyyI~~~~p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~----~~~~~~gps~~ 82 (197)
T cd04276 7 KEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP----NGGWAYGPSVV 82 (197)
T ss_pred CCCEEEEecCCCcHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC----CCcceeccccc
Confidence 899999999998889999999999999987 34555555332 2468884433322221 12222333444
Q ss_pred CCCC-----ceeeecCCCCcccCCC---CCccchhHHHHHhhhhhcCCCCCCC---------------------CCCcee
Q 039411 146 PPRG-----GIFDYDADEQWAVGAR---PGAFDLETVALHEIGHLLGLDHSLV---------------------EGAIMY 196 (222)
Q Consensus 146 ~P~~-----g~i~fd~~~~w~~~~~---~~~~~~~~va~HEiGHaLGL~Hs~~---------------------~~siM~ 196 (222)
.|.. ++|.++.......... ..+..+..+++||+||+|||.|... ..|||-
T Consensus 83 dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~~SVMd 162 (197)
T cd04276 83 DPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGATSSVMD 162 (197)
T ss_pred CCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCcceeec
Confidence 5644 4566654332111100 0123467899999999999999642 246662
Q ss_pred e---eeCC-------CCCCCCCHhhHHHHHHHhcC
Q 039411 197 A---YIDP-------GATKGLHTDDVQGIRALYNN 221 (222)
Q Consensus 197 p---~~~~-------~~~~~l~~~Di~~iq~LYg~ 221 (222)
. .+.. .....+...||.+||+|||.
T Consensus 163 Y~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~ 197 (197)
T cd04276 163 YPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP 197 (197)
T ss_pred CccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence 1 1110 12347889999999999984
No 12
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.68 E-value=3.8e-16 Score=127.66 Aligned_cols=99 Identities=20% Similarity=0.353 Sum_probs=68.8
Q ss_pred CCCCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecC
Q 039411 67 RPRWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAP 146 (222)
Q Consensus 67 ~~~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~ 146 (222)
..+|+ +..|.|.|....+...+++|++|++.|++.+||+|++..+. .. .|.|..+..|| +++.|.-+.-
T Consensus 7 ~~~Wp--~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~~-~~--yi~f~~~~~GC-~S~vG~~~~g----- 75 (200)
T cd04281 7 ERIWP--GGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTPE-EN--YIVFTYRPCGC-CSYVGRRGNG----- 75 (200)
T ss_pred cCcCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCceEEEECCCC-CC--EEEEEECCCCe-eEcCCCcCCC-----
Confidence 46899 99999999877666778999999999999999999997652 33 34454443454 4444432100
Q ss_pred CCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 147 PRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 147 P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
.-.+.+ + .++....|++|||||||||.|++
T Consensus 76 --~q~isl--------~---~~C~~~Gti~HEl~HaLGf~HEh 105 (200)
T cd04281 76 --PQAISI--------G---KNCDKFGIVVHELGHVIGFWHEH 105 (200)
T ss_pred --ceeeec--------C---CCcCcCchHHHHHHHHhcCcchh
Confidence 012222 2 12344689999999999999977
No 13
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.68 E-value=6.3e-16 Score=126.05 Aligned_cols=137 Identities=22% Similarity=0.321 Sum_probs=94.8
Q ss_pred CCCCCCceeEEEecC----CChHHHHHHHHHHHHHHHhc-ccceEEEe--cCCCCceEEEEEeecCCCCCCCCCCCCCee
Q 039411 69 RWPPSNSHLTYAFLP----GTPAEAINAVGRAFQTWAAI-SHFKFSQT--QDYTNANLKISFQRGWHGDSSPFDGRGGTI 141 (222)
Q Consensus 69 ~W~~~~~~lty~~~~----~~~~~~~~~i~~A~~~W~~~-~~l~F~ev--~~~~~adI~I~f~~~~~gd~~~f~g~g~~l 141 (222)
.|....++|+..+.+ ..+..++.++.+|++.|+.+ ..|+|+-+ .....+||.|.-... +.+.|.+
T Consensus 33 lV~~~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~G~g 104 (211)
T PF12388_consen 33 LVGSSPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPSGAG 104 (211)
T ss_pred eecCCCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCCCcc
Confidence 453226788888887 34457889999999999997 45777422 222356776642211 1233456
Q ss_pred eeecCC-CCc----eeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC---------------------------
Q 039411 142 AHAAPP-RGG----IFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL--------------------------- 189 (222)
Q Consensus 142 a~a~~P-~~g----~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~--------------------------- 189 (222)
+.|.|| .+| .|.+. ... .........|++|||||+|||.|++
T Consensus 105 gsAGFP~s~G~P~~~I~I~-----~~~-~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGTP 178 (211)
T PF12388_consen 105 GSAGFPTSNGNPYKFIQIY-----GLS-NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGTP 178 (211)
T ss_pred eeccCCCCCCCCCceEEEE-----ecC-CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCCCC
Confidence 778888 555 45551 111 1122345679999999999999986
Q ss_pred ---CCCCceeeeeCCCCCCCCCHhhHHHHHHHh
Q 039411 190 ---VEGAIMYAYIDPGATKGLHTDDVQGIRALY 219 (222)
Q Consensus 190 ---~~~siM~p~~~~~~~~~l~~~Di~~iq~LY 219 (222)
+++|||..++..+....++..||.+|++||
T Consensus 179 t~~d~~SiM~ac~~~~~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 179 TGADPNSIMNACFSSGEDGEFTSNDITALNYLY 211 (211)
T ss_pred CCCCCchhhhccccCCCCCCcChhhHHHHHhhC
Confidence 457899998876677799999999999999
No 14
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.67 E-value=2e-16 Score=128.87 Aligned_cols=96 Identities=25% Similarity=0.439 Sum_probs=63.1
Q ss_pred CCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCC
Q 039411 69 RWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPR 148 (222)
Q Consensus 69 ~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~ 148 (222)
+|+ +.+|.|.|.+..+...++.|++|+..|++.+||+|+++. . .....|.|.. ..|| +++.|..+ .
T Consensus 2 ~Wp--~~~IpY~~~~~~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~-~~gC-~S~vG~~~--------g 67 (191)
T PF01400_consen 2 KWP--NGTIPYYIDPSFSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSN-GSGC-WSYVGRQG--------G 67 (191)
T ss_dssp S-G--GGEEEEEEETTS-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEES-SSSE-EEESS--S--------S
T ss_pred cCC--CCEEEEEECCCCCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeec-Cccc-cchhhhcC--------c
Confidence 799 889999999887778889999999999999999999998 3 3334455543 3343 34444221 0
Q ss_pred CceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 149 GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 149 ~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
.-.|.+. .++....|++|||||||||.|++
T Consensus 68 ~q~i~l~-----------~~c~~~~~i~HEl~HaLG~~HEh 97 (191)
T PF01400_consen 68 EQTINLG-----------DGCFSVGTILHELGHALGFWHEH 97 (191)
T ss_dssp EEEEEE------------TTC-SHHHHHHHHHHHHTB--GG
T ss_pred ceeEEec-----------ceeCCccchHHHHHHHHhhhhhh
Confidence 1233332 12345679999999999999977
No 15
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.64 E-value=2.2e-15 Score=125.29 Aligned_cols=97 Identities=24% Similarity=0.342 Sum_probs=68.5
Q ss_pred CCCCCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeec
Q 039411 66 GRPRWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAA 145 (222)
Q Consensus 66 ~~~~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~ 145 (222)
...+|+ + .|.|.|....+...++.|++|++.|++.+||+|++.+.. .. .|.|..+ .|| +++.|.-+
T Consensus 42 ~~~~Wp--~-~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~rt~e-~~--yi~i~~~-~GC-~S~vG~~g------ 107 (230)
T cd04282 42 DTYRWP--F-PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGE-SN--YIFFFKG-SGC-WSMVGDQQ------ 107 (230)
T ss_pred cccCCC--c-ceeEEECCCCCHHHHHHHHHHHHHHHhCCCeeEEECCCC-Cc--EEEEEcC-CCe-eeccCccC------
Confidence 457999 7 899999887766788999999999999999999987643 33 3445443 344 44444311
Q ss_pred CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
....+.+ + .++....+++|||||||||.|++
T Consensus 108 --g~q~isl--------~---~~C~~~Gti~HEl~HalGf~HEq 138 (230)
T cd04282 108 --GGQNLSI--------G---AGCDYKATVEHEFLHALGFYHEQ 138 (230)
T ss_pred --CeEEEEE--------C---CCcCCCchHHHHHHHHhCCcccc
Confidence 0123333 2 13455689999999999999987
No 16
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.60 E-value=1.2e-14 Score=117.20 Aligned_cols=93 Identities=17% Similarity=0.270 Sum_probs=62.8
Q ss_pred CceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceee
Q 039411 74 NSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFD 153 (222)
Q Consensus 74 ~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~ 153 (222)
.-.|.|.|....+...++.|++|++.|++.+||+|++.+.. ...|. |..+ .|| ++..|..+. ...|.
T Consensus 3 ~~~IpY~i~~~~~~~~~~~I~~A~~~~~~~TCirF~~~~~~-~~yi~--~~~~-~gC-~S~vG~~gg--------~q~i~ 69 (182)
T cd04283 3 IVYVPYVISPQYSENERAVIEKAMQEFETLTCVRFVPRTTE-RDYLN--IESR-SGC-WSYIGRQGG--------RQTVS 69 (182)
T ss_pred eEEEEEEECCCCCHHHHHHHHHHHHHHHhCCceeeEECCCC-CcEEE--EEcC-CCc-eEecCccCC--------ceeEe
Confidence 35789999887666778999999999999999999987642 34444 4432 343 444443210 11222
Q ss_pred ecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 154 YDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 154 fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
+. ..++....+++|||||||||.|++
T Consensus 70 l~----------~~~C~~~G~i~HEl~HaLG~~HEh 95 (182)
T cd04283 70 LQ----------KQGCMYKGIIQHELLHALGFYHEQ 95 (182)
T ss_pred cC----------CCCcCccchHHHHHHHHhCCcccc
Confidence 21 123445689999999999999987
No 17
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.53 E-value=4.3e-14 Score=127.46 Aligned_cols=103 Identities=22% Similarity=0.350 Sum_probs=72.5
Q ss_pred eCCCCCCCCCCCceeEEEecCCChHHHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeee
Q 039411 63 FFPGRPRWPPSNSHLTYAFLPGTPAEAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIA 142 (222)
Q Consensus 63 ~~~~~~~W~~~~~~lty~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la 142 (222)
+.....+|+ ...|.|.|........+++|++||+.|++.++|+|+|..+. ..+..+.+... || +++.|..|.
T Consensus 75 ~~~~~~~Wp--~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~-~~~~~~~~~~~--gC-~S~VGr~gg-- 146 (411)
T KOG3714|consen 75 TSNPERRWP--NGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTP-DKDYLIVFTGG--GC-YSYVGRRGG-- 146 (411)
T ss_pred ccChhhcCC--CCeeeeEECCCCCHHHHHHHHHHHHHHhhCcceEEEeCCCC-CcceEEEeCCC--cc-eeeeCccCC--
Confidence 334567999 89999999988666889999999999999999999998743 34444444332 44 344443221
Q ss_pred eecCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC
Q 039411 143 HAAPPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 143 ~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~ 189 (222)
+. +.-+++ .++..+.|++|||+||||+.|++
T Consensus 147 -------~~------q~~sl~---~~C~~~G~i~HEl~HaLGf~Heh 177 (411)
T KOG3714|consen 147 -------GQ------QLLSLG---DGCDRFGTIVHELMHALGFWHEH 177 (411)
T ss_pred -------Cc------cceecC---CCcCcCchhHHHHHHHhhhhhcc
Confidence 10 111222 34566899999999999999987
No 18
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=99.11 E-value=2.5e-10 Score=85.46 Aligned_cols=122 Identities=19% Similarity=0.303 Sum_probs=70.2
Q ss_pred eeEEEecCCChHHHHHHHHHHHHHHHh-cccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeee
Q 039411 76 HLTYAFLPGTPAEAINAVGRAFQTWAA-ISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDY 154 (222)
Q Consensus 76 ~lty~~~~~~~~~~~~~i~~A~~~W~~-~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~f 154 (222)
+|+|.-... .+.+..|.+|.+.|+. |.+++|++.. .++|+|.-... +++..+..-....|.|.+
T Consensus 3 tv~Ydas~A--~~f~~~i~~aa~iWN~sV~NV~L~~~s---~a~~~~~~~~~----------~~~sya~~~g~G~G~I~l 67 (132)
T PF02031_consen 3 TVYYDASRA--PEFRSAIAQAAQIWNSSVSNVRLVEGS---SADIRYYEGND----------PRGSYASTDGLGSGYIFL 67 (132)
T ss_dssp EEEEEEEE---GGGHHHHHHHHHHHHHH-SSEEEEE-S---S-SEEEEEE------------TT--EEEE-SSS-EEEEE
T ss_pred EEEEeCCCC--chhHHHHHHHHHHHhcccCceEEeecC---CCcEEEEEecC----------CCCcccccCCCCcEEEEe
Confidence 566654433 2678999999999976 8999999975 48888764321 233444333335788988
Q ss_pred cCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCC-CC-CCceeeeeCCCC---CCCCCHhhHHHHHHHhc
Q 039411 155 DADEQWAVGARPGAFDLETVALHEIGHLLGLDHSL-VE-GAIMYAYIDPGA---TKGLHTDDVQGIRALYN 220 (222)
Q Consensus 155 d~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~-~~-~siM~p~~~~~~---~~~l~~~Di~~iq~LYg 220 (222)
|.. +. .+.+-..|++|||||+|||..-- -+ .++|.-.-. |. .......++..+.+||.
T Consensus 68 ~~~--~~-----qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~a-G~sCtN~~Pna~E~a~V~~~~a 130 (132)
T PF02031_consen 68 DYQ--QN-----QGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSA-GTSCTNAYPNAAERARVNSLFA 130 (132)
T ss_dssp EHH--HH-----HHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT--TT----S--HHHHHHHHHHCT
T ss_pred chH--Hh-----hCCccceeeeehhccccCCCCCCCCccHHhhcCCCC-CCCCCCCCCCHHHHHHHHHHhc
Confidence 742 21 24677889999999999996433 23 457763222 21 23567889999999985
No 19
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.09 E-value=3.1e-10 Score=91.54 Aligned_cols=121 Identities=21% Similarity=0.205 Sum_probs=75.8
Q ss_pred HHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCC---------eeeeecCCCCceeeecCCC
Q 039411 88 EAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGG---------TIAHAAPPRGGIFDYDADE 158 (222)
Q Consensus 88 ~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~---------~la~a~~P~~g~i~fd~~~ 158 (222)
.+++++..|++.|++.-|+..+|-. +.|||+|...+.. | .+.++.|. .+.|+..| -|++...
T Consensus 104 ~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~p-g--tg~~~~grArtal~~~efl~~~~~~----~~lnhr~ 174 (236)
T COG5549 104 RWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNPP-G--TGWRQYGRARTALIAYEFLGHALGL----GHLNHRG 174 (236)
T ss_pred hHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCCC-C--CcccccchHHHHHHHHHHhhhccCc----ccccccc
Confidence 5688999999999999777666544 4899999876532 2 12222211 12222222 1222211
Q ss_pred CcccCCCCCccchhHHHHHhhhhhcCCC-CCCCCCCceeeeeCCCCCCCCCHhhHHHHHHHhcC
Q 039411 159 QWAVGARPGAFDLETVALHEIGHLLGLD-HSLVEGAIMYAYIDPGATKGLHTDDVQGIRALYNN 221 (222)
Q Consensus 159 ~w~~~~~~~~~~~~~va~HEiGHaLGL~-Hs~~~~siM~p~~~~~~~~~l~~~Di~~iq~LYg~ 221 (222)
.-.+.......++..+|.||+|||||+. ||+-+ +.|.+..+ ...++..|++.+..+|-.
T Consensus 175 ~i~~~pg~~~e~L~~tarhElGhaLgi~ghsd~k-s~aly~~s---~~~is~rdv~tL~r~Yeq 234 (236)
T COG5549 175 DIMYPPGELRENLNPTARHELGHALGIWGHSDLK-SDALYGIS---WQGISRRDVKTLERKYEQ 234 (236)
T ss_pred cccCCcccchhhhhHHHHHhhcchheecccccch-hhhhcchh---hcccCHHHHHHHHHHhcC
Confidence 1111111234578899999999999996 98754 45554333 345999999999999953
No 20
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=98.15 E-value=2e-05 Score=63.82 Aligned_cols=48 Identities=23% Similarity=0.208 Sum_probs=35.3
Q ss_pred chhHHHHHhhhhhcCCCCCCCC----------CCceeeeeCCCCCCCCCHhhHHHHHH
Q 039411 170 DLETVALHEIGHLLGLDHSLVE----------GAIMYAYIDPGATKGLHTDDVQGIRA 217 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~~----------~siM~p~~~~~~~~~l~~~Di~~iq~ 217 (222)
....|++||+||.|||.|.... .-||.|.........+|+..+..|+.
T Consensus 132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~~~~~~FS~CS~~~i~~ 189 (192)
T cd04267 132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSGLNSYRFSQCSIGSIRE 189 (192)
T ss_pred eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccCCCCCccChhhHHHHHH
Confidence 4456999999999999997652 24899876531345788877777765
No 21
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=98.07 E-value=5.5e-06 Score=68.27 Aligned_cols=48 Identities=27% Similarity=0.272 Sum_probs=32.8
Q ss_pred hhHHHHHhhhhhcCCCCCCCC--------------CCceeeeeCCCCCCCCCHhhHHHHHHHh
Q 039411 171 LETVALHEIGHLLGLDHSLVE--------------GAIMYAYIDPGATKGLHTDDVQGIRALY 219 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~--------------~siM~p~~~~~~~~~l~~~Di~~iq~LY 219 (222)
...+++|||||+|||.|..+. .+||.+ .......-+++-+|..|+...
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY-~~~~~~~~FS~~~i~~i~~~~ 198 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSY-ASRPSRPYFSPPSIEYIREVV 198 (206)
T ss_pred cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCC-CCCCCCCCCCchhHHHHhCCC
Confidence 346799999999999997642 268862 221222348888888886543
No 22
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.96 E-value=3.1e-05 Score=63.68 Aligned_cols=116 Identities=16% Similarity=0.182 Sum_probs=65.4
Q ss_pred HHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcccCCCCCccchh
Q 039411 93 VGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWAVGARPGAFDLE 172 (222)
Q Consensus 93 i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~ 172 (222)
.-++|..|.......-. .+....|+.+.+...+.... ++++++++.|+.- .+- .......+.. ..+....
T Consensus 72 ~L~~F~~w~~~~~~~~~--~~~~~~D~a~llt~~d~~~~---~~~~~~~G~A~~g---giC-~~~~s~~i~~-~~~~~~a 141 (207)
T cd04273 72 SLKSFCRWQKKLNPPND--SDPEHHDHAILLTRQDICRS---NGNCDTLGLAPVG---GMC-SPSRSCSINE-DTGLSSA 141 (207)
T ss_pred HHHHHHHHHHHcCCccc--ccccccceEEEEeeeccccc---CCCCCceEEeccc---cCC-CCCcceEEEc-CCCceeE
Confidence 45789999764332110 01235777776655332211 3566777777642 110 0000011111 1123345
Q ss_pred HHHHHhhhhhcCCCCCCC---------CCCceeeeeCCC-CCCCCCHhhHHHHHHH
Q 039411 173 TVALHEIGHLLGLDHSLV---------EGAIMYAYIDPG-ATKGLHTDDVQGIRAL 218 (222)
Q Consensus 173 ~va~HEiGHaLGL~Hs~~---------~~siM~p~~~~~-~~~~l~~~Di~~iq~L 218 (222)
.|++|||||.||+.|-.+ .+-||+|..... ....+|.--+..++.+
T Consensus 142 ~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~ 197 (207)
T cd04273 142 FTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSF 197 (207)
T ss_pred EeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHH
Confidence 799999999999999775 268999988742 3456777766666654
No 23
>PF01471 PG_binding_1: Putative peptidoglycan binding domain; InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are: Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX []. Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=97.92 E-value=4.8e-06 Score=54.20 Aligned_cols=24 Identities=33% Similarity=0.423 Sum_probs=21.8
Q ss_pred ChhhhhcCCCCCccCCCHHHHHHh
Q 039411 1 MKTYQLNYHLNSTGVLDGNTVAMM 24 (222)
Q Consensus 1 i~~~Q~~~~L~~TG~ld~~T~~~m 24 (222)
|+.||++.||++||++|.+|+++|
T Consensus 34 v~~fQ~~~gL~~tG~~d~~T~~~L 57 (57)
T PF01471_consen 34 VKAFQKANGLPVTGVVDPETWEAL 57 (57)
T ss_dssp HHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred HHHHHHHcCcCCCCccCHHHHhcC
Confidence 589999999999999999999987
No 24
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.91 E-value=8.8e-06 Score=64.07 Aligned_cols=23 Identities=35% Similarity=0.694 Sum_probs=15.2
Q ss_pred cchhHHHHHhhhhhcCCCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSLVE 191 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~~~ 191 (222)
.+...|++||+||-|||.|....
T Consensus 67 ~~~g~TltHEvGH~LGL~HtF~~ 89 (154)
T PF05572_consen 67 YNFGKTLTHEVGHWLGLYHTFGG 89 (154)
T ss_dssp S-SSHHHHHHHHHHTT---TT--
T ss_pred cccccchhhhhhhhhcccccccC
Confidence 45668999999999999998753
No 25
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.90 E-value=2.3e-05 Score=63.58 Aligned_cols=115 Identities=23% Similarity=0.345 Sum_probs=72.9
Q ss_pred CceeEEEecC--CC---h----HHHHHHHHHHH---HHHHhcccceEEEecCCCCceEEEEEeecCCCCCC--CCCCCCC
Q 039411 74 NSHLTYAFLP--GT---P----AEAINAVGRAF---QTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSS--PFDGRGG 139 (222)
Q Consensus 74 ~~~lty~~~~--~~---~----~~~~~~i~~A~---~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~--~f~g~g~ 139 (222)
.+.++|.+.- +. + .+..++|...+ .-|.....+.|+.|... .+|++|...+...-+.. +.+- .+
T Consensus 28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~~-~~Df~I~Lasp~T~~~lC~g~~~-~~ 105 (203)
T PF11350_consen 28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDSG-APDFRISLASPGTTDRLCAGLDT-SG 105 (203)
T ss_pred CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCCC-CCCEEEEECCcchhhhhccCcCc-Cc
Confidence 4788998863 21 1 23444444444 45888788999999876 48999999876543322 2221 12
Q ss_pred eeeeecCCCCceeeecCCCCcccCCCCCc----cchhHHHHHhhhhhcCCCCCCCCC
Q 039411 140 TIAHAAPPRGGIFDYDADEQWAVGARPGA----FDLETVALHEIGHLLGLDHSLVEG 192 (222)
Q Consensus 140 ~la~a~~P~~g~i~fd~~~~w~~~~~~~~----~~~~~va~HEiGHaLGL~Hs~~~~ 192 (222)
.. .-+.|..++|.+|. ..|..+...-+ .+..-|+-||+||+||-.|..-+.
T Consensus 106 e~-SC~~~~g~~VvIN~-~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cpg 160 (203)
T PF11350_consen 106 ET-SCRNPAGGRVVINL-ARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCPG 160 (203)
T ss_pred ee-EeecCCCCeEEEeh-HHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCCC
Confidence 22 22334467787776 46876654322 345569999999999999987643
No 26
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.69 E-value=1.1e-05 Score=64.75 Aligned_cols=48 Identities=27% Similarity=0.275 Sum_probs=18.8
Q ss_pred hhHHHHHhhhhhcCCCCCCCCC-------------CceeeeeCCCCCCCCCHhhHHHHHHHh
Q 039411 171 LETVALHEIGHLLGLDHSLVEG-------------AIMYAYIDPGATKGLHTDDVQGIRALY 219 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~-------------siM~p~~~~~~~~~l~~~Di~~iq~LY 219 (222)
...|++|||||.||+.|..+.. .||++... .....+++.-|..|+.+-
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~~~-~~~~~fS~cS~~~i~~~l 171 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYAGI-CNNDKFSPCSIRQIRAVL 171 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS-G-GGG---------------
T ss_pred eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeCCC-CCCccccccccccccccc
Confidence 4568999999999999987642 58975422 123467888777777653
No 27
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.60 E-value=2.9e-05 Score=61.49 Aligned_cols=30 Identities=37% Similarity=0.516 Sum_probs=25.5
Q ss_pred hhHHHHHhhhhhcCCCCCCCCCCceeeeeC
Q 039411 171 LETVALHEIGHLLGLDHSLVEGAIMYAYID 200 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~siM~p~~~ 200 (222)
.+.=++||+||+|||.|=.++.=+|++..+
T Consensus 124 v~KEv~HElGH~~GL~HC~N~~CVM~FSnS 153 (181)
T COG1913 124 VVKEVLHELGHLLGLSHCPNPRCVMNFSNS 153 (181)
T ss_pred HHHHHHHHhhhhcCcccCCCCCcEEeCCcc
Confidence 345679999999999999999999986544
No 28
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=97.50 E-value=9.7e-05 Score=61.84 Aligned_cols=47 Identities=26% Similarity=0.309 Sum_probs=35.5
Q ss_pred HHHHHhhhhhcCCCCCCCC-------------------------CCceeeeeCCCCCCCCCHhhHHHHHHHhc
Q 039411 173 TVALHEIGHLLGLDHSLVE-------------------------GAIMYAYIDPGATKGLHTDDVQGIRALYN 220 (222)
Q Consensus 173 ~va~HEiGHaLGL~Hs~~~-------------------------~siM~p~~~~~~~~~l~~~Di~~iq~LYg 220 (222)
.|++|||||.||+.|=.+. .=||+|.... ..+.++.--+..|+.+..
T Consensus 147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~-~~~~FS~CS~~~~~~~l~ 218 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS-GITEFSPCTIGNICSLLG 218 (228)
T ss_pred eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC-CCCccChhhHHHHHHHHh
Confidence 5999999999999995431 1199998664 456788888888876653
No 29
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.50 E-value=3.7e-05 Score=57.69 Aligned_cols=17 Identities=53% Similarity=0.806 Sum_probs=12.0
Q ss_pred hHHHHHhhhhhcCCCCC
Q 039411 172 ETVALHEIGHLLGLDHS 188 (222)
Q Consensus 172 ~~va~HEiGHaLGL~Hs 188 (222)
..|++|||||.|||.|+
T Consensus 108 ~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 108 VDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp TTHHHHHHHHHTT----
T ss_pred ceEeeehhhHhcCCCCC
Confidence 37999999999999995
No 30
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.45 E-value=7.6e-05 Score=60.14 Aligned_cols=27 Identities=33% Similarity=0.529 Sum_probs=23.9
Q ss_pred hhHHHHHhhhhhcCCCCCCCCCCceee
Q 039411 171 LETVALHEIGHLLGLDHSLVEGAIMYA 197 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~siM~p 197 (222)
+..+++||+||.|||.|-.++.-+|+.
T Consensus 125 ~~k~~~HElGH~lGL~HC~~~~CvM~~ 151 (179)
T PRK13267 125 VRKEVTHELGHTLGLEHCDNPRCVMNF 151 (179)
T ss_pred HHHHHHHHHHHHcCCccCCCCCccCCC
Confidence 345699999999999999999999984
No 31
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.43 E-value=4.7e-05 Score=61.79 Aligned_cols=23 Identities=35% Similarity=0.395 Sum_probs=14.8
Q ss_pred cchhHHHHHhhhhhcCCCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSLVE 191 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~~~ 191 (222)
.....|++|||||.||+.|..+.
T Consensus 140 ~~~~~~~AHEiGH~lGa~HD~~~ 162 (196)
T PF13688_consen 140 YNGAITFAHEIGHNLGAPHDGDY 162 (196)
T ss_dssp HHHHHHHHHHHHHHTT-----SS
T ss_pred CceehhhHHhHHHhcCCCCCCCC
Confidence 34557999999999999997765
No 32
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.42 E-value=0.00017 Score=58.56 Aligned_cols=48 Identities=21% Similarity=0.285 Sum_probs=32.7
Q ss_pred chhHHHHHhhhhhcCCCCCCC------CCCceeeeeCCCCCCCCCHhhHHHHHHH
Q 039411 170 DLETVALHEIGHLLGLDHSLV------EGAIMYAYIDPGATKGLHTDDVQGIRAL 218 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~------~~siM~p~~~~~~~~~l~~~Di~~iq~L 218 (222)
....|++|||||.|||.|-.+ .+-||+|.... ....+|.--..-++.+
T Consensus 130 ~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~-~~~~fS~CS~~~~~~~ 183 (194)
T cd04269 130 LFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS-LTDAFSNCSYEDYQKF 183 (194)
T ss_pred HHHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC-CCCCCChhhHHHHHHH
Confidence 345799999999999999543 35799987653 3445665555444443
No 33
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.37 E-value=3.4e-05 Score=64.40 Aligned_cols=24 Identities=46% Similarity=0.858 Sum_probs=20.8
Q ss_pred cchhHHHHHhhhhhcCCCCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSLVEG 192 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~~~~ 192 (222)
.++-.|++||+||.|||.|+....
T Consensus 135 ~n~g~t~~HEvGH~lGL~HtF~~~ 158 (225)
T cd04275 135 YNLGDTATHEVGHWLGLYHTFQGG 158 (225)
T ss_pred ccccceeEEeccceeeeeeeecCC
Confidence 466789999999999999988665
No 34
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=97.28 E-value=0.00012 Score=59.53 Aligned_cols=29 Identities=38% Similarity=0.548 Sum_probs=20.0
Q ss_pred hhHHHHHhhhhhcCCCCCCCCCCceeeeeC
Q 039411 171 LETVALHEIGHLLGLDHSLVEGAIMYAYID 200 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~siM~p~~~ 200 (222)
+..+++||+||++||.|=.+ .-+|++..+
T Consensus 145 ~~Kea~HElGH~~GL~HC~~-~CvM~~s~s 173 (194)
T PF07998_consen 145 VCKEAVHELGHLFGLDHCEN-RCVMNFSNS 173 (194)
T ss_dssp HHHHHHHHHHHHTT----SS-TSTTS--SS
T ss_pred HHHHHHHHHHHHcCCcCCCC-CCccCCCCC
Confidence 45799999999999999999 999997433
No 35
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=96.97 E-value=0.00095 Score=54.36 Aligned_cols=50 Identities=24% Similarity=0.466 Sum_probs=31.6
Q ss_pred cchhHHHHHhhhhhcCCCCCCC--------CCCceeeeeCCCCCCCCCHhhHHHHHHH
Q 039411 169 FDLETVALHEIGHLLGLDHSLV--------EGAIMYAYIDPGATKGLHTDDVQGIRAL 218 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~~--------~~siM~p~~~~~~~~~l~~~Di~~iq~L 218 (222)
..+..+++|||||.||+.|-.. ..-||.|...+.....+|.--+..++.+
T Consensus 129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~~~~~fS~CS~~~~~~~ 186 (199)
T PF01421_consen 129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPSSSDKFSNCSRRQFEEF 186 (199)
T ss_dssp HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSSSTSSS-HHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCCCCCCcCHHHHHHHHHH
Confidence 3455799999999999999665 2458998655333346777666666554
No 36
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=96.96 E-value=0.0012 Score=55.78 Aligned_cols=48 Identities=29% Similarity=0.409 Sum_probs=33.4
Q ss_pred hhHHHHHhhhhhcCCCCCCC-----C------CCceeeeeCCC---CCCCCCHhhHHHHHHH
Q 039411 171 LETVALHEIGHLLGLDHSLV-----E------GAIMYAYIDPG---ATKGLHTDDVQGIRAL 218 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~-----~------~siM~p~~~~~---~~~~l~~~Di~~iq~L 218 (222)
...|++|||||.||+.|-.. + .=||+|....+ ....++.--++.++.+
T Consensus 167 ~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~ 228 (244)
T cd04270 167 SDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKV 228 (244)
T ss_pred HHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHH
Confidence 34699999999999999554 1 24899976422 2357777666666543
No 37
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=96.92 E-value=0.0013 Score=54.61 Aligned_cols=49 Identities=24% Similarity=0.256 Sum_probs=33.2
Q ss_pred chhHHHHHhhhhhcCCCCCCC------------------CCCceeeeeCCCCCCCCCHhhHHHHHHH
Q 039411 170 DLETVALHEIGHLLGLDHSLV------------------EGAIMYAYIDPGATKGLHTDDVQGIRAL 218 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~------------------~~siM~p~~~~~~~~~l~~~Di~~iq~L 218 (222)
....+++|||||.||+.|-.+ .+-||.|.........++.--+.-++.+
T Consensus 144 ~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~fS~CS~~~~~~~ 210 (220)
T cd04272 144 YGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNGERQYRFSQCSQRQIRNV 210 (220)
T ss_pred ccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCCCCCCccCcchHHHHHHH
Confidence 345799999999999999532 2468987644223446777666666554
No 38
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=96.23 E-value=0.0093 Score=51.58 Aligned_cols=19 Identities=42% Similarity=0.504 Sum_probs=16.6
Q ss_pred hHHHHHhhhhhcCCCCCCC
Q 039411 172 ETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 172 ~~va~HEiGHaLGL~Hs~~ 190 (222)
..|++||+||+|||.+--+
T Consensus 166 igv~~HE~gH~lGLPDlY~ 184 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYD 184 (286)
T ss_pred eeeeehhhhcccCCCcccc
Confidence 5799999999999987654
No 39
>PF11150 DUF2927: Protein of unknown function (DUF2927); InterPro: IPR021323 This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known.
Probab=95.73 E-value=0.37 Score=39.97 Aligned_cols=145 Identities=19% Similarity=0.136 Sum_probs=80.6
Q ss_pred CCCCCCCceeEEEecCCChH---HHHHHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCC---------CCCCC
Q 039411 68 PRWPPSNSHLTYAFLPGTPA---EAINAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGD---------SSPFD 135 (222)
Q Consensus 68 ~~W~~~~~~lty~~~~~~~~---~~~~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd---------~~~f~ 135 (222)
.||. ..|+..+....+. ...+.+..-++.-..++++.+..++.. +|+|.|.|...+... ..++.
T Consensus 28 ~Rw~---~PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~~~~~ 103 (213)
T PF11150_consen 28 RRWE---GPVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGAACFV 103 (213)
T ss_pred ccCC---CCeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccccccc
Confidence 5898 4688888844332 233556666777788999999888876 899999998765410 01110
Q ss_pred C--------CCCeeeeecCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCC-CCCC-CCCceeeeeCCCCCC
Q 039411 136 G--------RGGTIAHAAPPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLD-HSLV-EGAIMYAYIDPGATK 205 (222)
Q Consensus 136 g--------~g~~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~-Hs~~-~~siM~p~~~~~~~~ 205 (222)
- ...-++.++....+.+....... .+........+...+..||-.+|||. .|.. +.||-+- .....
T Consensus 104 ~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi-~i~~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFND---d~~~~ 179 (213)
T PF11150_consen 104 VPNFSSLPRYARCLVFARSNWSGEITRARAVI-FIRADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFND---DNEFA 179 (213)
T ss_pred chhhhcccccceEEEEEEecCCCceEEEEEEE-EEeccCCchHHHHHHHHHHHHHcCCccCCCcCCCceeeC---CCccc
Confidence 0 00112222222111111100000 01111112346678888899999995 3332 3343321 12234
Q ss_pred CCCHhhHHHHHHHhc
Q 039411 206 GLHTDDVQGIRALYN 220 (222)
Q Consensus 206 ~l~~~Di~~iq~LYg 220 (222)
.|+.-|..-|+.||-
T Consensus 180 ~LT~~D~lLLr~LYd 194 (213)
T PF11150_consen 180 LLTGFDELLLRMLYD 194 (213)
T ss_pred cccHHHHHHHHHHcC
Confidence 799999999999995
No 40
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=95.62 E-value=0.0053 Score=53.39 Aligned_cols=21 Identities=33% Similarity=0.564 Sum_probs=15.1
Q ss_pred cchhHHHHHhhhhhcCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~ 189 (222)
.....+++||+||++||+|-.
T Consensus 191 ~s~~~~f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 191 YSYGNEFSHELGHNFGLGHYP 211 (305)
T ss_dssp S-SHHHHHHHHHHTTT--SST
T ss_pred cCccceeehhhhhhcCCCCCC
Confidence 345689999999999999933
No 41
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=95.58 E-value=0.0082 Score=56.20 Aligned_cols=60 Identities=23% Similarity=0.329 Sum_probs=30.4
Q ss_pred CceEEEEEeecCCCCCCCCCCCCCeeeeec--------CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCC
Q 039411 116 NANLKISFQRGWHGDSSPFDGRGGTIAHAA--------PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDH 187 (222)
Q Consensus 116 ~adI~I~f~~~~~gd~~~f~g~g~~la~a~--------~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~H 187 (222)
++|+.|.......+ .+++|.|. .|..|.|.|+... +... ....+..|++|||.|+||+.-
T Consensus 159 ~~D~vlyV~~~~~~--------~~~~A~A~~C~~~~~~RP~~G~in~~p~~---i~~~-~~~~~~~~~~HEi~HaLGFs~ 226 (521)
T PF01457_consen 159 NADFVLYVTARPSS--------SSTLAWAAPCQQDSDGRPIAGVININPSY---IPSF-YFQEFFRTVIHEIAHALGFSS 226 (521)
T ss_dssp S-SEEEEEE----S--------TT---EEEEEEE-TTS-EEEEEEE--GGG------S---HHHHHHHHHHHHHHTT-SH
T ss_pred CccEEEEEEEeecc--------CCceEEEeehhhccCCCCeeEEEEEchhH---ccch-hhhcccceeeeeeeeeeeecc
Confidence 57877665543221 15666663 3567888887643 2211 123567899999999999975
No 42
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=95.44 E-value=0.0088 Score=52.42 Aligned_cols=20 Identities=40% Similarity=0.489 Sum_probs=17.4
Q ss_pred chhHHHHHhhhhhcCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~ 189 (222)
....|++||+||-|||.|+.
T Consensus 149 ~~~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 149 QDWATIMHELGHNLGLWHAG 168 (314)
T ss_pred ccHHHHHHHhhhhccccccC
Confidence 34569999999999999996
No 43
>PTZ00337 surface protease GP63; Provisional
Probab=95.37 E-value=0.026 Score=53.20 Aligned_cols=37 Identities=27% Similarity=0.387 Sum_probs=26.6
Q ss_pred CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCC
Q 039411 146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHS 188 (222)
Q Consensus 146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs 188 (222)
.|..|.|.|+....-. ....+.|++|||.||||+...
T Consensus 210 RPi~G~in~np~~i~~------~~~~~~v~~HEi~HALGFs~~ 246 (567)
T PTZ00337 210 RPFAAAVNFDPRQIAV------TNGDVRVAAHELGHALGFVRD 246 (567)
T ss_pred CceEEEEEECHHHccc------hhHHHHHHHHHHHHHHccCHH
Confidence 3677889988643211 235678999999999999643
No 44
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=95.02 E-value=0.015 Score=52.90 Aligned_cols=25 Identities=44% Similarity=0.519 Sum_probs=20.2
Q ss_pred hhHHHHHhhhhhcCCCCCCCCCCceee
Q 039411 171 LETVALHEIGHLLGLDHSLVEGAIMYA 197 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~siM~p 197 (222)
.....+|||||++||.|..+ .||--
T Consensus 316 ~iGA~lHEiGH~fg~pH~~~--GIM~R 340 (423)
T PF12044_consen 316 GIGAFLHEIGHLFGCPHQED--GIMLR 340 (423)
T ss_pred hhHHHHHHHHHhcCCCCCCC--CeeeC
Confidence 34578999999999999875 67753
No 45
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=94.23 E-value=0.059 Score=51.07 Aligned_cols=61 Identities=21% Similarity=0.242 Sum_probs=37.6
Q ss_pred CceEEEEEeecCCCCCCCCCCCCCeeeeec--------CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCC
Q 039411 116 NANLKISFQRGWHGDSSPFDGRGGTIAHAA--------PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDH 187 (222)
Q Consensus 116 ~adI~I~f~~~~~gd~~~f~g~g~~la~a~--------~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~H 187 (222)
++|+.|...... + ..+++|.|. .|..|-|.|+.... .+ ........|++|||.|+||+.-
T Consensus 205 naD~vLYVta~P------~--~~~~lA~A~~C~~~~d~RPi~G~iNinp~~i---~s-~~~~~~~rv~~HEi~HALGFS~ 272 (622)
T PTZ00257 205 NADFVLYVASVP------S--EPGVLAWATTCQVFSDDHPAVGVMNIPAANI---VS-RYDQGTTRTVTHEVAHALGFSS 272 (622)
T ss_pred ccCEEEEEEEec------C--CCCeeEEEEEeccccCCCceEEEEeeCHHHC---CC-ccchHHHHHHHHHHHHHhcCCH
Confidence 577766655422 1 134555543 36778898876432 11 1113467899999999999964
Q ss_pred C
Q 039411 188 S 188 (222)
Q Consensus 188 s 188 (222)
+
T Consensus 273 ~ 273 (622)
T PTZ00257 273 V 273 (622)
T ss_pred H
Confidence 3
No 46
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=93.77 E-value=0.082 Score=42.92 Aligned_cols=81 Identities=16% Similarity=0.272 Sum_probs=43.1
Q ss_pred HHHHHHHHHHHHHhccc---ceEEEecC-----CCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCc
Q 039411 89 AINAVGRAFQTWAAISH---FKFSQTQD-----YTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQW 160 (222)
Q Consensus 89 ~~~~i~~A~~~W~~~~~---l~F~ev~~-----~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w 160 (222)
.+.-++.+++.|+++.. ++=.||.. ..--|+.|.-.+ |.|..-|.|..-.+.+.+.. +
T Consensus 20 AQ~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~-------------G~LTDHYDP~~kvvrLSe~~-y 85 (226)
T COG2738 20 AQSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVP-------------GTLTDHYDPRRKVVRLSEAN-Y 85 (226)
T ss_pred HHHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeec-------------CCcccccChhhheeeccccc-c
Confidence 45678899999999843 33233321 112344443222 34444455655555443321 1
Q ss_pred ccCCCCCccchhHHHHHhhhhhcCCCC
Q 039411 161 AVGARPGAFDLETVALHEIGHLLGLDH 187 (222)
Q Consensus 161 ~~~~~~~~~~~~~va~HEiGHaLGL~H 187 (222)
. + ....-..|++||.|||+==.|
T Consensus 86 ~-g---~Sia~~aVAAHEVGHAiQd~~ 108 (226)
T COG2738 86 Y-G---PSIAAIAVAAHEVGHAIQDQE 108 (226)
T ss_pred C-C---ccHHHHHHHHHHhhHHHhhhc
Confidence 1 1 113456799999999985443
No 47
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=93.28 E-value=0.079 Score=44.00 Aligned_cols=76 Identities=17% Similarity=0.272 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHhccc---ceEEEec-----CCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcc
Q 039411 90 INAVGRAFQTWAAISH---FKFSQTQ-----DYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWA 161 (222)
Q Consensus 90 ~~~i~~A~~~W~~~~~---l~F~ev~-----~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~ 161 (222)
+.-++..|+.|+++.. ++=.|+. ...-.|++|.-..+.-+| -|.|.+..+.+.++. +
T Consensus 18 q~~vk~~~~kYs~v~~~~g~TGae~Ar~iL~~~gl~~V~Ve~~~G~LtD-------------HYdP~~k~vrLS~~v-y- 82 (222)
T PF04298_consen 18 QWWVKSTFKKYSQVRSSSGMTGAEVARHILDRNGLSDVRVERVPGELTD-------------HYDPRNKVVRLSEDV-Y- 82 (222)
T ss_pred HHHHHHHHHhhCcCCCCCCCCHHHHHHHHHHHCCCCCeeEEEeCCCCCC-------------CcCCCCCEEEeCCcc-C-
Confidence 3456777777777632 2222221 112346666654432222 244666666664432 1
Q ss_pred cCCCCCccchhHHHHHhhhhhc
Q 039411 162 VGARPGAFDLETVALHEIGHLL 183 (222)
Q Consensus 162 ~~~~~~~~~~~~va~HEiGHaL 183 (222)
+. ....-..||+||.|||+
T Consensus 83 -~~--~SiaAvaVAAHEvGHAi 101 (222)
T PF04298_consen 83 -NG--RSIAAVAVAAHEVGHAI 101 (222)
T ss_pred -CC--CCHHHHHHHHHHHhHHH
Confidence 10 11234579999999996
No 48
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=93.06 E-value=0.082 Score=50.15 Aligned_cols=46 Identities=33% Similarity=0.492 Sum_probs=35.1
Q ss_pred hHHHHHhhhhhcCCCCCCC----------CCCceeeeeCCCC---CCCCCHhhHHHHHH
Q 039411 172 ETVALHEIGHLLGLDHSLV----------EGAIMYAYIDPGA---TKGLHTDDVQGIRA 217 (222)
Q Consensus 172 ~~va~HEiGHaLGL~Hs~~----------~~siM~p~~~~~~---~~~l~~~Di~~iq~ 217 (222)
.-|++|||||..|=.|-.+ -+=|||++...|+ ..++++--+..|.+
T Consensus 393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~ 451 (764)
T KOG3658|consen 393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISK 451 (764)
T ss_pred heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHH
Confidence 3489999999999999664 2348999988664 34788887777754
No 49
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=92.46 E-value=0.058 Score=40.32 Aligned_cols=18 Identities=44% Similarity=0.835 Sum_probs=15.3
Q ss_pred chhHHHHHhhhhhcCCCC
Q 039411 170 DLETVALHEIGHLLGLDH 187 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~H 187 (222)
-...|++|||||-.||..
T Consensus 108 ~vthvliHEIgHhFGLsD 125 (136)
T COG3824 108 QVTHVLIHEIGHHFGLSD 125 (136)
T ss_pred HhhhhhhhhhhhhcCCCh
Confidence 456799999999999974
No 50
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=91.87 E-value=0.11 Score=42.73 Aligned_cols=32 Identities=16% Similarity=0.228 Sum_probs=26.6
Q ss_pred ChhhhhcCCCCCccCCCHHHHHHhcCCCCCCCC
Q 039411 1 MKTYQLNYHLNSTGVLDGNTVAMMTRPRCGVAD 33 (222)
Q Consensus 1 i~~~Q~~~~L~~TG~ld~~T~~~m~~pRCg~pD 33 (222)
|+.||+..||+++|.+|..|...|..| |..+.
T Consensus 45 V~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~~~ 76 (201)
T TIGR02869 45 VRKFQSKNGLTVDGIVGPKTKAALGIA-TTQQS 76 (201)
T ss_pred HHHHHHHhCCCCCCCcCHHHHHHhCCc-cccCc
Confidence 578999999999999999999999544 44443
No 51
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=91.50 E-value=0.088 Score=48.21 Aligned_cols=18 Identities=39% Similarity=0.564 Sum_probs=15.4
Q ss_pred CccchhHHHHHhhhhhcC
Q 039411 167 GAFDLETVALHEIGHLLG 184 (222)
Q Consensus 167 ~~~~~~~va~HEiGHaLG 184 (222)
....+.+|++|||||+.|
T Consensus 126 ~esElagViAHEigHv~q 143 (484)
T COG4783 126 NESELAGVIAHEIGHVAQ 143 (484)
T ss_pred CHHHHHHHHHHHHHHHhh
Confidence 345789999999999987
No 52
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=91.06 E-value=0.15 Score=46.23 Aligned_cols=24 Identities=33% Similarity=0.509 Sum_probs=19.3
Q ss_pred hhHHHHHhhhhhcCCCCCCCCCCcee
Q 039411 171 LETVALHEIGHLLGLDHSLVEGAIMY 196 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~siM~ 196 (222)
....+.|||||.||+.|.. ..||-
T Consensus 301 tlGA~~HElGH~lgcpHq~--~GvMl 324 (614)
T KOG4525|consen 301 TLGAVCHELGHCLGCPHQS--EGVML 324 (614)
T ss_pred HHHHHHHHhhhccCCCCCC--CceEe
Confidence 3567899999999999965 55774
No 53
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=91.00 E-value=0.04 Score=52.68 Aligned_cols=20 Identities=40% Similarity=0.338 Sum_probs=16.6
Q ss_pred hhHHHHHhhhhhcCCCCCCC
Q 039411 171 LETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~ 190 (222)
-..|++||+||.|||...-+
T Consensus 221 giGVfaHEfGH~LGLPDlYD 240 (645)
T PF05547_consen 221 GIGVFAHEFGHDLGLPDLYD 240 (645)
T ss_pred ceEEEEeeccccCCCCCccc
Confidence 45799999999999986544
No 54
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=90.52 E-value=0.14 Score=43.80 Aligned_cols=24 Identities=33% Similarity=0.548 Sum_probs=13.4
Q ss_pred chhHHHHHhhhhh-cCCCCCCCCCC
Q 039411 170 DLETVALHEIGHL-LGLDHSLVEGA 193 (222)
Q Consensus 170 ~~~~va~HEiGHa-LGL~Hs~~~~s 193 (222)
....|++||+||+ .||+.+-....
T Consensus 215 ~~~~v~vHE~GHsf~~LaDEY~~~~ 239 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADEYYYGE 239 (264)
T ss_dssp THHHHHHHHHHHHTT------ECS-
T ss_pred cccceeeeecccccccccccccCCC
Confidence 5678999999996 45677665443
No 55
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=90.28 E-value=0.14 Score=49.60 Aligned_cols=45 Identities=27% Similarity=0.251 Sum_probs=28.3
Q ss_pred chhHHHHHhhhhhcCCCCCCC-------CCCceeeeeCC-CCCCCCCHhhHHHH
Q 039411 170 DLETVALHEIGHLLGLDHSLV-------EGAIMYAYIDP-GATKGLHTDDVQGI 215 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~-------~~siM~p~~~~-~~~~~l~~~Di~~i 215 (222)
....|++|||||.||+.|-.. ..=|| +-+.. .....++.--++.+
T Consensus 322 ~~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~ 374 (716)
T KOG3607|consen 322 AFAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDF 374 (716)
T ss_pred hHHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHH
Confidence 455799999999999999762 22367 54441 12335555444444
No 56
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=88.54 E-value=0.27 Score=35.62 Aligned_cols=16 Identities=31% Similarity=0.706 Sum_probs=12.6
Q ss_pred hhHHHHHhhhhhcCCC
Q 039411 171 LETVALHEIGHLLGLD 186 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~ 186 (222)
...|++||+||.+|+.
T Consensus 73 I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 73 IRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHcCCC
Confidence 4579999999999985
No 57
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=87.95 E-value=0.32 Score=46.22 Aligned_cols=27 Identities=22% Similarity=0.276 Sum_probs=24.9
Q ss_pred ChhhhhcCCCCCccCCCHHHHHHhcCC
Q 039411 1 MKTYQLNYHLNSTGVLDGNTVAMMTRP 27 (222)
Q Consensus 1 i~~~Q~~~~L~~TG~ld~~T~~~m~~p 27 (222)
+|+||+..||..+|+++..|++.|+.|
T Consensus 317 VK~FQ~rhGL~~DGvIG~~T~~~LNvs 343 (608)
T PRK10594 317 VKRFQAWQGLGADGVIGPRTRDWLNVT 343 (608)
T ss_pred HHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence 589999999999999999999998755
No 58
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=87.77 E-value=1.1 Score=38.89 Aligned_cols=40 Identities=33% Similarity=0.424 Sum_probs=24.8
Q ss_pred cCCCCceeeecCCCCcc-----cCCCCCccchhHHHHHhhhhh----cCCC
Q 039411 145 APPRGGIFDYDADEQWA-----VGARPGAFDLETVALHEIGHL----LGLD 186 (222)
Q Consensus 145 ~~P~~g~i~fd~~~~w~-----~~~~~~~~~~~~va~HEiGHa----LGL~ 186 (222)
|-|.+..|.+|... |. ++. .+......|++||+||. ||+.
T Consensus 141 YCp~D~tIYlD~~f-~~~L~~~~ga-~G~~a~ayVlAHEyGHHVQ~l~Gil 189 (292)
T PF04228_consen 141 YCPADQTIYLDLSF-FDELQQRFGA-SGDFAQAYVLAHEYGHHVQNLLGIL 189 (292)
T ss_pred eCCCCCEEEechHH-HHHHHHHhCC-ccHHHHHHHHHHHHHHHHHHHhhhH
Confidence 45788899998642 21 111 12233456999999996 5664
No 59
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=87.69 E-value=0.27 Score=43.21 Aligned_cols=38 Identities=24% Similarity=0.305 Sum_probs=28.5
Q ss_pred cCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC
Q 039411 145 APPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 145 ~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~ 190 (222)
+.|-.++.+.|.+- ....+..|++||++|-+|...+.+
T Consensus 178 y~PFT~EA~vN~~~--------p~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 178 YNPFTGEANVNTDL--------PPYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred EcCCcceeecCCCC--------CcccccHHHHHHHHHHhCCCCHHH
Confidence 45767777776532 224677899999999999998774
No 60
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=86.56 E-value=0.33 Score=39.69 Aligned_cols=15 Identities=40% Similarity=0.638 Sum_probs=12.4
Q ss_pred chhHHHHHhhhhhcC
Q 039411 170 DLETVALHEIGHLLG 184 (222)
Q Consensus 170 ~~~~va~HEiGHaLG 184 (222)
....|+.||+||++-
T Consensus 21 ~~l~t~~HE~gHal~ 35 (200)
T PF13398_consen 21 RLLVTFVHELGHALA 35 (200)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456799999999964
No 61
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=86.10 E-value=3.7 Score=33.09 Aligned_cols=77 Identities=18% Similarity=0.198 Sum_probs=44.7
Q ss_pred HHHHHHHHHHHhcccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcccCCCCCccc
Q 039411 91 NAVGRAFQTWAAISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWAVGARPGAFD 170 (222)
Q Consensus 91 ~~i~~A~~~W~~~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~ 170 (222)
+.+..-+..|+...++. ...|.|.-....-|.| +..|.|.||- -+-..+ ..-
T Consensus 112 ~~l~~~~~~~~~~~~~~--------~~~i~ir~~ksrWGsc---------------~~~~~I~ln~----~L~~~P-~~~ 163 (205)
T PF01863_consen 112 EYLPERLKKYAKKLGLP--------PPKIKIRDMKSRWGSC---------------SSKGNITLNW----RLVMAP-PEV 163 (205)
T ss_pred HHHHHHHHHHHHHcCCC--------cceEEEeehhhccccC---------------CCCCcEEeec----ccccCC-ccH
Confidence 44455555566555542 4556666544333322 3357777753 222211 235
Q ss_pred hhHHHHHhhhhhcCCCCCCCCCCce
Q 039411 171 LETVALHEIGHLLGLDHSLVEGAIM 195 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~~~siM 195 (222)
+..|++||+-|..-..||..--+.|
T Consensus 164 idYVvvHEL~Hl~~~nHs~~Fw~~v 188 (205)
T PF01863_consen 164 IDYVVVHELCHLRHPNHSKRFWALV 188 (205)
T ss_pred HHHHHHHHHHHhccCCCCHHHHHHH
Confidence 6689999999999999987643333
No 62
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=84.86 E-value=0.44 Score=39.67 Aligned_cols=14 Identities=36% Similarity=0.610 Sum_probs=11.9
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
.+..|+.||+||+|
T Consensus 91 ~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 91 NVLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHHH
Confidence 46779999999994
No 63
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=84.76 E-value=0.4 Score=39.76 Aligned_cols=16 Identities=31% Similarity=0.370 Sum_probs=11.9
Q ss_pred hHHHHHhhhhhcCCCC
Q 039411 172 ETVALHEIGHLLGLDH 187 (222)
Q Consensus 172 ~~va~HEiGHaLGL~H 187 (222)
.-|++||+||+|=-.+
T Consensus 73 rFtlAHELGH~llH~~ 88 (213)
T COG2856 73 RFTLAHELGHALLHTD 88 (213)
T ss_pred HHHHHHHHhHHHhccc
Confidence 3588999999964333
No 64
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=84.76 E-value=0.41 Score=34.60 Aligned_cols=15 Identities=33% Similarity=0.485 Sum_probs=12.0
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
.....+++|||||.+
T Consensus 40 ~~~~f~laHELgH~~ 54 (122)
T PF06114_consen 40 ERQRFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 345679999999985
No 65
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=82.54 E-value=0.73 Score=37.90 Aligned_cols=110 Identities=21% Similarity=0.320 Sum_probs=58.5
Q ss_pred HHHHHHHHHHHhcccce--EEEec---------CCCCceEEEEEeecCC-CCCCC-CCCC-CCeee--eecCC-------
Q 039411 91 NAVGRAFQTWAAISHFK--FSQTQ---------DYTNANLKISFQRGWH-GDSSP-FDGR-GGTIA--HAAPP------- 147 (222)
Q Consensus 91 ~~i~~A~~~W~~~~~l~--F~ev~---------~~~~adI~I~f~~~~~-gd~~~-f~g~-g~~la--~a~~P------- 147 (222)
.-+..|++.|++-++.. |..+. +..++++.|.+..+-- ....| |.+. -..++ .-++|
T Consensus 49 ~dl~~aLaq~e~sgn~gdyfsQv~tspv~~L~~S~f~v~v~irwpq~l~n~~~ap~wq~a~~tava~wa~~fpl~ive~~ 128 (236)
T COG5549 49 EDLEKALAQWENSGNGGDYFSQVETSPVGYLIWSEFPVDVRIRWPQNLENVEGAPRWQGAYLTAVAGWAKTFPLIIVERF 128 (236)
T ss_pred ccHHHHHHHHHhcCCcccccceecccccceeecccCcceeeecccccccccccchhHHHHHHHHHHHHHHhCCceeeecc
Confidence 56788999999876543 54433 2236777887765221 10101 0000 00000 01345
Q ss_pred CCceeeecCCCCcccCCCCCc-cchhHHHHHhhhhhcCCCCCCCCCCceeeeeC
Q 039411 148 RGGIFDYDADEQWAVGARPGA-FDLETVALHEIGHLLGLDHSLVEGAIMYAYID 200 (222)
Q Consensus 148 ~~g~i~fd~~~~w~~~~~~~~-~~~~~va~HEiGHaLGL~Hs~~~~siM~p~~~ 200 (222)
+..||.+...+.=..+-...+ .....+.-|++||++||+|.+.+..||+|.+.
T Consensus 129 eeaDItie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~pg~ 182 (236)
T COG5549 129 EEADITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYPPGE 182 (236)
T ss_pred eeeeEEEEecCCCCCcccccchHHHHHHHHHHhhhccCcccccccccccCCccc
Confidence 234555433221111111111 23455778999999999999999999987554
No 66
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=81.42 E-value=0.77 Score=37.43 Aligned_cols=20 Identities=40% Similarity=0.592 Sum_probs=15.8
Q ss_pred cchhHHHHHhhhhhcCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~ 189 (222)
..+..|++||+||.. ..|..
T Consensus 87 ~el~aVlaHElgH~~-~~h~~ 106 (226)
T PF01435_consen 87 DELAAVLAHELGHIK-HRHIL 106 (226)
T ss_dssp HHHHHHHHHHHHHHH-TTHCC
T ss_pred HHHHHHHHHHHHHHH-cCCcc
Confidence 478899999999998 44544
No 67
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=81.19 E-value=14 Score=33.79 Aligned_cols=24 Identities=13% Similarity=0.210 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHhcccceEEEecC
Q 039411 90 INAVGRAFQTWAAISHFKFSQTQD 113 (222)
Q Consensus 90 ~~~i~~A~~~W~~~~~l~F~ev~~ 113 (222)
..+++.-|..-+..-+++|+++..
T Consensus 107 ~~v~~gl~~~~~~lfgi~~~~~~~ 130 (422)
T cd06456 107 EKVLDGLFELAERLYGITFKERTD 130 (422)
T ss_pred HHHHHHHHHHHHHHcCeeEEECCC
Confidence 345566666677788889988654
No 68
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=80.99 E-value=0.91 Score=42.33 Aligned_cols=26 Identities=27% Similarity=0.342 Sum_probs=24.3
Q ss_pred ChhhhhcCCCCCccCCCHHHHHHhcC
Q 039411 1 MKTYQLNYHLNSTGVLDGNTVAMMTR 26 (222)
Q Consensus 1 i~~~Q~~~~L~~TG~ld~~T~~~m~~ 26 (222)
||+||...||+..|+++++|++.|+-
T Consensus 273 vKrfQ~~~GL~~DGviG~~T~~aLn~ 298 (561)
T COG2989 273 VKRFQARHGLPADGVIGPATRAALNV 298 (561)
T ss_pred HHHHHHHhCCCCCCcccHHHHHHhcc
Confidence 58999999999999999999999973
No 69
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=80.79 E-value=13 Score=34.24 Aligned_cols=25 Identities=0% Similarity=-0.067 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHhcccceEEEecC
Q 039411 89 AINAVGRAFQTWAAISHFKFSQTQD 113 (222)
Q Consensus 89 ~~~~i~~A~~~W~~~~~l~F~ev~~ 113 (222)
+..+++--|..-+..=+|+|+++..
T Consensus 132 l~~vl~gl~~~~~~lfgi~~~~~~~ 156 (458)
T cd06457 132 LGTVMEGLSRLFSRLYGIRLVPVPL 156 (458)
T ss_pred HHHHHHHHHHHHHHHhCeEEEecCC
Confidence 4456677777778889999999753
No 70
>PRK02870 heat shock protein HtpX; Provisional
Probab=80.16 E-value=1.3 Score=39.26 Aligned_cols=33 Identities=15% Similarity=0.260 Sum_probs=20.6
Q ss_pred CCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhh
Q 039411 146 PPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHL 182 (222)
Q Consensus 146 ~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHa 182 (222)
.|..+.|-+.+.-.-.+ ....+..|++||+||.
T Consensus 152 ~~~~~~Ivvt~GLL~~L----~~dEL~aVlAHELgHi 184 (336)
T PRK02870 152 SEKSAMVAITTGLLEKL----DRDELQAVMAHELSHI 184 (336)
T ss_pred CCCCcEEEEehHHhhhC----CHHHHHHHHHHHHHHH
Confidence 44566666644221111 2357889999999998
No 71
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=79.71 E-value=0.94 Score=36.57 Aligned_cols=12 Identities=42% Similarity=0.822 Sum_probs=10.3
Q ss_pred hHHHHHhhhhhc
Q 039411 172 ETVALHEIGHLL 183 (222)
Q Consensus 172 ~~va~HEiGHaL 183 (222)
..|++||+||.+
T Consensus 10 ~~v~iHElGH~~ 21 (182)
T cd06163 10 ILIFVHELGHFL 21 (182)
T ss_pred HHHHHHHHHHHH
Confidence 468999999975
No 72
>PRK03982 heat shock protein HtpX; Provisional
Probab=79.69 E-value=0.93 Score=39.13 Aligned_cols=15 Identities=40% Similarity=0.536 Sum_probs=13.2
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
..+..|++||+||.-
T Consensus 123 ~El~AVlAHElgHi~ 137 (288)
T PRK03982 123 DELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 478999999999975
No 73
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=79.04 E-value=1 Score=38.61 Aligned_cols=16 Identities=44% Similarity=0.684 Sum_probs=13.6
Q ss_pred ccchhHHHHHhhhhhc
Q 039411 168 AFDLETVALHEIGHLL 183 (222)
Q Consensus 168 ~~~~~~va~HEiGHaL 183 (222)
...+..|++||+||..
T Consensus 154 ~dEl~aVlaHElgHi~ 169 (302)
T COG0501 154 DDELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3578999999999975
No 74
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=78.82 E-value=1 Score=37.00 Aligned_cols=14 Identities=50% Similarity=0.731 Sum_probs=11.6
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
-+..|+.||+||++
T Consensus 37 l~~~v~iHElgH~~ 50 (208)
T cd06161 37 LFLSVLLHELGHAL 50 (208)
T ss_pred HHHHHHHHHHHHHH
Confidence 35679999999985
No 75
>PRK03072 heat shock protein HtpX; Provisional
Probab=78.68 E-value=1 Score=38.91 Aligned_cols=17 Identities=29% Similarity=0.425 Sum_probs=13.9
Q ss_pred ccchhHHHHHhhhhhcC
Q 039411 168 AFDLETVALHEIGHLLG 184 (222)
Q Consensus 168 ~~~~~~va~HEiGHaLG 184 (222)
...+..|++||+||.-.
T Consensus 124 ~~El~aVlAHElgHi~~ 140 (288)
T PRK03072 124 ERELRGVLGHELSHVYN 140 (288)
T ss_pred HHHHHHHHHHHHHHHhc
Confidence 35789999999999653
No 76
>PRK03001 M48 family peptidase; Provisional
Probab=78.59 E-value=1.1 Score=38.69 Aligned_cols=16 Identities=25% Similarity=0.453 Sum_probs=13.6
Q ss_pred ccchhHHHHHhhhhhc
Q 039411 168 AFDLETVALHEIGHLL 183 (222)
Q Consensus 168 ~~~~~~va~HEiGHaL 183 (222)
...+..|++||+||.-
T Consensus 121 ~~El~aVlAHElgHi~ 136 (283)
T PRK03001 121 EREIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3578999999999975
No 77
>PRK01345 heat shock protein HtpX; Provisional
Probab=78.38 E-value=1.1 Score=39.42 Aligned_cols=19 Identities=21% Similarity=0.321 Sum_probs=15.3
Q ss_pred cchhHHHHHhhhhhcCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHS 188 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs 188 (222)
..+..|++|||||.- ..|.
T Consensus 122 dEL~aVlAHElgHi~-~~d~ 140 (317)
T PRK01345 122 EEVAGVMAHELAHVK-NRDT 140 (317)
T ss_pred HHHHHHHHHHHHHHH-cCCH
Confidence 478999999999987 3553
No 78
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=75.99 E-value=22 Score=32.93 Aligned_cols=14 Identities=36% Similarity=0.653 Sum_probs=11.7
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
+-..|+.||+||+|
T Consensus 262 ~~V~TLfHEfGHal 275 (472)
T cd06455 262 DEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 44568999999998
No 79
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=75.86 E-value=3 Score=28.89 Aligned_cols=30 Identities=13% Similarity=0.317 Sum_probs=19.0
Q ss_pred CceeeecCCCCcccCCCCCccchhHHHHHhhhhhc
Q 039411 149 GGIFDYDADEQWAVGARPGAFDLETVALHEIGHLL 183 (222)
Q Consensus 149 ~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaL 183 (222)
..+|+|.... +... ...-..+++||+.|++
T Consensus 44 G~~I~f~~g~-~~~~----s~~~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 44 GNDIYFAPGK-YNPD----SPEGRALLAHELAHVV 73 (79)
T ss_pred CCEEEEcCCC-cCCC----CCCcchhHhHHHHHHH
Confidence 6788885532 1111 1234579999999985
No 80
>PRK04897 heat shock protein HtpX; Provisional
Probab=75.39 E-value=1.5 Score=38.18 Aligned_cols=15 Identities=47% Similarity=0.536 Sum_probs=13.2
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
..+..|++||+||.-
T Consensus 135 ~El~aVlAHElgHi~ 149 (298)
T PRK04897 135 EELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHh
Confidence 578899999999965
No 81
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=75.39 E-value=1.5 Score=36.70 Aligned_cols=15 Identities=47% Similarity=0.713 Sum_probs=11.9
Q ss_pred hhHHHHHhhhhhcCC
Q 039411 171 LETVALHEIGHLLGL 185 (222)
Q Consensus 171 ~~~va~HEiGHaLGL 185 (222)
+..|++||+||++=-
T Consensus 53 ~~~v~iHElgH~~~A 67 (227)
T cd06164 53 FASVLLHELGHSLVA 67 (227)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457999999998633
No 82
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=75.26 E-value=1.4 Score=39.14 Aligned_cols=15 Identities=33% Similarity=0.512 Sum_probs=12.5
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
.+-..++.||+||++
T Consensus 152 ~~~v~tl~HE~GHa~ 166 (365)
T cd06258 152 HDDINTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHHHHhHHH
Confidence 455678999999998
No 83
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=74.86 E-value=0.87 Score=45.26 Aligned_cols=31 Identities=23% Similarity=0.303 Sum_probs=24.4
Q ss_pred chhHHHHHhhhhhcCCCCCCCCC------CceeeeeC
Q 039411 170 DLETVALHEIGHLLGLDHSLVEG------AIMYAYID 200 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~~~------siM~p~~~ 200 (222)
....+++||+||.+|+.|-...+ .||.|...
T Consensus 316 ~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~ 352 (845)
T KOG3538|consen 316 PSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTC 352 (845)
T ss_pred ccceeeeeccccccCcccCCcCCCCCCcccccCcccc
Confidence 34569999999999999976543 58888765
No 84
>PRK05457 heat shock protein HtpX; Provisional
Probab=74.74 E-value=1.6 Score=37.78 Aligned_cols=15 Identities=40% Similarity=0.609 Sum_probs=12.9
Q ss_pred ccchhHHHHHhhhhh
Q 039411 168 AFDLETVALHEIGHL 182 (222)
Q Consensus 168 ~~~~~~va~HEiGHa 182 (222)
...+..|++||+||.
T Consensus 131 ~~El~aVlAHElgHi 145 (284)
T PRK05457 131 RDEVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHHHH
Confidence 357889999999996
No 85
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=74.51 E-value=1.4 Score=39.64 Aligned_cols=15 Identities=33% Similarity=0.490 Sum_probs=12.0
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
.+-..+++||+||++
T Consensus 220 ~~~v~tl~HE~GHa~ 234 (427)
T cd06459 220 LDDVFTLAHELGHAF 234 (427)
T ss_pred hhhHHHHHHHhhHHH
Confidence 344678999999985
No 86
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=74.50 E-value=1.5 Score=40.25 Aligned_cols=25 Identities=12% Similarity=0.199 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHhcccceEEEecC
Q 039411 89 AINAVGRAFQTWAAISHFKFSQTQD 113 (222)
Q Consensus 89 ~~~~i~~A~~~W~~~~~l~F~ev~~ 113 (222)
+.+++..-+...+.+=+|+|+++..
T Consensus 136 ~~~~~~g~~~l~~~lfgi~~~~~~~ 160 (458)
T PF01432_consen 136 LEEVLEGLFELAERLFGIRFEEVPD 160 (458)
T ss_dssp HHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred HHHHHHHHHHHHHHHhcEEEEeccc
Confidence 3456666666777888899988764
No 87
>PRK02391 heat shock protein HtpX; Provisional
Probab=73.87 E-value=1.7 Score=37.81 Aligned_cols=15 Identities=40% Similarity=0.629 Sum_probs=12.8
Q ss_pred ccchhHHHHHhhhhh
Q 039411 168 AFDLETVALHEIGHL 182 (222)
Q Consensus 168 ~~~~~~va~HEiGHa 182 (222)
..++..|++||+||.
T Consensus 130 ~~El~aVlaHElgHi 144 (296)
T PRK02391 130 PDELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHHH
Confidence 357889999999995
No 88
>PRK01265 heat shock protein HtpX; Provisional
Probab=73.72 E-value=1.7 Score=38.32 Aligned_cols=15 Identities=47% Similarity=0.724 Sum_probs=12.8
Q ss_pred ccchhHHHHHhhhhh
Q 039411 168 AFDLETVALHEIGHL 182 (222)
Q Consensus 168 ~~~~~~va~HEiGHa 182 (222)
..++..|++||+||.
T Consensus 137 ~~El~aVlAHElgHi 151 (324)
T PRK01265 137 RDEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHHH
Confidence 357899999999994
No 89
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=73.24 E-value=2.2 Score=30.89 Aligned_cols=14 Identities=36% Similarity=0.451 Sum_probs=12.0
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
.+..|++||+.|++
T Consensus 24 ~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 24 WLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 45689999999995
No 90
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=72.18 E-value=1.9 Score=37.13 Aligned_cols=13 Identities=38% Similarity=0.603 Sum_probs=11.1
Q ss_pred hhHHHHHhhhhhc
Q 039411 171 LETVALHEIGHLL 183 (222)
Q Consensus 171 ~~~va~HEiGHaL 183 (222)
+..|++||+||++
T Consensus 135 ~isvvvHElgHal 147 (277)
T cd06162 135 LISGVVHEMGHGV 147 (277)
T ss_pred HHHHHHHHHHHHH
Confidence 4579999999996
No 91
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=72.16 E-value=2.4 Score=40.28 Aligned_cols=15 Identities=27% Similarity=0.474 Sum_probs=12.3
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
.+-..|++||+||++
T Consensus 376 ~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 376 LNSVFTLAHELGHSM 390 (591)
T ss_pred cchHHHHHHHhhhHH
Confidence 355679999999996
No 92
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=71.79 E-value=2 Score=36.77 Aligned_cols=15 Identities=27% Similarity=0.350 Sum_probs=11.7
Q ss_pred hhHHHHHhhhhhcCC
Q 039411 171 LETVALHEIGHLLGL 185 (222)
Q Consensus 171 ~~~va~HEiGHaLGL 185 (222)
...+++||+||++=-
T Consensus 118 ~isv~iHElgHa~~A 132 (263)
T cd06159 118 VVGVVVHELSHGILA 132 (263)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456999999999633
No 93
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=71.23 E-value=2.1 Score=40.32 Aligned_cols=15 Identities=33% Similarity=0.306 Sum_probs=12.5
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
.+-..|++||+||++
T Consensus 335 ~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 335 SGDIDVLTHEAGHAF 349 (549)
T ss_pred hhHHHHHHHHhhHHH
Confidence 455679999999997
No 94
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=70.27 E-value=1.9 Score=41.05 Aligned_cols=14 Identities=29% Similarity=0.399 Sum_probs=11.8
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
+-..|++||+||++
T Consensus 374 ~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 374 RDVSTLAHELGHAY 387 (587)
T ss_pred hhHHHHHHHhhHHH
Confidence 44578999999998
No 95
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=69.36 E-value=2.2 Score=33.39 Aligned_cols=12 Identities=33% Similarity=0.396 Sum_probs=10.2
Q ss_pred hHHHHHhhhhhc
Q 039411 172 ETVALHEIGHLL 183 (222)
Q Consensus 172 ~~va~HEiGHaL 183 (222)
+-|++|||+|.+
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 459999999974
No 96
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=68.56 E-value=2.6 Score=33.99 Aligned_cols=20 Identities=35% Similarity=0.594 Sum_probs=14.3
Q ss_pred hhHHHHHhhhhhcCCCCCCC
Q 039411 171 LETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~~ 190 (222)
+..+..||+||++--..-.+
T Consensus 41 ~~~l~iHElgH~~~A~~~G~ 60 (183)
T cd06160 41 LAILGIHEMGHYLAARRHGV 60 (183)
T ss_pred HHHHHHHHHHHHHHHHHCCC
Confidence 45688999999976654443
No 97
>PF09374 PG_binding_3: Predicted Peptidoglycan domain; InterPro: IPR018537 This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=67.72 E-value=2.6 Score=28.63 Aligned_cols=24 Identities=21% Similarity=0.326 Sum_probs=18.2
Q ss_pred ChhhhhcCC----CCCccCCCHHHHHHh
Q 039411 1 MKTYQLNYH----LNSTGVLDGNTVAMM 24 (222)
Q Consensus 1 i~~~Q~~~~----L~~TG~ld~~T~~~m 24 (222)
+|.+|+..| +.+.|++.+.|++.+
T Consensus 3 ik~LQ~alg~~~~v~~DG~iGp~Tl~Al 30 (72)
T PF09374_consen 3 IKFLQRALGLGADVAVDGIIGPKTLAAL 30 (72)
T ss_dssp HHHHHHHHT----S--SS--SHHHHHHH
T ss_pred HHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence 467899999 999999999999998
No 98
>PLN02791 Nudix hydrolase homolog
Probab=65.36 E-value=3.3 Score=40.74 Aligned_cols=21 Identities=19% Similarity=0.142 Sum_probs=17.1
Q ss_pred chhHHHHHhhhhhcCCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~ 190 (222)
.+..|++||+||.+|=.+.-.
T Consensus 584 f~~~v~lHElgHGsG~~~~~~ 604 (770)
T PLN02791 584 FFTHTICHECCHGIGPHTITL 604 (770)
T ss_pred HHHHHHHHHhhccccccceec
Confidence 457899999999999877544
No 99
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=64.29 E-value=4.9 Score=34.32 Aligned_cols=23 Identities=35% Similarity=0.534 Sum_probs=18.5
Q ss_pred ccchhHHHHHhhhhhcCCCCCCC
Q 039411 168 AFDLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 168 ~~~~~~va~HEiGHaLGL~Hs~~ 190 (222)
..+...|+.||+||.-|-.+--+
T Consensus 200 a~~yyaTl~HElghwtgh~~rl~ 222 (316)
T COG4227 200 AINYYATLLHELGHWTGHEARLD 222 (316)
T ss_pred hHhHHHHHHHHhccccCchhhhh
Confidence 45788999999999998766544
No 100
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=63.36 E-value=3.5 Score=37.23 Aligned_cols=19 Identities=37% Similarity=0.515 Sum_probs=13.9
Q ss_pred hhHHHHHhhhhhcCCCCCC
Q 039411 171 LETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~Hs~ 189 (222)
+..+..||+||||--.-+.
T Consensus 131 vi~~vvHElGHalAA~seg 149 (484)
T KOG2921|consen 131 VITVVVHELGHALAAASEG 149 (484)
T ss_pred HHHHHHHHhhHHHHHHhcC
Confidence 4567899999998654443
No 101
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=61.84 E-value=4.2 Score=32.65 Aligned_cols=13 Identities=38% Similarity=0.652 Sum_probs=11.3
Q ss_pred hhHHHHHhhhhhc
Q 039411 171 LETVALHEIGHLL 183 (222)
Q Consensus 171 ~~~va~HEiGHaL 183 (222)
+.++.+||+||++
T Consensus 9 ~~~i~~HE~aHa~ 21 (181)
T cd06158 9 LLAITLHEFAHAY 21 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 4578999999998
No 102
>PF14891 Peptidase_M91: Effector protein
Probab=61.23 E-value=8.7 Score=30.55 Aligned_cols=17 Identities=24% Similarity=0.102 Sum_probs=13.2
Q ss_pred chhHHHHHhhhhhcCCC
Q 039411 170 DLETVALHEIGHLLGLD 186 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~ 186 (222)
.-+.++.|||.||.-..
T Consensus 102 ~p~v~L~HEL~HA~~~~ 118 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYM 118 (174)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34679999999997543
No 103
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=61.08 E-value=5.2 Score=31.38 Aligned_cols=27 Identities=19% Similarity=0.318 Sum_probs=23.9
Q ss_pred ChhhhhcCCC-CCccCCCHHHHHHhcCC
Q 039411 1 MKTYQLNYHL-NSTGVLDGNTVAMMTRP 27 (222)
Q Consensus 1 i~~~Q~~~~L-~~TG~ld~~T~~~m~~p 27 (222)
|+.||+..+| +++|.++++|...+...
T Consensus 78 v~~fQ~~~gl~~~dG~~g~~t~~al~~~ 105 (185)
T COG3409 78 VRAFQQKNGLSPVDGIVGPATRAALPSQ 105 (185)
T ss_pred HHHHhhhcCcccccccccHHHHHHHHHH
Confidence 4789999999 89999999998888654
No 104
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=60.54 E-value=3.8 Score=37.20 Aligned_cols=15 Identities=47% Similarity=0.705 Sum_probs=11.2
Q ss_pred hHHHHHhhhhhc---CCC
Q 039411 172 ETVALHEIGHLL---GLD 186 (222)
Q Consensus 172 ~~va~HEiGHaL---GL~ 186 (222)
..+++||+||+| ||.
T Consensus 160 l~t~iHE~GHalye~~l~ 177 (396)
T cd06460 160 LFSTIHETGHALYEQGLP 177 (396)
T ss_pred HHHHHHHhhHHHHHhcCC
Confidence 347799999995 554
No 105
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=60.54 E-value=4.7 Score=37.19 Aligned_cols=12 Identities=50% Similarity=0.728 Sum_probs=9.0
Q ss_pred hhHHHHHhhhhhc
Q 039411 171 LETVALHEIGHLL 183 (222)
Q Consensus 171 ~~~va~HEiGHaL 183 (222)
+++| +||.||||
T Consensus 260 L~g~-iHE~GHAl 271 (497)
T COG2317 260 LFGT-IHETGHAL 271 (497)
T ss_pred HHHH-HHhhhhHH
Confidence 4444 89999995
No 106
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=60.51 E-value=24 Score=34.06 Aligned_cols=33 Identities=15% Similarity=0.298 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHhcccceEEEecCC--CCceEEEEE
Q 039411 91 NAVGRAFQTWAAISHFKFSQTQDY--TNANLKISF 123 (222)
Q Consensus 91 ~~i~~A~~~W~~~~~l~F~ev~~~--~~adI~I~f 123 (222)
.++..-|......-+|.|.|+++. -.+|++|.-
T Consensus 392 ~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~y~ 426 (718)
T KOG2089|consen 392 VVLSGLFGIYQTLFGLKFEEATDAEVWHADVRVYT 426 (718)
T ss_pred HHHHHHHHHHHHHhCceeeecCCchhcccceeEEe
Confidence 466777888888999999998863 257888643
No 107
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=60.07 E-value=4 Score=39.04 Aligned_cols=15 Identities=27% Similarity=0.355 Sum_probs=12.1
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
..-..|++||+||++
T Consensus 378 ~~dV~TLaHElGHs~ 392 (598)
T COG1164 378 LRDVFTLAHELGHSV 392 (598)
T ss_pred hhHHHHHHHHccHHH
Confidence 345579999999996
No 108
>PRK10911 oligopeptidase A; Provisional
Probab=58.56 E-value=4.2 Score=39.51 Aligned_cols=22 Identities=18% Similarity=0.191 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHhcccceEEEec
Q 039411 91 NAVGRAFQTWAAISHFKFSQTQ 112 (222)
Q Consensus 91 ~~i~~A~~~W~~~~~l~F~ev~ 112 (222)
.++..-|..-+..=+++|.++.
T Consensus 361 ~v~~gl~~~~~~Lfgi~~~e~~ 382 (680)
T PRK10911 361 KAVNGLFEVVKRIYGITAKERK 382 (680)
T ss_pred HHHHHHHHHHHHHcCeeEEecC
Confidence 4455566666677778898864
No 109
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=57.98 E-value=5.3 Score=32.88 Aligned_cols=15 Identities=33% Similarity=0.543 Sum_probs=12.8
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
.....+|.||-||||
T Consensus 26 ~~~~~~A~HEAGhAv 40 (213)
T PF01434_consen 26 EEKRRIAYHEAGHAV 40 (213)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 466789999999995
No 110
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=57.67 E-value=5.9 Score=35.27 Aligned_cols=16 Identities=31% Similarity=0.617 Sum_probs=13.5
Q ss_pred ccchhHHHHHhhhhhc
Q 039411 168 AFDLETVALHEIGHLL 183 (222)
Q Consensus 168 ~~~~~~va~HEiGHaL 183 (222)
...+..|+.||+||+.
T Consensus 121 ~sEvAAVl~HEmgHVt 136 (479)
T COG4784 121 SSEVAAVLAHEMGHVT 136 (479)
T ss_pred HHHHHHHHHhhhhhee
Confidence 3578899999999985
No 111
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=57.09 E-value=5.5 Score=36.36 Aligned_cols=14 Identities=43% Similarity=0.639 Sum_probs=12.4
Q ss_pred cchhHHHHHhhhhh
Q 039411 169 FDLETVALHEIGHL 182 (222)
Q Consensus 169 ~~~~~va~HEiGHa 182 (222)
..+.+|++||+||-
T Consensus 278 eel~AVl~HELGHW 291 (428)
T KOG2719|consen 278 EELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHHHHhhHH
Confidence 46889999999996
No 112
>PF02102 Peptidase_M35: Deuterolysin metalloprotease (M35) family; InterPro: IPR001384 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M35 (deuterolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Deuterolysin is a microbial zinc-containing metalloprotease that shows some similarity to thermolysin []. The protein is expressed with a possible 19-residue signal sequence, a 155-residue propeptide, and an active peptide of 177 residues []. The latter contains an HEXXH motif towards the C terminus, but the other zinc ligands are as yet undetermined [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 1EB6_A.
Probab=56.92 E-value=13 Score=33.29 Aligned_cols=47 Identities=23% Similarity=0.115 Sum_probs=23.7
Q ss_pred CCeeeeecCCCCceeeecCCCCcccCCC---CCccchhHHHHHhhhhhcCC
Q 039411 138 GGTIAHAAPPRGGIFDYDADEQWAVGAR---PGAFDLETVALHEIGHLLGL 185 (222)
Q Consensus 138 g~~la~a~~P~~g~i~fd~~~~w~~~~~---~~~~~~~~va~HEiGHaLGL 185 (222)
.|++|.+. |..+.|.+-....+.+... -.+.|.-.+++||+-|+.+.
T Consensus 262 ~~vlAYT~-p~~~~I~~Cp~ff~~lp~~~~~C~~qDqatt~LHE~TH~~~V 311 (359)
T PF02102_consen 262 SGVLAYTL-PSQNQIVNCPIFFSDLPALSNRCHAQDQATTTLHEMTHAPAV 311 (359)
T ss_dssp TT--EEEE-GGGTEEEE-HHHHHH--SS--STT---HHHHHHHHHHT-TTT
T ss_pred CCeEEEEE-cCCCeEEECchhhccCCCccccccCCCccchhhhhhhccccc
Confidence 46888886 4455665432221122221 13457778999999999987
No 113
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=56.53 E-value=81 Score=25.91 Aligned_cols=38 Identities=16% Similarity=-0.003 Sum_probs=23.5
Q ss_pred CCCceeeecCCCCcccCC-CCCccchhHHHHHhhhhhcC
Q 039411 147 PRGGIFDYDADEQWAVGA-RPGAFDLETVALHEIGHLLG 184 (222)
Q Consensus 147 P~~g~i~fd~~~~w~~~~-~~~~~~~~~va~HEiGHaLG 184 (222)
+....|||..+..=.... ......+..|+.||+-|+.=
T Consensus 71 ~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~Q 109 (205)
T PF04450_consen 71 DDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCWQ 109 (205)
T ss_pred CCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHhh
Confidence 456789997643211111 11124688999999999943
No 114
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=56.21 E-value=5.6 Score=36.26 Aligned_cols=12 Identities=42% Similarity=0.844 Sum_probs=10.0
Q ss_pred hHHHHHhhhhhc
Q 039411 172 ETVALHEIGHLL 183 (222)
Q Consensus 172 ~~va~HEiGHaL 183 (222)
..|++||+||.|
T Consensus 15 ~~v~~HE~gH~~ 26 (420)
T TIGR00054 15 VLIFVHELGHFL 26 (420)
T ss_pred HHHHHHhHHHHH
Confidence 469999999964
No 115
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=54.77 E-value=5.4 Score=36.67 Aligned_cols=12 Identities=33% Similarity=0.744 Sum_probs=9.5
Q ss_pred hHHHHHhhhhhc
Q 039411 172 ETVALHEIGHLL 183 (222)
Q Consensus 172 ~~va~HEiGHaL 183 (222)
.-|++||+||-|
T Consensus 16 ~li~vHElGHfl 27 (449)
T PRK10779 16 VLITVHEFGHFW 27 (449)
T ss_pred HHHHHHHHHHHH
Confidence 358899999953
No 116
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=52.69 E-value=6.9 Score=36.99 Aligned_cols=15 Identities=47% Similarity=0.689 Sum_probs=11.3
Q ss_pred hhHHHHHhh-hhhcCC
Q 039411 171 LETVALHEI-GHLLGL 185 (222)
Q Consensus 171 ~~~va~HEi-GHaLGL 185 (222)
...|++||+ ||..|=
T Consensus 293 ~~~v~lHEllGHGsGk 308 (549)
T PF03571_consen 293 FVQVGLHELLGHGSGK 308 (549)
T ss_dssp HHHHHHHHHCCCCS--
T ss_pred hhhhhHHhhccCcCcc
Confidence 457999998 999984
No 117
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=51.08 E-value=7.9 Score=25.28 Aligned_cols=21 Identities=29% Similarity=0.515 Sum_probs=17.1
Q ss_pred chhHHHHHhhhhhcCCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~ 190 (222)
....-++||+.+.+||.|.+.
T Consensus 28 ~~eR~~vH~lA~~~gL~s~S~ 48 (60)
T cd02641 28 SHDRLLVHELAEELGLRHEST 48 (60)
T ss_pred HHHHHHHHHHHHHcCCceEee
Confidence 445678999999999998653
No 118
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=50.15 E-value=7.5 Score=33.42 Aligned_cols=39 Identities=23% Similarity=0.216 Sum_probs=22.7
Q ss_pred cCCCCceeeecCCCCcccC---CCCCccchhHHHHHhhhhhc
Q 039411 145 APPRGGIFDYDADEQWAVG---ARPGAFDLETVALHEIGHLL 183 (222)
Q Consensus 145 ~~P~~g~i~fd~~~~w~~~---~~~~~~~~~~va~HEiGHaL 183 (222)
|-|...++.||....-.+. ...+.....-|++||.||.+
T Consensus 138 YCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHHV 179 (295)
T COG2321 138 YCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHHV 179 (295)
T ss_pred ecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHHH
Confidence 5578888888864211110 01122334569999999963
No 119
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=49.58 E-value=7.3 Score=37.92 Aligned_cols=32 Identities=16% Similarity=0.228 Sum_probs=21.5
Q ss_pred HHHH-HHHHHHHHhcccceEEEecCC--CCceEEE
Q 039411 90 INAV-GRAFQTWAAISHFKFSQTQDY--TNANLKI 121 (222)
Q Consensus 90 ~~~i-~~A~~~W~~~~~l~F~ev~~~--~~adI~I 121 (222)
..++ .-.|..-+..=+|+|+++.+. -..|+++
T Consensus 363 ~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~ 397 (681)
T PRK10280 363 NTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV 397 (681)
T ss_pred HHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence 3566 467777888899999996532 2456554
No 120
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=46.91 E-value=10 Score=33.66 Aligned_cols=19 Identities=42% Similarity=0.518 Sum_probs=14.9
Q ss_pred chhHHHHHhhhhhcCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~ 189 (222)
.+..|+.||++|++ +.|+.
T Consensus 274 glAtvLgHE~aHaV-arH~A 292 (424)
T KOG2661|consen 274 GLATVLGHEIAHAV-ARHAA 292 (424)
T ss_pred HHHHHHHHHHHHHH-HHHHH
Confidence 46789999999997 45654
No 121
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=44.96 E-value=9.5 Score=32.45 Aligned_cols=21 Identities=29% Similarity=0.479 Sum_probs=14.9
Q ss_pred ccchhHHHHHhhhhhcCCCCC
Q 039411 168 AFDLETVALHEIGHLLGLDHS 188 (222)
Q Consensus 168 ~~~~~~va~HEiGHaLGL~Hs 188 (222)
..+-.+|++||+.|.|=+..-
T Consensus 151 ~~dg~NVviHEfAH~LD~~~g 171 (253)
T PF06167_consen 151 PNDGHNVVIHEFAHKLDMEDG 171 (253)
T ss_dssp SSSS--HHHHHHHHHHHCTTS
T ss_pred CCCCcchHHHHHHHHHHhhcC
Confidence 346678999999999977543
No 122
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=42.78 E-value=13 Score=30.60 Aligned_cols=16 Identities=31% Similarity=0.351 Sum_probs=12.3
Q ss_pred chhHHHHHhhhhh-cCC
Q 039411 170 DLETVALHEIGHL-LGL 185 (222)
Q Consensus 170 ~~~~va~HEiGHa-LGL 185 (222)
.+-+|++||++|| |=|
T Consensus 92 l~gsiLAHE~mHa~Lrl 108 (212)
T PF12315_consen 92 LTGSILAHELMHAWLRL 108 (212)
T ss_pred HHhhHHHHHHHHHHhcc
Confidence 3457999999999 444
No 123
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=40.26 E-value=16 Score=29.47 Aligned_cols=17 Identities=24% Similarity=0.325 Sum_probs=13.4
Q ss_pred hhHHHHHhhhhhcCCCC
Q 039411 171 LETVALHEIGHLLGLDH 187 (222)
Q Consensus 171 ~~~va~HEiGHaLGL~H 187 (222)
+-.+++|||+|++.-..
T Consensus 36 lG~ilahel~hafd~~g 52 (206)
T PF01431_consen 36 LGFILAHELMHAFDPEG 52 (206)
T ss_dssp HHHHHHHHHHHCTSTTG
T ss_pred HHHHHHHHHHHHHHHhH
Confidence 55799999999886543
No 124
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=39.87 E-value=16 Score=23.99 Aligned_cols=23 Identities=26% Similarity=0.284 Sum_probs=18.2
Q ss_pred chhHHHHHhhhhhcCCCCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSLVEG 192 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~~~ 192 (222)
.-..-++|.+-|-|||.|..+..
T Consensus 28 ~~eRriih~la~~lGL~~~s~G~ 50 (60)
T cd02639 28 PAERRIVHLLASRLGLNHVSDGT 50 (60)
T ss_pred HHHHHHHHHHHHHcCCceEEeCC
Confidence 34467799999999999987644
No 125
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=38.98 E-value=13 Score=35.86 Aligned_cols=33 Identities=9% Similarity=0.191 Sum_probs=23.2
Q ss_pred HHHHHHHHHHHHhcccceEEEecCC--CCceEEEE
Q 039411 90 INAVGRAFQTWAAISHFKFSQTQDY--TNANLKIS 122 (222)
Q Consensus 90 ~~~i~~A~~~W~~~~~l~F~ev~~~--~~adI~I~ 122 (222)
-.+|.-.|..-+..=+|+|+|..+. -.+|+++.
T Consensus 364 ~~Vl~GlF~~~~rLfGI~~~e~~~~~vwHpDVr~~ 398 (683)
T COG0339 364 NKVLEGLFEVAKRLFGITFVERKDIPVWHPDVRVF 398 (683)
T ss_pred hHHHHHHHHHHHHHcCeEEEECCCCCccCCCceEE
Confidence 3567777888888999999997432 24566643
No 126
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=38.55 E-value=16 Score=34.02 Aligned_cols=17 Identities=24% Similarity=0.325 Sum_probs=13.3
Q ss_pred cchhHHHHHhhhhhcCC
Q 039411 169 FDLETVALHEIGHLLGL 185 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL 185 (222)
.+-+.|+.||+||+...
T Consensus 245 ~~D~~t~~HE~GH~~yy 261 (477)
T cd06461 245 MEDFVTVHHEMGHIQYY 261 (477)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34567889999999854
No 127
>KOG2090 consensus Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]
Probab=37.96 E-value=14 Score=35.19 Aligned_cols=22 Identities=9% Similarity=0.075 Sum_probs=14.2
Q ss_pred ccCCCHHH--HHHhcCCCCCCCCC
Q 039411 13 TGVLDGNT--VAMMTRPRCGVADI 34 (222)
Q Consensus 13 TG~ld~~T--~~~m~~pRCg~pD~ 34 (222)
.++.|.+. +..|++.-|+.++.
T Consensus 318 ~~~~~kel~~i~~mk~k~~~~~~~ 341 (704)
T KOG2090|consen 318 SEKTDKELAVIRDMKKKENNNNNA 341 (704)
T ss_pred hHHHHHHHHHHHHHHhhhcCCCCC
Confidence 34444443 67788888887765
No 128
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=37.57 E-value=9.7 Score=36.07 Aligned_cols=14 Identities=43% Similarity=0.589 Sum_probs=11.3
Q ss_pred hhHHHHHhhhhhcC
Q 039411 171 LETVALHEIGHLLG 184 (222)
Q Consensus 171 ~~~va~HEiGHaLG 184 (222)
+-.|+.|||||..-
T Consensus 487 IGaVIgHEI~HgFD 500 (654)
T COG3590 487 IGAVIGHEIGHGFD 500 (654)
T ss_pred ccceehhhhccccc
Confidence 45799999999753
No 129
>PF08434 CLCA_N: Calcium-activated chloride channel; InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs.
Probab=36.32 E-value=14 Score=31.57 Aligned_cols=26 Identities=23% Similarity=0.300 Sum_probs=18.3
Q ss_pred hHHHHHhhhhh-cCCCCCCCCCCceeeeeC
Q 039411 172 ETVALHEIGHL-LGLDHSLVEGAIMYAYID 200 (222)
Q Consensus 172 ~~va~HEiGHa-LGL~Hs~~~~siM~p~~~ 200 (222)
-.|++||-+|. -|+-.+..... |+|.
T Consensus 150 grv~VhEWAhlRWGVFDEY~~d~---pfY~ 176 (262)
T PF08434_consen 150 GRVFVHEWAHLRWGVFDEYNNDK---PFYI 176 (262)
T ss_pred ceeeeehhhhhccccccccCCCC---ceeE
Confidence 36999999998 88876654332 5554
No 130
>PRK15410 DgsA anti-repressor MtfA; Provisional
Probab=35.21 E-value=18 Score=30.93 Aligned_cols=20 Identities=20% Similarity=0.298 Sum_probs=16.6
Q ss_pred chhHHHHHhhhhhcCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~ 189 (222)
+-.+|++|||-|.|=+.+..
T Consensus 141 dg~NVvIHEFAH~LDm~~G~ 160 (260)
T PRK15410 141 SGFNLIIHEVAHKLDMRNGD 160 (260)
T ss_pred CCcchhHhHHHhHhhhhcCC
Confidence 45789999999999887744
No 131
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=34.86 E-value=9 Score=29.99 Aligned_cols=39 Identities=23% Similarity=0.148 Sum_probs=23.7
Q ss_pred eeeeecCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhc
Q 039411 140 TIAHAAPPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLL 183 (222)
Q Consensus 140 ~la~a~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaL 183 (222)
+-+-||.+.+ +|-||....-. .+...+..|+.||+-|.+
T Consensus 35 taG~ayL~~~-~I~lNP~ll~e----n~~~f~~~vV~HELaHl~ 73 (156)
T COG3091 35 TAGGAYLLKS-EIRLNPKLLEE----NGEDFIEQVVPHELAHLH 73 (156)
T ss_pred hcchhhcccc-ccccCHHHHHH----ccHHHHHHHHHHHHHHHH
Confidence 3345677765 77776522111 122356789999999975
No 132
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=34.49 E-value=20 Score=27.59 Aligned_cols=32 Identities=19% Similarity=0.246 Sum_probs=19.9
Q ss_pred CCceeeecCCCCcccCCCCCccchhHHHHHhhhhhc
Q 039411 148 RGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLL 183 (222)
Q Consensus 148 ~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaL 183 (222)
....|.|+. .+-.......+..|++||+-|++
T Consensus 40 ~~~~I~ln~----~l~~~~~~~~l~~~l~HEm~H~~ 71 (146)
T smart00731 40 KSAEIRLNP----KLLTENGRDRLRETLLHELCHAA 71 (146)
T ss_pred CCCEEEeCH----HHHhhccHHHHHhhHHHHHHHHH
Confidence 356777753 12111122467789999999996
No 133
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=33.80 E-value=15 Score=34.30 Aligned_cols=15 Identities=47% Similarity=0.795 Sum_probs=10.6
Q ss_pred HHHHhhhhhc---CCCCC
Q 039411 174 VALHEIGHLL---GLDHS 188 (222)
Q Consensus 174 va~HEiGHaL---GL~Hs 188 (222)
-++||.||+| ||...
T Consensus 262 s~iHE~GHalYEq~~~~~ 279 (494)
T PF02074_consen 262 STIHETGHALYEQGLPPE 279 (494)
T ss_dssp HHHHHHHHHHHHHTS-GG
T ss_pred HHHHHHHHHHHHcCCChh
Confidence 4489999995 77543
No 134
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=33.48 E-value=23 Score=27.17 Aligned_cols=16 Identities=31% Similarity=0.409 Sum_probs=13.6
Q ss_pred cchhHHHHHhhhhhcC
Q 039411 169 FDLETVALHEIGHLLG 184 (222)
Q Consensus 169 ~~~~~va~HEiGHaLG 184 (222)
..+..|++||+=|+.=
T Consensus 58 ~~~~~tL~HEm~H~~~ 73 (157)
T PF10263_consen 58 EELIDTLLHEMAHAAA 73 (157)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4678899999999964
No 135
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=33.38 E-value=1.2e+02 Score=24.47 Aligned_cols=19 Identities=21% Similarity=0.258 Sum_probs=15.6
Q ss_pred cchhHHHHHhhhhhcCCCC
Q 039411 169 FDLETVALHEIGHLLGLDH 187 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~H 187 (222)
..+..+++||+-|++=+..
T Consensus 63 ~~l~~~iaHE~hH~~r~~~ 81 (195)
T PF10026_consen 63 EELPALIAHEYHHNCRYEQ 81 (195)
T ss_pred HHHHHHHHHHHHHHHHHhc
Confidence 4788999999999976554
No 136
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=32.91 E-value=21 Score=34.43 Aligned_cols=14 Identities=43% Similarity=0.698 Sum_probs=12.0
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
....|+.||+||||
T Consensus 407 ~~~~~a~he~gha~ 420 (644)
T PRK10733 407 QKESTAYHEAGHAI 420 (644)
T ss_pred HHHHHHHHHHHHHH
Confidence 45679999999996
No 137
>PHA00657 crystallin beta/gamma motif-containing protein
Probab=32.33 E-value=34 Score=35.69 Aligned_cols=13 Identities=38% Similarity=0.475 Sum_probs=11.2
Q ss_pred chhHHHHHhhhhh
Q 039411 170 DLETVALHEIGHL 182 (222)
Q Consensus 170 ~~~~va~HEiGHa 182 (222)
.-+++.+||+||.
T Consensus 790 AD~STFvHE~gH~ 802 (2052)
T PHA00657 790 ADLSTFLHESGHF 802 (2052)
T ss_pred ccHHHHHHHHHHH
Confidence 4568999999998
No 138
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=31.01 E-value=21 Score=30.67 Aligned_cols=10 Identities=40% Similarity=0.547 Sum_probs=8.4
Q ss_pred HHHHhhhhhc
Q 039411 174 VALHEIGHLL 183 (222)
Q Consensus 174 va~HEiGHaL 183 (222)
-..|||||.+
T Consensus 222 G~~HE~GH~~ 231 (307)
T PF13402_consen 222 GPWHELGHNH 231 (307)
T ss_dssp HHHHHHHHHH
T ss_pred eehhhhhhhc
Confidence 4689999985
No 139
>CHL00176 ftsH cell division protein; Validated
Probab=30.67 E-value=21 Score=34.49 Aligned_cols=14 Identities=36% Similarity=0.752 Sum_probs=11.9
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
....||.||.||||
T Consensus 437 ~~~~vA~hEaGhA~ 450 (638)
T CHL00176 437 NKRLIAYHEVGHAI 450 (638)
T ss_pred HHHHHHHHhhhhHH
Confidence 45679999999997
No 140
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=30.33 E-value=1e+02 Score=26.36 Aligned_cols=78 Identities=17% Similarity=0.254 Sum_probs=44.4
Q ss_pred HHHHHHHHHHHHHh--cccceEEEecCCCCceEEEEEeecCCCCCCCCCCCCCeeeeecCCCCceeeecCCCCcccCCCC
Q 039411 89 AINAVGRAFQTWAA--ISHFKFSQTQDYTNANLKISFQRGWHGDSSPFDGRGGTIAHAAPPRGGIFDYDADEQWAVGARP 166 (222)
Q Consensus 89 ~~~~i~~A~~~W~~--~~~l~F~ev~~~~~adI~I~f~~~~~gd~~~f~g~g~~la~a~~P~~g~i~fd~~~~w~~~~~~ 166 (222)
+....+++...|+. |..+.|.- ... ++.|+..++. .| .|+. .|.|++ |-... .
T Consensus 81 ~ek~l~~~~~k~~~pnv~~v~f~V-~~~-dp~irlqkg~--kg-------gg~~--------~~ki~l-----~l~p~-~ 135 (280)
T COG5504 81 CEKYLPELIVKWYGPNVPIVDFPV-TPG-DPQIRLQKGR--KG-------GGGI--------PGKIML-----WLVPS-S 135 (280)
T ss_pred HHHHHHHHHHhhcCCCCceeeecc-cCC-CchHHHhhcc--cC-------CCCc--------CceEEE-----EEecC-C
Confidence 45567778888865 45566633 222 3445554443 11 1221 355554 21111 1
Q ss_pred CccchhHHHHHhhhhhcCCCCCCCC
Q 039411 167 GAFDLETVALHEIGHLLGLDHSLVE 191 (222)
Q Consensus 167 ~~~~~~~va~HEiGHaLGL~Hs~~~ 191 (222)
.-.+..++++||+=|..-|.+-.-.
T Consensus 136 ~~~~v~aliaHE~HH~~R~~~i~~~ 160 (280)
T COG5504 136 TITSVPALIAHEYHHNCRLRYIDYG 160 (280)
T ss_pred CccchHHHHHHHHHhhheecccccC
Confidence 2236789999999999999887654
No 141
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=30.18 E-value=30 Score=22.58 Aligned_cols=20 Identities=25% Similarity=0.380 Sum_probs=16.5
Q ss_pred chhHHHHHhhhhhcCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~ 189 (222)
....-++|++-+.+||.|.+
T Consensus 28 ~~eR~~vH~~a~~~gL~s~S 47 (60)
T cd02640 28 KEERALIHQIAQKYGLKSRS 47 (60)
T ss_pred HHHHHHHHHHHHHcCCceee
Confidence 45568899999999999864
No 142
>PRK04860 hypothetical protein; Provisional
Probab=29.68 E-value=15 Score=29.08 Aligned_cols=32 Identities=16% Similarity=0.135 Sum_probs=19.4
Q ss_pred CCCceeeecCCCCcccCCCCCccchhHHHHHhhhhh
Q 039411 147 PRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHL 182 (222)
Q Consensus 147 P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHa 182 (222)
.....|.||... +.. .+...+..|+.||+-|.
T Consensus 43 l~~~~I~~Np~l---l~~-~~~~~l~~~v~HEl~H~ 74 (160)
T PRK04860 43 LQSNEIRLNPVL---LLE-NQQAFIDEVVPHELAHL 74 (160)
T ss_pred HhcCCeeeCHHH---Hhh-CcHHHHHhHHHHHHHHH
Confidence 345667776421 111 12346778999999986
No 143
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.63 E-value=15 Score=34.63 Aligned_cols=18 Identities=44% Similarity=0.560 Sum_probs=15.6
Q ss_pred HHHHHhhhhhcCCCCCCC
Q 039411 173 TVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 173 ~va~HEiGHaLGL~Hs~~ 190 (222)
.|.+||.||-|||..+.+
T Consensus 348 Gv~ahEyghdlglPDEyd 365 (760)
T COG4412 348 GVEAHEYGHDLGLPDEYD 365 (760)
T ss_pred ceeeeecccccCCCcccc
Confidence 489999999999997654
No 144
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=28.56 E-value=38 Score=30.48 Aligned_cols=16 Identities=19% Similarity=0.320 Sum_probs=13.7
Q ss_pred cchhHHHHHhhhhhcC
Q 039411 169 FDLETVALHEIGHLLG 184 (222)
Q Consensus 169 ~~~~~va~HEiGHaLG 184 (222)
....+|++||+=|.+=
T Consensus 137 ~~~~sTlAHEfQHmIn 152 (366)
T PF10460_consen 137 DTVYSTLAHEFQHMIN 152 (366)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4578999999999983
No 145
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=28.48 E-value=49 Score=26.51 Aligned_cols=35 Identities=17% Similarity=0.030 Sum_probs=21.8
Q ss_pred cCCCCceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCC
Q 039411 145 APPRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGL 185 (222)
Q Consensus 145 ~~P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL 185 (222)
|.|....|.+-.+... ....+..|++||+=||.--
T Consensus 51 f~p~~~~I~lC~N~~~------~~~~l~~~l~HELIHayD~ 85 (173)
T PF09768_consen 51 FDPSKKGIVLCQNRIR------SQGHLEDTLTHELIHAYDH 85 (173)
T ss_pred ccCCCCCEEEeeCCCC------CHHHHHHHHHHHHHHHHHH
Confidence 5564444555443322 2347889999999999643
No 146
>cd06007 R3H_DEXH_helicase R3H domain of a group of proteins which also contain a DEXH-box helicase domain, and may function as ATP-dependent DNA or RNA helicases. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=28.23 E-value=33 Score=22.31 Aligned_cols=20 Identities=25% Similarity=0.423 Sum_probs=16.7
Q ss_pred chhHHHHHhhhhhcCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSL 189 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~ 189 (222)
....-++|++-+.+||.|.+
T Consensus 27 ~~eR~~vH~~a~~~gL~s~S 46 (59)
T cd06007 27 NHERAVIHRLCRKLGLKSKS 46 (59)
T ss_pred HHHHHHHHHHHHHcCCCcee
Confidence 45568899999999999864
No 147
>KOG1047 consensus Bifunctional leukotriene A4 hydrolase/aminopeptidase LTA4H [Lipid transport and metabolism; Posttranslational modification, protein turnover, chaperones; Defense mechanisms; Amino acid transport and metabolism]
Probab=27.98 E-value=31 Score=32.71 Aligned_cols=20 Identities=40% Similarity=0.391 Sum_probs=15.1
Q ss_pred CccchhHHHHHhhhhh-cCCC
Q 039411 167 GAFDLETVALHEIGHL-LGLD 186 (222)
Q Consensus 167 ~~~~~~~va~HEiGHa-LGL~ 186 (222)
+...+..|++|||-|+ -|..
T Consensus 284 GDrsl~~vIaHEIAHSWtGNl 304 (613)
T KOG1047|consen 284 GDRSLVDVIAHEIAHSWTGNL 304 (613)
T ss_pred CCcchhhHHHHHhhhhhcccc
Confidence 4467789999999887 5554
No 148
>PRK04351 hypothetical protein; Provisional
Probab=26.59 E-value=27 Score=27.26 Aligned_cols=32 Identities=16% Similarity=0.216 Sum_probs=19.6
Q ss_pred CCCceeeecCCCCcccCCCCCccchhHHHHHhhhhh
Q 039411 147 PRGGIFDYDADEQWAVGARPGAFDLETVALHEIGHL 182 (222)
Q Consensus 147 P~~g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHa 182 (222)
.....|.|+..- -...+...+..|+.||+-|.
T Consensus 41 l~~~~I~lnp~l----l~~~~~~~l~~vv~HElcH~ 72 (149)
T PRK04351 41 LKDHHIEFNPKM----LEEYGLEELIGIIKHELCHY 72 (149)
T ss_pred cCCCeEEeCHHH----HhhccHHHHHhhHHHHHHHH
Confidence 445667775321 11112346788999999997
No 149
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=25.56 E-value=48 Score=25.59 Aligned_cols=39 Identities=13% Similarity=0.060 Sum_probs=19.4
Q ss_pred ceeeecCCCCcccCCCCCccchhHHHHHhhhhhcCCCCCCC
Q 039411 150 GIFDYDADEQWAVGARPGAFDLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 150 g~i~fd~~~~w~~~~~~~~~~~~~va~HEiGHaLGL~Hs~~ 190 (222)
..|.+-. ..|.... .+......|++||+=|..=.+.+.|
T Consensus 77 ~~IyLc~-~F~~~p~-~g~~Sk~~TLiHE~SHf~~~~~T~D 115 (148)
T PF14521_consen 77 YTIYLCP-AFFSAPT-TGKDSKEGTLIHEWSHFTAVGGTDD 115 (148)
T ss_dssp TEEEE-H-HHHHS-S-SSTT-HHHHHHHHHHHSCCCT--B-
T ss_pred eEEEECh-hhcCCCC-CCCCchHHHHHHhhhhhhhhcCCCc
Confidence 4566543 2343322 2334678999999999433444444
No 150
>PF13058 DUF3920: Protein of unknown function (DUF3920)
Probab=23.77 E-value=46 Score=24.68 Aligned_cols=15 Identities=27% Similarity=0.423 Sum_probs=13.1
Q ss_pred chhHHHHHhhhhhcC
Q 039411 170 DLETVALHEIGHLLG 184 (222)
Q Consensus 170 ~~~~va~HEiGHaLG 184 (222)
.+..++.||+-|++-
T Consensus 75 qvlktllhefrh~mQ 89 (126)
T PF13058_consen 75 QVLKTLLHEFRHAMQ 89 (126)
T ss_pred HHHHHHHHHHHHHHH
Confidence 567899999999986
No 151
>TIGR00043 metalloprotein, YbeY/UPF0054 family. This minimally protein family is represented by a single member sequence only in nearly every bacterium. Crystallography demonstrates metal-binding activity, possibly to nickel. It is a predicted to be a metallohydrolase, and somewhat more weakly to be a metalloproteinase.
Probab=22.71 E-value=69 Score=23.48 Aligned_cols=22 Identities=36% Similarity=0.685 Sum_probs=18.9
Q ss_pred chhHHHHHhhhhhcCCCCCCCC
Q 039411 170 DLETVALHEIGHLLGLDHSLVE 191 (222)
Q Consensus 170 ~~~~va~HEiGHaLGL~Hs~~~ 191 (222)
.+..+++|-+=|.||..|..+.
T Consensus 72 e~~~l~iHG~LHLlGyDH~~~~ 93 (110)
T TIGR00043 72 ELAHLTVHGLLHLLGYDHETED 93 (110)
T ss_pred HHHHHHHHHHHHHcCCCCCChH
Confidence 5677999999999999997653
No 152
>PF02130 UPF0054: Uncharacterized protein family UPF0054; InterPro: IPR002036 These, as yet, uncharacterised proteins are of 17 to 21 kDa. They contain a conserved region with three histidines at the C terminus. The protein family is represented by a single member sequence only in nearly every bacterium. The crystal structure of the protein from the hyperthermophilic bacteria Aquifex aeolicus has been determined. The overall fold consists of one central alpha-helix surrounded by a four-stranded beta-sheet and four other alpha-helices. Structure-based homology analysis reveals a good resemblance to the metal-dependent proteinases such as collagenases and gelatinases. However, experimental tests for collagenase and gelatinase-type function show no detectable activity under standard assay conditions [].; GO: 0046872 metal ion binding; PDB: 1TVI_A 1OZ9_A 1XM5_A 1XAX_A.
Probab=22.52 E-value=69 Score=24.65 Aligned_cols=22 Identities=36% Similarity=0.560 Sum_probs=17.7
Q ss_pred cchhHHHHHhhhhhcCCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~~ 190 (222)
..+...++|-+-|.||..|..+
T Consensus 106 ~el~~l~vHG~LHLlGyDH~~~ 127 (145)
T PF02130_consen 106 EELARLLVHGLLHLLGYDHETE 127 (145)
T ss_dssp HHHHHHHHHHHHHHTT-SSTTT
T ss_pred HHHhHHHHHHHHHHcCCCCCCH
Confidence 3567799999999999999854
No 153
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=22.35 E-value=34 Score=33.94 Aligned_cols=18 Identities=6% Similarity=0.213 Sum_probs=13.1
Q ss_pred HHHHHHHHhcccceEEEe
Q 039411 94 GRAFQTWAAISHFKFSQT 111 (222)
Q Consensus 94 ~~A~~~W~~~~~l~F~ev 111 (222)
.+.|+...+.++|.|...
T Consensus 173 ~~i~~~Yd~l~Gl~~~~~ 190 (775)
T PF03272_consen 173 NDIFKFYDDLTGLSDDPS 190 (775)
T ss_pred HHHHHHHHhhhCCCCCCC
Confidence 567777778888887654
No 154
>TIGR01241 FtsH_fam ATP-dependent metalloprotease FtsH. HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Probab=22.07 E-value=43 Score=31.09 Aligned_cols=14 Identities=50% Similarity=0.726 Sum_probs=11.6
Q ss_pred chhHHHHHhhhhhc
Q 039411 170 DLETVALHEIGHLL 183 (222)
Q Consensus 170 ~~~~va~HEiGHaL 183 (222)
....++.||.||+|
T Consensus 310 ~~~~~A~hEaGhAl 323 (495)
T TIGR01241 310 EKKLVAYHEAGHAL 323 (495)
T ss_pred HHHHHHHHHHhHHH
Confidence 34579999999996
No 155
>COG5570 Uncharacterized small protein [Function unknown]
Probab=22.04 E-value=44 Score=21.25 Aligned_cols=18 Identities=22% Similarity=0.374 Sum_probs=15.6
Q ss_pred ccCCCHHHHHHhcCCCCC
Q 039411 13 TGVLDGNTVAMMTRPRCG 30 (222)
Q Consensus 13 TG~ld~~T~~~m~~pRCg 30 (222)
-|.|+++..+.|+.|-|-
T Consensus 14 Hg~le~ei~ea~n~Ps~d 31 (57)
T COG5570 14 HGNLEREIQEAMNSPSSD 31 (57)
T ss_pred hchHHHHHHHHhcCCCcc
Confidence 389999999999999774
No 156
>PRK00016 metal-binding heat shock protein; Provisional
Probab=22.01 E-value=63 Score=25.34 Aligned_cols=43 Identities=23% Similarity=0.149 Sum_probs=26.6
Q ss_pred CCceeeecCCCCcccCC---CCCccchhHHHHHhhhhhcCCCCCCC
Q 039411 148 RGGIFDYDADEQWAVGA---RPGAFDLETVALHEIGHLLGLDHSLV 190 (222)
Q Consensus 148 ~~g~i~fd~~~~w~~~~---~~~~~~~~~va~HEiGHaLGL~Hs~~ 190 (222)
.-|+|.+..+.-..... ..-...+..+++|-+-|.||..|..+
T Consensus 87 ~LGDI~Is~~~~~~qa~~~~~s~~~e~~~l~iHG~LHLlGYDH~~~ 132 (159)
T PRK00016 87 YLGDIVICPEVAEEQAEEQGHSLERELAHLTVHGILHLLGYDHIED 132 (159)
T ss_pred CceeEEEcHHHHHHHHHHcCCCHHHHHHHHHHHhhHHhcCCCCCCh
Confidence 35888775432111100 00123567799999999999999764
No 157
>KOG3314 consensus Ku70-binding protein [Replication, recombination and repair]
Probab=21.38 E-value=73 Score=25.42 Aligned_cols=15 Identities=27% Similarity=0.437 Sum_probs=12.4
Q ss_pred cchhHHHHHhhhhhc
Q 039411 169 FDLETVALHEIGHLL 183 (222)
Q Consensus 169 ~~~~~va~HEiGHaL 183 (222)
..+..|++||+=|+.
T Consensus 89 ~h~n~vv~HElIH~f 103 (194)
T KOG3314|consen 89 DHVNQVVIHELIHAF 103 (194)
T ss_pred HHHHHHHHHHHHHHH
Confidence 356789999999985
No 158
>KOG3624 consensus M13 family peptidase [Amino acid transport and metabolism]
Probab=21.26 E-value=43 Score=32.48 Aligned_cols=14 Identities=29% Similarity=0.311 Sum_probs=11.0
Q ss_pred hhHHHHHhhhhhcC
Q 039411 171 LETVALHEIGHLLG 184 (222)
Q Consensus 171 ~~~va~HEiGHaLG 184 (222)
+-.|+.|||+|++-
T Consensus 518 iG~vigHEl~H~FD 531 (687)
T KOG3624|consen 518 IGFVIGHELTHGFD 531 (687)
T ss_pred HHHHHHHHHhhccc
Confidence 44699999999653
No 159
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=20.88 E-value=42 Score=27.67 Aligned_cols=19 Identities=32% Similarity=0.427 Sum_probs=14.9
Q ss_pred hHHHHHhhhhhcCCCCCCCC
Q 039411 172 ETVALHEIGHLLGLDHSLVE 191 (222)
Q Consensus 172 ~~va~HEiGHaLGL~Hs~~~ 191 (222)
.-++.||++| +=|+|....
T Consensus 102 ~fil~HE~~H-v~~~h~~~~ 120 (206)
T PF10463_consen 102 AFILLHELAH-VVLGHEGDS 120 (206)
T ss_pred HHHHHHHHHH-HHHcCcccc
Confidence 3488999999 558887764
No 160
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=20.41 E-value=52 Score=28.17 Aligned_cols=22 Identities=32% Similarity=0.347 Sum_probs=16.9
Q ss_pred cchhHHHHHhhhhhcCCCCCCCC
Q 039411 169 FDLETVALHEIGHLLGLDHSLVE 191 (222)
Q Consensus 169 ~~~~~va~HEiGHaLGL~Hs~~~ 191 (222)
..+..|++||+-|++ |.|..-.
T Consensus 58 ~~~~~~l~HevlH~~-~~H~~r~ 79 (292)
T PF13203_consen 58 EERVGLLLHEVLHCL-LRHPWRR 79 (292)
T ss_pred HHHHHHHHHHHHHHH-ccchhhh
Confidence 356789999999998 7776543
No 161
>PF15639 Tox-MPTase3: Metallopeptidase toxin 3
Probab=20.23 E-value=1.8e+02 Score=22.22 Aligned_cols=36 Identities=19% Similarity=0.395 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHhcccceEEEecC-CCCceEEEEEeec
Q 039411 91 NAVGRAFQTWAAISHFKFSQTQD-YTNANLKISFQRG 126 (222)
Q Consensus 91 ~~i~~A~~~W~~~~~l~F~ev~~-~~~adI~I~f~~~ 126 (222)
..+-+|+..|...+....++.=. +..+.|+|.-..+
T Consensus 23 p~iv~A~~K~~~l~~~~l~~~l~WGkGP~IkI~~~~~ 59 (135)
T PF15639_consen 23 PAIVKAYKKWGGLNKAKLQEALTWGKGPYIKITPLVG 59 (135)
T ss_pred hHHHHHHHHHcccchhhhhhccccCCCCEEEEeeccc
Confidence 46778888887765544444321 2367888876554
No 162
>PF15127 DUF4565: Protein of unknown function (DUF4565)
Probab=20.08 E-value=89 Score=22.20 Aligned_cols=14 Identities=29% Similarity=0.501 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHhc
Q 039411 90 INAVGRAFQTWAAI 103 (222)
Q Consensus 90 ~~~i~~A~~~W~~~ 103 (222)
++++..|++.|...
T Consensus 63 qEIl~dAlqQWA~~ 76 (91)
T PF15127_consen 63 QEILSDALQQWAEN 76 (91)
T ss_pred HHHHHHHHHHHHHh
Confidence 57899999999864
Done!