Query 021444
Match_columns 312
No_of_seqs 270 out of 1540
Neff 7.1
Searched_HMMs 46136
Date Fri Mar 29 03:00:43 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/021444.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/021444hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 2.8E-56 6E-61 438.2 20.4 235 3-275 29-268 (469)
2 cd04278 ZnMc_MMP Zinc-dependen 100.0 3.9E-36 8.5E-41 257.2 15.4 153 108-268 1-157 (157)
3 PF00413 Peptidase_M10: Matrix 100.0 2.2E-34 4.7E-39 244.2 15.1 149 108-268 1-154 (154)
4 cd04279 ZnMc_MMP_like_1 Zinc-d 100.0 5.7E-28 1.2E-32 206.4 13.9 148 114-268 2-156 (156)
5 cd04268 ZnMc_MMP_like Zinc-dep 99.9 5E-24 1.1E-28 182.7 9.3 143 113-267 1-165 (165)
6 smart00235 ZnMc Zinc-dependent 99.9 1.6E-23 3.5E-28 175.6 10.5 134 107-269 3-140 (140)
7 cd04277 ZnMc_serralysin_like Z 99.9 6.7E-23 1.5E-27 180.0 13.4 125 127-268 32-186 (186)
8 cd04327 ZnMc_MMP_like_3 Zinc-d 99.7 1.8E-17 3.9E-22 147.2 12.3 109 108-236 2-110 (198)
9 cd00203 ZnMc Zinc-dependent me 99.7 6.7E-17 1.5E-21 138.5 13.0 121 126-267 19-167 (167)
10 cd04276 ZnMc_MMP_like_2 Zinc-d 99.6 2.1E-14 4.6E-19 127.2 11.1 146 112-269 6-197 (197)
11 cd04280 ZnMc_astacin_like Zinc 99.5 1.1E-13 2.4E-18 121.2 11.7 92 114-237 2-93 (180)
12 PF12388 Peptidase_M57: Dual-a 99.5 1.5E-13 3.2E-18 122.0 12.2 135 112-267 37-211 (211)
13 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.5 5.7E-13 1.2E-17 118.4 12.9 99 106-236 7-105 (200)
14 PF01400 Astacin: Astacin (Pep 99.4 2.6E-13 5.7E-18 119.9 8.4 96 108-236 2-97 (191)
15 cd04282 ZnMc_meprin Zinc-depen 99.3 1.5E-11 3.3E-16 111.2 12.6 98 105-237 42-139 (230)
16 cd04283 ZnMc_hatching_enzyme Z 99.3 1.2E-11 2.6E-16 108.4 11.0 92 114-236 4-95 (182)
17 KOG3714 Meprin A metalloprotea 99.3 3.1E-11 6.8E-16 118.4 11.6 106 99-236 72-177 (411)
18 PF01471 PG_binding_1: Putativ 99.3 1.4E-11 3.1E-16 87.5 6.2 55 2-59 3-57 (57)
19 COG5549 Predicted Zn-dependent 98.9 6.8E-09 1.5E-13 91.1 9.1 123 128-270 103-235 (236)
20 PF02031 Peptidase_M7: Strepto 98.9 5.7E-09 1.2E-13 85.1 8.1 124 115-268 3-130 (132)
21 TIGR02869 spore_SleB spore cor 97.7 6.5E-05 1.4E-09 67.0 6.6 56 3-61 15-70 (201)
22 cd04267 ZnMc_ADAM_like Zinc-de 97.7 0.00019 4E-09 63.0 9.3 49 216-265 131-189 (192)
23 cd04273 ZnMc_ADAMTS_like Zinc- 97.7 0.00019 4.1E-09 64.0 9.0 119 135-269 73-200 (207)
24 PF13583 Reprolysin_4: Metallo 97.7 5E-05 1.1E-09 67.9 5.2 48 218-267 137-198 (206)
25 PF11350 DUF3152: Protein of u 97.7 0.00013 2.9E-09 64.4 7.4 139 113-263 28-183 (203)
26 PF05572 Peptidase_M43: Pregna 97.4 6.3E-05 1.4E-09 64.3 2.0 25 215-239 66-90 (154)
27 PF13574 Reprolysin_2: Metallo 97.4 4.8E-05 1E-09 66.2 1.0 47 218-266 111-170 (173)
28 COG1913 Predicted Zn-dependent 97.3 9.3E-05 2E-09 63.8 1.7 28 218-245 124-151 (181)
29 cd04275 ZnMc_pappalysin_like Z 97.3 0.00011 2.4E-09 66.6 1.9 75 215-299 134-209 (225)
30 PF13688 Reprolysin_5: Metallo 97.3 9.5E-05 2.1E-09 65.0 1.0 23 216-238 140-162 (196)
31 PRK13267 archaemetzincin-like 97.1 0.00023 4.9E-09 62.4 2.0 27 218-244 125-151 (179)
32 PF13582 Reprolysin_3: Metallo 97.1 0.00015 3.2E-09 58.9 0.8 18 218-235 107-124 (124)
33 PRK10594 murein L,D-transpepti 97.0 0.0018 3.8E-08 66.3 7.1 37 26-62 307-343 (608)
34 PF07998 Peptidase_M54: Peptid 96.9 0.00036 7.8E-09 61.8 1.4 28 217-245 144-171 (194)
35 cd04271 ZnMc_ADAM_fungal Zinc- 96.9 0.0011 2.4E-08 60.3 4.2 49 220-270 147-220 (228)
36 cd04269 ZnMc_adamalysin_II_lik 96.8 0.0017 3.6E-08 57.1 4.8 50 217-268 130-185 (194)
37 cd04270 ZnMc_TACE_like Zinc-de 96.0 0.0083 1.8E-07 55.1 4.4 50 219-268 168-230 (244)
38 PF01421 Reprolysin: Reprolysi 95.9 0.011 2.3E-07 52.2 4.5 51 216-267 129-187 (199)
39 cd04272 ZnMc_salivary_gland_MP 95.5 0.018 3.8E-07 51.8 4.5 51 217-268 144-212 (220)
40 COG3409 Putative peptidoglycan 95.5 0.022 4.8E-07 49.0 4.8 59 3-63 47-106 (185)
41 COG3023 ampD N-acetyl-anhydrom 95.3 0.046 1E-06 50.1 6.4 54 2-60 198-252 (257)
42 COG2989 Uncharacterized protei 95.0 0.048 1E-06 54.8 5.9 58 3-61 240-298 (561)
43 KOG4525 Jacalin-like lectin do 94.9 0.034 7.4E-07 54.4 4.6 74 218-293 301-397 (614)
44 PF10462 Peptidase_M66: Peptid 94.6 0.014 3E-07 55.2 1.2 21 216-236 191-211 (305)
45 PF05548 Peptidase_M11: Gameto 94.5 0.019 4.1E-07 54.7 1.9 19 218-236 150-168 (314)
46 PF12044 Metallopep: Putative 93.4 0.04 8.8E-07 54.4 1.7 57 218-276 316-394 (423)
47 TIGR03296 M6dom_TIGR03296 M6 f 93.2 0.028 6.1E-07 52.8 0.3 20 218-237 165-184 (286)
48 PF11150 DUF2927: Protein of u 91.9 3.7 7.9E-05 37.0 12.2 150 107-271 28-197 (213)
49 PTZ00337 surface protease GP63 91.9 0.18 4E-06 51.7 4.2 36 188-235 211-246 (567)
50 PF01457 Peptidase_M8: Leishma 91.4 0.093 2E-06 53.4 1.6 60 157-234 159-226 (521)
51 PF08823 PG_binding_2: Putativ 91.4 0.66 1.4E-05 34.7 5.7 53 4-59 18-74 (74)
52 COG3824 Predicted Zn-dependent 91.0 0.091 2E-06 42.8 0.8 19 216-234 107-125 (136)
53 PTZ00257 Glycoprotein GP63 (le 90.6 0.48 1E-05 48.9 5.8 97 156-270 204-313 (622)
54 PF04298 Zn_peptidase_2: Putat 90.2 0.35 7.6E-06 43.7 3.9 20 216-237 87-106 (222)
55 PF05547 Peptidase_M6: Immune 88.1 0.13 2.8E-06 53.5 -0.4 20 218-237 221-240 (645)
56 COG2738 Predicted Zn-dependent 87.3 0.45 9.8E-06 42.1 2.5 16 216-231 90-105 (226)
57 PF09471 Peptidase_M64: IgA Pe 87.0 0.25 5.5E-06 45.9 0.8 21 217-237 215-236 (264)
58 KOG3607 Meltrins, fertilins an 85.4 0.76 1.7E-05 48.5 3.5 46 217-263 322-374 (716)
59 KOG3658 Tumor necrosis factor- 83.2 0.85 1.8E-05 47.2 2.6 48 219-266 393-452 (764)
60 PF06262 DUF1025: Possibl zinc 83.1 0.51 1.1E-05 37.2 0.8 17 217-233 72-88 (97)
61 PF09374 PG_binding_3: Predict 78.2 2.5 5.3E-05 31.4 3.0 26 34-59 1-30 (72)
62 PF13398 Peptidase_M50B: Pepti 78.0 0.84 1.8E-05 40.6 0.5 15 217-231 21-35 (200)
63 PF14247 DUF4344: Domain of un 77.8 0.82 1.8E-05 41.4 0.4 14 217-230 91-104 (220)
64 COG4783 Putative Zn-dependent 77.3 0.82 1.8E-05 45.7 0.2 19 216-235 128-146 (484)
65 KOG3538 Disintegrin metallopro 76.7 3.3 7.2E-05 44.7 4.7 34 215-248 314-353 (845)
66 PF06114 DUF955: Domain of unk 76.7 0.99 2.2E-05 35.2 0.6 14 217-230 41-54 (122)
67 COG2856 Predicted Zn peptidase 76.4 1 2.2E-05 40.6 0.6 19 217-237 71-89 (213)
68 COG5549 Predicted Zn-dependent 75.0 1.6 3.5E-05 39.1 1.4 30 218-247 153-182 (236)
69 PF02163 Peptidase_M50: Peptid 70.9 1.6 3.5E-05 37.9 0.5 22 218-239 7-28 (192)
70 cd05709 S2P-M50 Site-2 proteas 70.6 1.7 3.6E-05 37.5 0.5 20 218-237 8-27 (180)
71 cd06163 S2P-M50_PDZ_RseP-like 69.8 1.8 4E-05 37.9 0.6 12 219-230 10-21 (182)
72 PF04228 Zn_peptidase: Putativ 68.8 1.5 3.2E-05 41.4 -0.2 40 186-229 141-181 (292)
73 PF12725 DUF3810: Protein of u 68.6 3.3 7.1E-05 39.6 2.1 38 186-237 178-215 (318)
74 cd06161 S2P-M50_SpoIVFB SpoIVF 67.0 2.2 4.8E-05 38.1 0.5 17 217-233 37-53 (208)
75 PF01435 Peptidase_M48: Peptid 66.5 2.4 5.1E-05 37.4 0.6 20 216-236 87-106 (226)
76 cd06459 M3B_Oligoendopeptidase 62.6 2.9 6.3E-05 40.8 0.5 15 216-230 220-234 (427)
77 cd06164 S2P-M50_SpoIVFB_CBS Sp 62.5 3 6.5E-05 37.8 0.5 17 218-234 53-69 (227)
78 cd06258 Peptidase_M3_like The 62.4 3 6.6E-05 40.0 0.6 16 215-230 151-166 (365)
79 PRK03982 heat shock protein Ht 62.1 3.2 6.9E-05 38.9 0.6 15 216-230 123-137 (288)
80 COG0501 HtpX Zn-dependent prot 61.6 3.3 7.1E-05 38.4 0.6 15 216-230 155-169 (302)
81 PF01432 Peptidase_M3: Peptida 61.0 3.3 7.1E-05 41.2 0.6 25 130-154 136-160 (458)
82 PRK03001 M48 family peptidase; 59.9 3.7 7.9E-05 38.4 0.6 15 216-230 122-136 (283)
83 COG3409 Putative peptidoglycan 59.6 13 0.00028 31.6 4.0 37 24-60 148-184 (185)
84 cd06162 S2P-M50_PDZ_SREBP Ster 59.5 3.6 7.9E-05 38.5 0.5 14 218-231 135-148 (277)
85 cd06159 S2P-M50_PDZ_Arch Uncha 59.2 3.7 8E-05 38.2 0.5 19 218-236 118-136 (263)
86 PRK03072 heat shock protein Ht 59.2 3.8 8.3E-05 38.5 0.6 15 216-230 125-139 (288)
87 PRK01345 heat shock protein Ht 59.0 3.9 8.4E-05 39.0 0.6 18 216-234 122-139 (317)
88 TIGR02289 M3_not_pepF oligoend 58.8 3.7 8.1E-05 42.0 0.5 15 216-230 335-349 (549)
89 PF13699 DUF4157: Domain of un 58.7 8.6 0.00019 29.0 2.4 30 190-230 44-73 (79)
90 cd06455 M3A_TOP Peptidase M3 T 58.3 3.9 8.5E-05 41.1 0.6 14 217-230 262-275 (472)
91 TIGR02290 M3_fam_3 oligoendope 56.5 4.3 9.3E-05 41.9 0.5 15 216-230 373-387 (587)
92 TIGR00181 pepF oligoendopeptid 55.6 5.1 0.00011 41.3 0.9 15 216-230 376-390 (591)
93 PRK04897 heat shock protein Ht 55.5 4.6 0.0001 38.1 0.5 15 216-230 135-149 (298)
94 PF01447 Peptidase_M4: Thermol 55.3 4.9 0.00011 34.2 0.6 12 219-230 136-147 (150)
95 cd06160 S2P-M50_like_2 Unchara 55.0 4.8 0.0001 35.3 0.5 23 218-240 41-63 (183)
96 PF13485 Peptidase_MA_2: Pepti 54.8 5 0.00011 31.4 0.6 14 217-230 24-37 (128)
97 cd06457 M3A_MIP Peptidase M3 m 54.3 5.1 0.00011 40.1 0.6 33 130-162 132-168 (458)
98 PRK05457 heat shock protein Ht 54.3 5 0.00011 37.7 0.5 14 216-229 132-145 (284)
99 PRK02870 heat shock protein Ht 53.9 5.3 0.00011 38.5 0.6 14 216-229 171-184 (336)
100 PRK02391 heat shock protein Ht 53.7 5.1 0.00011 37.8 0.5 14 216-229 131-144 (296)
101 PRK01265 heat shock protein Ht 53.0 5.3 0.00012 38.3 0.5 14 216-229 138-151 (324)
102 KOG2921 Intramembrane metallop 52.7 5.5 0.00012 39.1 0.5 18 218-235 131-148 (484)
103 cd06456 M3A_DCP_Oligopeptidase 52.5 5.7 0.00012 39.4 0.6 32 131-162 107-140 (422)
104 PF08823 PG_binding_2: Putativ 52.0 39 0.00084 25.2 4.9 29 31-60 20-48 (74)
105 PLN02791 Nudix hydrolase homol 51.8 5.8 0.00013 42.4 0.6 21 217-237 584-604 (770)
106 cd06158 S2P-M50_like_1 Unchara 48.4 7.1 0.00015 34.1 0.5 13 218-230 9-21 (181)
107 PRK10779 zinc metallopeptidase 47.4 7.4 0.00016 38.8 0.5 12 219-230 16-27 (449)
108 PHA00024 IX minor coat protein 47.3 8 0.00017 24.3 0.5 15 291-305 10-24 (33)
109 TIGR00054 RIP metalloprotease 47.0 7.6 0.00016 38.5 0.5 12 219-230 15-26 (420)
110 cd06460 M32_Taq Peptidase fami 46.9 7.6 0.00017 38.3 0.5 13 218-230 159-171 (396)
111 COG1164 Oligoendopeptidase F [ 46.7 7.6 0.00016 40.4 0.5 14 217-230 379-392 (598)
112 COG1059 Thermostable 8-oxoguan 46.2 27 0.00059 31.0 3.7 47 7-54 153-199 (210)
113 PF01434 Peptidase_M41: Peptid 43.5 8.9 0.00019 34.3 0.4 15 216-230 26-40 (213)
114 COG4227 Antirestriction protei 41.5 12 0.00027 34.7 1.0 23 215-237 200-222 (316)
115 PF01863 DUF45: Protein of unk 41.4 14 0.0003 32.2 1.3 28 215-242 161-188 (205)
116 PRK10911 oligopeptidase A; Pro 40.7 11 0.00023 39.9 0.5 14 217-230 462-475 (680)
117 PF10281 Ish1: Putative stress 37.7 82 0.0018 20.0 4.2 29 5-37 7-35 (38)
118 PF03571 Peptidase_M49: Peptid 37.5 12 0.00027 38.3 0.4 16 217-232 292-308 (549)
119 KOG2719 Metalloprotease [Gener 37.1 13 0.00029 36.8 0.5 26 216-242 278-303 (428)
120 PF03272 Enhancin: Viral enhan 35.8 52 0.0011 35.4 4.6 22 132-153 170-191 (775)
121 cd02641 R3H_Smubp-2_like R3H d 33.2 19 0.00042 25.6 0.7 21 217-237 28-48 (60)
122 PRK03094 hypothetical protein; 33.2 31 0.00066 26.3 1.8 16 2-17 8-23 (80)
123 COG5570 Uncharacterized small 32.5 43 0.00094 23.3 2.3 31 32-67 3-33 (57)
124 PF10463 Peptidase_U49: Peptid 32.3 14 0.00031 33.1 -0.1 46 218-272 101-146 (206)
125 PF06167 Peptidase_M90: Glucos 31.1 21 0.00046 33.0 0.8 21 215-235 151-171 (253)
126 PF03698 UPF0180: Uncharacteri 29.8 38 0.00081 25.8 1.8 16 2-17 8-23 (80)
127 PRK10280 dipeptidyl carboxypep 29.5 21 0.00046 37.7 0.6 32 131-162 363-397 (681)
128 KOG2661 Peptidase family M48 [ 28.2 23 0.00051 34.1 0.5 19 217-236 274-292 (424)
129 COG3590 PepO Predicted metallo 28.1 19 0.00042 37.0 -0.1 13 218-230 487-499 (654)
130 PF03801 Ndc80_HEC: HEC/Ndc80p 28.1 1.2E+02 0.0027 25.7 5.0 63 4-66 42-108 (157)
131 PF08434 CLCA_N: Calcium-activ 25.5 43 0.00094 31.2 1.8 12 218-229 149-160 (262)
132 COG2321 Predicted metalloprote 25.4 28 0.00061 32.6 0.5 82 185-269 137-224 (295)
133 PRK14749 hypothetical protein; 24.6 47 0.001 20.4 1.2 18 292-309 5-22 (30)
134 PF12315 DUF3633: Protein of u 24.1 33 0.00072 30.8 0.7 14 217-230 92-105 (212)
135 COG2317 Zn-dependent carboxype 23.6 31 0.00067 34.7 0.5 44 254-301 374-424 (497)
136 cd06461 M2_ACE Peptidase famil 23.2 35 0.00075 34.6 0.7 16 217-232 246-261 (477)
137 TIGR02106 cyd_oper_ybgT cyd op 22.7 53 0.0012 20.2 1.2 17 293-309 6-22 (30)
138 PF01431 Peptidase_M13: Peptid 22.7 34 0.00075 29.9 0.5 16 218-233 36-51 (206)
139 TIGR03141 cytochro_ccmD heme e 22.6 61 0.0013 21.6 1.6 22 288-310 2-23 (45)
140 COG5504 Predicted Zn-dependent 22.1 1.1E+02 0.0025 28.4 3.7 78 128-238 79-160 (280)
141 cd02639 R3H_RRM R3H domain of 22.0 42 0.00091 24.0 0.8 22 218-239 29-50 (60)
142 CHL00176 ftsH cell division pr 21.9 35 0.00076 35.8 0.5 14 217-230 437-450 (638)
143 PRK10733 hflB ATP-dependent me 21.8 35 0.00075 35.8 0.4 14 217-230 407-420 (644)
144 PF14891 Peptidase_M91: Effect 21.6 40 0.00088 29.0 0.7 18 217-234 102-119 (174)
145 PF02074 Peptidase_M32: Carbox 20.9 38 0.00082 34.5 0.5 17 219-235 260-279 (494)
146 COG4412 Uncharacterized protei 20.6 34 0.00073 35.1 0.0 51 218-268 346-411 (760)
147 PF13402 M60-like: Peptidase M 20.6 39 0.00085 31.4 0.5 11 220-230 221-231 (307)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=2.8e-56 Score=438.18 Aligned_cols=235 Identities=47% Similarity=0.793 Sum_probs=210.2
Q ss_pred ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcCCCCCCCCCCCCccccCCCCCC
Q 021444 3 GLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILRPRCGNPDIVNGTTTMNSGKPT 82 (312)
Q Consensus 3 ~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~pRCG~pD~~~~~~~~~~~~~~ 82 (312)
++..++.||++|||+++.+...+...++.+++||+.||++++|++||++|.+|++.|++|||||||.
T Consensus 29 ~~~~~~~yl~~~~y~~~~~~~~~~~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd~------------- 95 (469)
T KOG1565|consen 29 DKVALQDYLECYGYLPPTDLTATRASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPDG------------- 95 (469)
T ss_pred chhHHHHHhhhcccCCCccccccccCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCCC-------------
Confidence 3577899999999998765432222578999999999999999999999999999999999999992
Q ss_pred CCCCCCCCCCCcccccccccCCCCCCCCCCCCceEEeecCC-CCCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccE
Q 021444 83 SYNATSSTNGHLHSVGHYSFFPGMPRWPSNKRDLTYAFLPD-NQLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADI 160 (312)
Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~kW~~~~~~LtY~i~~~-~~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI 160 (312)
+|++++++.|| ++.+|||+|.++ +.++ .+++.++++||+.|+++++++|.|+.....|||
T Consensus 96 ----------------~~~~~~~~~kW--~k~~lT~ri~n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi 157 (469)
T KOG1565|consen 96 ----------------RYRYFPGKPKW--NKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADI 157 (469)
T ss_pred ----------------ccccCcccCcc--cccccceeccccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCce
Confidence 25567788999 788999999987 3344 899999999999999999999999996679999
Q ss_pred EEEeeecCCCCCCCcCCCCCcceeeeCCC---CCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444 161 RIGFFTGDHGDGEPFDGPMGTLAHAFSPP---NGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 161 ~I~f~~~~hgd~~~fdG~gg~lAhA~~P~---~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
+|.|..+.|||++||||++|+|||||+|+ .|++|||++|.|++. ...+++++.||+|||||+|||.||.+
T Consensus 158 ~i~F~~~~h~d~~PFDG~~g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-------~~~g~~l~~Va~HEiGH~LGL~HS~~ 230 (469)
T KOG1565|consen 158 RISFFPGDHGDGFPFDGPGGVLAHAFFPGPGIGGDLHFDKDETWTYG-------DSNGVDLFLVAAHEIGHALGLGHSSD 230 (469)
T ss_pred eeeeeccCCCCCCcccCCCCceecccCCCCCCCCccccCcccceecc-------CCccchhHHHhhhhcccccccCCCCC
Confidence 99999999999999999999999999994 689999999999995 13488999999999999999999999
Q ss_pred CCCccccccCCCCCcCCCCHHHHHHHHHHhCCCCCCCC
Q 021444 238 EDAIMYPSISSGSRKVELANDDIQGIQALYGNNPNFNG 275 (312)
Q Consensus 238 ~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~~~~~~ 275 (312)
+++||||+|++...++.|+.|||.|||.|||.+++...
T Consensus 231 ~~aiM~P~y~~~~~~~~L~~DDv~giq~lYG~~~~~~~ 268 (469)
T KOG1565|consen 231 PDAIMYPFYQPDSGNFDLSQDDVRGIQHLYGGPPRDYF 268 (469)
T ss_pred cccccccccccCCCCcccChhhhhhhHHHhCCCccccc
Confidence 99999999997667889999999999999999888643
No 2
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=100.00 E-value=3.9e-36 Score=257.24 Aligned_cols=153 Identities=54% Similarity=0.876 Sum_probs=135.5
Q ss_pred CCCCCCCceEEeecCCC--CChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444 108 RWPSNKRDLTYAFLPDN--QLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA 185 (312)
Q Consensus 108 kW~~~~~~LtY~i~~~~--~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA 185 (312)
|| .+++|||+|.+.. .....++++|++||+.|+++++|+|+|+.+...|||+|.|....++++.+|+|.+|+++||
T Consensus 1 kW--~~~~itY~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l~~a 78 (157)
T cd04278 1 KW--SKTNLTYRILNYPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTLAHA 78 (157)
T ss_pred CC--CCCceeEEEECCCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCccccccc
Confidence 79 6899999998543 2348899999999999999999999999865589999999998899888999999999999
Q ss_pred eCC--CCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCcCCCCHHHHHHH
Q 021444 186 FSP--PNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSRKVELANDDIQGI 263 (312)
Q Consensus 186 ~~P--~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~I 263 (312)
++| ..|+|+||.++.|.... ...+.+++.|++|||||||||+|+.++.+||||.+.+......|+.+||.+|
T Consensus 79 ~~p~~~~g~i~~~~~~~~~~~~------~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~~ 152 (157)
T cd04278 79 FFPGGIGGDIHFDDDEQWTLGS------DSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRGI 152 (157)
T ss_pred cCCCCcceeEEECCCcccccCC------CCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHHH
Confidence 999 67999999999998753 1346789999999999999999999999999999986434678999999999
Q ss_pred HHHhC
Q 021444 264 QALYG 268 (312)
Q Consensus 264 q~LYG 268 (312)
|+|||
T Consensus 153 ~~lYg 157 (157)
T cd04278 153 QALYG 157 (157)
T ss_pred HHhcC
Confidence 99998
No 3
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=100.00 E-value=2.2e-34 Score=244.22 Aligned_cols=149 Identities=46% Similarity=0.822 Sum_probs=131.6
Q ss_pred CCCCCCCceEEeecCCC-CCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444 108 RWPSNKRDLTYAFLPDN-QLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA 185 (312)
Q Consensus 108 kW~~~~~~LtY~i~~~~-~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA 185 (312)
|| ++++|||+|.+.+ ..+ +++++++++||+.|+++++++|+++.+ ..+||+|.|....+++...|++..++++++
T Consensus 1 ~W--~~~~ity~i~~~~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~-~~adi~i~~~~~~~~~~~~~~~~~~~~~~~ 77 (154)
T PF00413_consen 1 KW--PKKTITYSISNTTPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSD-GNADIRISFGSNNHGDGYSFDGSGGTLAHA 77 (154)
T ss_dssp SS--SSSEEEEEESSBCTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESS-SSCSEEEEEESSSSSSSS-CSSSSSESEEE
T ss_pred CC--CCCcEEEEEECCCCCCCHHHHHHHHHHHHHHHHhcCCceEEeccC-CCcceeeeeeccccCcccccccceeeeecc
Confidence 79 6799999999754 333 789999999999999999999999985 579999999988888888899999999999
Q ss_pred eCCC---CCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCCcCCCCHHHHHH
Q 021444 186 FSPP---NGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSRKVELANDDIQG 262 (312)
Q Consensus 186 ~~P~---~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~ 262 (312)
+.|. .|+++|+.++.|... ....++..|++|||||||||+|++++.|||||.+.+.. ...|+.+||.+
T Consensus 78 ~~~~~~~~~~i~~~~~~~~~~~--------~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~-~~~l~~~Di~~ 148 (154)
T PF00413_consen 78 YFPNNIVSGDIHFNDDESWTID--------DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPD-NKTLSEDDIDG 148 (154)
T ss_dssp EESSSTTTTEEEEETTSHEESS--------SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSS-STSTTHHHHHH
T ss_pred ccccccccccccccccccchhh--------hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCC-CCCCCHHHHHH
Confidence 9997 689999999999874 23578999999999999999999999999999998644 24899999999
Q ss_pred HHHHhC
Q 021444 263 IQALYG 268 (312)
Q Consensus 263 Iq~LYG 268 (312)
||+|||
T Consensus 149 i~~lYg 154 (154)
T PF00413_consen 149 IQYLYG 154 (154)
T ss_dssp HHHHHS
T ss_pred HHHHhC
Confidence 999998
No 4
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.95 E-value=5.7e-28 Score=206.44 Aligned_cols=148 Identities=26% Similarity=0.311 Sum_probs=113.2
Q ss_pred CceEEeecCCCC----ChHHHHHHHHHHHHHHhhcccceEEEcccC-CcccEEEEeeecCCCCCCCcCCCCCcceeeeCC
Q 021444 114 RDLTYAFLPDNQ----LTDEVKNVFATAFDRWSEATSLTFTRTDVY-TTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSP 188 (312)
Q Consensus 114 ~~LtY~i~~~~~----~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P 188 (312)
..|+|++.+... ....++++|++||+.|+++++|+|++++.. .++||+|.|.... +|+|.++++|||++|
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~-----~~~~~g~~~a~a~~p 76 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPP-----PVGGAGGGLARAGFP 76 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCC-----CCCCCCCceEEeccc
Confidence 478999987543 247899999999999999999999998743 3899999997532 678999999999999
Q ss_pred CCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCC-CccccccCCCC-CcCCCCHHHHHHHHHH
Q 021444 189 PNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVED-AIMYPSISSGS-RKVELANDDIQGIQAL 266 (312)
Q Consensus 189 ~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~-sIM~p~y~~~~-~~~~L~~dDi~~Iq~L 266 (312)
..|. ++....|...-+..........+++.|++|||||+|||.|++++. +||||++.... .+..|+.+|+++||.|
T Consensus 77 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~l 154 (156)
T cd04279 77 LISD--GNRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRL 154 (156)
T ss_pred ccCC--CcccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHH
Confidence 7543 333322222110000000124689999999999999999999999 99999997632 3568999999999999
Q ss_pred hC
Q 021444 267 YG 268 (312)
Q Consensus 267 YG 268 (312)
||
T Consensus 155 Y~ 156 (156)
T cd04279 155 YG 156 (156)
T ss_pred hC
Confidence 97
No 5
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.90 E-value=5e-24 Score=182.74 Aligned_cols=143 Identities=25% Similarity=0.348 Sum_probs=105.8
Q ss_pred CCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCc
Q 021444 113 KRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGR 192 (312)
Q Consensus 113 ~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~ 192 (312)
++.|+|.|++. .+..++++|++||+.|+++++|+|+++.....+||+|.+.+..+.. +|..+..++...|..|+
T Consensus 1 ~~~I~y~i~~~--~~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~----~~~~~~~~~~~~~~~g~ 74 (165)
T cd04268 1 KKPITYYIDDS--VPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYN----DGTWSYGPSQVDPLTGE 74 (165)
T ss_pred CCCEEEEEcCC--CCHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCC----CCccccCCccCCCCCcc
Confidence 46899999864 3478999999999999999999999998655899999987643111 11112222333466789
Q ss_pred eEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC----------------CCCccccccCCC------C
Q 021444 193 FHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV----------------EDAIMYPSISSG------S 250 (312)
Q Consensus 193 ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~----------------~~sIM~p~y~~~------~ 250 (312)
|+++....|.... ...+..+..|++|||||||||.|+++ ..|||++..... .
T Consensus 75 i~~~~~~~~~~~~------~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~ 148 (165)
T cd04268 75 ILLARVYLYSSFV------EYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDG 148 (165)
T ss_pred EEeeEEEEchhHH------HHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccc
Confidence 9998765443210 01245689999999999999999998 899999865421 2
Q ss_pred CcCCCCHHHHHHHHHHh
Q 021444 251 RKVELANDDIQGIQALY 267 (312)
Q Consensus 251 ~~~~L~~dDi~~Iq~LY 267 (312)
....|+.+||.+||.||
T Consensus 149 ~~~~~~~~Di~ai~~lY 165 (165)
T cd04268 149 QKYTIGPYDIAAIKKLY 165 (165)
T ss_pred cCCCCCHHHHHHHHhcC
Confidence 35689999999999998
No 6
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.90 E-value=1.6e-23 Score=175.55 Aligned_cols=134 Identities=39% Similarity=0.597 Sum_probs=107.7
Q ss_pred CCCCCCCCceEEeecCCCCCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444 107 PRWPSNKRDLTYAFLPDNQLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA 185 (312)
Q Consensus 107 ~kW~~~~~~LtY~i~~~~~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA 185 (312)
.+| .+.+|+|+|.. +.++ .+++++|++||+.|+++++|+|+|+.+ ++||.|.|...+++ +..++++
T Consensus 3 ~~W--~~~~v~Y~i~~-~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~--------g~~~a~~ 69 (140)
T smart00235 3 KKW--PKGTVPYVIDS-SSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGS--------GCTLSHA 69 (140)
T ss_pred CcC--CCCcEEEEEcC-CCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCC--------Ccceeee
Confidence 589 56899999972 2233 446999999999999999999999874 89999999986641 1126889
Q ss_pred eCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCCCC---ccccccCCCCCcCCCCHHHHHH
Q 021444 186 FSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVEDA---IMYPSISSGSRKVELANDDIQG 262 (312)
Q Consensus 186 ~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~~s---IM~p~y~~~~~~~~L~~dDi~~ 262 (312)
+.| .|+++++. +.|... ..|++|||||||||.|++++.+ +|++.+.+....+.++.+|..+
T Consensus 70 g~~-~g~~~~~~-~~~~~~--------------~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~ 133 (140)
T smart00235 70 GRP-GGDQHFSL-GNGCIN--------------TGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLG 133 (140)
T ss_pred ecC-CCceEEEc-cCCcCC--------------cccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCC
Confidence 888 78899976 556542 3499999999999999998887 9999997644467788889999
Q ss_pred HHHHhCC
Q 021444 263 IQALYGN 269 (312)
Q Consensus 263 Iq~LYG~ 269 (312)
++++||+
T Consensus 134 ~~~~yg~ 140 (140)
T smart00235 134 IPYDYGS 140 (140)
T ss_pred chhccCc
Confidence 9999984
No 7
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.89 E-value=6.7e-23 Score=179.96 Aligned_cols=125 Identities=23% Similarity=0.301 Sum_probs=99.0
Q ss_pred hHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCC-------CCceEecCCC
Q 021444 127 TDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPP-------NGRFHLDGDE 199 (312)
Q Consensus 127 ~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~-------~G~ihfd~~e 199 (312)
+...+++|++||+.|+++++|+|+|+.+...+||+|.+..... +++.+.|++|. .|+|++|.+.
T Consensus 32 ~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~---------~~~~g~a~~p~~~~~~~~~g~i~~~~~~ 102 (186)
T cd04277 32 SAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPD---------GNTAGYAYYPGSGSGTAYGGDIWFNSSY 102 (186)
T ss_pred CHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCC---------CCccEEEECCCCCccccccceeEEecCc
Confidence 4789999999999999999999999987668999998764321 35678888885 3689999877
Q ss_pred CceecCCCccccCCccccchhhhhhhhhhhcCCCCCCCC----------------CCccccccCCC-------CCcCCCC
Q 021444 200 NWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVE----------------DAIMYPSISSG-------SRKVELA 256 (312)
Q Consensus 200 ~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~----------------~sIM~p~y~~~-------~~~~~L~ 256 (312)
.|... ..+.....|++|||||+|||.|+++. .|||.+..... .....++
T Consensus 103 ~~~~~--------~~g~~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~ 174 (186)
T cd04277 103 DTNSD--------SPGSYGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPM 174 (186)
T ss_pred ccccC--------CCChhhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCcc
Confidence 66532 23556899999999999999999853 46897654332 2346889
Q ss_pred HHHHHHHHHHhC
Q 021444 257 NDDIQGIQALYG 268 (312)
Q Consensus 257 ~dDi~~Iq~LYG 268 (312)
..||+|||+|||
T Consensus 175 ~~DI~AlQ~lYG 186 (186)
T cd04277 175 LLDIAALQYLYG 186 (186)
T ss_pred HHHHHHHHHhhC
Confidence 999999999998
No 8
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74 E-value=1.8e-17 Score=147.17 Aligned_cols=109 Identities=29% Similarity=0.456 Sum_probs=76.2
Q ss_pred CCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeC
Q 021444 108 RWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFS 187 (312)
Q Consensus 108 kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~ 187 (312)
+|| +.++|+|.|... .+...++.|++||+.|+++++|+|++++. .++||+|.|..+. | ++++.|..+.. ..
T Consensus 2 ~W~-~~~~~~~~f~~~--~~~~~r~~I~~A~~~W~~~t~i~F~~~~~-~~adi~I~f~~~~-G-c~S~vG~~~~~---~~ 72 (198)
T cd04327 2 LWR-NGTVLRIAFLGG--PDAFLKDKVRAAAREWLPYANLKFKFVTD-ADADIRISFTPGD-G-YWSYVGTDALL---IG 72 (198)
T ss_pred CCC-CCCeEEEEeCCC--CcHHHHHHHHHHHHHHhhhcCeEEEECCC-CCCCEEEEEecCC-C-CCCCcCCcccc---cC
Confidence 697 477899998754 24788999999999999999999999985 5799999998643 3 45555543221 11
Q ss_pred CCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444 188 PPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 188 P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
+....+.++ |... ...+.+...|++|||||||||.|++
T Consensus 73 ~~~~t~~l~----~~~~-------~~~~~~~~~~i~HElgHaLG~~HEh 110 (198)
T cd04327 73 ADAPTMNLG----WFTD-------DTPDPEFSRVVLHEFGHALGFIHEH 110 (198)
T ss_pred CCCceeeee----eecC-------CCchhhHHHHHHHHHHHHhcCcccc
Confidence 112234442 2111 0123467789999999999999984
No 9
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.72 E-value=6.7e-17 Score=138.49 Aligned_cols=121 Identities=24% Similarity=0.309 Sum_probs=91.8
Q ss_pred ChHHHHHHHHHHHHHHhhcccceEEEcccC-CcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCC-----CceEecCCC
Q 021444 126 LTDEVKNVFATAFDRWSEATSLTFTRTDVY-TTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPN-----GRFHLDGDE 199 (312)
Q Consensus 126 ~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~-~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~-----G~ihfd~~e 199 (312)
+...++++|+.|++.|+++++++|+++... ..+||.+.+...+. .++.+++||.|.. +.++++...
T Consensus 19 ~~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~--------~~~~~g~a~~~~~c~~~~~~~~~~~~~ 90 (167)
T cd00203 19 LSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDF--------DGGTGGWAYLGRVCDSLRGVGVLQDNQ 90 (167)
T ss_pred hHHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccC--------CCCceEEEecCCccCCCCCcEEEecCC
Confidence 447899999999999999999999999853 38999998865432 2235667776653 455554432
Q ss_pred CceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC--------------------CCCccccccCCC--CCcCCCCH
Q 021444 200 NWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV--------------------EDAIMYPSISSG--SRKVELAN 257 (312)
Q Consensus 200 ~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~--------------------~~sIM~p~y~~~--~~~~~L~~ 257 (312)
.| +.....|++|||||+|||.|..+ ..+||+|..... .....++.
T Consensus 91 ~~-------------~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~ 157 (167)
T cd00203 91 SG-------------TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQ 157 (167)
T ss_pred cc-------------cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCH
Confidence 22 23578899999999999999876 368999876532 34568999
Q ss_pred HHHHHHHHHh
Q 021444 258 DDIQGIQALY 267 (312)
Q Consensus 258 dDi~~Iq~LY 267 (312)
.||+.||.+|
T Consensus 158 ~d~~~i~~~Y 167 (167)
T cd00203 158 CDIDQINKLY 167 (167)
T ss_pred HHHHHHHhhC
Confidence 9999999998
No 10
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.56 E-value=2.1e-14 Score=127.21 Aligned_cols=146 Identities=18% Similarity=0.196 Sum_probs=90.1
Q ss_pred CCCceEEeecCCCCChHHHHHHHHHHHHHHhhc------ccceEEEcccC--CcccEE---EEeeecCCCCCCCcCCCCC
Q 021444 112 NKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEA------TSLTFTRTDVY--TTADIR---IGFFTGDHGDGEPFDGPMG 180 (312)
Q Consensus 112 ~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~------s~l~F~ev~~~--~~aDI~---I~f~~~~hgd~~~fdG~gg 180 (312)
.|+.|||+|+++. +.+++++|++||+.|+++ .++.++++... ...||+ |+|.... ++..+
T Consensus 6 pk~pItyyI~~~~--p~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~-------~~~~~ 76 (197)
T cd04276 6 PKEPIVYYLDNTF--PEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSP-------NGGWA 76 (197)
T ss_pred CCCCEEEEecCCC--cHHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecC-------CCcce
Confidence 4789999999753 388999999999999987 35555555321 246888 5554322 12223
Q ss_pred cceeeeCCCCC-----ceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC------------------
Q 021444 181 TLAHAFSPPNG-----RFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV------------------ 237 (312)
Q Consensus 181 ~lAhA~~P~~G-----~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~------------------ 237 (312)
...+...|..| +|+++........... ....+..+..+++||+||+|||.|...
T Consensus 77 ~gps~~dPrTGeIl~a~V~l~~~~~~~~~~~~---~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~ 153 (197)
T cd04276 77 YGPSVVDPRTGEILKADVILYSGFLRQDQLWY---EDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKE 153 (197)
T ss_pred ecccccCCCCCCeEEEEEEeCchhhccchhHH---HHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhh
Confidence 33444556555 4556543221110000 001123577899999999999999752
Q ss_pred ---CCCccccc---cCC------CCCcCCCCHHHHHHHHHHhCC
Q 021444 238 ---EDAIMYPS---ISS------GSRKVELANDDIQGIQALYGN 269 (312)
Q Consensus 238 ---~~sIM~p~---y~~------~~~~~~L~~dDi~~Iq~LYG~ 269 (312)
..|||.+. +.. ......+...||.+||++||.
T Consensus 154 ~~~~~SVMdY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~ 197 (197)
T cd04276 154 KGATSSVMDYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP 197 (197)
T ss_pred cCCcceeecCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence 35777432 210 012457899999999999984
No 11
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.52 E-value=1.1e-13 Score=121.20 Aligned_cols=92 Identities=18% Similarity=0.249 Sum_probs=64.4
Q ss_pred CceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCce
Q 021444 114 RDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRF 193 (312)
Q Consensus 114 ~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~i 193 (312)
..|.|.|++. ++...++.|++||+.|+++++|+|++.+. ++..|.|..+ .| ++.+.|..+ +...+
T Consensus 2 ~~VpY~i~~~--~~~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~G-c~S~vG~~~--------~~q~i 66 (180)
T cd04280 2 GTVPYVIDGS--FDESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SG-CWSYVGRVG--------GRQVV 66 (180)
T ss_pred CEEEEEECCC--CCHHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CC-cceecCccC--------CceeE
Confidence 4789999863 34788999999999999999999999874 4556666654 33 333333211 11223
Q ss_pred EecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444 194 HLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 194 hfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
.+.. ++....+++|||||||||.|++.
T Consensus 67 ~l~~-----------------~c~~~g~v~HE~~HalG~~HEh~ 93 (180)
T cd04280 67 SLGS-----------------GCFSLGTIVHELMHALGFYHEQS 93 (180)
T ss_pred EeCC-----------------CcCcCchhHHHHHHHhcCcchhc
Confidence 3321 34447899999999999999873
No 12
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.51 E-value=1.5e-13 Score=121.97 Aligned_cols=135 Identities=28% Similarity=0.422 Sum_probs=91.7
Q ss_pred CCCceEEeecCCC--CChHHHHHHHHHHHHHHhhcc-cceEEEc--ccCCcccEEEEeeecCCCCCCCcCCCCCcceeee
Q 021444 112 NKRDLTYAFLPDN--QLTDEVKNVFATAFDRWSEAT-SLTFTRT--DVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAF 186 (312)
Q Consensus 112 ~~~~LtY~i~~~~--~~~~~~~~~i~~A~~~Ws~~s-~l~F~ev--~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~ 186 (312)
..++|+..+.+.. .+...++.++++|++.|+++. .|+|+-+ .....+||.|.-... +.+.|..+-|.
T Consensus 37 ~~~~I~i~~~~~~~~~l~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~--------~~~~G~ggsAG 108 (211)
T PF12388_consen 37 SPRTITIIGYTGGSQSLNSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSS--------NNPSGAGGSAG 108 (211)
T ss_pred CCCEEEEEeCCCccccccHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccC--------CCCCCcceecc
Confidence 3568888887632 344789999999999999973 5677522 223466776632211 12334567788
Q ss_pred CC-CCC----ceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC-------------------------
Q 021444 187 SP-PNG----RFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS------------------------- 236 (312)
Q Consensus 187 ~P-~~G----~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~------------------------- 236 (312)
|| ..| .|.+ +.+. .........|++|||||+|||.|++
T Consensus 109 FP~s~G~P~~~I~I-----~~~~-------~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpG 176 (211)
T PF12388_consen 109 FPTSNGNPYKFIQI-----YGLS-------NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPG 176 (211)
T ss_pred CCCCCCCCCceEEE-----EecC-------CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCC
Confidence 99 544 3555 1111 1122346789999999999999986
Q ss_pred -----CCCCccccccCCCCCcCCCCHHHHHHHHHHh
Q 021444 237 -----VEDAIMYPSISSGSRKVELANDDIQGIQALY 267 (312)
Q Consensus 237 -----~~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LY 267 (312)
|++|||...+.. .....++..||.|+++||
T Consensus 177 TPt~~d~~SiM~ac~~~-~~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 177 TPTGADPNSIMNACFSS-GEDGEFTSNDITALNYLY 211 (211)
T ss_pred CCCCCCCchhhhccccC-CCCCCcChhhHHHHHhhC
Confidence 356889887654 345689999999999999
No 13
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.47 E-value=5.7e-13 Score=118.38 Aligned_cols=99 Identities=23% Similarity=0.400 Sum_probs=66.5
Q ss_pred CCCCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceee
Q 021444 106 MPRWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHA 185 (312)
Q Consensus 106 ~~kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA 185 (312)
..+|| +..|.|.|... .+...+.+|++||+.|++.++|+|++... . .-.|.|..+..| ++++.|..+.-.+
T Consensus 7 ~~~Wp--~~~VpY~i~~~--~~~~~~~~I~~Am~~~~~~TCirF~~r~~-~--~~yi~f~~~~~G-C~S~vG~~~~g~q- 77 (200)
T cd04281 7 ERIWP--GGVIPYVIDGN--FTGSQRAMFKQAMRHWENFTCVTFVERTP-E--ENYIVFTYRPCG-CCSYVGRRGNGPQ- 77 (200)
T ss_pred cCcCC--CCEEEEEECCC--CCHHHHHHHHHHHHHHHhCCceEEEECCC-C--CCEEEEEECCCC-eeEcCCCcCCCce-
Confidence 46894 67999999752 34667899999999999999999998764 2 234455543334 3444554221011
Q ss_pred eCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444 186 FSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 186 ~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
.+.+ ..++....+++|||||||||-|++
T Consensus 78 ------~isl-----------------~~~C~~~Gti~HEl~HaLGf~HEh 105 (200)
T cd04281 78 ------AISI-----------------GKNCDKFGIVVHELGHVIGFWHEH 105 (200)
T ss_pred ------eeec-----------------CCCcCcCchHHHHHHHHhcCcchh
Confidence 1211 123445689999999999999986
No 14
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.44 E-value=2.6e-13 Score=119.93 Aligned_cols=96 Identities=25% Similarity=0.492 Sum_probs=60.0
Q ss_pred CCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeC
Q 021444 108 RWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFS 187 (312)
Q Consensus 108 kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~ 187 (312)
||| ..+|.|.|.+. .+...++.|++||+.|++.++|+|+++. . .....|.|..+ .| ++++.|..+
T Consensus 2 ~Wp--~~~IpY~~~~~--~~~~~~~~I~~Am~~~e~~TcI~F~~~~-~-~~~~~i~~~~~-~g-C~S~vG~~~------- 66 (191)
T PF01400_consen 2 KWP--NGTIPYYIDPS--FSSSQRQRIRKAMDEWEKNTCIRFVERT-E-NEDDYISFSNG-SG-CWSYVGRQG------- 66 (191)
T ss_dssp S-G--GGEEEEEEETT--S-HHHHHHHHHHHHHHHHHSSEEEEE-S-S-SSSSEEEEESS-SS-EEEESS--S-------
T ss_pred cCC--CCEEEEEECCC--CCHHHHHHHHHHHHHHHhCCCEEEEECC-C-CCceEEEeecC-cc-ccchhhhcC-------
Confidence 794 56999999863 3477889999999999999999999998 3 34444556432 23 233333210
Q ss_pred CCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444 188 PPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 188 P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
+.-.|.+. .++....|++|||||||||.|++
T Consensus 67 -g~q~i~l~-----------------~~c~~~~~i~HEl~HaLG~~HEh 97 (191)
T PF01400_consen 67 -GEQTINLG-----------------DGCFSVGTILHELGHALGFWHEH 97 (191)
T ss_dssp -SEEEEEE------------------TTC-SHHHHHHHHHHHHTB--GG
T ss_pred -cceeEEec-----------------ceeCCccchHHHHHHHHhhhhhh
Confidence 00122222 13345679999999999999986
No 15
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.33 E-value=1.5e-11 Score=111.25 Aligned_cols=98 Identities=23% Similarity=0.380 Sum_probs=66.0
Q ss_pred CCCCCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCccee
Q 021444 105 GMPRWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAH 184 (312)
Q Consensus 105 ~~~kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAh 184 (312)
...+|| + .|.|.|.+. .+...+..|++||+.|++.++|+|++.+. .. | .|.|..+. | |+++.|..|
T Consensus 42 ~~~~Wp--~-~VPY~i~~~--~~~~~~~~I~~Am~~~~~~TCirF~~rt~-e~-~-yi~i~~~~-G-C~S~vG~~g---- 107 (230)
T cd04282 42 DTYRWP--F-PIPYILDDS--LDLNAKGVILKAFEMYRLKSCVDFKPYEG-ES-N-YIFFFKGS-G-CWSMVGDQQ---- 107 (230)
T ss_pred cccCCC--c-ceeEEECCC--CCHHHHHHHHHHHHHHHhCCCeeEEECCC-CC-c-EEEEEcCC-C-eeeccCccC----
Confidence 356995 3 899999753 34678899999999999999999998763 22 2 34555432 3 344444311
Q ss_pred eeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444 185 AFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 185 A~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
+...+.+. .++....+++|||||||||-|++.
T Consensus 108 ----g~q~isl~-----------------~~C~~~Gti~HEl~HalGf~HEqs 139 (230)
T cd04282 108 ----GGQNLSIG-----------------AGCDYKATVEHEFLHALGFYHEQS 139 (230)
T ss_pred ----CeEEEEEC-----------------CCcCCCchHHHHHHHHhCCccccc
Confidence 01122221 245557899999999999999873
No 16
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.32 E-value=1.2e-11 Score=108.37 Aligned_cols=92 Identities=17% Similarity=0.288 Sum_probs=60.0
Q ss_pred CceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCce
Q 021444 114 RDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRF 193 (312)
Q Consensus 114 ~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~i 193 (312)
-.|.|.|+.. .+...++.|++||+.|++.++|+|++.+. ...- |.|..+ .| ++++.|..|. ...|
T Consensus 4 ~~IpY~i~~~--~~~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~y--i~~~~~-~g-C~S~vG~~gg--------~q~i 68 (182)
T cd04283 4 VYVPYVISPQ--YSENERAVIEKAMQEFETLTCVRFVPRTT-ERDY--LNIESR-SG-CWSYIGRQGG--------RQTV 68 (182)
T ss_pred EEEEEEECCC--CCHHHHHHHHHHHHHHHhCCceeeEECCC-CCcE--EEEEcC-CC-ceEecCccCC--------ceeE
Confidence 3688999742 34677899999999999999999998763 3333 444432 23 3333343210 0112
Q ss_pred EecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444 194 HLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 194 hfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
.+. ..++....+++|||||||||.|++
T Consensus 69 ~l~----------------~~~C~~~G~i~HEl~HaLG~~HEh 95 (182)
T cd04283 69 SLQ----------------KQGCMYKGIIQHELLHALGFYHEQ 95 (182)
T ss_pred ecC----------------CCCcCccchHHHHHHHHhCCcccc
Confidence 221 123445689999999999999987
No 17
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.26 E-value=3.1e-11 Score=118.37 Aligned_cols=106 Identities=22% Similarity=0.273 Sum_probs=73.4
Q ss_pred ccccCCCCCCCCCCCCceEEeecCCCCChHHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCC
Q 021444 99 HYSFFPGMPRWPSNKRDLTYAFLPDNQLTDEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGP 178 (312)
Q Consensus 99 ~~~~~~~~~kW~~~~~~LtY~i~~~~~~~~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~ 178 (312)
|........+|| ...|.|.|... .+...+.+|+.||+.|++.++|+|++..+ .+.+..+.+... | ++.+.|.
T Consensus 72 r~~~~~~~~~Wp--~~~ipY~i~~~--~~~~~r~~i~~A~~~~~~~TCi~F~~~~~-~~~~~~~~~~~~--g-C~S~VGr 143 (411)
T KOG3714|consen 72 RNGTSNPERRWP--NGVIPYYIDGS--FTSSQRALIRQAMREIENHTCIRFVERTT-PDKDYLIVFTGG--G-CYSYVGR 143 (411)
T ss_pred hhcccChhhcCC--CCeeeeEECCC--CCHHHHHHHHHHHHHHhhCcceEEEeCCC-CCcceEEEeCCC--c-ceeeeCc
Confidence 333444567995 48999999864 44889999999999999999999999874 345555544332 3 3444454
Q ss_pred CCcceeeeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC
Q 021444 179 MGTLAHAFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 179 gg~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
.|.--. .+ + ...++.-..|++|||+||||+-|.+
T Consensus 144 ~gg~~q-------~~--------s---------l~~~C~~~G~i~HEl~HaLGf~Heh 177 (411)
T KOG3714|consen 144 RGGGQQ-------LL--------S---------LGDGCDRFGTIVHELMHALGFWHEH 177 (411)
T ss_pred cCCCcc-------ce--------e---------cCCCcCcCchhHHHHHHHhhhhhcc
Confidence 221100 11 1 1235666899999999999999987
No 18
>PF01471 PG_binding_1: Putative peptidoglycan binding domain; InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are: Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX []. Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.25 E-value=1.4e-11 Score=87.47 Aligned_cols=55 Identities=31% Similarity=0.513 Sum_probs=47.3
Q ss_pred CChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHh
Q 021444 2 DGLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHI 59 (312)
Q Consensus 2 ~~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M 59 (312)
+.|..++.+|.++||++. ..+..+++.+++||+.||+++||++||++|.+|+++|
T Consensus 3 ~~v~~lq~~L~~~gy~~~---~~~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L 57 (57)
T PF01471_consen 3 PDVKALQQYLNRLGYYPG---PVDGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL 57 (57)
T ss_dssp HHHHHHHHHHHHTTTT-S---STTSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred HHHHHHHHHHHHcCCCCC---CCCCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence 357899999999999943 2346789999999999999999999999999999987
No 19
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.90 E-value=6.8e-09 Score=91.12 Aligned_cols=123 Identities=24% Similarity=0.296 Sum_probs=75.9
Q ss_pred HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCC---------cceeeeCCCCCceEecCC
Q 021444 128 DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMG---------TLAHAFSPPNGRFHLDGD 198 (312)
Q Consensus 128 ~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg---------~lAhA~~P~~G~ihfd~~ 198 (312)
..+..++..|++.|++.-|+..+|-. ..|||+|...+. .|.+ .++.+. .+.|+-.|+ |++..
T Consensus 103 p~wq~a~~tava~wa~~fpl~ive~~--eeaDItie~~n~-pgtg--~~~~grArtal~~~efl~~~~~~~----~lnhr 173 (236)
T COG5549 103 PRWQGAYLTAVAGWAKTFPLIIVERF--EEADITIEVGNP-PGTG--WRQYGRARTALIAYEFLGHALGLG----HLNHR 173 (236)
T ss_pred hhHHHHHHHHHHHHHHhCCceeeecc--eeeeEEEEecCC-CCCc--ccccchHHHHHHHHHHhhhccCcc----ccccc
Confidence 45788999999999999666665554 689999976542 2222 222211 122232222 22110
Q ss_pred CCceecCCCccccCCccccchhhhhhhhhhhcCCC-CCCCCCCccccccCCCCCcCCCCHHHHHHHHHHhCCC
Q 021444 199 ENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLG-HSSVEDAIMYPSISSGSRKVELANDDIQGIQALYGNN 270 (312)
Q Consensus 199 e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~-Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~ 270 (312)
..+.. .+.....++..+|.||+|||||+. |+.- .++|.+..+ ...++..|++.+..+|-.+
T Consensus 174 --~~i~~----~pg~~~e~L~~tarhElGhaLgi~ghsd~-ks~aly~~s----~~~is~rdv~tL~r~Yeqp 235 (236)
T COG5549 174 --GDIMY----PPGELRENLNPTARHELGHALGIWGHSDL-KSDALYGIS----WQGISRRDVKTLERKYEQP 235 (236)
T ss_pred --ccccC----CcccchhhhhHHHHHhhcchheecccccc-hhhhhcchh----hcccCHHHHHHHHHHhcCC
Confidence 00000 012335579999999999999997 8774 455544333 2348999999999999754
No 20
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.90 E-value=5.7e-09 Score=85.10 Aligned_cols=124 Identities=22% Similarity=0.296 Sum_probs=71.8
Q ss_pred ceEEeecCCCCChHHHHHHHHHHHHHHh-hcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCce
Q 021444 115 DLTYAFLPDNQLTDEVKNVFATAFDRWS-EATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRF 193 (312)
Q Consensus 115 ~LtY~i~~~~~~~~~~~~~i~~A~~~Ws-~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~i 193 (312)
+|+|.-... .+++..|.+|.+.|+ .|.+++|++... ++|+|.-.. + +++..+..-..+.|.|
T Consensus 3 tv~Ydas~A----~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~~----~------~~~sya~~~g~G~G~I 65 (132)
T PF02031_consen 3 TVYYDASRA----PEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEGN----D------PRGSYASTDGLGSGYI 65 (132)
T ss_dssp EEEEEEEE-----GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE------------TT--EEEE-SSS-EEE
T ss_pred EEEEeCCCC----chhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEec----C------CCCcccccCCCCcEEE
Confidence 466654421 568899999999997 578999999863 888876432 1 1233333333456788
Q ss_pred EecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCC-C-CCCCccccccCCCC-CcCCCCHHHHHHHHHHhC
Q 021444 194 HLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHS-S-VEDAIMYPSISSGS-RKVELANDDIQGIQALYG 268 (312)
Q Consensus 194 hfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs-~-~~~sIM~p~y~~~~-~~~~L~~dDi~~Iq~LYG 268 (312)
.+|.. + ..+++-..|++|||||.|||..- . .-..+|.-.-.+-. .+..+...++..+++||.
T Consensus 66 ~l~~~--~-----------~qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~aG~sCtN~~Pna~E~a~V~~~~a 130 (132)
T PF02031_consen 66 FLDYQ--Q-----------NQGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSAGTSCTNAYPNAAERARVNSLFA 130 (132)
T ss_dssp EEEHH--H-----------HHHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT-TT----S--HHHHHHHHHHCT
T ss_pred EechH--H-----------hhCCccceeeeehhccccCCCCCCCCccHHhhcCCCCCCCCCCCCCCHHHHHHHHHHhc
Confidence 88742 2 13567788999999999999543 2 34578874332211 234678899999999986
No 21
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=97.75 E-value=6.5e-05 Score=67.03 Aligned_cols=56 Identities=16% Similarity=0.230 Sum_probs=48.8
Q ss_pred ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcC
Q 021444 3 GLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILR 61 (312)
Q Consensus 3 ~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~ 61 (312)
.+..++..|.+.||.... .+..++..+.+||+.||+.+||+++|++|..|...|..
T Consensus 15 ~V~~LQ~~L~~lG~~~g~---idG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~ 70 (201)
T TIGR02869 15 DVIEIQRRLKAWGYYNGK---VDGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGI 70 (201)
T ss_pred HHHHHHHHHHHcCCCCCC---CCCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCC
Confidence 477899999999997532 24568899999999999999999999999999999944
No 22
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.74 E-value=0.00019 Score=63.05 Aligned_cols=49 Identities=27% Similarity=0.347 Sum_probs=34.0
Q ss_pred ccchhhhhhhhhhhcCCCCCCCC----------CCccccccCCCCCcCCCCHHHHHHHHH
Q 021444 216 IDLESVAVHEIGHLLGLGHSSVE----------DAIMYPSISSGSRKVELANDDIQGIQA 265 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~~~----------~sIM~p~y~~~~~~~~L~~dDi~~Iq~ 265 (312)
.....+++|||||.|||.|.... .-||.|.... .....++.--+.-|+.
T Consensus 131 ~~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~-~~~~~FS~CS~~~i~~ 189 (192)
T cd04267 131 LLTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG-LNSYRFSQCSIGSIRE 189 (192)
T ss_pred eeehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC-CCCCccChhhHHHHHH
Confidence 34567999999999999998753 2489886643 2334566666665554
No 23
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=97.71 E-value=0.00019 Score=64.04 Aligned_cols=119 Identities=18% Similarity=0.234 Sum_probs=68.4
Q ss_pred HHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCceEecCCCCceecCCCccccCCc
Q 021444 135 ATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRFHLDGDENWVITSGTTTSISMS 214 (312)
Q Consensus 135 ~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~ 214 (312)
-+.|..|.......-.. +....|+.+-+...+.... ++.+++++.||.-+ + ........+.. ..
T Consensus 73 L~~F~~w~~~~~~~~~~--~~~~~D~a~llt~~d~~~~---~~~~~~~G~A~~gg---i-C~~~~s~~i~~-------~~ 136 (207)
T cd04273 73 LKSFCRWQKKLNPPNDS--DPEHHDHAILLTRQDICRS---NGNCDTLGLAPVGG---M-CSPSRSCSINE-------DT 136 (207)
T ss_pred HHHHHHHHHHcCCcccc--cccccceEEEEeeeccccc---CCCCCceEEecccc---C-CCCCcceEEEc-------CC
Confidence 57888997643221111 1236777777765443211 35566777776422 1 00001111111 12
Q ss_pred cccchhhhhhhhhhhcCCCCCCC---------CCCccccccCCCCCcCCCCHHHHHHHHHHhCC
Q 021444 215 AIDLESVAVHEIGHLLGLGHSSV---------EDAIMYPSISSGSRKVELANDDIQGIQALYGN 269 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs~~---------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~ 269 (312)
+.....|++|||||.||+.|-.+ ..-||+|..........++.-=+..++.+...
T Consensus 137 ~~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~~ 200 (207)
T cd04273 137 GLSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLDT 200 (207)
T ss_pred CceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHcc
Confidence 34556799999999999999875 26899998764334456677667667665543
No 24
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=97.70 E-value=5e-05 Score=67.95 Aligned_cols=48 Identities=27% Similarity=0.265 Sum_probs=31.6
Q ss_pred chhhhhhhhhhhcCCCCCCCC--------------CCccccccCCCCCcCCCCHHHHHHHHHHh
Q 021444 218 LESVAVHEIGHLLGLGHSSVE--------------DAIMYPSISSGSRKVELANDDIQGIQALY 267 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~--------------~sIM~p~y~~~~~~~~L~~dDi~~Iq~LY 267 (312)
...+++|||||+|||.|..+. .+||.+.-.. ....++.-+|.-|+...
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY~~~~--~~~~FS~~~i~~i~~~~ 198 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSYASRP--SRPYFSPPSIEYIREVV 198 (206)
T ss_pred cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCCCCCC--CCCCCCchhHHHHhCCC
Confidence 457899999999999998742 3688632111 11236777777775443
No 25
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.68 E-value=0.00013 Score=64.37 Aligned_cols=139 Identities=21% Similarity=0.330 Sum_probs=83.3
Q ss_pred CCceEEeecCCCC---Ch----HHHHHHHHHHHH---HHhhcccceEEEcccCCcccEEEEeeecCCCCCC--CcCCCCC
Q 021444 113 KRDLTYAFLPDNQ---LT----DEVKNVFATAFD---RWSEATSLTFTRTDVYTTADIRIGFFTGDHGDGE--PFDGPMG 180 (312)
Q Consensus 113 ~~~LtY~i~~~~~---~~----~~~~~~i~~A~~---~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~--~fdG~gg 180 (312)
.+.++|++.-... .. ++..++|...+. -|.....+.|+.|.. ..+|++|...+...-+.. +.+-.+.
T Consensus 28 ~~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~-~~~Df~I~Lasp~T~~~lC~g~~~~~e 106 (203)
T PF11350_consen 28 GRLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDS-GAPDFRISLASPGTTDRLCAGLDTSGE 106 (203)
T ss_pred CceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCC-CCCCEEEEECCcchhhhhccCcCcCce
Confidence 3789999873221 11 445555555554 688888899999986 459999999875432211 1221111
Q ss_pred cceeeeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC-----CCCccccccCCCCCcCCC
Q 021444 181 TLAHAFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV-----EDAIMYPSISSGSRKVEL 255 (312)
Q Consensus 181 ~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~-----~~sIM~p~y~~~~~~~~L 255 (312)
..-+.+..++|.+|.+ .|..+..... ..-..+.+.|+=||+||+||-+|..= +..||-. +.+.|
T Consensus 107 --~SC~~~~g~~VvIN~~-RWv~Ga~~f~--gdl~~YRqYvINHEVGH~LGh~H~~Cpg~G~~APVMmQ------QT~gL 175 (203)
T PF11350_consen 107 --TSCRNPAGGRVVINLA-RWVRGAPAFG--GDLASYRQYVINHEVGHALGHGHEPCPGPGRPAPVMMQ------QTLGL 175 (203)
T ss_pred --eEeecCCCCeEEEehH-HhhccCcccC--CcHHHHHHHhhhhhhhhhcccCCCcCCCCCCcCCcchh------hhccc
Confidence 1223444567777763 6776533210 12234678899999999999988764 3346642 23345
Q ss_pred CHHHHHHH
Q 021444 256 ANDDIQGI 263 (312)
Q Consensus 256 ~~dDi~~I 263 (312)
.-++|..|
T Consensus 176 ~g~~~~~~ 183 (203)
T PF11350_consen 176 DGCELAKL 183 (203)
T ss_pred CCcccccc
Confidence 55555444
No 26
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=97.44 E-value=6.3e-05 Score=64.33 Aligned_cols=25 Identities=32% Similarity=0.498 Sum_probs=16.1
Q ss_pred cccchhhhhhhhhhhcCCCCCCCCC
Q 021444 215 AIDLESVAVHEIGHLLGLGHSSVED 239 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs~~~~ 239 (312)
..+...|++||+||-|||.|.....
T Consensus 66 ~~~~g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp TS-SSHHHHHHHHHHTT---TT---
T ss_pred ccccccchhhhhhhhhcccccccCC
Confidence 4567889999999999999998654
No 27
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.42 E-value=4.8e-05 Score=66.19 Aligned_cols=47 Identities=26% Similarity=0.219 Sum_probs=19.7
Q ss_pred chhhhhhhhhhhcCCCCCCCCC-------------CccccccCCCCCcCCCCHHHHHHHHHH
Q 021444 218 LESVAVHEIGHLLGLGHSSVED-------------AIMYPSISSGSRKVELANDDIQGIQAL 266 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~-------------sIM~p~y~~~~~~~~L~~dDi~~Iq~L 266 (312)
...|++|||||.||+.|..+.. .||++.. ......+++--|..|+.+
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~~--~~~~~~fS~cS~~~i~~~ 170 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYAG--ICNNDKFSPCSIRQIRAV 170 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS---GGGG--------------
T ss_pred eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeCC--CCCCcccccccccccccc
Confidence 4568999999999999987642 6898552 233445666666666654
No 28
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.33 E-value=9.3e-05 Score=63.83 Aligned_cols=28 Identities=39% Similarity=0.624 Sum_probs=25.2
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCccccc
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDAIMYPS 245 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~ 245 (312)
...=++||+||.|||.|=.++.-||++.
T Consensus 124 v~KEv~HElGH~~GL~HC~N~~CVM~FS 151 (181)
T COG1913 124 VVKEVLHELGHLLGLSHCPNPRCVMNFS 151 (181)
T ss_pred HHHHHHHHhhhhcCcccCCCCCcEEeCC
Confidence 4567899999999999999999999865
No 29
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.30 E-value=0.00011 Score=66.64 Aligned_cols=75 Identities=27% Similarity=0.377 Sum_probs=42.3
Q ss_pred cccchhhhhhhhhhhcCCCCCCCCCCccccccCCCCC-cCCCCHHHHHHHHHHhCCCCCCCCCCCCCCCCCCCCCCCcch
Q 021444 215 AIDLESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSR-KVELANDDIQGIQALYGNNPNFNGSSVPTDQQRDTSGAHGAY 293 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~-~~~L~~dDi~~Iq~LYG~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (312)
.+++-.|++|||||.|||.|.+...+ +.....+. .-++... ...||.+.....+ |..+..|...|.=+|
T Consensus 134 ~~n~g~t~~HEvGH~lGL~HtF~~~~---~c~~~gD~v~DTP~~~-----~~~~~cp~~~dsC--~~~~~~d~~~N~MdY 203 (225)
T cd04275 134 PYNLGDTATHEVGHWLGLYHTFQGGS---PCCTTGDYVADTPAEA-----SPSYGCPAGRDTC--PGQPGLDPIHNYMDY 203 (225)
T ss_pred cccccceeEEeccceeeeeeeecCCC---CCcCCCCccCCCcccc-----ccccCCCCCCCCC--CCCCCCCchhhcccc
Confidence 45788899999999999999998764 11111100 0011100 1157754433333 333456666776666
Q ss_pred hcccch
Q 021444 294 FLGSCL 299 (312)
Q Consensus 294 ~~~~~~ 299 (312)
---+|.
T Consensus 204 s~d~C~ 209 (225)
T cd04275 204 SDDSCM 209 (225)
T ss_pred cCcchh
Confidence 666664
No 30
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.25 E-value=9.5e-05 Score=65.00 Aligned_cols=23 Identities=35% Similarity=0.369 Sum_probs=15.2
Q ss_pred ccchhhhhhhhhhhcCCCCCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGHSSVE 238 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~~~ 238 (312)
.....|++|||||.||+.|..+.
T Consensus 140 ~~~~~~~AHEiGH~lGa~HD~~~ 162 (196)
T PF13688_consen 140 YNGAITFAHEIGHNLGAPHDGDY 162 (196)
T ss_dssp HHHHHHHHHHHHHHTT-----SS
T ss_pred CceehhhHHhHHHhcCCCCCCCC
Confidence 45677999999999999998775
No 31
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.13 E-value=0.00023 Score=62.41 Aligned_cols=27 Identities=33% Similarity=0.548 Sum_probs=24.3
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCcccc
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDAIMYP 244 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p 244 (312)
+..+++|||||.|||.|-.++.-||+.
T Consensus 125 ~~k~~~HElGH~lGL~HC~~~~CvM~~ 151 (179)
T PRK13267 125 VRKEVTHELGHTLGLEHCDNPRCVMNF 151 (179)
T ss_pred HHHHHHHHHHHHcCCccCCCCCccCCC
Confidence 455799999999999999999999985
No 32
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=97.13 E-value=0.00015 Score=58.94 Aligned_cols=18 Identities=50% Similarity=0.680 Sum_probs=12.7
Q ss_pred chhhhhhhhhhhcCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHS 235 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs 235 (312)
...+++|||||.|||.|.
T Consensus 107 ~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 107 GVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp STTHHHHHHHHHTT----
T ss_pred cceEeeehhhHhcCCCCC
Confidence 458999999999999995
No 33
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=96.98 E-value=0.0018 Score=66.31 Aligned_cols=37 Identities=24% Similarity=0.311 Sum_probs=33.9
Q ss_pred CcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcCC
Q 021444 26 DDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILRP 62 (312)
Q Consensus 26 ~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~p 62 (312)
..+|.++.+|||+||+..||..+|+++..|++.|+.|
T Consensus 307 ~~YD~~Lv~AVK~FQ~rhGL~~DGvIG~~T~~~LNvs 343 (608)
T PRK10594 307 AAYDNELVEAVKRFQAWQGLGADGVIGPRTRDWLNVT 343 (608)
T ss_pred cccCHHHHHHHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence 4578999999999999999999999999999998754
No 34
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.92 E-value=0.00036 Score=61.81 Aligned_cols=28 Identities=43% Similarity=0.630 Sum_probs=20.0
Q ss_pred cchhhhhhhhhhhcCCCCCCCCCCccccc
Q 021444 217 DLESVAVHEIGHLLGLGHSSVEDAIMYPS 245 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~~~sIM~p~ 245 (312)
.+..+++||+||.+||.|=.+ .-||++.
T Consensus 144 R~~Kea~HElGH~~GL~HC~~-~CvM~~s 171 (194)
T PF07998_consen 144 RVCKEAVHELGHLFGLDHCEN-RCVMNFS 171 (194)
T ss_dssp HHHHHHHHHHHHHTT----SS-TSTTS--
T ss_pred HHHHHHHHHHHHHcCCcCCCC-CCccCCC
Confidence 456799999999999999999 9999974
No 35
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.87 E-value=0.0011 Score=60.27 Aligned_cols=49 Identities=41% Similarity=0.514 Sum_probs=37.3
Q ss_pred hhhhhhhhhhcCCCCCCCC-------------------------CCccccccCCCCCcCCCCHHHHHHHHHHhCCC
Q 021444 220 SVAVHEIGHLLGLGHSSVE-------------------------DAIMYPSISSGSRKVELANDDIQGIQALYGNN 270 (312)
Q Consensus 220 ~Va~HEiGHaLGL~Hs~~~-------------------------~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~ 270 (312)
.|++|||||.||+.|-.+. .=||+|.... ....++.-=+.-|+.+...+
T Consensus 147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~--~~~~FS~CS~~~~~~~l~~~ 220 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS--GITEFSPCTIGNICSLLGRN 220 (228)
T ss_pred eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC--CCCccChhhHHHHHHHHhcC
Confidence 6999999999999996431 1199997554 45678888888888887643
No 36
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.81 E-value=0.0017 Score=57.15 Aligned_cols=50 Identities=30% Similarity=0.384 Sum_probs=33.5
Q ss_pred cchhhhhhhhhhhcCCCCCCC------CCCccccccCCCCCcCCCCHHHHHHHHHHhC
Q 021444 217 DLESVAVHEIGHLLGLGHSSV------EDAIMYPSISSGSRKVELANDDIQGIQALYG 268 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG 268 (312)
....|++|||||.||+.|-.+ ..-||+|.... ....++.-=+.-++.+..
T Consensus 130 ~~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~fS~CS~~~~~~~l~ 185 (194)
T cd04269 130 LFAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS--LTDAFSNCSYEDYQKFLS 185 (194)
T ss_pred HHHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC--CCCCCChhhHHHHHHHHh
Confidence 346799999999999999653 35799987553 233455544555555444
No 37
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=95.99 E-value=0.0083 Score=55.06 Aligned_cols=50 Identities=24% Similarity=0.388 Sum_probs=34.4
Q ss_pred hhhhhhhhhhhcCCCCCCC-----C------CCccccccCC--CCCcCCCCHHHHHHHHHHhC
Q 021444 219 ESVAVHEIGHLLGLGHSSV-----E------DAIMYPSISS--GSRKVELANDDIQGIQALYG 268 (312)
Q Consensus 219 ~~Va~HEiGHaLGL~Hs~~-----~------~sIM~p~y~~--~~~~~~L~~dDi~~Iq~LYG 268 (312)
..|++|||||.||+.|-.. + .=||+|.-.. ......++.-=+..++.+.-
T Consensus 168 a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~ 230 (244)
T cd04270 168 DLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLE 230 (244)
T ss_pred HHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHc
Confidence 4699999999999999665 1 3499987532 12345677766666665544
No 38
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=95.88 E-value=0.011 Score=52.23 Aligned_cols=51 Identities=25% Similarity=0.487 Sum_probs=31.1
Q ss_pred ccchhhhhhhhhhhcCCCCCCC--------CCCccccccCCCCCcCCCCHHHHHHHHHHh
Q 021444 216 IDLESVAVHEIGHLLGLGHSSV--------EDAIMYPSISSGSRKVELANDDIQGIQALY 267 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~~--------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LY 267 (312)
..+-.+++|||||.||+.|-.. ..-||.+...+. ....++.-=+..++.+.
T Consensus 129 ~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~-~~~~fS~CS~~~~~~~l 187 (199)
T PF01421_consen 129 LSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPS-SSDKFSNCSRRQFEEFL 187 (199)
T ss_dssp HHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSS-STSSS-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCC-CCCCcCHHHHHHHHHHH
Confidence 4566799999999999999776 256899865432 22356665555555543
No 39
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=95.51 E-value=0.018 Score=51.83 Aligned_cols=51 Identities=22% Similarity=0.320 Sum_probs=34.0
Q ss_pred cchhhhhhhhhhhcCCCCCCC------------------CCCccccccCCCCCcCCCCHHHHHHHHHHhC
Q 021444 217 DLESVAVHEIGHLLGLGHSSV------------------EDAIMYPSISSGSRKVELANDDIQGIQALYG 268 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~------------------~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG 268 (312)
....+++|||||.||+.|-.+ ..-||.+.... .....++.--+.-++.+..
T Consensus 144 ~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~~~~~IM~~~~~~-~~~~~fS~CS~~~~~~~l~ 212 (220)
T cd04272 144 YGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPWDDGYIMSYVVNG-ERQYRFSQCSQRQIRNVFR 212 (220)
T ss_pred ccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCCCCCeEEeeccCC-CCCCccCcchHHHHHHHHc
Confidence 456899999999999999532 23688875432 2344566666666665543
No 40
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=95.46 E-value=0.022 Score=48.98 Aligned_cols=59 Identities=19% Similarity=0.300 Sum_probs=49.0
Q ss_pred ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCCCCCHHHHHHhcCCC
Q 021444 3 GLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNL-EPTGQLDSQTMQHILRPR 63 (312)
Q Consensus 3 ~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gL-pvTG~lD~~T~~~M~~pR 63 (312)
-+..++.-|.+.||.+. ...+..+......||+.||+.++| +++|.++++|+..+...+
T Consensus 47 ~v~~lq~~L~~~g~~~~--~~~dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~~~ 106 (185)
T COG3409 47 SVRILQAALNALGYYPD--GVIDGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPSQL 106 (185)
T ss_pred hHHHHHHHHHhcCCCCC--CCccCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHHHh
Confidence 36789999999999873 122345678889999999999999 899999999999987754
No 41
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=95.31 E-value=0.046 Score=50.07 Aligned_cols=54 Identities=22% Similarity=0.429 Sum_probs=45.5
Q ss_pred CChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCCCCCHHHHHHhc
Q 021444 2 DGLAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNL-EPTGQLDSQTMQHIL 60 (312)
Q Consensus 2 ~~~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gL-pvTG~lD~~T~~~M~ 60 (312)
+++..+++-|.+|||--+. ..+|+..+.+|+.||.-|+= .++|+.|.+|+..+.
T Consensus 198 ~~v~~lq~~L~~YGY~v~~-----~~~d~~t~~vv~aFQ~hfrp~~~dg~~d~et~a~l~ 252 (257)
T COG3023 198 EDVAALQEMLARYGYGVEI-----GVFDQETQQVVRAFQMHFRPGLYDGEADVETIAILQ 252 (257)
T ss_pred CCHHHHHHHHHHhCcCCCc-----chhhHHHHHHHHHHHHHhCCCCCCCCCChHHHHHHH
Confidence 4688999999999996432 46899999999999988865 689999999988763
No 42
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=94.96 E-value=0.048 Score=54.80 Aligned_cols=58 Identities=21% Similarity=0.292 Sum_probs=46.0
Q ss_pred ChHHHHHHHHhcC-CCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhcC
Q 021444 3 GLAKLKKYFHYFG-YIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHILR 61 (312)
Q Consensus 3 ~~~~~~~yL~~fG-Yl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~ 61 (312)
.+..+.+-|.+-| -++... ..+..++.++.+|||+||+.+||+..|++++.|+++|+-
T Consensus 240 ~v~aL~~~L~~~~~d~~~a~-~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~ 298 (561)
T COG2989 240 DVPALRARLARSGMDLPSAA-GSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV 298 (561)
T ss_pred hHHHHHHHHHhcCccchhhc-cCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence 3566677777777 344322 234478999999999999999999999999999999976
No 43
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=94.92 E-value=0.034 Score=54.44 Aligned_cols=74 Identities=20% Similarity=0.284 Sum_probs=46.8
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCccc---cccCCC-------------------CCcCCCCHHHHHHHHHHhCCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDAIMY---PSISSG-------------------SRKVELANDDIQGIQALYGNNPNFNG 275 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~sIM~---p~y~~~-------------------~~~~~L~~dDi~~Iq~LYG~~~~~~~ 275 (312)
-...++|||||.||+.|-. +-||- ++++.. -.+-+.++.-+++++-+|-+.-..++
T Consensus 301 tlGA~~HElGH~lgcpHq~--~GvMlrgy~~lNRsftt~e~~~vrt~s~g~aPv~~keec~w~~lD~~Rf~yhp~f~lp~ 378 (614)
T KOG4525|consen 301 TLGAVCHELGHCLGCPHQS--EGVMLRGYDYLNRSFTTDEFECVRTGSKGLAPVLAKEECPWRILDARRFSYHPTFSLPG 378 (614)
T ss_pred HHHHHHHHhhhccCCCCCC--CceEecCchhhcceeeecceeeeecCCcCcCCccccccCchhhhhhhhhcccccccCCC
Confidence 3557899999999999965 56773 332210 01335677888999999987666555
Q ss_pred -CCCCCCCCCCCCCCCcch
Q 021444 276 -SSVPTDQQRDTSGAHGAY 293 (312)
Q Consensus 276 -~~~~~~~~~~~~~~~~~~ 293 (312)
+|=|+....-++...++-
T Consensus 379 dptyps~~~s~y~~~~~s~ 397 (614)
T KOG4525|consen 379 DPTYPSSSSSSYSSSSSSA 397 (614)
T ss_pred CCCCCCccccceecCCCcc
Confidence 333555555554444443
No 44
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=94.62 E-value=0.014 Score=55.22 Aligned_cols=21 Identities=38% Similarity=0.642 Sum_probs=15.3
Q ss_pred ccchhhhhhhhhhhcCCCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
.....+++||+||++||+|..
T Consensus 191 ~s~~~~f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 191 YSYGNEFSHELGHNFGLGHYP 211 (305)
T ss_dssp S-SHHHHHHHHHHTTT--SST
T ss_pred cCccceeehhhhhhcCCCCCC
Confidence 346789999999999999943
No 45
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.53 E-value=0.019 Score=54.72 Aligned_cols=19 Identities=42% Similarity=0.592 Sum_probs=17.0
Q ss_pred chhhhhhhhhhhcCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~ 236 (312)
-..|++||+||-|||.|+.
T Consensus 150 ~~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 150 DWATIMHELGHNLGLWHAG 168 (314)
T ss_pred cHHHHHHHhhhhccccccC
Confidence 3569999999999999996
No 46
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.37 E-value=0.04 Score=54.40 Aligned_cols=57 Identities=23% Similarity=0.296 Sum_probs=36.2
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCccccccCC-------------CCC---------cCCCCHHHHHHHHHHhCCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDAIMYPSISS-------------GSR---------KVELANDDIQGIQALYGNNPNFNG 275 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~-------------~~~---------~~~L~~dDi~~Iq~LYG~~~~~~~ 275 (312)
.....+|||||+|||.|..+ .||---|.. .+. +.+-.+.....++.+|.+.-..+.
T Consensus 316 ~iGA~lHEiGH~fg~pH~~~--GIM~Rdy~~~NR~F~~~E~~~~rtk~~g~~~~~~~~ec~whrld~lrf~~hP~F~lp~ 393 (423)
T PF12044_consen 316 GIGAFLHEIGHLFGCPHQED--GIMLRDYDRLNRSFTTREPYSTRTKSKGLRPCLPKDECHWHRLDALRFRYHPSFRLPS 393 (423)
T ss_pred hhHHHHHHHHHhcCCCCCCC--CeeeCCchhhcceeeccccccccccCCCcCcccCCccccccHhhhHhhhcCccccCCC
Confidence 35578999999999999876 777433311 000 112345667778888876655544
Q ss_pred C
Q 021444 276 S 276 (312)
Q Consensus 276 ~ 276 (312)
.
T Consensus 394 D 394 (423)
T PF12044_consen 394 D 394 (423)
T ss_pred c
Confidence 4
No 47
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=93.15 E-value=0.028 Score=52.78 Aligned_cols=20 Identities=40% Similarity=0.406 Sum_probs=17.2
Q ss_pred chhhhhhhhhhhcCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~ 237 (312)
...|++||+||.|||.+--+
T Consensus 165 ~igv~~HE~gH~lGLPDlY~ 184 (286)
T TIGR03296 165 GVGVIAHELGHDLGLPDLYD 184 (286)
T ss_pred ceeeeehhhhcccCCCcccc
Confidence 36899999999999988654
No 48
>PF11150 DUF2927: Protein of unknown function (DUF2927); InterPro: IPR021323 This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known.
Probab=91.94 E-value=3.7 Score=37.03 Aligned_cols=150 Identities=15% Similarity=0.095 Sum_probs=81.8
Q ss_pred CCCCCCCCceEEeecCCCCCh-HHHHHHHHHHHHHHhhcccceEEEcccCCcccEEEEeeecCCCCC---------CCcC
Q 021444 107 PRWPSNKRDLTYAFLPDNQLT-DEVKNVFATAFDRWSEATSLTFTRTDVYTTADIRIGFFTGDHGDG---------EPFD 176 (312)
Q Consensus 107 ~kW~~~~~~LtY~i~~~~~~~-~~~~~~i~~A~~~Ws~~s~l~F~ev~~~~~aDI~I~f~~~~hgd~---------~~fd 176 (312)
.|| ...++..+....... ....+.+..-++.-+.++.+.+..+... +|+|.|.|...+.... ..+.
T Consensus 28 ~Rw---~~PVrv~v~~~~~~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~~~~~ 103 (213)
T PF11150_consen 28 RRW---EGPVRVRVEGVPPADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGAACFV 103 (213)
T ss_pred ccC---CCCeEEEEeccChhhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccccccc
Confidence 489 357888887322111 3344555555777788899999888864 8999999987654110 0000
Q ss_pred C-------CC-CcceeeeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCC--CCCCcccccc
Q 021444 177 G-------PM-GTLAHAFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSS--VEDAIMYPSI 246 (312)
Q Consensus 177 G-------~g-g~lAhA~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~--~~~sIM~p~y 246 (312)
- .. .-+..++....+.+.....-- .+. ......-+...+..||-.+|||.--+ .+.||-+-
T Consensus 104 ~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi-~i~------~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFND-- 174 (213)
T PF11150_consen 104 VPNFSSLPRYARCLVFARSNWSGEITRARAVI-FIR------ADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFND-- 174 (213)
T ss_pred chhhhcccccceEEEEEEecCCCceEEEEEEE-EEe------ccCCchHHHHHHHHHHHHHcCCccCCCcCCCceeeC--
Confidence 0 00 001122221111111100000 000 01112346678888899999995433 23444331
Q ss_pred CCCCCcCCCCHHHHHHHHHHhCCCC
Q 021444 247 SSGSRKVELANDDIQGIQALYGNNP 271 (312)
Q Consensus 247 ~~~~~~~~L~~dDi~~Iq~LYG~~~ 271 (312)
......|+.-|..-|+.||-+.-
T Consensus 175 --d~~~~~LT~~D~lLLr~LYdP~L 197 (213)
T PF11150_consen 175 --DNEFALLTGFDELLLRMLYDPRL 197 (213)
T ss_pred --CCccccccHHHHHHHHHHcCccC
Confidence 23445789999999999998543
No 49
>PTZ00337 surface protease GP63; Provisional
Probab=91.86 E-value=0.18 Score=51.68 Aligned_cols=36 Identities=28% Similarity=0.371 Sum_probs=24.5
Q ss_pred CCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCC
Q 021444 188 PPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHS 235 (312)
Q Consensus 188 P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs 235 (312)
|..|.|.|+.... . .......|++|||.||||+...
T Consensus 211 Pi~G~in~np~~i---~---------~~~~~~~v~~HEi~HALGFs~~ 246 (567)
T PTZ00337 211 PFAAAVNFDPRQI---A---------VTNGDVRVAAHELGHALGFVRD 246 (567)
T ss_pred ceEEEEEECHHHc---c---------chhHHHHHHHHHHHHHHccCHH
Confidence 4457787876321 1 1234678999999999999654
No 50
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=91.44 E-value=0.093 Score=53.40 Aligned_cols=60 Identities=27% Similarity=0.331 Sum_probs=29.2
Q ss_pred cccEEEEeeecCCCCCCCcCCCCCcceeee--------CCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhh
Q 021444 157 TADIRIGFFTGDHGDGEPFDGPMGTLAHAF--------SPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGH 228 (312)
Q Consensus 157 ~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGH 228 (312)
++|+.|.....+.+ .+++|.|. .|..|.|.|+... +. ......+..|++|||.|
T Consensus 159 ~~D~vlyV~~~~~~--------~~~~A~A~~C~~~~~~RP~~G~in~~p~~---i~-------~~~~~~~~~~~~HEi~H 220 (521)
T PF01457_consen 159 NADFVLYVTARPSS--------SSTLAWAAPCQQDSDGRPIAGVININPSY---IP-------SFYFQEFFRTVIHEIAH 220 (521)
T ss_dssp S-SEEEEEE----S--------TT---EEEEEEE-TTS-EEEEEEE--GGG-------------S--HHHHHHHHHHHHH
T ss_pred CccEEEEEEEeecc--------CCceEEEeehhhccCCCCeeEEEEEchhH---cc-------chhhhcccceeeeeeee
Confidence 67887765543221 15677763 2334677776531 11 00123577899999999
Q ss_pred hcCCCC
Q 021444 229 LLGLGH 234 (312)
Q Consensus 229 aLGL~H 234 (312)
|||+.-
T Consensus 221 aLGFs~ 226 (521)
T PF01457_consen 221 ALGFSS 226 (521)
T ss_dssp HTT-SH
T ss_pred eeeecc
Confidence 999976
No 51
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=91.36 E-value=0.66 Score=34.71 Aligned_cols=53 Identities=17% Similarity=0.349 Sum_probs=41.1
Q ss_pred hHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCC----CCCCCCHHHHHHh
Q 021444 4 LAKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLE----PTGQLDSQTMQHI 59 (312)
Q Consensus 4 ~~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLp----vTG~lD~~T~~~M 59 (312)
..+|+.=|++-||+... .+..+++++.+|++.|+..-|+. .-|++|...++.|
T Consensus 18 ~~evq~~L~~lGyy~g~---~~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~L 74 (74)
T PF08823_consen 18 AREVQEALKRLGYYKGE---ADGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAYL 74 (74)
T ss_pred HHHHHHHHHHcCCccCC---CCCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhhC
Confidence 46789999999998743 24567899999999999988873 3455888877654
No 52
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=90.99 E-value=0.091 Score=42.80 Aligned_cols=19 Identities=42% Similarity=0.730 Sum_probs=16.0
Q ss_pred ccchhhhhhhhhhhcCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGH 234 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~H 234 (312)
-....|++|||||-.||+.
T Consensus 107 d~vthvliHEIgHhFGLsD 125 (136)
T COG3824 107 DQVTHVLIHEIGHHFGLSD 125 (136)
T ss_pred hHhhhhhhhhhhhhcCCCh
Confidence 3567899999999999964
No 53
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=90.64 E-value=0.48 Score=48.92 Aligned_cols=97 Identities=21% Similarity=0.235 Sum_probs=50.1
Q ss_pred CcccEEEEeeecCCCCCCCcCCCCCcceeee--------CCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhh
Q 021444 156 TTADIRIGFFTGDHGDGEPFDGPMGTLAHAF--------SPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIG 227 (312)
Q Consensus 156 ~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~--------~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiG 227 (312)
.++|+.|....... . .+++|.|- .|..|.|.|+.... .+ ........|++|||.
T Consensus 204 ~naD~vLYVta~P~------~--~~~lA~A~~C~~~~d~RPi~G~iNinp~~i---~s-------~~~~~~~rv~~HEi~ 265 (622)
T PTZ00257 204 SNADFVLYVASVPS------E--PGVLAWATTCQVFSDDHPAVGVMNIPAANI---VS-------RYDQGTTRTVTHEVA 265 (622)
T ss_pred cccCEEEEEEEecC------C--CCeeEEEEEeccccCCCceEEEEeeCHHHC---CC-------ccchHHHHHHHHHHH
Confidence 36777776654322 1 24555552 35557777775321 00 011246789999999
Q ss_pred hhcCCCCCCCCCCcccccc---CCCCCc--CCCCHHHHHHHHHHhCCC
Q 021444 228 HLLGLGHSSVEDAIMYPSI---SSGSRK--VELANDDIQGIQALYGNN 270 (312)
Q Consensus 228 HaLGL~Hs~~~~sIM~p~y---~~~~~~--~~L~~dDi~~Iq~LYG~~ 270 (312)
||||+.-+.-..--|.-.. .+.... ...++.-++-.+.-||.+
T Consensus 266 HALGFS~~~f~~~~~~~~~~~~rGk~~~~~vi~SP~v~~~Ar~hfgC~ 313 (622)
T PTZ00257 266 HALGFSSVFFENAGIVKSVTNLRGKPFAAPVINSSTAVAKAREQYGCP 313 (622)
T ss_pred HHhcCCHHHHhhcCcccceeccCCCcceeeeecCHHHHHHHHHhcCCC
Confidence 9999965432221111111 111111 112556666677777754
No 54
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=90.20 E-value=0.35 Score=43.71 Aligned_cols=20 Identities=40% Similarity=0.625 Sum_probs=15.2
Q ss_pred ccchhhhhhhhhhhcCCCCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
..-..||+||.||| +.|...
T Consensus 87 iaAvaVAAHEvGHA--iQ~a~~ 106 (222)
T PF04298_consen 87 IAAVAVAAHEVGHA--IQHAEG 106 (222)
T ss_pred HHHHHHHHHHHhHH--Hhcccc
Confidence 34567999999999 566653
No 55
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=88.11 E-value=0.13 Score=53.46 Aligned_cols=20 Identities=40% Similarity=0.338 Sum_probs=16.5
Q ss_pred chhhhhhhhhhhcCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~ 237 (312)
-..|++||+||.|||.-.-+
T Consensus 221 giGVfaHEfGH~LGLPDlYD 240 (645)
T PF05547_consen 221 GIGVFAHEFGHDLGLPDLYD 240 (645)
T ss_pred ceEEEEeeccccCCCCCccc
Confidence 47799999999999966443
No 56
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=87.33 E-value=0.45 Score=42.08 Aligned_cols=16 Identities=44% Similarity=0.681 Sum_probs=12.9
Q ss_pred ccchhhhhhhhhhhcC
Q 021444 216 IDLESVAVHEIGHLLG 231 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLG 231 (312)
..-..|++||.|||+=
T Consensus 90 ia~~aVAAHEVGHAiQ 105 (226)
T COG2738 90 IAAIAVAAHEVGHAIQ 105 (226)
T ss_pred HHHHHHHHHHhhHHHh
Confidence 3457799999999964
No 57
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=87.01 E-value=0.25 Score=45.95 Aligned_cols=21 Identities=38% Similarity=0.676 Sum_probs=13.0
Q ss_pred cchhhhhhhhhhhcC-CCCCCC
Q 021444 217 DLESVAVHEIGHLLG-LGHSSV 237 (312)
Q Consensus 217 ~l~~Va~HEiGHaLG-L~Hs~~ 237 (312)
....|++||+||++| |+-+-.
T Consensus 215 ~~~~v~vHE~GHsf~~LaDEY~ 236 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADEYY 236 (264)
T ss_dssp THHHHHHHHHHHHTT------E
T ss_pred cccceeeeeccccccccccccc
Confidence 567899999999654 565543
No 58
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=85.40 E-value=0.76 Score=48.53 Aligned_cols=46 Identities=28% Similarity=0.344 Sum_probs=30.1
Q ss_pred cchhhhhhhhhhhcCCCCCCC-------CCCccccccCCCCCcCCCCHHHHHHH
Q 021444 217 DLESVAVHEIGHLLGLGHSSV-------EDAIMYPSISSGSRKVELANDDIQGI 263 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~-------~~sIM~p~y~~~~~~~~L~~dDi~~I 263 (312)
....|++|||||.||+.|-.. ..=|| +.+.+...+..++.--++-+
T Consensus 322 ~~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~ 374 (716)
T KOG3607|consen 322 AFAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDF 374 (716)
T ss_pred hHHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHH
Confidence 467799999999999999772 22478 66553223334555444444
No 59
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=83.20 E-value=0.85 Score=47.19 Aligned_cols=48 Identities=27% Similarity=0.457 Sum_probs=33.5
Q ss_pred hhhhhhhhhhhcCCCCCCC----------CCCccccccCCC--CCcCCCCHHHHHHHHHH
Q 021444 219 ESVAVHEIGHLLGLGHSSV----------EDAIMYPSISSG--SRKVELANDDIQGIQAL 266 (312)
Q Consensus 219 ~~Va~HEiGHaLGL~Hs~~----------~~sIM~p~y~~~--~~~~~L~~dDi~~Iq~L 266 (312)
..|++|||||..|=.|-.+ .+=|||++-..+ .++..+++--+.-|...
T Consensus 393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~V 452 (764)
T KOG3658|consen 393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKV 452 (764)
T ss_pred heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHHH
Confidence 4599999999999999765 234899886543 23445666666666544
No 60
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=83.07 E-value=0.51 Score=37.23 Aligned_cols=17 Identities=35% Similarity=0.626 Sum_probs=13.1
Q ss_pred cchhhhhhhhhhhcCCC
Q 021444 217 DLESVAVHEIGHLLGLG 233 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~ 233 (312)
-+..|++|||||.+|+.
T Consensus 72 ~I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 72 LIRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHHcCCC
Confidence 35679999999999985
No 61
>PF09374 PG_binding_3: Predicted Peptidoglycan domain; InterPro: IPR018537 This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=78.21 E-value=2.5 Score=31.36 Aligned_cols=26 Identities=19% Similarity=0.391 Sum_probs=20.7
Q ss_pred HHHHHHHHhcC----CCCCCCCCHHHHHHh
Q 021444 34 SAVKTYQKNFN----LEPTGQLDSQTMQHI 59 (312)
Q Consensus 34 ~Ai~~fQ~~~g----LpvTG~lD~~T~~~M 59 (312)
.|+|.+|+..| +.+.|++-+.|+.++
T Consensus 1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al 30 (72)
T PF09374_consen 1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL 30 (72)
T ss_dssp HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence 48999999999 999999999999998
No 62
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=78.02 E-value=0.84 Score=40.56 Aligned_cols=15 Identities=47% Similarity=0.651 Sum_probs=12.7
Q ss_pred cchhhhhhhhhhhcC
Q 021444 217 DLESVAVHEIGHLLG 231 (312)
Q Consensus 217 ~l~~Va~HEiGHaLG 231 (312)
.+..|+.||+||++-
T Consensus 21 ~~l~t~~HE~gHal~ 35 (200)
T PF13398_consen 21 RLLVTFVHELGHALA 35 (200)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456799999999975
No 63
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=77.81 E-value=0.82 Score=41.42 Aligned_cols=14 Identities=36% Similarity=0.570 Sum_probs=12.4
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
++..|+.||+||||
T Consensus 91 ~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 91 NVLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHHH
Confidence 57889999999995
No 64
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=77.30 E-value=0.82 Score=45.65 Aligned_cols=19 Identities=42% Similarity=0.644 Sum_probs=15.5
Q ss_pred ccchhhhhhhhhhhcCCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGHS 235 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs 235 (312)
..+-.|++|||||..| +|-
T Consensus 128 sElagViAHEigHv~q-rH~ 146 (484)
T COG4783 128 SELAGVIAHEIGHVAQ-RHL 146 (484)
T ss_pred HHHHHHHHHHHHHHhh-hhH
Confidence 4689999999999976 453
No 65
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=76.74 E-value=3.3 Score=44.74 Aligned_cols=34 Identities=21% Similarity=0.407 Sum_probs=26.6
Q ss_pred cccchhhhhhhhhhhcCCCCCCCCC------CccccccCC
Q 021444 215 AIDLESVAVHEIGHLLGLGHSSVED------AIMYPSISS 248 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs~~~~------sIM~p~y~~ 248 (312)
+.....+++||+||.+|+.|-...+ .||.|....
T Consensus 314 gl~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~ 353 (845)
T KOG3538|consen 314 GLPSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCG 353 (845)
T ss_pred CCccceeeeeccccccCcccCCcCCCCCCcccccCccccc
Confidence 4446679999999999999986543 589887654
No 66
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=76.70 E-value=0.99 Score=35.22 Aligned_cols=14 Identities=36% Similarity=0.629 Sum_probs=11.8
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
....+++|||||.+
T Consensus 41 ~~~f~laHELgH~~ 54 (122)
T PF06114_consen 41 RQRFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 45679999999995
No 67
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=76.43 E-value=1 Score=40.64 Aligned_cols=19 Identities=32% Similarity=0.480 Sum_probs=14.3
Q ss_pred cchhhhhhhhhhhcCCCCCCC
Q 021444 217 DLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
.-.-|++||+||+ |-|...
T Consensus 71 r~rFtlAHELGH~--llH~~~ 89 (213)
T COG2856 71 RKRFTLAHELGHA--LLHTDL 89 (213)
T ss_pred HHHHHHHHHHhHH--Hhcccc
Confidence 3466999999999 556554
No 68
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=74.96 E-value=1.6 Score=39.11 Aligned_cols=30 Identities=40% Similarity=0.619 Sum_probs=26.0
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCccccccC
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDAIMYPSIS 247 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~ 247 (312)
...++-|++||++||+|.+.+..||+|.+.
T Consensus 153 tal~~~efl~~~~~~~~lnhr~~i~~~pg~ 182 (236)
T COG5549 153 TALIAYEFLGHALGLGHLNHRGDIMYPPGE 182 (236)
T ss_pred HHHHHHHHhhhccCcccccccccccCCccc
Confidence 455788999999999999999999987653
No 69
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=70.93 E-value=1.6 Score=37.90 Aligned_cols=22 Identities=32% Similarity=0.404 Sum_probs=15.7
Q ss_pred chhhhhhhhhhhcCCCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSVED 239 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~ 239 (312)
+..+++||+||++=-....++.
T Consensus 7 ~i~i~~HE~gH~~~a~~~G~~~ 28 (192)
T PF02163_consen 7 LISIVLHELGHALAARLYGDKV 28 (192)
T ss_dssp HHHHHHHHHHHHHHHHTTT--B
T ss_pred cccccccccccccccccccccc
Confidence 4578999999998776665544
No 70
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=70.60 E-value=1.7 Score=37.53 Aligned_cols=20 Identities=50% Similarity=0.592 Sum_probs=14.5
Q ss_pred chhhhhhhhhhhcCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~ 237 (312)
+..|.+||+||++=-..-.+
T Consensus 8 ~i~i~iHE~gH~~~A~~~G~ 27 (180)
T cd05709 8 LISVTVHELGHALVARRLGV 27 (180)
T ss_pred HHHHHHHHHHHHHHHHHcCC
Confidence 45789999999985444443
No 71
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=69.83 E-value=1.8 Score=37.93 Aligned_cols=12 Identities=50% Similarity=0.844 Sum_probs=10.4
Q ss_pred hhhhhhhhhhhc
Q 021444 219 ESVAVHEIGHLL 230 (312)
Q Consensus 219 ~~Va~HEiGHaL 230 (312)
..|++||+||.+
T Consensus 10 ~~v~iHElGH~~ 21 (182)
T cd06163 10 ILIFVHELGHFL 21 (182)
T ss_pred HHHHHHHHHHHH
Confidence 468999999986
No 72
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=68.80 E-value=1.5 Score=41.43 Aligned_cols=40 Identities=20% Similarity=0.241 Sum_probs=23.0
Q ss_pred eCCCCCceEecCCCCce-ecCCCccccCCccccchhhhhhhhhhh
Q 021444 186 FSPPNGRFHLDGDENWV-ITSGTTTSISMSAIDLESVAVHEIGHL 229 (312)
Q Consensus 186 ~~P~~G~ihfd~~e~wt-~~~~~~~~~~~~~~~l~~Va~HEiGHa 229 (312)
|-|.+..|.+|... |. +.... ......-...|++||+||.
T Consensus 141 YCp~D~tIYlD~~f-~~~L~~~~---ga~G~~a~ayVlAHEyGHH 181 (292)
T PF04228_consen 141 YCPADQTIYLDLSF-FDELQQRF---GASGDFAQAYVLAHEYGHH 181 (292)
T ss_pred eCCCCCEEEechHH-HHHHHHHh---CCccHHHHHHHHHHHHHHH
Confidence 56888889998752 11 11000 0011223566999999996
No 73
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=68.57 E-value=3.3 Score=39.60 Aligned_cols=38 Identities=24% Similarity=0.443 Sum_probs=27.9
Q ss_pred eCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcCCCCCCC
Q 021444 186 FSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 186 ~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
++|=.++.+.|.+- ....+..|++||++|-+|...+..
T Consensus 178 y~PFT~EA~vN~~~--------------p~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 178 YNPFTGEANVNTDL--------------PPYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred EcCCcceeecCCCC--------------CcccccHHHHHHHHHHhCCCCHHH
Confidence 35556666666532 235678899999999999999874
No 74
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=66.97 E-value=2.2 Score=38.07 Aligned_cols=17 Identities=41% Similarity=0.466 Sum_probs=13.0
Q ss_pred cchhhhhhhhhhhcCCC
Q 021444 217 DLESVAVHEIGHLLGLG 233 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~ 233 (312)
-+..|++||+||++=-.
T Consensus 37 l~~~v~iHElgH~~~A~ 53 (208)
T cd06161 37 LFLSVLLHELGHALVAR 53 (208)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 35679999999986433
No 75
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=66.47 E-value=2.4 Score=37.39 Aligned_cols=20 Identities=35% Similarity=0.521 Sum_probs=16.0
Q ss_pred ccchhhhhhhhhhhcCCCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~ 236 (312)
..+..|++||+||.. .+|..
T Consensus 87 ~el~aVlaHElgH~~-~~h~~ 106 (226)
T PF01435_consen 87 DELAAVLAHELGHIK-HRHIL 106 (226)
T ss_dssp HHHHHHHHHHHHHHH-TTHCC
T ss_pred HHHHHHHHHHHHHHH-cCCcc
Confidence 368899999999997 45554
No 76
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=62.58 E-value=2.9 Score=40.79 Aligned_cols=15 Identities=33% Similarity=0.523 Sum_probs=12.2
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
.+-..+++||+|||+
T Consensus 220 ~~~v~tl~HE~GHa~ 234 (427)
T cd06459 220 LDDVFTLAHELGHAF 234 (427)
T ss_pred hhhHHHHHHHhhHHH
Confidence 445789999999985
No 77
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=62.48 E-value=3 Score=37.83 Aligned_cols=17 Identities=41% Similarity=0.593 Sum_probs=12.8
Q ss_pred chhhhhhhhhhhcCCCC
Q 021444 218 LESVAVHEIGHLLGLGH 234 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~H 234 (312)
+..|++||+||++=-.+
T Consensus 53 ~~~v~iHElgH~~~A~~ 69 (227)
T cd06164 53 FASVLLHELGHSLVARR 69 (227)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 56799999999864433
No 78
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=62.43 E-value=3 Score=40.02 Aligned_cols=16 Identities=31% Similarity=0.416 Sum_probs=13.1
Q ss_pred cccchhhhhhhhhhhc
Q 021444 215 AIDLESVAVHEIGHLL 230 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaL 230 (312)
..+-..++.||+|||+
T Consensus 151 ~~~~v~tl~HE~GHa~ 166 (365)
T cd06258 151 GHDDINTLFHEFGHAV 166 (365)
T ss_pred CHHHHHHHHHHHhHHH
Confidence 3455679999999998
No 79
>PRK03982 heat shock protein HtpX; Provisional
Probab=62.13 E-value=3.2 Score=38.92 Aligned_cols=15 Identities=40% Similarity=0.614 Sum_probs=13.0
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
..+..|++|||||.-
T Consensus 123 ~El~AVlAHElgHi~ 137 (288)
T PRK03982 123 DELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 468999999999974
No 80
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=61.58 E-value=3.3 Score=38.39 Aligned_cols=15 Identities=47% Similarity=0.791 Sum_probs=13.0
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
..+..|+.||+||..
T Consensus 155 dEl~aVlaHElgHi~ 169 (302)
T COG0501 155 DELEAVLAHELGHIK 169 (302)
T ss_pred HHHHHHHHHHHHHHh
Confidence 368999999999984
No 81
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=60.99 E-value=3.3 Score=41.16 Aligned_cols=25 Identities=12% Similarity=0.199 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHhhcccceEEEccc
Q 021444 130 VKNVFATAFDRWSEATSLTFTRTDV 154 (312)
Q Consensus 130 ~~~~i~~A~~~Ws~~s~l~F~ev~~ 154 (312)
+..+++.-+...+.+-+|+|+++..
T Consensus 136 ~~~~~~g~~~l~~~lfgi~~~~~~~ 160 (458)
T PF01432_consen 136 LEEVLEGLFELAERLFGIRFEEVPD 160 (458)
T ss_dssp HHHHHHHHHHHHHHHHTEEEEECTC
T ss_pred HHHHHHHHHHHHHHHhcEEEEeccc
Confidence 3455566666778888899998864
No 82
>PRK03001 M48 family peptidase; Provisional
Probab=59.90 E-value=3.7 Score=38.39 Aligned_cols=15 Identities=27% Similarity=0.563 Sum_probs=13.0
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
..+..|++||+||.-
T Consensus 122 ~El~aVlAHElgHi~ 136 (283)
T PRK03001 122 REIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHh
Confidence 468999999999974
No 83
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=59.60 E-value=13 Score=31.58 Aligned_cols=37 Identities=27% Similarity=0.401 Sum_probs=31.9
Q ss_pred CCCcchHHHHHHHHHHHHhcCCCCCCCCCHHHHHHhc
Q 021444 24 FTDDFDDMLESAVKTYQKNFNLEPTGQLDSQTMQHIL 60 (312)
Q Consensus 24 ~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~ 60 (312)
.+..+....+.|++.||+.++|+++|+.-..|...+.
T Consensus 148 ~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~ 184 (185)
T COG3409 148 VDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR 184 (185)
T ss_pred CCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence 3356778899999999999999999999999987663
No 84
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=59.47 E-value=3.6 Score=38.54 Aligned_cols=14 Identities=50% Similarity=0.627 Sum_probs=11.7
Q ss_pred chhhhhhhhhhhcC
Q 021444 218 LESVAVHEIGHLLG 231 (312)
Q Consensus 218 l~~Va~HEiGHaLG 231 (312)
+..|++||+||++=
T Consensus 135 ~isvvvHElgHal~ 148 (277)
T cd06162 135 LISGVVHEMGHGVA 148 (277)
T ss_pred HHHHHHHHHHHHHH
Confidence 45799999999963
No 85
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=59.24 E-value=3.7 Score=38.19 Aligned_cols=19 Identities=26% Similarity=0.289 Sum_probs=13.8
Q ss_pred chhhhhhhhhhhcCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~ 236 (312)
...|++||+||++=-.+..
T Consensus 118 ~isv~iHElgHa~~Ar~~G 136 (263)
T cd06159 118 VVGVVVHELSHGILARVEG 136 (263)
T ss_pred HHHHHHHHHHHHHHHHHcC
Confidence 4569999999997544433
No 86
>PRK03072 heat shock protein HtpX; Provisional
Probab=59.19 E-value=3.8 Score=38.48 Aligned_cols=15 Identities=33% Similarity=0.547 Sum_probs=13.0
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
..+..|++||+||.-
T Consensus 125 ~El~aVlAHElgHi~ 139 (288)
T PRK03072 125 RELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHHh
Confidence 468999999999974
No 87
>PRK01345 heat shock protein HtpX; Provisional
Probab=59.01 E-value=3.9 Score=39.05 Aligned_cols=18 Identities=22% Similarity=0.372 Sum_probs=14.6
Q ss_pred ccchhhhhhhhhhhcCCCC
Q 021444 216 IDLESVAVHEIGHLLGLGH 234 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~H 234 (312)
..+..|++|||||.- .+|
T Consensus 122 dEL~aVlAHElgHi~-~~d 139 (317)
T PRK01345 122 EEVAGVMAHELAHVK-NRD 139 (317)
T ss_pred HHHHHHHHHHHHHHH-cCC
Confidence 368999999999986 344
No 88
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=58.81 E-value=3.7 Score=42.02 Aligned_cols=15 Identities=33% Similarity=0.368 Sum_probs=12.6
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
.+-..|++||+|||+
T Consensus 335 ~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 335 SGDIDVLTHEAGHAF 349 (549)
T ss_pred hhHHHHHHHHhhHHH
Confidence 455779999999997
No 89
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=58.67 E-value=8.6 Score=28.97 Aligned_cols=30 Identities=10% Similarity=0.187 Sum_probs=19.4
Q ss_pred CCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhc
Q 021444 190 NGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLL 230 (312)
Q Consensus 190 ~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaL 230 (312)
..+|+|..... . .....-..+++||+.|++
T Consensus 44 G~~I~f~~g~~-~----------~~s~~~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 44 GNDIYFAPGKY-N----------PDSPEGRALLAHELAHVV 73 (79)
T ss_pred CCEEEEcCCCc-C----------CCCCCcchhHhHHHHHHH
Confidence 46788865431 0 112245789999999985
No 90
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=58.32 E-value=3.9 Score=41.07 Aligned_cols=14 Identities=36% Similarity=0.615 Sum_probs=11.8
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
+-..|+.||+||+|
T Consensus 262 ~~V~TLfHEfGHal 275 (472)
T cd06455 262 DEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 44668999999998
No 91
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=56.48 E-value=4.3 Score=41.89 Aligned_cols=15 Identities=33% Similarity=0.364 Sum_probs=12.4
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
.+-..|++||+|||+
T Consensus 373 ~~~v~TL~HE~GHa~ 387 (587)
T TIGR02290 373 RRDVSTLAHELGHAY 387 (587)
T ss_pred chhHHHHHHHhhHHH
Confidence 345679999999998
No 92
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=55.61 E-value=5.1 Score=41.28 Aligned_cols=15 Identities=27% Similarity=0.507 Sum_probs=12.7
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
.+-..|++||+||++
T Consensus 376 ~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 376 LNSVFTLAHELGHSM 390 (591)
T ss_pred cchHHHHHHHhhhHH
Confidence 455789999999997
No 93
>PRK04897 heat shock protein HtpX; Provisional
Probab=55.47 E-value=4.6 Score=38.12 Aligned_cols=15 Identities=47% Similarity=0.620 Sum_probs=12.9
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
..+..|++||+||.-
T Consensus 135 ~El~aVlAHElgHi~ 149 (298)
T PRK04897 135 EELEGVIGHEISHIR 149 (298)
T ss_pred HHHHHHHHHHHHHHh
Confidence 468999999999974
No 94
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=55.27 E-value=4.9 Score=34.18 Aligned_cols=12 Identities=33% Similarity=0.445 Sum_probs=10.3
Q ss_pred hhhhhhhhhhhc
Q 021444 219 ESVAVHEIGHLL 230 (312)
Q Consensus 219 ~~Va~HEiGHaL 230 (312)
.-|++|||+|++
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 559999999974
No 95
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=55.01 E-value=4.8 Score=35.31 Aligned_cols=23 Identities=30% Similarity=0.592 Sum_probs=16.8
Q ss_pred chhhhhhhhhhhcCCCCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDA 240 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~s 240 (312)
+..+.+||+||++--..-.++..
T Consensus 41 ~~~l~iHElgH~~~A~~~G~~~~ 63 (183)
T cd06160 41 LAILGIHEMGHYLAARRHGVKAS 63 (183)
T ss_pred HHHHHHHHHHHHHHHHHCCCCcc
Confidence 45688999999987666555443
No 96
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=54.82 E-value=5 Score=31.39 Aligned_cols=14 Identities=36% Similarity=0.482 Sum_probs=12.2
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
.+..|++||+.|++
T Consensus 24 ~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 24 WLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 46799999999996
No 97
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=54.32 E-value=5.1 Score=40.12 Aligned_cols=33 Identities=12% Similarity=0.130 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHhhcccceEEEccc----CCcccEEE
Q 021444 130 VKNVFATAFDRWSEATSLTFTRTDV----YTTADIRI 162 (312)
Q Consensus 130 ~~~~i~~A~~~Ws~~s~l~F~ev~~----~~~aDI~I 162 (312)
+..+++-.|..-+..-+|+|++++. ...+|++.
T Consensus 132 l~~vl~gl~~~~~~lfgi~~~~~~~~~~~~Wh~dV~~ 168 (458)
T cd06457 132 LGTVMEGLSRLFSRLYGIRLVPVPLAPGEVWHPDVRK 168 (458)
T ss_pred HHHHHHHHHHHHHHHhCeEEEecCCCCCCCcCccceE
Confidence 3455666677777888999999853 23567654
No 98
>PRK05457 heat shock protein HtpX; Provisional
Probab=54.26 E-value=5 Score=37.69 Aligned_cols=14 Identities=43% Similarity=0.726 Sum_probs=12.5
Q ss_pred ccchhhhhhhhhhh
Q 021444 216 IDLESVAVHEIGHL 229 (312)
Q Consensus 216 ~~l~~Va~HEiGHa 229 (312)
..+..|++||+||.
T Consensus 132 ~El~aVlAHElgHi 145 (284)
T PRK05457 132 DEVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHHH
Confidence 46899999999996
No 99
>PRK02870 heat shock protein HtpX; Provisional
Probab=53.86 E-value=5.3 Score=38.51 Aligned_cols=14 Identities=36% Similarity=0.717 Sum_probs=12.7
Q ss_pred ccchhhhhhhhhhh
Q 021444 216 IDLESVAVHEIGHL 229 (312)
Q Consensus 216 ~~l~~Va~HEiGHa 229 (312)
..+..|++||+||.
T Consensus 171 dEL~aVlAHELgHi 184 (336)
T PRK02870 171 DELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHHH
Confidence 46899999999998
No 100
>PRK02391 heat shock protein HtpX; Provisional
Probab=53.70 E-value=5.1 Score=37.84 Aligned_cols=14 Identities=43% Similarity=0.726 Sum_probs=12.4
Q ss_pred ccchhhhhhhhhhh
Q 021444 216 IDLESVAVHEIGHL 229 (312)
Q Consensus 216 ~~l~~Va~HEiGHa 229 (312)
..+..|++||+||.
T Consensus 131 ~El~aVlaHElgHi 144 (296)
T PRK02391 131 DELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHH
Confidence 46899999999995
No 101
>PRK01265 heat shock protein HtpX; Provisional
Probab=53.05 E-value=5.3 Score=38.30 Aligned_cols=14 Identities=50% Similarity=0.845 Sum_probs=12.4
Q ss_pred ccchhhhhhhhhhh
Q 021444 216 IDLESVAVHEIGHL 229 (312)
Q Consensus 216 ~~l~~Va~HEiGHa 229 (312)
..+..|++||+||.
T Consensus 138 ~El~aVlAHElgHi 151 (324)
T PRK01265 138 DEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHH
Confidence 46899999999995
No 102
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=52.75 E-value=5.5 Score=39.07 Aligned_cols=18 Identities=39% Similarity=0.650 Sum_probs=13.4
Q ss_pred chhhhhhhhhhhcCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHS 235 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs 235 (312)
+..+.+||+||||--.-+
T Consensus 131 vi~~vvHElGHalAA~se 148 (484)
T KOG2921|consen 131 VITVVVHELGHALAAASE 148 (484)
T ss_pred HHHHHHHHhhHHHHHHhc
Confidence 456889999999854443
No 103
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=52.48 E-value=5.7 Score=39.41 Aligned_cols=32 Identities=19% Similarity=0.286 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHhhcccceEEEccc--CCcccEEE
Q 021444 131 KNVFATAFDRWSEATSLTFTRTDV--YTTADIRI 162 (312)
Q Consensus 131 ~~~i~~A~~~Ws~~s~l~F~ev~~--~~~aDI~I 162 (312)
..+++.-|..-+..-+++|+++.. ....|+++
T Consensus 107 ~~v~~gl~~~~~~lfgi~~~~~~~~~~Wh~dV~~ 140 (422)
T cd06456 107 EKVLDGLFELAERLYGITFKERTDLPVWHPDVRV 140 (422)
T ss_pred HHHHHHHHHHHHHHcCeeEEECCCCCCCCCCceE
Confidence 344555566667777888888753 23455543
No 104
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=51.97 E-value=39 Score=25.18 Aligned_cols=29 Identities=21% Similarity=0.256 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHhcCCCCCCCCCHHHHHHhc
Q 021444 31 MLESAVKTYQKNFNLEPTGQLDSQTMQHIL 60 (312)
Q Consensus 31 ~~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~ 60 (312)
++.++|++. -++.=+++|..|++|.++|.
T Consensus 20 evq~~L~~l-Gyy~g~~~g~~d~a~~~Al~ 48 (74)
T PF08823_consen 20 EVQEALKRL-GYYKGEADGVWDEATEDALR 48 (74)
T ss_pred HHHHHHHHc-CCccCCCCCcccHHHHHHHH
Confidence 444444432 22334999999999999984
No 105
>PLN02791 Nudix hydrolase homolog
Probab=51.79 E-value=5.8 Score=42.36 Aligned_cols=21 Identities=19% Similarity=0.183 Sum_probs=17.3
Q ss_pred cchhhhhhhhhhhcCCCCCCC
Q 021444 217 DLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
.+..|++||+||.+|=.+.-.
T Consensus 584 f~~~v~lHElgHGsG~~~~~~ 604 (770)
T PLN02791 584 FFTHTICHECCHGIGPHTITL 604 (770)
T ss_pred HHHHHHHHHhhccccccceec
Confidence 457899999999999877544
No 106
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=48.44 E-value=7.1 Score=34.09 Aligned_cols=13 Identities=31% Similarity=0.608 Sum_probs=11.3
Q ss_pred chhhhhhhhhhhc
Q 021444 218 LESVAVHEIGHLL 230 (312)
Q Consensus 218 l~~Va~HEiGHaL 230 (312)
+..+.+||+||++
T Consensus 9 ~~~i~~HE~aHa~ 21 (181)
T cd06158 9 LLAITLHEFAHAY 21 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 4578999999997
No 107
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=47.43 E-value=7.4 Score=38.82 Aligned_cols=12 Identities=42% Similarity=0.767 Sum_probs=9.6
Q ss_pred hhhhhhhhhhhc
Q 021444 219 ESVAVHEIGHLL 230 (312)
Q Consensus 219 ~~Va~HEiGHaL 230 (312)
..|++||+||-|
T Consensus 16 ~li~vHElGHfl 27 (449)
T PRK10779 16 VLITVHEFGHFW 27 (449)
T ss_pred HHHHHHHHHHHH
Confidence 358899999963
No 108
>PHA00024 IX minor coat protein
Probab=47.33 E-value=8 Score=24.28 Aligned_cols=15 Identities=47% Similarity=0.693 Sum_probs=11.9
Q ss_pred cchhcccchhHHHHH
Q 021444 291 GAYFLGSCLGLSFFV 305 (312)
Q Consensus 291 ~~~~~~~~~~~~~~~ 305 (312)
|||+||-||+.+..+
T Consensus 10 gA~ilG~~l~~~Il~ 24 (33)
T PHA00024 10 GAYILGWALFYGILV 24 (33)
T ss_pred HHHHHHHHHHHHHHH
Confidence 789999999877554
No 109
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=47.03 E-value=7.6 Score=38.46 Aligned_cols=12 Identities=50% Similarity=0.866 Sum_probs=10.1
Q ss_pred hhhhhhhhhhhc
Q 021444 219 ESVAVHEIGHLL 230 (312)
Q Consensus 219 ~~Va~HEiGHaL 230 (312)
..|++||+||.|
T Consensus 15 ~~v~~HE~gH~~ 26 (420)
T TIGR00054 15 VLIFVHELGHFL 26 (420)
T ss_pred HHHHHHhHHHHH
Confidence 469999999974
No 110
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=46.93 E-value=7.6 Score=38.29 Aligned_cols=13 Identities=38% Similarity=0.521 Sum_probs=10.6
Q ss_pred chhhhhhhhhhhc
Q 021444 218 LESVAVHEIGHLL 230 (312)
Q Consensus 218 l~~Va~HEiGHaL 230 (312)
-..+++||+||||
T Consensus 159 ~l~t~iHE~GHal 171 (396)
T cd06460 159 ALFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHHhhHHH
Confidence 3568899999986
No 111
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=46.65 E-value=7.6 Score=40.39 Aligned_cols=14 Identities=29% Similarity=0.349 Sum_probs=11.7
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
+-..|++||+||++
T Consensus 379 ~dV~TLaHElGHs~ 392 (598)
T COG1164 379 RDVFTLAHELGHSV 392 (598)
T ss_pred hHHHHHHHHccHHH
Confidence 44679999999996
No 112
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=46.19 E-value=27 Score=30.99 Aligned_cols=47 Identities=17% Similarity=0.383 Sum_probs=33.6
Q ss_pred HHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCCCCCHH
Q 021444 7 LKKYFHYFGYIPDLPSNFTDDFDDMLESAVKTYQKNFNLEPTGQLDSQ 54 (312)
Q Consensus 7 ~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~~fQ~~~gLpvTG~lD~~ 54 (312)
+.+||.++||+.+.....+...=-++++.++..-+..|+++ |+||-=
T Consensus 153 Ilr~l~r~g~i~e~~kt~t~K~YLe~E~ilr~iae~~g~s~-gelDL~ 199 (210)
T COG1059 153 ILRWLVRYGLIDENPKTLTRKLYLEIEEILRSIAEEVGISL-GELDLY 199 (210)
T ss_pred HHHHHHHhcccccCcccccHHHHHHHHHHHHHHHHHhCCCc-chhHHH
Confidence 56899999999764222211112378899999999999996 888853
No 113
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=43.48 E-value=8.9 Score=34.26 Aligned_cols=15 Identities=33% Similarity=0.545 Sum_probs=12.9
Q ss_pred ccchhhhhhhhhhhc
Q 021444 216 IDLESVAVHEIGHLL 230 (312)
Q Consensus 216 ~~l~~Va~HEiGHaL 230 (312)
.....+|.||-||||
T Consensus 26 ~~~~~~A~HEAGhAv 40 (213)
T PF01434_consen 26 EEKRRIAYHEAGHAV 40 (213)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 457889999999996
No 114
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=41.47 E-value=12 Score=34.73 Aligned_cols=23 Identities=30% Similarity=0.414 Sum_probs=17.4
Q ss_pred cccchhhhhhhhhhhcCCCCCCC
Q 021444 215 AIDLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
..+...++.||+||.-|=.+--+
T Consensus 200 a~~yyaTl~HElghwtgh~~rl~ 222 (316)
T COG4227 200 AINYYATLLHELGHWTGHEARLD 222 (316)
T ss_pred hHhHHHHHHHHhccccCchhhhh
Confidence 45788999999999977554433
No 115
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=41.41 E-value=14 Score=32.23 Aligned_cols=28 Identities=32% Similarity=0.289 Sum_probs=22.5
Q ss_pred cccchhhhhhhhhhhcCCCCCCCCCCcc
Q 021444 215 AIDLESVAVHEIGHLLGLGHSSVEDAIM 242 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs~~~~sIM 242 (312)
..-+..|++||+=|..-..||..--+.|
T Consensus 161 ~~~idYVvvHEL~Hl~~~nHs~~Fw~~v 188 (205)
T PF01863_consen 161 PEVIDYVVVHELCHLRHPNHSKRFWALV 188 (205)
T ss_pred ccHHHHHHHHHHHHhccCCCCHHHHHHH
Confidence 3457889999999999999998655544
No 116
>PRK10911 oligopeptidase A; Provisional
Probab=40.65 E-value=11 Score=39.88 Aligned_cols=14 Identities=43% Similarity=0.475 Sum_probs=11.6
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
+-..|+.||+||+|
T Consensus 462 ~~v~tlfHEfGHal 475 (680)
T PRK10911 462 DEVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHHhHHH
Confidence 34568999999998
No 117
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=37.70 E-value=82 Score=19.97 Aligned_cols=29 Identities=21% Similarity=0.208 Sum_probs=18.4
Q ss_pred HHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHH
Q 021444 5 AKLKKYFHYFGYIPDLPSNFTDDFDDMLESAVK 37 (312)
Q Consensus 5 ~~~~~yL~~fGYl~~~~~~~~~~~~~~~~~Ai~ 37 (312)
.++++||+++|.-.+... ...++|.+.++
T Consensus 7 ~~L~~wL~~~gi~~~~~~----~~rd~Ll~~~k 35 (38)
T PF10281_consen 7 SDLKSWLKSHGIPVPKSA----KTRDELLKLAK 35 (38)
T ss_pred HHHHHHHHHcCCCCCCCC----CCHHHHHHHHH
Confidence 579999999999654321 13455554444
No 118
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=37.48 E-value=12 Score=38.34 Aligned_cols=16 Identities=38% Similarity=0.511 Sum_probs=11.7
Q ss_pred cchhhhhhhh-hhhcCC
Q 021444 217 DLESVAVHEI-GHLLGL 232 (312)
Q Consensus 217 ~l~~Va~HEi-GHaLGL 232 (312)
....|++||+ ||..|=
T Consensus 292 f~~~v~lHEllGHGsGk 308 (549)
T PF03571_consen 292 FFVQVGLHELLGHGSGK 308 (549)
T ss_dssp HHHHHHHHHHCCCCS--
T ss_pred HhhhhhHHhhccCcCcc
Confidence 3567999998 999885
No 119
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=37.14 E-value=13 Score=36.84 Aligned_cols=26 Identities=38% Similarity=0.500 Sum_probs=17.4
Q ss_pred ccchhhhhhhhhhhcCCCCCCCCCCcc
Q 021444 216 IDLESVAVHEIGHLLGLGHSSVEDAIM 242 (312)
Q Consensus 216 ~~l~~Va~HEiGHaLGL~Hs~~~~sIM 242 (312)
..+..|+.||+||. =++|.--.-.+|
T Consensus 278 eel~AVl~HELGHW-~~~H~~K~~ii~ 303 (428)
T KOG2719|consen 278 EELVAVLAHELGHW-KLNHVLKNIIIM 303 (428)
T ss_pred HHHHHHHHHHhhHH-HHhhHHHHHHHH
Confidence 46899999999998 355543333333
No 120
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=35.76 E-value=52 Score=35.44 Aligned_cols=22 Identities=18% Similarity=0.394 Sum_probs=15.7
Q ss_pred HHHHHHHHHHhhcccceEEEcc
Q 021444 132 NVFATAFDRWSEATSLTFTRTD 153 (312)
Q Consensus 132 ~~i~~A~~~Ws~~s~l~F~ev~ 153 (312)
+--...|+...+.+.|.|....
T Consensus 170 ~~Y~~i~~~Yd~l~Gl~~~~~~ 191 (775)
T PF03272_consen 170 DFYNDIFKFYDDLTGLSDDPSD 191 (775)
T ss_pred HHHHHHHHHHHhhhCCCCCCCc
Confidence 3446678888888888887543
No 121
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=33.22 E-value=19 Score=25.59 Aligned_cols=21 Identities=38% Similarity=0.597 Sum_probs=17.6
Q ss_pred cchhhhhhhhhhhcCCCCCCC
Q 021444 217 DLESVAVHEIGHLLGLGHSSV 237 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~~ 237 (312)
...--++||+.+.+||.|.+.
T Consensus 28 ~~eR~~vH~lA~~~gL~s~S~ 48 (60)
T cd02641 28 SHDRLLVHELAEELGLRHEST 48 (60)
T ss_pred HHHHHHHHHHHHHcCCceEee
Confidence 456689999999999999763
No 122
>PRK03094 hypothetical protein; Provisional
Probab=33.20 E-value=31 Score=26.25 Aligned_cols=16 Identities=19% Similarity=0.382 Sum_probs=14.5
Q ss_pred CChHHHHHHHHhcCCC
Q 021444 2 DGLAKLKKYFHYFGYI 17 (312)
Q Consensus 2 ~~~~~~~~yL~~fGYl 17 (312)
+|+..+++||++-||-
T Consensus 8 ~~Ls~i~~~L~~~GYe 23 (80)
T PRK03094 8 QSLTDVQQALKQKGYE 23 (80)
T ss_pred cCcHHHHHHHHHCCCE
Confidence 5899999999999995
No 123
>COG5570 Uncharacterized small protein [Function unknown]
Probab=32.45 E-value=43 Score=23.32 Aligned_cols=31 Identities=19% Similarity=0.392 Sum_probs=24.7
Q ss_pred HHHHHHHHHHhcCCCCCCCCCHHHHHHhcCCCCCCC
Q 021444 32 LESAVKTYQKNFNLEPTGQLDSQTMQHILRPRCGNP 67 (312)
Q Consensus 32 ~~~Ai~~fQ~~~gLpvTG~lD~~T~~~M~~pRCG~p 67 (312)
+++-|...|+-. |.|+++..+.|+.|-|---
T Consensus 3 ieshl~eL~kkH-----g~le~ei~ea~n~Ps~dd~ 33 (57)
T COG5570 3 IESHLAELEKKH-----GNLEREIQEAMNSPSSDDL 33 (57)
T ss_pred HHHHHHHHHHhh-----chHHHHHHHHhcCCCcchH
Confidence 455677788765 7999999999999988543
No 124
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=32.27 E-value=14 Score=33.06 Aligned_cols=46 Identities=26% Similarity=0.368 Sum_probs=27.7
Q ss_pred chhhhhhhhhhhcCCCCCCCCCCccccccCCCCCcCCCCHHHHHHHHHHhCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSVEDAIMYPSISSGSRKVELANDDIQGIQALYGNNPN 272 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~sIM~p~y~~~~~~~~L~~dDi~~Iq~LYG~~~~ 272 (312)
+.-++.||++| +=|+|.... .|... ..-++ ..|..++..|+....+
T Consensus 101 ~~fil~HE~~H-v~~~h~~~~----~~~~~---~~eE~-~AD~~A~~~il~~~~~ 146 (206)
T PF10463_consen 101 IAFILLHELAH-VVLGHEGDS----SPSQS---IQEEK-EADSYATEMILSDVGN 146 (206)
T ss_pred HHHHHHHHHHH-HHHcCcccc----ccchh---HHHHH-hhhHHHHHHHHHHHHh
Confidence 34589999999 569998865 11111 11122 4677777777665433
No 125
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=31.11 E-value=21 Score=32.96 Aligned_cols=21 Identities=29% Similarity=0.439 Sum_probs=15.0
Q ss_pred cccchhhhhhhhhhhcCCCCC
Q 021444 215 AIDLESVAVHEIGHLLGLGHS 235 (312)
Q Consensus 215 ~~~l~~Va~HEiGHaLGL~Hs 235 (312)
..+..+|++||+-|.|=+..-
T Consensus 151 ~~dg~NVviHEfAH~LD~~~g 171 (253)
T PF06167_consen 151 PNDGHNVVIHEFAHKLDMEDG 171 (253)
T ss_dssp SSSS--HHHHHHHHHHHCTTS
T ss_pred CCCCcchHHHHHHHHHHhhcC
Confidence 456789999999999966543
No 126
>PF03698 UPF0180: Uncharacterised protein family (UPF0180); InterPro: IPR005370 The members of this family are small uncharacterised proteins.
Probab=29.79 E-value=38 Score=25.75 Aligned_cols=16 Identities=31% Similarity=0.573 Sum_probs=14.5
Q ss_pred CChHHHHHHHHhcCCC
Q 021444 2 DGLAKLKKYFHYFGYI 17 (312)
Q Consensus 2 ~~~~~~~~yL~~fGYl 17 (312)
+|+..+++||++-||=
T Consensus 8 ~~Ls~v~~~L~~~Gye 23 (80)
T PF03698_consen 8 EGLSNVKEALREKGYE 23 (80)
T ss_pred CCchHHHHHHHHCCCE
Confidence 5899999999999994
No 127
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=29.47 E-value=21 Score=37.72 Aligned_cols=32 Identities=16% Similarity=0.196 Sum_probs=21.6
Q ss_pred HHHH-HHHHHHHhhcccceEEEccc--CCcccEEE
Q 021444 131 KNVF-ATAFDRWSEATSLTFTRTDV--YTTADIRI 162 (312)
Q Consensus 131 ~~~i-~~A~~~Ws~~s~l~F~ev~~--~~~aDI~I 162 (312)
..++ +-.|..-+..=+|+|+++++ ....|+++
T Consensus 363 ~~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~ 397 (681)
T PRK10280 363 NTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV 397 (681)
T ss_pred HHHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence 3455 35667777888999999753 34667764
No 128
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=28.18 E-value=23 Score=34.11 Aligned_cols=19 Identities=37% Similarity=0.548 Sum_probs=15.3
Q ss_pred cchhhhhhhhhhhcCCCCCC
Q 021444 217 DLESVAVHEIGHLLGLGHSS 236 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~Hs~ 236 (312)
.+..|+.||++|++ ++|+.
T Consensus 274 glAtvLgHE~aHaV-arH~A 292 (424)
T KOG2661|consen 274 GLATVLGHEIAHAV-ARHAA 292 (424)
T ss_pred HHHHHHHHHHHHHH-HHHHH
Confidence 47789999999997 56655
No 129
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=28.12 E-value=19 Score=37.02 Aligned_cols=13 Identities=46% Similarity=0.672 Sum_probs=11.2
Q ss_pred chhhhhhhhhhhc
Q 021444 218 LESVAVHEIGHLL 230 (312)
Q Consensus 218 l~~Va~HEiGHaL 230 (312)
+-.|+.|||||+.
T Consensus 487 IGaVIgHEI~HgF 499 (654)
T COG3590 487 IGAVIGHEIGHGF 499 (654)
T ss_pred ccceehhhhcccc
Confidence 5679999999974
No 130
>PF03801 Ndc80_HEC: HEC/Ndc80p family; InterPro: IPR005550 Members of this family are components of the mitotic spindle. It has been shown that Ndc80 from yeast is part of a complex called the Ndc80p complex []. This complex is thought to bind to the microtubules of the spindle.; PDB: 3IZ0_E 2VE7_B 2IGP_A.
Probab=28.06 E-value=1.2e+02 Score=25.70 Aligned_cols=63 Identities=3% Similarity=0.071 Sum_probs=37.4
Q ss_pred hHHHHHHHHhcCCC-CCCCC-CCCCcchHHHHHHHHHHHHhcC--CCCCCCCCHHHHHHhcCCCCCC
Q 021444 4 LAKLKKYFHYFGYI-PDLPS-NFTDDFDDMLESAVKTYQKNFN--LEPTGQLDSQTMQHILRPRCGN 66 (312)
Q Consensus 4 ~~~~~~yL~~fGYl-~~~~~-~~~~~~~~~~~~Ai~~fQ~~~g--LpvTG~lD~~T~~~M~~pRCG~ 66 (312)
+.+|.+||...||. .+-.. .......+++.+-++-+=++.. .+++.+++++....|+.=+|=.
T Consensus 42 ~~~I~~fL~~~~~~~~~is~k~l~~Pt~kdf~~I~~fL~~~idp~~~~~~k~eeev~~~lK~L~YP~ 108 (157)
T PF03801_consen 42 IRKIYEFLSEHGFESHPISPKTLKSPTQKDFVEIFNFLFRQIDPNFKFGKKFEEEVPFLLKALGYPF 108 (157)
T ss_dssp HHHHHHHHHHTT--SS---TTTTSS--HHHHHHHHHHHHHTTSTT---SSTHHHHHHHHHHHTT-SS
T ss_pred HHHHHHHHHHcCCCCccccccccCCCCHHHHHHHHHHHHHHhCCCCCcCcCHHHHHHHHHHHhCCCc
Confidence 56789999999992 11111 1223345677766666655555 6787799999999998777655
No 131
>PF08434 CLCA_N: Calcium-activated chloride channel; InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs.
Probab=25.52 E-value=43 Score=31.17 Aligned_cols=12 Identities=50% Similarity=0.545 Sum_probs=9.9
Q ss_pred chhhhhhhhhhh
Q 021444 218 LESVAVHEIGHL 229 (312)
Q Consensus 218 l~~Va~HEiGHa 229 (312)
.-.|++||-+|-
T Consensus 149 ~grv~VhEWAhl 160 (262)
T PF08434_consen 149 RGRVFVHEWAHL 160 (262)
T ss_pred cceeeeehhhhh
Confidence 356999999996
No 132
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=25.37 E-value=28 Score=32.58 Aligned_cols=82 Identities=22% Similarity=0.178 Sum_probs=39.6
Q ss_pred eeCCCCCceEecCCCCceecCCCccccCCccccchhhhhhhhhhhcC--CCCCCCCC---CccccccC-CCCCcCCCCHH
Q 021444 185 AFSPPNGRFHLDGDENWVITSGTTTSISMSAIDLESVAVHEIGHLLG--LGHSSVED---AIMYPSIS-SGSRKVELAND 258 (312)
Q Consensus 185 A~~P~~G~ihfd~~e~wt~~~~~~~~~~~~~~~l~~Va~HEiGHaLG--L~Hs~~~~---sIM~p~y~-~~~~~~~L~~d 258 (312)
-|-|.+.++.||....=...... ......--..|++||.||.+= |+|...-+ .-|.-.-. ...-+.+|+.|
T Consensus 137 FYCP~D~kvYlDlsFf~~m~~~f---ga~GdfAqaYViAHEVGHHVQnllGi~~kv~~~~q~~s~~~an~~sVr~ELQAD 213 (295)
T COG2321 137 FYCPADTKVYLDLSFFDEMKTKF---GASGDFAQAYVIAHEVGHHVQNLLGILPKVNQNQQGMSQAEANQLSVRRELQAD 213 (295)
T ss_pred eecCCCceEEEehhHHHHHHHHh---cCCccHHHHHHHHhhhhHHHHHHhhhhHHHHHHHhhhhhhhccCcceeeeehhh
Confidence 35688888888874211010000 011123346699999999631 12211000 00211111 11234677777
Q ss_pred HHHHHHHHhCC
Q 021444 259 DIQGIQALYGN 269 (312)
Q Consensus 259 Di~~Iq~LYG~ 269 (312)
=-++|+.-|-.
T Consensus 214 CfAGVW~~~~~ 224 (295)
T COG2321 214 CFAGVWANYVQ 224 (295)
T ss_pred HHHHHHHHHhc
Confidence 77777777754
No 133
>PRK14749 hypothetical protein; Provisional
Probab=24.63 E-value=47 Score=20.37 Aligned_cols=18 Identities=28% Similarity=0.362 Sum_probs=13.3
Q ss_pred chhcccchhHHHHHHHHH
Q 021444 292 AYFLGSCLGLSFFVVAAV 309 (312)
Q Consensus 292 ~~~~~~~~~~~~~~~~~~ 309 (312)
+.+||-+|..+|-|+.|+
T Consensus 5 aWiLG~~lAc~f~ilna~ 22 (30)
T PRK14749 5 LWFVGILLMCSLSTLVLV 22 (30)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 356788888888777765
No 134
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=24.05 E-value=33 Score=30.81 Aligned_cols=14 Identities=29% Similarity=0.266 Sum_probs=11.4
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
-.-.|++||++||-
T Consensus 92 l~gsiLAHE~mHa~ 105 (212)
T PF12315_consen 92 LTGSILAHELMHAW 105 (212)
T ss_pred HHhhHHHHHHHHHH
Confidence 34679999999993
No 135
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=23.62 E-value=31 Score=34.73 Aligned_cols=44 Identities=32% Similarity=0.516 Sum_probs=24.6
Q ss_pred CCCHHHHHHH-----HHHhCCCCCCCCCCCCCCCCCCCCCCCcc--hhcccchhH
Q 021444 254 ELANDDIQGI-----QALYGNNPNFNGSSVPTDQQRDTSGAHGA--YFLGSCLGL 301 (312)
Q Consensus 254 ~L~~dDi~~I-----q~LYG~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 301 (312)
.|..+|+-.+ +..-|.+|+...+.. =||--=.||+ ||=.-.||.
T Consensus 374 ~l~v~DlP~lWn~kme~yLGirp~~d~eG~----LQDIHWs~GsfGYFPsYtlG~ 424 (497)
T COG2317 374 ELKVDDLPELWNDKMEEYLGIRPKNDAEGV----LQDIHWSHGSFGYFPTYTLGN 424 (497)
T ss_pred CcchhhhHHHHHHHHHHHcCCCCCCccccc----cccccccCCCcCcCchhhhHH
Confidence 5666777655 344487777554443 4565555664 555444443
No 136
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=23.20 E-value=35 Score=34.56 Aligned_cols=16 Identities=25% Similarity=0.289 Sum_probs=13.0
Q ss_pred cchhhhhhhhhhhcCC
Q 021444 217 DLESVAVHEIGHLLGL 232 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL 232 (312)
.-+.|+.||+||+...
T Consensus 246 ~D~~t~~HE~GH~~yy 261 (477)
T cd06461 246 EDFVTVHHEMGHIQYY 261 (477)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4477899999999854
No 137
>TIGR02106 cyd_oper_ybgT cyd operon protein YbgT. This model describes a very small (as short as 33 amino acids) protein of unknown function, essentially always found in an operon with CydAB, subunits of the cytochrome d terminal oxidase. It begins with an aromatic motif MWYFXW and appears to contain a membrane-spanning helix. This protein appears to be restricted to the Proteobacteria and exist in a single copy only. We suggest it may be a membrane subunit of the terminal oxidase. The family is named after the E. coli member YbgT. This model excludes the apparently related protein YccB.
Probab=22.74 E-value=53 Score=20.22 Aligned_cols=17 Identities=35% Similarity=0.598 Sum_probs=11.7
Q ss_pred hhcccchhHHHHHHHHH
Q 021444 293 YFLGSCLGLSFFVVAAV 309 (312)
Q Consensus 293 ~~~~~~~~~~~~~~~~~ 309 (312)
.+||.+|..+|-|+.|+
T Consensus 6 WilG~~lA~~~~v~~a~ 22 (30)
T TIGR02106 6 WILGTLLACAFGVLNAM 22 (30)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 46777777777776665
No 138
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=22.68 E-value=34 Score=29.85 Aligned_cols=16 Identities=25% Similarity=0.316 Sum_probs=12.8
Q ss_pred chhhhhhhhhhhcCCC
Q 021444 218 LESVAVHEIGHLLGLG 233 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~ 233 (312)
+-.|++|||.|++.-.
T Consensus 36 lG~ilahel~hafd~~ 51 (206)
T PF01431_consen 36 LGFILAHELMHAFDPE 51 (206)
T ss_dssp HHHHHHHHHHHCTSTT
T ss_pred HHHHHHHHHHHHHHHh
Confidence 5679999999987553
No 139
>TIGR03141 cytochro_ccmD heme exporter protein CcmD. The model for this protein family describes a small, hydrophobic, and only moderately well-conserved protein, tricky to identify accurately for all of these reasons. However, members are found as part of large operons involved in heme export across the inner membrane for assembly of c-type cytochromes in a large number of bacteria. The gray zone between the trusted cutoff (13.0) and noise cutoff (4.75) includes both low-scoring examples and false-positive matches to hydrophobic domains of longer proteins.
Probab=22.62 E-value=61 Score=21.57 Aligned_cols=22 Identities=23% Similarity=0.451 Sum_probs=15.3
Q ss_pred CCCcchhcccchhHHHHHHHHHh
Q 021444 288 GAHGAYFLGSCLGLSFFVVAAVL 310 (312)
Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~ 310 (312)
+.||.|..+ +-|++|+++++.+
T Consensus 2 Ggy~~yVW~-sYg~t~l~l~~li 23 (45)
T TIGR03141 2 GGYAFYVWL-AYGITALVLAGLI 23 (45)
T ss_pred CCccHHHHH-HHHHHHHHHHHHH
Confidence 457777664 5688888877654
No 140
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=22.08 E-value=1.1e+02 Score=28.35 Aligned_cols=78 Identities=17% Similarity=0.207 Sum_probs=47.2
Q ss_pred HHHHHHHHHHHHHHhh--cccceEEEcccCCcccEEEEeeecCCCCCCCcCCCCCcceeeeCCCCCceEe--cCCCCcee
Q 021444 128 DEVKNVFATAFDRWSE--ATSLTFTRTDVYTTADIRIGFFTGDHGDGEPFDGPMGTLAHAFSPPNGRFHL--DGDENWVI 203 (312)
Q Consensus 128 ~~~~~~i~~A~~~Ws~--~s~l~F~ev~~~~~aDI~I~f~~~~hgd~~~fdG~gg~lAhA~~P~~G~ihf--d~~e~wt~ 203 (312)
..+....+.+...|+. |..+.|.-.. .++.|+..++.. |+ ++ + .|.|++ ..+
T Consensus 79 ~~~ek~l~~~~~k~~~pnv~~v~f~V~~--~dp~irlqkg~k--gg-------g~------~--~~ki~l~l~p~----- 134 (280)
T COG5504 79 SNCEKYLPELIVKWYGPNVPIVDFPVTP--GDPQIRLQKGRK--GG-------GG------I--PGKIMLWLVPS----- 134 (280)
T ss_pred HHHHHHHHHHHHhhcCCCCceeeecccC--CCchHHHhhccc--CC-------CC------c--CceEEEEEecC-----
Confidence 5567778888899985 4556665443 355566555332 21 11 1 233332 222
Q ss_pred cCCCccccCCccccchhhhhhhhhhhcCCCCCCCC
Q 021444 204 TSGTTTSISMSAIDLESVAVHEIGHLLGLGHSSVE 238 (312)
Q Consensus 204 ~~~~~~~~~~~~~~l~~Va~HEiGHaLGL~Hs~~~ 238 (312)
..-.+..++++||+=|..-|.|..-.
T Consensus 135 ---------~~~~~v~aliaHE~HH~~R~~~i~~~ 160 (280)
T COG5504 135 ---------STITSVPALIAHEYHHNCRLRYIDYG 160 (280)
T ss_pred ---------CCccchHHHHHHHHHhhheecccccC
Confidence 11236789999999999999887743
No 141
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.97 E-value=42 Score=23.98 Aligned_cols=22 Identities=36% Similarity=0.386 Sum_probs=18.3
Q ss_pred chhhhhhhhhhhcCCCCCCCCC
Q 021444 218 LESVAVHEIGHLLGLGHSSVED 239 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~~~ 239 (312)
-..-++|.+-|-|||.|.++..
T Consensus 29 ~eRriih~la~~lGL~~~s~G~ 50 (60)
T cd02639 29 AERRIVHLLASRLGLNHVSDGT 50 (60)
T ss_pred HHHHHHHHHHHHcCCceEEeCC
Confidence 4568899999999999987544
No 142
>CHL00176 ftsH cell division protein; Validated
Probab=21.91 E-value=35 Score=35.84 Aligned_cols=14 Identities=36% Similarity=0.707 Sum_probs=12.1
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
....||.||.||||
T Consensus 437 ~~~~vA~hEaGhA~ 450 (638)
T CHL00176 437 NKRLIAYHEVGHAI 450 (638)
T ss_pred HHHHHHHHhhhhHH
Confidence 45679999999998
No 143
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=21.82 E-value=35 Score=35.81 Aligned_cols=14 Identities=50% Similarity=0.688 Sum_probs=12.0
Q ss_pred cchhhhhhhhhhhc
Q 021444 217 DLESVAVHEIGHLL 230 (312)
Q Consensus 217 ~l~~Va~HEiGHaL 230 (312)
....|+.||+||||
T Consensus 407 ~~~~~a~he~gha~ 420 (644)
T PRK10733 407 QKESTAYHEAGHAI 420 (644)
T ss_pred HHHHHHHHHHHHHH
Confidence 45779999999997
No 144
>PF14891 Peptidase_M91: Effector protein
Probab=21.57 E-value=40 Score=29.01 Aligned_cols=18 Identities=22% Similarity=0.071 Sum_probs=13.5
Q ss_pred cchhhhhhhhhhhcCCCC
Q 021444 217 DLESVAVHEIGHLLGLGH 234 (312)
Q Consensus 217 ~l~~Va~HEiGHaLGL~H 234 (312)
.-+.|+.||+.||.-..+
T Consensus 102 ~p~v~L~HEL~HA~~~~~ 119 (174)
T PF14891_consen 102 PPFVVLYHELIHAYDYMN 119 (174)
T ss_pred HHHHHHHHHHHHHHHHHC
Confidence 446799999999965433
No 145
>PF02074 Peptidase_M32: Carboxypeptidase Taq (M32) metallopeptidase; InterPro: IPR001333 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M32 (carboxypeptidase Taq family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. Carboxypeptidase Taq is a zinc-containing thermostable metallopeptidase. It was originally discovered and purified from Thermus aquaticus; optimal enzymatic activity occurs at 80 celcius. Although very little is known about this enzyme, it is thought either to be associated with a membrane or to be particle bound.; GO: 0004181 metallocarboxypeptidase activity, 0006508 proteolysis; PDB: 1K9X_A 1KA4_A 1KA2_A 3DWC_A 1WGZ_A 3HQ2_A 3HOA_B.
Probab=20.93 E-value=38 Score=34.46 Aligned_cols=17 Identities=41% Similarity=0.581 Sum_probs=11.8
Q ss_pred hhhhhhhhhhhc---CCCCC
Q 021444 219 ESVAVHEIGHLL---GLGHS 235 (312)
Q Consensus 219 ~~Va~HEiGHaL---GL~Hs 235 (312)
..-++||.|||| ||...
T Consensus 260 l~s~iHE~GHalYEq~~~~~ 279 (494)
T PF02074_consen 260 LFSTIHETGHALYEQGLPPE 279 (494)
T ss_dssp HHHHHHHHHHHHHHHTS-GG
T ss_pred HHHHHHHHHHHHHHcCCChh
Confidence 446789999996 66443
No 146
>COG4412 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.56 E-value=34 Score=35.07 Aligned_cols=51 Identities=27% Similarity=0.188 Sum_probs=30.4
Q ss_pred chhhhhhhhhhhcCCCCCCC-----------CCCccccc-cCC---CCCcCCCCHHHHHHHHHHhC
Q 021444 218 LESVAVHEIGHLLGLGHSSV-----------EDAIMYPS-ISS---GSRKVELANDDIQGIQALYG 268 (312)
Q Consensus 218 l~~Va~HEiGHaLGL~Hs~~-----------~~sIM~p~-y~~---~~~~~~L~~dDi~~Iq~LYG 268 (312)
-..|.+||+||-|||..+.+ ..|||..- +.+ ......++.-|.+-.|.-||
T Consensus 346 aaGv~ahEyghdlglPDEydtqy~GtgEPVgYwsImssGsWaGki~Gt~p~gFs~wdkef~q~~~G 411 (760)
T COG4412 346 AAGVEAHEYGHDLGLPDEYDTQYKGTGEPVGYWSIMSSGSWAGKIKGTLPGGFSFWDKEFPQPSIG 411 (760)
T ss_pred ccceeeeecccccCCCcccccccccCCCcccceeeecccccccccCCcCCcccchhhhhccCcccC
Confidence 34599999999999987665 23677421 111 12223355556665665554
No 147
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=20.56 E-value=39 Score=31.40 Aligned_cols=11 Identities=36% Similarity=0.341 Sum_probs=9.0
Q ss_pred hhhhhhhhhhc
Q 021444 220 SVAVHEIGHLL 230 (312)
Q Consensus 220 ~Va~HEiGHaL 230 (312)
--..|||||-+
T Consensus 221 WG~~HE~GH~~ 231 (307)
T PF13402_consen 221 WGPWHELGHNH 231 (307)
T ss_dssp HHHHHHHHHHH
T ss_pred eeehhhhhhhc
Confidence 35899999985
Done!