Query 037540
Match_columns 340
No_of_seqs 345 out of 1585
Neff 6.5
Searched_HMMs 46136
Date Fri Mar 29 13:17:13 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037540.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037540hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1565 Gelatinase A and relat 100.0 9.5E-51 2.1E-55 408.0 20.7 220 63-302 29-269 (469)
2 PF00413 Peptidase_M10: Matrix 100.0 2.4E-30 5.2E-35 223.4 15.4 133 149-294 1-154 (154)
3 cd04278 ZnMc_MMP Zinc-dependen 100.0 3E-30 6.4E-35 225.3 14.3 137 149-294 1-157 (157)
4 cd04279 ZnMc_MMP_like_1 Zinc-d 99.9 2.1E-23 4.5E-28 181.8 13.3 139 156-294 2-156 (156)
5 cd04268 ZnMc_MMP_like Zinc-dep 99.9 2.3E-21 4.9E-26 169.3 10.6 132 156-293 2-165 (165)
6 smart00235 ZnMc Zinc-dependent 99.9 2.6E-21 5.5E-26 165.4 9.9 134 147-295 2-140 (140)
7 cd04277 ZnMc_serralysin_like Z 99.8 4.6E-20 1E-24 165.2 12.2 122 173-294 32-186 (186)
8 cd04327 ZnMc_MMP_like_3 Zinc-d 99.7 1.6E-17 3.4E-22 150.6 12.2 103 149-262 2-110 (198)
9 cd04280 ZnMc_astacin_like Zinc 99.6 2.8E-15 6.1E-20 134.2 11.7 90 157-262 3-92 (180)
10 PF01400 Astacin: Astacin (Pep 99.6 2.3E-15 4.9E-20 136.0 10.1 129 149-296 2-190 (191)
11 cd04281 ZnMc_BMP1_TLD Zinc-dep 99.6 1.1E-14 2.4E-19 132.4 13.0 99 147-262 7-105 (200)
12 cd00203 ZnMc Zinc-dependent me 99.6 9.7E-15 2.1E-19 127.4 10.8 117 173-293 20-167 (167)
13 cd04282 ZnMc_meprin Zinc-depen 99.5 4.2E-13 9.1E-18 124.3 13.1 96 147-262 43-138 (230)
14 cd04283 ZnMc_hatching_enzyme Z 99.5 4.6E-13 1E-17 120.0 11.5 123 156-294 4-181 (182)
15 KOG3714 Meprin A metalloprotea 99.5 3.9E-13 8.4E-18 134.6 12.0 137 145-298 77-272 (411)
16 cd04276 ZnMc_MMP_like_2 Zinc-d 99.4 1.9E-12 4.2E-17 117.4 12.8 134 156-295 8-197 (197)
17 PF12388 Peptidase_M57: Dual-a 99.2 8.5E-11 1.8E-15 107.1 11.8 138 148-293 32-211 (211)
18 PF01471 PG_binding_1: Putativ 99.1 1.5E-10 3.2E-15 84.1 6.9 55 62-119 3-57 (57)
19 PF02031 Peptidase_M7: Strepto 98.7 6.7E-08 1.4E-12 81.1 9.3 125 156-294 2-130 (132)
20 COG5549 Predicted Zn-dependent 98.6 2.4E-07 5.3E-12 83.7 9.8 112 174-295 103-234 (236)
21 TIGR02869 spore_SleB spore cor 97.9 2.7E-05 5.9E-10 71.1 6.9 69 56-128 8-76 (201)
22 PF13583 Reprolysin_4: Metallo 97.7 4.9E-05 1.1E-09 69.6 4.7 47 244-292 137-197 (206)
23 PRK10594 murein L,D-transpepti 97.4 0.00045 9.7E-09 72.2 7.6 67 56-122 236-343 (608)
24 PF13574 Reprolysin_2: Metallo 97.4 7.9E-05 1.7E-09 66.3 1.7 47 244-292 111-170 (173)
25 PF11350 DUF3152: Protein of u 97.3 0.00067 1.4E-08 61.6 7.3 108 156-264 29-159 (203)
26 COG1913 Predicted Zn-dependent 97.3 0.00013 2.7E-09 64.7 1.9 27 245-271 125-151 (181)
27 cd04267 ZnMc_ADAM_like Zinc-de 97.2 0.00035 7.6E-09 62.6 4.1 48 243-291 132-189 (192)
28 PF13688 Reprolysin_5: Metallo 97.1 0.0002 4.3E-09 64.2 1.5 23 243-265 141-163 (196)
29 PRK13267 archaemetzincin-like 97.1 0.00035 7.5E-09 62.7 2.6 27 244-270 125-151 (179)
30 PF13582 Reprolysin_3: Metallo 97.0 0.00029 6.3E-09 58.4 1.3 18 244-261 107-124 (124)
31 cd04273 ZnMc_ADAMTS_like Zinc- 97.0 0.00053 1.2E-08 62.4 3.1 54 241-294 137-199 (207)
32 PF07998 Peptidase_M54: Peptid 96.8 0.00067 1.4E-08 61.6 2.3 27 244-271 145-171 (194)
33 cd04271 ZnMc_ADAM_fungal Zinc- 96.7 0.0019 4E-08 60.1 4.6 49 246-296 147-220 (228)
34 PF05572 Peptidase_M43: Pregna 96.7 0.00077 1.7E-08 59.0 1.8 25 241-265 66-90 (154)
35 cd04269 ZnMc_adamalysin_II_lik 96.6 0.0024 5.3E-08 57.2 4.6 49 244-294 131-185 (194)
36 COG2989 Uncharacterized protei 96.3 0.0072 1.6E-07 62.0 6.3 65 56-121 233-298 (561)
37 cd04275 ZnMc_pappalysin_like Z 96.2 0.00077 1.7E-08 62.6 -1.4 26 241-266 134-159 (225)
38 PF01421 Reprolysin: Reprolysi 95.8 0.014 3.1E-07 52.5 5.1 52 241-293 128-187 (199)
39 cd04270 ZnMc_TACE_like Zinc-de 95.7 0.015 3.2E-07 54.6 4.7 50 245-294 168-230 (244)
40 KOG4525 Jacalin-like lectin do 95.4 0.023 4.9E-07 57.1 5.2 57 245-303 302-380 (614)
41 PF05548 Peptidase_M11: Gameto 94.8 0.018 3.8E-07 56.2 2.3 21 242-262 148-168 (314)
42 COG3409 Putative peptidoglycan 94.5 0.08 1.7E-06 46.4 5.8 61 59-121 43-104 (185)
43 cd04272 ZnMc_salivary_gland_MP 94.2 0.029 6.2E-07 51.5 2.2 20 243-262 144-163 (220)
44 PF04298 Zn_peptidase_2: Putat 94.1 0.062 1.3E-06 49.8 4.2 54 202-263 51-106 (222)
45 PF10462 Peptidase_M66: Peptid 93.6 0.036 7.8E-07 53.8 1.7 20 243-262 192-211 (305)
46 COG3023 ampD N-acetyl-anhydrom 93.4 0.22 4.7E-06 47.0 6.5 71 45-120 169-252 (257)
47 PF12044 Metallopep: Putative 93.1 0.051 1.1E-06 55.0 2.0 22 245-268 317-338 (423)
48 COG2738 Predicted Zn-dependent 92.2 0.14 3.1E-06 46.4 3.4 76 176-257 20-105 (226)
49 TIGR03296 M6dom_TIGR03296 M6 f 91.0 0.061 1.3E-06 51.6 -0.3 20 245-264 166-185 (286)
50 COG3824 Predicted Zn-dependent 88.6 0.18 4E-06 42.3 0.8 17 244-260 109-125 (136)
51 PF08823 PG_binding_2: Putativ 87.4 1.6 3.4E-05 33.6 5.3 53 64-119 18-74 (74)
52 PF11150 DUF2927: Protein of u 86.8 15 0.00032 34.0 12.3 137 148-296 28-196 (213)
53 PTZ00337 surface protease GP63 85.9 0.86 1.9E-05 48.0 4.1 19 243-261 228-246 (567)
54 KOG3607 Meltrins, fertilins an 84.0 1.1 2.3E-05 48.6 3.9 45 244-289 323-374 (716)
55 PF09471 Peptidase_M64: IgA Pe 83.9 0.54 1.2E-05 44.8 1.5 24 243-266 215-239 (264)
56 PF06262 DUF1025: Possibl zinc 82.2 0.73 1.6E-05 37.3 1.4 16 244-259 73-88 (97)
57 PF05547 Peptidase_M6: Immune 81.5 0.3 6.4E-06 52.1 -1.4 17 245-261 222-238 (645)
58 PF01457 Peptidase_M8: Leishma 80.6 0.83 1.8E-05 47.6 1.5 55 243-297 209-268 (521)
59 PF06114 DUF955: Domain of unk 78.3 1.3 2.8E-05 35.2 1.7 14 243-256 41-54 (122)
60 PF13398 Peptidase_M50B: Pepti 74.7 1.5 3.2E-05 39.9 1.2 16 243-258 21-36 (200)
61 COG5549 Predicted Zn-dependent 72.4 2.2 4.8E-05 39.3 1.7 29 245-273 154-182 (236)
62 PF09374 PG_binding_3: Predict 72.3 3.9 8.4E-05 31.1 2.8 26 94-119 1-30 (72)
63 PF14247 DUF4344: Domain of un 71.6 1.8 4E-05 40.2 1.0 14 243-256 91-104 (220)
64 COG3409 Putative peptidoglycan 71.5 13 0.00029 32.1 6.5 58 61-120 127-184 (185)
65 COG4783 Putative Zn-dependent 70.5 2 4.2E-05 44.1 1.0 16 242-257 128-143 (484)
66 COG2856 Predicted Zn peptidase 69.9 2.1 4.5E-05 39.6 1.0 18 244-263 72-89 (213)
67 KOG3658 Tumor necrosis factor- 69.6 2.9 6.3E-05 44.5 2.1 49 245-293 393-453 (764)
68 PTZ00257 Glycoprotein GP63 (le 68.6 2.5 5.5E-05 44.8 1.4 54 243-296 255-313 (622)
69 cd06455 M3A_TOP Peptidase M3 T 67.2 15 0.00032 37.7 6.7 14 243-256 262-275 (472)
70 PF02163 Peptidase_M50: Peptid 66.8 2.9 6.2E-05 37.1 1.2 21 245-265 8-28 (192)
71 TIGR02290 M3_fam_3 oligoendope 65.3 3.4 7.3E-05 43.5 1.6 19 171-189 307-325 (587)
72 cd06163 S2P-M50_PDZ_RseP-like 65.3 3.2 6.9E-05 37.3 1.2 14 245-258 10-23 (182)
73 cd05709 S2P-M50 Site-2 proteas 64.4 3.4 7.3E-05 36.4 1.2 20 245-264 9-28 (180)
74 PF01435 Peptidase_M48: Peptid 64.1 3.6 7.8E-05 37.0 1.4 20 242-262 87-106 (226)
75 KOG3538 Disintegrin metallopro 62.7 4.7 0.0001 44.5 2.2 32 243-274 316-353 (845)
76 cd06161 S2P-M50_SpoIVFB SpoIVF 61.5 4.1 8.8E-05 37.2 1.2 18 244-261 38-55 (208)
77 COG0501 HtpX Zn-dependent prot 60.4 4 8.7E-05 38.6 1.0 14 242-255 155-168 (302)
78 PRK03982 heat shock protein Ht 59.7 4.2 9.1E-05 39.0 1.0 15 242-256 123-137 (288)
79 PRK03001 M48 family peptidase; 57.4 4.9 0.00011 38.4 1.0 15 242-256 122-136 (283)
80 cd06258 Peptidase_M3_like The 57.2 5.2 0.00011 39.2 1.2 15 242-256 152-166 (365)
81 cd06459 M3B_Oligoendopeptidase 56.8 4.9 0.00011 40.0 1.0 15 242-256 220-234 (427)
82 PRK03072 heat shock protein Ht 56.5 5.1 0.00011 38.5 1.0 16 241-256 124-139 (288)
83 PRK01345 heat shock protein Ht 56.4 5.2 0.00011 39.1 1.0 15 242-256 122-136 (317)
84 cd06164 S2P-M50_SpoIVFB_CBS Sp 55.6 5.9 0.00013 36.8 1.2 18 244-261 53-70 (227)
85 PF01432 Peptidase_M3: Peptida 54.5 6.1 0.00013 40.1 1.2 12 245-256 243-254 (458)
86 PF12725 DUF3810: Protein of u 52.8 6.2 0.00013 38.6 0.9 23 241-263 193-215 (318)
87 PRK04897 heat shock protein Ht 52.5 6.5 0.00014 38.0 1.0 14 242-255 135-148 (298)
88 cd06162 S2P-M50_PDZ_SREBP Ster 52.4 7.1 0.00015 37.6 1.2 16 244-259 135-150 (277)
89 cd06159 S2P-M50_PDZ_Arch Uncha 52.2 7.1 0.00015 37.2 1.2 18 245-262 119-136 (263)
90 PRK02870 heat shock protein Ht 51.3 7 0.00015 38.6 1.0 15 241-255 170-184 (336)
91 PRK05457 heat shock protein Ht 51.3 7 0.00015 37.5 1.0 15 241-255 131-145 (284)
92 PF13485 Peptidase_MA_2: Pepti 50.7 8.2 0.00018 30.8 1.2 14 243-256 24-37 (128)
93 PRK01265 heat shock protein Ht 50.7 7.2 0.00016 38.3 1.0 14 242-255 138-151 (324)
94 PRK02391 heat shock protein Ht 50.5 7.3 0.00016 37.7 1.0 15 241-255 130-144 (296)
95 TIGR02289 M3_not_pepF oligoend 50.5 7.7 0.00017 40.6 1.3 16 241-256 334-349 (549)
96 cd06457 M3A_MIP Peptidase M3 m 48.3 8.6 0.00019 39.4 1.2 25 176-200 132-156 (458)
97 KOG2921 Intramembrane metallop 48.0 8.6 0.00019 38.7 1.1 18 245-262 132-149 (484)
98 TIGR00181 pepF oligoendopeptid 47.8 9.2 0.0002 40.2 1.4 15 242-256 376-390 (591)
99 PF03272 Enhancin: Viral enhan 47.8 21 0.00045 39.2 4.1 136 148-294 128-289 (775)
100 cd06160 S2P-M50_like_2 Unchara 47.4 9.6 0.00021 34.3 1.2 23 245-267 42-64 (183)
101 PF01447 Peptidase_M4: Thermol 46.9 9.6 0.00021 33.2 1.1 12 245-256 136-147 (150)
102 cd06456 M3A_DCP_Oligopeptidase 46.1 10 0.00022 38.5 1.3 14 243-256 207-220 (422)
103 PF04228 Zn_peptidase: Putativ 44.3 11 0.00023 36.6 1.0 11 245-255 171-181 (292)
104 PF01434 Peptidase_M41: Peptid 43.5 11 0.00024 34.4 1.0 16 241-256 25-40 (213)
105 PLN02791 Nudix hydrolase homol 42.9 11 0.00024 41.3 1.0 20 244-263 585-604 (770)
106 PF01863 DUF45: Protein of unk 41.0 16 0.00036 32.4 1.7 26 243-268 163-188 (205)
107 cd06460 M32_Taq Peptidase fami 40.6 13 0.00028 37.6 1.0 12 245-256 160-171 (396)
108 TIGR00054 RIP metalloprotease 40.5 13 0.00028 37.7 1.0 12 245-256 15-26 (420)
109 cd06158 S2P-M50_like_1 Unchara 40.4 14 0.00031 33.0 1.2 13 244-256 9-21 (181)
110 PRK10779 zinc metallopeptidase 39.9 13 0.00029 37.8 1.0 12 245-256 16-27 (449)
111 PF08823 PG_binding_2: Putativ 39.7 71 0.0015 24.5 4.8 32 90-121 15-49 (74)
112 COG1164 Oligoendopeptidase F [ 38.6 14 0.00031 39.3 1.0 14 243-256 379-392 (598)
113 PF08434 CLCA_N: Calcium-activ 37.7 14 0.0003 35.3 0.7 12 244-255 149-160 (262)
114 cd02641 R3H_Smubp-2_like R3H d 35.4 19 0.00042 26.3 1.0 19 244-262 29-47 (60)
115 PF10281 Ish1: Putative stress 35.0 87 0.0019 20.5 4.1 29 65-97 7-35 (38)
116 PRK10911 oligopeptidase A; Pro 35.0 17 0.00038 39.2 1.0 13 244-256 463-475 (680)
117 KOG2719 Metalloprotease [Gener 33.2 19 0.00042 36.6 0.9 14 242-255 278-291 (428)
118 KOG2089 Metalloendopeptidase f 31.0 1E+02 0.0022 33.3 5.7 43 177-220 391-435 (718)
119 PF13699 DUF4157: Domain of un 30.6 25 0.00054 27.2 1.0 13 244-256 61-73 (79)
120 KOG3988 Protein-tyrosine sulfo 29.8 68 0.0015 31.3 3.9 38 44-81 295-332 (378)
121 PF03571 Peptidase_M49: Peptid 29.6 22 0.00047 37.5 0.7 15 244-258 293-308 (549)
122 PF06167 Peptidase_M90: Glucos 29.5 27 0.00058 33.1 1.2 20 242-261 152-171 (253)
123 COG4784 Putative Zn-dependent 27.2 1.3E+02 0.0028 30.2 5.4 16 241-256 121-136 (479)
124 PF10463 Peptidase_U49: Peptid 26.3 37 0.00081 31.2 1.5 43 245-296 102-144 (206)
125 COG4227 Antirestriction protei 25.9 30 0.00065 33.1 0.8 22 240-261 199-220 (316)
126 PRK03094 hypothetical protein; 25.6 49 0.0011 25.9 1.8 17 62-78 8-24 (80)
127 PRK10280 dipeptidyl carboxypep 25.4 32 0.0007 37.2 1.1 31 178-208 364-397 (681)
128 KOG2661 Peptidase family M48 [ 23.6 40 0.00087 33.4 1.2 18 244-262 275-292 (424)
129 PF03698 UPF0180: Uncharacteri 22.3 62 0.0013 25.3 1.8 17 62-78 8-24 (80)
130 PRK10733 hflB ATP-dependent me 21.9 42 0.00091 36.0 1.1 14 243-256 407-420 (644)
131 cd02639 R3H_RRM R3H domain of 21.8 60 0.0013 23.9 1.6 22 244-265 29-50 (60)
132 PF08006 DUF1700: Protein of u 21.4 82 0.0018 27.7 2.7 47 64-123 7-54 (181)
133 COG1059 Thermostable 8-oxoguan 21.2 1.3E+02 0.0028 27.5 3.9 46 67-113 153-198 (210)
134 PF12315 DUF3633: Protein of u 20.8 49 0.0011 30.6 1.1 11 245-255 94-104 (212)
135 PF01431 Peptidase_M13: Peptid 20.2 51 0.0011 29.4 1.1 16 244-259 36-51 (206)
136 cd06461 M2_ACE Peptidase famil 20.1 52 0.0011 34.1 1.3 16 243-258 246-261 (477)
No 1
>KOG1565 consensus Gelatinase A and related matrix metalloproteases [Posttranslational modification, protein turnover, chaperones; Extracellular structures]
Probab=100.00 E-value=9.5e-51 Score=407.99 Aligned_cols=220 Identities=45% Similarity=0.799 Sum_probs=200.4
Q ss_pred ChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhcCCCCCCCCcccccccccccccc
Q 037540 63 GMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIMSPRCGVSDTSFRQKIHATKRYS 142 (340)
Q Consensus 63 ~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~~PRCGvpD~~~~~~~~~~~~~~ 142 (340)
+...++.||++|||+++.+...+...+..+++||+.||++++|++||++|.+|++.|++||||||| ++++
T Consensus 29 ~~~~~~~yl~~~~y~~~~~~~~~~~~~~~~~~al~~~q~~~~l~~tG~lD~~Tl~~m~~prCgvpd----------~~~~ 98 (469)
T KOG1565|consen 29 DKVALQDYLECYGYLPPTDLTATRASQNVLEDALKMMQDFFGLPVTGKLDNATLALMNKPRCGVPD----------GRYR 98 (469)
T ss_pred chhHHHHHhhhcccCCCccccccccCchhhHHHHHhhhhhcCccccCCcchhhhhhccCCCcCCCC----------Cccc
Confidence 678899999999999876543222257899999999999999999999999999999999999999 2488
Q ss_pred cccCCCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-
Q 037540 143 YFYGKPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF- 221 (340)
Q Consensus 143 ~~~~~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf- 221 (340)
++++++||++. +|||+|. |++++++..+++.++++||+.|+++++++|+|+...+.|||+|.|..++|||++||
T Consensus 99 ~~~~~~kW~k~---~lT~ri~--n~~~~l~~~~vr~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFD 173 (469)
T KOG1565|consen 99 YFPGKPKWNKE---HLTYRIK--NYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFD 173 (469)
T ss_pred cCcccCccccc---ccceecc--ccCCCCCHHHHHHHHHHHHhhcccCCCCccccCCCCCCCceeeeeeccCCCCCCccc
Confidence 89999999999 9999999 77899999999999999999999999999999998789999999999999998887
Q ss_pred --------------------cccCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCC
Q 037540 222 --------------------HFDAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDL 281 (340)
Q Consensus 222 --------------------hfd~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L 281 (340)
|||++|.|+++ ...++|++.||+|||||+|||.||.++++||||+|++...++.|
T Consensus 174 G~~g~laHAf~Pg~~~~G~~hfD~dE~Wt~~-----~~~g~~l~~Va~HEiGH~LGL~HS~~~~aiM~P~y~~~~~~~~L 248 (469)
T KOG1565|consen 174 GPGGVLAHAFFPGPGIGGDLHFDKDETWTYG-----DSNGVDLFLVAAHEIGHALGLGHSSDPDAIMYPFYQPDSGNFDL 248 (469)
T ss_pred CCCCceecccCCCCCCCCccccCcccceecc-----CCccchhHHHhhhhcccccccCCCCCcccccccccccCCCCccc
Confidence 99999999986 13489999999999999999999999999999999976678899
Q ss_pred CHHHHHHHHHHhCCCCCCCcc
Q 037540 282 KVDDVEGVQALYGSNPNFKFS 302 (340)
Q Consensus 282 ~~dDI~~Iq~LYG~~~~~~~~ 302 (340)
++|||.|||.|||.+++....
T Consensus 249 ~~DDv~giq~lYG~~~~~~~~ 269 (469)
T KOG1565|consen 249 SQDDVRGIQHLYGGPPRDYFM 269 (469)
T ss_pred ChhhhhhhHHHhCCCcccccc
Confidence 999999999999998876553
No 2
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=99.97 E-value=2.4e-30 Score=223.38 Aligned_cols=133 Identities=44% Similarity=0.834 Sum_probs=110.3
Q ss_pred CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-------
Q 037540 149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF------- 221 (340)
Q Consensus 149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf------- 221 (340)
||++. +|||+|... .+.++.+++++++++||+.|+++++++|+|+.+ ..+||+|.|....++++..|
T Consensus 1 ~W~~~---~ity~i~~~--~~~~~~~~~~~~i~~A~~~W~~~~~~~F~~~~~-~~adi~i~~~~~~~~~~~~~~~~~~~~ 74 (154)
T PF00413_consen 1 KWPKK---TITYSISNT--TPQLSQSEQRDAIRQAFQAWNDVAPLNFTEVSD-GNADIRISFGSNNHGDGYSFDGSGGTL 74 (154)
T ss_dssp SSSSS---EEEEEESSB--CTTS-HHHHHHHHHHHHHHHHTTSSEEEEEESS-SSCSEEEEEESSSSSSSS-CSSSSSES
T ss_pred CCCCC---cEEEEEECC--CCCCCHHHHHHHHHHHHHHHHhcCCceEEeccC-CCcceeeeeeccccCcccccccceeee
Confidence 79998 999999954 467778999999999999999999999999985 57999999987766654433
Q ss_pred --------------cccCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCCCHHHHH
Q 037540 222 --------------HFDAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDLKVDDVE 287 (340)
Q Consensus 222 --------------hfd~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L~~dDI~ 287 (340)
+|+.++.|... ..+.++..|++|||||+|||.|+++++|||||.+.+... ..|+.+||.
T Consensus 75 ~~~~~~~~~~~~~i~~~~~~~~~~~------~~~~~~~~v~~HEiGHaLGL~H~~~~~svM~~~~~~~~~-~~l~~~Di~ 147 (154)
T PF00413_consen 75 AHAYFPNNIVSGDIHFNDDESWTID------DSGNDLQSVAIHEIGHALGLDHSNDPNSVMYPYYRGPDN-KTLSEDDID 147 (154)
T ss_dssp EEEEESSSTTTTEEEEETTSHEESS------SSSEEHHHHHHHHHHHHTTBESSSSTTSTTSSSCTSSSS-TSTTHHHHH
T ss_pred eccccccccccccccccccccchhh------hhhhhhhhhhhhccccccCcCcCCCcccceeeecccCCC-CCCCHHHHH
Confidence 34444445443 235678999999999999999999999999999986542 389999999
Q ss_pred HHHHHhC
Q 037540 288 GVQALYG 294 (340)
Q Consensus 288 ~Iq~LYG 294 (340)
+||+|||
T Consensus 148 ~i~~lYg 154 (154)
T PF00413_consen 148 GIQYLYG 154 (154)
T ss_dssp HHHHHHS
T ss_pred HHHHHhC
Confidence 9999998
No 3
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.97 E-value=3e-30 Score=225.31 Aligned_cols=137 Identities=43% Similarity=0.780 Sum_probs=112.8
Q ss_pred CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-------
Q 037540 149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF------- 221 (340)
Q Consensus 149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf------- 221 (340)
||+++ +|||+|.. ..+.++...+++++++|++.|+++++|+|+|+.+...|||+|.|....|+++.+|
T Consensus 1 kW~~~---~itY~i~~--~~~~~~~~~~~~~i~~A~~~W~~~~~l~F~e~~~~~~adi~I~~~~~~~~~~~~~~~~~g~l 75 (157)
T cd04278 1 KWSKT---NLTYRILN--YPPDLPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGTL 75 (157)
T ss_pred CCCCC---ceeEEEEC--CCCCCCHHHHHHHHHHHHHHHHhhcCceeEEcccCCCCCEEEEEeecccCCCCCCCCCcccc
Confidence 79999 99999974 3444667999999999999999999999999987558999999998777653222
Q ss_pred -------------cccCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCCCHHHHHH
Q 037540 222 -------------HFDAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDLKVDDVEG 288 (340)
Q Consensus 222 -------------hfd~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L~~dDI~~ 288 (340)
+|+.++.|.+.. ...+.+++.|++|||||+|||.|+.++++||||.+.+......|+.+||.+
T Consensus 76 ~~a~~p~~~~g~i~~~~~~~~~~~~----~~~~~~~~~~~~HEiGHaLGL~H~~~~~~vM~~~~~~~~~~~~l~~~D~~~ 151 (157)
T cd04278 76 AHAFFPGGIGGDIHFDDDEQWTLGS----DSGGTDLFSVAAHEIGHALGLGHSSDPDSIMYPYYQGPVPKFKLSQDDIRG 151 (157)
T ss_pred ccccCCCCcceeEEECCCcccccCC----CCccchHHHHHHHHhccccccCCCCCCcCeecccccCCCcCCCCCHHHHHH
Confidence 344555555432 124678999999999999999999999999999998653456899999999
Q ss_pred HHHHhC
Q 037540 289 VQALYG 294 (340)
Q Consensus 289 Iq~LYG 294 (340)
||+|||
T Consensus 152 ~~~lYg 157 (157)
T cd04278 152 IQALYG 157 (157)
T ss_pred HHHhcC
Confidence 999998
No 4
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.90 E-value=2.1e-23 Score=181.82 Aligned_cols=139 Identities=27% Similarity=0.311 Sum_probs=100.0
Q ss_pred CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccC-CCccEEEEeecC----CCCCC--Cccc-----c
Q 037540 156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDY-DSADIHIGFYHG----DHGDG--QPFH-----F 223 (340)
Q Consensus 156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~-~~ADI~I~f~~g----~hgdg--~pfh-----f 223 (340)
..|+|.+.+.+..+.-...++++++++|++.|+++++|+|+++.+. .++||+|.|... ..|+. .++. +
T Consensus 2 ~~i~~~i~~~~~~~~~~~~~~~~~v~~A~~~W~~~~~l~F~~~~~~~~~adi~I~f~~~~~~~~~g~~~a~a~~p~~~~~ 81 (156)
T cd04279 2 SPIRVYIDPTPAPPDSRAQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDG 81 (156)
T ss_pred CCeEEEEcCCCCccccchHHHHHHHHHHHHHHHHhCCeEEEEccCCCCCCcEEEEecCCCCCCCCCCceEEecccccCCC
Confidence 3789999865332223357899999999999999999999998653 379999999853 11111 1110 1
Q ss_pred cCCcceeecccccccc--ccccchhHHHHhhhhhhCCCcCCCCC-CccccccCCCC-CCCCCCHHHHHHHHHHhC
Q 037540 224 DAAETWAVDMESVKSR--VAVDLESVATHEIGHVLGLAHTSVKE-AVMYPSLTPRT-RKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 224 d~~e~wt~~~~~~~~~--~~~dl~~Va~HEIGHaLGL~Hs~d~~-SIMyP~~~~~~-~~~~L~~dDI~~Iq~LYG 294 (340)
+....|.......... .+.+++.|++|||||+|||.|++++. +||||++.... .+..|+++|+++||.|||
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~siM~p~~~~~~~~~~~l~~~D~~~i~~lY~ 156 (156)
T cd04279 82 NRKLFNRTDINLGPGQPRGAENLQAIALHELGHALGLWHHSDRPEDAMYPSQGQGPDGNPTLSARDVATLKRLYG 156 (156)
T ss_pred cccccccceEeecCCcCccchHHHHHHHHHhhhhhcCCCCCCCccceeeeecccCCcCcCCCCHHHHHHHHHHhC
Confidence 1223333332211111 25678999999999999999999999 99999998642 356899999999999997
No 5
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.85 E-value=2.3e-21 Score=169.32 Aligned_cols=132 Identities=21% Similarity=0.320 Sum_probs=93.4
Q ss_pred CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCC--CCCCcc--cc---cCCcc
Q 037540 156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDH--GDGQPF--HF---DAAET 228 (340)
Q Consensus 156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~h--gdg~pf--hf---d~~e~ 228 (340)
..|+|.|... + ...+|+++++||+.|+++++++|+|+.+...+||+|.+....+ ..++++ +. ...+.
T Consensus 2 ~~I~y~i~~~-~-----~~~~r~~i~~A~~~W~~~~~i~F~e~~~~~~~di~i~~~~~~~~~~~~~~~~~~~~~~~~g~i 75 (165)
T cd04268 2 KPITYYIDDS-V-----PDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEI 75 (165)
T ss_pred CCEEEEEcCC-C-----CHHHHHHHHHHHHHHHHHhCcCceeccccCccCceEEEEEeecCCCCccccCCccCCCCCccE
Confidence 3899999852 2 2679999999999999999999999987558999999876321 112222 11 11233
Q ss_pred eeecccccc---ccccccchhHHHHhhhhhhCCCcCCC----------------CCCccccccCCC------CCCCCCCH
Q 037540 229 WAVDMESVK---SRVAVDLESVATHEIGHVLGLAHTSV----------------KEAVMYPSLTPR------TRKVDLKV 283 (340)
Q Consensus 229 wt~~~~~~~---~~~~~dl~~Va~HEIGHaLGL~Hs~d----------------~~SIMyP~~~~~------~~~~~L~~ 283 (340)
|........ ...+..+..|++|||||+|||.|+++ ..|||++..... .....|+.
T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~~~~~~~~~~~~~~~~~~ 155 (165)
T cd04268 76 LLARVYLYSSFVEYSGARLRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAPSNFSIQLGDGQKYTIGP 155 (165)
T ss_pred EeeEEEEchhHHHHHHHHHHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCccccccccccccCCCCCH
Confidence 321111000 01234578999999999999999998 899998654421 13468999
Q ss_pred HHHHHHHHHh
Q 037540 284 DDVEGVQALY 293 (340)
Q Consensus 284 dDI~~Iq~LY 293 (340)
+||.+||.||
T Consensus 156 ~Di~ai~~lY 165 (165)
T cd04268 156 YDIAAIKKLY 165 (165)
T ss_pred HHHHHHHhcC
Confidence 9999999998
No 6
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.85 E-value=2.6e-21 Score=165.36 Aligned_cols=134 Identities=31% Similarity=0.491 Sum_probs=101.1
Q ss_pred CCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCC-CCCCcc-ccc
Q 037540 147 KPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDH-GDGQPF-HFD 224 (340)
Q Consensus 147 ~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~h-gdg~pf-hfd 224 (340)
..+|++. +|+|.|.. +.++..++++++++|++.|+++++++|+|+.+ .+||.|.|...++ |+..++ .+.
T Consensus 2 ~~~W~~~---~v~Y~i~~----~~~~~~~~~~~i~~A~~~w~~~t~i~F~e~~~--~ad~~I~f~~~~~~g~~~a~~g~~ 72 (140)
T smart00235 2 SKKWPKG---TVPYVIDS----SSLSPEEVREAIARAFAEWSDVTCLRFVERTS--TADIYISFGKGDGSGCTLSHAGRP 72 (140)
T ss_pred CCcCCCC---cEEEEEcC----CCCCHHHHHHHHHHHHHHHhcCCeeEEEECCC--CCCeEEEEEECCCCCcceeeeecC
Confidence 4699998 99999972 23344556999999999999999999999865 7999999999887 431132 122
Q ss_pred CCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCCCC---ccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 037540 225 AAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVKEA---VMYPSLTPRTRKVDLKVDDVEGVQALYGS 295 (340)
Q Consensus 225 ~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~~S---IMyP~~~~~~~~~~L~~dDI~~Iq~LYG~ 295 (340)
.+..+ ++.. .++....|++|||||+|||.|++++.+ +|++.+......+.++++|..+++.+||+
T Consensus 73 ~g~~~-~~~~-----~~~~~~~~~~HEigHaLGl~H~~~~~drd~~~~~~~~~~~~~~~~~~~~~~~~~~~yg~ 140 (140)
T smart00235 73 GGDQH-FSLG-----NGCINTGVAAHELGHALGLYHEQSRSDRDNYMYINYTNITRNFDLSNDDSLGIPYDYGS 140 (140)
T ss_pred CCceE-EEcc-----CCcCCcccHHHHHHHHhcCCcCCCCCcccCeEEEehhhhhhccccccccCCCchhccCc
Confidence 22232 2211 133345699999999999999998888 99999875544567888899999999984
No 7
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=99.83 E-value=4.6e-20 Score=165.17 Aligned_cols=122 Identities=25% Similarity=0.342 Sum_probs=84.7
Q ss_pred HHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCC--CCccccc-------CCcceeeccccc-cccccc
Q 037540 173 LTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGD--GQPFHFD-------AAETWAVDMESV-KSRVAV 242 (340)
Q Consensus 173 ~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgd--g~pfhfd-------~~e~wt~~~~~~-~~~~~~ 242 (340)
....++++++|++.|+++++|+|+|+.+...+||+|.+....++. |+++.-. .+..|....... ....+.
T Consensus 32 ~~~~~~~i~~A~~~w~~~~~l~F~e~~~~~~adI~i~~~~~~~~~~~g~a~~p~~~~~~~~~g~i~~~~~~~~~~~~~g~ 111 (186)
T cd04277 32 SAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSYDTNSDSPGS 111 (186)
T ss_pred CHHHHHHHHHHHHHHHhhcCceeEECCCCCcceEEEEeccCCCCCccEEEECCCCCccccccceeEEecCcccccCCCCh
Confidence 467999999999999999999999998866899999987753221 2222111 112232111100 002255
Q ss_pred cchhHHHHhhhhhhCCCcCCCC----------------CCccccccCCC-------CCCCCCCHHHHHHHHHHhC
Q 037540 243 DLESVATHEIGHVLGLAHTSVK----------------EAVMYPSLTPR-------TRKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~----------------~SIMyP~~~~~-------~~~~~L~~dDI~~Iq~LYG 294 (340)
....|++|||||+|||.|+++. .|||.+..... .....++..||++||+|||
T Consensus 112 ~~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~~~~~~~~~~~~~~~~~~~~DI~AlQ~lYG 186 (186)
T cd04277 112 YGYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDIAALQYLYG 186 (186)
T ss_pred hhHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecCCCCCCccccCcccCCccHHHHHHHHHhhC
Confidence 6789999999999999999853 46895433221 2346789999999999998
No 8
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.74 E-value=1.6e-17 Score=150.64 Aligned_cols=103 Identities=26% Similarity=0.413 Sum_probs=70.1
Q ss_pred CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-cccC--
Q 037540 149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-HFDA-- 225 (340)
Q Consensus 149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-hfd~-- 225 (340)
+|+++ +.|+|.|... . .+..+++|++||+.|++++||+|+|++. .++||+|.|..++ |+ +++ .-+.
T Consensus 2 ~W~~~--~~~~~~f~~~-~-----~~~~r~~I~~A~~~W~~~t~i~F~~~~~-~~adi~I~f~~~~-Gc-~S~vG~~~~~ 70 (198)
T cd04327 2 LWRNG--TVLRIAFLGG-P-----DAFLKDKVRAAAREWLPYANLKFKFVTD-ADADIRISFTPGD-GY-WSYVGTDALL 70 (198)
T ss_pred CCCCC--CeEEEEeCCC-C-----cHHHHHHHHHHHHHHhhhcCeEEEECCC-CCCCEEEEEecCC-CC-CCCcCCcccc
Confidence 79987 3788998742 1 3778999999999999999999999976 5799999998753 32 222 1110
Q ss_pred --Ccceeeccccc-cccccccchhHHHHhhhhhhCCCcCC
Q 037540 226 --AETWAVDMESV-KSRVAVDLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 226 --~e~wt~~~~~~-~~~~~~dl~~Va~HEIGHaLGL~Hs~ 262 (340)
...-++..... ....+++...|++|||||||||.|++
T Consensus 71 ~~~~~~t~~l~~~~~~~~~~~~~~~i~HElgHaLG~~HEh 110 (198)
T cd04327 71 IGADAPTMNLGWFTDDTPDPEFSRVVLHEFGHALGFIHEH 110 (198)
T ss_pred cCCCCceeeeeeecCCCchhhHHHHHHHHHHHHhcCcccc
Confidence 01111111100 00123466789999999999999974
No 9
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=99.62 E-value=2.8e-15 Score=134.18 Aligned_cols=90 Identities=16% Similarity=0.132 Sum_probs=63.2
Q ss_pred cEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCCcceeeccccc
Q 037540 157 TLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAAETWAVDMESV 236 (340)
Q Consensus 157 ~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~~~ 236 (340)
.|.|.|..+ ++ ...++.|++|++.|++++||+|+|+.. ++..|.|..+ .|+........++ -.+...
T Consensus 3 ~VpY~i~~~-~~-----~~~~~~I~~A~~~w~~~TcIrF~~~~~---~~~~I~f~~~-~Gc~S~vG~~~~~-q~i~l~-- 69 (180)
T cd04280 3 TVPYVIDGS-FD-----ESDRSLILRAMREIESNTCIRFVPRTT---EKDYIRIVKG-SGCWSYVGRVGGR-QVVSLG-- 69 (180)
T ss_pred EEEEEECCC-CC-----HHHHHHHHHHHHHHHhCCcceEEECCC---CCcEEEEEcC-CCcceecCccCCc-eeEEeC--
Confidence 799999852 32 679999999999999999999999875 4445666655 3432222111111 122222
Q ss_pred cccccccchhHHHHhhhhhhCCCcCC
Q 037540 237 KSRVAVDLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 237 ~~~~~~dl~~Va~HEIGHaLGL~Hs~ 262 (340)
.++....+++|||||||||.|++
T Consensus 70 ---~~c~~~g~v~HE~~HalG~~HEh 92 (180)
T cd04280 70 ---SGCFSLGTIVHELMHALGFYHEQ 92 (180)
T ss_pred ---CCcCcCchhHHHHHHHhcCcchh
Confidence 25556799999999999999988
No 10
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=99.61 E-value=2.3e-15 Score=135.96 Aligned_cols=129 Identities=19% Similarity=0.399 Sum_probs=79.1
Q ss_pred CCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-cccCCc
Q 037540 149 RWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-HFDAAE 227 (340)
Q Consensus 149 kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-hfd~~e 227 (340)
||+.. +|.|.|.. ++ ....++.|++|++.|++.+||+|+++.. . ....|.+..+ .|+ +++ ....++
T Consensus 2 ~Wp~~---~IpY~~~~-~~-----~~~~~~~I~~Am~~~e~~TcI~F~~~~~-~-~~~~i~~~~~-~gC-~S~vG~~~g~ 68 (191)
T PF01400_consen 2 KWPNG---TIPYYIDP-SF-----SSSQRQRIRKAMDEWEKNTCIRFVERTE-N-EDDYISFSNG-SGC-WSYVGRQGGE 68 (191)
T ss_dssp S-GGG---EEEEEEET-TS------HHHHHHHHHHHHHHHHHSSEEEEE-SS-S-SSSEEEEESS-SSE-EEESS--SSE
T ss_pred cCCCC---EEEEEECC-CC-----CHHHHHHHHHHHHHHHhCCCEEEEECCC-C-CceEEEeecC-ccc-cchhhhcCcc
Confidence 89999 99999985 33 2678899999999999999999999983 3 3334555543 333 332 122222
Q ss_pred ceeeccccccccccccchhHHHHhhhhhhCCCcCC---------------------------------------CCCCcc
Q 037540 228 TWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS---------------------------------------VKEAVM 268 (340)
Q Consensus 228 ~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~---------------------------------------d~~SIM 268 (340)
+ .+... .++....|++|||||||||.|++ |+.|||
T Consensus 69 q-~i~l~-----~~c~~~~~i~HEl~HaLG~~HEh~RpDRd~yi~i~~~~i~~~~~~~f~~~~~~~~~~~~~pYD~~SIM 142 (191)
T PF01400_consen 69 Q-TINLG-----DGCFSVGTILHELGHALGFWHEHQRPDRDNYITINWDNIQPGYKHNFDKYPPSEWSTYGSPYDYGSIM 142 (191)
T ss_dssp E-EEEE------TTC-SHHHHHHHHHHHHTB--GGGSTTGGGTEEE-GGGB-TTSGGGG---SGGGBE-TTS---TT-TT
T ss_pred e-eEEec-----ceeCCccchHHHHHHHHhhhhhhhccccccEEEEehhcchhhhhhhhhccccccccccCCCcCccCee
Confidence 2 23322 24556789999999999999988 567999
Q ss_pred -ccccCCC-------------------CCCCCCCHHHHHHHHHHhCCC
Q 037540 269 -YPSLTPR-------------------TRKVDLKVDDVEGVQALYGSN 296 (340)
Q Consensus 269 -yP~~~~~-------------------~~~~~L~~dDI~~Iq~LYG~~ 296 (340)
|+..... .+...|+..|+.+|+.+|+.+
T Consensus 143 HY~~~~~s~~~~~~ti~~~~~~~~~~iG~~~~lS~~D~~~in~~Y~C~ 190 (191)
T PF01400_consen 143 HYSSYAFSKNGSQPTITPKDPKYQETIGQRNRLSFTDIKQINKMYNCP 190 (191)
T ss_dssp ---TTTTBSSTTS-SEEESSTTCTS-GGG-SS--HHHHHHHHHHTT--
T ss_pred cccCcccccCCCCCeEEecCCCCcccccccCCCCHHHHHHHHHhcCCC
Confidence 4332211 012358999999999999753
No 11
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=99.60 E-value=1.1e-14 Score=132.37 Aligned_cols=99 Identities=21% Similarity=0.351 Sum_probs=67.1
Q ss_pred CCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCC
Q 037540 147 KPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAA 226 (340)
Q Consensus 147 ~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~ 226 (340)
..+|+++ .|.|.|.. +++ ...+++|++|++.|++.|||+|++..+ +...|. |..+..|+.+.......
T Consensus 7 ~~~Wp~~---~VpY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~r~~-~~~yi~--f~~~~~GC~S~vG~~~~ 74 (200)
T cd04281 7 ERIWPGG---VIPYVIDG-NFT-----GSQRAMFKQAMRHWENFTCVTFVERTP-EENYIV--FTYRPCGCCSYVGRRGN 74 (200)
T ss_pred cCcCCCC---EEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceEEEECCC-CCCEEE--EEECCCCeeEcCCCcCC
Confidence 5699999 99999974 332 567889999999999999999999765 344444 44333343221111111
Q ss_pred cceeeccccccccccccchhHHHHhhhhhhCCCcCC
Q 037540 227 ETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 227 e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~ 262 (340)
..-.+... .++....+++|||||||||.|++
T Consensus 75 g~q~isl~-----~~C~~~Gti~HEl~HaLGf~HEh 105 (200)
T cd04281 75 GPQAISIG-----KNCDKFGIVVHELGHVIGFWHEH 105 (200)
T ss_pred CceeeecC-----CCcCcCchHHHHHHHHhcCcchh
Confidence 11222222 35666789999999999999988
No 12
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.58 E-value=9.7e-15 Score=127.37 Aligned_cols=117 Identities=21% Similarity=0.255 Sum_probs=84.0
Q ss_pred HHHHHHHHHHHHHHhhccCcceEEEeccC-CCccEEEEeecCCCC---CCCccc-----ccCCcceeecccccccccccc
Q 037540 173 LTDVKAAFKSSFQRWSSVIPVNFTEIYDY-DSADIHIGFYHGDHG---DGQPFH-----FDAAETWAVDMESVKSRVAVD 243 (340)
Q Consensus 173 ~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~-~~ADI~I~f~~g~hg---dg~pfh-----fd~~e~wt~~~~~~~~~~~~d 243 (340)
...+++++++|++.|+++++++|+++... ..+||.+.+...+.+ .|.++. ......|..... ..+..
T Consensus 20 ~~~~~~~v~~a~~~w~~~~~i~f~~~~~~~~~~di~~~~~~~~~~~~~~g~a~~~~~c~~~~~~~~~~~~~----~~~~~ 95 (167)
T cd00203 20 SAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGVLQDNQ----SGTKE 95 (167)
T ss_pred HHHHHHHHHHHHHHHHhhhCceEEEeccCCCcCcEEEEeccccCCCCceEEEecCCccCCCCCcEEEecCC----ccccc
Confidence 47799999999999999999999999863 389999988764422 123331 011222321111 11345
Q ss_pred chhHHHHhhhhhhCCCcCCC--------------------CCCccccccCCC--CCCCCCCHHHHHHHHHHh
Q 037540 244 LESVATHEIGHVLGLAHTSV--------------------KEAVMYPSLTPR--TRKVDLKVDDVEGVQALY 293 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d--------------------~~SIMyP~~~~~--~~~~~L~~dDI~~Iq~LY 293 (340)
...|++|||||+|||.|..+ ..+||+|..... .....++..||+.||++|
T Consensus 96 ~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~~d~~~i~~~Y 167 (167)
T cd00203 96 GAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQCDIDQINKLY 167 (167)
T ss_pred chhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCHHHHHHHHhhC
Confidence 78999999999999999876 468998766532 245689999999999998
No 13
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=99.48 E-value=4.2e-13 Score=124.30 Aligned_cols=96 Identities=20% Similarity=0.308 Sum_probs=65.7
Q ss_pred CCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCC
Q 037540 147 KPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAA 226 (340)
Q Consensus 147 ~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~ 226 (340)
..+|++ .|.|.|.. +++ ...++.|++|++.|++.|||+|++... +...| .|..+ .|+.+......+
T Consensus 43 ~~~Wp~----~VPY~i~~-~~~-----~~~~~~I~~Am~~~~~~TCirF~~rt~-e~~yi--~i~~~-~GC~S~vG~~gg 108 (230)
T cd04282 43 TYRWPF----PIPYILDD-SLD-----LNAKGVILKAFEMYRLKSCVDFKPYEG-ESNYI--FFFKG-SGCWSMVGDQQG 108 (230)
T ss_pred ccCCCc----ceeEEECC-CCC-----HHHHHHHHHHHHHHHhCCCeeEEECCC-CCcEE--EEEcC-CCeeeccCccCC
Confidence 469986 59999974 343 567899999999999999999998764 33344 44443 243222211112
Q ss_pred cceeeccccccccccccchhHHHHhhhhhhCCCcCC
Q 037540 227 ETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 227 e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~ 262 (340)
. -.+... .++....+++|||||||||.|++
T Consensus 109 ~-q~isl~-----~~C~~~Gti~HEl~HalGf~HEq 138 (230)
T cd04282 109 G-QNLSIG-----AGCDYKATVEHEFLHALGFYHEQ 138 (230)
T ss_pred e-EEEEEC-----CCcCCCchHHHHHHHHhCCcccc
Confidence 1 223222 35667899999999999999988
No 14
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=99.46 E-value=4.6e-13 Score=120.05 Aligned_cols=123 Identities=17% Similarity=0.279 Sum_probs=82.7
Q ss_pred CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCcc-cccCCcceeeccc
Q 037540 156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPF-HFDAAETWAVDME 234 (340)
Q Consensus 156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pf-hfd~~e~wt~~~~ 234 (340)
..|.|.|.. +++ ...++.|++|++.|++.+||+|++... +...|+| ..+ .|+ +++ ....+ ...+...
T Consensus 4 ~~IpY~i~~-~~~-----~~~~~~I~~A~~~~~~~TCirF~~~~~-~~~yi~~--~~~-~gC-~S~vG~~gg-~q~i~l~ 71 (182)
T cd04283 4 VYVPYVISP-QYS-----ENERAVIEKAMQEFETLTCVRFVPRTT-ERDYLNI--ESR-SGC-WSYIGRQGG-RQTVSLQ 71 (182)
T ss_pred EEEEEEECC-CCC-----HHHHHHHHHHHHHHHhCCceeeEECCC-CCcEEEE--EcC-CCc-eEecCccCC-ceeEecC
Confidence 478999974 332 567899999999999999999998764 3444544 433 233 332 11222 2223322
Q ss_pred cccccccccchhHHHHhhhhhhCCCcCC------------------------------------CCCCcc-ccccCC---
Q 037540 235 SVKSRVAVDLESVATHEIGHVLGLAHTS------------------------------------VKEAVM-YPSLTP--- 274 (340)
Q Consensus 235 ~~~~~~~~dl~~Va~HEIGHaLGL~Hs~------------------------------------d~~SIM-yP~~~~--- 274 (340)
..++....+++|||||||||.|++ |++||| |+....
T Consensus 72 ----~~~C~~~G~i~HEl~HaLG~~HEhsRpDRD~yV~I~~~nI~~~~~~nF~~~~~~~~~~pYDy~SiMHY~~~afs~~ 147 (182)
T cd04283 72 ----KQGCMYKGIIQHELLHALGFYHEQTRSDRDKYVRINWENIIPDQLYNFDKQDTNNLGTPYDYSSVMHYGRYAFSIN 147 (182)
T ss_pred ----CCCcCccchHHHHHHHHhCCcccccccccCceEEEehhhcCchHHhhhhhccccccCCcccceeeeccCCcccccC
Confidence 125666789999999999999988 557899 432210
Q ss_pred -------C-------CCCCCCCHHHHHHHHHHhC
Q 037540 275 -------R-------TRKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 275 -------~-------~~~~~L~~dDI~~Iq~LYG 294 (340)
. .+...|+..||..|+.||.
T Consensus 148 g~~Ti~~~~~~~~~iGqr~~lS~~Di~~iN~~Y~ 181 (182)
T cd04283 148 GKPTIVPIPDPNVPIGQRQGMSNLDILRINKLYN 181 (182)
T ss_pred CCCeEEECCcccccccCCCCCCHHHHHHHHHhcC
Confidence 0 1224689999999999996
No 15
>KOG3714 consensus Meprin A metalloprotease [Posttranslational modification, protein turnover, chaperones]
Probab=99.46 E-value=3.9e-13 Score=134.61 Aligned_cols=137 Identities=18% Similarity=0.235 Sum_probs=94.7
Q ss_pred cCCCCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCccccc
Q 037540 145 YGKPRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFD 224 (340)
Q Consensus 145 ~~~~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd 224 (340)
....+|+.. .|.|.|... ++ ...|++|++||+.|++.||++|+|..+ .+.+..+-+..+ |+.......
T Consensus 77 ~~~~~Wp~~---~ipY~i~~~-~~-----~~~r~~i~~A~~~~~~~TCi~F~~~~~-~~~~~~~~~~~~--gC~S~VGr~ 144 (411)
T KOG3714|consen 77 NPERRWPNG---VIPYYIDGS-FT-----SSQRALIRQAMREIENHTCIRFVERTT-PDKDYLIVFTGG--GCYSYVGRR 144 (411)
T ss_pred ChhhcCCCC---eeeeEECCC-CC-----HHHHHHHHHHHHHHhhCcceEEEeCCC-CCcceEEEeCCC--cceeeeCcc
Confidence 346799998 999999863 43 789999999999999999999999874 334444444332 443222222
Q ss_pred CCcceeeccccccccccccchhHHHHhhhhhhCCCcCC---------------------------------------CCC
Q 037540 225 AAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS---------------------------------------VKE 265 (340)
Q Consensus 225 ~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~---------------------------------------d~~ 265 (340)
....-.+. ...++....|++||||||||+.|++ |++
T Consensus 145 gg~~q~~s-----l~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYDyg 219 (411)
T KOG3714|consen 145 GGGQQLLS-----LGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYDYG 219 (411)
T ss_pred CCCcccee-----cCCCcCcCchhHHHHHHHhhhhhccCcccccCceEEeeccCChhhhhhhhhcChhhhhccCCcccCC
Confidence 22111121 2246777999999999999999988 568
Q ss_pred Cccccc-cC-----------CC--------CCCCCCCHHHHHHHHHHhCCCCC
Q 037540 266 AVMYPS-LT-----------PR--------TRKVDLKVDDVEGVQALYGSNPN 298 (340)
Q Consensus 266 SIMyP~-~~-----------~~--------~~~~~L~~dDI~~Iq~LYG~~~~ 298 (340)
|||++. +. +. .+...++..||..|..||.....
T Consensus 220 SvMHY~~~afs~~~~~~ti~~~~~~~~~~mGqr~~~S~~Di~~iN~~Y~C~~~ 272 (411)
T KOG3714|consen 220 SVMHYAPYAFSKNGSLPTIVPKDNGFQNTMGQRERLSFYDIRKINKLYCCPEC 272 (411)
T ss_pred cccccCCcccCcCCCCCceecccccccccccccCcCCHHHHHHHHhhcCCCcc
Confidence 999522 11 00 11236899999999999987543
No 16
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.42 E-value=1.9e-12 Score=117.44 Aligned_cols=134 Identities=22% Similarity=0.331 Sum_probs=81.6
Q ss_pred CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhcc------CcceEEEeccC--CCccEE---EEeecCCCC-CCC-ccc
Q 037540 156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSV------IPVNFTEIYDY--DSADIH---IGFYHGDHG-DGQ-PFH 222 (340)
Q Consensus 156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~v------t~l~F~ev~~~--~~ADI~---I~f~~g~hg-dg~-pfh 222 (340)
..|||+|.. .++ .++++++++||+.|+++ .+...+++... ...||+ |.|...+.+ .++ |..
T Consensus 8 ~pItyyI~~--~~p----~~~r~aI~~A~~~Wn~~fe~~Gf~~a~~v~~~p~~~~~~Diry~~ir~~~~~~~~~~~gps~ 81 (197)
T cd04276 8 EPIVYYLDN--TFP----EKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAYGPSV 81 (197)
T ss_pred CCEEEEecC--CCc----HHHHHHHHHHHHHHHHHHHhcCCCccEEEEeCCCCcCcccceEEEEEEEecCCCcceecccc
Confidence 489999984 332 78999999999999987 35655565431 236888 555554311 111 222
Q ss_pred cc--CCcceeecc--cccc---------ccccccchhHHHHhhhhhhCCCcCCC---------------------CCCcc
Q 037540 223 FD--AAETWAVDM--ESVK---------SRVAVDLESVATHEIGHVLGLAHTSV---------------------KEAVM 268 (340)
Q Consensus 223 fd--~~e~wt~~~--~~~~---------~~~~~dl~~Va~HEIGHaLGL~Hs~d---------------------~~SIM 268 (340)
.| .+|....+. .... ...+..+..+++||+||+|||.|... ..|||
T Consensus 82 ~dPrTGeIl~a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lGl~hn~~~s~~~~~~~l~~~~~~~~~~~~~SVM 161 (197)
T cd04276 82 VDPRTGEILKADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLGLRHNFKASSDGSNEELEDPLGTKEKGATSSVM 161 (197)
T ss_pred cCCCCCCeEEEEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhcCcccccccccCchhhhcchhhhhhcCCcceee
Confidence 22 234332111 1000 01123367899999999999999651 35788
Q ss_pred -ccccC--CC------CCCCCCCHHHHHHHHHHhCC
Q 037540 269 -YPSLT--PR------TRKVDLKVDDVEGVQALYGS 295 (340)
Q Consensus 269 -yP~~~--~~------~~~~~L~~dDI~~Iq~LYG~ 295 (340)
|+.+. .. .....+...||.+||.+||.
T Consensus 162 dY~~~~~~~~~~~~~~~~~~~~g~yDi~Aiq~~Y~~ 197 (197)
T cd04276 162 DYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYTP 197 (197)
T ss_pred cCccccccccCccccccccCCCchHHHHHHHHHcCC
Confidence 42221 11 12346899999999999983
No 17
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=99.23 E-value=8.5e-11 Score=107.10 Aligned_cols=138 Identities=20% Similarity=0.330 Sum_probs=84.8
Q ss_pred CCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhcc-CcceEEEe--ccCCCccEEEEeecCCC--CC-CCcc
Q 037540 148 PRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSV-IPVNFTEI--YDYDSADIHIGFYHGDH--GD-GQPF 221 (340)
Q Consensus 148 ~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~v-t~l~F~ev--~~~~~ADI~I~f~~g~h--gd-g~pf 221 (340)
.-|... +++|+..+.+.+. ..+ ...++.++++|++.|+.. ..|+|+-+ .....+||.|.-..... |. +.+
T Consensus 32 nlV~~~-~~~I~i~~~~~~~-~~l-~s~~~~al~~AI~~yNalgl~i~F~~t~g~~~~~~di~v~~~~~~~~~G~ggsA- 107 (211)
T PF12388_consen 32 NLVGSS-PRTITIIGYTGGS-QSL-NSAWRTALDEAINNYNALGLSITFTLTFGTNYQNADIIVYSNSSNNPSGAGGSA- 107 (211)
T ss_pred eeecCC-CCEEEEEeCCCcc-ccc-cHHHHHHHHHHHHHHHhhCCceEEEEecCCCcCCCceEEEeccCCCCCCcceec-
Confidence 345433 4588888875321 223 367999999999999986 35777522 23356788764332211 11 122
Q ss_pred ccc-C-----CcceeeccccccccccccchhHHHHhhhhhhCCCcCC------------------------------CCC
Q 037540 222 HFD-A-----AETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTS------------------------------VKE 265 (340)
Q Consensus 222 hfd-~-----~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~------------------------------d~~ 265 (340)
.|. . ...+..+.. .........|++|||||++||.|+. |.+
T Consensus 108 GFP~s~G~P~~~I~I~~~~---~~~~~~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~~nEg~~~vGAi~IpGTPt~~d~~ 184 (211)
T PF12388_consen 108 GFPTSNGNPYKFIQIYGLS---NYSVNVIEHVITHEIGHCIGFRHTDYFNRSSCGSGGNEGSAGVGAIHIPGTPTGADPN 184 (211)
T ss_pred cCCCCCCCCCceEEEEecC---CCchhHHHHHHHHHhhhhccccccCcCCcccccccCCcCccccceEECCCCCCCCCCc
Confidence 122 1 111221111 1112235789999999999999976 457
Q ss_pred CccccccCCCCCCCCCCHHHHHHHHHHh
Q 037540 266 AVMYPSLTPRTRKVDLKVDDVEGVQALY 293 (340)
Q Consensus 266 SIMyP~~~~~~~~~~L~~dDI~~Iq~LY 293 (340)
|||...+... ...+++..||.+++.||
T Consensus 185 SiM~ac~~~~-~~~~f~~~Di~Al~~lY 211 (211)
T PF12388_consen 185 SIMNACFSSG-EDGEFTSNDITALNYLY 211 (211)
T ss_pred hhhhccccCC-CCCCcChhhHHHHHhhC
Confidence 8897776543 34479999999999999
No 18
>PF01471 PG_binding_1: Putative peptidoglycan binding domain; InterPro: IPR002477 This entry represents peptidoglycan binding domain (PGBD), as well as related domains that share the same structure. PGBD may have a general peptidoglycan binding function, has a core structure consisting of a closed, three-helical bundle with a left-handed twist. It is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation [, , ]. Examples are: Muramoyl-pentapeptide carboxypeptidase (3.4.17.8 from EC) N-acetylmuramoyl-L-alanine amidase cwlA precursor (cell wall hydrolase, autolysin, 3.5.1.28 from EC) Autolytic lysozyme (1,4-beta-N-acetylmuramidase, autolysin, 3.2.1.17 from EC) Membrane-bound lytic murein transglycosylase B Zinc-containing D-alanyl-D-alanine-cleaving carboxypeptidase, VanX []. Many of the proteins having this domain are as yet uncharacterised. However, some are known to belong to MEROPS peptidase family M15 (clan MD), subfamily M15A metallopeptidases. A number of the proteins belonging to subfamily M15A are non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. Eukaryotic enzymes can contain structurally similar PGBD-like domains. Matrix metalloproteinases (MMP), which catalyse extracellular matrix degradation, have N-terminal domains that resemble PGBD. Examples are gelatinase A (MMP-2), which degrades type IV collagen [], stromelysin-1 (MMP-3), which plays a role in arthritis and tumour invasion [, ], and gelatinase B (MMP-9) secreted by neutrophils as part of the innate immune defence mechanism []. Several MMPs are implicated in cancer progression, since degradation of the extracellular matrix is an essential step in the cascade of metastasis [].; GO: 0008152 metabolic process; PDB: 1L6J_A 3BKH_A 3BKV_A 1GXD_A 1EAK_D 1CK7_A 1SLM_A 1LBU_A 1SU3_B.
Probab=99.13 E-value=1.5e-10 Score=84.07 Aligned_cols=55 Identities=35% Similarity=0.643 Sum_probs=46.8
Q ss_pred cChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHh
Q 037540 62 TGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTI 119 (340)
Q Consensus 62 ~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m 119 (340)
+.+..++++|.++||++. ..+..+++.+++||+.||+.+||++||++|.+|+++|
T Consensus 3 ~~v~~lq~~L~~~gy~~~---~~~g~~~~~t~~Av~~fQ~~~gL~~tG~~d~~T~~~L 57 (57)
T PF01471_consen 3 PDVKALQQYLNRLGYYPG---PVDGIFDPETREAVKAFQKANGLPVTGVVDPETWEAL 57 (57)
T ss_dssp HHHHHHHHHHHHTTTT-S---STTSBSHHHHHHHHHHHHHHTTS-SSSSBCHHHHHHH
T ss_pred HHHHHHHHHHHHcCCCCC---CCCCCcCHHHHHHHHHHHHHcCcCCCCccCHHHHhcC
Confidence 356789999999999943 2356789999999999999999999999999999987
No 19
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=98.73 E-value=6.7e-08 Score=81.12 Aligned_cols=125 Identities=21% Similarity=0.257 Sum_probs=69.4
Q ss_pred CcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhh-ccCcceEEEeccCCCccEEEEeecCCCCCCCcccccCCcceeeccc
Q 037540 156 MTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWS-SVIPVNFTEIYDYDSADIHIGFYHGDHGDGQPFHFDAAETWAVDME 234 (340)
Q Consensus 156 ~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws-~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~ 234 (340)
++|+|.-.. ..+.+..|.+|.+.|+ .|.+++|++..+ ++|++.-.. +.+..++ .-+.....++...
T Consensus 2 vtv~Ydas~--------A~~f~~~i~~aa~iWN~sV~NV~L~~~s~---a~~~~~~~~-~~~~sya-~~~g~G~G~I~l~ 68 (132)
T PF02031_consen 2 VTVYYDASR--------APEFRSAIAQAAQIWNSSVSNVRLVEGSS---ADIRYYEGN-DPRGSYA-STDGLGSGYIFLD 68 (132)
T ss_dssp EEEEEEEEE---------GGGHHHHHHHHHHHHHH-SSEEEEE-SS----SEEEEEE---TT--EE-EE-SSS-EEEEEE
T ss_pred EEEEEeCCC--------CchhHHHHHHHHHHHhcccCceEEeecCC---CcEEEEEec-CCCCccc-ccCCCCcEEEEec
Confidence 478888763 2568999999999997 688999999754 888876432 1111111 1111222222221
Q ss_pred cccccccccchhHHHHhhhhhhCCCc--CCCCCCccccccCCC-CCCCCCCHHHHHHHHHHhC
Q 037540 235 SVKSRVAVDLESVATHEIGHVLGLAH--TSVKEAVMYPSLTPR-TRKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 235 ~~~~~~~~dl~~Va~HEIGHaLGL~H--s~d~~SIMyP~~~~~-~~~~~L~~dDI~~Iq~LYG 294 (340)
.. ...+++-..|++|||||.|||.. +.....+|.-.-.+- -.+..+....+..++++|.
T Consensus 69 ~~-~~qgy~~~RIaaHE~GHiLGLPD~y~GpCS~LMSG~~aG~sCtN~~Pna~E~a~V~~~~a 130 (132)
T PF02031_consen 69 YQ-QNQGYNSTRIAAHELGHILGLPDHYPGPCSELMSGGSAGTSCTNAYPNAAERARVNSLFA 130 (132)
T ss_dssp HH-HHHHS-HHHHHHHHHHHHHT----TTS-TT-GGGTTTT-TT----S--HHHHHHHHHHCT
T ss_pred hH-HhhCCccceeeeehhccccCCCCCCCCccHHhhcCCCCCCCCCCCCCCHHHHHHHHHHhc
Confidence 11 13467788899999999999964 334567887433221 1244578899999999985
No 20
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.61 E-value=2.4e-07 Score=83.68 Aligned_cols=112 Identities=21% Similarity=0.376 Sum_probs=73.3
Q ss_pred HHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCCCCCCc-c----------ccc--------CCcceeeccc
Q 037540 174 TDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGDGQP-F----------HFD--------AAETWAVDME 234 (340)
Q Consensus 174 ~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgdg~p-f----------hfd--------~~e~wt~~~~ 234 (340)
...+.++..|++.|++.-|+..+|- .+.|||+|..... .|-|.. | +|= -.....+...
T Consensus 103 p~wq~a~~tava~wa~~fpl~ive~--~eeaDItie~~n~-pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~~~ 179 (236)
T COG5549 103 PRWQGAYLTAVAGWAKTFPLIIVER--FEEADITIEVGNP-PGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIMYP 179 (236)
T ss_pred hhHHHHHHHHHHHHHHhCCceeeec--ceeeeEEEEecCC-CCCcccccchHHHHHHHHHHhhhccCcccccccccccCC
Confidence 3678899999999999977766655 4689999976543 222211 1 110 0011112111
Q ss_pred cccccccccchhHHHHhhhhhhCCC-cCCCCCCccccccCCCCCCCCCCHHHHHHHHHHhCC
Q 037540 235 SVKSRVAVDLESVATHEIGHVLGLA-HTSVKEAVMYPSLTPRTRKVDLKVDDVEGVQALYGS 295 (340)
Q Consensus 235 ~~~~~~~~dl~~Va~HEIGHaLGL~-Hs~d~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG~ 295 (340)
....+.++..+|.||+|||||+. |+.- .++|.+..+ ...++..|++.+.++|-.
T Consensus 180 --pg~~~e~L~~tarhElGhaLgi~ghsd~-ks~aly~~s----~~~is~rdv~tL~r~Yeq 234 (236)
T COG5549 180 --PGELRENLNPTARHELGHALGIWGHSDL-KSDALYGIS----WQGISRRDVKTLERKYEQ 234 (236)
T ss_pred --cccchhhhhHHHHHhhcchheecccccc-hhhhhcchh----hcccCHHHHHHHHHHhcC
Confidence 12335568899999999999998 8764 556653333 224899999999999974
No 21
>TIGR02869 spore_SleB spore cortex-lytic enzyme. Members of this protein family are the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. This protein is stored in an inactive form in the spore and activated during germination.
Probab=97.90 E-value=2.7e-05 Score=71.05 Aligned_cols=69 Identities=23% Similarity=0.308 Sum_probs=57.7
Q ss_pred cCCCCCcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhcCCCCCCCC
Q 037540 56 EMGSNITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIMSPRCGVSD 128 (340)
Q Consensus 56 ~~g~~~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~~PRCGvpD 128 (340)
++|++-+.+..+++.|.+.||...+ .+..++..+++||+.||+..||+++|.+|..|...|..| +.++.
T Consensus 8 ~~G~~g~~V~~LQ~~L~~lG~~~g~---idG~fg~~T~~AV~~FQ~~~GL~~dG~vg~~T~~~L~~~-~~~~~ 76 (201)
T TIGR02869 8 QRGSTGSDVIEIQRRLKAWGYYNGK---VDGVFGWLTYWAVRKFQSKNGLTVDGIVGPKTKAALGIA-TTQQS 76 (201)
T ss_pred CCCCCcHHHHHHHHHHHHcCCCCCC---CCCccCHHHHHHHHHHHHHhCCCCCCCcCHHHHHHhCCc-cccCc
Confidence 4588889999999999999998533 245678999999999999999999999999999999543 34443
No 22
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=97.67 E-value=4.9e-05 Score=69.55 Aligned_cols=47 Identities=28% Similarity=0.352 Sum_probs=29.8
Q ss_pred chhHHHHhhhhhhCCCcCCCC--------------CCccccccCCCCCCCCCCHHHHHHHHHH
Q 037540 244 LESVATHEIGHVLGLAHTSVK--------------EAVMYPSLTPRTRKVDLKVDDVEGVQAL 292 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d~--------------~SIMyP~~~~~~~~~~L~~dDI~~Iq~L 292 (340)
...+++|||||+|||.|..+. .+||.+.-.+ .. ..++.-+|..|+..
T Consensus 137 ~~~~~aHEiGH~lGl~H~~~~~~~~~~~~~~~~~~~tIMsY~~~~-~~-~~FS~~~i~~i~~~ 197 (206)
T PF13583_consen 137 GYQTFAHEIGHNLGLRHDFDYSGDPYSYAYECDSGQTIMSYASRP-SR-PYFSPPSIEYIREV 197 (206)
T ss_pred cchHHHHHHHHHhcCCCCcccCCCcccccccCCCCCEEeCCCCCC-CC-CCCCchhHHHHhCC
Confidence 347899999999999998742 2577521111 11 22666677666543
No 23
>PRK10594 murein L,D-transpeptidase; Provisional
Probab=97.37 E-value=0.00045 Score=72.24 Aligned_cols=67 Identities=21% Similarity=0.267 Sum_probs=56.1
Q ss_pred cCCCCCcChHHHHHHHHhcCCCCCCC------C-----------------------------------CCCCcchHHHHH
Q 037540 56 EMGSNITGMSELKKYFTRFGYLSLSY------T-----------------------------------NFTDVFDTQLQS 94 (340)
Q Consensus 56 ~~g~~~~~~~~~~~YL~~fGYl~~~~------~-----------------------------------~~~~~~d~~l~~ 94 (340)
++|+.-+.+..+++=|...|++.... . ..+..+|.+|.+
T Consensus 236 rpG~~~~~V~~LR~RL~~~G~l~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~YD~~Lv~ 315 (608)
T PRK10594 236 RPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPAVRAAYDNELVE 315 (608)
T ss_pred CCCCCCchHHHHHHHHHHcCCcccccccccccccccccccccccccccccccccccccccccccccccccccccCHHHHH
Confidence 67999999999999999999985210 0 012357999999
Q ss_pred HHHHHHHhcCCCCCccCCHHHHHHhcCC
Q 037540 95 AVVSYQTKLGLPVTGKLDSETISTIMSP 122 (340)
Q Consensus 95 Ai~~~Q~~~gLpvTG~LD~~T~~~m~~P 122 (340)
|||+||+..||..+|+++..|++.|+.|
T Consensus 316 AVK~FQ~rhGL~~DGvIG~~T~~~LNvs 343 (608)
T PRK10594 316 AVKRFQAWQGLGADGVIGPRTRDWLNVT 343 (608)
T ss_pred HHHHHHHHcCCCCCcCcCHHHHHHHcCC
Confidence 9999999999999999999999998754
No 24
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=97.36 E-value=7.9e-05 Score=66.27 Aligned_cols=47 Identities=19% Similarity=0.212 Sum_probs=19.1
Q ss_pred chhHHHHhhhhhhCCCcCCCCC-------------CccccccCCCCCCCCCCHHHHHHHHHH
Q 037540 244 LESVATHEIGHVLGLAHTSVKE-------------AVMYPSLTPRTRKVDLKVDDVEGVQAL 292 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d~~-------------SIMyP~~~~~~~~~~L~~dDI~~Iq~L 292 (340)
...|++|||||.||+.|..+.+ .||++. .......+++--|+.|+.+
T Consensus 111 ~~~~~aHElGH~lGa~Hd~~~~~~~~~~~~~~sg~~IMsy~--~~~~~~~fS~cS~~~i~~~ 170 (173)
T PF13574_consen 111 GIDTFAHELGHQLGAPHDFDNGCNPGSPTCPGSGGYIMSYA--GICNNDKFSPCSIRQIRAV 170 (173)
T ss_dssp HHHHHHHHHHHHHT---SSSS----SS--T-STTT-TT-SS---GGGG--------------
T ss_pred eeeeehhhhHhhcCCCCCCCCCCCCCCCcCCCCCCEEEeeC--CCCCCcccccccccccccc
Confidence 3568999999999999988643 688744 2223345566666665543
No 25
>PF11350 DUF3152: Protein of unknown function (DUF3152); InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=97.33 E-value=0.00067 Score=61.60 Aligned_cols=108 Identities=19% Similarity=0.269 Sum_probs=66.0
Q ss_pred CcEEEEeecCCcCCC---CCHHHHHHHHHHHH---HHhhccCcceEEEeccCCCccEEEEeecCCCCC--CCcc------
Q 037540 156 MTLTYAFSKTDMIDY---IDLTDVKAAFKSSF---QRWSSVIPVNFTEIYDYDSADIHIGFYHGDHGD--GQPF------ 221 (340)
Q Consensus 156 ~~LTY~f~~~n~~~~---l~~~~~r~ai~~Af---~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~hgd--g~pf------ 221 (340)
..++|++.-++-... .+.++..++|.+.+ +-|.....+.|+.|.+ +.+|++|...+...-+ +.+.
T Consensus 29 ~~~rY~VeVE~Gi~~~~~gg~~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~-~~~Df~I~Lasp~T~~~lC~g~~~~~e~ 107 (203)
T PF11350_consen 29 RLYRYRVEVEDGIDVDAYGGDDAFAAMVDATLADPRGWTADGRVRFQRVDS-GAPDFRISLASPGTTDRLCAGLDTSGET 107 (203)
T ss_pred ceEEEEEEEccCcCCcccccHHHHHHHHHHHhcCCCCCCcCCCEEEEECCC-CCCCEEEEECCcchhhhhccCcCcCcee
Confidence 489999987643221 12355666666655 5699888999999987 4599999987653211 1011
Q ss_pred --cc-------cCCcceeeccccccccccccchhHHHHhhhhhhCCCcCCCC
Q 037540 222 --HF-------DAAETWAVDMESVKSRVAVDLESVATHEIGHVLGLAHTSVK 264 (340)
Q Consensus 222 --hf-------d~~e~wt~~~~~~~~~~~~dl~~Va~HEIGHaLGL~Hs~d~ 264 (340)
+. -+..+|..+...-...-+.....|+=||.||+||-+|..=+
T Consensus 108 SC~~~~g~~VvIN~~RWv~Ga~~f~gdl~~YRqYvINHEVGH~LGh~H~~Cp 159 (203)
T PF11350_consen 108 SCRNPAGGRVVINLARWVRGAPAFGGDLASYRQYVINHEVGHALGHGHEPCP 159 (203)
T ss_pred EeecCCCCeEEEehHHhhccCcccCCcHHHHHHHhhhhhhhhhcccCCCcCC
Confidence 11 01234544332211111233567999999999999887643
No 26
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=97.27 E-value=0.00013 Score=64.74 Aligned_cols=27 Identities=41% Similarity=0.572 Sum_probs=24.3
Q ss_pred hhHHHHhhhhhhCCCcCCCCCCccccc
Q 037540 245 ESVATHEIGHVLGLAHTSVKEAVMYPS 271 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~SIMyP~ 271 (340)
..=++||+||+|||.|=.++.-||++.
T Consensus 125 ~KEv~HElGH~~GL~HC~N~~CVM~FS 151 (181)
T COG1913 125 VKEVLHELGHLLGLSHCPNPRCVMNFS 151 (181)
T ss_pred HHHHHHHhhhhcCcccCCCCCcEEeCC
Confidence 456899999999999999999999854
No 27
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=97.20 E-value=0.00035 Score=62.56 Aligned_cols=48 Identities=19% Similarity=0.263 Sum_probs=32.8
Q ss_pred cchhHHHHhhhhhhCCCcCCCC----------CCccccccCCCCCCCCCCHHHHHHHHH
Q 037540 243 DLESVATHEIGHVLGLAHTSVK----------EAVMYPSLTPRTRKVDLKVDDVEGVQA 291 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~----------~SIMyP~~~~~~~~~~L~~dDI~~Iq~ 291 (340)
....+++||+||.|||.|.... .-||.|.... .....+|.-.++.|+.
T Consensus 132 ~~~~~~aHElGH~lG~~HD~~~~~~~~c~~~~~~IM~~~~~~-~~~~~FS~CS~~~i~~ 189 (192)
T cd04267 132 LTALTMAHELGHNLGAEHDGGDELAFECDGGGNYIMAPVDSG-LNSYRFSQCSIGSIRE 189 (192)
T ss_pred eehhhhhhhHHhhcCCcCCCCCCccccCCCCCCeEEcccccC-CCCCccChhhHHHHHH
Confidence 4567999999999999997753 2489876543 2234566655555553
No 28
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=97.11 E-value=0.0002 Score=64.21 Aligned_cols=23 Identities=35% Similarity=0.430 Sum_probs=14.9
Q ss_pred cchhHHHHhhhhhhCCCcCCCCC
Q 037540 243 DLESVATHEIGHVLGLAHTSVKE 265 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~~ 265 (340)
....|++|||||.||+.|..+..
T Consensus 141 ~~~~~~AHEiGH~lGa~HD~~~~ 163 (196)
T PF13688_consen 141 NGAITFAHEIGHNLGAPHDGDYS 163 (196)
T ss_dssp HHHHHHHHHHHHHTT-----SSS
T ss_pred ceehhhHHhHHHhcCCCCCCCCC
Confidence 34679999999999999987753
No 29
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.06 E-value=0.00035 Score=62.74 Aligned_cols=27 Identities=41% Similarity=0.608 Sum_probs=24.0
Q ss_pred chhHHHHhhhhhhCCCcCCCCCCcccc
Q 037540 244 LESVATHEIGHVLGLAHTSVKEAVMYP 270 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d~~SIMyP 270 (340)
+..+++||+||.|||.|-.++.-||+.
T Consensus 125 ~~k~~~HElGH~lGL~HC~~~~CvM~~ 151 (179)
T PRK13267 125 VRKEVTHELGHTLGLEHCDNPRCVMNF 151 (179)
T ss_pred HHHHHHHHHHHHcCCccCCCCCccCCC
Confidence 345699999999999999999999984
No 30
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.98 E-value=0.00029 Score=58.40 Aligned_cols=18 Identities=50% Similarity=0.682 Sum_probs=12.6
Q ss_pred chhHHHHhhhhhhCCCcC
Q 037540 244 LESVATHEIGHVLGLAHT 261 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs 261 (340)
...|++|||||.|||.|.
T Consensus 107 ~~~~~~HEiGH~lGl~Hd 124 (124)
T PF13582_consen 107 GVDTFAHEIGHNLGLNHD 124 (124)
T ss_dssp STTHHHHHHHHHTT----
T ss_pred cceEeeehhhHhcCCCCC
Confidence 358999999999999995
No 31
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=96.97 E-value=0.00053 Score=62.44 Aligned_cols=54 Identities=22% Similarity=0.345 Sum_probs=38.8
Q ss_pred cccchhHHHHhhhhhhCCCcCCC---------CCCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 037540 241 AVDLESVATHEIGHVLGLAHTSV---------KEAVMYPSLTPRTRKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaLGL~Hs~d---------~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG 294 (340)
+.....|++|||||.||+.|-.+ ..-||+|.......+...|.=-+..++.+..
T Consensus 137 ~~~~a~~~aHElGH~LG~~HD~~~~~C~~~~~~~~IM~~~~~~~~~~~~fS~CS~~~~~~~l~ 199 (207)
T cd04273 137 GLSSAFTIAHELGHVLGMPHDGDGNSCGPEGKDGHIMSPTLGANTGPFTWSKCSRRYLTSFLD 199 (207)
T ss_pred CceeEEeeeeechhhcCCCCCCCCCCCCCCCCCceeecccccCCCCCCCcCHHHHHHHHHHHc
Confidence 34456799999999999999875 2689998876432344667766666665554
No 32
>PF07998 Peptidase_M54: Peptidase family M54; InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=96.82 E-value=0.00067 Score=61.61 Aligned_cols=27 Identities=41% Similarity=0.588 Sum_probs=19.3
Q ss_pred chhHHHHhhhhhhCCCcCCCCCCccccc
Q 037540 244 LESVATHEIGHVLGLAHTSVKEAVMYPS 271 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d~~SIMyP~ 271 (340)
+..+++||+||++||.|=.+ .-+|++.
T Consensus 145 ~~Kea~HElGH~~GL~HC~~-~CvM~~s 171 (194)
T PF07998_consen 145 VCKEAVHELGHLFGLDHCEN-RCVMNFS 171 (194)
T ss_dssp HHHHHHHHHHHHTT----SS-TSTTS--
T ss_pred HHHHHHHHHHHHcCCcCCCC-CCccCCC
Confidence 35799999999999999999 9999964
No 33
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=96.72 E-value=0.0019 Score=60.13 Aligned_cols=49 Identities=29% Similarity=0.493 Sum_probs=37.2
Q ss_pred hHHHHhhhhhhCCCcCCCC-------------------------CCccccccCCCCCCCCCCHHHHHHHHHHhCCC
Q 037540 246 SVATHEIGHVLGLAHTSVK-------------------------EAVMYPSLTPRTRKVDLKVDDVEGVQALYGSN 296 (340)
Q Consensus 246 ~Va~HEIGHaLGL~Hs~d~-------------------------~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG~~ 296 (340)
.|++|||||.||+.|=.+. .=||+|.... ....++.--+..++.+...+
T Consensus 147 ~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~--~~~~FS~CS~~~~~~~l~~~ 220 (228)
T cd04271 147 QVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSS--GITEFSPCTIGNICSLLGRN 220 (228)
T ss_pred eehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCC--CCCccChhhHHHHHHHHhcC
Confidence 5999999999999996431 1199987554 45578888888888887654
No 34
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=96.70 E-value=0.00077 Score=59.03 Aligned_cols=25 Identities=40% Similarity=0.522 Sum_probs=15.8
Q ss_pred cccchhHHHHhhhhhhCCCcCCCCC
Q 037540 241 AVDLESVATHEIGHVLGLAHTSVKE 265 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaLGL~Hs~d~~ 265 (340)
..+...|++||+||-|||.|.....
T Consensus 66 ~~~~g~TltHEvGH~LGL~HtF~~~ 90 (154)
T PF05572_consen 66 QYNFGKTLTHEVGHWLGLYHTFGGG 90 (154)
T ss_dssp TS-SSHHHHHHHHHHTT---TT---
T ss_pred ccccccchhhhhhhhhcccccccCC
Confidence 3456789999999999999998544
No 35
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=96.64 E-value=0.0024 Score=57.23 Aligned_cols=49 Identities=24% Similarity=0.365 Sum_probs=32.6
Q ss_pred chhHHHHhhhhhhCCCcCCC------CCCccccccCCCCCCCCCCHHHHHHHHHHhC
Q 037540 244 LESVATHEIGHVLGLAHTSV------KEAVMYPSLTPRTRKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d------~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG 294 (340)
.-.+++|||||.||+.|-.+ ..-||+|.... .....|.=-...++.+..
T Consensus 131 ~a~~~AHElGH~lG~~HD~~~c~C~~~~cIM~~~~~~--~~~~fS~CS~~~~~~~l~ 185 (194)
T cd04269 131 FAVTMAHELGHNLGMEHDDGGCTCGRSTCIMAPSPSS--LTDAFSNCSYEDYQKFLS 185 (194)
T ss_pred HHHHHHHHHHhhcCCCcCCCCCCCCCCCeEecCCCCC--CCCCCChhhHHHHHHHHh
Confidence 35799999999999999543 36789886542 233455544555555444
No 36
>COG2989 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.34 E-value=0.0072 Score=62.04 Aligned_cols=65 Identities=23% Similarity=0.400 Sum_probs=55.2
Q ss_pred cCCCCCcChHHHHHHHHhcCC-CCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhcC
Q 037540 56 EMGSNITGMSELKKYFTRFGY-LSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIMS 121 (340)
Q Consensus 56 ~~g~~~~~~~~~~~YL~~fGY-l~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~~ 121 (340)
++|+....+..+.+=|.+-|= ++.. ...+..+|.++.+|||+||+..||+..|+.++.|+++|+-
T Consensus 233 rpG~~~~~v~aL~~~L~~~~~d~~~a-~~~s~~yd~el~~avKrfQ~~~GL~~DGviG~~T~~aLn~ 298 (561)
T COG2989 233 RPGVTSPDVPALRARLARSGMDLPSA-AGSSPAYDPELVEAVKRFQARHGLPADGVIGPATRAALNV 298 (561)
T ss_pred CCCCCchhHHHHHHHHHhcCccchhh-ccCcccccHHHHHHHHHHHHHhCCCCCCcccHHHHHHhcc
Confidence 679999999999999999883 3322 2235578999999999999999999999999999999974
No 37
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=96.19 E-value=0.00077 Score=62.64 Aligned_cols=26 Identities=46% Similarity=0.622 Sum_probs=22.5
Q ss_pred cccchhHHHHhhhhhhCCCcCCCCCC
Q 037540 241 AVDLESVATHEIGHVLGLAHTSVKEA 266 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaLGL~Hs~d~~S 266 (340)
..++..|++||+||.|||.|.....+
T Consensus 134 ~~n~g~t~~HEvGH~lGL~HtF~~~~ 159 (225)
T cd04275 134 PYNLGDTATHEVGHWLGLYHTFQGGS 159 (225)
T ss_pred cccccceeEEeccceeeeeeeecCCC
Confidence 45677899999999999999997764
No 38
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=95.81 E-value=0.014 Score=52.51 Aligned_cols=52 Identities=21% Similarity=0.443 Sum_probs=31.3
Q ss_pred cccchhHHHHhhhhhhCCCcCCC--------CCCccccccCCCCCCCCCCHHHHHHHHHHh
Q 037540 241 AVDLESVATHEIGHVLGLAHTSV--------KEAVMYPSLTPRTRKVDLKVDDVEGVQALY 293 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaLGL~Hs~d--------~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LY 293 (340)
...+-.+++|||||.||+.|-.. ..-||.|...+.. ....|.=-+..++.+.
T Consensus 128 ~~~~a~~~AHelGH~lGm~HD~~~~C~C~~~~~cIM~~~~~~~~-~~~fS~CS~~~~~~~l 187 (199)
T PF01421_consen 128 GLSFAVIIAHELGHNLGMPHDGDNGCKCPRRDGCIMSPSIGPSS-SDKFSNCSRRQFEEFL 187 (199)
T ss_dssp HHHHHHHHHHHHHHHTT---TTTCTHCSSSSSSSTTSSSSSSSS-TSSS-HHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHhcCCCCCCCCcCCcCCCCCccccccccCCC-CCCcCHHHHHHHHHHH
Confidence 33455789999999999999766 3568998554322 2256665555555543
No 39
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=95.67 E-value=0.015 Score=54.63 Aligned_cols=50 Identities=20% Similarity=0.383 Sum_probs=33.7
Q ss_pred hhHHHHhhhhhhCCCcCCC-----C------CCccccccCC--CCCCCCCCHHHHHHHHHHhC
Q 037540 245 ESVATHEIGHVLGLAHTSV-----K------EAVMYPSLTP--RTRKVDLKVDDVEGVQALYG 294 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d-----~------~SIMyP~~~~--~~~~~~L~~dDI~~Iq~LYG 294 (340)
..|++|||||.||+.|-.+ + .=||+|.-.. ......++.=-++.++...-
T Consensus 168 a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~CS~~~~~~~L~ 230 (244)
T cd04270 168 DLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSPCSKKSISKVLE 230 (244)
T ss_pred HHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCHhHHHHHHHHHc
Confidence 4699999999999999554 1 3489986421 12345677666666665554
No 40
>KOG4525 consensus Jacalin-like lectin domain-containing protein [General function prediction only]
Probab=95.42 E-value=0.023 Score=57.06 Aligned_cols=57 Identities=25% Similarity=0.319 Sum_probs=39.0
Q ss_pred hhHHHHhhhhhhCCCcCCCCCCcc---ccccCCC----C---------------CCCCCCHHHHHHHHHHhCCCCCCCcc
Q 037540 245 ESVATHEIGHVLGLAHTSVKEAVM---YPSLTPR----T---------------RKVDLKVDDVEGVQALYGSNPNFKFS 302 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~SIM---yP~~~~~----~---------------~~~~L~~dDI~~Iq~LYG~~~~~~~~ 302 (340)
...++|||||.||+.|. ++.|| |++++.. + .+-+.++.-+++++-+|-...+-.++
T Consensus 302 lGA~~HElGH~lgcpHq--~~GvMlrgy~~lNRsftt~e~~~vrt~s~g~aPv~~keec~w~~lD~~Rf~yhp~f~lp~d 379 (614)
T KOG4525|consen 302 LGAVCHELGHCLGCPHQ--SEGVMLRGYDYLNRSFTTDEFECVRTGSKGLAPVLAKEECPWRILDARRFSYHPTFSLPGD 379 (614)
T ss_pred HHHHHHHhhhccCCCCC--CCceEecCchhhcceeeecceeeeecCCcCcCCccccccCchhhhhhhhhcccccccCCCC
Confidence 46889999999999997 46788 5554421 0 12356778889999999654333344
Q ss_pred c
Q 037540 303 S 303 (340)
Q Consensus 303 ~ 303 (340)
+
T Consensus 380 p 380 (614)
T KOG4525|consen 380 P 380 (614)
T ss_pred C
Confidence 3
No 41
>PF05548 Peptidase_M11: Gametolysin peptidase M11; InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=94.75 E-value=0.018 Score=56.22 Aligned_cols=21 Identities=43% Similarity=0.580 Sum_probs=18.0
Q ss_pred ccchhHHHHhhhhhhCCCcCC
Q 037540 242 VDLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 242 ~dl~~Va~HEIGHaLGL~Hs~ 262 (340)
.....|++||+||-|||.|+.
T Consensus 148 ~~~~~~~~HElgHN~GL~Ha~ 168 (314)
T PF05548_consen 148 VQDWATIMHELGHNLGLWHAG 168 (314)
T ss_pred cccHHHHHHHhhhhccccccC
Confidence 344579999999999999996
No 42
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=94.55 E-value=0.08 Score=46.37 Aligned_cols=61 Identities=25% Similarity=0.372 Sum_probs=49.4
Q ss_pred CCCcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCccCCHHHHHHhcC
Q 037540 59 SNITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGL-PVTGKLDSETISTIMS 121 (340)
Q Consensus 59 ~~~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gL-pvTG~LD~~T~~~m~~ 121 (340)
..-..+..++.-|.+.||.+. ...+..+....+.||+.||+..+| +++|..+++|...+..
T Consensus 43 ~~~~~v~~lq~~L~~~g~~~~--~~~dg~~g~~t~~av~~fQ~~~gl~~~dG~~g~~t~~al~~ 104 (185)
T COG3409 43 AEGPSVRILQAALNALGYYPD--GVIDGVYGPETAAAVRAFQQKNGLSPVDGIVGPATRAALPS 104 (185)
T ss_pred CCCchHHHHHHHHHhcCCCCC--CCccCccCcccHHHHHHHhhhcCcccccccccHHHHHHHHH
Confidence 355677888999999999873 112334567889999999999999 8999999999998865
No 43
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=94.16 E-value=0.029 Score=51.54 Aligned_cols=20 Identities=35% Similarity=0.461 Sum_probs=17.1
Q ss_pred cchhHHHHhhhhhhCCCcCC
Q 037540 243 DLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~ 262 (340)
....+++||+||.||+.|-.
T Consensus 144 ~~~~~~AHElGH~lG~~HD~ 163 (220)
T cd04272 144 YGVYTMTHELAHLLGAPHDG 163 (220)
T ss_pred ccHHHHHHHHHHHhCCCCCC
Confidence 34689999999999999954
No 44
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=94.09 E-value=0.062 Score=49.79 Aligned_cols=54 Identities=24% Similarity=0.412 Sum_probs=32.0
Q ss_pred CCccEEEEeecCCCCCCCcccccCCcceeeccccccccccccc--hhHHHHhhhhhhCCCcCCC
Q 037540 202 DSADIHIGFYHGDHGDGQPFHFDAAETWAVDMESVKSRVAVDL--ESVATHEIGHVLGLAHTSV 263 (340)
Q Consensus 202 ~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~~~~~~~~~dl--~~Va~HEIGHaLGL~Hs~d 263 (340)
+-.|++|.-..|.-+| |||..++. +...++ ...+.++ ..||+||.|||+ .|...
T Consensus 51 gl~~V~Ve~~~G~LtD----HYdP~~k~-vrLS~~-vy~~~SiaAvaVAAHEvGHAi--Q~a~~ 106 (222)
T PF04298_consen 51 GLSDVRVERVPGELTD----HYDPRNKV-VRLSED-VYNGRSIAAVAVAAHEVGHAI--QHAEG 106 (222)
T ss_pred CCCCeeEEEeCCCCCC----CcCCCCCE-EEeCCc-cCCCCCHHHHHHHHHHHhHHH--hcccc
Confidence 4557788776664445 67766654 322211 1223333 679999999994 55543
No 45
>PF10462 Peptidase_M66: Peptidase M66; InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=93.55 E-value=0.036 Score=53.75 Aligned_cols=20 Identities=35% Similarity=0.639 Sum_probs=14.8
Q ss_pred cchhHHHHhhhhhhCCCcCC
Q 037540 243 DLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~ 262 (340)
....+++||+||++||.|-.
T Consensus 192 s~~~~f~HE~GH~~GL~H~p 211 (305)
T PF10462_consen 192 SYGNEFSHELGHNFGLGHYP 211 (305)
T ss_dssp -SHHHHHHHHHHTTT--SST
T ss_pred CccceeehhhhhhcCCCCCC
Confidence 35789999999999999943
No 46
>COG3023 ampD N-acetyl-anhydromuramyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]
Probab=93.43 E-value=0.22 Score=46.99 Aligned_cols=71 Identities=25% Similarity=0.367 Sum_probs=54.9
Q ss_pred chHhhHhhhhc------------cCCCCCcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCC-CCCccC
Q 037540 45 TWYDFARLLDA------------EMGSNITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGL-PVTGKL 111 (340)
Q Consensus 45 ~~~~~~~~~~~------------~~g~~~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gL-pvTG~L 111 (340)
+|..+...+|- ..+.+..++..+++-|.+|||--+. +.+|++.+.+|+.||.-|.- ..+|+.
T Consensus 169 pWk~l~~~~gigaw~~~~~~~~~~~~~~~~~v~~lq~~L~~YGY~v~~-----~~~d~~t~~vv~aFQ~hfrp~~~dg~~ 243 (257)
T COG3023 169 PWKRLAEALGIGAWLDTAQVQKYLALLKGEDVAALQEMLARYGYGVEI-----GVFDQETQQVVRAFQMHFRPGLYDGEA 243 (257)
T ss_pred CHHHHHhhcCcccCCcHhhhhhhhhhhccCCHHHHHHHHHHhCcCCCc-----chhhHHHHHHHHHHHHHhCCCCCCCCC
Confidence 58777666441 1123338889999999999997643 45789999999999999985 479999
Q ss_pred CHHHHHHhc
Q 037540 112 DSETISTIM 120 (340)
Q Consensus 112 D~~T~~~m~ 120 (340)
|-+|++.+.
T Consensus 244 d~et~a~l~ 252 (257)
T COG3023 244 DVETIAILQ 252 (257)
T ss_pred ChHHHHHHH
Confidence 999998764
No 47
>PF12044 Metallopep: Putative peptidase family; InterPro: IPR021917 This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.06 E-value=0.051 Score=54.97 Aligned_cols=22 Identities=36% Similarity=0.619 Sum_probs=19.4
Q ss_pred hhHHHHhhhhhhCCCcCCCCCCcc
Q 037540 245 ESVATHEIGHVLGLAHTSVKEAVM 268 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~SIM 268 (340)
....+|||||++||.|..+ .||
T Consensus 317 iGA~lHEiGH~fg~pH~~~--GIM 338 (423)
T PF12044_consen 317 IGAFLHEIGHLFGCPHQED--GIM 338 (423)
T ss_pred hHHHHHHHHHhcCCCCCCC--Cee
Confidence 4689999999999999865 888
No 48
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=92.17 E-value=0.14 Score=46.42 Aligned_cols=76 Identities=22% Similarity=0.382 Sum_probs=42.3
Q ss_pred HHHHHHHHHHHhhccCc---ceEEEec-----cCCCccEEEEeecCCCCCCCcccccCCcceeecccccccccccc--ch
Q 037540 176 VKAAFKSSFQRWSSVIP---VNFTEIY-----DYDSADIHIGFYHGDHGDGQPFHFDAAETWAVDMESVKSRVAVD--LE 245 (340)
Q Consensus 176 ~r~ai~~Af~~Ws~vt~---l~F~ev~-----~~~~ADI~I~f~~g~hgdg~pfhfd~~e~wt~~~~~~~~~~~~d--l~ 245 (340)
.+.-++.++++|++|.. ++=.||. +.+--|+.+.-.+|.-.| |||...+ .+...+ ....|.. -.
T Consensus 20 AQ~kvks~y~KySkV~~s~g~TGaevAr~iLd~nGl~dV~Ve~v~G~LTD----HYDP~~k-vvrLSe-~~y~g~Sia~~ 93 (226)
T COG2738 20 AQSKVKSTYKKYSKVRASSGLTGAEVARMILDENGLYDVPVEEVPGTLTD----HYDPRRK-VVRLSE-ANYYGPSIAAI 93 (226)
T ss_pred HHHHHHHHHHHHhhccCcCCCcHHHHHHHHHhhcCCccceeeeecCCccc----ccChhhh-eeeccc-cccCCccHHHH
Confidence 45567889999998753 3322332 123456666655543333 6665433 222221 1223333 36
Q ss_pred hHHHHhhhhhhC
Q 037540 246 SVATHEIGHVLG 257 (340)
Q Consensus 246 ~Va~HEIGHaLG 257 (340)
.|++||.|||+=
T Consensus 94 aVAAHEVGHAiQ 105 (226)
T COG2738 94 AVAAHEVGHAIQ 105 (226)
T ss_pred HHHHHHhhHHHh
Confidence 799999999963
No 49
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=90.97 E-value=0.061 Score=51.65 Aligned_cols=20 Identities=40% Similarity=0.552 Sum_probs=16.8
Q ss_pred hhHHHHhhhhhhCCCcCCCC
Q 037540 245 ESVATHEIGHVLGLAHTSVK 264 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~ 264 (340)
..|++||+||.|||.+--+.
T Consensus 166 igv~~HE~gH~lGLPDlY~~ 185 (286)
T TIGR03296 166 VGVIAHELGHDLGLPDLYDT 185 (286)
T ss_pred eeeeehhhhcccCCCccccC
Confidence 57999999999999875543
No 50
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=88.57 E-value=0.18 Score=42.26 Aligned_cols=17 Identities=47% Similarity=0.812 Sum_probs=14.2
Q ss_pred chhHHHHhhhhhhCCCc
Q 037540 244 LESVATHEIGHVLGLAH 260 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~H 260 (340)
...|++|||||-.||+.
T Consensus 109 vthvliHEIgHhFGLsD 125 (136)
T COG3824 109 VTHVLIHEIGHHFGLSD 125 (136)
T ss_pred hhhhhhhhhhhhcCCCh
Confidence 45799999999988864
No 51
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=87.38 E-value=1.6 Score=33.63 Aligned_cols=53 Identities=19% Similarity=0.359 Sum_probs=40.4
Q ss_pred hHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCC----CCccCCHHHHHHh
Q 037540 64 MSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLP----VTGKLDSETISTI 119 (340)
Q Consensus 64 ~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLp----vTG~LD~~T~~~m 119 (340)
..++++=|.+-||+..+. +..+|+++++|++.||..-|+. .-|++|...++-|
T Consensus 18 ~~evq~~L~~lGyy~g~~---~g~~d~a~~~Al~~~~g~ENfE~R~~~~~~ID~~VL~~L 74 (74)
T PF08823_consen 18 AREVQEALKRLGYYKGEA---DGVWDEATEDALRAWAGTENFEERIRGDGKIDPVVLAYL 74 (74)
T ss_pred HHHHHHHHHHcCCccCCC---CCcccHHHHHHHHHHHHHhhHHhhcCCCCccCHHHHhhC
Confidence 356888999999987442 4567899999999999998875 3455887776543
No 52
>PF11150 DUF2927: Protein of unknown function (DUF2927); InterPro: IPR021323 This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known.
Probab=86.80 E-value=15 Score=33.98 Aligned_cols=137 Identities=15% Similarity=0.199 Sum_probs=79.8
Q ss_pred CCCCCCCCCcEEEEeecCCcCCCCCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEEEeecCCC---------CCC
Q 037540 148 PRWVRESPMTLTYAFSKTDMIDYIDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHIGFYHGDH---------GDG 218 (340)
Q Consensus 148 ~kW~~~~~~~LTY~f~~~n~~~~l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I~f~~g~h---------gdg 218 (340)
.||... |+..+.. .+. -......+.+..-++.-+.++.+....+.+. +|+|.|.|...+. +..
T Consensus 28 ~Rw~~P----Vrv~v~~--~~~-~~~~~d~~~v~~~~~rL~~itg~~I~~~~~~-~aN~~v~~~~~~~~~~~ir~~~p~~ 99 (213)
T PF11150_consen 28 RRWEGP----VRVRVEG--VPP-ADRARDLARVRAYLARLRRITGHPISQVSSP-NANFHVIFVSEDDWRPRIRRLLPGA 99 (213)
T ss_pred ccCCCC----eEEEEec--cCh-hhHHHHHHHHHHHHhhhccccCCceeeccCC-CCcEEEEEeccchhhHHHHHhcccc
Confidence 589865 7777762 111 1234555666666888899999999888875 9999999987654 111
Q ss_pred Ccccc------cC--------Ccceeeccc-------cccccccccchhHHHHhhhhhhCCCcCC--CCCCccccccCCC
Q 037540 219 QPFHF------DA--------AETWAVDME-------SVKSRVAVDLESVATHEIGHVLGLAHTS--VKEAVMYPSLTPR 275 (340)
Q Consensus 219 ~pfhf------d~--------~e~wt~~~~-------~~~~~~~~dl~~Va~HEIGHaLGL~Hs~--d~~SIMyP~~~~~ 275 (340)
..+.. .. ...|..... -.......-+...+..||-.+|||.--+ .+.||-+ ..
T Consensus 100 ~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~vi~i~~~~~~~~~~~Ci~EElaQ~LGl~NDs~~~~pSIFN----Dd 175 (213)
T PF11150_consen 100 ACFVVPNFSSLPRYARCLVFARSNWSGEITRARAVIFIRADHPPQERRDCIHEELAQGLGLPNDSPRARPSIFN----DD 175 (213)
T ss_pred ccccchhhhcccccceEEEEEEecCCCceEEEEEEEEEeccCCchHHHHHHHHHHHHHcCCccCCCcCCCceee----CC
Confidence 11100 00 001110000 0000111235678888899999996322 2334332 22
Q ss_pred CCCCCCCHHHHHHHHHHhCCC
Q 037540 276 TRKVDLKVDDVEGVQALYGSN 296 (340)
Q Consensus 276 ~~~~~L~~dDI~~Iq~LYG~~ 296 (340)
.....|+.-|..-++.||...
T Consensus 176 ~~~~~LT~~D~lLLr~LYdP~ 196 (213)
T PF11150_consen 176 NEFALLTGFDELLLRMLYDPR 196 (213)
T ss_pred CccccccHHHHHHHHHHcCcc
Confidence 344579999999999999754
No 53
>PTZ00337 surface protease GP63; Provisional
Probab=85.86 E-value=0.86 Score=47.96 Aligned_cols=19 Identities=42% Similarity=0.681 Sum_probs=16.1
Q ss_pred cchhHHHHhhhhhhCCCcC
Q 037540 243 DLESVATHEIGHVLGLAHT 261 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs 261 (340)
....|++|||.||||+...
T Consensus 228 ~~~~v~~HEi~HALGFs~~ 246 (567)
T PTZ00337 228 GDVRVAAHELGHALGFVRD 246 (567)
T ss_pred HHHHHHHHHHHHHHccCHH
Confidence 4578999999999999654
No 54
>KOG3607 consensus Meltrins, fertilins and related Zn-dependent metalloproteinases of the ADAMs family [Posttranslational modification, protein turnover, chaperones]
Probab=84.00 E-value=1.1 Score=48.58 Aligned_cols=45 Identities=24% Similarity=0.320 Sum_probs=29.2
Q ss_pred chhHHHHhhhhhhCCCcCC-------CCCCccccccCCCCCCCCCCHHHHHHH
Q 037540 244 LESVATHEIGHVLGLAHTS-------VKEAVMYPSLTPRTRKVDLKVDDVEGV 289 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~-------d~~SIMyP~~~~~~~~~~L~~dDI~~I 289 (340)
...|++|||||.||+.|-. ...-|| +...+...+..++.--++.+
T Consensus 323 ~a~v~AhelgH~lGm~hD~~~c~C~~~~~CiM-~~~~~~~~~~~FS~CS~~~~ 374 (716)
T KOG3607|consen 323 FAVVLAHELGHNLGMIHDESSCSCPSSGLCIM-PEETGFYIPKKFSNCSYQDF 374 (716)
T ss_pred HHHHHHHHHHhhcCcccccccccCCCCCcccc-ccccCcCcccccccchHHHH
Confidence 4579999999999999976 223578 66553322334555444444
No 55
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=83.92 E-value=0.54 Score=44.81 Aligned_cols=24 Identities=33% Similarity=0.542 Sum_probs=13.3
Q ss_pred cchhHHHHhhhhhh-CCCcCCCCCC
Q 037540 243 DLESVATHEIGHVL-GLAHTSVKEA 266 (340)
Q Consensus 243 dl~~Va~HEIGHaL-GL~Hs~d~~S 266 (340)
....|++||+||++ ||+.+-..+.
T Consensus 215 ~~~~v~vHE~GHsf~~LaDEY~~~~ 239 (264)
T PF09471_consen 215 SFKQVVVHEFGHSFGGLADEYYYGE 239 (264)
T ss_dssp THHHHHHHHHHHHTT------ECS-
T ss_pred cccceeeeecccccccccccccCCC
Confidence 46789999999965 5666654443
No 56
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=82.22 E-value=0.73 Score=37.35 Aligned_cols=16 Identities=31% Similarity=0.690 Sum_probs=12.6
Q ss_pred chhHHHHhhhhhhCCC
Q 037540 244 LESVATHEIGHVLGLA 259 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~ 259 (340)
...|++|||||.+|+.
T Consensus 73 I~~tlvhEiah~fG~~ 88 (97)
T PF06262_consen 73 IRDTLVHEIAHHFGIS 88 (97)
T ss_dssp HHHHHHHHHHHHTT--
T ss_pred HHHHHHHHHHHHcCCC
Confidence 3679999999999985
No 57
>PF05547 Peptidase_M6: Immune inhibitor A peptidase M6; InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=81.47 E-value=0.3 Score=52.07 Aligned_cols=17 Identities=47% Similarity=0.661 Sum_probs=14.6
Q ss_pred hhHHHHhhhhhhCCCcC
Q 037540 245 ESVATHEIGHVLGLAHT 261 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs 261 (340)
..|++||+||.|||-.-
T Consensus 222 iGVfaHEfGH~LGLPDl 238 (645)
T PF05547_consen 222 IGVFAHEFGHDLGLPDL 238 (645)
T ss_pred eEEEEeeccccCCCCCc
Confidence 57999999999999643
No 58
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=80.62 E-value=0.83 Score=47.56 Aligned_cols=55 Identities=20% Similarity=0.253 Sum_probs=27.0
Q ss_pred cchhHHHHhhhhhhCCCcCCCCCCccc---cccCCCC--CCCCCCHHHHHHHHHHhCCCC
Q 037540 243 DLESVATHEIGHVLGLAHTSVKEAVMY---PSLTPRT--RKVDLKVDDVEGVQALYGSNP 297 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~~SIMy---P~~~~~~--~~~~L~~dDI~~Iq~LYG~~~ 297 (340)
.+..|++|||.|+||+.-..-...-|. +...+.. ..+--++.=++-.+.-||.+.
T Consensus 209 ~~~~~~~HEi~HaLGFs~~~~~~~~~~~~~~~~~~~~~~~~~l~tp~v~~~ar~hf~C~~ 268 (521)
T PF01457_consen 209 EFFRTVIHEIAHALGFSSSFFRYYGMPRTVSNVRGKPTTVTVLVTPNVVEFARKHFGCPT 268 (521)
T ss_dssp HHHHHHHHHHHHHTT-SHHHHHHTT-EEEES-GGG-SS-EEEE--HHHHHHHHHHHT-TT
T ss_pred cccceeeeeeeeeeeecccccccccccccccceeecCceeEEEeCcHHHHHHHHHhCCCc
Confidence 357899999999999976211110110 0011111 111225566666788888643
No 59
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=78.29 E-value=1.3 Score=35.22 Aligned_cols=14 Identities=36% Similarity=0.648 Sum_probs=11.5
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
....+++|||||.+
T Consensus 41 ~~~f~laHELgH~~ 54 (122)
T PF06114_consen 41 RQRFTLAHELGHIL 54 (122)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 35679999999974
No 60
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=74.70 E-value=1.5 Score=39.89 Aligned_cols=16 Identities=38% Similarity=0.542 Sum_probs=12.9
Q ss_pred cchhHHHHhhhhhhCC
Q 037540 243 DLESVATHEIGHVLGL 258 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL 258 (340)
....|+.||+||++-=
T Consensus 21 ~~l~t~~HE~gHal~a 36 (200)
T PF13398_consen 21 RLLVTFVHELGHALAA 36 (200)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3567999999999753
No 61
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=72.36 E-value=2.2 Score=39.27 Aligned_cols=29 Identities=34% Similarity=0.604 Sum_probs=24.9
Q ss_pred hhHHHHhhhhhhCCCcCCCCCCccccccC
Q 037540 245 ESVATHEIGHVLGLAHTSVKEAVMYPSLT 273 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~ 273 (340)
..+.-|++||++||+|.+++..||+|.+.
T Consensus 154 al~~~efl~~~~~~~~lnhr~~i~~~pg~ 182 (236)
T COG5549 154 ALIAYEFLGHALGLGHLNHRGDIMYPPGE 182 (236)
T ss_pred HHHHHHHhhhccCcccccccccccCCccc
Confidence 45778999999999999999999976543
No 62
>PF09374 PG_binding_3: Predicted Peptidoglycan domain; InterPro: IPR018537 This family contains a potential peptidoglycan binding domain. ; PDB: 2IKB_B 2IS5_A 2NR7_A.
Probab=72.31 E-value=3.9 Score=31.13 Aligned_cols=26 Identities=27% Similarity=0.486 Sum_probs=20.6
Q ss_pred HHHHHHHHhcC----CCCCccCCHHHHHHh
Q 037540 94 SAVVSYQTKLG----LPVTGKLDSETISTI 119 (340)
Q Consensus 94 ~Ai~~~Q~~~g----LpvTG~LD~~T~~~m 119 (340)
.|+|.+|+..| +.+.|.+.+.|+..+
T Consensus 1 rAik~LQ~alg~~~~v~~DG~iGp~Tl~Al 30 (72)
T PF09374_consen 1 RAIKFLQRALGLGADVAVDGIIGPKTLAAL 30 (72)
T ss_dssp HHHHHHHHHHT----S--SS--SHHHHHHH
T ss_pred CHHHHHHHHHccCCccCCCCCcCHHHHHHH
Confidence 38999999999 999999999999999
No 63
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=71.61 E-value=1.8 Score=40.16 Aligned_cols=14 Identities=36% Similarity=0.582 Sum_probs=12.0
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
....|+.||+||||
T Consensus 91 ~~~~~l~HE~GHAl 104 (220)
T PF14247_consen 91 NVLFTLYHELGHAL 104 (220)
T ss_pred HHHHHHHHHHHHHH
Confidence 46789999999985
No 64
>COG3409 Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]
Probab=71.53 E-value=13 Score=32.14 Aligned_cols=58 Identities=24% Similarity=0.371 Sum_probs=41.1
Q ss_pred CcChHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCHHHHHHhc
Q 037540 61 ITGMSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDSETISTIM 120 (340)
Q Consensus 61 ~~~~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~~T~~~m~ 120 (340)
......++.-+..-||..... .+..|....+.|++.||+..+|+++|..-..|...+.
T Consensus 127 ~~~~~~~~~~~~~~~~~~~~~--~dg~fg~~t~~~v~~~q~~~~l~~dgi~g~~t~~~l~ 184 (185)
T COG3409 127 GGDVATLQQPLPLLGYRSGIR--VDGIFGPQTEAAVKAFQRQYGLTVDGIVGPQTWAALR 184 (185)
T ss_pred ccchHHHHHHHHhcccccCCC--CCCcccHHHHHHHHHHHHHhCCCCCeeecHHHHHHhh
Confidence 334444455555555554321 2345678899999999999999999999999887653
No 65
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=70.45 E-value=2 Score=44.10 Aligned_cols=16 Identities=50% Similarity=0.765 Sum_probs=13.7
Q ss_pred ccchhHHHHhhhhhhC
Q 037540 242 VDLESVATHEIGHVLG 257 (340)
Q Consensus 242 ~dl~~Va~HEIGHaLG 257 (340)
..+-+|++|||||..|
T Consensus 128 sElagViAHEigHv~q 143 (484)
T COG4783 128 SELAGVIAHEIGHVAQ 143 (484)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 4578999999999876
No 66
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=69.88 E-value=2.1 Score=39.57 Aligned_cols=18 Identities=39% Similarity=0.623 Sum_probs=13.2
Q ss_pred chhHHHHhhhhhhCCCcCCC
Q 037540 244 LESVATHEIGHVLGLAHTSV 263 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d 263 (340)
-.-|++||+||+| -|...
T Consensus 72 ~rFtlAHELGH~l--lH~~~ 89 (213)
T COG2856 72 KRFTLAHELGHAL--LHTDL 89 (213)
T ss_pred HHHHHHHHHhHHH--hcccc
Confidence 3569999999995 45443
No 67
>KOG3658 consensus Tumor necrosis factor-alpha-converting enzyme (TACE/ADAM17) and related metalloproteases [Extracellular structures]
Probab=69.64 E-value=2.9 Score=44.48 Aligned_cols=49 Identities=20% Similarity=0.359 Sum_probs=32.0
Q ss_pred hhHHHHhhhhhhCCCcCCC----------CCCccccccCC-C-CCCCCCCHHHHHHHHHHh
Q 037540 245 ESVATHEIGHVLGLAHTSV----------KEAVMYPSLTP-R-TRKVDLKVDDVEGVQALY 293 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d----------~~SIMyP~~~~-~-~~~~~L~~dDI~~Iq~LY 293 (340)
.-|++|||||..|=.|-.+ -+=|||++-.. + .++..++.--+.-|...-
T Consensus 393 ~lt~AHEiGHNfGSpHDp~~ECsPg~~~~GnyiM~a~AtSGd~~NN~kFSpCS~ksI~~VL 453 (764)
T KOG3658|consen 393 DLTLAHEIGHNFGSPHDPDIECSPGESQGGNYIMYAYATSGDKPNNKKFSPCSLKSISKVL 453 (764)
T ss_pred heeehhhhccccCCCCCCCCccCCCCCCCCcEEEEEecccCcCccCcccCcchHHHHHHHH
Confidence 3589999999999999664 23489986442 2 234456665665555443
No 68
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=68.62 E-value=2.5 Score=44.81 Aligned_cols=54 Identities=22% Similarity=0.284 Sum_probs=30.3
Q ss_pred cchhHHHHhhhhhhCCCcCCCCCCcccccc---CCCCC--CCCCCHHHHHHHHHHhCCC
Q 037540 243 DLESVATHEIGHVLGLAHTSVKEAVMYPSL---TPRTR--KVDLKVDDVEGVQALYGSN 296 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~~SIMyP~~---~~~~~--~~~L~~dDI~~Iq~LYG~~ 296 (340)
....|++|||.|+||+.-+.-.+--|.-.. .+... ....++.-.+..++-||.+
T Consensus 255 ~~~rv~~HEi~HALGFS~~~f~~~~~~~~~~~~rGk~~~~~vi~SP~v~~~Ar~hfgC~ 313 (622)
T PTZ00257 255 GTTRTVTHEVAHALGFSSVFFENAGIVKSVTNLRGKPFAAPVINSSTAVAKAREQYGCP 313 (622)
T ss_pred HHHHHHHHHHHHHhcCCHHHHhhcCcccceeccCCCcceeeeecCHHHHHHHHHhcCCC
Confidence 357899999999999964332211111111 11111 1112667777788888764
No 69
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=67.19 E-value=15 Score=37.70 Aligned_cols=14 Identities=43% Similarity=0.598 Sum_probs=11.8
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
+-..|+.||+||+|
T Consensus 262 ~~V~TLfHEfGHal 275 (472)
T cd06455 262 DEVETFFHEFGHVI 275 (472)
T ss_pred HHHHHHHHHHHHHH
Confidence 34678999999998
No 70
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=66.77 E-value=2.9 Score=37.11 Aligned_cols=21 Identities=33% Similarity=0.385 Sum_probs=15.3
Q ss_pred hhHHHHhhhhhhCCCcCCCCC
Q 037540 245 ESVATHEIGHVLGLAHTSVKE 265 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~ 265 (340)
..+++||+||++--....++.
T Consensus 8 i~i~~HE~gH~~~a~~~G~~~ 28 (192)
T PF02163_consen 8 ISIVLHELGHALAARLYGDKV 28 (192)
T ss_dssp HHHHHHHHHHHHHHHTTT--B
T ss_pred ccccccccccccccccccccc
Confidence 578999999998776665543
No 71
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=65.30 E-value=3.4 Score=43.55 Aligned_cols=19 Identities=16% Similarity=0.357 Sum_probs=14.0
Q ss_pred CCHHHHHHHHHHHHHHhhc
Q 037540 171 IDLTDVKAAFKSSFQRWSS 189 (340)
Q Consensus 171 l~~~~~r~ai~~Af~~Ws~ 189 (340)
++.++.++.+.+||+.|..
T Consensus 307 ~~~e~a~~~v~~~~~~l~~ 325 (587)
T TIGR02290 307 YTFDEAKELVLEAFGKFSP 325 (587)
T ss_pred ecHHHHHHHHHHHHHhcCH
Confidence 3456778888888887764
No 72
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=65.27 E-value=3.2 Score=37.35 Aligned_cols=14 Identities=36% Similarity=0.491 Sum_probs=11.2
Q ss_pred hhHHHHhhhhhhCC
Q 037540 245 ESVATHEIGHVLGL 258 (340)
Q Consensus 245 ~~Va~HEIGHaLGL 258 (340)
..|++||+||.+=-
T Consensus 10 ~~v~iHElGH~~~A 23 (182)
T cd06163 10 ILIFVHELGHFLVA 23 (182)
T ss_pred HHHHHHHHHHHHHH
Confidence 46899999998643
No 73
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=64.39 E-value=3.4 Score=36.39 Aligned_cols=20 Identities=45% Similarity=0.564 Sum_probs=14.9
Q ss_pred hhHHHHhhhhhhCCCcCCCC
Q 037540 245 ESVATHEIGHVLGLAHTSVK 264 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~ 264 (340)
..+.+||+||++=-.+-.++
T Consensus 9 i~i~iHE~gH~~~A~~~G~~ 28 (180)
T cd05709 9 ISVTVHELGHALVARRLGVK 28 (180)
T ss_pred HHHHHHHHHHHHHHHHcCCC
Confidence 57899999999855554443
No 74
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=64.15 E-value=3.6 Score=36.96 Aligned_cols=20 Identities=35% Similarity=0.576 Sum_probs=15.3
Q ss_pred ccchhHHHHhhhhhhCCCcCC
Q 037540 242 VDLESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 242 ~dl~~Va~HEIGHaLGL~Hs~ 262 (340)
..+..|++||+||.. .+|..
T Consensus 87 ~el~aVlaHElgH~~-~~h~~ 106 (226)
T PF01435_consen 87 DELAAVLAHELGHIK-HRHIL 106 (226)
T ss_dssp HHHHHHHHHHHHHHH-TTHCC
T ss_pred HHHHHHHHHHHHHHH-cCCcc
Confidence 357899999999986 45544
No 75
>KOG3538 consensus Disintegrin metalloproteinases with thrombospondin repeats [Posttranslational modification, protein turnover, chaperones]
Probab=62.71 E-value=4.7 Score=44.54 Aligned_cols=32 Identities=25% Similarity=0.367 Sum_probs=24.9
Q ss_pred cchhHHHHhhhhhhCCCcCCCCC------CccccccCC
Q 037540 243 DLESVATHEIGHVLGLAHTSVKE------AVMYPSLTP 274 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~~------SIMyP~~~~ 274 (340)
...-+++||+||.+|+.|-...+ .||.|....
T Consensus 316 ~~aftiahE~GH~~gm~hd~~~~~C~~~~~im~~~~~~ 353 (845)
T KOG3538|consen 316 PSAFTIAHELGHNFGMEHDGRGNSCGPTFHVMSSDTCG 353 (845)
T ss_pred ccceeeeeccccccCcccCCcCCCCCCcccccCccccc
Confidence 34579999999999999976543 589876553
No 76
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=61.53 E-value=4.1 Score=37.20 Aligned_cols=18 Identities=39% Similarity=0.464 Sum_probs=13.6
Q ss_pred chhHHHHhhhhhhCCCcC
Q 037540 244 LESVATHEIGHVLGLAHT 261 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs 261 (340)
+..|++||+||++=-.+-
T Consensus 38 ~~~v~iHElgH~~~A~~~ 55 (208)
T cd06161 38 FLSVLLHELGHALVARRY 55 (208)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 468999999998754443
No 77
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=60.43 E-value=4 Score=38.56 Aligned_cols=14 Identities=50% Similarity=0.937 Sum_probs=12.2
Q ss_pred ccchhHHHHhhhhh
Q 037540 242 VDLESVATHEIGHV 255 (340)
Q Consensus 242 ~dl~~Va~HEIGHa 255 (340)
..+..|+.||+||.
T Consensus 155 dEl~aVlaHElgHi 168 (302)
T COG0501 155 DELEAVLAHELGHI 168 (302)
T ss_pred HHHHHHHHHHHHHH
Confidence 45889999999996
No 78
>PRK03982 heat shock protein HtpX; Provisional
Probab=59.69 E-value=4.2 Score=38.95 Aligned_cols=15 Identities=40% Similarity=0.651 Sum_probs=12.5
Q ss_pred ccchhHHHHhhhhhh
Q 037540 242 VDLESVATHEIGHVL 256 (340)
Q Consensus 242 ~dl~~Va~HEIGHaL 256 (340)
..+..|++||+||.-
T Consensus 123 ~El~AVlAHElgHi~ 137 (288)
T PRK03982 123 DELEGVIAHELTHIK 137 (288)
T ss_pred HHHHHHHHHHHHHHH
Confidence 358899999999963
No 79
>PRK03001 M48 family peptidase; Provisional
Probab=57.39 E-value=4.9 Score=38.42 Aligned_cols=15 Identities=33% Similarity=0.618 Sum_probs=12.5
Q ss_pred ccchhHHHHhhhhhh
Q 037540 242 VDLESVATHEIGHVL 256 (340)
Q Consensus 242 ~dl~~Va~HEIGHaL 256 (340)
.++..|++||+||.-
T Consensus 122 ~El~aVlAHElgHi~ 136 (283)
T PRK03001 122 REIRGVMAHELAHVK 136 (283)
T ss_pred HHHHHHHHHHHHHHh
Confidence 458899999999963
No 80
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=57.23 E-value=5.2 Score=39.22 Aligned_cols=15 Identities=33% Similarity=0.414 Sum_probs=12.3
Q ss_pred ccchhHHHHhhhhhh
Q 037540 242 VDLESVATHEIGHVL 256 (340)
Q Consensus 242 ~dl~~Va~HEIGHaL 256 (340)
.+-..++.||+||++
T Consensus 152 ~~~v~tl~HE~GHa~ 166 (365)
T cd06258 152 HDDINTLFHEFGHAV 166 (365)
T ss_pred HHHHHHHHHHHhHHH
Confidence 344679999999998
No 81
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=56.78 E-value=4.9 Score=39.99 Aligned_cols=15 Identities=33% Similarity=0.541 Sum_probs=12.3
Q ss_pred ccchhHHHHhhhhhh
Q 037540 242 VDLESVATHEIGHVL 256 (340)
Q Consensus 242 ~dl~~Va~HEIGHaL 256 (340)
.+-..+++||+||++
T Consensus 220 ~~~v~tl~HE~GHa~ 234 (427)
T cd06459 220 LDDVFTLAHELGHAF 234 (427)
T ss_pred hhhHHHHHHHhhHHH
Confidence 445789999999985
No 82
>PRK03072 heat shock protein HtpX; Provisional
Probab=56.48 E-value=5.1 Score=38.50 Aligned_cols=16 Identities=38% Similarity=0.601 Sum_probs=12.9
Q ss_pred cccchhHHHHhhhhhh
Q 037540 241 AVDLESVATHEIGHVL 256 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaL 256 (340)
..++..|++||+||.-
T Consensus 124 ~~El~aVlAHElgHi~ 139 (288)
T PRK03072 124 ERELRGVLGHELSHVY 139 (288)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3458899999999953
No 83
>PRK01345 heat shock protein HtpX; Provisional
Probab=56.37 E-value=5.2 Score=39.09 Aligned_cols=15 Identities=33% Similarity=0.589 Sum_probs=12.8
Q ss_pred ccchhHHHHhhhhhh
Q 037540 242 VDLESVATHEIGHVL 256 (340)
Q Consensus 242 ~dl~~Va~HEIGHaL 256 (340)
..+..|++|||||.-
T Consensus 122 dEL~aVlAHElgHi~ 136 (317)
T PRK01345 122 EEVAGVMAHELAHVK 136 (317)
T ss_pred HHHHHHHHHHHHHHH
Confidence 358899999999975
No 84
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=55.65 E-value=5.9 Score=36.80 Aligned_cols=18 Identities=39% Similarity=0.495 Sum_probs=13.4
Q ss_pred chhHHHHhhhhhhCCCcC
Q 037540 244 LESVATHEIGHVLGLAHT 261 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs 261 (340)
+..|++||+||++=-.+-
T Consensus 53 ~~~v~iHElgH~~~A~~~ 70 (227)
T cd06164 53 FASVLLHELGHSLVARRY 70 (227)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 367999999998754443
No 85
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=54.47 E-value=6.1 Score=40.08 Aligned_cols=12 Identities=33% Similarity=0.584 Sum_probs=10.8
Q ss_pred hhHHHHhhhhhh
Q 037540 245 ESVATHEIGHVL 256 (340)
Q Consensus 245 ~~Va~HEIGHaL 256 (340)
..|+.||+|||+
T Consensus 243 v~tLfHE~GHa~ 254 (458)
T PF01432_consen 243 VETLFHEFGHAM 254 (458)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHhHHH
Confidence 579999999986
No 86
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=52.84 E-value=6.2 Score=38.60 Aligned_cols=23 Identities=30% Similarity=0.433 Sum_probs=19.5
Q ss_pred cccchhHHHHhhhhhhCCCcCCC
Q 037540 241 AVDLESVATHEIGHVLGLAHTSV 263 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaLGL~Hs~d 263 (340)
...+..|++||++|-+|...+.+
T Consensus 193 ~~~~P~T~~HElAHq~G~a~E~E 215 (318)
T PF12725_consen 193 PYSLPFTICHELAHQLGFASEDE 215 (318)
T ss_pred cccccHHHHHHHHHHhCCCCHHH
Confidence 45678899999999999998773
No 87
>PRK04897 heat shock protein HtpX; Provisional
Probab=52.51 E-value=6.5 Score=37.96 Aligned_cols=14 Identities=50% Similarity=0.795 Sum_probs=12.2
Q ss_pred ccchhHHHHhhhhh
Q 037540 242 VDLESVATHEIGHV 255 (340)
Q Consensus 242 ~dl~~Va~HEIGHa 255 (340)
..+..|++||+||.
T Consensus 135 ~El~aVlAHElgHi 148 (298)
T PRK04897 135 EELEGVIGHEISHI 148 (298)
T ss_pred HHHHHHHHHHHHHH
Confidence 45889999999995
No 88
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=52.42 E-value=7.1 Score=37.56 Aligned_cols=16 Identities=38% Similarity=0.478 Sum_probs=12.3
Q ss_pred chhHHHHhhhhhhCCC
Q 037540 244 LESVATHEIGHVLGLA 259 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~ 259 (340)
+..+++||+||++=-.
T Consensus 135 ~isvvvHElgHal~A~ 150 (277)
T cd06162 135 LISGVVHEMGHGVAAV 150 (277)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3579999999986433
No 89
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=52.22 E-value=7.1 Score=37.21 Aligned_cols=18 Identities=22% Similarity=0.231 Sum_probs=13.6
Q ss_pred hhHHHHhhhhhhCCCcCC
Q 037540 245 ESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~ 262 (340)
..+++||+||++=-.+..
T Consensus 119 isv~iHElgHa~~Ar~~G 136 (263)
T cd06159 119 VGVVVHELSHGILARVEG 136 (263)
T ss_pred HHHHHHHHHHHHHHHHcC
Confidence 579999999987554444
No 90
>PRK02870 heat shock protein HtpX; Provisional
Probab=51.30 E-value=7 Score=38.60 Aligned_cols=15 Identities=33% Similarity=0.694 Sum_probs=12.7
Q ss_pred cccchhHHHHhhhhh
Q 037540 241 AVDLESVATHEIGHV 255 (340)
Q Consensus 241 ~~dl~~Va~HEIGHa 255 (340)
...+..|++||+||.
T Consensus 170 ~dEL~aVlAHELgHi 184 (336)
T PRK02870 170 RDELQAVMAHELSHI 184 (336)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345889999999997
No 91
>PRK05457 heat shock protein HtpX; Provisional
Probab=51.29 E-value=7 Score=37.54 Aligned_cols=15 Identities=40% Similarity=0.702 Sum_probs=12.5
Q ss_pred cccchhHHHHhhhhh
Q 037540 241 AVDLESVATHEIGHV 255 (340)
Q Consensus 241 ~~dl~~Va~HEIGHa 255 (340)
...+..|++||+||.
T Consensus 131 ~~El~aVlAHElgHi 145 (284)
T PRK05457 131 RDEVEAVLAHEISHI 145 (284)
T ss_pred HHHHHHHHHHHHHHH
Confidence 346889999999994
No 92
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=50.73 E-value=8.2 Score=30.77 Aligned_cols=14 Identities=36% Similarity=0.496 Sum_probs=11.9
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
.+..|++||+.|.+
T Consensus 24 ~~~~~l~HE~~H~~ 37 (128)
T PF13485_consen 24 WLDRVLAHELAHQW 37 (128)
T ss_pred HHHHHHHHHHHHHH
Confidence 35689999999986
No 93
>PRK01265 heat shock protein HtpX; Provisional
Probab=50.66 E-value=7.2 Score=38.30 Aligned_cols=14 Identities=43% Similarity=0.866 Sum_probs=12.0
Q ss_pred ccchhHHHHhhhhh
Q 037540 242 VDLESVATHEIGHV 255 (340)
Q Consensus 242 ~dl~~Va~HEIGHa 255 (340)
.++..|++||+||.
T Consensus 138 ~El~aVlAHElgHi 151 (324)
T PRK01265 138 DEIKAVAGHELGHL 151 (324)
T ss_pred HHHHHHHHHHHHHH
Confidence 45889999999994
No 94
>PRK02391 heat shock protein HtpX; Provisional
Probab=50.52 E-value=7.3 Score=37.66 Aligned_cols=15 Identities=47% Similarity=0.751 Sum_probs=12.4
Q ss_pred cccchhHHHHhhhhh
Q 037540 241 AVDLESVATHEIGHV 255 (340)
Q Consensus 241 ~~dl~~Va~HEIGHa 255 (340)
..++..|++||+||.
T Consensus 130 ~~El~aVlaHElgHi 144 (296)
T PRK02391 130 PDELEAVLAHELSHV 144 (296)
T ss_pred HHHHHHHHHHHHHHH
Confidence 345889999999994
No 95
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=50.47 E-value=7.7 Score=40.58 Aligned_cols=16 Identities=38% Similarity=0.453 Sum_probs=12.9
Q ss_pred cccchhHHHHhhhhhh
Q 037540 241 AVDLESVATHEIGHVL 256 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaL 256 (340)
..+-..|++||+||++
T Consensus 334 t~~dv~TL~HElGHa~ 349 (549)
T TIGR02289 334 TSGDIDVLTHEAGHAF 349 (549)
T ss_pred ChhHHHHHHHHhhHHH
Confidence 3445779999999997
No 96
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=48.33 E-value=8.6 Score=39.36 Aligned_cols=25 Identities=4% Similarity=0.069 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHhhccCcceEEEecc
Q 037540 176 VKAAFKSSFQRWSSVIPVNFTEIYD 200 (340)
Q Consensus 176 ~r~ai~~Af~~Ws~vt~l~F~ev~~ 200 (340)
+..+++--+..-+..-+|+|+|+..
T Consensus 132 l~~vl~gl~~~~~~lfgi~~~~~~~ 156 (458)
T cd06457 132 LGTVMEGLSRLFSRLYGIRLVPVPL 156 (458)
T ss_pred HHHHHHHHHHHHHHHhCeEEEecCC
Confidence 3455566667777888899999753
No 97
>KOG2921 consensus Intramembrane metalloprotease (sterol-regulatory element-binding protein (SREBP) protease) [Posttranslational modification, protein turnover, chaperones]
Probab=48.05 E-value=8.6 Score=38.72 Aligned_cols=18 Identities=39% Similarity=0.615 Sum_probs=13.6
Q ss_pred hhHHHHhhhhhhCCCcCC
Q 037540 245 ESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~ 262 (340)
..+++||+||||--.-+.
T Consensus 132 i~~vvHElGHalAA~seg 149 (484)
T KOG2921|consen 132 ITVVVHELGHALAAASEG 149 (484)
T ss_pred HHHHHHHhhHHHHHHhcC
Confidence 568899999998654443
No 98
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=47.83 E-value=9.2 Score=40.23 Aligned_cols=15 Identities=27% Similarity=0.521 Sum_probs=12.5
Q ss_pred ccchhHHHHhhhhhh
Q 037540 242 VDLESVATHEIGHVL 256 (340)
Q Consensus 242 ~dl~~Va~HEIGHaL 256 (340)
.+-..|++||+||++
T Consensus 376 ~~dv~TLaHElGHa~ 390 (591)
T TIGR00181 376 LNSVFTLAHELGHSM 390 (591)
T ss_pred cchHHHHHHHhhhHH
Confidence 445789999999997
No 99
>PF03272 Enhancin: Viral enhancin protein; InterPro: IPR004954 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M60 (enhancin family, clan MA(E)). The active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The viral enhancin protein, or enhancing factor, is involved in disruption of the peritrophic membrane and fusion of nucleocapsids with mid-gut cells.; GO: 0016032 viral reproduction
Probab=47.82 E-value=21 Score=39.23 Aligned_cols=136 Identities=15% Similarity=0.218 Sum_probs=69.9
Q ss_pred CCCCCCCCCcEEEEeecCCc----CCC-----------CCHHHHHHHHHHHHHHhhccCcceEEEeccCCCccEEE-Eee
Q 037540 148 PRWVRESPMTLTYAFSKTDM----IDY-----------IDLTDVKAAFKSSFQRWSSVIPVNFTEIYDYDSADIHI-GFY 211 (340)
Q Consensus 148 ~kW~~~~~~~LTY~f~~~n~----~~~-----------l~~~~~r~ai~~Af~~Ws~vt~l~F~ev~~~~~ADI~I-~f~ 211 (340)
..|.+. +-.|.+..... .+. .+.+++.+--.+.|+...+.+.|.|..... .+.++.= .|.
T Consensus 128 ~~w~~~---~~~fa~ve~~~i~lLVP~~Dk~~l~~~~~~~l~~L~~~Y~~i~~~Yd~l~Gl~~~~~~~-~~~n~~~kYF~ 203 (775)
T PF03272_consen 128 SEWDDS---DSSFALVELDYIQLLVPPADKPNLNNKDFKSLDELIDFYNDIFKFYDDLTGLSDDPSDP-VDKNFNNKYFA 203 (775)
T ss_pred hhhhhc---CCceEEEECCEEEEEeCcchHHHHhhhcccCHHHHHHHHHHHHHHHHhhhCCCCCCCcc-cccccccceEE
Confidence 468877 77787764321 111 123344445567888888889998865432 2222211 133
Q ss_pred cC-CCCCCCcccccCCcceeecccccccc--ccccchhHHHHhhhhhhCCCcCCCC-------CCccccccCCCCCCCCC
Q 037540 212 HG-DHGDGQPFHFDAAETWAVDMESVKSR--VAVDLESVATHEIGHVLGLAHTSVK-------EAVMYPSLTPRTRKVDL 281 (340)
Q Consensus 212 ~g-~hgdg~pfhfd~~e~wt~~~~~~~~~--~~~dl~~Va~HEIGHaLGL~Hs~d~-------~SIMyP~~~~~~~~~~L 281 (340)
.. .||.|.+|. +..|+.....+... ...-.-..++|||||+-=..=.++. +.|+.-.|+ ...+
T Consensus 204 KAD~~G~G~AYY---~~~w~a~ss~s~~~~L~~~~~nW~~LHEIgHgYd~~F~~n~~~~~EVWnNI~~d~yQ----~~~~ 276 (775)
T PF03272_consen 204 KADKSGPGAAYY---GSNWTAQSSSSLSFYLNPSPTNWGALHEIGHGYDFGFTRNGTYLNEVWNNILADRYQ----YTYM 276 (775)
T ss_pred EecCCCCCCccc---cccceecCchhHHHHhCCCCCCchhhhhhhhhcceeEeeCCcchhhhhhhhhhhhhh----hhhc
Confidence 33 367676652 34566543321100 0001235899999997433322111 122222222 2346
Q ss_pred CHHHHHHHHHHhC
Q 037540 282 KVDDVEGVQALYG 294 (340)
Q Consensus 282 ~~dDI~~Iq~LYG 294 (340)
+.++.+..-.||+
T Consensus 277 ~~~e~~~~~wly~ 289 (775)
T PF03272_consen 277 TQDERQTDGWLYD 289 (775)
T ss_pred Chhhhhhccceec
Confidence 7788888888884
No 100
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=47.42 E-value=9.6 Score=34.26 Aligned_cols=23 Identities=30% Similarity=0.528 Sum_probs=17.2
Q ss_pred hhHHHHhhhhhhCCCcCCCCCCc
Q 037540 245 ESVATHEIGHVLGLAHTSVKEAV 267 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~SI 267 (340)
..+.+||+||++--..-.++...
T Consensus 42 ~~l~iHElgH~~~A~~~G~~~~~ 64 (183)
T cd06160 42 AILGIHEMGHYLAARRHGVKASL 64 (183)
T ss_pred HHHHHHHHHHHHHHHHCCCCccc
Confidence 56889999999877666655443
No 101
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=46.93 E-value=9.6 Score=33.21 Aligned_cols=12 Identities=33% Similarity=0.490 Sum_probs=10.1
Q ss_pred hhHHHHhhhhhh
Q 037540 245 ESVATHEIGHVL 256 (340)
Q Consensus 245 ~~Va~HEIGHaL 256 (340)
.-|++|||+|.+
T Consensus 136 lDVvaHEltHGV 147 (150)
T PF01447_consen 136 LDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHHHH
T ss_pred cceeeecccccc
Confidence 459999999974
No 102
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=46.11 E-value=10 Score=38.52 Aligned_cols=14 Identities=43% Similarity=0.605 Sum_probs=11.7
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
+-..|+.||+||+|
T Consensus 207 ~~v~tLfHEfGHal 220 (422)
T cd06456 207 DEVTTLFHEFGHAL 220 (422)
T ss_pred HHHHHHHHHHHHHH
Confidence 34578999999998
No 103
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=44.28 E-value=11 Score=36.62 Aligned_cols=11 Identities=45% Similarity=0.619 Sum_probs=9.5
Q ss_pred hhHHHHhhhhh
Q 037540 245 ESVATHEIGHV 255 (340)
Q Consensus 245 ~~Va~HEIGHa 255 (340)
..|++||.||.
T Consensus 171 ayVlAHEyGHH 181 (292)
T PF04228_consen 171 AYVLAHEYGHH 181 (292)
T ss_pred HHHHHHHHHHH
Confidence 46999999995
No 104
>PF01434 Peptidase_M41: Peptidase family M41 This is family M41 in the peptidase classification. ; InterPro: IPR000642 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The peptidase M41 family belong to a larger family of zinc metalloproteases. This family includes the cell division protein FtsH, and the yeast mitochondrial respiratory chain complexes assembly protein, which is a putative ATP-dependent protease required for assembly of the mitochondrial respiratory chain and ATPase complexes. FtsH is an integral membrane protein, which seems to act as an ATP-dependent zinc metallopeptidase that binds one zinc ion.; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0006508 proteolysis; PDB: 4EIW_C 2DHR_E 1IY1_A 1IY2_A 1IY0_A 1IXZ_A 2CE7_F 2CEA_F 3KDS_E 2QZ4_A ....
Probab=43.53 E-value=11 Score=34.41 Aligned_cols=16 Identities=31% Similarity=0.538 Sum_probs=13.2
Q ss_pred cccchhHHHHhhhhhh
Q 037540 241 AVDLESVATHEIGHVL 256 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaL 256 (340)
..+...+|.||-||||
T Consensus 25 ~~~~~~~A~HEAGhAv 40 (213)
T PF01434_consen 25 EEEKRRIAYHEAGHAV 40 (213)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3456889999999996
No 105
>PLN02791 Nudix hydrolase homolog
Probab=42.94 E-value=11 Score=41.25 Aligned_cols=20 Identities=20% Similarity=0.307 Sum_probs=16.4
Q ss_pred chhHHHHhhhhhhCCCcCCC
Q 037540 244 LESVATHEIGHVLGLAHTSV 263 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d 263 (340)
+..|++||+||.+|=.+.-.
T Consensus 585 ~~~v~lHElgHGsG~~~~~~ 604 (770)
T PLN02791 585 FTHTICHECCHGIGPHTITL 604 (770)
T ss_pred HHHHHHHHhhccccccceec
Confidence 46899999999999877544
No 106
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=41.00 E-value=16 Score=32.45 Aligned_cols=26 Identities=23% Similarity=0.258 Sum_probs=20.9
Q ss_pred cchhHHHHhhhhhhCCCcCCCCCCcc
Q 037540 243 DLESVATHEIGHVLGLAHTSVKEAVM 268 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL~Hs~d~~SIM 268 (340)
-+..|++||+-|..-..||..--+.|
T Consensus 163 ~idYVvvHEL~Hl~~~nHs~~Fw~~v 188 (205)
T PF01863_consen 163 VIDYVVVHELCHLRHPNHSKRFWALV 188 (205)
T ss_pred HHHHHHHHHHHHhccCCCCHHHHHHH
Confidence 35789999999999999997644444
No 107
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=40.57 E-value=13 Score=37.62 Aligned_cols=12 Identities=42% Similarity=0.531 Sum_probs=10.1
Q ss_pred hhHHHHhhhhhh
Q 037540 245 ESVATHEIGHVL 256 (340)
Q Consensus 245 ~~Va~HEIGHaL 256 (340)
..+++||+||+|
T Consensus 160 l~t~iHE~GHal 171 (396)
T cd06460 160 LFSTIHETGHAL 171 (396)
T ss_pred HHHHHHHhhHHH
Confidence 458899999986
No 108
>TIGR00054 RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain.
Probab=40.47 E-value=13 Score=37.70 Aligned_cols=12 Identities=42% Similarity=0.719 Sum_probs=10.2
Q ss_pred hhHHHHhhhhhh
Q 037540 245 ESVATHEIGHVL 256 (340)
Q Consensus 245 ~~Va~HEIGHaL 256 (340)
..|++||+||.|
T Consensus 15 ~~v~~HE~gH~~ 26 (420)
T TIGR00054 15 VLIFVHELGHFL 26 (420)
T ss_pred HHHHHHhHHHHH
Confidence 468999999975
No 109
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=40.41 E-value=14 Score=32.95 Aligned_cols=13 Identities=31% Similarity=0.562 Sum_probs=11.1
Q ss_pred chhHHHHhhhhhh
Q 037540 244 LESVATHEIGHVL 256 (340)
Q Consensus 244 l~~Va~HEIGHaL 256 (340)
+.++.+||+||++
T Consensus 9 ~~~i~~HE~aHa~ 21 (181)
T cd06158 9 LLAITLHEFAHAY 21 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 3578999999997
No 110
>PRK10779 zinc metallopeptidase RseP; Provisional
Probab=39.91 E-value=13 Score=37.84 Aligned_cols=12 Identities=33% Similarity=0.620 Sum_probs=9.7
Q ss_pred hhHHHHhhhhhh
Q 037540 245 ESVATHEIGHVL 256 (340)
Q Consensus 245 ~~Va~HEIGHaL 256 (340)
.-|++||+||-|
T Consensus 16 ~li~vHElGHfl 27 (449)
T PRK10779 16 VLITVHEFGHFW 27 (449)
T ss_pred HHHHHHHHHHHH
Confidence 358899999964
No 111
>PF08823 PG_binding_2: Putative peptidoglycan binding domain; InterPro: IPR014927 This entry may be a peptidoglycan binding domain.
Probab=39.68 E-value=71 Score=24.47 Aligned_cols=32 Identities=19% Similarity=0.259 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHhcCC---CCCccCCHHHHHHhcC
Q 037540 90 TQLQSAVVSYQTKLGL---PVTGKLDSETISTIMS 121 (340)
Q Consensus 90 ~~l~~Ai~~~Q~~~gL---pvTG~LD~~T~~~m~~ 121 (340)
.++.+-|+..=+..|. +++|..|++|.+.+..
T Consensus 15 ~~~~~evq~~L~~lGyy~g~~~g~~d~a~~~Al~~ 49 (74)
T PF08823_consen 15 GDVAREVQEALKRLGYYKGEADGVWDEATEDALRA 49 (74)
T ss_pred HHHHHHHHHHHHHcCCccCCCCCcccHHHHHHHHH
Confidence 3444444444444555 8999999999999853
No 112
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=38.62 E-value=14 Score=39.27 Aligned_cols=14 Identities=29% Similarity=0.387 Sum_probs=11.7
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
+-..|++||+||++
T Consensus 379 ~dV~TLaHElGHs~ 392 (598)
T COG1164 379 RDVFTLAHELGHSV 392 (598)
T ss_pred hHHHHHHHHccHHH
Confidence 34679999999986
No 113
>PF08434 CLCA_N: Calcium-activated chloride channel; InterPro: IPR013642 The CLCA family of calcium-activated chloride channels has been identified in many epithelial and endothelial cell types as well as in smooth muscle cells [] and has four or five putative transmembrane regions. Additionally to their role as chloride channels some CLCA proteins function as adhesion molecules and may also have roles as tumour suppressors []. The domain described here is found at the N terminus of CLCAs.
Probab=37.73 E-value=14 Score=35.32 Aligned_cols=12 Identities=33% Similarity=0.390 Sum_probs=10.0
Q ss_pred chhHHHHhhhhh
Q 037540 244 LESVATHEIGHV 255 (340)
Q Consensus 244 l~~Va~HEIGHa 255 (340)
...|++||.+|.
T Consensus 149 ~grv~VhEWAhl 160 (262)
T PF08434_consen 149 RGRVFVHEWAHL 160 (262)
T ss_pred cceeeeehhhhh
Confidence 357999999995
No 114
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=35.36 E-value=19 Score=26.33 Aligned_cols=19 Identities=37% Similarity=0.609 Sum_probs=16.9
Q ss_pred chhHHHHhhhhhhCCCcCC
Q 037540 244 LESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~ 262 (340)
.+.-++||+.+.+||.|.+
T Consensus 29 ~eR~~vH~lA~~~gL~s~S 47 (60)
T cd02641 29 HDRLLVHELAEELGLRHES 47 (60)
T ss_pred HHHHHHHHHHHHcCCceEe
Confidence 5678999999999999975
No 115
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=35.03 E-value=87 Score=20.48 Aligned_cols=29 Identities=17% Similarity=0.167 Sum_probs=18.7
Q ss_pred HHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHH
Q 037540 65 SELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVV 97 (340)
Q Consensus 65 ~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~ 97 (340)
.++++||..+|...++.. ...++|-+.++
T Consensus 7 ~~L~~wL~~~gi~~~~~~----~~rd~Ll~~~k 35 (38)
T PF10281_consen 7 SDLKSWLKSHGIPVPKSA----KTRDELLKLAK 35 (38)
T ss_pred HHHHHHHHHcCCCCCCCC----CCHHHHHHHHH
Confidence 679999999998765432 12355544444
No 116
>PRK10911 oligopeptidase A; Provisional
Probab=35.01 E-value=17 Score=39.21 Aligned_cols=13 Identities=38% Similarity=0.360 Sum_probs=11.2
Q ss_pred chhHHHHhhhhhh
Q 037540 244 LESVATHEIGHVL 256 (340)
Q Consensus 244 l~~Va~HEIGHaL 256 (340)
-..|+.||+||+|
T Consensus 463 ~v~tlfHEfGHal 475 (680)
T PRK10911 463 EVITLFHEFGHGL 475 (680)
T ss_pred HHHHHHHHHhHHH
Confidence 3569999999998
No 117
>KOG2719 consensus Metalloprotease [General function prediction only]
Probab=33.23 E-value=19 Score=36.62 Aligned_cols=14 Identities=43% Similarity=0.695 Sum_probs=12.0
Q ss_pred ccchhHHHHhhhhh
Q 037540 242 VDLESVATHEIGHV 255 (340)
Q Consensus 242 ~dl~~Va~HEIGHa 255 (340)
..+.+|+.||+||-
T Consensus 278 eel~AVl~HELGHW 291 (428)
T KOG2719|consen 278 EELVAVLAHELGHW 291 (428)
T ss_pred HHHHHHHHHHhhHH
Confidence 45789999999995
No 118
>KOG2089 consensus Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=31.00 E-value=1e+02 Score=33.33 Aligned_cols=43 Identities=21% Similarity=0.408 Sum_probs=29.8
Q ss_pred HHHHHHHHHHhhccCcceEEEeccC--CCccEEEEeecCCCCCCCc
Q 037540 177 KAAFKSSFQRWSSVIPVNFTEIYDY--DSADIHIGFYHGDHGDGQP 220 (340)
Q Consensus 177 r~ai~~Af~~Ws~vt~l~F~ev~~~--~~ADI~I~f~~g~hgdg~p 220 (340)
..++.--|...+..-+|.|.|+++. -.+|++| |..+|.+.|.+
T Consensus 391 ~~v~~Gl~~i~q~LFglkf~e~~da~vWh~dVr~-y~v~D~~Sg~~ 435 (718)
T KOG2089|consen 391 PVVLSGLFGIYQTLFGLKFEEATDAEVWHADVRV-YTVKDSASGNP 435 (718)
T ss_pred HHHHHHHHHHHHHHhCceeeecCCchhcccceeE-EeccCCCCCce
Confidence 3466777888899999999998873 3578885 44445444443
No 119
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=30.60 E-value=25 Score=27.20 Aligned_cols=13 Identities=31% Similarity=0.575 Sum_probs=11.1
Q ss_pred chhHHHHhhhhhh
Q 037540 244 LESVATHEIGHVL 256 (340)
Q Consensus 244 l~~Va~HEIGHaL 256 (340)
-..+++||+.|++
T Consensus 61 ~~~llaHEl~Hv~ 73 (79)
T PF13699_consen 61 GRALLAHELAHVV 73 (79)
T ss_pred cchhHhHHHHHHH
Confidence 4679999999985
No 120
>KOG3988 consensus Protein-tyrosine sulfotransferase TPST1/TPST2 [Posttranslational modification, protein turnover, chaperones]
Probab=29.76 E-value=68 Score=31.30 Aligned_cols=38 Identities=8% Similarity=0.197 Sum_probs=33.6
Q ss_pred cchHhhHhhhhccCCCCCcChHHHHHHHHhcCCCCCCC
Q 037540 44 ATWYDFARLLDAEMGSNITGMSELKKYFTRFGYLSLSY 81 (340)
Q Consensus 44 ~~~~~~~~~~~~~~g~~~~~~~~~~~YL~~fGYl~~~~ 81 (340)
-.-+++.+-.||=+|+.+.+|+.+--.|.+.||=|..+
T Consensus 295 VNl~AlskWvg~ip~dvvrdma~iAPmL~~LGYDp~an 332 (378)
T KOG3988|consen 295 VNLEALSKWVGCIPEDVVRDMADIAPMLAILGYDPYAN 332 (378)
T ss_pred ccHHHHHHHhccCCHHHHHHHHHHHHHHHHhCCCCCCC
Confidence 34688999999999999999999999999999987544
No 121
>PF03571 Peptidase_M49: Peptidase family M49; InterPro: IPR005317 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M49 (dipeptidyl-peptidase III family, clan M-). The predicted active site residues occur in the motif HEXXXH which is unlike that in any other family. The dipeptidyl peptidase III aminopeptidases cleave dipeptides from the N-terminal of peptides consisting of four or more amino acids and have a broad specificity.; GO: 0008239 dipeptidyl-peptidase activity, 0006508 proteolysis, 0005737 cytoplasm; PDB: 3T6B_A 3FVY_A 3T6J_A 3CSK_A.
Probab=29.60 E-value=22 Score=37.48 Aligned_cols=15 Identities=40% Similarity=0.601 Sum_probs=11.1
Q ss_pred chhHHHHhh-hhhhCC
Q 037540 244 LESVATHEI-GHVLGL 258 (340)
Q Consensus 244 l~~Va~HEI-GHaLGL 258 (340)
...|++||+ ||..|=
T Consensus 293 ~~~v~lHEllGHGsGk 308 (549)
T PF03571_consen 293 FVQVGLHELLGHGSGK 308 (549)
T ss_dssp HHHHHHHHHCCCCS--
T ss_pred hhhhhHHhhccCcCcc
Confidence 357999998 998884
No 122
>PF06167 Peptidase_M90: Glucose-regulated metallo-peptidase M90; InterPro: IPR010384 MtfA (earlier known as YeeI) is a transcription factor that binds Mlc (make large colonies), itself a repressor of glucose and hence a protein important in regulation of the phosphoenolpyruvate:glucose-phosphotransferase (PtsG) system, the major glucose transporter in Escherichia coli. Mlc is a repressor of ptsG, and MtfA is found to bind and inactivate Mlc with high affinity []. The membrane-bound protein EIICBGlc encoded by the ptsG gene is the major glucose transporter in E. coli. ; PDB: 3DL1_A 3KHI_A.
Probab=29.47 E-value=27 Score=33.11 Aligned_cols=20 Identities=30% Similarity=0.400 Sum_probs=14.2
Q ss_pred ccchhHHHHhhhhhhCCCcC
Q 037540 242 VDLESVATHEIGHVLGLAHT 261 (340)
Q Consensus 242 ~dl~~Va~HEIGHaLGL~Hs 261 (340)
.|-.+|++||+-|.|=+...
T Consensus 152 ~dg~NVviHEfAH~LD~~~g 171 (253)
T PF06167_consen 152 NDGHNVVIHEFAHKLDMEDG 171 (253)
T ss_dssp SSS--HHHHHHHHHHHCTTS
T ss_pred CCCcchHHHHHHHHHHhhcC
Confidence 34578999999999977543
No 123
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=27.15 E-value=1.3e+02 Score=30.16 Aligned_cols=16 Identities=38% Similarity=0.736 Sum_probs=12.9
Q ss_pred cccchhHHHHhhhhhh
Q 037540 241 AVDLESVATHEIGHVL 256 (340)
Q Consensus 241 ~~dl~~Va~HEIGHaL 256 (340)
..++-.|+.||+||..
T Consensus 121 ~sEvAAVl~HEmgHVt 136 (479)
T COG4784 121 SSEVAAVLAHEMGHVT 136 (479)
T ss_pred HHHHHHHHHhhhhhee
Confidence 3457889999999974
No 124
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=26.27 E-value=37 Score=31.23 Aligned_cols=43 Identities=21% Similarity=0.280 Sum_probs=27.5
Q ss_pred hhHHHHhhhhhhCCCcCCCCCCccccccCCCCCCCCCCHHHHHHHHHHhCCC
Q 037540 245 ESVATHEIGHVLGLAHTSVKEAVMYPSLTPRTRKVDLKVDDVEGVQALYGSN 296 (340)
Q Consensus 245 ~~Va~HEIGHaLGL~Hs~d~~SIMyP~~~~~~~~~~L~~dDI~~Iq~LYG~~ 296 (340)
.-++.||++| +=|+|..+. .|... ..-+ -..|..+...|++..
T Consensus 102 ~fil~HE~~H-v~~~h~~~~----~~~~~---~~eE-~~AD~~A~~~il~~~ 144 (206)
T PF10463_consen 102 AFILLHELAH-VVLGHEGDS----SPSQS---IQEE-KEADSYATEMILSDV 144 (206)
T ss_pred HHHHHHHHHH-HHHcCcccc----ccchh---HHHH-HhhhHHHHHHHHHHH
Confidence 4689999999 678998765 11111 1111 257888888877643
No 125
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=25.92 E-value=30 Score=33.13 Aligned_cols=22 Identities=27% Similarity=0.388 Sum_probs=17.2
Q ss_pred ccccchhHHHHhhhhhhCCCcC
Q 037540 240 VAVDLESVATHEIGHVLGLAHT 261 (340)
Q Consensus 240 ~~~dl~~Va~HEIGHaLGL~Hs 261 (340)
...+.+.|+.||+||--|=.+-
T Consensus 199 da~~yyaTl~HElghwtgh~~r 220 (316)
T COG4227 199 DAINYYATLLHELGHWTGHEAR 220 (316)
T ss_pred chHhHHHHHHHHhccccCchhh
Confidence 3567889999999999875443
No 126
>PRK03094 hypothetical protein; Provisional
Probab=25.59 E-value=49 Score=25.89 Aligned_cols=17 Identities=12% Similarity=0.411 Sum_probs=15.1
Q ss_pred cChHHHHHHHHhcCCCC
Q 037540 62 TGMSELKKYFTRFGYLS 78 (340)
Q Consensus 62 ~~~~~~~~YL~~fGYl~ 78 (340)
+||..+++||++-||--
T Consensus 8 ~~Ls~i~~~L~~~GYeV 24 (80)
T PRK03094 8 QSLTDVQQALKQKGYEV 24 (80)
T ss_pred cCcHHHHHHHHHCCCEE
Confidence 68899999999999954
No 127
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=25.42 E-value=32 Score=37.20 Aligned_cols=31 Identities=16% Similarity=0.385 Sum_probs=19.4
Q ss_pred HHH-HHHHHHhhccCcceEEEecc--CCCccEEE
Q 037540 178 AAF-KSSFQRWSSVIPVNFTEIYD--YDSADIHI 208 (340)
Q Consensus 178 ~ai-~~Af~~Ws~vt~l~F~ev~~--~~~ADI~I 208 (340)
.++ +-.|..-+..-.|+|+|+.+ .-..|+++
T Consensus 364 ~Vl~~Glf~l~~~LfGi~f~~~~~~~vWh~dV~~ 397 (681)
T PRK10280 364 TVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRV 397 (681)
T ss_pred HHHHHhHHHHHHHHcCeEEEECCCCCCCCCCeeE
Confidence 455 34566677778899998653 23456554
No 128
>KOG2661 consensus Peptidase family M48 [Posttranslational modification, protein turnover, chaperones]
Probab=23.58 E-value=40 Score=33.44 Aligned_cols=18 Identities=39% Similarity=0.639 Sum_probs=14.1
Q ss_pred chhHHHHhhhhhhCCCcCC
Q 037540 244 LESVATHEIGHVLGLAHTS 262 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~ 262 (340)
+-.|+.||++|++ +.|+.
T Consensus 275 lAtvLgHE~aHaV-arH~A 292 (424)
T KOG2661|consen 275 LATVLGHEIAHAV-ARHAA 292 (424)
T ss_pred HHHHHHHHHHHHH-HHHHH
Confidence 5789999999996 45554
No 129
>PF03698 UPF0180: Uncharacterised protein family (UPF0180); InterPro: IPR005370 The members of this family are small uncharacterised proteins.
Probab=22.29 E-value=62 Score=25.30 Aligned_cols=17 Identities=29% Similarity=0.509 Sum_probs=15.2
Q ss_pred cChHHHHHHHHhcCCCC
Q 037540 62 TGMSELKKYFTRFGYLS 78 (340)
Q Consensus 62 ~~~~~~~~YL~~fGYl~ 78 (340)
+||..+++||++-||--
T Consensus 8 ~~Ls~v~~~L~~~GyeV 24 (80)
T PF03698_consen 8 EGLSNVKEALREKGYEV 24 (80)
T ss_pred CCchHHHHHHHHCCCEE
Confidence 68899999999999964
No 130
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=21.94 E-value=42 Score=36.00 Aligned_cols=14 Identities=50% Similarity=0.700 Sum_probs=12.1
Q ss_pred cchhHHHHhhhhhh
Q 037540 243 DLESVATHEIGHVL 256 (340)
Q Consensus 243 dl~~Va~HEIGHaL 256 (340)
+...|+.||+||||
T Consensus 407 ~~~~~a~he~gha~ 420 (644)
T PRK10733 407 QKESTAYHEAGHAI 420 (644)
T ss_pred HHHHHHHHHHHHHH
Confidence 35789999999997
No 131
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=21.75 E-value=60 Score=23.90 Aligned_cols=22 Identities=32% Similarity=0.336 Sum_probs=18.6
Q ss_pred chhHHHHhhhhhhCCCcCCCCC
Q 037540 244 LESVATHEIGHVLGLAHTSVKE 265 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~Hs~d~~ 265 (340)
-+.-++|.+-|-|||.|.++.+
T Consensus 29 ~eRriih~la~~lGL~~~s~G~ 50 (60)
T cd02639 29 AERRIVHLLASRLGLNHVSDGT 50 (60)
T ss_pred HHHHHHHHHHHHcCCceEEeCC
Confidence 4678999999999999987654
No 132
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=21.42 E-value=82 Score=27.75 Aligned_cols=47 Identities=26% Similarity=0.483 Sum_probs=34.8
Q ss_pred hHHHHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCC-CCccCCHHHHHHhcCCC
Q 037540 64 MSELKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLP-VTGKLDSETISTIMSPR 123 (340)
Q Consensus 64 ~~~~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLp-vTG~LD~~T~~~m~~PR 123 (340)
++++++||++ +| +++.++++..|.+.+.-. ..|+-+++.++.+..|+
T Consensus 7 L~~L~~~L~~---lp----------~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~ 54 (181)
T PF08006_consen 7 LNELEKYLKK---LP----------EEEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPK 54 (181)
T ss_pred HHHHHHHHHc---CC----------HHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHH
Confidence 4667788874 33 357889999999988654 46888888888886664
No 133
>COG1059 Thermostable 8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair]
Probab=21.20 E-value=1.3e+02 Score=27.54 Aligned_cols=46 Identities=17% Similarity=0.442 Sum_probs=31.7
Q ss_pred HHHHHHhcCCCCCCCCCCCCcchHHHHHHHHHHHHhcCCCCCccCCH
Q 037540 67 LKKYFTRFGYLSLSYTNFTDVFDTQLQSAVVSYQTKLGLPVTGKLDS 113 (340)
Q Consensus 67 ~~~YL~~fGYl~~~~~~~~~~~d~~l~~Ai~~~Q~~~gLpvTG~LD~ 113 (340)
+-+||.++||+.+.....+...--+.++.++.+-+..|+++ |+||=
T Consensus 153 Ilr~l~r~g~i~e~~kt~t~K~YLe~E~ilr~iae~~g~s~-gelDL 198 (210)
T COG1059 153 ILRWLVRYGLIDENPKTLTRKLYLEIEEILRSIAEEVGISL-GELDL 198 (210)
T ss_pred HHHHHHHhcccccCcccccHHHHHHHHHHHHHHHHHhCCCc-chhHH
Confidence 35688888888654322333333578899999999999984 66663
No 134
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=20.78 E-value=49 Score=30.56 Aligned_cols=11 Identities=36% Similarity=0.573 Sum_probs=10.0
Q ss_pred hhHHHHhhhhh
Q 037540 245 ESVATHEIGHV 255 (340)
Q Consensus 245 ~~Va~HEIGHa 255 (340)
-++++||++||
T Consensus 94 gsiLAHE~mHa 104 (212)
T PF12315_consen 94 GSILAHELMHA 104 (212)
T ss_pred hhHHHHHHHHH
Confidence 57999999998
No 135
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=20.17 E-value=51 Score=29.40 Aligned_cols=16 Identities=25% Similarity=0.370 Sum_probs=12.8
Q ss_pred chhHHHHhhhhhhCCC
Q 037540 244 LESVATHEIGHVLGLA 259 (340)
Q Consensus 244 l~~Va~HEIGHaLGL~ 259 (340)
+-.+++|||+|++.-.
T Consensus 36 lG~ilahel~hafd~~ 51 (206)
T PF01431_consen 36 LGFILAHELMHAFDPE 51 (206)
T ss_dssp HHHHHHHHHHHCTSTT
T ss_pred HHHHHHHHHHHHHHHh
Confidence 5679999999987553
No 136
>cd06461 M2_ACE Peptidase family M2 Angiotensin converting enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc dependent dipeptidase that catalyzes the conversion of the decapeptide angiotensin I to the potent vasopressor ocatapeptide angiotensin II, by removing two C-terminal amino acids. There are two forms of the enzyme in humans, the ubiquitous somatic ACE and the sperm-specific germinal ACE, both encoded by the same gene through transcription from alternative promoters. Somatic ACE has two tandem active sites with distinct catalytic properties, whereas germinal ACE, the function of which is largely unknown, has just a single active site. Recently, an ACE homolog, ACE2, has been identified in humans that differs from ACE; it preferentially removes carboxy-terminal hydrophobic or basic amino acids and appears to be important in cardiac function. ACE homologs (also known as members of the M2 gluzincin family) have been found in a wide variety of species, including those that neither h
Probab=20.05 E-value=52 Score=34.11 Aligned_cols=16 Identities=25% Similarity=0.331 Sum_probs=12.7
Q ss_pred cchhHHHHhhhhhhCC
Q 037540 243 DLESVATHEIGHVLGL 258 (340)
Q Consensus 243 dl~~Va~HEIGHaLGL 258 (340)
+-..|+.||+||+...
T Consensus 246 ~D~~t~~HE~GH~~yy 261 (477)
T cd06461 246 EDFVTVHHEMGHIQYY 261 (477)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3467899999999853
Done!